Abstract
Data warehousing and on-line analytical processing (OLAP) are essential elements of decision support, which has increasingly become a focus of the database industry. Many commercial products and services are now available, and all of the principal database management system vendors now have offerings in these areas. Decision support places some rather different requirements on database technology compared to traditional on-line transaction processing applications. This paper provides an overview of data warehousing and OLAP technologies, with an emphasis on their new requirements. We describe back end tools for extracting, cleaning and loading data into a data warehouse; multidimensional data models typical of OLAP; front end client tools for querying and data analysis; server extensions for efficient query processing; and tools for metadata management and for managing the warehouse. In addition to surveying the state of the art, this paper also identifies some promising research issues, some of which are related to problems that the database research community has worked on for years, but others are only just beginning to be addressed. This overview is based on a tutorial that the authors presented at the VLDB Conference, 1996.
- 1 Inmon, W. H., Building the Data Warehouse. John Wiley, 1992. Google ScholarDigital Library
- 2 http://www.olapcouncil.orgGoogle Scholar
- 3 Codd, E. F., S. B. Codd, C. T. Salley, "Providing OLAP (On-Line Analytical Processing) to User Analyst: An IT Mandate." Available from Arbor Software's web site http://www.arborsoft.com/OLAP.html.Google Scholar
- 4 http://pwp.starnetinc.com/larryg/articles.htmlGoogle Scholar
- 5 Kimball, R. The Data Warehouse Toolkit. John Wiley, 1996.Google Scholar
- 6 Barclay, T., R. Barnes, J. Gray, P. Sundaresan, "Loading Databases using Dataflow Parallelism." SIGMOD Record, Vol. 23, No. 4, Dec.1994. Google ScholarDigital Library
- 7 Blakeley, J. A., N. Coburn, P. Larson. "Updating Derived Relations: Detecting Irrelevant and Autonomously Computable Updates." ACM TODS, Vol. 4, No. 3, 1989. Google ScholarDigital Library
- 8 Gupta, A., I. S. Mumick, "Maintenance of Materialized Views: Problems, Techniques, and Applications." Data Eng. Bulletin, Vol. 18, No. 2, June 1995.Google Scholar
- 9 Zhuge, Y., H. Garcia-Molina, J. Hammer, J. Widom, "View Maintenance in a Warehousing Environment," Proc. of SIGMOD Conf., 1995. Google ScholarDigital Library
- 10 Roussopoulos, N., et al., "The Maryland ADMS Project: Views R Us." Data Eng. Bulletin, Vol. 18, No. 2, June 1995.Google Scholar
- 11 O'Neil P., Quass D. "Improved Query Performance with Variant Indices", To appear in Proc. of SIGMOD Conf., 1997. Google ScholarDigital Library
- 12 O'Neil P., Graefe G. "Multi-Table Joins through Bitmapped Join Indices" SIGMOD Record, Sep. 1995. Google ScholarDigital Library
- 13 Harinarayan V., Rajaraman A., Ullman J. D. "Implementing Data Cubes Efficiently" Proc. of SIGMOD Conf., 1996. Google ScholarDigital Library
- 14 Chaudhuri S., Krishnamurthy R., Potamianos S., Shim K. "Optimizing Queries with Materialized Views" Intl. Conference on Data Engineering, 1995. Google ScholarDigital Library
- 15 Levy A., Mendelzon A., Sagiv Y. "Answering Queries Using Views" Proc. of PODS, 1995. Google ScholarDigital Library
- 16 Yang H. Z., Larson P. A. "Query Transformations for PSJ Queries", Proc. of VLDB, 1987. Google ScholarDigital Library
- 17 Kim W. "On Optimizing a SQL-like Nested Query" ACM TODS, Sep. 1982. Google ScholarDigital Library
- 18 Ganski, R., Wong H. K. T., "Optimization of Nested SQL Queries Revisited" Proc. of SIGMOD Conf., 1987. Google ScholarDigital Library
- 19 Dayal, U., "Of Nests and Trees: A Unified Approach to Processing Queries that Contain Nested Subqueries, Aggregates and Quantifiers" Proc. VLDB Conf., 1987. Google ScholarDigital Library
- 20 Murlaikrishna, "Improved Unnesting Algorithms for Join Aggregate SQL Queries" Proc. VLDB Conf., 1992. Google ScholarDigital Library
- 21 Seshadri P., Pirahesh H., Leung T. "Complex Query Decorrelation" Intl. Conference on Data Engineering, 1996. Google ScholarDigital Library
- 22 Mumick I. S., Pirahesh H. "Implementation of Magic Sets in Starburst" Proc. of SIGMOD Conf., 1994. Google ScholarDigital Library
- 23 Chaudhuri S., Shim K. "Optimizing Queries with Aggregate Views", Proc. of EDBT, 1996. Google ScholarDigital Library
- 24 Chaudhuri S., Shim K. "Including Group By in Query Optimization", Proc. of VLDB, 1994. Google ScholarDigital Library
- 25 Yan P., Larson P. A. "Eager Aggregation and Lazy Aggregation", Proc. of VLDB, 1995. Google ScholarDigital Library
- 26 Gupta A., Harinarayan V., Quass D. "Aggregate-Query Processing in Data Warehouse Environments", Proc. of VLDB, 1995. Google ScholarDigital Library
- 27 Chaudhuri S., Shim K. "An Overview of Cost-based Optimization of Queries with Aggregates" IEEE Data Enginering Bulletin, Sep. 1995.Google Scholar
- 28 Dewitt D. J., Gray J. "Parallel Database Systems: The Future of High Performance Database Systems" CACM, June 1992. Google ScholarDigital Library
- 29 Gray J. et.al. "Data Cube: A Relational Aggregation Operator Generalizing Group-by, Cross-Tab and Sub Totals" Data Mining and Knowledge Discovery Journal, Vol. 1, No. 1, 1997. Google ScholarDigital Library
- 30 Agrawal S. et.al. "On the Computation of Multidimensional Aggregates" Proc. of VLDB Conf., 1996.Google Scholar
- 31 Kimball R., Strehlo., "Why decision support fails and how to fix it", reprinted in SIGMOD Record, 24(3), 1995. Google ScholarDigital Library
- 32 Chatziantoniou D., Ross K. "Querying Multiple Features in Relational Databases" Proc. of VLDB Conf., 1996. Google ScholarDigital Library
- 33 Widom, J. "Research Problems in Data Warehousing." Proc. 4th Intl. CIKM Conf., 1995. Google ScholarDigital Library
- 34 Wu, M-C., A. P. Buchmann. "Research Issues in Data Warehousing." Submitted for publication.Google Scholar
Index Terms
- An overview of data warehousing and OLAP technology
Recommendations
Data warehousing and OLAP over big data: current challenges and future research directions
DOLAP '13: Proceedings of the sixteenth international workshop on Data warehousing and OLAPIn this paper, we highlight open problems and actual research trends in the field of Data Warehousing and OLAP over Big Data, an emerging term in Data Warehousing and OLAP research. We also derive several novel research directions arising in this field, ...
Data warehousing tool's architecture: from multidimensional analysis to data mining
DEXA '97: Proceedings of the 8th International Workshop on Database and Expert Systems ApplicationsDecision support tools evolve quickly. End user needs for analyses are more and more sophisticated. That is why it is necessary to set up departmental data marts from the data warehouse. Usually these data marts use the OLAP architecture, offering a ...
Comments