Designing data warehouses

Dimitri Theodoratos, Timos Sellis

Research output: Contribution to journalConference articlepeer-review

69 Scopus citations

Abstract

A Data Warehouse (DW) is a database that collects and stores data from multiple remote and heterogeneous information sources. When a query is posed, it is evaluated locally, without accessing the original information sources. In this paper we deal with the issue of designing a DW, in the context of the relational model, by selecting a set of views to materialize in the DW. First, we briefly present a theoretical framework for the DW design problem, which concerns the selection of a set of views that (a) fit in the space allocated to the DW. (b) answer all the queries of interest, and (c) minimize the total query evaluation and view maintenance cost. We then formalize the DW design problem as a state space search problem by taking into account multiquery optimization over the maintenance queries (i.e., queries that compute changes to the materialized views) and the use of auxiliary views for reducing the view maintenance cost. Finally, incremental algorithms and heuristics for pruning the search space are presented.

Original languageEnglish (US)
Pages (from-to)279-301
Number of pages23
JournalData and Knowledge Engineering
Volume31
Issue number3
DOIs
StatePublished - Nov 1999
Externally publishedYes
EventProceedings of the 1998 17th International Conference on Conceptual Modeling (ER'98) - Singapore, Singapore
Duration: Nov 16 1998Nov 19 1998

All Science Journal Classification (ASJC) codes

  • Information Systems and Management

Fingerprint Dive into the research topics of 'Designing data warehouses'. Together they form a unique fingerprint.

Cite this