A randomized approach for the incremental design of an evolving data warehouse

Dimitri Theodoratos, Theodore Dalamagas, Alkis Simitsis, Manos Stavropoulos

Research output: Chapter in Book/Report/Conference proceedingConference contribution

17 Scopus citations

Abstract

A Data Warehouse (DW) can be used to integrate data from multiple distributed data sources. A DW can be seen as a set of materialized views that determine its schema and its content in terms of the schema and the content of the data sources. DW applications require high query performance. For this reason, the design of a typical DW consists of selecting views to materialize that are able to answer a set of input user queries. However, the cost of answering the queries has to be balanced against the cost of maintaining the materialized views. In an evolving DW application, new queries need to be answered by the DW. An incremental selection of materialized views uses the materialized views already in the DW to answer parts of the new queries, and avoids the re-implementation of the DW from scratch. This incremental design is complex and an exhaustive approach is not feasible. We have developed a randomized approach for incrementally selecting a set of views that are able to answer a set of input user queries locally while minimizing a combination of the query evaluation and view maintenance cost. In this process we exploit “common sub-expressions” among new queries and between new queries and old views. Our approach is implemented and we report on its experimental evaluation.

Original languageEnglish (US)
Title of host publicationLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
EditorsHideko S. Kunii, Sushil Jajodia, Arne Solvberg
PublisherSpringer Verlag
Pages325-338
Number of pages14
ISBN (Print)3540428666, 9783540428664
DOIs
StatePublished - 2001
Externally publishedYes
Event20th International Conference on Conceptual Modeling, ER 2001 - Yokohama, Japan
Duration: Nov 27 2001Nov 30 2001

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume2224
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Other

Other20th International Conference on Conceptual Modeling, ER 2001
Country/TerritoryJapan
CityYokohama
Period11/27/0111/30/01

All Science Journal Classification (ASJC) codes

  • Theoretical Computer Science
  • General Computer Science

Fingerprint

Dive into the research topics of 'A randomized approach for the incremental design of an evolving data warehouse'. Together they form a unique fingerprint.

Cite this