TY - GEN
T1 - Efficient storage and temporal query evaluation in hierarchical data archiving systems
AU - Wang, Hui
AU - Liu, Ruilin
AU - Theodoratos, Dimitri
AU - Wu, Xiaoying
PY - 2011
Y1 - 2011
N2 - Data archiving has been commonly used in many fields for data backup and analysis purposes. Although comprehensive application software, new computing and storage technologies, and the Internet have made it easier to create, collect and store all types of data, the meaningful storing, accessing, and managing of database archives in a cost-effective way remains extremely challenging. In this paper, we focus on hierarchical data archiving that has been popularly used in the scientific field and web data management. First, we propose a novel compaction scheme for archiving hierarchical data. By compacting both data and timestamps, our scheme substantially reduces not only the amount of needed storage, but also the incremental archiving time. Second, we design a temporal query language to support data retrieval from the compact data archives. Third, as compaction on data and timestamps may bring significant overhead to query evaluation, we investigate how to optimize such overhead by exploiting the characteristics of the queries and of the archived hierarchical data. Finally, we conduct an extensive experimentation to demonstrate the effectiveness and efficiency of both our efficient storage and query optimization techniques.
AB - Data archiving has been commonly used in many fields for data backup and analysis purposes. Although comprehensive application software, new computing and storage technologies, and the Internet have made it easier to create, collect and store all types of data, the meaningful storing, accessing, and managing of database archives in a cost-effective way remains extremely challenging. In this paper, we focus on hierarchical data archiving that has been popularly used in the scientific field and web data management. First, we propose a novel compaction scheme for archiving hierarchical data. By compacting both data and timestamps, our scheme substantially reduces not only the amount of needed storage, but also the incremental archiving time. Second, we design a temporal query language to support data retrieval from the compact data archives. Third, as compaction on data and timestamps may bring significant overhead to query evaluation, we investigate how to optimize such overhead by exploiting the characteristics of the queries and of the archived hierarchical data. Finally, we conduct an extensive experimentation to demonstrate the effectiveness and efficiency of both our efficient storage and query optimization techniques.
UR - http://www.scopus.com/inward/record.url?scp=79961187831&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=79961187831&partnerID=8YFLogxK
U2 - 10.1007/978-3-642-22351-8_7
DO - 10.1007/978-3-642-22351-8_7
M3 - Conference contribution
AN - SCOPUS:79961187831
SN - 9783642223501
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 109
EP - 128
BT - Scientific and Statistical Database Management - 23rd International Conference, SSDBM 2011, Proceedings
T2 - 23rd International Conference on Scientific and Statistical Database Management, SSDBM 2011
Y2 - 20 July 2011 through 22 July 2011
ER -