TY - GEN
T1 - Eager evaluation of partial tree-pattern queries on XML streams
AU - Theodoratos, Dimitri
AU - Wu, Xiaoying
PY - 2009
Y1 - 2009
N2 - Current streaming applications have stringent requirements on query response time and memory consumption because of the large (possibly unbounded)size of data they handle. Further, known query evaluation algorithms on streaming XML documents focus almost exclusively on tree-pattern queries(TPQs). However recently, requirements for flexible querying of XML data have motivated the introduction of query languages that are more general and flexible than TPQs. These languages are not supported by known algorithms. In this paper, we consider a language which generalizes and strictly contains TPQs. Queries in this language can be represented as dags enhanced with constraints.We explore this representation to design an original polynomial time streaming algorithm for these queries. Our algorithm avoids storing and processing matches of the query dag that do not contribute to new solutions (redundant matches). Its key feature is that it applies an eager evaluation strategy to quickly determine when node matches should be returned as solutions to the user and also to proactively detect redundant matches. We experimentally test its time and space performance. The results show the superiority of the eager algorithm compared to the only known algorithm for this class of queries which is a lazy algorithm.
AB - Current streaming applications have stringent requirements on query response time and memory consumption because of the large (possibly unbounded)size of data they handle. Further, known query evaluation algorithms on streaming XML documents focus almost exclusively on tree-pattern queries(TPQs). However recently, requirements for flexible querying of XML data have motivated the introduction of query languages that are more general and flexible than TPQs. These languages are not supported by known algorithms. In this paper, we consider a language which generalizes and strictly contains TPQs. Queries in this language can be represented as dags enhanced with constraints.We explore this representation to design an original polynomial time streaming algorithm for these queries. Our algorithm avoids storing and processing matches of the query dag that do not contribute to new solutions (redundant matches). Its key feature is that it applies an eager evaluation strategy to quickly determine when node matches should be returned as solutions to the user and also to proactively detect redundant matches. We experimentally test its time and space performance. The results show the superiority of the eager algorithm compared to the only known algorithm for this class of queries which is a lazy algorithm.
UR - http://www.scopus.com/inward/record.url?scp=67650109835&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=67650109835&partnerID=8YFLogxK
U2 - 10.1007/978-3-642-00887-0_20
DO - 10.1007/978-3-642-00887-0_20
M3 - Conference contribution
AN - SCOPUS:67650109835
SN - 9783642008863
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 241
EP - 246
BT - Database Systems for Advanced Applications - 14th International Conference, DASFAA 2009, Proceedings
T2 - 14th International Conference on Database Systems for Advanced Applications, DASFAA 2009
Y2 - 21 April 2009 through 23 April 2009
ER -