TY - GEN
T1 - Evaluation of partial path queries on XML data
AU - Souldatos, Stefanos
AU - Wu, Xiaoying
AU - Theodoratos, Dimitri
AU - Dalamagas, Theodore
AU - Sellis, Timos
PY - 2007
Y1 - 2007
N2 - XML query languages typically allow the specification of structural patterns of elements. Finding the occurrences of such patterns in an XML tree is the key operation in XML query processing. Many algorithms have been presented for this operation. These algorithms focus mainly on the evaluation of path-pattern or tree-pattern queries. In this paper, we define a partial path-pattern query language, and we address the problem of its efficient evaluation on XML data. In order to process partial path-pattern queries, we introduce a set of sound and complete inference rules to characterize structural relationship derivation. We provide necessary and sufficient conditions for detecting query unsatisfi- ability and node redundancy. We show how partial path-pattern queries can be equivalently put in a canonical directed acyclic graph form. We developed two stack-based algorithms for the evaluation of partial path-pattern queries, Partial MJ and Partial Path Stack. Partial MJ computes answers to the query by merge-joining the results of the root-to-leaf paths of a spanning tree of the query. Partial- PathStack exploits a topological order of the nodes of the query graph to match the query pattern as a whole to the XML tree. The experimental evaluation of our algorithms shows that Partial Path Stack is independent of intermediate results and largely outperforms Partial MJ.
AB - XML query languages typically allow the specification of structural patterns of elements. Finding the occurrences of such patterns in an XML tree is the key operation in XML query processing. Many algorithms have been presented for this operation. These algorithms focus mainly on the evaluation of path-pattern or tree-pattern queries. In this paper, we define a partial path-pattern query language, and we address the problem of its efficient evaluation on XML data. In order to process partial path-pattern queries, we introduce a set of sound and complete inference rules to characterize structural relationship derivation. We provide necessary and sufficient conditions for detecting query unsatisfi- ability and node redundancy. We show how partial path-pattern queries can be equivalently put in a canonical directed acyclic graph form. We developed two stack-based algorithms for the evaluation of partial path-pattern queries, Partial MJ and Partial Path Stack. Partial MJ computes answers to the query by merge-joining the results of the root-to-leaf paths of a spanning tree of the query. Partial- PathStack exploits a topological order of the nodes of the query graph to match the query pattern as a whole to the XML tree. The experimental evaluation of our algorithms shows that Partial Path Stack is independent of intermediate results and largely outperforms Partial MJ.
KW - Partial path-pattern query
KW - Query evaluation
KW - Tree-structured data
UR - http://www.scopus.com/inward/record.url?scp=63449119872&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=63449119872&partnerID=8YFLogxK
U2 - 10.1145/1321440.1321446
DO - 10.1145/1321440.1321446
M3 - Conference contribution
AN - SCOPUS:63449119872
SN - 9781595938039
T3 - International Conference on Information and Knowledge Management, Proceedings
SP - 21
EP - 30
BT - CIKM 2007 - Proceedings of the 16th ACM Conference on Information and Knowledge Management
T2 - 16th ACM Conference on Information and Knowledge Management, CIKM 2007
Y2 - 6 November 2007 through 9 November 2007
ER -