TY - GEN
T1 - Efficient In-Memory Evaluation of Reachability Graph Pattern Queries on Data Graphs
AU - Wu, Xiaoying
AU - Theodoratos, Dimitri
AU - Skoutas, Dimitrios
AU - Lan, Michael
N1 - Funding Information:
The research of the first author was supported by the National Natural Science Foundation of China under Grant No. 61872276.
Publisher Copyright:
© 2022, The Author(s), under exclusive license to Springer Nature Switzerland AG.
PY - 2022
Y1 - 2022
N2 - Graphs are a widely used data model in modern data-intensive applications. Graph pattern matching is a fundamental operation for the exploration and analysis of large data graphs. In this paper, we present a novel approach for efficiently finding homomorphic matches of graph pattern queries, where pattern edges denote reachability relationships between nodes in the data graph. We first propose the concept of query reachability graph to compactly encode all the possible homomorphisms from a query pattern to the data graph. Then, we design a graph traversal-based filtering method to prune nodes from the data graph which violate reachability conditions induced by the pattern edges. We use the pruned data graph to generate a refined query reachability graph which serves as a compact search space for the pattern query answer. Finally, we design a multiway join algorithm to enumerate answer tuples from the query reachability graph without generating an excessive number of redundant intermediate results (a drawback of previous approaches). We experimentally verify the efficiency of our approach and demonstrate that it outperforms by far existing approaches and a recent graph DBMS on evaluating reachability graph pattern queries.
AB - Graphs are a widely used data model in modern data-intensive applications. Graph pattern matching is a fundamental operation for the exploration and analysis of large data graphs. In this paper, we present a novel approach for efficiently finding homomorphic matches of graph pattern queries, where pattern edges denote reachability relationships between nodes in the data graph. We first propose the concept of query reachability graph to compactly encode all the possible homomorphisms from a query pattern to the data graph. Then, we design a graph traversal-based filtering method to prune nodes from the data graph which violate reachability conditions induced by the pattern edges. We use the pruned data graph to generate a refined query reachability graph which serves as a compact search space for the pattern query answer. Finally, we design a multiway join algorithm to enumerate answer tuples from the query reachability graph without generating an excessive number of redundant intermediate results (a drawback of previous approaches). We experimentally verify the efficiency of our approach and demonstrate that it outperforms by far existing approaches and a recent graph DBMS on evaluating reachability graph pattern queries.
KW - Edge-to-path homomorphism
KW - Graph pattern matching
KW - Multi-way join
UR - http://www.scopus.com/inward/record.url?scp=85129868314&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85129868314&partnerID=8YFLogxK
U2 - 10.1007/978-3-031-00123-9_4
DO - 10.1007/978-3-031-00123-9_4
M3 - Conference contribution
AN - SCOPUS:85129868314
SN - 9783031001222
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 55
EP - 71
BT - Database Systems for Advanced Applications - 27th International Conference, DASFAA 2022, Proceedings
A2 - Bhattacharya, Arnab
A2 - Lee Mong Li, Janice
A2 - Agrawal, Divyakant
A2 - Reddy, P. Krishna
A2 - Mohania, Mukesh
A2 - Mondal, Anirban
A2 - Goyal, Vikram
A2 - Uday Kiran, Rage
PB - Springer Science and Business Media Deutschland GmbH
T2 - 27th International Conference on Database Systems for Advanced Applications, DASFAA 2022
Y2 - 11 April 2022 through 14 April 2022
ER -