TY - GEN
T1 - Scalable Optimization of Graph Pattern Queries Using Summary Graphs
AU - Wu, Xiaoying
AU - Lan, Michael
AU - Hasan, Md Rakibul
AU - Theodoratos, Dimitri
N1 - Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2025.
PY - 2025
Y1 - 2025
N2 - Graph pattern matching is a fundamental operation for querying, exploring and analyzing graph data, widely used not only on web applications like social networks but also in various other domains. We focus on evaluating graph pattern queries involving direct (edge-to-edge matching) and reachability (edge-to-path matching) relationships under homomorphisms on data graphs. Most existing algorithms focus on isomorphic matching of patterns involving only direct edges. Given that this problem is NP-hard even for the restricted case of isomorphic patterns, these algorithms are space- and time-inefficient for general graphs. We address the problem of optimizing pattern queries on graphs using materialized views to prune the pattern matching search space. We propose a compact way of materializing the views that losslessly summarizes all the homomorphic matches of the query without explicitly storing all the query results. We design an algorithm for optimizing pattern queries in the presence of materialized views. We conducted experiments on various data graphs, query patterns, and materialized views. Our results demonstrate that our optimization algorithm can substantially speed up query evaluation, achieving improvements of several orders of magnitude depending on the query coverage provided by the materialized views. In addition, our approach scales smoothly, outperforming a previous graph simulation-based method.
AB - Graph pattern matching is a fundamental operation for querying, exploring and analyzing graph data, widely used not only on web applications like social networks but also in various other domains. We focus on evaluating graph pattern queries involving direct (edge-to-edge matching) and reachability (edge-to-path matching) relationships under homomorphisms on data graphs. Most existing algorithms focus on isomorphic matching of patterns involving only direct edges. Given that this problem is NP-hard even for the restricted case of isomorphic patterns, these algorithms are space- and time-inefficient for general graphs. We address the problem of optimizing pattern queries on graphs using materialized views to prune the pattern matching search space. We propose a compact way of materializing the views that losslessly summarizes all the homomorphic matches of the query without explicitly storing all the query results. We design an algorithm for optimizing pattern queries in the presence of materialized views. We conducted experiments on various data graphs, query patterns, and materialized views. Our results demonstrate that our optimization algorithm can substantially speed up query evaluation, achieving improvements of several orders of magnitude depending on the query coverage provided by the materialized views. In addition, our approach scales smoothly, outperforming a previous graph simulation-based method.
UR - http://www.scopus.com/inward/record.url?scp=85211952794&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85211952794&partnerID=8YFLogxK
U2 - 10.1007/978-981-96-0567-5_29
DO - 10.1007/978-981-96-0567-5_29
M3 - Conference contribution
AN - SCOPUS:85211952794
SN - 9789819605668
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 415
EP - 426
BT - Web Information Systems Engineering – WISE 2024 - 25th International Conference, Proceedings
A2 - Barhamgi, Mahmoud
A2 - Wang, Hua
A2 - Wang, Xin
PB - Springer Science and Business Media Deutschland GmbH
T2 - 25th International Conference on Web Information Systems Engineering, WISE 2024
Y2 - 2 December 2024 through 5 December 2024
ER -