TY - GEN
T1 - Efficiently Computing Homomorphic Matches of Hybrid Pattern Queries on Large Graphs
AU - Wu, Xiaoying
AU - Theodoratos, Dimitri
AU - Skoutas, Dimitrios
AU - Lan, Michael
N1 - Publisher Copyright:
© 2019, Springer Nature Switzerland AG.
PY - 2019
Y1 - 2019
N2 - In this paper, we address the problem of efficiently finding homomorphic matches for hybrid patterns over large data graphs. Finding matches for patterns in data graphs is of fundamental importance for graph analytics. In hybrid patterns, each edge may correspond either to an edge or a path in the data graph, thus allowing for higher expressiveness and flexibility in query formulation. We introduce the concept of answer graph to compactly represent the query results and exploit computation sharing. We design a holistic bottom-up algorithm called GPM, which greatly reduces the number of intermediate results, leading to significant performance gains. GPM directly processes child constraints in the given query instead of resorting to a post-processing procedure. An extensive experimental evaluation using both real and synthetic datasets shows that our methods evaluate hybrid patterns up to several orders of magnitude faster than existing algorithms and exhibit much better scalability.
AB - In this paper, we address the problem of efficiently finding homomorphic matches for hybrid patterns over large data graphs. Finding matches for patterns in data graphs is of fundamental importance for graph analytics. In hybrid patterns, each edge may correspond either to an edge or a path in the data graph, thus allowing for higher expressiveness and flexibility in query formulation. We introduce the concept of answer graph to compactly represent the query results and exploit computation sharing. We design a holistic bottom-up algorithm called GPM, which greatly reduces the number of intermediate results, leading to significant performance gains. GPM directly processes child constraints in the given query instead of resorting to a post-processing procedure. An extensive experimental evaluation using both real and synthetic datasets shows that our methods evaluate hybrid patterns up to several orders of magnitude faster than existing algorithms and exhibit much better scalability.
UR - http://www.scopus.com/inward/record.url?scp=85077117619&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85077117619&partnerID=8YFLogxK
U2 - 10.1007/978-3-030-27520-4_20
DO - 10.1007/978-3-030-27520-4_20
M3 - Conference contribution
AN - SCOPUS:85077117619
SN - 9783030275198
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 279
EP - 295
BT - Big Data Analytics and Knowledge Discovery - 21st International Conference, DaWaK 2019, Proceedings
A2 - Ordonez, Carlos
A2 - Song, Il-Yeol
A2 - Anderst-Kotsis, Gabriele
A2 - Khalil, Ismail
A2 - Tjoa, A Min
PB - Springer
T2 - 21st International Conference on Big Data Analytics and Knowledge Discovery, DaWaK 2019
Y2 - 26 August 2019 through 29 August 2019
ER -