Scalable Optimization of Graph Pattern Queries Using Summary Graphs

Xiaoying Wu, Michael Lan, Md Rakibul Hasan, Dimitri Theodoratos

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Graph pattern matching is a fundamental operation for querying, exploring and analyzing graph data, widely used not only on web applications like social networks but also in various other domains. We focus on evaluating graph pattern queries involving direct (edge-to-edge matching) and reachability (edge-to-path matching) relationships under homomorphisms on data graphs. Most existing algorithms focus on isomorphic matching of patterns involving only direct edges. Given that this problem is NP-hard even for the restricted case of isomorphic patterns, these algorithms are space- and time-inefficient for general graphs. We address the problem of optimizing pattern queries on graphs using materialized views to prune the pattern matching search space. We propose a compact way of materializing the views that losslessly summarizes all the homomorphic matches of the query without explicitly storing all the query results. We design an algorithm for optimizing pattern queries in the presence of materialized views. We conducted experiments on various data graphs, query patterns, and materialized views. Our results demonstrate that our optimization algorithm can substantially speed up query evaluation, achieving improvements of several orders of magnitude depending on the query coverage provided by the materialized views. In addition, our approach scales smoothly, outperforming a previous graph simulation-based method.

Original languageEnglish (US)
Title of host publicationWeb Information Systems Engineering – WISE 2024 - 25th International Conference, Proceedings
EditorsMahmoud Barhamgi, Hua Wang, Xin Wang
PublisherSpringer Science and Business Media Deutschland GmbH
Pages415-426
Number of pages12
ISBN (Print)9789819605668
DOIs
StatePublished - 2025
Event25th International Conference on Web Information Systems Engineering, WISE 2024 - Doha, Qatar
Duration: Dec 2 2024Dec 5 2024

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume15437 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference25th International Conference on Web Information Systems Engineering, WISE 2024
Country/TerritoryQatar
CityDoha
Period12/2/2412/5/24

All Science Journal Classification (ASJC) codes

  • Theoretical Computer Science
  • General Computer Science

Fingerprint

Dive into the research topics of 'Scalable Optimization of Graph Pattern Queries Using Summary Graphs'. Together they form a unique fingerprint.

Cite this