Semantic querying of tree-structured data sources using partially specified tree patterns

Dimitri Theodoratos, Theodore Dalamagas, Antonis Koufopoulos, Narain Gehani

Research output: Chapter in Book/Report/Conference proceedingConference contribution

16 Scopus citations

Abstract

Nowadays, huge volumes of data are organized or exported in a tree-structured form. Querying capabilities are provided through queries that are based on branching path expression. Even for a single knowledge domain structural differences raise difficulties for querying data sources in a uniform way. In this paper, we present a method for semantically querying tree-structured data sources using partially specified tree patterns. Based on dimensions which are sets of semantically related nodes in tree structures, we define dimension graphs. Dimension graphs can be automatically extracted from trees and abstract their structural information. They are semantically rich constructs that support the formulation of queries and their efficient evaluation. We design a tree-pattern query language to query multiple tree-structured data sources. A central feature of this language is that the structure can be specified fully, partially, or not at all in the queries. Therefore, it can be used to query multiple trees with structural differences. We study the derivation of structural expressions in queries by introducing a set of inference rules for structural expressions. We define two types of query unsatisfiability and we provide necessary and sufficient conditions for checking each of them. Our approach is validated through experimental evaluation.

Original languageEnglish (US)
Title of host publicationCIKM'05 - Proceedings of the 14th ACM International Conference on Information and Knowledge Management
Pages712-719
Number of pages8
DOIs
StatePublished - 2005
EventCIKM'05 - Proceedings of the 14th ACM International Conference on Information and Knowledge Management - Bremen, Germany
Duration: Oct 31 2005Nov 5 2005

Publication series

NameInternational Conference on Information and Knowledge Management, Proceedings

Other

OtherCIKM'05 - Proceedings of the 14th ACM International Conference on Information and Knowledge Management
Country/TerritoryGermany
CityBremen
Period10/31/0511/5/05

All Science Journal Classification (ASJC) codes

  • General Decision Sciences
  • General Business, Management and Accounting

Keywords

  • Query evaluation
  • Query satisfiability
  • Tree-pattern queries
  • Tree-structured data
  • XML

Fingerprint

Dive into the research topics of 'Semantic querying of tree-structured data sources using partially specified tree patterns'. Together they form a unique fingerprint.

Cite this