Identifying meaningful return information for XML keyword search

Ziyang Liu, Yi Chen

Research output: Chapter in Book/Report/Conference proceedingConference contribution

232 Scopus citations

Abstract

Keyword search enables web users to easily access XML data without the need to learn a structured query language and to study possibly complex data schemas. Existing work has addressed the problem of selecting qualified data nodes that match keywords and connecting them in a meaningful way, in the spirit of inferring a where clause in XQuery. However, how to infer the return clause for keyword search is an open problem. To address this challenge, we present an XML keyword search engine, XSeek, to infer the semantics of the search and identify return nodes effectively. XSeek recognizes possible entities and attributes inherently represented in the data. It also distinguishes between search predicates and return specifications in the keywords. Then based on the analysis of both XML data structures and keyword match patterns, XSeek generates return nodes. Extensive experimental studies show the effectiveness of XSeek.

Original languageEnglish (US)
Title of host publicationSIGMOD 2007
Subtitle of host publicationProceedings of the ACM SIGMOD International Conference on Management of Data
Pages329-340
Number of pages12
DOIs
StatePublished - 2007
Externally publishedYes
EventSIGMOD 2007: ACM SIGMOD International Conference on Management of Data - Beijing, China
Duration: Jun 12 2007Jun 14 2007

Publication series

NameProceedings of the ACM SIGMOD International Conference on Management of Data
ISSN (Print)0730-8078

Other

OtherSIGMOD 2007: ACM SIGMOD International Conference on Management of Data
Country/TerritoryChina
CityBeijing
Period6/12/076/14/07

All Science Journal Classification (ASJC) codes

  • Software
  • Information Systems

Keywords

  • Keyword search
  • XML

Fingerprint

Dive into the research topics of 'Identifying meaningful return information for XML keyword search'. Together they form a unique fingerprint.

Cite this