Improving XML search by generating and utilizing informative result snippets

Ziyang Liu, Yu Huang, Yi Chen

Research output: Contribution to journalArticlepeer-review

8 Scopus citations

Abstract

Snippets are used by almost every text search engine to complement the ranking scheme in order to effectively handle user searches, which are inherently ambiguous and whose relevance semantics are difficult to assess. Despite the fact that XML is a standard representation format of Web data, research on generating result snippets for XML search remains limited. To tackle this important yet open problem, in this article, we present a system eXtract which generates snippets for XML search results. We identify that a good XML result snippet should be a meaningful information unit of a small size that effectively summarizes this query result and differentiates it from others, according to which users can quickly assess the relevance of the query result. We have designed and implemented a novel algorithm to satisfy these requirements. Furthermore, we propose to cluster the query results based on their snippets. Since XML result clustering can only be done at query time, snippet-based clustering significantly improves the efficiency while compromising little clustering accuracy.We verified the efficiency and effectiveness of our approach through experiments.

Original languageEnglish (US)
Article number19
JournalACM Transactions on Database Systems
Volume35
Issue number3
DOIs
StatePublished - Jul 1 2010
Externally publishedYes

All Science Journal Classification (ASJC) codes

  • Information Systems

Keywords

  • Algorithms
  • Design

Fingerprint Dive into the research topics of 'Improving XML search by generating and utilizing informative result snippets'. Together they form a unique fingerprint.

Cite this