Trading off popularity for diversity in the results sets of keyword queries on linked data

Ananya Dass, Dimitri Theodoratos

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Keyword search is the most popular technique for querying the ever growing repositories of RDF graph data on the Web. However, keyword queries are ambiguous. As a consequence, they typically produce on linked data a huge number of candidate results corresponding to a plethora of alternative query interpretations. Current approaches ignore the diversity of the result interpretations and might fail to satisfy the users who are looking for less popular results. In this paper, we propose a novel approach for keyword search result diversification on RDF graphs. Our approach instead of diversifying the query results per se, diversifies the interpretations of the query (i.e., pattern graphs). We model the problem as an optimization problem aiming at selecting k pattern graphs which maximize an objective function balancing relevance and diversity. We devise metrics to assess the relevance and diversity of a set of pattern graphs, and we design a greedy heuristic algorithm to generate a relevant and diverse list of k pattern graphs for a given keyword query. The experimental results show the effectiveness of our approach and proposed metrics and also the efficiency of our algorithm.

Original languageEnglish (US)
Title of host publicationWeb Engineering - 17th International Conference, ICWE 2017, Proceedings
EditorsJordi Cabot, Roberto De Virgilio, Riccardo Torlone
PublisherSpringer Verlag
Pages151-170
Number of pages20
ISBN (Print)9783319601304
DOIs
StatePublished - 2017
Event17th International Conference on Web Engineering, ICWE 2017 - Rome, Italy
Duration: Jun 5 2017Jun 8 2017

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume10360 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Other

Other17th International Conference on Web Engineering, ICWE 2017
Country/TerritoryItaly
CityRome
Period6/5/176/8/17

All Science Journal Classification (ASJC) codes

  • Theoretical Computer Science
  • General Computer Science

Fingerprint

Dive into the research topics of 'Trading off popularity for diversity in the results sets of keyword queries on linked data'. Together they form a unique fingerprint.

Cite this