Interactive exploration of composite items

Sihem Amer-Yahia, Senjuti Basu Roy

Research output: Chapter in Book/Report/Conference proceedingConference contribution

3 Scopus citations

Abstract

Data exploration is seeing a renewed interest in our community. With the rise of big data analytics, this area is growing to encompass not only approaches and algorithms to find the next best data items to explore but also interactivity, i.e. accounting for feedback from the data scientist during the exploration. Interactivity is essential to account for evolving needs during the exploration and also customize the discovery process. In this tutorial, we focus on the interactive exploration of Composite Items (CIs). CIs address complex information needs and are prevalent in online shopping where products are bundled together to provide discounts, in travel itinerary recommendation where points of interest in a city are combined into a single travel package, and task assignment in crowdsourcing where persoalized micro-tasks are composed and recommended to workers. CI formation is usually expressed as a constrained optimization problem. For instance, in online shopping, package retrieval can retrieve the cheapest smartphones (optimization objective) with compatible accessories (constraints). Similarly, a city tour must be the most popular and conform to a total time and cost budget. A data scientist interested in exploring a variety of CIs has to repeatedly reformulate optimization problems with new constraints and objectives. In this tutorial, we investigate the applicability of interactive data exploration approaches to CI formation. We will first review CI applications and shapes (15mn). We then discuss three big research questions 60mn): (i) algorithms for CI formation, (ii) modes of exploration for CIs, and (iii) human-in-the-loop CIs. We will conclude with research directions (15mn). The proposed tutorial is timely. It brings together several related efforts and addresses unsolved questions in the emerging area of human-in-the-loop exploration of complex information needs. The tutorial is relevant to the general area of data science and more specifically to Scalable Analytics, Data Mining, Clustering and Knowledge Discovery, Indexing, Query Processing and Optimization, and Crowdsourcing. The technical topics covered are constrained optimization, ranking semantics, clustering, algorithms, and empirical evaluations.

Original languageEnglish (US)
Title of host publicationAdvances in Database Technology - EDBT 2018
Subtitle of host publication21st International Conference on Extending Database Technology, Proceedings
EditorsMichael Bohlen, Reinhard Pichler, Norman May, Erhard Rahm, Shan-Hung Wu, Katja Hose
PublisherOpenProceedings.org
Pages513-516
Number of pages4
ISBN (Electronic)9783893180783
DOIs
StatePublished - 2018
Event21st International Conference on Extending Database Technology, EDBT 2018 - Vienna, Austria
Duration: Mar 26 2018Mar 29 2018

Publication series

NameAdvances in Database Technology - EDBT
Volume2018-March
ISSN (Electronic)2367-2005

Conference

Conference21st International Conference on Extending Database Technology, EDBT 2018
Country/TerritoryAustria
CityVienna
Period3/26/183/29/18

All Science Journal Classification (ASJC) codes

  • Information Systems
  • Software
  • Computer Science Applications

Fingerprint

Dive into the research topics of 'Interactive exploration of composite items'. Together they form a unique fingerprint.

Cite this