TY - GEN
T1 - The exploratory labeling assistant
T2 - 31st Annual ACM Symposium on User Interface Software and Technology, UIST 2018
AU - Felix, Cristian
AU - Dasgupta, Aritra
AU - Bertini, Enrico
N1 - Publisher Copyright:
© 2018 Copyright held by the owner/author(s). Publication rights licensed to ACM.
PY - 2018/10/11
Y1 - 2018/10/11
N2 - In this paper, we define the concept of exploratory labeling: the use of computational and interactive methods to help analysts categorize groups of documents into a set of unknown and evolving labels. While many computational methods exist to analyze data and build models once the data is organized around a set of predefined categories or labels, few methods address the problem of reliably discovering and curating such labels in the first place. In order to move first steps towards bridging this gap, we propose an interactive visual data analysis method that integrates human-driven label ideation, specification and refinement with machine-driven recommendations. The proposed method enables the user to progressively discover and ideate labels in an exploratory fashion and specify rules that can be used to automatically match sets of documents to labels. To support this process of ideation, specification, as well as evaluation of the labels, we use unsupervised machine learning methods that provide suggestions and data summaries. We evaluate our method by applying it to a real-world labeling problem as well as through controlled user studies to identify and reflect on patterns of interaction emerging from exploratory labeling activities.
AB - In this paper, we define the concept of exploratory labeling: the use of computational and interactive methods to help analysts categorize groups of documents into a set of unknown and evolving labels. While many computational methods exist to analyze data and build models once the data is organized around a set of predefined categories or labels, few methods address the problem of reliably discovering and curating such labels in the first place. In order to move first steps towards bridging this gap, we propose an interactive visual data analysis method that integrates human-driven label ideation, specification and refinement with machine-driven recommendations. The proposed method enables the user to progressively discover and ideate labels in an exploratory fashion and specify rules that can be used to automatically match sets of documents to labels. To support this process of ideation, specification, as well as evaluation of the labels, we use unsupervised machine learning methods that provide suggestions and data summaries. We evaluate our method by applying it to a real-world labeling problem as well as through controlled user studies to identify and reflect on patterns of interaction emerging from exploratory labeling activities.
KW - Document labeling
KW - Exploratory labeling
KW - Text analysis
KW - Visualization
UR - http://www.scopus.com/inward/record.url?scp=85056904824&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85056904824&partnerID=8YFLogxK
U2 - 10.1145/3242587.3242596
DO - 10.1145/3242587.3242596
M3 - Conference contribution
AN - SCOPUS:85056904824
T3 - UIST 2018 - Proceedings of the 31st Annual ACM Symposium on User Interface Software and Technology
SP - 153
EP - 164
BT - UIST 2018 - Proceedings of the 31st Annual ACM Symposium on User Interface Software and Technology
PB - Association for Computing Machinery, Inc
Y2 - 14 October 2018 through 17 October 2018
ER -