TY - GEN
T1 - VALUE
T2 - 2023 Workshop on Human-In-the-Loop Data Analytics, HILDA 2023 - Co-located with SIGMOD 2023
AU - Bhattacharjee, Kaustav
AU - Dasgupta, Aritra
N1 - Publisher Copyright:
© 2023 ACM.
PY - 2023/6/18
Y1 - 2023/6/18
N2 - The widespread adoption of open datasets across various domains has emphasized the significance of joining and computing their utility. However, the interplay between computation and human interaction is vital for informed decision-making. To address this issue, we first propose a utility metric to calibrate the usefulness of open datasets when joined with other such datasets. Further, we distill this utility metric through a visual analytic framework called VALUE, which empowers the researchers to identify joinable datasets, prioritize them based on their utility, and inspect the joined dataset. This transparent evaluation of the utility of the joined datasets is implemented through a human-in-the-loop approach where the researchers can adapt and refine the selection criteria according to their mental model of utility. Finally, we demonstrate the effectiveness of our approach through a usage scenario using real-world open datasets.
AB - The widespread adoption of open datasets across various domains has emphasized the significance of joining and computing their utility. However, the interplay between computation and human interaction is vital for informed decision-making. To address this issue, we first propose a utility metric to calibrate the usefulness of open datasets when joined with other such datasets. Further, we distill this utility metric through a visual analytic framework called VALUE, which empowers the researchers to identify joinable datasets, prioritize them based on their utility, and inspect the joined dataset. This transparent evaluation of the utility of the joined datasets is implemented through a human-in-the-loop approach where the researchers can adapt and refine the selection criteria according to their mental model of utility. Finally, we demonstrate the effectiveness of our approach through a usage scenario using real-world open datasets.
KW - linking
KW - open datasets
KW - utility
KW - visual analytics
UR - http://www.scopus.com/inward/record.url?scp=85167947255&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85167947255&partnerID=8YFLogxK
U2 - 10.1145/3597465.3605225
DO - 10.1145/3597465.3605225
M3 - Conference contribution
AN - SCOPUS:85167947255
T3 - HILDA 2023 - Workshop on Human-In-the-Loop Data Analytics - Co-located with SIGMOD 2023
BT - HILDA 2023 - Workshop on Human-In-the-Loop Data Analytics - Co-located with SIGMOD 2023
PB - Association for Computing Machinery, Inc
Y2 - 18 June 2023
ER -