Multiresolution Mutual Information Method for Social Network Entity Resolution

Cong Shi, Rong Duan

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Scopus citations

Abstract

Online Social Networks (OSN) are widely adopted in our daily lives, and it is common for one individual to registerwith multiple sites for different services. Linking the rich contentsof different social network sites is valuable to researchers forunderstanding human behaviors from different perspectives. Forinstance, each OSN has its own group of users and thus, has itsown biases. Linked accounts can be a good calibration dataset toimprove data quality. This Entity Resolution (ER) problem is achallenge in the social network domain that many researchersattempt to tackle. In this paper we take advantage of spatialinformation posted in different social network sites and proposean efficient multiresolution mutual information approach to linkthe entities from those sites. The proposed method significantlyreduces the computing time by utilizing an iterative coarse-tofinemultiresolution approach, yet is robust in dealing with thesparsity of location data. The human location-wise behavior isalso discussed in deciding the resolution level. Public availableTwitter and Instagram data collected from their APIs are usedto illustrate the method, and the performance is evaluated bycomparing it with greedy mutual information approach.

Original languageEnglish (US)
Title of host publicationProceedings - 15th IEEE International Conference on Data Mining Workshop, ICDMW 2015
EditorsXindong Wu, Alexander Tuzhilin, Hui Xiong, Jennifer G. Dy, Charu Aggarwal, Zhi-Hua Zhou, Peng Cui
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages240-247
Number of pages8
ISBN (Electronic)9781467384926
DOIs
StatePublished - Jan 29 2016
Externally publishedYes
Event15th IEEE International Conference on Data Mining Workshop, ICDMW 2015 - Atlantic City, United States
Duration: Nov 14 2015Nov 17 2015

Publication series

NameProceedings - 15th IEEE International Conference on Data Mining Workshop, ICDMW 2015

Other

Other15th IEEE International Conference on Data Mining Workshop, ICDMW 2015
Country/TerritoryUnited States
CityAtlantic City
Period11/14/1511/17/15

All Science Journal Classification (ASJC) codes

  • Electrical and Electronic Engineering
  • Computer Science Applications

Fingerprint

Dive into the research topics of 'Multiresolution Mutual Information Method for Social Network Entity Resolution'. Together they form a unique fingerprint.

Cite this