Skip to main navigation Skip to search Skip to main content

Web Mining to Identify People of Similar Background

Research output: Chapter in Book/Report/Conference proceedingChapter

Abstract

This chapter presents a new approach of mining the Web to identify people of similar background. To find similar people from the Web for a given person, two major research issues are person representation and matching persons. In this chapter, a person representation method which uses a person’s personal Web site to represent this person’s background is proposed. Based on this person representation method, the main proposed algorithm integrates textual content and hyperlink information of all the Web pages belonging to a personal Web site to represent a person and match persons. Other algorithms are also explored and compared to the main proposed algorithm. The evaluation methods and experimental results are presented.

Original languageEnglish (US)
Title of host publicationHandbook of Research on Text and Web Mining Technologies
Subtitle of host publicationVolume I-II
PublisherIGI Global
Pages369-385
Number of pages17
VolumeI
ISBN (Electronic)9781599049915
ISBN (Print)9781599049908
DOIs
StatePublished - Jan 1 2008

All Science Journal Classification (ASJC) codes

  • General Computer Science

Fingerprint

Dive into the research topics of 'Web Mining to Identify People of Similar Background'. Together they form a unique fingerprint.

Cite this