Abstract
This chapter presents a new approach of mining the Web to identify people of similar background. To find similar people from the Web for a given person, two major research issues are person representation and matching persons. In this chapter, a person representation method which uses a person’s personal Web site to represent this person’s background is proposed. Based on this person representation method, the main proposed algorithm integrates textual content and hyperlink information of all the Web pages belonging to a personal Web site to represent a person and match persons. Other algorithms are also explored and compared to the main proposed algorithm. The evaluation methods and experimental results are presented.
| Original language | English (US) |
|---|---|
| Title of host publication | Handbook of Research on Text and Web Mining Technologies |
| Subtitle of host publication | Volume I-II |
| Publisher | IGI Global |
| Pages | 369-385 |
| Number of pages | 17 |
| Volume | I |
| ISBN (Electronic) | 9781599049915 |
| ISBN (Print) | 9781599049908 |
| DOIs | |
| State | Published - Jan 1 2008 |
All Science Journal Classification (ASJC) codes
- General Computer Science
Fingerprint
Dive into the research topics of 'Web Mining to Identify People of Similar Background'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver