A hybrid classifier approach for Web retrieved documents classification

Razvan Stefan Bot, Yi Fang Brook Wu, Xin Chen, Quanzhi Li

Research output: Chapter in Book/Report/Conference proceedingConference contribution

3 Scopus citations

Abstract

The paper presents a hybrid technique for the classification of web returned hits into concept hierarchies. The technique involves a combination of manual and automatic classifiers. At first, all web returned documents are assigned to human defined categories using manual classifiers, and then automatic classifiers are used to generate a concept hierarchy for each of these categories. The results of the evaluation reveal the following: (a) for polysemous queries, our system is able to generate meaningful categories corresponding to (but not limited to), the different semantic facets of the queries; (b) as expected, for non-polysemous queries the system generates fewer categories; (c) the hierarchy precision of the concept hierarchies generated for polysemous queries is found to be significantly better when compared to the one obtained using a baseline system.

Original languageEnglish (US)
Title of host publicationInternational Conferen ON Information Technology
Subtitle of host publicationCoding Computing, ITCC 2004
EditorsP.K. Srimani, A. Abraham, M. Cannataro, J. Domingo-Ferrer, R. Hashemi
Pages326-330
Number of pages5
DOIs
StatePublished - 2004
EventInternational Conference on Information Technology: Coding Computing, ITCC 2004 - Las Vegas, NV, United States
Duration: Apr 5 2004Apr 7 2004

Publication series

NameInternational Conference on Information Technology: Coding Computing, ITCC
Volume1

Other

OtherInternational Conference on Information Technology: Coding Computing, ITCC 2004
Country/TerritoryUnited States
CityLas Vegas, NV
Period4/5/044/7/04

All Science Journal Classification (ASJC) codes

  • Software
  • General Engineering
  • Information Systems

Keywords

  • Automatic classification
  • Concept hierarchy
  • Hybrid classification
  • Information retrieval
  • Manual classification

Fingerprint

Dive into the research topics of 'A hybrid classifier approach for Web retrieved documents classification'. Together they form a unique fingerprint.

Cite this