Extracting features from web search returned hits for hierarchical classification

Research output: Chapter in Book/Report/Conference proceedingConference contribution

11 Scopus citations

Abstract

In this paper, we discuss an approach to classify documents using features extracted from returned documents which are closely related to the search query. The purpose is to organize returned documents around the main theme, which is the query. In order to figure out which features to be used in classification, we analyze portions of text in a document that are closely related to the query. The extracted features will be used as attributes in monothetic classification of returned documents. The advantages of this approach are: 1. It allows only closely related terms to be displayed in the hierarchies; and 2. It allows dynamic query-oriented topical classification.

Original languageEnglish (US)
Title of host publicationProceedings of the International Conference on Information and Knowledge Engineering 2003
EditorsN. Goharian, N. Goharian
Pages103-108
Number of pages6
StatePublished - Dec 1 2003
EventProceedings of the International Conference on Information and Knowledge Engineering 2003 - Las Vegas, NV, United States
Duration: Jun 23 2003Jun 26 2003

Publication series

NameProceedings of the International Conference on Information and Knowledge Engineering
Volume1

Other

OtherProceedings of the International Conference on Information and Knowledge Engineering 2003
CountryUnited States
CityLas Vegas, NV
Period6/23/036/26/03

All Science Journal Classification (ASJC) codes

  • Engineering(all)

Keywords

  • Classification
  • Concept hierarchy
  • Feature extraction
  • Web search

Cite this