TY - GEN
T1 - Concept chaining utilizing meronyms in text characterization
AU - Watrous-DeVersterre, Lori
AU - Wang, Chong
AU - Song, Min
PY - 2012
Y1 - 2012
N2 - For most, the web is the first source to answer a question formulated by curiosity, need, or research reasons. This phenomenon is due to the internet's ubiquitous access, ease of use, and the extensive and ever expanding content. The problem is no longer the need to acquire content to encourage use, but to provide organizational tools to support content categorization that will facilitate improved access methods. This paper presents the results of a new text characterization algorithm that combines semantic and linguistic techniques utilizing domain-based ontology background knowledge. It explores the combination of meronym, synonym, and hypernym linguistic relationships to create a set of concept chains used to represent concepts found in a document. The experiments show improved accuracy over bag-of-words based term weighting methods and reveal characteristics of the meronym contribution to document representation.
AB - For most, the web is the first source to answer a question formulated by curiosity, need, or research reasons. This phenomenon is due to the internet's ubiquitous access, ease of use, and the extensive and ever expanding content. The problem is no longer the need to acquire content to encourage use, but to provide organizational tools to support content categorization that will facilitate improved access methods. This paper presents the results of a new text characterization algorithm that combines semantic and linguistic techniques utilizing domain-based ontology background knowledge. It explores the combination of meronym, synonym, and hypernym linguistic relationships to create a set of concept chains used to represent concepts found in a document. The experiments show improved accuracy over bag-of-words based term weighting methods and reveal characteristics of the meronym contribution to document representation.
KW - clustering
KW - concept extraction
KW - digital libraries
KW - machine learning
KW - natural language processing
KW - ontology
KW - text characterization
UR - http://www.scopus.com/inward/record.url?scp=84863549997&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84863549997&partnerID=8YFLogxK
U2 - 10.1145/2232817.2232862
DO - 10.1145/2232817.2232862
M3 - Conference contribution
AN - SCOPUS:84863549997
SN - 9781450311540
T3 - Proceedings of the ACM/IEEE Joint Conference on Digital Libraries
SP - 241
EP - 248
BT - JCDL '12 - Proceedings of the 12th ACM/IEEE-CS Joint Conference on Digital Libraries
T2 - 12th ACM/IEEE-CS Joint Conference on Digital Libraries, JCDL '12
Y2 - 10 June 2012 through 14 June 2012
ER -