Abstract
Automated medical concept recognition is important for medical informatics such as medical document retrieval and text mining research. In this paper, we present a software tool called keyphrase identification program (KIP) for identifying topical concepts from medical documents. KIP combines two functions: noun phrase extraction and keyphrase identification. The former automatically extracts noun phrases from medical literature as keyphrase candidates. The latter assigns weights to extracted noun phrases for a medical document based on how important they are to that document and how domain specific they are in the medical domain. The experimental results show that our noun phrase extractor is effective in identifying noun phrases from medical documents, so is the keyphrase extractor in identifying important medical conceptual terms. They both performed better than the systems they were compared to.
Original language | English (US) |
---|---|
Pages (from-to) | 668-679 |
Number of pages | 12 |
Journal | Journal of Biomedical Informatics |
Volume | 39 |
Issue number | 6 |
DOIs | |
State | Published - Dec 2006 |
All Science Journal Classification (ASJC) codes
- Computer Science Applications
- Health Informatics
Keywords
- Document keyphrase
- Keyphrase extraction
- Medical concepts
- Medical documents
- Noun phrase extraction
- Text mining