Identifying important concepts from medical documents

Quanzhi Li, Yi Fang Brook Wu

Research output: Contribution to journalArticlepeer-review

39 Scopus citations


Automated medical concept recognition is important for medical informatics such as medical document retrieval and text mining research. In this paper, we present a software tool called keyphrase identification program (KIP) for identifying topical concepts from medical documents. KIP combines two functions: noun phrase extraction and keyphrase identification. The former automatically extracts noun phrases from medical literature as keyphrase candidates. The latter assigns weights to extracted noun phrases for a medical document based on how important they are to that document and how domain specific they are in the medical domain. The experimental results show that our noun phrase extractor is effective in identifying noun phrases from medical documents, so is the keyphrase extractor in identifying important medical conceptual terms. They both performed better than the systems they were compared to.

Original languageEnglish (US)
Pages (from-to)668-679
Number of pages12
JournalJournal of Biomedical Informatics
Issue number6
StatePublished - Dec 2006

All Science Journal Classification (ASJC) codes

  • Computer Science Applications
  • Health Informatics


  • Document keyphrase
  • Keyphrase extraction
  • Medical concepts
  • Medical documents
  • Noun phrase extraction
  • Text mining


Dive into the research topics of 'Identifying important concepts from medical documents'. Together they form a unique fingerprint.

Cite this