TY - JOUR
T1 - Structural group-based auditing of missing hierarchical relationships in UMLS
AU - Chen, Yan
AU - Gu, Huanying (Helen)
AU - Perl, Yehoshua
AU - Geller, James
N1 - Funding Information:
This work was partially supported by the United States National Library of Medicine under grant R 01 LM008445-01A2.
PY - 2009/6
Y1 - 2009/6
N2 - The Metathesaurus of the UMLS was created by integrating various source terminologies. The inter-concept relationships were either integrated into the UMLS from the source terminologies or specially generated. Due to the extensive size and inherent complexity of the Metathesaurus, the accidental omission of some hierarchical relationships was inevitable. We present a recursive procedure which allows a human expert, with the support of an algorithm, to locate missing hierarchical relationships. The procedure starts with a group of concepts with exactly the same (correct) semantic type assignments. It then partitions the concepts, based on child-of hierarchical relationships, into smaller, singly rooted, hierarchically connected subgroups. The auditor only needs to focus on the subgroups with very few concepts and their concepts with semantic type reassignments. The procedure was evaluated by comparing it with a comprehensive manual audit and it exhibits a perfect error recall.
AB - The Metathesaurus of the UMLS was created by integrating various source terminologies. The inter-concept relationships were either integrated into the UMLS from the source terminologies or specially generated. Due to the extensive size and inherent complexity of the Metathesaurus, the accidental omission of some hierarchical relationships was inevitable. We present a recursive procedure which allows a human expert, with the support of an algorithm, to locate missing hierarchical relationships. The procedure starts with a group of concepts with exactly the same (correct) semantic type assignments. It then partitions the concepts, based on child-of hierarchical relationships, into smaller, singly rooted, hierarchically connected subgroups. The auditor only needs to focus on the subgroups with very few concepts and their concepts with semantic type reassignments. The procedure was evaluated by comparing it with a comprehensive manual audit and it exhibits a perfect error recall.
KW - Auditing
KW - Hierarchical relationships
KW - Partition
KW - Refined semantic network
KW - Refined semantic type
KW - Semantic refinement
KW - Semantic type assignment
KW - UMLS
UR - http://www.scopus.com/inward/record.url?scp=65649085898&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=65649085898&partnerID=8YFLogxK
U2 - 10.1016/j.jbi.2008.08.006
DO - 10.1016/j.jbi.2008.08.006
M3 - Article
C2 - 18824248
AN - SCOPUS:65649085898
SN - 1532-0464
VL - 42
SP - 452
EP - 467
JO - Journal of Biomedical Informatics
JF - Journal of Biomedical Informatics
IS - 3
ER -