Using Generative Large Language Models for Hierarchical Relationship Prediction in Medical Ontologies

Hao Liu, Shuxin Zhou, Zhehuan Chen, Yehoshua Perl, Jiayin Wang

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

This study extends the exploration of ontology enrichment by evaluating the performance of various open-sourced Large Language Models (LLMs) on the task of predicting hierarchical relationships (IS-A) in medical ontologies including SNOMED CT Clinical Finding and Procedure hierarchies and the human Disease Ontology. With the previous finetuned BERT models for hierarchical relationship prediction as the baseline, we assessed eight open-source generative LLMs for the same task. We observed only three models, without finetuning, demonstrated comparable or superior performance compared to the baseline BERT -based models. The best performance model OpenChat achieved a macro average F1 score of 0.96 (0.95) on SNOMED CT Clinical Finding (Procedure) hierarchy, an increase over 7% from the baseline 0.89 (0.85). On human Disease Ontology, OpenChat excels with an F1 score of 0.91, outperforming the second-best performance model Vicuna (0.84). Notably, some LLMs prove unsuitable for hierarchical relationship prediction tasks or appliable for concept placement of medical ontologies. We also explored various prompt templates and ensemble techniques to uncover potential confounding factors in applying LLMs for IS-A relation predictions for medical ontologies.

Original languageEnglish (US)
Title of host publicationProceedings - 2024 IEEE 12th International Conference on Healthcare Informatics, ICHI 2024
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages248-256
Number of pages9
ISBN (Electronic)9798350383737
DOIs
StatePublished - 2024
Externally publishedYes
Event12th IEEE International Conference on Healthcare Informatics, ICHI 2024 - Orlando, United States
Duration: Jun 3 2024Jun 6 2024

Publication series

NameProceedings - 2024 IEEE 12th International Conference on Healthcare Informatics, ICHI 2024

Conference

Conference12th IEEE International Conference on Healthcare Informatics, ICHI 2024
Country/TerritoryUnited States
CityOrlando
Period6/3/246/6/24

All Science Journal Classification (ASJC) codes

  • Artificial Intelligence
  • Computer Vision and Pattern Recognition
  • Information Systems and Management
  • Statistics, Probability and Uncertainty
  • Health Informatics

Keywords

  • Hieratical Relation Prediction
  • Large Language Models
  • Medical Ontology
  • Prompts Design
  • SNOMED CT

Fingerprint

Dive into the research topics of 'Using Generative Large Language Models for Hierarchical Relationship Prediction in Medical Ontologies'. Together they form a unique fingerprint.

Cite this