Biomedical text readability after hypernym substitution with fine-tuned large language models

Karl Swanson; Shuhan He; Josh Calvano; David Chen; Talar Telvizian; Lawrence Jiang; Paul Chong; Jacob Schwell; Gin Mak; Jarone Lee

doi:10.1371/journal.pdig.0000489

Biomedical text readability after hypernym substitution with fine-tuned large language models

PLOS Digit Health. 2024 Apr 16;3(4):e0000489. doi: 10.1371/journal.pdig.0000489. eCollection 2024 Apr.

Authors

Karl Swanson¹, Shuhan He², Josh Calvano³, David Chen⁴, Talar Telvizian⁵, Lawrence Jiang⁶, Paul Chong⁷, Jacob Schwell⁸, Gin Mak⁹, Jarone Lee²

Affiliations

¹ Department of Medicine-Clinical Informatics, University of California-San Francisco, San Francisco, United States of America.
² Massachusetts General Hospital and Harvard Medical School, Boston, Massachusetts, United States of America.
³ Department of Anesthesiology and Critical Care, University of New Mexico Hospital, Albuquerque, New Mexico, United States of America.
⁴ Temerty Faculty of Medicine, University of Toronto, Toronto, Ontario, Canada.
⁵ Department of Internal Medicine, Main Line Health Lankenau Medical Center, Wynnewood, Pennsylvania, United States of America.
⁶ Department of Computer Science, Duke University, Durham, North Carolina, United States of America.
⁷ School of Osteopathic Medicine, Campbell University, Lillington, North Carolina, United States of America.
⁸ Sidney Kimmel Medical College, Thomas Jefferson University, Philadelphia, Pennsylvania, United States of America.
⁹ Department of Psychology, Neuroscience & Behaviour, McMaster University, Hamilton, Ontario, Canada.

Abstract

The advent of patient access to complex medical information online has highlighted the need for simplification of biomedical text to improve patient understanding and engagement in taking ownership of their health. However, comprehension of biomedical text remains a difficult task due to the need for domain-specific expertise. We aimed to study the simplification of biomedical text via large language models (LLMs) commonly used for general natural language processing tasks involve text comprehension, summarization, generation, and prediction of new text from prompts. Specifically, we finetuned three variants of large language models to perform substitutions of complex words and word phrases in biomedical text with a related hypernym. The output of the text substitution process using LLMs was evaluated by comparing the pre- and post-substitution texts using four readability metrics and two measures of sentence complexity. A sample of 1,000 biomedical definitions in the National Library of Medicine's Unified Medical Language System (UMLS) was processed with three LLM approaches, and each showed an improvement in readability and sentence complexity after hypernym substitution. Readability scores were translated from a pre-processed collegiate reading level to a post-processed US high-school level. Comparison between the three LLMs showed that the GPT-J-6b approach had the best improvement in measures of sentence complexity. This study demonstrates the merit of hypernym substitution to improve readability of complex biomedical text for the public and highlights the use case for fine-tuning open-access large language models for biomedical natural language processing.

Copyright: © 2024 Swanson et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Grants and funding

The author(s) received no specific funding for this work.