A survey of recent methods for addressing AI fairness and bias in biomedicine

Yifan Yang; Mingquan Lin; Han Zhao; Yifan Peng; Furong Huang; Zhiyong Lu

doi:10.1016/j.jbi.2024.104646

A survey of recent methods for addressing AI fairness and bias in biomedicine

J Biomed Inform. 2024 Jun:154:104646. doi: 10.1016/j.jbi.2024.104646. Epub 2024 Apr 25.

Authors

Yifan Yang¹, Mingquan Lin², Han Zhao³, Yifan Peng², Furong Huang⁴, Zhiyong Lu⁵

Affiliations

¹ National Center for Biotechnology Information (NCBI), National Library of Medicine (NLM), National Institutes of Health (NIH), Bethesda, MD, USA; Department of Computer Science, University of Maryland, College Park, USA.
² Department of Population Health Sciences, Weill Cornell Medicine, NY, USA.
³ Department of Computer Science, University of Illinois at Urbana-Champaign, Champaign, IL, USA.
⁴ Department of Computer Science, University of Maryland, College Park, USA.
⁵ National Center for Biotechnology Information (NCBI), National Library of Medicine (NLM), National Institutes of Health (NIH), Bethesda, MD, USA. Electronic address: zhiyong.lu@nih.gov.

PMID: 38677633
PMCID: PMC11129918 (available on 2025-06-01)
DOI: 10.1016/j.jbi.2024.104646

Abstract

Objectives: Artificial intelligence (AI) systems have the potential to revolutionize clinical practices, including improving diagnostic accuracy and surgical decision-making, while also reducing costs and manpower. However, it is important to recognize that these systems may perpetuate social inequities or demonstrate biases, such as those based on race or gender. Such biases can occur before, during, or after the development of AI models, making it critical to understand and address potential biases to enable the accurate and reliable application of AI models in clinical settings. To mitigate bias concerns during model development, we surveyed recent publications on different debiasing methods in the fields of biomedical natural language processing (NLP) or computer vision (CV). Then we discussed the methods, such as data perturbation and adversarial learning, that have been applied in the biomedical domain to address bias.

Methods: We performed our literature search on PubMed, ACM digital library, and IEEE Xplore of relevant articles published between January 2018 and December 2023 using multiple combinations of keywords. We then filtered the result of 10,041 articles automatically with loose constraints, and manually inspected the abstracts of the remaining 890 articles to identify the 55 articles included in this review. Additional articles in the references are also included in this review. We discuss each method and compare its strengths and weaknesses. Finally, we review other potential methods from the general domain that could be applied to biomedicine to address bias and improve fairness.

Results: The bias of AIs in biomedicine can originate from multiple sources such as insufficient data, sampling bias and the use of health-irrelevant features or race-adjusted algorithms. Existing debiasing methods that focus on algorithms can be categorized into distributional or algorithmic. Distributional methods include data augmentation, data perturbation, data reweighting methods, and federated learning. Algorithmic approaches include unsupervised representation learning, adversarial learning, disentangled representation learning, loss-based methods and causality-based methods.

Keywords: AI; Bias; Biomedicine; Fairness.

Published by Elsevier Inc.

Publication types

Review

MeSH terms

Algorithms
Artificial Intelligence*
Bias*
Humans
Machine Learning
Natural Language Processing*
Surveys and Questionnaires

Grants and funding

ZIA LM010021/ImNIH/Intramural NIH HHS/United States