Screening mammography performance according to breast density: a comparison between radiologists versus standalone intelligence detection

Mi-Ri Kwon; Yoosoo Chang; Soo-Youn Ham; Yoosun Cho; Eun Young Kim; Jeonggyu Kang; Eun Kyung Park; Ki Hwan Kim; Minjeong Kim; Tae Soo Kim; Hyeonsoo Lee; Ria Kwon; Ga-Young Lim; Hye Rin Choi; JunHyeok Choi; Shin Ho Kook; Seungho Ryu

doi:10.1186/s13058-024-01821-w

Screening mammography performance according to breast density: a comparison between radiologists versus standalone intelligence detection

Breast Cancer Res. 2024 Apr 22;26(1):68. doi: 10.1186/s13058-024-01821-w.

Authors

Mi-Ri Kwon^#¹, Yoosoo Chang^#^{2

3

4}, Soo-Youn Ham^#¹, Yoosun Cho⁵, Eun Young Kim⁶, Jeonggyu Kang⁵, Eun Kyung Park⁷, Ki Hwan Kim⁷, Minjeong Kim^{7

8}, Tae Soo Kim⁷, Hyeonsoo Lee⁷, Ria Kwon^{5

9}, Ga-Young Lim^{5

9}, Hye Rin Choi^{5

9}, JunHyeok Choi¹⁰, Shin Ho Kook¹, Seungho Ryu^#^{11

12

13}

Affiliations

¹ Department of Radiology, Kangbuk Samsung Hospital, Sungkyunkwan University School of Medicine, Seoul, South Korea.
² Center for Cohort Studies, Kangbuk Samsung Hospital, Sungkyunkwan University School of Medicine, Samsung Main Building B2, 250, Taepyung-ro 2ga, Jung-gu, 04514, Seoul, South Korea. yoosoo.chang@gmail.com.
³ Department of Occupational and Environmental Medicine, Kangbuk Samsung Hospital, Sungkyunkwan University School of Medicine, Seoul, Republic of Korea. yoosoo.chang@gmail.com.
⁴ Department of Clinical Research Design & Evaluation, Samsung Advanced Institute for Health Sciences & Technology, Sungkyunkwan University, Seoul, Republic of Korea. yoosoo.chang@gmail.com.
⁵ Center for Cohort Studies, Kangbuk Samsung Hospital, Sungkyunkwan University School of Medicine, Samsung Main Building B2, 250, Taepyung-ro 2ga, Jung-gu, 04514, Seoul, South Korea.
⁶ Department of Surgery, Kangbuk Samsung Hospital, Sungkyunkwan University School of Medicine, Seoul, Republic of Korea.
⁷ Lunit Inc, Seoul, Republic of Korea.
⁸ Department of Statistics, Ewha Womans University, Seoul, Republic of Korea.
⁹ Institute of Medical Research, Sungkyunkwan University School of Medicine, Suwon, Republic of Korea.
¹⁰ School of Mechanical Engineering, Sunkyungkwan University, Seoul, Republic of Korea.
¹¹ Center for Cohort Studies, Kangbuk Samsung Hospital, Sungkyunkwan University School of Medicine, Samsung Main Building B2, 250, Taepyung-ro 2ga, Jung-gu, 04514, Seoul, South Korea. sh703.yoo@gmail.com.
¹² Department of Occupational and Environmental Medicine, Kangbuk Samsung Hospital, Sungkyunkwan University School of Medicine, Seoul, Republic of Korea. sh703.yoo@gmail.com.
¹³ Department of Clinical Research Design & Evaluation, Samsung Advanced Institute for Health Sciences & Technology, Sungkyunkwan University, Seoul, Republic of Korea. sh703.yoo@gmail.com.

^# Contributed equally.

Abstract

Background: Artificial intelligence (AI) algorithms for the independent assessment of screening mammograms have not been well established in a large screening cohort of Asian women. We compared the performance of screening digital mammography considering breast density, between radiologists and AI standalone detection among Korean women.

Methods: We retrospectively included 89,855 Korean women who underwent their initial screening digital mammography from 2009 to 2020. Breast cancer within 12 months of the screening mammography was the reference standard, according to the National Cancer Registry. Lunit software was used to determine the probability of malignancy scores, with a cutoff of 10% for breast cancer detection. The AI's performance was compared with that of the final Breast Imaging Reporting and Data System category, as recorded by breast radiologists. Breast density was classified into four categories (A-D) based on the radiologist and AI-based assessments. The performance metrics (cancer detection rate [CDR], sensitivity, specificity, positive predictive value [PPV], recall rate, and area under the receiver operating characteristic curve [AUC]) were compared across breast density categories.

Results: Mean participant age was 43.5 ± 8.7 years; 143 breast cancer cases were identified within 12 months. The CDRs (1.1/1000 examination) and sensitivity values showed no significant differences between radiologist and AI-based results (69.9% [95% confidence interval [CI], 61.7-77.3] vs. 67.1% [95% CI, 58.8-74.8]). However, the AI algorithm showed better specificity (93.0% [95% CI, 92.9-93.2] vs. 77.6% [95% CI, 61.7-77.9]), PPV (1.5% [95% CI, 1.2-1.9] vs. 0.5% [95% CI, 0.4-0.6]), recall rate (7.1% [95% CI, 6.9-7.2] vs. 22.5% [95% CI, 22.2-22.7]), and AUC values (0.8 [95% CI, 0.76-0.84] vs. 0.74 [95% CI, 0.7-0.78]) (all P < 0.05). Radiologist and AI-based results showed the best performance in the non-dense category; the CDR and sensitivity were higher for radiologists in the heterogeneously dense category (P = 0.059). However, the specificity, PPV, and recall rate consistently favored AI-based results across all categories, including the extremely dense category.

Conclusions: AI-based software showed slightly lower sensitivity, although the difference was not statistically significant. However, it outperformed radiologists in recall rate, specificity, PPV, and AUC, with disparities most prominent in extremely dense breast tissue.

Keywords: Asian women; Breast; Intelligence; Mammography; Screening.

Publication types

Research Support, Non-U.S. Gov't
Comparative Study

MeSH terms

Adult
Algorithms
Artificial Intelligence*
Breast / diagnostic imaging
Breast / pathology
Breast Density*
Breast Neoplasms* / diagnosis
Breast Neoplasms* / diagnostic imaging
Breast Neoplasms* / epidemiology
Breast Neoplasms* / pathology
Early Detection of Cancer* / methods
Female
Humans
Mammography* / methods
Mass Screening / methods
Middle Aged
ROC Curve
Radiologists*
Republic of Korea / epidemiology
Retrospective Studies
Sensitivity and Specificity

Grants and funding

SKKU Excellence in Research Award Research Fund/Sungkyunkwan University