Going beyond a First Reader: A Machine Learning Methodology for Optimizing Cost and Performance in Breast Ultrasound Diagnosis

Ultrasound Med Biol. 2015 Dec;41(12):3148-62. doi: 10.1016/j.ultrasmedbio.2015.07.020. Epub 2015 Sep 6.

Abstract

The goal of this study was to devise a machine learning methodology as a viable low-cost alternative to a second reader to help augment physicians' interpretations of breast ultrasound images in differentiating benign and malignant masses. Two independent feature sets consisting of visual features based on a radiologist's interpretation of images and computer-extracted features when used as first and second readers and combined by adaptive boosting (AdaBoost) and a pruning classifier resulted in a very high level of diagnostic performance (area under the receiver operating characteristic curve = 0.98) at a cost of pruning a fraction (20%) of the cases for further evaluation by independent methods. AdaBoost also improved the diagnostic performance of the individual human observers and increased the agreement between their analyses. Pairing AdaBoost with selective pruning is a principled methodology for achieving high diagnostic performance without the added cost of an additional reader for differentiating solid breast masses by ultrasound.

Keywords: Adaptive boosting; Artificial intelligence; Breast cancer; Breast ultrasound; Computer-aided diagnosis.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Area Under Curve
  • Breast Neoplasms / diagnostic imaging*
  • Female
  • Humans
  • Image Processing, Computer-Assisted / methods*
  • Machine Learning*
  • Middle Aged
  • Observer Variation
  • Sensitivity and Specificity
  • Ultrasonography, Mammary / methods*