Support vector machine for breast cancer classification using diffusion-weighted MRI histogram features: Preliminary study

J Magn Reson Imaging. 2018 May;47(5):1205-1216. doi: 10.1002/jmri.25873. Epub 2017 Oct 16.

Abstract

Background: Diffusion-weighted MRI (DWI) is currently one of the fastest developing MRI-based techniques in oncology. Histogram properties from model fitting of DWI are useful features for differentiation of lesions, and classification can potentially be improved by machine learning.

Purpose: To evaluate classification of malignant and benign tumors and breast cancer subtypes using support vector machine (SVM).

Study type: Prospective.

Subjects: Fifty-one patients with benign (n = 23) and malignant (n = 28) breast tumors (26 ER+, whereof six were HER2+).

Field strength/sequence: Patients were imaged with DW-MRI (3T) using twice refocused spin-echo echo-planar imaging with echo time / repetition time (TR/TE) = 9000/86 msec, 90 × 90 matrix size, 2 × 2 mm in-plane resolution, 2.5 mm slice thickness, and 13 b-values.

Assessment: Apparent diffusion coefficient (ADC), relative enhanced diffusivity (RED), and the intravoxel incoherent motion (IVIM) parameters diffusivity (D), pseudo-diffusivity (D*), and perfusion fraction (f) were calculated. The histogram properties (median, mean, standard deviation, skewness, kurtosis) were used as features in SVM (10-fold cross-validation) for differentiation of lesions and subtyping.

Statistical tests: Accuracies of the SVM classifications were calculated to find the combination of features with highest prediction accuracy. Mann-Whitney tests were performed for univariate comparisons.

Results: For benign versus malignant tumors, univariate analysis found 11 histogram properties to be significant differentiators. Using SVM, the highest accuracy (0.96) was achieved from a single feature (mean of RED), or from three feature combinations of IVIM or ADC. Combining features from all models gave perfect classification. No single feature predicted HER2 status of ER + tumors (univariate or SVM), although high accuracy (0.90) was achieved with SVM combining several features. Importantly, these features had to include higher-order statistics (kurtosis and skewness), indicating the importance to account for heterogeneity.

Data conclusion: Our findings suggest that SVM, using features from a combination of diffusion models, improves prediction accuracy for differentiation of benign versus malignant breast tumors, and may further assist in subtyping of breast cancer.

Level of evidence: 3 Technical Efficacy: Stage 3 J. Magn. Reson. Imaging 2018;47:1205-1216.

Keywords: breast MR; diffusion weighted MRI; intravoxel incoherent motion; prognostic factors; support vector machine; tumor heterogeneity.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adult
  • Aged
  • Algorithms
  • Breast / diagnostic imaging
  • Breast Neoplasms / diagnostic imaging*
  • Diffusion
  • Diffusion Magnetic Resonance Imaging*
  • Echo-Planar Imaging
  • Estrogen Receptor alpha / metabolism
  • Female
  • Humans
  • Image Interpretation, Computer-Assisted / methods
  • Image Processing, Computer-Assisted / methods*
  • Machine Learning
  • Middle Aged
  • Motion
  • Prospective Studies
  • Receptor, ErbB-2 / metabolism
  • Reproducibility of Results
  • Support Vector Machine*

Substances

  • ESR1 protein, human
  • Estrogen Receptor alpha
  • ERBB2 protein, human
  • Receptor, ErbB-2