Performance of deep learning for differentiating pancreatic diseases on contrast-enhanced magnetic resonance imaging: A preliminary study

Diagn Interv Imaging. 2020 Feb;101(2):91-100. doi: 10.1016/j.diii.2019.07.002. Epub 2019 Jul 30.

Abstract

Purpose: The purpose of this study was to evaluate the ability of deep learning to differentiate pancreatic diseases on contrast-enhanced magnetic resonance (MR) images with the aid of generative adversarial network (GAN).

Materials and methods: A total of 504 patients who underwent T1-weighted contrast-enhanced MR examinations before any treatments were included in this retrospective study. First, the MRI examinations of 398 patients (215 men, 183 women; mean age, 59.14±12.07 [SD] years [range: 16-85 years]) from one hospital were used as the training set. Then the MRI examinations of 50 (26 men, 24women; mean age, 58.58±13.64 [SD] years [range: 24-85 years]) and 56 (30 men, 26 women; mean age, 59.13±11.35 [SD] years [range: 26-80 years]) consecutive patients from two hospitals were separately collected as the internal and external validation sets. An InceptionV4 network was trained on the training set augmented by synthetic images from GANs. Classification performance of trained InceptionV4 network for every patch and every patient were made on both validation sets, respectively. The prediction agreement between convolutional neural network (CNN) and radiologist was measured by the Cohen's kappa coefficient.

Results: The patch-level average accuracy and the micro-averaging area under receiver operating characteristic curve (AUC) of InceptionV4 network were 71.56% and 0.9204 (95% confidence interval [CI]: 0.9165-0.9308) for the internal validation set, and 79.46% and 0.9451 (95%CI: 0.9320-0.9523) for the external validation set, respectively. The patient-level average accuracy and the micro-averaging AUC of InceptionV4 network were 70.00% and 0.8250 (95%CI: 0.8147-0.8326) for the internal validation, 76.79% and 0.8646 (95%CI: 0.8489-0.8772) for the external validation set, respectively. Evaluated by human reader, the average accuracy and micro-averaging AUC for internal and external validation sets were 82.00% and 0.8950 (95%CI: 0.8817-0.9083), 83.93% and 0.9063 (95%CI: 0.8968-0.9212), respectively. The Cohen's kappa coefficients between InceptionV4 network and human reader for the internal and external invalidation sets were 0.8339 (95%CI: 0.6991-0.9447) and 0.8862 (95%CI: 0.7759-0.9738), respectively.

Conclusion: Deep learning using CNN and GAN had the potential to differentiate pancreatic diseases on contrast-enhanced MR images.

Keywords: Convolutional neural network (CNN); Deep learning; Generative adversarial network (GAN); Magnetic resonance imaging (MRI); Pancreatic diseases.

Publication types

  • Multicenter Study
  • Validation Study

MeSH terms

  • Adolescent
  • Adult
  • Aged
  • Aged, 80 and over
  • Contrast Media*
  • Deep Learning*
  • Diagnosis, Differential
  • Female
  • Humans
  • Magnetic Resonance Imaging / methods*
  • Male
  • Middle Aged
  • Pancreatic Diseases / diagnostic imaging*
  • Retrospective Studies
  • Young Adult

Substances

  • Contrast Media