Leveraging 2D Deep Learning ImageNet-trained models for Native 3D Medical Image Analysis

Bhakti Baheti; Sarthak Pati; Bjoern Menze; Spyridon Bakas

doi:10.1007/978-3-031-33842-7_6

Leveraging 2D Deep Learning ImageNet-trained models for Native 3D Medical Image Analysis

Brainlesion. 2023:13769:68-79. doi: 10.1007/978-3-031-33842-7_6. Epub 2023 Jul 18.

Authors

Bhakti Baheti^{1

2

3}, Sarthak Pati^{1

2

3

4}, Bjoern Menze^{4

5}, Spyridon Bakas^{1

2

3}

Affiliations

¹ Center for Biomedical Image Computing and Analytics (CBICA), University of Pennsylvania, Philadelphia, PA, USA.
² Department of Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA.
³ Department of Radiology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA.
⁴ Department of Informatics, Technical University of Munich, Munich, Germany.
⁵ Department of Quantitative Biomedicine, University of Zurich, Zurich, Switzerland.

Abstract

Convolutional neural networks (CNNs) have shown promising performance in various 2D computer vision tasks due to availability of large amounts of 2D training data. Contrarily, medical imaging deals with 3D data and usually lacks the equivalent extent and diversity of data, for developing AI models. Transfer learning provides the means to use models trained for one application as a starting point to another application. In this work, we leverage 2D pre-trained models as a starting point in 3D medical applications by exploring the concept of Axial-Coronal-Sagittal (ACS) convolutions. We have incorporated ACS as an alternative of native 3D convolutions in the Generally Nuanced Deep Learning Framework (GaNDLF), providing various well-established and state-of-the-art network architectures with the availability of pre-trained encoders from 2D data. Results of our experimental evaluation on 3D MRI data of brain tumor patients for i) tumor segmentation and ii) radiogenomic classification, show model size reduction by ~22% and improvement in validation accuracy by ~33%. Our findings support the advantage of ACS convolutions in pre-trained 2D CNNs over 3D CNN without pre-training, for 3D segmentation and classification tasks, democratizing existing models trained in datasets of unprecedented size and showing promise in the field of healthcare.

Keywords: Deep learning; ImageNet; MRI; Transfer learning; classification; segmentation.

Abstract

Grants and funding