[Medical image segmentation data augmentation method based on channel weight and data-efficient features]

Sheng Wu Yi Xue Gong Cheng Xue Za Zhi. 2024 Apr 25;41(2):220-227. doi: 10.7507/1001-5515.202302024.
[Article in Chinese]

Abstract

In computer-aided medical diagnosis, obtaining labeled medical image data is expensive, while there is a high demand for model interpretability. However, most deep learning models currently require a large amount of data and lack interpretability. To address these challenges, this paper proposes a novel data augmentation method for medical image segmentation. The uniqueness and advantages of this method lie in the utilization of gradient-weighted class activation mapping to extract data efficient features, which are then fused with the original image. Subsequently, a new channel weight feature extractor is constructed to learn the weights between different channels. This approach achieves non-destructive data augmentation effects, enhancing the model's performance, data efficiency, and interpretability. Applying the method of this paper to the Hyper-Kvasir dataset, the intersection over union (IoU) and Dice of the U-net were improved, respectively; and on the ISIC-Archive dataset, the IoU and Dice of the DeepLabV3+ were also improved respectively. Furthermore, even when the training data is reduced to 70 %, the proposed method can still achieve performance that is 95 % of that achieved with the entire dataset, indicating its good data efficiency. Moreover, the data-efficient features used in the method have interpretable information built-in, which enhances the interpretability of the model. The method has excellent universality, is plug-and-play, applicable to various segmentation methods, and does not require modification of the network structure, thus it is easy to integrate into existing medical image segmentation method, enhancing the convenience of future research and applications.

在计算机辅助医疗诊断领域,获取含标签的医学数据代价昂贵,同时对模型的可解释性要求较高,而目前大多数深度学习模型存在数据缺乏和可解释性差的局限。为此,本文提出一种新颖的用于医学图像分割的数据增强方法,其优势和新颖之处在于,通过梯度类激活热力图提取数据效用特征并与原图像进行融合,然后构建新的通道权重特征提取器来学习不同通道间的权重,最终实现了不具有破坏性的数据增强效果,提升了模型的性能、数据效用和可解释性。将本文方法应用于超光谱-克瓦希尔(Hyper-Kvasir)数据集,U型网络(U-net)模型的交并比(IoU)和戴斯(Dice)系数分别有所提升;在国际皮肤成像合作组织(ISIC)档案文件(Archive)数据集(ISIC-Archive)上,深度研究实验室V3+网络(DeepLabV3+)模型的指标IoU和Dice系数也分别有所提升。此外,在仅使用70%的训练数据的情况下,依然取得了原模型在整个数据集上训练所得性能的95%,表现出良好的数据效用。而且,该方法所使用的数据效用特征具有内置的可解释信息,有助于提高模型的可解释性。本文所提方法普适性较好,可以即插即用,适用于不同的分割方法,且无需修改网络结构,因此易于集成到现有的医学图像分割工作中,可提高今后研究和应用的便利性。.

Keywords: Data augmentation; Data efficiency; Deep learning; Interpretability; Medical image segmentation.

Publication types

  • English Abstract

MeSH terms

  • Algorithms*
  • Deep Learning*
  • Diagnosis, Computer-Assisted / methods
  • Diagnostic Imaging / methods
  • Humans
  • Image Processing, Computer-Assisted* / methods
  • Neural Networks, Computer