Transformer-CNN hybrid network for improving PET time of flight prediction

Phys Med Biol. 2024 May 30;69(11). doi: 10.1088/1361-6560/ad4c4d.

Abstract

Objective.In positron emission tomography (PET) reconstruction, the integration of time-of-flight (TOF) information, known as TOF-PET, has been a major research focus. Compared to traditional reconstruction methods, the introduction of TOF enhances the signal-to-noise ratio of images. Precision in TOF is measured by full width at half maximum (FWHM) and the offset from ground truth, referred to as coincidence time resolution (CTR) and bias.Approach.This study proposes a network combining transformer and convolutional neural network (CNN) to utilize TOF information from detector waveforms, using event waveform pairs as inputs. This approach integrates the global self-attention mechanism of Transformer, which focuses on temporal relationships, with the local receptive field of CNN. The combination of global and local information allows the network to assign greater weight to the rising edges of waveforms, thereby extracting valuable temporal information for precise TOF predictions. Experiments were conducted using lutetium yttrium oxyorthosilicate (LYSO) scintillators and silicon photomultiplier (SiPM) detectors. The network was trained and tested using the waveform datasets after cropping.Main results.Compared to the constant fraction discriminator (CFD), CNN, CNN with attention, long short-term memory (LSTM) and Transformer, our network achieved an average CTR of 189 ps, reducing it by 82 ps (more than 30%), 13 ps (6.4%), 12 ps (6.0%), 16 ps (7.8%) and 9 ps (4.6%), respectively. Additionally, a reduction of 10.3, 8.7, 6.7 and 4 ps in average bias was achieved compared to CNN, CNN with attention, LSTM and Transformer.Significance.This work demonstrates the potential of applying the Transformer for PET TOF estimation using real experimental data. Through the integration of both CNN and Transformer with local and global attention, it achieves optimal performance, thereby presenting a novel direction for future research in this field.

Keywords: CTR; TOF-PET; deep learning; transformer.

MeSH terms

  • Image Processing, Computer-Assisted* / methods
  • Neural Networks, Computer*
  • Positron-Emission Tomography* / instrumentation
  • Positron-Emission Tomography* / methods
  • Time Factors