| Literature DB >> 35312031 |
Cameron Dennis Pain1,2, Gary F Egan3,4, Zhaolin Chen3,5.
Abstract
Image processing plays a crucial role in maximising diagnostic quality of positron emission tomography (PET) images. Recently, deep learning methods developed across many fields have shown tremendous potential when applied to medical image enhancement, resulting in a rich and rapidly advancing literature surrounding this subject. This review encapsulates methods for integrating deep learning into PET image reconstruction and post-processing for low-dose imaging and resolution enhancement. A brief introduction to conventional image processing techniques in PET is firstly presented. We then review methods which integrate deep learning into the image reconstruction framework as either deep learning-based regularisation or as a fully data-driven mapping from measured signal to images. Deep learning-based post-processing methods for low-dose imaging, temporal resolution enhancement and spatial resolution enhancement are also reviewed. Finally, the challenges associated with applying deep learning to enhance PET images in the clinical setting are discussed and future research directions to address these challenges are presented.Entities:
Keywords: Deep learning; Denoising; Dynamic PET; Image reconstruction; Low-dose; PET; Super resolution
Mesh:
Year: 2022 PMID: 35312031 PMCID: PMC9250483 DOI: 10.1007/s00259-022-05746-4
Source DB: PubMed Journal: Eur J Nucl Med Mol Imaging ISSN: 1619-7070 Impact factor: 10.057
Fig. 1Image processing in the context of PET as a whole. Advancements in various fields contribute holistically to improvements in PET as a modality. This review considers data-driven deep learning-based techniques in the image processing pipeline
Fig. 2Description of deep learning-based PET image reconstruction methods. A End-to-end methods are fully data-driven and do not require an instrument-based system matrix. B Regularisation-based methods utilise a neural network in combination with data consistency, retaining the system matrix. C Unrolling iterative algorithms into a sequence of processing steps makes it feasible to train iteration-specific regularisation terms
Fig. 3Images from Mehranian et al. comparing reference thirty minute maximum a posteriori expectation maximisation reconstruction with a 2-min reconstruction with FBSEM-Net using PET and MR inputs (FBSEM-pm), standard data consistency-based reconstructions (MAPEM) and deep learning-based post-processing with PET and MR inputs (Unet-pm) (images from [68])
Summary of deep learning based low-dose to full-dose post-processing implementations reviewed in this work. Details from each
source are: the neural network architecture, dimensions of the input data, additional input information, tracer, anatomical region, activity and acquisition time, the dose or time reduction factor, and the evaluation metrics used to convey performance
| Network architecture | PET input dimensions | Additional input data | Tracers | Anatomy | Activity/Acq, time (MBq, min) | Dose/time reduction factor | Evaluation metrics | |
|---|---|---|---|---|---|---|---|---|
| [ | CNN | 2D Patch | T1 | 18F-FDG | Brain | (203,12) | 4 | PSNR, nMSE |
| [ | Unet | 2.5D | None | 18F-FDG | Brain | (370,40) | 200 | SSIM, PSNR, NRMSE |
| [ | Residual Unet | 2D | T1, T2, FLAIR | 18F-FBB | Brain | (330, 20) | 100 | PSNR, RMSE, SSIM, QCS, rSUV, CD |
| [ | Unet | 3D | CT | 18F-FDG | Cardiac | (300,10) | 10, 100 | LVEF, ESV, EDV |
| [ | Modified Unet | 2.5D | LAVA | 18F-FDG | Whole body | Site 1: (3 kg−1, 3.5 bed−1) Site 2: (3 kg−1, 4 bed−1) | 16 | PSNR, NRMSE, SSIM, rSUV, CTD |
| [ | Unet | 3D Patch | None | 18F-FDG | Brain | (5.18 kg−1, 5 bed−1) | 7.5, 30 | SNR, SSIM |
| [ | CNN (Dilating convolutional kernels) | 2D | None | 18F-FDG | Brain | (166.5, 10) | 10 | MAE, PSNR, SSIM, rMAE |
| [ | Unet | 3D | None | 18F-FDG | Brain | (205, 20) | 20 | PSNR, RMSE, SSIM, rSUV, QCS |
| [ | FFNN | 2D Patch | None | Sim 82Rb, 82Rb | Cardiac | (N/A, 7) | 7, 3.5, 1.5 | NMSE, ROI Contrast |
| [ | Unet | 3D Patch | None | 18F-FDG | Whole body | (225.3, 10) | 6.7, 9.1, 13.3, 17.5, 26.3, 66.7, 125, 250, 500 | Lesion SUV, QCS, CTD |
| [ | Modified Unet | 2D | Sim T1 | Sim 18F-FDG | Brain | (N/A, N/A) | N/A | MSE, Lesion CR |
| [ | CNN | 3D | T1 | 18F-FDG | Brain | (N/A, N/A) | 10, 100 | NRMSE, SUV bias |
| [ | cycleGAN | 2D Patch | None | 18F-FDG | Brain | (218.3, 20) | 125 | PSNR, NRMSE, SSIM, SUV bias |
| [ | GAN | 2D | None | 18F-FBB | Brain | (300, 20) | 10 | PSNR, NRMSE, SSIM, rSUV, QCS, CD |
| [ | GAN | 3D Patch | None | 18F-FDG | Whole body | (5.55 kg−1, 20) | 2 | SSIM, PSNR |
| [ | cycleGAN | 3D Patch | None | 18F-FDG | Whole body | BMI 30 | 8 | MAE, NRMSE, rPSNR |
| [ | cycleGAN | 2D Patch | None | 18F-FDG | Whole Body | (370, 5) | 3.3, 10 | PSNR, NRMSE SUV bias |
| [ | GAN | 2D Patch | None | 18F-FDG | Whole body | (N/A, N/A) | 10 | PSNR, RMSE, SSIM Lesion SUV |
| [ | GAN | 2.5D | None | 18F-FBB | Brain | (330, 20) | 100 | PSNR, RMSE, SSIM, FBM, EBM, CD |
| [ | GAN | 3D Patch | None | 18F-FDG | Brain | (203, 12) | 4 | PSNR, nMSE, rSUV |
| [ | GAN | 3D Patch | T1, DT | 18F-FDG | Brain | (203, 12) | 4 | PSNR, SSIM, rCR |
| [ | GAN | 3D Patch | None | 18F-FDG | Whole body | (5.55 kg−1, 20) | 5 | NRMSE, PSNR, RFSIM, VIF |
| [ | CAE, Unet, GAN | 2D, 2.5D, 3D | None | 18F-FDG | Thoracic | (370, 20) | 10 | PSNR, nMSE, Lesion SUV bias |
| [ | Residual Unet | 2D | T1, T2, FLAIR | 18F-FBB | Brain | LD: (8, 30) FD: (334, 20) | 42 | PSNR, RMSE, SSIM rSUV, QCS, CD |
| [ | Residual Unet | 2D | T1, T2, FLAIR | 18F-FBB | Brain | Site 1: (330, 20) Site 2: (283, 20) | Site 1: 100 Site 2: 20 | PSNR, RMSE, SSIM rSUV, QCS, CD |
| [ | Unet | 3D Patch | None | 18F-FDG, 18F-FMISO, 68Ga-Dotatate | Whole body | FDG: (340, 20) FMISO: (181, 50) DOTATATE: (130, 21.6) | 10 | PSNR, NRMSE, Lesion SUV bias |
| [ | Residual Unet | 2.5D | None | Sim 18F-FDG, 18F-FDG | Brain | (185, 70) | 4 | CR |
| [ | Unet | 2.5D | None | 18F-FDG | Whole body | Site 1: (481, 3 bed−1) Site 2: (400, 3 bed−1) Site 3: (429, 3 bed−1) | 4 | QCS, CTD, rSUV |
| [ | Residual Unet | 3D | None | 18F-FDG | Whole body | (391, 2.45 bed−1) | 1.33, 2, 4 | CTD, rSUV |
| [ | Modified Unet | 2.5D | T1, T2 | 18F-FDG | Brain | (230, 30) | 180 | PSNR, SSIM |
DT diffusion tensor, PSNR peak signal-to-noise ratio, RMSE root mean square error, NRMSE normalised root mean square error, MSE mean square error, MAE mean absolute error, rSUV regional SUV, CR contrast recovery, SSIM structural similarity index, QCS qualitative clinical score, CD clinical diagnosis, CTD clinical tumour detectability, LVEF left ventricular ejection fraction, EDV end diastolic volume, ESV end systolic volume, LAVA liver acquisition volume acceleration, RFSIM Riesz-transform based feature similarity, VIF visual information fidelity, Sim simulated data
Fig. 4Examples of deep learning-based low-dose to full-dose post-processing using PET only inputs and evaluated across multiple sites using different acquisition protocols and scanners. The neural network used in this case was trained using data sourced independently from the three evaluation sites to provide an unbiased evaluation (Image from [100])
Fig. 5Demonstration of low-dose to full-dose mapping and the benefits of including multi-contrast MRI (PET + MR) as an input to the deep learning-based algorithm as compared to using only PET inputs (PET Only) (Images from [75])
Fig. 6Including task-specific perceptual loss in the form of a pre-trained amyloid classifier improves diagnostic quality of synthesised full-dose images (images from [91])
Fig. 7Deep learning-based spatial resolution enhancement using unpaired training with high-resolution images acquired on a dedicated brain scanner as targets. Optimal results were obtained from pre-training with synthetic data (images from [128])