| Literature DB >> 35330479 |
Francisco Silva1,2, Tania Pereira1, Inês Neves1,3, Joana Morgado1, Cláudia Freitas4,5, Mafalda Malafaia1,6, Joana Sousa1, João Fonseca1,6, Eduardo Negrão4, Beatriz Flor de Lima4, Miguel Correia da Silva4, António J Madureira4,5, Isabel Ramos4,5, José Luis Costa5,7,8, Venceslau Hespanhol4,5, António Cunha1,9, Hélder P Oliveira1,2.
Abstract
Advancements in the development of computer-aided decision (CAD) systems for clinical routines provide unquestionable benefits in connecting human medical expertise with machine intelligence, to achieve better quality healthcare. Considering the large number of incidences and mortality numbers associated with lung cancer, there is a need for the most accurate clinical procedures; thus, the possibility of using artificial intelligence (AI) tools for decision support is becoming a closer reality. At any stage of the lung cancer clinical pathway, specific obstacles are identified and "motivate" the application of innovative AI solutions. This work provides a comprehensive review of the most recent research dedicated toward the development of CAD tools using computed tomography images for lung cancer-related tasks. We discuss the major challenges and provide critical perspectives on future directions. Although we focus on lung cancer in this review, we also provide a more clear definition of the path used to integrate AI in healthcare, emphasizing fundamental research points that are crucial for overcoming current barriers.Entities:
Keywords: CT scan; computer-aided decision; learning models; lung cancer
Year: 2022 PMID: 35330479 PMCID: PMC8950137 DOI: 10.3390/jpm12030480
Source DB: PubMed Journal: J Pers Med ISSN: 2075-4426
Figure 1Prevalence of the major histological subtypes of lung cancer: non-small cell lung cancer and small cell lung cancer [13,14,15].
Figure 2Physiological response to the activation of membrane receptors and immune receptors. Activation of receptor kinases, such as EGFR, FGFR, or HER-2, promotes the activation of RAS and its downstream pathways that facilitate cell growth, proliferation, cell survival, and differentiation. However, mutations in the RAS family lead to its constitutive activation and the hyperactivation of the downstream pathways—leading to uncontrolled cell survival. On the other hand, PD-L1 is present on the cell surface of immune cells and its binding to tumor cells PD-1 inhibits cell proliferation, cytokine secretion, and the expression of anti-apoptotic molecules in immune cells culminating in the escape of cancer from immunosurveillance. The goal of target therapies is to diminish the activation of abnormal signalling pathways, which can be inhibited at every step.
Figure 3Radiomic vs. radiogenomic perspectives for lung cancer assessment. The assessment of lung cancer was first based on the nodule; however, recently, in radiogenomics approaches, other lung structures were shown to have relevant information for cancer characterization. Those approaches brought new challenges, such as the segmentation of lungs, to use this region of interest (ROI) in the AI-based models.
Figure 4Two main perspectives for CADs in lung cancer, focused on the nodule and a more holistic approach that takes into consideration information about the surrounding structures of the nodule.
Overview of published works regarding nodule detection approaches in lung CT images (2020–2021).
| Authors | Year | Dataset | Methods | Performance Results (%) |
|---|---|---|---|---|
| Tan et al. [ | 2020 | LIDC-IDRI | 3D CNNs, based on FCN, DenseNet, and U-Net | TPR = 97.5 |
| Mukherjee et al. [ | 2020 | LIDC-IDRI | Ensemble stacking | ACC = 99.5 |
| Shi et al. [ | 2020 | LUNA16 | 3D Res-I and U-Net network | TPR = 96.4 |
| Khehrah et al. [ | 2020 | LIDC-IDRI | SVM | ACC = 92 |
| Kuo et al. [ | 2020 | LIDC-IDRI Private (320 patients) | SVM | TPR = 92.1 |
| Zheng et al. [ | 2020 | LIDC-IDRI | 3D multiscale dense CNNs | TPR = 94.2 (1.0 FP/scan), |
| Paing et al. [ | 2020 | LIDC-IDRI | Optimized random forest | ACC = 93.1 |
| Liu et al. [ | 2020 | LIDC-IDRI | CNN algorithm: You Only Look Once v3 | TPR = 87.3 |
| Harsono et al. [ | 2020 | LIDC-IDRI Private (546 patients) | I3DR-Net | mAP = 49.6 (LIDC), |
| Xu et al. [ | 2020 | LUNA16 | 3D CNN networks: V-Net and multi-level contextual 3D CNNs | TPR = 93.1 (1.64 FP/scan) |
| Drokin and Ericheva [ | 2020 | LIDC-IDRI | Algorithm for sampling points from a point cloud | FROC = 85.9 |
| El-Regaily et al. [ | 2020 | LIDC-IDRI | Multi-view CNN | ACC = 91.0 |
| Ye et al. [ | 2020 | LUNA16 | Three modified V-Nets with multilevel receptive fields | ACC = 66.7 |
| Baker and Ghadi [ | 2020 | LIDC-IDRI | SVM | NRR = 94.5 |
| Halder et al. [ | 2020 | LIDC-IDRI | SVM | ACC = 88.2 |
| Jain et al. [ | 2020 | LUNA16 | SumNet | ACC = 94.1 |
| Mahersia et al. [ | 2020 | LIDC-IDRI | SVM, Bayesian back-propagation neuronal classifier and neuro-fuzzy classifier | NRR = 97.9 |
| Mittapalli and Thanikaiselvan [ | 2021 | LUNA16 | Multiscale CNN with Compound Fusions | CPM = 94.8 |
| Vipparla et al. [ | 2021 | LUNA16 | 3D Attention-based CNN architectures: MP-ACNN1, MP-ACNN2 and MP-ACNN3 | CPM = 93.1 |
| Luo et al. [ | 2021 | LUNA16 | SCPM-Net | TPR = 92.2 (1 FPs/image), |
| Bhaskar and Ganashree [ | 2021 | DSB-2017 | Gaussian mixture convolutional auto encoder + 3D deep CNN | ACC = 74.0 |
ACC: accuracy; AUC: area under the ROC curve; CPM: competition performance metric; DSC: Sørensen–Dice coefficient; FDR: false discovery rate; FNR: false negative rate; FP: false positive; FPR: false positive rate; FROC: free-response receiver operating characteristic; mAP: mean average precision; MCC: Matthews correlation coefficient; NPV: negative predictive value; NRR: nodule recognition rate; PPV: positive predictive value; TNR: true negative rate; TPR: true positive rate.
Overview of the published works regarding nodule segmentation approaches in lung CT images (2020–2021).
| Authors | Year | Dataset | Methods | Performance Results (%) |
|---|---|---|---|---|
| Sharma et al. [ | 2020 | SPIE-AAPM Lung CT Challenge | SVM + k-NN | ACC = 93.9 |
| Xiao et al. [ | 2020 | LUNA16 | 3D-UNet + Res2Net Neural Network | TPR = 99.1 |
| Singadkar et al. [ | 2020 | LIDC-IDRI | Deep deconvolutional residual network | DSC = 95.0 |
| Kumar and Raman [ | 2020 | LUNA16 | V-Net (3D CNN) | DSC = 96.1 |
| Rocha et al. [ | 2020 | LIDC-IDRI | Sliding Band Filter + U-Net + SegU-Net | DSC = 66.3 (SBF), |
| Hancock and Magnan [ | 2021 | LIDC-IDRI | Level set machine learning method | DSC = 83.6 |
| Savic et al. [ | 2021 | LIDC-IDRI Private—phantom (108 patients) | Algorithm based on the fast marching method | DSC = 93.3 |
ACC: accuracy; DSC: Sørensen–Dice coefficient; GM: Geometric mean; JI: Jaccard index; TPR: true positive rate.
Overview of published works regarding nodule classification approaches in lung CT images (2020–2021).
| Authors | Year | Dataset | Methods | Performance Results (%) |
|---|---|---|---|---|
| Wang et al. [ | 2020 | Private (1478 patients) | Adaptive-boost deep learning strategy with multiple 3D CNN-based weak classifiers | ACC = 73.4 |
| Xiao et al. [ | 2020 | LIDC-IDRI | ResNet-18 + Denoising autoencoder classifier + handcrafted features | ACC = 93.1 |
| Wang et al. [ | 2020 | LUNGx | ConvNet | ACC = 90.4 |
| Lin et al. [ | 2020 | LUNA16 | GVGG + ResCon network | TPR = 92.5 |
| Onishi et al. [ | 2020 | Private (60 patients) | M-Scale 3D CNN | TPR = 90.9 |
| Zhao et al. [ | 2020 | LIDC-IDRI | Multi-stream multi-task network | ACC = 93.9 |
| Zia et al. [ | 2020 | LIDC-IDRI | Multi-deep model | ACC = 90.7 |
| Jiang et al. [ | 2020 | LUNA16 | Ensemble of 3D Dual Path Networks | ACC = 90.2 |
| Bao et al. [ | 2020 | LIDC-IDRI | Global-local residual network | ACC = 90.4 |
| Shah et al. [ | 2020 | LUNA16 | NoduleNet (transfer learning from VGG16 and VGG19 models) | ACC = 95.0 |
| Tong et al. [ | 2020 | LIDC-IDRI | 3D-ResNet + SVM with RBF and polynomial kernels | ACC = 90.6 |
| Xu et al. [ | 2020 | LIDC-IDRI | Multi-scale cost-sensitive methods | ACC = 92.6 |
| Huang et al. [ | 2020 | LIDC-IDRI | Deep transfer convolutional neural network + Extreme learning machine | ACC = 94.6 |
| Naik et al. [ | 2020 | LUNA16 | FractalNet + CNN | ACC = 94.1 |
| Zhang et al. [ | 2020 | LUNA16 | 3D squeeze-and-excitation network and aggregated residual transformations | ACC = 91.7 |
| Liu et al. [ | 2020 | LIDC-IDRI | Multi-model ensemble learning architecture based on 3D CNNs: VggNet, ResNet, and InceptionNet | ACC = 90.6 |
| Afshar et al. [ | 2020 | LIDC-IDRI | 3D Multi-scale Capsule Network | ACC = 93.1 |
| Lyu et al. [ | 2020 | LIDC-IDRI | Multi-level cross ResNet | ACC = 92.2 |
| Wu et al. [ | 2020 | LIDC-IDRI | Deep residual network (ResNet + residual learning + migration learning) | ACC = 98.2 |
| Lin and Li [ | 2020 | LIDC-IDRI | Taguchi-based AlexNet CNN | ACC = 99.6 |
| Kuang et al. [ | 2020 | LIDC-IDRI | Combination of a multi-discriminator generative adversarial network and an encoder | ACC = 95.3 |
| Lima et al. [ | 2020 | LIDC-IDRI | SVM with Gaussian kernel + Relief + Evolutionary Genetic Algorithm | AUC = 85.6 |
| Veasey et al. [ | 2020 | NLST | Recurrent neural network with 2D CNN | PPV = 55.9 (t0), 66.9 (t1) |
| Bansal et al. [ | 2020 | LUNA16 | Deep3DSCan | TPR = 87.1 |
| Zhai et al. [ | 2020 | LUNA16 LIDC-IDRI | Multi-task learning CNN | TPR = 84.0 (LUNA16), |
| Paul et al. [ | 2020 | NLST | Ensemble of CNNs | ACC = 90.3 |
| Ali et al. [ | 2020 | LIDC-IDRI LUNGx | Transferable texture CNN | ACC = 96.6 (LIDC-IDRI), |
| Silva et al. [ | 2020 | LIDC-IDRI | Transfer learning (convolutional autoencoder) | AUC = 93.6 |
| Xia et al. [ | 2021 | LIDC-IDRI | Gradient boosting machine algorithm | ACC = 91.9 |
ACC: accuracy; AUC: area under the ROC curve; FNR: false negative rate; FPR: false positive rate; PPV: positive predictive value; TNR: true negative rate; TPR: true positive rate.
Overview of published works regarding conventional methodologies for the segmentation of lung CT images (2014–2021).
| Authors | Year | Dataset | Methods | Performance Results (%) |
|---|---|---|---|---|
| Lai and Wei [ | 2014 | Private (10 patients) | Filtering process + morphological operations (threshold, region filling, closing) | TPR = 97.0 |
| Li et al. [ | 2015 | Private (15 patients) | Edge-based recursive geometric active contour (GAC) model | OV = 98.0 |
| Shi et al. [ | 2016 | Private (23 patients) | Histogram thresholding + region growing and random walk | OR = 1.87 |
| Zhang et al. [ | 2017 | LIDC-IDRI | Region- and edge-based GAC (REGAC) method | DSC = 97.7 |
| Rebouças Filho et al. [ | 2017 | Private (40 patients) | 3D ACACM | F-score = 99.2 (ACACM), |
| Oliveira et al. [ | 2018 | VISCERAL Anatomy3 | Multi-atlas alignment + label fusion (voting and statistical selection) | DSC = 97.4 (LL), |
| Chen et al. [ | 2021 | LOLA11 Private (65 patients) | Random walker | (Private) |
AAE: average area error; ABD: absolute border distance; ACM: active contour method; DSC: Sørensen–Dice coefficient; LL: left lung; LSCPM: level-set based on coherent propagation method; HD: Hausdorff distance; OR: over-segmentation rate; OV: overlap volume; RG: region growing; RL: right lung; TPR: true positive rate; TNR: true negative rate; UR: under-segmentation rate.
Overview of published works regarding learning-based methodologies for the segmentation of lung CT images (2019–2021).
| Authors | Year | Dataset | Methods | Performance Results (%) |
|---|---|---|---|---|
| Dong et al. [ | 2019 | LCTSC | U-net generator with a FCN discriminator | DSC = 97.0 |
| Feng et al. [ | 2019 | LCTSC | Two-stage segmentation process with 3D U-net | DSC = 97.2 (RL), |
| Park et al. [ | 2019 | LCTSC Private (30 patients) | U-net | DSC = 98.8 |
| Hofmanninger et al. [ | 2020 | LCTSC, LTRC, VISCERAL, VESSEL12 Private (5300 patients) | U-net, ResUNet, Dilated residual network-D-22, DeepLab v3+ | (merged dataset) |
| Yoo et al. [ | 2020 | HUG-ILD Private (203 patients) | 2D and 3D U-net | (Private - 2D; 3D) |
| Khanna et al. [ | 2020 | LUNA16 VESSEL12 2HUG-ILD | ResUNet + false positive removal algorithm | (LUNA16) |
| Shi et al. [ | 2020 | StructSeg 2019 | TA-Net | DSC = 96.8 (LL), |
| Nemoto et al. [ | 2020 | NSCLC-Radiomics | 2D and 3D U-net | DSC = 99.0 (2D/3D U-net) |
| Zhang et al. [ | 2020 | Lung dataset (Kaggle “Finding and Measuring Lungs in CT Data” competition) | Dense-Inception U-net (DIU-net) | DSC = 98.6 |
| Vu et al. [ | 2020 | Private (168 patients) | U-net with pre-trained VGG16 | DSC = 97.0 (RL and LL) |
| Liu et al. [ | 2020 | HUG-ILD | Random forest fusion classification of deep, texture and intensity features | DSC = 96.4 |
| Hu et al. [ | 2020 | Private (39 patients) | Mask R-CNN + supervised and unsupervised classifiers | DSC = 97.3 |
| Han et al. [ | 2020 | Private | Xception + VGG with SVM-RBF Detectron2 + contour fine-tuning | DSC = 97.0 |
| Xu et al. [ | 2021 | Private (217 patients) COVID-19-CT-Seg HUG-ILD VESSEL12 | Boundary-Guided Network (BG-Net) | DSC = 98.6 (Private), |
| Jalali et al. [ | 2021 | LIDC-IDRI | ResBCDU-Net | DSC = 97.1 |
| Wang et al. [ | 2021 | Lung dataset (Kaggle “Finding and Measuring Lungs in CT Data” competition) | HDA-ResUNet | DSC = 97.9 |
| Tan et al. [ | 2021 | LIDC-IDRI QIN lung CT dataset | LGAN | (LIDC-IDRI) |
| Pawar and Talbar [ | 2021 | HUG-ILD | LungSeg-Net | DSC = 96.3 (Fibrosis), |
| Cao et al. [ | 2021 | StructSeg 2019 | C-SE-ResUNet | DCS = 97.0 (LL) |
ACC: accuracy; AUC: area under the ROC curve; DSC: Sørensen–Dice coefficient; HD: Hausdorff distance; IOU: intersection over union; JI: Jaccard index; LL: left lung; OR: over-segmentation rate; PPV: positive predictive vale; RL: right lung; TNR: true negative rate; TPR: true positive rate; UR: under-segmentation rate.
Overview of published studies regarding predictive models for gene mutation status based on nodule features (2017–2021).
| Authors | Year | Dataset | Methods | Performance Results (%) |
|---|---|---|---|---|
| Zou et al. [ | 2017 | Private | Multivariable analyses | |
| Cheng et al. [ | 2017 | Private | Weighted mean difference, inverse variance | |
| Li et al. [ | 2018 | Private | Random forest/CNNs | |
| Koyasu et al. [ | 2019 | NSCLC-radiogenomics | XGBoost/random forest | |
| Wang et al. [ | 2019 | Private | CNNs | |
| Zhao et al. [ | 2019 | TCIA and private | 3D DenseNet | |
| Moreno et al. [ | 2021 | NSCLC-radiogenomics | SCAV with ML/CNN | |
| Zhang et al. [ | 2021 | Private | Machine learning | |
| Le et al. [ | 2021 | NSCLC-radiogenomics | LR / KNN / RF / XGBoost | |
| Cheng et al. [ | 2021 | Private | Pre-trained 3D DenseNet | |
| Zhang et al. [ | 2021 | Private | Logistic regression | |
| Han et al. [ | 2021 | Private | Logistic Regression |
ACC: Accuracy; AUC: area under the ROC curve; KNN: K-nearest neighbors; LR: logistic regression; MLP: multilayer perceptron; OR: odds ratio; RF: random forest; SCAV: selective class average voting; SE-CNN: squeezeand-excitation convolutional neural network; SVM: support vector machine; XGBoost: extreme gradient boosting.
Overview of published studies regarding predictive models for gene mutation status based on nodule and extra nodule features (2017–2021).
| Authors | Year | Dataset | Methods | Performance Results (%) |
|---|---|---|---|---|
| Gevaert et al. [ | 2017 | Private | Decision Tree | |
| Cao et al. [ | 2018 | Private | Principal component analysis | |
| Rizzo et al. [ | 2019 | Private | Univariate analysis | |
| Pinheiro et al. [ | 2019 | NSCLC-radiogenomics | Gradient tree boosting | |
| Xiong et al. [ | 2019 | Private | ResNet 101 | |
| Silva et al. [ | 2021 | LIDC-IDRI | Convolutional autoencoder | |
| Morgado et al. [ | 2021 | NSCLC-radiogenomics | LR, Elastic Net, Linear SVM, RBG SVM, RF, and XGBoost |
AUC: area under the ROC curve; LR: logistic regression; RF: random forest; SVM: support vector machine; TNR: true negative rate; TPR: true positive rate.
Overview of published works regarding the prediction of PD-L1 expression status in lung cancer CT images (2017–2021).
| Authors | Year | Dataset | Methods | Performance Results (%) |
|---|---|---|---|---|
| Toyokawa et al. [ | 2017 | Private | Fisher’s exact test | PD-L1+ statistical |
| Wu et al. [ | 2019 | Private | Univariate/multivariate LR | AUC = 78.3 |
| Zhu et al. [ | 2020 | Private | Univariate/multivariate LR | AUC = 78.0 |
| Jiang et al. [ | 2020 | Private | Random forest | AUC = 97.0 (≥1%) |
| Tian et al. [ | 2021 | Private | Fully connected classifier | AUC = 76.0 |
| Yang et al. [ | 2021 | Private | Simple temporal attention (SimTA) module | AUC = 77.0 (SimTA60) |
| Jiang et al. [ | 2021 | Private | Random forest | (Internal validation) |
ACC: accuracy; AUC: area under the ROC curve; LR: logistic regression; RNN: recurrent neural network; SimTAx: response prediction x days post immunotherapy; TNR: true negative rate; TPR: true positive rate.