| Literature DB >> 30720861 |
Wenya Linda Bi1, Ahmed Hosny2, Matthew B Schabath3, Maryellen L Giger4, Nicolai J Birkbak5,6, Alireza Mehrtash7,8, Tavis Allison9,10, Omar Arnaout1, Christopher Abbosh11,12, Ian F Dunn13, Raymond H Mak14, Rulla M Tamimi15, Clare M Tempany16, Charles Swanton17,18, Udo Hoffmann19, Lawrence H Schwartz20,21, Robert J Gillies22, Raymond Y Huang23, Hugo J W L Aerts24,25.
Abstract
Judgement, as one of the core tenets of medicine, relies upon the integration of multilayered data with nuanced decision making. Cancer offers a unique context for medical decisions given not only its variegated forms with evolution of disease but also the need to take into account the individual condition of patients, their ability to receive treatment, and their responses to treatment. Challenges remain in the accurate detection, characterization, and monitoring of cancers despite improved technologies. Radiographic assessment of disease most commonly relies upon visual evaluations, the interpretations of which may be augmented by advanced computational analyses. In particular, artificial intelligence (AI) promises to make great strides in the qualitative interpretation of cancer imaging by expert clinicians, including volumetric delineation of tumors over time, extrapolation of the tumor genotype and biological course from its radiographic phenotype, prediction of clinical outcome, and assessment of the impact of disease and treatment on adjacent organs. AI may automate processes in the initial interpretation of images and shift the clinical workflow of radiographic detection, management decisions on whether or not to administer an intervention, and subsequent observation to a yet to be envisioned paradigm. Here, the authors review the current state of AI as applied to medical imaging of cancer and describe advances in 4 tumor types (lung, brain, breast, and prostate) to illustrate how common clinical problems are being addressed. Although most studies evaluating AI applications in oncology to date have not been vigorously validated for reproducibility and generalizability, the results do highlight increasingly concerted efforts in pushing AI technology to clinical use and to impact future directions in cancer care.Entities:
Keywords: artificial intelligence; cancer imaging; clinical challenges; deep learning; radiomics
Mesh:
Year: 2019 PMID: 30720861 PMCID: PMC6403009 DOI: 10.3322/caac.21552
Source DB: PubMed Journal: CA Cancer J Clin ISSN: 0007-9235 Impact factor: 508.702
Figure 1Artificial Intelligence Applications in Medical Imaging as Applied to Common Cancers. Artificial intelligence tools can be conceptualized to apply to 3 broad categories of image‐based clinical tasks in oncology: 1) detection of abnormalities; 2) characterization of a suspected lesion by defining its shape or volume, histopathologic diagnosis, stage of disease, or molecular profile; and 3) determination of prognosis or response to treatment over time during monitoring. 2D indicates 2‐dimensional; 3D, 3‐dimensional; CNS, central nervous system.
Figure 2Potential Enhanced Clinical Workflow With Artificial Intelligence (AI) Interventions. The traditional paradigm for patients with tumors entails initial radiologic diagnosis of a mass lesion, a decision to treat or observe based on clinical factors and patient preference, a definitive histopathologic diagnosis only after obtaining tissue, molecular genotyping in centers with such resources, and determination of clinical outcome only after the passage of time. In contrast, AI‐based interventions offer the potential to augment this clinical workflow and decision making at different stages of oncological care. Continuous feedback and optimization from measured outcomes may further improve AI systems.
Summary of Key Studies on the Role of Artificial Intelligence in Imaging of Lung Cancer, as Applied to Detection, Diagnosis, and Characterization, and Predicting Prognosis and Treatment Response
| REFERENCE | TUMOR(S) STUDIED | APPLICATION | NO. OF CASES | IMAGING MODALITY | MACHINE LEARNING ALGORITHM | IMAGING FEATURE TYPE | TYPE OF VALIDATION | RESULTS |
|---|---|---|---|---|---|---|---|---|
| Cancer detection | ||||||||
| Hawkins 2016 | NSCLC | Risk of lung cancer in screening/early detection | 600 | CT | Random forests classifier | Predefined radiomic features | Independent validation within ACRIN 6684 | AUC, 0.83 |
| Liu 2017 | NSCLC | Predict lung cancer in indeterminate pulmonary nodules | 172 | CT | Multiple supervised technique | Semantic | Independent validation with single‐center data | AUC, 0.88; ACC, 81%; Sn, 76.2%; Sp, 91.7% |
| Ciompi 2017 | Benign vs malignant lung lesions | Predict lung cancers in screening | 1411 | CT | SVM | Deep learning radiomics | Independent validation with multicenter data | ACC, 73% |
| Diagnosis and characterization | ||||||||
| Yamamoto 2014 | NSCLC | Discriminate ALK+ from non‐ALK tumors | 172 | CT | Random forests classifier | Semantic | Independent validation with multicenter data | Discriminatory power for ALK+ status: Sn, 83.3%; Sp, 77.9%; ACC, 78.8% |
| Maldonado 2015 | Lung adenocarcinoma | Differentiate indolent vs aggressive adenocarcinoma | 294 | CT | Previously built CANARY model | Semantic (CANARY) | Independent validation with single‐center data | Progression‐free survival curve HR ( |
| Grossmann 2017 | NSCLC | Predict molecular and cellular pathways | 351 | CT | SVM | Predefined radiomic features | Independent validation with multicenter data | Autodegration pathway prediction (AUC, 0.69; |
| Rios Velazquez 2017 | NSCLC | Predict mutational status | 763 | CT | Random forests classifier | Predefined radiomic features | Independent validation with multicenter data | EGFR+ and EGFR− cases (AUC, 0.69); EGFR+ and KRAS+ tumors (AUC, 0.80) |
| Predicting treatment response and prognosis | ||||||||
| Aerts 2014 | NSCLC, head and neck cancer | Prognostic biomarkers | 1019 | CT | Regression | Predefined radiomic features | Independent validation with multicenter data | Prognostic CI, 0.70; CI, 0.69 |
| Coroller 2015 | Lung adenocarcinoma | Predict distant metastasis | 182 | CT | Regression | Predefined radiomic features | Independent validation with single‐center data | CI, 0.61; P = 1.79 × 10−17 |
| Sun 2018 | NSCLC | Predict the immune phenotype of tumors and clinical outcomes | 491 | CT | Regression | Predefined radiomic features | Independent validation with multicenter data | AUC, 0.67; 95% CI, 0.57‐0.77; |
Abbreviations: ACC, accuracy; ACRIN, American College of Radiology Imaging Network; ALK+, anaplastic lymphoma kinase positive; AUC, area under curve; CANARY, Computer‐Aided Nodule Assessment and Risk Yield; CI, concordance index; CT, computed tomography; EGFR+/EGFR−, epidermal growth factor receptor positive/negative; HR, hazard ratio; KRAS, KRAS proto‐oncogene, guanosine‐triphosphatase; NSCLC, non‐small cell lung cancer; Sn, sensitivity; Sp, specificity; SVM, support vector machine.
Figure 3Clinical Applications of Artificial Intelligence in Lung Cancer Screening on Detection of Incidental Pulmonary Nodules. Imaging analysis shows promise in predicting the risk of developing lung cancer on initial detection of an incidental lung nodule and in distinguishing indolent from aggressive lung neoplasms. PFS indicates progression‐free survival; ROC, receiver operating characteristic.
Figure 4Applications of Noninvasive Monitoring During the Course of Cancer Evolution. Cancers share a common theme in developing intratumoral heterogeneity during their natural history. The presence of subclones (represented by different colors) confers significant implications in the response to treatment and may be difficult to capture through standard biopsies. Imaging and blood biomarkers during disease monitoring offer a potential technological solution for detecting the presence of intratumoral heterogeneity through space and time and thereby, perhaps, a direct change in therapeutic strategies.
Summary of Key Studies on the Role of Artificial Intelligence in the Imaging of CNS Tumors, as Applied to Diagnosis, Biologic Characterization, Monitoring Treatment Response, and Predicting Outcome
| REFERENCE | TUMOR(S) STUDIED | APPLICATION | NO. OF PATIENTS | IMAGING MODALITY | MACHINE LEARNING ALGORITHM | IMAGING/RADIOMIC FEATURE TYPE | TYPE OF VALIDATION | PERFORMANCE | |
|---|---|---|---|---|---|---|---|---|---|
| Diagnosis | |||||||||
| Fetit 2015 | Medulloblastoma, pilocytic astrocytoma, ependymoma | Classification of CNS tumor subtype | 48 | MRI conventional | Multiple supervised techniques | Texture | Leave‐one‐out cross‐validation, single center | AUC, 0.91‐0.99 | |
| Coroller 2017 | Meningioma | Differentiate grade 1 vs grade 2‐3 | 175 | MRI conventional | Random forest | Radiomic and semantic features | Independent validation with single‐center data | AUC, 0.76‐0.86 | |
| Zhang 2017 | Glioma (WHO grade 2‐4) | LGG (WHO grade 2) vs HGG (grade 3‐4) | 120 | MRI conventional, perfusion, diffusion, permeability maps | SVM | Histogram, texture | Leave‐one‐out cross validation | ACC, 0.945 | |
| Zhang 2018 | Pituitary adenoma | Null cell adenoma vs other subtypes | 112 | MRI conventional | SVM | Intensity, shape, size, texture | Independent validation with single‐center data | AUC, 0.804 | |
| Kang 2018 | Glioblastoma, lymphoma | Classify glioblastoma vs lymphoma | 198 | MRI conventional, perfusion, diffusion maps | Multiple supervised techniques | Volume, shape, texture | Independent validation with multicenter data | AUC, 0.946 | |
| Biologic characterization | |||||||||
| Korfiatis 2016 | Glioblastoma |
| 155 | MRI conventional | SVM, random forest | Texture | Cross‐validation, single center | AUC, 0.85; Sn, 0.803; Sp, 0.813 | |
| Zhou 2017 | Glioma (WHO grade 3‐4) |
| 120 | MRI conventional, apparent diffusion maps | Random forest | Histogram, texture, shape | Independent validation with single‐center data | ACC, 89%; AUC, 0.923 | |
| Zhang 2017 | Glioma (WHO grade 2‐3) | 1p/19q Chromosomal status, IDH1/IDH2 mutation status | 165 | MRI conventional | Logistic regression | VASARI features | Boot‐strap validation, single center | AUC, 0.86 | |
| Chang 2018 | Glioma (WHO grade 2‐4) |
| 496 | MRI conventional, apparent diffusion maps | Deep learning ResNet | Histogram, texture, shape | Independent validation with multicenter data | ACC, 89%; AUC, 0.95 | |
| Monitoring treatment response | |||||||||
| Larroza 2015 | Brain metastases | Classify tumor vs radiation necrosis | 73 | MRI conventional | SVM | Texture | Cross‐validation, single center | AUC, >0.9 | |
| Tiwari 2016 | Glioma and brain metastases | Classify tumor vs radiation necrosis | 58 | MRI conventional | SVM | Intensity, texture | Independent validation with multicenter data | ACC, 80% | |
| Kim 2017 | High‐grade glioma | Classify tumor vs radiation necrosis | 51 | MR diffusion, perfusion, susceptibility weighted maps | Regression | Intensity, histogram | Single‐center, prospective trial without validation | Sn, 71.9%; Sn, 100%; Sp, 100%; ACC, 82.3% | |
| Kebir 2017 | High‐grade glioma | Classify tumor vs radiation necrosis | 14 | FET PET | Unsupervised consensus clustering | Texture | Single‐center, retrospective trial without validation | Sn, 90%; Sp, 75% for detecting true progression; NPV, 75% | |
| Predicting treatment response and survival | |||||||||
| Chang 2016 | Glioblastoma | Predict OS | 126 | MRI conventional, diffusion | Random forest | Shape, intensity histogram, volume, texture | Single‐center data split into training/testing | HR, 3.64 ( | |
| Grossmann 2017 | Glioblastoma | Predict PFS and OS | 126 | MRI conventional | Unsupervised principle component feature selection, random forest supervised training | Shape, volume, texture | Multicenter, phase 2 clinical trial data split into training/testing | OS: HR, 2.5 ( | |
| Liu 2017 | Glioblastoma | Predict OS | 117 | MRI perfusion | Unsupervised consensus clustering | Histogram of intensity | Single‐center data split into training/testing | HR, >3.0; | |
Abbreviations: ACC, accuracy; AUC, area under the curve; CNS, central nervous system; FET PET, 18F‐fluoro‐ethyl‐tyrosine positron emission tomography; HGG, high‐grade glioma; HR, hazard ratio; IDH1/IDH2, isocitrate dehydrogenase 1/isocitrate dehydrogenase 2; LGG, low‐grade glioma; MGMT, O‐6‐methylguanine‐DNA methyltransferase; MRI, magnetic resonance imaging; NPV, negative predictive value; OS, overall survival; PFS, progression‐free survival; ResNet, residual network; Sn, sensitivity; Sp, specificity; SVM, support vector machine; VASARI, Visually Accessible Rembrandt Images; WHO, World Health Organization. aValidation categories included cross‐validation (within own data set), independent validation with single‐center data, and independent validation with multicenter data.
Figure 5Grad‐CAM Visualizations (Selvaraju et al 2017)127 for a Convolutional Neural Network (Chang et al 201896) Applied to 2 Examples of Isocitrate Dehydrogenase 1 (IDH1)/IDH2 Wild‐Type Glioblastoma and 2 Examples of IDH1‐Mutant Glioblastoma. Color maps are overlaid on original gadolinium‐enhanced, T1‐weighted magnetic resonance images, with red color weighted to the discriminative regions for IDH status classification.
Summary of Key Studies on Imaging Characterization of Breast Lesions, Including Detection, Diagnosis, Biologic Characterization, and Predicting Prognosis and Treatment Response
| REFERENCE | APPLICATION | NO. OF CASES | IMAGING MODALITY | MACHINE LEARNING ALGORITHM (IF APPLICABLE) | IMAGING/RADIOMIC FEATURE TYPE | RESULTS | |
|---|---|---|---|---|---|---|---|
| Detection | |||||||
| Zhang 1994 | Microcalcification detection | 34 | Mammography | Convolutional neural networks | Deep learning characterization followed by conventional image analysis | AUC, 0.91 | |
| Karssemeijer 2006 | Mass lesions | 500 | Mammography | Engineered algorithms | Engineered algorithms | Performance similar to radiology | |
| Reiner 2006 | Mass lesions | 21 | Breast tomosynthesis | Engineered algorithms | Engineered algorithms | Sn, 90% | |
| Sahiner 2012 | Microcalcifications | 72 | Breast tomosynthesis | Engineered algorithms | Engineered algorithms | Sn, 90% | |
| Diagnosis | |||||||
| Gilhuijs 1998 | Mass lesions | 27 | DCE‐MRI | Engineered algorithms | Size, shape, kinetics | AUC, 0.96 | |
| Jiang 1999 | Microcalcifications | 104 | Mammography | Engineered algorithms | Size and shape of individual microcalcifications and clusters | AUC, 0.75 | |
| Chen 2007 | Mass lesions | 121 | DCE‐MRI | Engineered algorithms | Uptake heterogeneity in cancer tumors via 3D texture analysis | 3D better compared with 2D analysis | |
| Booshan 2010 | Differentiate benign vs DCIS vs IDC | 353 | DCE‐MRI | Bayesian neural networks | Size, shape, margin morphology, texture (uptake heterogeneity), kinetics, variance kinetics | AUC, 0.79‐0.85 | |
| Jamieson 2010 | Mass lesions | 1126 | Multimodality: Mammography, breast ultrasound, and breast DCE‐MRI | t‐SNE followed by Bayesian neural networks | Multiradiomic features in nonsupervised data mining | AUC, 0.88 | |
| Nielsen 2011 | Breast cancer risk | 495 | Mammography | — | Texture analysis | AUC, 0.57‐0.66 | |
| Huynh 2016 | Mass lesions | 219 | Mammography | Deep learning | Feature extracted from transfer learning from pretrained CNN | AUC, 0.81 | |
| Andropova 2017 | Mass lesions | 1125 | Multimodality: Mammography, breast ultrasound, and breast DCE‐MRI | Deep learning | Fusion of human‐engineered computer features and those feature extracted from transfer learning from pretrained CNN | AUC: DCE‐MRI, 0.89; FFDM, 0.86; ultrasound, 0.90 | |
| Biologic characterization | |||||||
| Gierach 2014 |
| 237 | Mammography | Bayesian artificial neural network | Texture analysis | AUC, 0.68‐0.72 | |
| Li 2016 | Molecular subtype classification | 91 (from TCGA) | DCE‐MRI | Engineered features, linear discriminant analysis | Multiradiomic tumor signature, including size, shape, margin morphology, texture (uptake heterogeneity), kinetics, variance kinetics | AUC, 0.65‐0.89 | |
| Li 2017 |
| 456 | Mammography | CNNs, computerized radiographic texture analysis, SVM | Texture analysis and deep learning | AUC, 0.73‐0.86 | |
| Predicting treatment response and prognosis | |||||||
| Drukker 2018 | Prediction of recurrence‐free survival | 284 (from ACRIN 6657) | DCE‐MRI | — | .Most‐enhancing tumor volume | HR, 2.28‐4.81 | |
Abbreviations: 2D, 2‐dimensional; 3D, 3‐dimensional; ACC, accuracy; ACRIN, American College of Radiology Imaging Network; AUC, area under the curve; CNN, convolutional neural networks; DCE‐MRI, dynamic contrast‐enhanced magnetic resonance imaging; DCIS, ductal carcinoma in situ; FFDM, full‐field digital mammography; HR, hazard ratio; IDC, invasive ductal carcinoma; Sn, sensitivity; Sp, specificity; SVM, support vector machine; TCGA, The Cancer Genome Atlas; t‐SNE, t‐distributed stochastic neighbor embedding.
Figure 6Significant Associations Between Genomic Features and Radiomic Phenotypes in Breast Carcinoma Imaged With Magnetic Resonance Imaging. Gene‐set enrichment analysis (GSEA) and linear regression analysis were combined to associate genomic features, including microRNA (miRNA) expression, protein expression, and gene somatic mutations among others, with 6 categories of radiomic phenotypes. In this figure, each node represents a genomic feature or a radiomic phenotype. Each line is an identified statistically significant association, whereas nonsignificant associations are not depicted. Node size is proportional to its connectivity relative to other nodes in the category. Reprinted with permission from Maryellen L. Giger, University of Chicago (Zhu Y, Li H, Guo W, et al. Deciphering genomic underpinnings of quantitative MRI‐based radiomic phenotypes of invasive breast carcinoma [serial online]. Sci Rep. 2015;5:17787.170).