| Literature DB >> 33004526 |
Danai Khemasuwan1, Jeffrey S Sorensen2, Henri G Colt3.
Abstract
Artificial intelligence (AI) is transforming healthcare delivery. The digital revolution in medicine and healthcare information is prompting a staggering growth of data intertwined with elements from many digital sources such as genomics, medical imaging and electronic health records. Such massive growth has sparked the development of an increasing number of AI-based applications that can be deployed in clinical practice. Pulmonary specialists who are familiar with the principles of AI and its applications will be empowered and prepared to seize future practice and research opportunities. The goal of this review is to provide pulmonary specialists and other readers with information pertinent to the use of AI in pulmonary medicine. First, we describe the concept of AI and some of the requisites of machine learning and deep learning. Next, we review some of the literature relevant to the use of computer vision in medical imaging, predictive modelling with machine learning, and the use of AI for battling the novel severe acute respiratory syndrome-coronavirus-2 pandemic. We close our review with a discussion of limitations and challenges pertaining to the further incorporation of AI into clinical pulmonary practice.Entities:
Mesh:
Year: 2020 PMID: 33004526 PMCID: PMC7537944 DOI: 10.1183/16000617.0181-2020
Source DB: PubMed Journal: Eur Respir Rev ISSN: 0905-9180
Examples of machine-learning algorithms
| Regularised regression | LASSO | An extension of classic regression algorithms in which a penalty is enforced to the fitted model to minimise its complexity and reduce the risk of overfitting |
| Tree-based model | Classification and regression trees, random forest, gradient boosted trees (XGBoost) | Based on decision trees (a decision support tool which is a sequence of “if-then-else” splits are derived by iteratively separating data into groups based on the relationship between attributes and outcomes) |
| Support vector machine | Linear, hinge loss, radial basis function kernel | Represents data in a multidimensional feature space and fits a “hyperplane” that best separates data based on outcomes of interest |
| KNN | KNN | Represents data in a multidimensional feature space and uses local information about observations closest to a new dataset to predict outcomes for the new dataset |
| Neural network | Deep neural networks, ANNs | Nonlinear algorithms built using multiple layers of nodes that extract features from the data and perform combinations that best predict outcomes |
| Dimensionality reduction algorithms | Principal component analysis, linear discriminant analysis | Exploits inherent structure to transform data from high-dimensional space into a low-dimensional space which retains some meaningful attributes of the original data |
| LCA | LCA | Identifies hidden population subgroups (latent classes) in the data. Used in datasets with complex constructs that have multiple behaviours. The probability of class membership is indirectly estimated by measuring patterns in the data |
| Cluster analysis | K-means, hierarchical cluster analysis | Uses inherent structures in the data to best organise data into subgroups of maximum commonality based on some distance measure between features |
| Reinforcement learning | Markov decision process and Q learning | Provides tools to optimise sequences of decision for the best outcomes or to maximise rewards. Learns by trial and error. An action is reinforced with the action that results in a positive outcome (reward), and |
KNN: K-nearest neighbour; LCA: latent class analysis; LASSO: least absolute shrinkage and selection operator; ANN: artificial neural network.
FIGURE 1An example of an artificial neural network. Simple image classification: the model is trained on a few thousand images of planes and non-planes, then is later able to predict if a given image is a plane or not.
Examples of Conformité Européenne (CE)-marked and US Food and Drug Administration (FDA)-approved artificial intelligence (AI) algorithms
| InferRead CT Lung (InferVision) | CT lung nodule detection, report generation, multi-time point analysis and aims to aid early-stage diagnosis | Yes | Yes |
| Lung AI (Arterys) | Automatic detection of solid, part solid and ground-glass nodules and supports Lung-RADS reporting and multi-time point analysis | Yes | Yes |
| Veolity (MeVis Medical Solutions) | CT lung nodule detection, segmentation, quantification, temporal registration and nodule comparison | Yes | Yes |
| ClearReadCT compare (Riverain) | Compares, tracks nodules and provides nodule volumetric changes over time for solid, part-solid and ground-glass nodules | Yes | Yes |
| JLD-01K (JLK Inc.) | CT lung nodule detection, measures the diameter and volume of the nodules, categorises LungRADS category | Yes | No |
| VUNO Med-LungCT (VUNO) | Quantifies pulmonary nodules and automatically categorises the Lung-RADS | Yes | No |
| Veye Chest (Aidence) | CT lung nodule detection, nodule classification, volume quantification and growth calculation | Yes | No |
| RevealAI Lung (Mindshare Medical) | Provides a malignancy similarity index from lung CT scans that aids risk assessment of lung nodules | Yes | No |
| Icolung (Icometrix) | Objective quantification of disease burden in COVID-19 patients | Yes | No |
| InferRead CT Pneumonia (InferVision) | An alert system that warns if there is a suspected positive case of COVID-19 | Yes | No |
| Aidoc (Aidoc) | Analysis of CT images and flags presence of pulmonary embolism | Yes | Yes |
| LungQ (Thirona) | Lung volume segmentation and quantification, volume density analysis, airway morphology, fissure completeness analysis | Yes | Yes |
| LungPrint Discovery (VIDA) | Provides visual and quantitative information relevant to COPD and ILD. Provides high-density tissue quantification by lobe, trachea analysis and quantification | Yes | Yes |
| Lung Density Analysis (Imbio) | Provides visualisation and quantification of lung regions with abnormal tissue density. Provides a mapping of normal lung, air-trapping and areas of persistent low density | Yes | Yes |
| Lung texture analysis (Imbio) | Transforms a chest CT into a map of the lung textures to identify ILDs and other fibrotic conditions | Yes | No |
| Lung densities (Quibim) | Provides quantification of imaging biomarkers: lung volumes, vessel volumes and emphysema volume ratios | Yes | No |
| Red Dot (behold.ai) | Assessment of adult chest radiographs with features suggestive of pneumothorax | Yes | Yes |
| Triage (Zebra Medical Vision) | Identifies findings suggestive of pneumothorax based on chest radiography; outputs an alert | Yes | Yes |
| ArtiQ.PFT (ArtiQ) | Automated pulmonary function test interpretation | Yes | No |
CT: computed tomography; COVID-19: coronavirus disease 2019; ILD: interstitial lung disease; Lung-RADS: Lung Imaging Reporting and Data System.
Model training, validation, algorithm type and data source for selected studies in pulmonary medicine
| A | CT chest of lung cancer screening patients/retrospective | NLST | Prediction of cancer risk based on CT findings | Histology; follow-up | CNN | 42 290 CT images from 14 851 patients | Not reported/yes |
| B | CT chest of lung cancer screening patients/retrospective | The IDEAL study (Artificial Intelligence and Big Data for Early Lung Cancer Diagnosis) | The AUC for CNN was 89.6% (95% CI 87.6–91.5%), compared with 86.8% (95% CI 84.3–89.1%) for the Brock model (p≤0.005) | Histology; follow-up | CNN | 1397 nodules in 1187 patients | Not reported/yes |
| M | CT chest of lung cancer screening patients/retrospective | NLST for model derivation and internal validation/externally tested on cohorts from two academic institutions | The AUC for CNN was 83.5% (95% CI 75.4–90.7%) and 91.9% (95% CI 88.7–94.7%) on two different cohorts (Vanderbilt and Oxford University) | Histology; follow-up | CNN | 14 761 benign nodules from 5972 patients, and 932 malignant nodules from 575 patients | Not reported/yes |
| C | CT chest of lung cancer screening patients/retrospective | Training dataset from the Multicentric Italian Lung Detection trial and validation dataset from the Danish Lung Cancer Screening Trial | CNN can achieve performance at classifying nodule type within the interobserver variability among human experts (Cohen κ-statistics ranging from 0.58 to 0.65) | Expert consensus | CNN | 1352 nodules for training set and 453 nodules for validation set | Random split sample validation/yes |
| N | Chest radiographs to detect malignant nodule/retrospective | Analysis of data collected from Seoul National University Hospital, Boramae Hospital and National Cancer Center, University of California San Francisco Medical Center | Chest radiograph classification and nodule detection performances of deep learning-based automatic detection were a range of 0.92–0.99 (AUROC) | Expert consensus | CNN | 43 292 chest radiographs from 34 676 patients | Random split sample validation/yes |
| W | Mediastinal lymph node metastasis of NSCLC from 18F-FDG PET/CT images/retrospective | Data collected at the Affiliated Tumor Hospital of Harbin Medical University | The performance of CNN is not significantly different from classic machine-learning methods and expert radiologists | Expert consensus | CNN | 1397 lymph node stations from 168 patients | Resampling method/no |
| W | Solitary pulmonary nodule ≤3 cm, histologically confirmed adenocarcinoma/retrospective | Analysis of data collected from Fudan University Shanghai Cancer Center | Algorithm showed AUROC of 0.892, which was higher than three expert radiologists in classifying invasive adenocarcinoma from pre-invasive lesions | Histology | CNN | CT scan from 1545 patients | Random split sample validation/no |
| Z | Thin-slice chest CT scan before surgical treatment; nodule diameter ≤10 mm/retrospective | Secondary analysis of data from Huadong Hospital affiliated to Fudan University | Based on classification of tumour invasiveness, deep-learning algorithm achieved better classification performance than the radiologists (63.3% | Histology | CNN | Pre-operative thin-slice CT; 523 nodules for training/128 nodules for testing | Not reported/no |
| W | HRCT showing diffuse fibrotic lung disease confirmed by at least two thoracic radiologists/retrospective | Secondary analysis of data from La Fondazione Policlinico, Universitario and University of Parma (Italy) | Interobserver agreement between the algorithm and the radiologists’ majority opinion (n=91) was good (κw=0·69) | Expert consensus | CNN | HRCT; 929 scans for training/89 scans for validation | Not reported/yes |
| C | HRCT showing NSIP or UIP confirmed by two thoracic radiologists/retrospective | HRCT dataset from the Lung Tissue Research Consortium | Interobserver agreements between the algorithm and the radiologists’ opinion were fair to moderate (κw=0.33 and 0.47) | Expert consensus | CNN | HRCT 105 patients (54 of NSIP and 51 for UIP) | Not reported/no |
| R | The whole-transcriptome RNA sequencing data from transbronchial biopsy samples/prospective | Bronchial Sample Collection for a Novel Genomic Test (BRAVE) study in 29 US and European sites | The molecular signatures had high specificity (88%) and sensitivity (70%) against diagnostic reference pathology (ROC-AUC 0.87, 95% CI 0.76–0.98) | Histology | ML; type not reported | 94 patients in clinical utility analysis | Not reported/no |
| S | Peripheral blood biobank/prospective | Peripheral blood biobanked at Stanford University, USA and University of Sheffield, UK | Four distinct immunological clusters were identified. Cluster I had unique sets of upregulated proteins (TRAIL, CCL5, CCL7, CCL4, MIF), which was the cluster with the least favourable 5-year transplant-free survival rates (47.6%, 95% CI 35.4–64.1%) | N/A | Unsupervised ML | Blood biobanked; 281 patients for discovery cohort/104 patients for validation cohort | Resampling method/yes |
| L | Echocardiographic parameters/retrospective | King's College Hospital (UK); University Medical Center Gottingen and University of Regensburg (Germany) | Among five ML algorithms, random forest of regression trees is the best method to identify PH patients (AUC 0.87, 95% CI 0.78–0.96) with accuracy of 0.83 | Right heart catheterisation | Five ML algorithms (random forest of classification trees, random forest of regression trees, lasso-penalised logistic regression, boosted classification trees, SVM) | 90 patients with invasively determined PAP with corresponding echocardiographic estimations of PAP | Resampling method/no |
| W | 100 clinical, physiological, inflammatory and demographic variables/prospective | Severe Asthma Research Program (SARP) cohort from National Heart, Lung, and Blood Institutes (USA) | Four asthma clusters with differing CS responses were identified. Those in CS-responsive cluster were older, more nasal polyps, and high blood eosinophils. After CS, there was the highest increase in lung function in this group | N/A | Unsupervised ML; MML-MKKC | 346 adult asthmatics with paired (before and after CS) sputum data | Random split sample validation/no |
| K | 19 candidate clinical variables from retrospective cohort of patients with pleural infection | A tertiary care, university-affiliated hospital, Utah, USA | Candidate predictors of tPA/DNase failure were the presence of pleural thickening (48% relative importance) and presence of an abscess/necrotising pneumonia (24%) | N/A | Supervised ML (extreme gradient boosting and coupled with decision trees) | 84 patients with pleural infection and received intrapleural tPA/DNase | Random split sample validation/no |
| T | PFT tests and clinical diagnosis/prospective | University Hospital Leuven (Belgium) | Pulmonologists’ interpretation of PFTs matched guideline in 74.4±5.9% of cases and made correct diagnosis in 44.6±8.7% | ATS/ERS guideline and expert panel | ML; type not reported | Dataset based on 1430 historical cases/50 cases in prospective analysis | Not reported/yes |
| W | CT chest of patients with atypical pneumonia/retrospective | Xi'an Jiaotong University First Affiliated Hospital, Nanchang University First Hospital and Xi'An No.8 Hospital of Xi'An Medical College (China) | An internal validation achieved a total accuracy of 82.9% with specificity of 80.5% and sensitivity of 84%. The external testing dataset showed a total accuracy of 73.1% with specificity of 67% and sensitivity of 74% | Confirmed nucleic acid testing of SARS-CoV-2 | CNN | CT images from 99 patients, of which 44 were confirmed cases of SARS-CoV-2 | Random split sample validation/no |
| L | CT chest of patients with atypical pneumonia/retrospective | Six medical centres, China | AUC values for COVID-19 was 0.96 (95% CI 0.94–0.99). Sensitivity of 90% (95% CI 83–94%) and specificity 96% (95% CI 93–98%) | Confirmed nucleic acid testing of SARS-CoV-2 | CNN | 4356 chest CT examinations from 3322 patients | Not reported/yes |
PH: pulmonary hypertension; PFT: pulmonary function test; SARS-CoV-2: severe acute respiratory syndrome-coronavirus-2; CT: computed tomography; NLST: National Lung Screening Trial; ROC: receiver operating characteristic; AUC: area under the curve; CNN: convolutional neural network; AUROC: area under the ROC curve; NSCLC: nonsmall cell lung cancer; 18F-FDG PET: fluorine-18 2-fluoro-2-deoxy-d-glucose positron emission tomography; HRCT: high-resolution computed tomography; κw: weighted κ-coefficient; NSIP: nonspecific interstitial pneumonia; UIP: usual interstitial pneumonia; ML: machine learning; TRAIL: tumor necrosis factor-related apoptosis-inducing ligand; CCL: C-C motif chemokine ligand; MIF: macrophage migration inhibitory factor; N/A: not applicable; SVM: support vector machine; PAP: pulmonary arterial pressure; CS: corticosteroids; MML-MKKC: multiview learning-multiple Kernel k-means clustering; tPA: intrapleural tissue plasminogen activator; DNase: deoxyribonuclease; ATS: American Thoracic Society; ERS: European Respiratory Society; COVID-19: coronavirus disease 2019.
FIGURE 2An example of convolutional neural network (CNN). A sequence of layers. Each layer transforms one volume of activations to another through a differentiable function. Three main types of layers build CNN architectures: convolutional layer, pooling layer and fully connected layer. The convolution layers merge two sets of information with the use of a filter to produce a feature map as an output. The pooling layers reduce the number of parameters and computation in the network. The fully connected output layer provides the final probabilities for each label as a final classification.