| Literature DB >> 34926501 |
Yu-Han Zhang1, Xiao-Fei Hu1, Jie-Chao Ma2, Xian-Qi Wang1, Hao-Ran Luo1, Zi-Feng Wu2, Shu Zhang2, De-Jun Shi2, Yi-Zhou Yu2, Xiao-Ming Qiu3, Wen-Bing Zeng4,5, Wei Chen1, Jian Wang1.
Abstract
Objective: To assess the performance of a novel deep learning (DL)-based artificial intelligence (AI) system in classifying computed tomography (CT) scans of pneumonia patients into different groups, as well as to present an effective clinically relevant machine learning (ML) system based on medical image identification and clinical feature interpretation to assist radiologists in triage and diagnosis.Entities:
Keywords: COVID-19; computed tomography; deep learning; pneumonia; pulmonary infectious disease
Year: 2021 PMID: 34926501 PMCID: PMC8677931 DOI: 10.3389/fmed.2021.753055
Source DB: PubMed Journal: Front Med (Lausanne) ISSN: 2296-858X
Figure 1A flow diagram of the patient selection process with inclusion and exclusion criteria is shown. Between January 2011 and February 2020, this study included 2,195 patients from three institutions, 1,431 of whom were finally used to construct the classification system.
Figure 2AI framework of the study. (A) Using multi-layer raw pneumonia CT images and associated etiology labels as input, the data will be flipped horizontally as data augmentation to increase the number of training samples and reduce the possibility of over-fitting. (B) The proposed DL system is composed of two models: a classification model and a segmentation model. The classifier can detect abnormal slices and predict pathogen type scores, while the segmentation model extracts CT image features (lung lobe and the contour of the lesions). (C) The voting schema calculates the ratio of particular positive slices and votes on their patient-wise triage base on the image-wise diagnosis score. (D) The ML system is trained with the clinical factors and CT features quantified by the DL system. (E) Patients CT factors and clinical features distribution and probabilities evaluated by characteristics network to assist radiologists in understanding the predicted results produced by the systems.
Summary of training + validation and testing datasets by four pathogenic types.
|
| ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
| |
| Patients | 688 (51.4%) | 293 (21.9%) | 287 (21.4%) | 70 (5.2%) | – | 47 (50.5%) | 23 (24.7%) | 19 (20.4%) | 4 (4.3%) | – |
| Scans | 1,925 (59.3%) | 722 (22.2%) | 480 (14.8%) | 118 (3.6%) | – | 128 (58.9%) | 55 (25.1%) | 27 (12.3%) | 8 (3.7%) | – |
| Slices | 374,862 | 89,310 | 55,328 | 14,671 | – | 40,446 | 7,630 | 3,159 | 1,216 | – |
| Male | 386 (56.1%) | 154 (52.5%) | 152 (52.9%) | 37 (52.8%) | 0.75 | 24 (51.1%) | 13 (56.5%) | 11 (57.9%) | 2 (50.0%) | 0.17 |
| Age ≥60 years old | 144 (20.9%) | 129 (44.0%) | 122 (42.5%) | 27 (38.6%) | – | 9 (19.1%) | 9 (39.1%) | 8 (42.1%) | 2 (50.0%) | – |
| Age <60 years old | 544 (79.1%) | 173 (56.0%) | 165 (57.5%) | 43 (61.4%) | – | 38 (80.9%) | 14 (60.9%) | 11 (57.9%) | 2 (50.0%) | – |
| Mean age | 47.6 ± 14.7 | 55.2 ± 16.2 | 56.6 ± 14.3 | 52.8 ± 18.8 | <0.001 | 45.8 ± 14.6 | 57.9± 15.0 | 58.9 ± 13.8 | 52.1± 21.4 | <0.001 |
Data presented as n (%) unless otherwise indicated. Mean ages are reported as mean ± standard deviation. COV-19, COVID-19 pneumonia; common, common virus pneumonia.
The performance of the DL system in making multi-pathogenic types classification based on the CT cohort.
|
|
|
| |||||||
|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
| ||
| Testing | AUC (95%CI) | 0.984 | 0.980 | 0.988 | 0.985 | 0.983 | 0.987 | 0.990 | 0.979 |
| Accuracy | 0.939 | 0.926 | 0.953 | 0.937 | 0.931 | 0.932 | 0.938 | 0.944 | |
| Sensitivity | 0.931 | 0.916 | 0.945 | 0.961 | 0.918 | 0.989 | 0.981 | 0.957 | |
| Specificity | 0.945 | 0.937 | 0.953 | 0.938 | 0.946 | 0.927 | 0.936 | 0.943 | |
| Testing | AUC | 0.988 | 0.993 | 0.983 | 0.989 | 0.995 | 0.986 | 0.984 | 0.991 |
| Accuracy | 0.961 | 0.959 | 0.963 | 0.954 | 0.968 | 0.959 | 0.932 | 0.959 | |
| Sensitivity | 0.959 | 0.952 | 0.964 | 0.978 | 0.984 | 0.965 | 0.964 | 1.000 | |
| Specificity | 0.965 | 0.967 | 0.963 | 0.947 | 0.946 | 0.957 | 0.927 | 0.958 | |
Figure 3The individual ROC curves of our image-based DL system and clinical-joint ML system in classifying the four pathogens of pneumonia on testing dataset. In the observer performance test, the AI system performed much better than all reader groups in terms of four type classification.
Lesion characteristics in CT image of different types of pneumonia.
|
|
|
|
|
|
| |
|---|---|---|---|---|---|---|
| Characteristics | Patients | 670 | 268 | 241 | 70 | – |
| Age (year) | 47.5 + 14.7 | 55.7 + 15.0 | 57.0 + 14.2 | 52.8 + 18.9 | 0.6 | |
| Sex (male) | 371 (55.4%) | 142 (53.0%) | 126 (52.3%) | 37 (52.1%) | 0.80 | |
| Total lesion percent (%) | – | 3.5 (7.9%) | 15.7 (24.1%) | 5.7 (11.4%) | 7.2 (17.0%) | <0.001 |
| GGO percentage in each lung lobe (%) | LUL (%) | 3.4 (9.2%) | 12.1 (20.5%) | 3.3 (9.2%) | 4.8 (13.1%) | <0.001 |
| LLL (%) | 2.7 (9.2%) | 16.2 (23.8%) | 4.6 (12.7%) | 4.8 (13.8%) | <0.001 | |
| RUL (%) | 1.7 (7.3%) | 10.6 (19.8%) | 2.4 (6.6%) | 3.8 (12.5%) | <0.001 | |
| RML (%) | 1.3 (6.7%) | 8.4 (17.6%) | 0.5 (2.9%) | 2.7 (10.6%) | <0.001 | |
| RLL (%) | 2.7 (9.4%) | 14.0 (21.7%) | 2.8 (8.8%) | 3.3 (10.8%) | <0.001 | |
| Consolidation percentage in each lung lobe (%) | LUL (%) | 6.8 (14.5%) | 9.4 (21.5%) | 4.3 (9.4%) | 12.1 (19.2%) | <0.001 |
| LLL (%) | 13.9 (21.1%) | 13.8 (26.7%) | 16.0 (20.0%) | 12.9 (21.5%) | 0.33 | |
| RUL (%) | 8.7 (16.8%) | 11.2 (23.6%) | 6.2 (10.0%) | 13.9 (21.5%) | <0.001 | |
| RML (%) | 8.6 (18.3%) | 9.1 (20.6%) | 3.1 (10.2%) | 11.1 (21.5%) | <0.001 | |
| RLL (%) | 16.3 (22.8%) | 14.4 (25.6%) | 14.9 (18.1%) | 17.4 (25.5%) | 0.17 | |
| Density: HU distribution within lesions | >−200 HU (%) | 9 (0.4%) | 43 (6.5%) | 14 (4.0%) | 9 (4.8%) | 0.12 |
| −400~−200 HU (%) | 96 (4.8%) | 65 (9.8%) | 68 (19.3%) | 44 (23.4%) | 0.25 | |
| −600~−400 HU (%) | 678 (34.0%) | 264 (40.0%) | 171 (48.4%) | 84 (44.7%) | <0.001 | |
| < −600 HU (%) | 1,209 (60.7%) | 287 (43.5%) | 100 (28.3%) | 51 (27.1%) | <0.001 | |
| Location: distance from lesion to pulmonary pleurae | Lesion distance (mm) | 4.8 ± 4.3 | 1.3 ± 2.6 | 2.2 ± 3.4 | 3.0 ± 3.3 | <0.001 |
GGO, ground glass opacity; RUL, right upper lobe; RML, right middle lobe; RLL, right lower lobe; LUL, left upper lobe; LLL, left lower lobe; HU, hounsfield unit. Data presented as n (%) unless otherwise indicated. Lesion distance are reported as mean ± standard deviation.
Receiver operating characteristic (ROC) of the Image-based model and Clinical-joint model.
|
|
|
| |
|---|---|---|---|
| COV-19 | 0.995 (0.990, 0.998) | 0.997 (0.995, 1.000) | 0.032 |
| Common | 0.986 (0.977, 0.995) | 0.994 (0.990, 0.995) | 0.018 |
| Bacterial | 0.984 (0.970, 0.995) | 0.989 (0.979, 0.997) | <0.001 |
| Fungal | 0.991 (0.978, 1.000) | 0.996 (0.989, 1.000) | <0.001 |
Figure 4Illustration of characteristics that contribute to the prediction of pneumonia using CAM and SHAP. (A) A COVID-19 patient's origin image sample. (B,C) Visualize the attention regions of a network for distinct abnormality and disease categories. (D) The relative contribution of each CT or clinical measure to predicting the probability of pneumonia prediction. Features to the right of the risk explanation bar increased the danger, while features on the left decreased it.