| Literature DB >> 35910809 |
Jun Lu1,2, Yiran Zhou1, Wenzhi Lv3, Hongquan Zhu1, Tian Tian1, Su Yan1, Yan Xie1, Di Wu1, Yuanhao Li1, Yufei Liu1, Luyue Gao1, Wei Fan2, Yan Nan2, Shun Zhang1, Xiaolong Peng1,4, Guiling Zhang1, Wenzhen Zhu1.
Abstract
Rationale: Although non-contrast computed tomography (NCCT) is the recommended examination for the suspected acute ischemic stroke (AIS), it cannot detect significant changes in the early infarction. We aimed to develop a deep-learning model to identify early invisible AIS in NCCT and evaluate its diagnostic performance and capacity for assisting radiologists in decision making.Entities:
Keywords: acute ischemic stroke; artificial intelligence; deep-learning; diagnosis; non-contrast computed tomography
Mesh:
Year: 2022 PMID: 35910809 PMCID: PMC9330528 DOI: 10.7150/thno.74125
Source DB: PubMed Journal: Theranostics ISSN: 1838-7640 Impact factor: 11.600
Figure 1Inclusion and exclusion workflow. AIS = acute ischemic stroke, INP = imaging-negative patients
Figure 2Model training process. The deep learning model is comprised of a localization model and a classification model. To train the localization model, the labeled AIS NCCT slices (green box) and negative NCCT slices were input, and the suspected regions were labeled and output (red box). The regions were cropped from the slice and normalized then input the classification model for training, output the diagnosis probabilities of AIS (AIS-risk) in these regions. The final results of this model are the lesions' location and probabilities of AIS (yellow box with AIS-risk). AIS = acute ischemic stroke, INP = imaging-negative patients.
Patient demographic data
| Institution A (n = 986) | Institution B (n = 150) |
| |
|---|---|---|---|
| Subjects characters* | |||
| Diagnosis (n) | < 0.0001¥ | ||
| AIS (slices) | 728 (2580) | 74 (2006) | |
| INP (slices) | 258 (6445) | 76 (1954) | |
| Age, years, median [IQR] | 55 [47-65] | 63 [53-75] | < 0.0001§ |
| Gender (n) | 0.8690¥ | ||
| Male | 664 | 100 | |
| Female | 322 | 50 | |
| Clinical information※ | |||
| TSS, hours, median [IQR] | 8.03 [3.50-18.69] | 9.50 [5.75-16.07] | 0.7020§ |
| NIHSS, median [IQR] | 4.00 [2.00-7.00] | 3.00 [2.00-4.50] | 0.0090§ |
| Lesions features※ | |||
| Number (n) | 0.3010¥ | ||
| Single | 509 | 56 | |
| Multiple | 219 | 18 | |
| Location (n) | 0.0050¥ | ||
| Left | 341 | 32 | |
| Right | 331 | 28 | |
| Bilateral | 56 | 14 | |
| Size (mm) | < 0.0001¥ | ||
| 0-10 | 257 | 42 | |
| 10-30 | 522 | 50 | |
| 30-50 | 151 | 8 | |
| 50-100 | 102 | 3 | |
| > 100 | 32 | 0 | |
| Area (n) | < 0.0001¶ | ||
| Frontal lobe | 176 | 10 | |
| Parietal lobe | 162 | 3 | |
| Temporal lobe | 188 | 8 | |
| Occipital lobe | 70 | 4 | |
| Insular lobe | 105 | 3 | |
| Basal ganglia | 273 | 35 | |
| Corona radiata | 247 | 41 | |
| Centrum semiovale | 88 | 6 | |
| Pons | 54 | 6 | |
| Mesencephalon | 50 | 7 | |
| Cerebellum | 45 | 6 | |
| Cerebral peduncle | 8 | 5 | |
| Thalamus | 64 | 4 | |
| Corpus callosum | 19 | 2 | |
| Hippocampus | 12 | 2 | |
| Periventricular | 20 | 2 |
AIS = acute ischemic stroke, NIP = imaging-negative patients, TTS = time from onset to scan,
NIHSS = National Institute of Health stroke scale, IQR = interquartile range
* For the entire cohort of 1136 subjects. ※ For the lesions in 802 AIS patients.
¥: The Pearson's chi-squared test was performed.
§: The Mann-Whitney U test was performed.
¶: The Fisher's exact test was performed.
Performance of deep learning model and two experienced radiologists in training and internal validation cohort
| Results (n) | Test performance (%) | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| TP | TN | FP | FN | AUC [95%CI] | Sensitivity [95%CI] | Specificity [95%CI] | Accuracy [95%CI] |
| |
| Training | 1368 | 5051 | 113 | 696 | 82.05 [81.01-83.08] | 66.28 (1368/2064) [64.10-68.32] | 97.81 (5051/5164) [97.41-98.20] | 88.81 (6419/7228) [88.06-89.53] | |
| Internal Validation | |||||||||
| Deep-learning model | 356 | 1268 | 23 | 160 | 83.61 [81.58-85.64] | 68.99 (356/516) [65.12-73.06] | 98.22 (1268/1291) [97.44-98.92] | 89.87 (1624/1807) [88.39-91.23] | |
| Radiologist 1 | 181 | 1239 | 52 | 335 | 65.52 [63.40-67.65] | 35.08 (181/516) [30.81-39.34] | 95.97 (1239/1291) [94.89-96.98] | 78.58 (1420/1807) [76.62-80.45] | < 0.0001 |
| Radiologist 2 | 115 | 1248 | 43 | 401 | 59.48 [57.62-61.34] | 22.29 (115/516) [18.60-25.78] | 96.67 (1248/1291) [95.66-97.60] | 75.43 (1363/1807) [73.38-77.40] | < 0.0001 |
TP = true positive, TN = true negative, FP = false positive, FN = false negative
†: compare between radiologists and deep learning model.
Delong's test was used to compare the AUCs.
Performance of the deep learning model and two experienced radiologists in external validation cohort
| Results (n) | Test performance (%) | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| TP | TN | FP | FN | AUC [95%CI] | Sensitivity [95%CI] | Specificity [95%CI] | Accuracy [95%CI] |
|
| |
| Based on slices | ||||||||||
| Deep-learning model | 97 | 3412 | 394 | 57 | 76.32 [72.46-80.17] | 62.99 (97/154) [55.19-70.13] | 89.65 (3412/3806) [88.68-90.57] | 88.61 (3509/3960) [87.58-89.58] | ||
| Radiologist 1 | 48 | 3686 | 120 | 106 | 64.01 [60.33-67.69] | 31.17 (48/154) [24.03-38.33] | 96.85 (3686/3806) [96.30-97.40] | 94.29 (3734/3960) [93.52-94.50] | < 0.0001 | < 0.0001 |
| Radiologist 1 | 107 | 3533 | 273 | 47 | 81.15 [77.48-84.83] | 69.48 (107/154) [62.34-76.62] | 92.83 (3533/3806) [91.96-93.59] | 91.92 (3640/3960) [91.03-92.75] | < 0.0001 | |
| Radiologist 2 | 51 | 3641 | 165 | 103 | 64.39 [60.65-68.13] | 33.12 (51/154) [25.97-40.91] | 95.66 (3641/3806) [95.01-96.32] | 93.23 (3692/3960) [92.41-94.00] | < 0.0001 | < 0.0001 |
| Radiologist 2 | 110 | 3510 | 296 | 44 | 81.83 [78.22-85.43] | 71.43 (110/154) [64.29-78.57] | 92.22 (3510/3806) [91.33-93.09] | 91.41 (3620/3960) [90.50-92.27] | < 0.0001 | |
| Based on patients | ||||||||||
| Radiologist 1 | 14 | 46 | 30 | 60 | 39.72 [32.60-46.85] | 63.51 (47/74) [52.67-74.32] | 60.53 (46/76) [48.68-71.05] | 62.00 (93/150) [53.72-69.79] | < 0.0001 | |
| Radiologist 1 | 40 | 66 | 10 | 34 | 70.45 [63.57-77.33] | 97.30 (72/74) [93.24-100.00] | 86.84 (66/76) [78.95-94.74] | 92.00 (138/150) [86.44-95.80] | ||
| Radiologist 2 | 16 | 42 | 34 | 58 | 38.44 [31.10-45.79] | 62.16 (46/74) [51.35-72.97] | 55.26 (42/76) [43.42-67.11] | 58.67 (88/150) [50.35-66.64] | < 0.0001 | |
| Radiologist 2 + model | 41 | 55 | 21 | 33 | 63.89 [56.26-71.51] | 97.30 (72/74) [93.24-100.00] | 72.37 (55/76) [61.84-82.89] | 84.67 (127/150) [77.89-90.02] | ||
TP = true positive, TN = true negative, FP = false positive, FN = false negative
†: compare between radiologists and deep learning model.
$: compare between the radiologists and radiologists +model.
Delong's test was used to compare the AUCs.
Figure 3Comparison of performance between the deep-learning model and experienced radiologists. A. Receiver operating characteristic curve (ROC) of the deep-learning model and that from the two experienced radiologists for AIS detection based on slices in the internal validation cohort; B. ROC of the deep-learning model and that from the two experienced radiologists with and without the assistance of the model for AIS detection based on slices in the external validation cohort; C. ROC of each of the two experienced radiologists with and without the assistance of the model for AIS detection based on patients in the external validation cohort; D. Sensitivity, specificity, and accuracy of the deep-learning model and that from the two experienced radiologists for AIS detection in the internal validation cohort; E. Sensitivity, specificity, and accuracy of each of the two experienced radiologists with and without the assistance of the model for AIS detection based on patients in the external validation cohort.* 0.01 ≤ P < 0.05; **0.001 ≤ P < 0.01; *** P < 0.001.
Figure 4Examples. Images show the results of two experienced radiologists without and with the assistance of the deep-learning model for diagnosing. The first two cases are AIS patients and the last was imaging-negative patient. Within each case, left column shows a NCCT image, second column shows the result of radiologist read, middle column shows result of model diagnose (the yellow box represents the location of AIS and the probabilities calculated by the model in this region are 0.99, 0.93), fourth column shows the result of “radiologist + model” (the green circle or line), and right column shows corresponding DWI.