| Literature DB >> 34411910 |
Xiyue Wang1, Tao Shen2, Sen Yang2, Jun Lan3, Yanming Xu4, Minghui Wang1, Jing Zhang5, Xiao Han6.
Abstract
Acute Intracranial hemorrhage (ICH) is a life-threatening disease that requires emergency medical attention, which is routinely diagnosed using non-contrast head CT imaging. The diagnostic accuracy of acute ICH on CT varies greatly among radiologists due to the difficulty of interpreting subtle findings and the time pressure associated with the ever-increasing workload. The use of artificial intelligence technology may help automate the process and assist radiologists for more prompt and better decision-making. In this work, we design a deep learning approach that mimics the interpretation process of radiologists, and combines a 2D CNN model and two sequence models to achieve accurate acute ICH detection and subtype classification. Being developed using the extensive 2019-RSNA Brain CT Hemorrhage Challenge dataset with over 25000 CT scans, our deep learning algorithm can accurately classify the acute ICH and its five subtypes with AUCs of 0.988 (ICH), 0.984 (EDH), 0.992 (IPH), 0.996 (IVH), 0.985 (SAH), and 0.983 (SDH), respectively, reaching the accuracy level of expert radiologists. Our method won 1st place among 1345 teams from 75 countries in the RSNA challenge. We have further evaluated our algorithm on two independent external validation datasets with 75 and 491 CT scans, respectively, and our method maintained high AUCs of 0.964 and 0.949 for acute ICH detection. These results have demonstrated the high performance and robust generalization ability of our proposed method, which makes it a useful second-read or triage tool that can facilitate routine clinical applications.Entities:
Keywords: Deep learning; Head CT; Image classification; Intracranial hemorrhage (ICH); Sequence model
Mesh:
Year: 2021 PMID: 34411910 PMCID: PMC8377493 DOI: 10.1016/j.nicl.2021.102785
Source DB: PubMed Journal: Neuroimage Clin ISSN: 2213-1582 Impact factor: 4.881
Fig. 1The pipeline of our study. The AI system comprises three stages. In the first stage, a 2D CNN classifier is used to extract features and produce an initial prediction of ICH and its subtypes on each input slice. The generated feature vectors for all slices of a 3D scan are fed into Sequence Model 1 (the second stage) to get more refined and spatially coherent ICH detection results on every image slice. In the third stage, the classification results of the CNN classifier and Sequence Model 1 are assembled together and passed through Sequence Model 2 to perform adaptive model averaging. The output of Sequence Model 2 gives the final ICH prediction results for every slice of the input scan. Each DICOM file represents one 3D scan, which can be considered as a sequence of 2D slices.
Dataset characteristics.
Data distribution characteristics of the utilized datasets.
| Label | Training set | Test sets | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| RSNA-train | Batch-1 test set | Batch-2 test set | CQ500 | PhysioNet-ICH | ||||||||||
| Scans | Slices | Scans | Slices | Scans | Slices | Scans | Slices | Scans | Slices | |||||
| ICH | 8003 | 97103 | 879 | 10830 | 1243 | 15902 | 205 | 18774 | 36 | 318 | ||||
| EDH | 313 | 2761 | 41 | 384 | 23 | 208 | 13 | 131 | 21 | 173 | ||||
| IPH | 4796 | 32564 | 525 | 3554 | 758 | 5468 | 134 | 6323 | 16 | 73 | ||||
| IVH | 3313 | 23766 | 379 | 2439 | 616 | 4546 | 28 | 2348 | 5 | 24 | ||||
| SAH | 3549 | 32122 | 383 | 3553 | 528 | 4908 | 60 | 9590 | 7 | 18 | ||||
| SDH | 3442 | 42496 | 370 | 4670 | 503 | 6555 | 53 | 6391 | 4 | 56 | ||||
| None | 11527 | 577155 | 1335 | 67715 | 2285 | 105330 | 286 | 152616 | 39 | 2496 | ||||
| Total | 19530 | 674258 | 2214 | 78545 | 3528 | 121232 | 491 | 171390 | 75 | 2814 | ||||
Performance at different stages of the proposed method (evaluated on the RSNA batch-1 test set).
| Log_loss | AUC | Specificity | Sensitivity | |
|---|---|---|---|---|
| 2D-CNN | 0.064 | 0.961 (0.9594–0.9631) | 0.903 (0.8956–0.9147) | 0.889 (0.8775–0.8775) |
| 2D-CNN + Seq_model 1 | 0.060 | 0.967 (0.9656–0.9690) | 0.914 (0.9024–0.9284) | 0.904 (0.8903–0.9173) |
| 2D-CNN + Seq_model 1 + Seq_model 2 | 0.058 | 0.975 (0.9738–0.9765) | 0.926 (0.9140–0.9301) | 0.919 (0.9301–0.9308) |
| Model_ensemble | 0.054 | 0.988 (0.9873–0.9889) | 0.944 (0.9371–0.9466) | 0.950 (0.9460–0.9575) |
Fig. 2ROC curves of our AI algorithm evaluated on three test data sets: the RSNA batch-1 test set, the PhysioNet-ICH, and the CQ500 datasets for predicting (a) ICH, (b) EDH, (c) IPH, (d) IVH, (e) SAH, and (f) SDH. The “RSNA-1” means the batch-1 test set from the 2019-RSNA Brain CT Hemorrhage Challenge.
Performance of the proposed AI system for automatic ICH detection and subtype classification on the 2019-RSNA challenge, PhysioNet-ICH, and CQ500 datasets.
Fig. 3Visualization of model predictions using saliency maps on the CQ500 dataset (left panel (a)–(e)) and the PhysioNet-ICH dataset (right panel (f)–(j)). In each subfigure, the original CT slice (left column), the corresponding saliency map (middle column), and the manual annotation by experienced radiologists (right column) are shown. In the saliency maps, warmer colors represent more relevant regions for the model prediction of the corresponding bleeding type. The same color scale is used for all the saliency maps, which is indicated by the color bar. The brain window is used for displaying the CT images.