| Literature DB >> 33199977 |
Bo Wang1,2,3, Shuo Jin4,5, Qingsen Yan6,3, Haibo Xu7, Chuan Luo1,8, Lai Wei4,5, Wei Zhao3, Xuexue Hou3, Wenshuo Ma9, Zhengqing Xu3, Zhuozhao Zheng4, Wenbo Sun7, Lan Lan7, Wei Zhang3,10, Xiangdong Mu4,5, Chenxi Shi9, Zhongxiao Wang9, Jihae Lee9, Zijian Jin3, Minggui Lin4, Hongbo Jin3, Liang Zhang11, Jun Guo4, Benqi Zhao4, Zhizhong Ren4, Shuhao Wang9,12, Wei Xu9, Xinghuan Wang13,14, Jianming Wang15,16, Zheng You1,2,8, Jiahong Dong4,5.
Abstract
The sudden outbreak of novel coronavirus 2019 (COVID-19) increased the diagnostic burden of radiologists. In the time of an epidemic crisis, we hope artificial intelligence (AI) to reduce physician workload in regions with the outbreak, and improve the diagnosis accuracy for physicians before they could acquire enough experience with the new disease. In this paper, we present our experience in building and deploying an AI system that automatically analyzes CT images and provides the probability of infection to rapidly detect COVID-19 pneumonia. The proposed system which consists of classification and segmentation will save about 30%-40% of the detection time for physicians and promote the performance of COVID-19 detection. Specifically, working in an interdisciplinary team of over 30 people with medical and/or AI background, geographically distributed in Beijing and Wuhan, we are able to overcome a series of challenges (e.g. data discrepancy, testing time-effectiveness of model, data security, etc.) in this particular situation and deploy the system in four weeks. In addition, since the proposed AI system provides the priority of each CT image with probability of infection, the physicians can confirm and segregate the infected patients in time. Using 1,136 training cases (723 positives for COVID-19) from five hospitals, we are able to achieve a sensitivity of 0.974 and specificity of 0.922 on the test dataset, which included a variety of pulmonary diseases.Entities:
Keywords: COVID-19; Classification; Deep learning; Medical assistance system; Neural network; Segmentation
Year: 2020 PMID: 33199977 PMCID: PMC7654325 DOI: 10.1016/j.asoc.2020.106897
Source DB: PubMed Journal: Appl Soft Comput ISSN: 1568-4946 Impact factor: 6.725
Fig. 1The framework of the proposed system.
Case-level data source distribution of the dataset by hospital.
| Hospital name | Positive cases (#Patients) | Negative cases (#Patients) |
|---|---|---|
| Beijing Tsinghua Changgung Hospital | 5 (5) | 284 (284) |
| Wuhan No.7 Hospital | 188 (187) | 141 (141) |
| Zhongnan Hospital of Wuhan University | 141 (115) | 116 (116) |
| Tianyou Hospital Affiliated to Wuhan University of Science & Technology | 378 (378) | 0 (0) |
| Wuhan’s Leishenshan Hospital | 165 (165) | 0 (0) |
| Total | 877 (850) | 541 (541) |
Fig. 4Patient-level age–gender distribution of positive cases.
Fig. 5Case-level clinical typing distribution of the dataset.
Case-level data source distribution of the dataset by CT scanner models.
| Equipment model | Positive cases (#Patients) | Negative cases (#Patients) |
|---|---|---|
| UIH uCT 760 | 13 (10) | 195 (195) |
| UIH uCT 530 | 253 (252) | 0 (0) |
| GE Optima CT660 | 378 (378) | 27 (27) |
| GE Discovery CT750 HD | 0 (0) | 86 (86) |
| GE Discovery CT | 192 (176) | 0 (0) |
| GE BrightSpeed | 0 (0) | 3 (3) |
| SIEMENS SOMATOM Definition | 22 (19) | 0 (0) |
| SIEMENS Sensation Open | 2 (2) | 0 (0) |
| Philips iCT 256 | 0 (0) | 228 (228) |
| Philips Ingenuity CT | 17 (13) | 0 (0) |
| Philips Brilliance Big Bore | 0 (0) | 2 (2) |
| Total | 877 (850) | 541 (541) |
Case-level dataset division for each model training task.
| Training set | Testing set | |||||
|---|---|---|---|---|---|---|
| Positive cases (#Patients) | Negative (#Patients) | Positive (#Patients) | Negative (#Patients) | |||
| Healthy | Other diseases | Healthy | Other diseases | |||
| Lung region extraction | 361 (360) | 9 (9) | 27 (27) | 93 (93) | 0 (0) | 1 (1) |
| Lesion segmentation | 704 (680) | 7 (7) | 21 (21) | 168 (168) | 2 (2) | 5 (5) |
| Lesion classification | 723 (696) | 70 (70) | 343 (343) | 154 (154) | 21 (21) | 107 (107) |
Fig. 6Demonstration of the deployment workstation.
Dice coefficients of segmentation models.
| Segmentation model | Dice coefficient |
|---|---|
| FCN-8s | 0.681 |
| V-Net | 0.739 |
| U-Net | 0.742 |
| 3D U-Net++ | 0.754 |
Case-level data source distribution of the classification dataset by hospital.
| Hospital name | Training set | Testing set | ||||
|---|---|---|---|---|---|---|
| Positive cases | Negative cases | Positive cases | Negative cases | |||
| Healthy | Other diseases | Healthy | Other diseases | |||
| Beijing Tsinghua Changgung Hospital | 4 | 43 | 183 | 1 | 10 | 48 |
| Wuhan No.7 Hospital | 150 | 15 | 88 | 38 | 6 | 32 |
| Zhongnan Hospital of Wuhan University | 113 | 12 | 72 | 28 | 5 | 27 |
| Tianyou Hospital | 319 | 0 | 0 | 59 | 0 | 0 |
| Wuhan’s Leishenshan Hospital | 132 | 0 | 0 | 33 | 0 | 0 |
| Total | 718 | 70 | 343 | 159 | 21 | 107 |
Case-level data source distribution of the classification dataset by CT scanner models.
| Equipment model | Training set | Testing set | ||||
|---|---|---|---|---|---|---|
| Positive cases | Negative cases | Positive cases | Negative cases | |||
| Healthy | Other diseases | Healthy | Other diseases | |||
| UIH uCT 760 | 12 | 30 | 127 | 1 | 5 | 33 |
| UIH uCT 530 | 201 | 0 | 0 | 52 | 0 | 0 |
| GE Optima CT660 | 319 | 3 | 17 | 59 | 0 | 7 |
| GE Discovery CT750 HD | 0 | 13 | 54 | 0 | 5 | 14 |
| GE Discovery CT | 156 | 0 | 0 | 36 | 0 | 0 |
| GE BrightSpeed | 0 | 0 | 2 | 0 | 0 | 1 |
| SIEMENS SOMATOM Definition | 20 | 0 | 0 | 2 | 0 | 0 |
| SIEMENS Sensation Open | 1 | 0 | 0 | 1 | 0 | 0 |
| Philips iCT 256 | 0 | 24 | 141 | 0 | 11 | 52 |
| Philips Ingenuity CT | 14 | 0 | 0 | 3 | 0 | 0 |
| Philips Brilliance Big Bore | 0 | 0 | 2 | 0 | 0 | 0 |
| Total | 723 | 70 | 343 | 154 | 21 | 107 |
Fig. 2Model performance and highlights of model predictions.a, Receiver operating characteristic (ROC) curves of DPN-92, Inception-v3, ResNet-50, and Attention ResNet-50 with 3D U-Net++, respectively. b, ROC curves of 3D U-Net++ - ResNet-50 trained with different numbers of training cases. c, Typical predictions of the segmentation model.
Training sets distribution in multiple training stages.
| Collection dates | Positive | Negative | Total |
|---|---|---|---|
| 2020.02.07 | 144 | 82 | 226 |
| 2020.02.10 | 289 | 165 | 454 |
| 2020.02.14 | 433 | 247 | 680 |
| 2020.02.17 | 578 | 330 | 908 |
| 2020.02.20 | 723 | 413 | 1136 |
Fig. 3Illustration of the reader study. Five qualified physicians participated in this reader study. A total of 170 cases (89 were positive) were randomly selected from the test set.