| Literature DB >> 34195215 |
Huijing Ma1, Qinghao Ye2,3, Weiping Ding4, Yinghui Jiang2,3, Minhao Wang2,3, Zhangming Niu3, Xi Zhou5, Yuan Gao6,7, Chengjia Wang8, Wade Menpes-Smith7, Evandro Fei Fang9, Jianbo Shao10, Jun Xia6, Guang Yang11,12.
Abstract
The rapid spread of coronavirus 2019 disease (COVID-19) has manifested a global public health crisis, and chest CT has been proven to be a powerful tool for screening, triage, evaluation and prognosis in COVID-19 patients. However, CT is not only costly but also associated with an increased incidence of cancer, in particular for children. This study will question whether clinical symptoms and laboratory results can predict the CT outcomes for the pediatric patients with positive RT-PCR testing results in order to determine the necessity of CT for such a vulnerable group. Clinical data were collected from 244 consecutive pediatric patients (16 years of age and under) treated at Wuhan Children's Hospital with positive RT-PCR testing, and the chest CT were performed within 3 days of clinical data collection, from January 21 to March 8, 2020. This study was approved by the local ethics committee of Wuhan Children's Hospital. Advanced decision tree based machine learning models were developed for the prediction of CT outcomes. Results have shown that age, lymphocyte, neutrophils, ferritin and C-reactive protein are the most related clinical indicators for predicting CT outcomes for pediatric patients with positive RT-PCR testing. Our decision support system has managed to achieve an AUC of 0.84 with 0.82 accuracy and 0.84 sensitivity for predicting CT outcomes. Our model can effectively predict CT outcomes, and our findings have indicated that the use of CT should be reconsidered for pediatric patients, as it may not be indispensable.Entities:
Keywords: COVID-19; RT-PCR—polymerase chain reaction with reverse transcription; artificial intelligence; decision trees; machine learning; pediatric
Year: 2021 PMID: 34195215 PMCID: PMC8236538 DOI: 10.3389/fmed.2021.699984
Source DB: PubMed Journal: Front Med (Lausanne) ISSN: 2296-858X
Baseline characteristics of children with COVID-19.
| Age [Mean (SD)] | 7.9 (4.5) | 5.6 (4.8) | 6.6 (4.8) |
| Sex (Male/Female) | 54/48 | 85/56 | 139/104 |
| Contact history | 92 | 126 | 218 |
| Fever | 23 | 70 | 93 |
| Cough | 23 | 70 | 93 |
| Vomit | 3 | 13 | 16 |
| Diarrhea | 2 | 9 | 11 |
| Poor spirit | 0 | 7 | 7 |
| Running nose | 4 | 11 | 15 |
| LDH(U/L) | 221 (189–260) | 246 (213–326.5) | 238 (201–294) |
| Ferritin (ng/mL) | 58.1 (36.8–86.6) | 61.6 (40.2–95.3) | 58.9 (39.9–90.2) |
| CK-MB (U/L) | 20 (16–32) | 24 (18–35) | 23 (17–34) |
| Leukocyte (109/L) | 7 (6–8.9) | 6.9 (5.4–8.6) | 6.9 (5.6–8.7) |
| Neutrophils (109/L) | 3.6 (2.5–4.6) | 2.4 (1.7–3.8) | 3 (1.9–4.2) |
| Lymphocyte (109/L) | 2.8 (2.3–3.5) | 2.9 (2–4.5) | 2.9 (2.1–4) |
| C-Reactive protein (mg/L) | 1 (0.8–4) | 3 (1–5.9) | 1.2 (0.8–5) |
| Neutrophil lymphocyte ratio (NLR) | 1.3 (0.9–1.9) | 0.9 (0.5–1.5) | 1 (0.6–1.7) |
LDH, Lactic Dehydrogenase; CK-MB, Creatine Kinase-MB.
Figure 1Flow chart and network architecture of our proposed model.
Odds ratio for features.
| Ferritin | 10.36 | [1.28, 83.69] | 0.0196 |
| Lymphocyte | 3.11 | [1.57, 6.15] | 0.0014 |
| C-reactive protein | 2.40 | [1.42, 4.05] | 0.0014 |
| LDH | 2.30 | [1.12, 4.72] | 0.0322 |
| CK-MB | 1.67 | [0.5, 5.59] | 0.5815 |
| Leukocyte | 0.47 | [0.26, 0.85] | 0.0199 |
| Age | 0.41 | [0.24, 0.69] | 0.0011 |
| Neutrophils | 0.41 | [0.24, 0.7] | 0.0016 |
| Neutrophils lymphocyte ratio (NLR) | 0.37 | [0.15, 0.87] | 0.0322 |
Figure 2Spearman's Correlation for all features.
Comparison of general models.
| TabNet ( | 0.7891 | 0.7755 | 0.7727 | 0.7391 | 0.7559 |
| AutoML ( | 0.7453 | 0.7368 | 0.7143 | 0.7895 | 0.7519 |
| DeepFM ( | 0.6941 | 0.6818 | 0.7273 | 0.6667 | 0.6970 |
| XGBoost ( | 0.7131 | 0.7097 | 0.6429 | 0.6923 | 0.6676 |
| Our Model |
For all the comparison methods please refer to the opensource implementations at TabNet: .
Bold values indicate the best performed method.
Result of all cases where each proposed method can be applied.
| 1 | √ | 0.7081 | 0.6957 | 0.7297 | 0.7105 | 0.7201 | ||
| 2 | √ | √ | 0.7635 | 0.7581 | 0.7941 | 0.7714 | 0.7828 | |
| 3 | √ | √ | 0.7812 | 0.7761 | 0.8158 | 0.7949 | 0.8053 | |
| 4 | √ | √ | √ |
Bold values indicate the best performed method.
Figure 3Combinations of different dual features.
Results of single feature models.
| Age | 0.6683 (0.0806) | 0.6477 (0.0561) | 0.8015 (0.1786) | 0.4371 (0.2707) | 0.7172 (0.0684) |
| C-reactive protein | 0.5981 (0.0864) | 0.6102 (0.0462) | 0.7387 (0.2140) | 0.4352 (0.3635) | 0.6771 (0.0536) |
| Ferritin | 0.5327 (0.1163) | 0.6355 (0.0453) | 0.8613 (0.1161) | 0.3195 (0.2679) | 0.7322 (0.0103) |
| Lymphocyte | 0.6194 (0.1047) | 0.6355 (0.0659) | 0.8500 (0.1225) | 0.3424 (0.2990) | 0.7302 (0.0266) |
| Neutrophils | 0.6726 (0.0813) | 0.6513 (0.0523) | 0.8313 (0.1239) | 0.4048 (0.2651) | 0.7325 (0.0276) |
Results of combined feature models.
| Age-C-reactive protein | 0.8163 (0.1311) | 0.7288 (0.0759) | 0.8589 (0.0854) | 0.5490 (0.2774) | 0.7883 (0.0334) |
| Age-Neutrophils | 0.7915 (0.0326) | 0.7129 (0.0355) | 0.8512 (0.0255) | 0.5243 (0.1097) | 0.7748 (0.0182) |
| Age-Ferritin | 0.7551 (0.0437) | 0.7214 (0.0375) | 0.7803 (0.0465) | 0.6410 (0.0660) | 0.7637 (0.0329) |
| Age-Lymphocyte | 0.7956 (0.0775) | 0.7332 (0.0452) | 0.7724 (0.0873) | 0.6805 (0.0622) | 0.7679 (0.0472) |
| Combination | 0.8412 (0.0982) | 0.8191 (0.0590) | 0.8597 (0.0407) | 0.7767 (0.1853) | 0.8464 (0.0348) |
Figure 4AUC score for all models.