| Literature DB >> 32300971 |
Xiaofeng Chen1, Yanyan Tang2, Yongkang Mo3, Shengkai Li4, Daiying Lin5, Zhijian Yang6, Zhiqi Yang1, Hongfu Sun7, Jinming Qiu2, Yuting Liao8, Jianning Xiao5, Xiangguang Chen1, Xianheng Wu5, Renhua Wu9, Zhuozhi Dai10.
Abstract
OBJECTIVES: Rapid and accurate diagnosis of coronavirus disease 2019 (COVID-19) is critical during the epidemic. We aim to identify differences in CT imaging and clinical manifestations between pneumonia patients with and without COVID-19, and to develop and validate a diagnostic model for COVID-19 based on radiological semantic and clinical features alone.Entities:
Keywords: COVID-19; Diagnosis; Multi-institutional systems; Pneumonia; Radiology
Mesh:
Year: 2020 PMID: 32300971 PMCID: PMC7160614 DOI: 10.1007/s00330-020-06829-2
Source DB: PubMed Journal: Eur Radiol ISSN: 0938-7994 Impact factor: 5.315
Fig. 1A 23-year-old female with a travel history to Wuhan presenting with fever. Axial noncontrast CT image shows a consolidation with ground-glass opacities in the peripheral region by the right upper lobe. Air bronchogram is found in lesion. The maximum diameter of lesion is 2.8 cm. The right upper lobe score is 1 because of the involved lung parenchyma less than 1/4
Fig. 2Workflow of data process and analysis in this study. Radiological semantic features, including qualitative and quantitative imaging features, are extracted from axial lung CT section. The clinical manifestation and laboratory parameters are provided by electronic case system. Statistical analysis is performed for comparing the different features between COVID-19 and non-COVID-19 patients. Univariate analysis, least absolute shrinkage, and selection operator (LASSO) are further performed to determine the COVID-19 risk factors with p < 0.05 in statistical analysis. Three models based on the selected features are established by multivariate logistic regression. These models include radiological mode (R model), clinical model (C model), and the combination of clinical and radiological model (CR model). The performance and clinical benefits of the prediction model are assessed by the area under a receiver operating characteristic (ROC) curve and the decision curve, respectively
Radiological semantic features of patients in COVID-19 and non-COVID-19
| Feature | Non-COVID-19 ( | COVID-19 ( | |
|---|---|---|---|
| Number of pure GGO | |||
| Total# | 1.00 (0.00, 5.05) | 3.50 (0.95, 8.05) | 0.018b* |
| Peripheral area# | 1.00 (0.00, 4.05) | 2.00 (0.00, 6.05) | 0.032b* |
| Central/both peripheral and central area# | 0.00 (0.00, 0.00) | 0.00 (0.00, 2.00) | 0.001b* |
| Number of mixed GGO | |||
| Total# | 1.00 (0.00, 3.05) | 3.00 (1.00, 9.00) | 0.001b* |
| Peripheral area# | 0.00 (0.00, 2.00) | 2.50 (1.00, 6.00) | < 0.001b* |
| Central/both peripheral and central area# | 0.00 (0.00, 1.05) | 0.00 (0.00, 2.00) | 0.657b |
| Total number of consolidation | |||
| Consolidation# | 1.00 (0.00, 3.00) | 0.00 (0.00, 0.05) | 0.001b* |
| Pure solid nodules# | 0.00 (0.00, 0.00) | 0.00 (0.00, 0.00) | 0.309b |
| Solid nodules with GGO# | 0.00 (0.00, 0.00) | 0.00 (0.00, 1.00) | 0.033b* |
| Total number of lesions | |||
| Peripheral area# | 5.00 (2.00, 9.05) | 7.00 (2.00, 13.00) | 0.112b |
| Central area# | 0.00 (0.00, 3.00) | 0.00 (0.00, 1.05) | 0.960b |
| Both peripheral and central area# | 0.00 (0.00, 2.00) | 0.00 (0.00, 2.05) | 0.582b |
| Interlobular septal thickening | 0.009a* | ||
| Negative | 44 (66.67%) | 31 (44.29%) | |
| Positive | 22 (33.33%) | 39 (55.71%) | |
| Crazy paving pattern | < 0.001a* | ||
| Negative | 60 (90.91%) | 32 (45.71%) | |
| Positive | 6 (9.09%) | 38 (54.29%) | |
| Tree-in-bud sign | < 0.001a* | ||
| Negative | 37 (56.06%) | 61 (87.14%) | |
| Positive | 29 (43.94%) | 9 (12.86%) | |
| Pleural thickening | 0.030a* | ||
| Negative | 46 (69.70%) | 36 (51.43%) | |
| Positive | 20 (30.30%) | 34 (48.57%) | |
| Offending vessel augmentation in lesions | < 0.001a* | ||
| Negative | 55 (83.33%) | 17 (24.29%) | |
| Positive | 11 (16.67%) | 53 (75.71%) | |
GGO ground-glass opacities
#Results are median with interquartile range in parentheses, and the remainder results are measurements with corresponding ratio in parentheses
*Data with statistical significance. pa: chi-square test, pb: Student’s t test
Clinical features of patients in COVID-19 and non-COVID-19
| Feature | Non-COVID-19 ( | COVID-19 ( | |
|---|---|---|---|
| Sex | |||
| Male# | 43 (65.15%) | 41 (58.57%) | 0.430a |
| Female# | 23 (34.85%) | 29 (41.43%) | |
| Age (years) | 46.73 ± 25.00 | 42.93 ± 13.32 | 0.275b |
| Vital signs | |||
| Systolic blood pressure (mmHg) | 126.92 ± 23.07 | 127.07 ± 15.16 | 0.965b |
| Diastolic blood pressure (mmHg) | 77.74 ± 15.72 | 80.39 ± 10.51 | 0.254b |
| Respiration rate (bpm) | 25.20 ± 7.29 | 19.86 ± 1.90 | < 0.001b* |
| Heart rate (bpm) | 101.59 ± 20.36 | 86.06 ± 13.34 | < 0.001b* |
| Temperature (°C) | 37.61 ± 1.06 | 37.12 ± 0.83 | 0.003b* |
| Signs | |||
| Dry cough# | 56 (84.85%) | 48 (68.57%) | 0.025a* |
| Fatigue# | 8 (12.12%) | 22 (31.43%) | 0.007a* |
| Sore throat# | 6 (9.09%) | 9 (12.86%) | 0.483a |
| Stuffy# | 4 (6.06%) | 2 (2.86%) | 0.623a |
| Runny nose# | 3 (4.55%) | 3 (4.29%) | 0.731a |
| White blood cell count (× 109/L) | 11.48 ± 5.36 | 5.27 ± 2.33 | < 0.001b* |
| White blood cell count category | < 0.001c* | ||
| Low# | 0 (0.00%) | 2 (2.86%) | |
| Normal# | 27 (40.91%) | 63 (90.00%) | |
| High# | 39 (59.09%) | 5 (7.14%) | |
| Lymphocyte count (× 109/L) | 1.57 ± 1.33 | 1.25 ± 0.68 | 0.086b |
| Lymphocyte count category | < 0.001c* | ||
| Low# | 24 (36.36%) | 32 (45.71%) | |
| Normal# | 35 (53.03%) | 37 (52.86%) | |
| High# | 7 (10.61%) | 1 (1.43%) | |
| Neutrophil count (× 109/L) | 8.97 ± 4.90 | 3.53 ± 2.17 | < 0.001b* |
| Neutrophil count category | < 0.001c* | ||
| Low# | 3 (4.55%) | 8 (11.43%) | |
| Normal# | 23 (34.85%) | 59 (84.29%) | |
| High# | 40 (60.61%) | 3 (4.29%) | |
| C-reactive protein (mg/L) | 69.30 ± 65.88 | 26.37 ± 30.97 | < 0.001b* |
| Procalcitonin (ng/mL) | 3.36 ± 8.98 | 0.26 ± 0.84 | 0.007b* |
*Data with statistical significance. pa: chi-square test, pb: Student’s t test. pc: Kruskal-Wallis H test
#Results are measurements with corresponding ratio in parentheses
Selected features in C, R, and CR models
| Model and individual features | Coefficients |
|---|---|
| R, | |
| Intercept | − 0.307 |
| Total number of mixed GGO in peripheral area | 0.359 |
| Total number of consolidation | − 1.262 |
| Total number of solid nodules with ground-glass opacities | 0.452 |
| Interlobular septal thickening | − 5.559 |
| Crazy paving pattern | 3.566 |
| Tree-in-bud | − 2.548 |
| Pleural thickening | 3.265 |
| Offending vessel augmentation in lesions | 5.504 |
| C, | |
| Intercept | 29.273 |
| Respiration | − 0.359 |
| Heart rate | − 0.054 |
| Temperature | − 0.289 |
| White blood cell count | − 0.175 |
| Cough | − 1.866 |
| Fatigue | 2.855 |
| Lymphocyte count category | − 0.028 |
| CR, | |
| Intercept | 45.117 |
| Total number of mixed GGO in peripheral area | 0.108 |
| Tree-in-bud | − 1.853 |
| Offending vessel augmentation in lesions | 6.000 |
| Respiration | − 0.583 |
| Heart ratio | − 0.084 |
| Temperature | − 0.536 |
| White blood cell count | − 0.471 |
| Cough | − 0.997 |
| Fatigue | − 0.228 |
| Lymphocyte count category | − 2.177 |
C, R, and CR indicate the predicted model based on clinical features, radiological features, and the combination of clinical features and clinical radiological features, respectively
*n means corresponding selected features, and data in parentheses are total features. Coefficients: the estimate value of each feature in multivariate logistic regression model by “glm” package in R
Performance of the individualized prediction models
| Primary cohort ( | Validation cohort ( | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Models | AUC | 95% CI | Accuracy | Specificity | Sensitivity | AUC | 95% CI | Accuracy | Specificity | Sensitivity |
| C model | 0.952 | 0.915~0.988 | 0.888 | 0.894 | 0.882 | 0.967 | 0.919~1.000 | 0.868 | 0.859 | 0.842 |
| R model | 0.969 | 0.940~0.997 | 0.929 | 0.851 | 1.000 | 0.809 | 0.669~0.948 | 0.684 | 0.368 | 1.000 |
| CR model | 0.986 | 0.966~1.000 | 0.959 | 0.957 | 0.961 | 0.936 | 0.866~1.000 | 0.763 | 0.789 | 0.737 |
C, R, and CR indicate the predicted model based on clinical features, radiological features, and the combination of clinical features and clinical radiological features, respectively. CI confidence interval
Fig. 3ROC of the three models in primary and validation cohort curves. Comparison of receiver operating characteristic (ROC) curves among the radiological mode (R model), clinical model (C model), and the combination of clinical and radiological model (CR model) for the diagnosis of COVID-19 in the primary (a) and validation (b) cohorts
Fig. 4Decision curve analysis for each model in the primary dataset. The y-axis measures the net benefit, which is calculated by summing the benefits (true-positive findings) and subtracting the harms (false-positive findings), weighting the latter by a factor related to the relative harm of undetected metastasis compared with the harm of unnecessary treatment. The decision curve shows that if the threshold probability is over 10%, the application of the combination of clinical and radiological model (CR model) to diagnose COVID-19 adds more benefit than the clinical model (C model) and radiological model (R model)
Fig. 5Nomogram of the CR model in the primary cohort. TN_Mixed_GGO_IP represented the total number of mixed GGO in peripheral area. AVAIL represented offending vessel segmentation in lesions. N was a negative result, and P was a positive result. Norm represented normal. Note that in probability scale, 0 = non-COVID-19, 1 = COVID-19