| Literature DB >> 35693121 |
Winston T Wang1, Charlotte L Zhang1, Kang Wei2, Ye Sang3, Jun Shen4, Guangyu Wang5, Alexander X Lozano1.
Abstract
Within COVID-19 there is an urgent unmet need to predict at the time of hospital admission which COVID-19 patients will recover from the disease, and how fast they recover to deliver personalized treatments and to properly allocate hospital resources so that healthcare systems do not become overwhelmed. To this end, we have combined clinically salient CT imaging data synergistically with laboratory testing data in an integrative machine learning model to predict organ-specific recovery of patients from COVID-19. We trained and validated our model in 285 patients on each separate major organ system impacted by COVID-19 including the renal, pulmonary, immune, cardiac, and hepatic systems. To greatly enhance the speed and utility of our model, we applied an artificial intelligence method to segment and classify regions on CT imaging, from which interpretable data could be directly fed into the predictive machine learning model for overall recovery. Across all organ systems we achieved validation set area under the receiver operator characteristic curve (AUC) values for organ-specific recovery ranging from 0.80 to 0.89, and significant overall recovery prediction in Kaplan-Meier analyses. This demonstrates that the synergistic use of an artificial intelligence (AI) framework applied to CT lung imaging and a machine learning model that integrates laboratory test data with imaging data can accurately predict the overall recovery of COVID-19 patients from baseline characteristics.Entities:
Keywords: COVID-19; artificial intelligence; longitudinal evaluation; medical image
Year: 2020 PMID: 35693121 PMCID: PMC7798573 DOI: 10.1093/pcmedi/pbaa040
Source DB: PubMed Journal: Precis Clin Med ISSN: 2516-1571
Figure 1.Proposed framework for NCP diagnosis and prognosis prediction. A large CT and metadata dataset was constructed from 285 patients (247 756 CT images from COVID-19 pneumonia). These NCP images were entered into an AI diagnostic system with patients' medical records to generate a quantitative report of lung lesions. We next analyzed lung-lesion features and clinical metadata to evaluate the changing of bodily functions after the initial hospital discharge using a LightGBM model and conduct a prognosis analysis using a Kaplan-Meier curve
Demographic and clinical statistics of the cohort. Data are mean (range) andn (%). The cohort of these 285 patients as COVID-19 pneumonia is stratified into groups of critically ill and non-critically ill patients. P values indicated comparison of critically ill patients versus non-critically ill patients and were calculated by independent-samples t test via SPSS 22.0.
| Total | Non-critically ill | Critically ill |
| ||||||
|---|---|---|---|---|---|---|---|---|---|
| Variables | sample size | mean (min-max) | sample size | % | mean (min-max) | sample size | % | mean (min-max) | t test |
| Age (Years) | 285 | 49 (0–91) | 101 | 48.8 | 48.1 (17–77) | 184 | 51.2% | 49.5 (0–91) | 0.542 |
| Sex | |||||||||
| Men | 101 | 139 (48.8%) | 51 | 50.5 | 88 | 47.8 | |||
| Women | 184 | 146 (51.2%) | 50 | 49.5 | 96 | 52.2 | |||
| Hemoglobin (g/L) | 269 | 133.3 (69–195) | 99 | 36.8 | 137.4 (89–177) | 170 | 63.2 | 130.99 (69–195) | 0.003** |
| Total protein (g/L) | 218 | 63.5 (50.9–90.5) | 62 | 28.4 | 64.4 (54.8–78.0) | 156 | 71.6 | 63.2 (50.1–90.5) | 0.132 |
| Albumin (g/L) | 218 | 36.1 (24.3–49.5) | 62 | 28.4 | 37.8 (27.2–46.1) | 156 | 71.6 | 35.5 (24.3–49.5) | 0.001** |
| Creatinine ( | 282 | 75.9 (18–1017.9) | 100 | 35.5 | 65.5 (39.8–138.4) | 182 | 64.5 | 81.7 (18–1017) | 0.033* |
| Lactate dehydrogenase (U/L) | 278 | 235.7 (97–559) | 100 | 36.0 | 204.6 (119–358) | 178 | 64.0 | 253.1 (97–559) | 0.000*** |
| C-reactive protein (mg/L) | 272 | 32.1 (0.1–333.1) | 99 | 36.4 | 19.7 (0.1–130.5) | 173 | 63.6 | 39.2 (0.1–333.1) | 0.000*** |
| Creatine kinase (IU/L) | 278 | 146.1 (13–1849) | 100 | 36.0 | 129.4 (13–871)) | 178 | 64.0 | 155.4 (20–1849) | 0.283 |
| Creatine kinase myocardial band (IU/L) | 278 | 12.6 (3–165) | 100 | 36.0 | 11.0 (4–19) | 178 | 64.0 | 13.4 (3–165) | 0.019* |
| Potassium (IU/L) | 282 | 3.8 (2.6–5.8) | 100 | 35.5 | 3.8 (2.6–4.6) | 182 | 64.5 | 3.8 (2.9–5.8) | 0.579 |
| Sodium (IU/L) | 282 | 137.9 (124.5–147.7) | 100 | 35.5 | 139.0 (131.7–147.7) | 182 | 64.5 | 137.3 (124.5–144.3) | 0.000*** |
| Erythrocyte sedimentation rate (mm/h) | 267 | 31.2 (0.5–126) | 99 | 37.1 | 25.9 (0.5–105) | 168 | 62.9 | 34.3 (0.5–126) | 0.012* |
| Globulin (g/L) | 218 | 27.4 (17.7–49.4) | 62 | 28.4 | 26.6 (19.6–36.9) | 156 | 71.6 | 27.7 (17.7–49.4) | 0.144 |
| Blood urea nitrate (mmol/L) | 282 | 4.6 (1.8–26.9) | 100 | 35.5 | 4.0 (1.8–10.9) | 182 | 64.5 | 4.9 (1.9–26.9) | 0.000*** |
| Kidney score | 285 | 1.04 (0–4) | 101 | 35.5 | 0.69 (0–3) | 184 | 64.5 | 1.22 (0–4) | 0.000*** |
| Liver score | 285 | 1.28 (0–5) | 101 | 35.5 | 0.89 (0–4) | 184 | 64.5 | 1.49 (0–5) | 0.000*** |
| Pulmonary score | 285 | 1.93 (0–4) | 101 | 35.5 | 1.42 (0–3) | 184 | 64.5 | 2.21 (0–4) | 0.000*** |
| Immune score | 285 | 2.23 (0–5) | 101 | 35.5 | 1.96 (0–5) | 184 | 64.5 | 2.38 (0–5) | 0.011* |
| Coagulation score | 285 | 1.75 (0–4) | 101 | 35.5 | 1.50 (0–3) | 184 | 64.5 | 1.88 (0–4) | 0.005** |
https://www.graphpad.com/support/faq/what-is-the-meaning-of--or--or--in-reports-of-statistical-significance-from-prism-or-instat/#:~:text=NEJM%20(New%20England%20Journal%20of,than%200.001%20shown%20as%20%22%3C%20.&text=0001%22.-,P%20values%20less%20than%200.001%20are%20summarized%20with%20three%20asterisks,are%20summarized%20with%20four%20asterisks.
Figure 2.ROC curves and confusion matrices for immune and renal systems. (A) The immune system model resulted in an AUC of 0.89. (B) The renal system model resulted in an AUC of 0.83. (C) The confusion matrix for the immune system model on the validation set, with an accuracy of 0.83. (D) The confusion matrix for the renal system model on the validation set, with an accuracy of 0.79.
Figure 3.Kaplan-Meier curves to assess patient recovery. Kaplan-Meier curves for the (A) immune and (B) renal systems. The patient population was stratified into high and low model scores based on a cutpoint of 0.5, the general midpoint of the lightGBM model output range.