| Literature DB >> 35298502 |
Shanmukh Alle1, Akshay Kanakan2, Samreen Siddiqui3, Akshit Garg1, Akshaya Karthikeyan1, Priyanka Mehta2, Neha Mishra2, Partha Chattopadhyay2,4, Priti Devi2,4, Swati Waghdhare3, Akansha Tyagi3, Bansidhar Tarai3, Pranjal Pratim Hazarik3, Poonam Das3, Sandeep Budhiraja3, Vivek Nangia3, Arun Dewan3, Ramanathan Sethuraman5, C Subramanian5, Mashrin Srivastava5, Avinash Chakravarthi5, Johnny Jacob5, Madhuri Namagiri5, Varma Konala5, Debasish Dash2, Tavpritesh Sethi6, Sujeet Jha3, Anurag Agrawal2, Rajesh Pandey2, P K Vinod1, U Deva Priyakumar1.
Abstract
The variability of clinical course and prognosis of COVID-19 highlights the necessity of patient sub-group risk stratification based on clinical data. In this study, clinical data from a cohort of Indian COVID-19 hospitalized patients is used to develop risk stratification and mortality prediction models. We analyzed a set of 70 clinical parameters including physiological and hematological for developing machine learning models to identify biomarkers. We also compared the Indian and Wuhan cohort, and analyzed the role of steroids. A bootstrap averaged ensemble of Bayesian networks was also learned to construct an explainable model for discovering actionable influences on mortality and days to outcome. We discovered blood parameters, diabetes, co-morbidity and SpO2 levels as important risk stratification features, whereas mortality prediction is dependent only on blood parameters. XGboost and logistic regression model yielded the best performance on risk stratification and mortality prediction, respectively (AUC score 0.83, AUC score 0.92). Blood coagulation parameters (ferritin, D-Dimer and INR), immune and inflammation parameters IL6, LDH and Neutrophil (%) are common features for both risk and mortality prediction. Compared with Wuhan patients, Indian patients with extreme blood parameters indicated higher survival rate. Analyses of medications suggest that a higher proportion of survivors and mild patients who were administered steroids had extreme neutrophil and lymphocyte percentages. The ensemble averaged Bayesian network structure revealed serum ferritin to be the most important predictor for mortality and Vitamin D to influence severity independent of days to outcome. The findings are important for effective triage during strains on healthcare infrastructure.Entities:
Mesh:
Year: 2022 PMID: 35298502 PMCID: PMC8929610 DOI: 10.1371/journal.pone.0264785
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 4(A) Comparison of F1 scores for various machine learning models that use patient vitals and lab test results. (B) Performance of the ML models with respect to number of days to outcome.
Fig 5Distribution plots for lymphocyte (%) and neutrophil (%) in steroid administered and non-administered patients having mild and severe disease.
Distribution of the number of patients across various classes.
| Data Distribution | |||
|---|---|---|---|
| Risk Category | Quaternary Stratification | Mortality | Binary Stratification |
|
| 8 (1.47%) | 483 (88.79%) | 244 (44.85%) |
|
| 236 (43.38%) | ||
|
| 239 (43.93%) | 300 (55.15%) | |
|
| 61 (11.21%) | 61 (11.21%) | |