Literature DB >> 33539308

Using Automated Machine Learning to Predict the Mortality of Patients With COVID-19: Prediction Model Development Study.

Kenji Ikemura1,2, Eran Bellin3, Yukako Yagi4, Henny Billett5, Mahmoud Saada2, Katelyn Simone2, Lindsay Stahl3, James Szymanski1, D Y Goldstein1, Morayma Reyes Gil1.   

Abstract

BACKGROUND: During a pandemic, it is important for clinicians to stratify patients and decide who receives limited medical resources. Machine learning models have been proposed to accurately predict COVID-19 disease severity. Previous studies have typically tested only one machine learning algorithm and limited performance evaluation to area under the curve analysis. To obtain the best results possible, it may be important to test different machine learning algorithms to find the best prediction model.
OBJECTIVE: In this study, we aimed to use automated machine learning (autoML) to train various machine learning algorithms. We selected the model that best predicted patients' chances of surviving a SARS-CoV-2 infection. In addition, we identified which variables (ie, vital signs, biomarkers, comorbidities, etc) were the most influential in generating an accurate model.
METHODS: Data were retrospectively collected from all patients who tested positive for COVID-19 at our institution between March 1 and July 3, 2020. We collected 48 variables from each patient within 36 hours before or after the index time (ie, real-time polymerase chain reaction positivity). Patients were followed for 30 days or until death. Patients' data were used to build 20 machine learning models with various algorithms via autoML. The performance of machine learning models was measured by analyzing the area under the precision-recall curve (AUPCR). Subsequently, we established model interpretability via Shapley additive explanation and partial dependence plots to identify and rank variables that drove model predictions. Afterward, we conducted dimensionality reduction to extract the 10 most influential variables. AutoML models were retrained by only using these 10 variables, and the output models were evaluated against the model that used 48 variables.
RESULTS: Data from 4313 patients were used to develop the models. The best model that was generated by using autoML and 48 variables was the stacked ensemble model (AUPRC=0.807). The two best independent models were the gradient boost machine and extreme gradient boost models, which had an AUPRC of 0.803 and 0.793, respectively. The deep learning model (AUPRC=0.73) was substantially inferior to the other models. The 10 most influential variables for generating high-performing models were systolic and diastolic blood pressure, age, pulse oximetry level, blood urea nitrogen level, lactate dehydrogenase level, D-dimer level, troponin level, respiratory rate, and Charlson comorbidity score. After the autoML models were retrained with these 10 variables, the stacked ensemble model still had the best performance (AUPRC=0.791).
CONCLUSIONS: We used autoML to develop high-performing models that predicted the survival of patients with COVID-19. In addition, we identified important variables that correlated with mortality. This is proof of concept that autoML is an efficient, effective, and informative method for generating machine learning-based clinical decision support tools. ©Kenji Ikemura, Eran Bellin, Yukako Yagi, Henny Billett, Mahmoud Saada, Katelyn Simone, Lindsay Stahl, James Szymanski, D Y Goldstein, Morayma Reyes Gil. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 26.02.2021.

Entities:  

Keywords:  COVID-19; Shapley additive explanation; automated machine learning; biomarker; decision support; decision support tool; dimensionality reduction; machine learning; partial dependence plot; ranking

Mesh:

Year:  2021        PMID: 33539308      PMCID: PMC7919846          DOI: 10.2196/23458

Source DB:  PubMed          Journal:  J Med Internet Res        ISSN: 1438-8871            Impact factor:   5.428


  12 in total

1.  Predicting the Travel Distance of Patients to Access Healthcare Using Deep Neural Networks.

Authors:  Li-Chin Chen; Ji-Tian Sheu; Yuh-Jue Chuang; Yu Tsao
Journal:  IEEE J Transl Eng Health Med       Date:  2021-12-08       Impact factor: 3.316

2.  Influence of Co-morbidities During SARS-CoV-2 Infection in an Indian Population.

Authors:  Adrian Matysek; Aneta Studnicka; Wade Menpes Smith; Michał Hutny; Paweł Gajewski; Krzysztof J Filipiak; Jorming Goh; Guang Yang
Journal:  Front Med (Lausanne)       Date:  2022-08-01

3.  Comparing human milk macronutrients measured using analyzers based on mid-infrared spectroscopy and ultrasound and the application of machine learning in data fitting.

Authors:  Huijuan Ruan; Qingya Tang; Yajie Zhang; Xuelin Zhao; Yi Xiang; Yi Feng; Wei Cai
Journal:  BMC Pregnancy Childbirth       Date:  2022-07-14       Impact factor: 3.105

Review 4.  The accuracy of machine learning approaches using non-image data for the prediction of COVID-19: A meta-analysis.

Authors:  Kuang-Ming Kuo; Paul C Talley; Chao-Sheng Chang
Journal:  Int J Med Inform       Date:  2022-05-13       Impact factor: 4.730

5.  Machine Learning Based Clinical Decision Support System for Early COVID-19 Mortality Prediction.

Authors:  Akshaya Karthikeyan; Akshit Garg; P K Vinod; U Deva Priyakumar
Journal:  Front Public Health       Date:  2021-05-12

6.  BACS: blockchain and AutoML-based technology for efficient credit scoring classification.

Authors:  Fan Yang; Yanan Qiao; Yong Qi; Junge Bo; Xiao Wang
Journal:  Ann Oper Res       Date:  2022-01-24       Impact factor: 4.854

Review 7.  COVID Mortality Prediction with Machine Learning Methods: A Systematic Review and Critical Appraisal.

Authors:  Francesca Bottino; Emanuela Tagliente; Luca Pasquini; Alberto Di Napoli; Martina Lucignani; Lorenzo Figà-Talamanca; Antonio Napolitano
Journal:  J Pers Med       Date:  2021-09-07

8.  Explainable machine learning model for predicting spontaneous bacterial peritonitis in cirrhotic patients with ascites.

Authors:  Yingying Hu; Ruijia Chen; Haibing Gao; Haitao Lin; Jinye Wang; Xiaowei Wang; Jingfeng Liu; Yongyi Zeng
Journal:  Sci Rep       Date:  2021-11-04       Impact factor: 4.379

9.  Machine Learning-Based COVID-19 Patients Triage Algorithm Using Patient-Generated Health Data from Nationwide Multicenter Database.

Authors:  Min Sue Park; Hyeontae Jo; Haeun Lee; Se Young Jung; Hyung Ju Hwang
Journal:  Infect Dis Ther       Date:  2022-02-16

10.  Interpretable deep learning for the prediction of ICU admission likelihood and mortality of COVID-19 patients.

Authors:  Amril Nazir; Hyacinth Kwadwo Ampadu
Journal:  PeerJ Comput Sci       Date:  2022-03-17
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.