Literature DB >> 27664505

Building interpretable predictive models for pediatric hospital readmission using Tree-Lasso logistic regression.

Milos Jovanovic1, Sandro Radovanovic1, Milan Vukicevic2, Sven Van Poucke3, Boris Delibasic1.   

Abstract

OBJECTIVES: Quantification and early identification of unplanned readmission risk have the potential to improve the quality of care during hospitalization and after discharge. However, high dimensionality, sparsity, and class imbalance of electronic health data and the complexity of risk quantification, challenge the development of accurate predictive models. Predictive models require a certain level of interpretability in order to be applicable in real settings and create actionable insights. This paper aims to develop accurate and interpretable predictive models for readmission in a general pediatric patient population, by integrating a data-driven model (sparse logistic regression) and domain knowledge based on the international classification of diseases 9th-revision clinical modification (ICD-9-CM) hierarchy of diseases. Additionally, we propose a way to quantify the interpretability of a model and inspect the stability of alternative solutions.
MATERIALS AND METHODS: The analysis was conducted on >66,000 pediatric hospital discharge records from California, State Inpatient Databases, Healthcare Cost and Utilization Project between 2009 and 2011. We incorporated domain knowledge based on the ICD-9-CM hierarchy in a data driven, Tree-Lasso regularized logistic regression model, providing the framework for model interpretation. This approach was compared with traditional Lasso logistic regression resulting in models that are easier to interpret by fewer high-level diagnoses, with comparable prediction accuracy.
RESULTS: The results revealed that the use of a Tree-Lasso model was as competitive in terms of accuracy (measured by area under the receiver operating characteristic curve-AUC) as the traditional Lasso logistic regression, but integration with the ICD-9-CM hierarchy of diseases provided more interpretable models in terms of high-level diagnoses. Additionally, interpretations of models are in accordance with existing medical understanding of pediatric readmission. Best performing models have similar performances reaching AUC values 0.783 and 0.779 for traditional Lasso and Tree-Lasso, respectfully. However, information loss of Lasso models is 0.35 bits higher compared to Tree-Lasso model.
CONCLUSIONS: We propose a method for building predictive models applicable for the detection of readmission risk based on Electronic Health records. Integration of domain knowledge (in the form of ICD-9-CM taxonomy) and a data-driven, sparse predictive algorithm (Tree-Lasso Logistic Regression) resulted in an increase of interpretability of the resulting model. The models are interpreted for the readmission prediction problem in general pediatric population in California, as well as several important subpopulations, and the interpretations of models comply with existing medical understanding of pediatric readmission. Finally, quantitative assessment of the interpretability of the models is given, that is beyond simple counts of selected low-level features.
Copyright © 2016 Elsevier B.V. All rights reserved.

Entities:  

Keywords:  Hospital readmission prediction; Lasso regression; Model interpretability; Tree Lasso regression

Mesh:

Year:  2016        PMID: 27664505     DOI: 10.1016/j.artmed.2016.07.003

Source DB:  PubMed          Journal:  Artif Intell Med        ISSN: 0933-3657            Impact factor:   5.326


  8 in total

1.  E-CatBoost: An efficient machine learning framework for predicting ICU mortality using the eICU Collaborative Research Database.

Authors:  Nima Safaei; Babak Safaei; Seyedhouman Seyedekrami; Mojtaba Talafidaryani; Arezoo Masoud; Shaodong Wang; Qing Li; Mahdi Moqri
Journal:  PLoS One       Date:  2022-05-05       Impact factor: 3.752

2.  The role of specialized hospital units in infection and mortality risk reduction among patients with hematological cancers.

Authors:  Raïsa Carmen; Galit B Yom-Tov; Inneke Van Nieuwenhuyse; Bram Foubert; Yishai Ofran
Journal:  PLoS One       Date:  2019-03-20       Impact factor: 3.240

3.  The utility of LASSO-based models for real time forecasts of endemic infectious diseases: A cross country comparison.

Authors:  Yirong Chen; Collins Wenhan Chu; Mark I C Chen; Alex R Cook
Journal:  J Biomed Inform       Date:  2018-02-27       Impact factor: 6.317

4.  Determining Predictors of Weight Loss in a Behavioral Intervention: A Case Study in the Use of Lasso Regression.

Authors:  Carly Lupton-Smith; Elizabeth A Stuart; Emma E McGinty; Arlene T Dalcin; Gerald J Jerome; Nae-Yuh Wang; Gail L Daumit
Journal:  Front Psychiatry       Date:  2022-02-03       Impact factor: 4.157

5.  Applicability of predictive models for 30-day unplanned hospital readmission risk in paediatrics: a systematic review.

Authors:  Ines Marina Niehaus; Nina Kansy; Stephanie Stock; Jörg Dötsch; Dirk Müller
Journal:  BMJ Open       Date:  2022-03-30       Impact factor: 2.692

6.  Separation of Different Blogs from Skin Disease Data using Artificial Intelligence.

Authors:  Mohammed J Abdulaal; Ibrahim M Mehedi; Abdulah Jeza Aljohani; Ahmad H Milyani; Mohamed Mahmoud; Abdullah M Abusorrah; Rahtul Jannat
Journal:  Comput Intell Neurosci       Date:  2022-08-23

7.  Predicting Readmission Charges Billed by Hospitals: Machine Learning Approach.

Authors:  Deepika Gopukumar; Abhijeet Ghoshal; Huimin Zhao
Journal:  JMIR Med Inform       Date:  2022-08-30

8.  Ada-WHIPS: explaining AdaBoost classification with applications in the health sciences.

Authors:  Julian Hatwell; Mohamed Medhat Gaber; R Muhammad Atif Azad
Journal:  BMC Med Inform Decis Mak       Date:  2020-10-02       Impact factor: 2.796

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.