| Literature DB >> 32082697 |
Hasan Symum1, José L Zayas-Castro1,2.
Abstract
OBJECTIVES: The study aimed to develop and compare predictive models based on supervised machine learning algorithms for predicting the prolonged length of stay (LOS) of hospitalized patients diagnosed with five different chronic conditions.Entities:
Keywords: Chronic Disease; Discharge Planning; Inpatients; Length of Stay; Machine Learning
Year: 2020 PMID: 32082697 PMCID: PMC7010949 DOI: 10.4258/hir.2020.26.1.20
Source DB: PubMed Journal: Healthc Inform Res ISSN: 2093-3681
Descriptive statistics for all common features
Values are presented as number (%) or mean ± standard deviation. For the binary (Yes or No) type variable, descriptive statistics is shown for “Yes” level only.
CHF: congestive heart failure, AMI: acute myocardial infarction, COPD: chronic obstructive pulmonary disease, PN: pneumonia, DB: type 2 diabetes, LOS: length of stay.
Descriptive statistics for disease cohort-specific variables
For the binary (Yes or No) type variable, descriptive statistics is shown for “Yes” level only.
CHF: congestive heart failure, COPD: chronic obstructive pulmonary disease, AMI: acute myocardial infarction, DB: type 2 diabetes, PN: pneumonia.
Figure 1Data preprocessing steps for building predictive model. SVM: support vector machine.
Summary statistics in data preprocessing steps
Values are presented as number (%).
CHF: congestive heart failure, AMI: acute myocardial infarction, COPD: chronic obstructive pulmonary disease, PN: pneumonia, DB: type 2 diabetes.
Figure 2Flowchart of the predictive model building and best performing model selection. CQ: chi-square feature selection, WR: support vector machinebased wrapper feature selection, AUC: area under the curve, SP: specificity, SN: sensitivity, SMOTE: Synthetic Minority Over-sampling Technique.
Performance comparison of predictive models
AMI: acute myocardial infarction, CHF: congestive heart failure, COPD: chronic obstructive pulmonary disease, DB: type 2 diabetes, PN: pneumonia, AUC: area under the curve, SP: specificity, KNN: k-nearest neighbor, LSVM: linear support vector machine, RF: random forest, NN: multi-layer neural network, WR: support vector machine-based wrapper method, WR+ST: wrapper method with SMOTE (Synthetic Minority Over-sampling Technique), CQ: chi-square filtering method, CQ+ST: chi-square with SMOTE.
aBest model based on AUC, bbest model based on specificity.
Figure 3Performance metric changes (%) with and without SMOTE balancing. AMI: acute myocardial infarction, CHF: congestive heart failure, COPD: chronic obstructive pulmonary disease, DB: type 2 diabetes, PN: pneumonia, SP: specificity, SN: sensitivity, KNN: k-nearest neighbor, LSVM: linear support vector machine, RF: random forest, NN: multi-layer neural network, SMOTE: Synthetic Minority Over-sampling Technique.
Best performing model based on several criteria
AMI: acute myocardial infarction, CHF: congestive heart failure, COPD: chronic obstructive pulmonary disease, DB: type 2 diabetes, PN: pneumonia, AUC: area under the curve, SP: specificity, SN: sensitivity, LSVM: linear support vector machine, RF: random forest, NN: multi-layer neural network, KNN: k-nearest neighbor, WR: support vector machine-based wrapper method, WR+ST: wrapper method with SMOTE (Synthetic Minority Over-sampling Technique), CQ: chi-square filtering method, CQ+ST: chi-square with SMOTE.
Example, a+b+c; a = machine learning method, b = feature selection technique, c = presence of SMOTE balancing; LSVM+CQ+ST = linear support vector mechanics with chi-square feature selection and SMOTE date balancing technique.
Important features extracted by LSVM algorithm
LSVM: linear support vector machine, AMI: acute myocardial infarction, CHF: congestive heart failure, COPD: chronic obstructive pulmonary disease, DB: type 2 diabetes, PN: pneumonia.
Important features extracted using regression analysis
CHF: congestive heart failure, AMI: acute myocardial infarction, COPD: chronic obstructive pulmonary disease, DB: type 2 diabetes, PN: pneumonia, ED: emergency department.