| Literature DB >> 34389924 |
Md Shahid Ansari1, Dinesh Jain2, Haripriya Harikumar3,4, Santu Rana3, Sunil Gupta3, Sandeep Budhiraja5, Svetha Venkatesh3.
Abstract
PURPOSE: Our objective is to identify the predictive factors and predict hospital length of stay (LOS) in dengue patients, for efficient utilization of hospital resources.Entities:
Keywords: Dengue; Elastic-net; Healthcare; Patient’s length of stay (LOS); Predictive models; Random forest
Mesh:
Year: 2021 PMID: 34389924 PMCID: PMC8363490 DOI: 10.1007/s10729-021-09571-3
Source DB: PubMed Journal: Health Care Manag Sci ISSN: 1386-9620
Fig. 1Incidence of cases and deaths due to dengue in India (Source: Directorate of National Vector Borne Disease Control Programme, Dte. GHS, Ministry of Health and Family Welfare)
Attributes characteristics of the length of stay prediction dataset of dengue patients (n = 1148)
| Attributes | Unit | Value | Missing data | Method: |
|---|---|---|---|---|
| alternative value | ||||
| Age | Year | 0-87 | 0 | – |
| Gender | – | male; female | 0 | – |
| Length of Stay (LOS) | Days | 1-94 | 0 | – |
| Admission Type | – | emergency; direct | 0 | – |
| Platelet Count | x109/L | 5.0 - 676.0 | 0 | – |
| Haematocrit | % | 18.9 - 58.9 | 10.2 | Median; 40.4 |
| Haemoglobin | g/dL | 4.8 - 19.4 | 15.5 | Median; 13.4 |
| TLC | x109/L | 0.9 - 45.4 | 16.9 | Median; 5.0 |
| Lymphocytes | % | 2.5 - 100.0 | 18.1 | Median; 35 |
| Monocytes | % | 0.0 - 43.7 | 18.1 | Median; 6.0 |
| Neutrophil | % | 7.0 - 95.8 | 18.1 | Median; 56 |
| Eosinophils | % | 0.0 - 36.9 | 18.8 | Median; 1.0 |
| MCH | Pg | 15 – 43 | 18.6 | Median; 29 |
| MCHC | gm/dL | 28.4 - 37.4 | 18.6 | Median; 33.2 |
| MCV | fL | 51.2 - 122.3 | 18.6 | Median; 86.1 |
| RBC Count | x1012/L | 1.8 - 1640.0 | 18.6 | Median; 4.8 |
| RDW | % | 10.6 - 45.5 | 18.6 | Median; 13.7 |
| SGPT (ALT) | IU/L | 8.0 - 6054.0 | 32.4 | Median; 66 |
| SGOT (AST) | IU/L | 13.0 - 18590.0 | 35.1 | Median; 106 |
| Potassium | mmol/L | 2.7 - 6.6 | 34 | Median; 4.1 |
| Sodium | mmol/L | 117.4 - 150.0 | 34.1 | Median; 135.1 |
| Creatinine | mg/dL | 0.1 - 7.6 | 36.4 | Median; 0.7 |
| Albumin | g/dL | 1.3 - 5.3 | 41.5 | Median; 3.6 |
| Bilirubin Total | mg/dL | 0.1 - 17.1 | 44.7 | Median; 0.6 |
| Bilirubin Direct | mg/dL | 0.0 - 7.0 | 44.8 | Median; 0.2 |
| Bilirubin Indirect | mg/dL | 0.0 - 16.6 | 44.8 | Median; 0.4 |
| Globulin | g/dL | 1.0 - 4.8 | 47 | Median; 2.9 |
| Total Protein | g/dL | 2.9 - 8.5 | 46.9 | Median; 6.5 |
| Basophils | % | 0.0 - 3.3 | 49.7 | Removed |
| PT | S | 8.8 - 100.0 | 78.9 | Removed |
| APTT | S | 22.4 - 126.0 | 83 | Removed |
| Left Effusion | – | 1, yes; 0, no | 40 | Regression imputation |
| Right Effusion | – | 1, yes; 0, no | 40 | Regression imputation |
| Bilateral Effusion | – | 1, yes; 0, no | 40 | Regression imputation |
| Abdominal Free Fluid | – | 1, yes; 0, no | 40 | Regression imputation |
| Enlarged Spleen | – | 1, yes; 0, no | 40 | Regression imputation |
| Enlarged Liver | – | 1, yes; 0, no | 0 | – |
| Dialysis | – | 1, yes; 0, no | 0 | – |
| Assisted Ventilation | – | 1, yes; 0, no | 0 | – |
| Blood Component Transfusion | – | 1, yes; 0, no | 0 | – |
The demographic, investigation, clinical and procedure characteristics of the length of stay data set (n = 1148)
| Variable | Value | ||
|---|---|---|---|
| LOS (Days) | 1, > 5; 0,≤ 5 | ||
| Gender | 1, male; 0, female | ||
| Age | 1, 0-10 else 0; 1, 10-20 else 0; 1, 20-30 else 0; 1, 30-40 else 0; 1, 40-50 else 0; 1, 50-60 else 0, 1, > 60 else 0 | ||
| Marital Status | 1, single else 0; 1, married else 0; 1, unknown else 0 | ||
| Address | 1, NCR ; nonNCR, 0 | ||
| Channel | 1, cash else 0; 1, corporate else 0; 1, PSU else 0; 1, TPA else 0 | ||
| Type of admission | 1, emergency else 0; 1, direct else 0 | ||
| Previous admission | 1, yes; 0, no | ||
| Investigations | Normal | Low | High |
| Heamatocrit (%) | if((men & hct≥ 40 & hct≤ 50) or (women & hct≥ 36 & hct≤ 46)), 1 else 0 | if((men & hct< 40) or (women & hct< 36)), 1 else 0 | if((men & hct> 50) or (women & hct> 46)), 1 else 0 |
| Haemoglobin (g/dL) | if((men & age≥ 13yr & hgb≥ 13 & hgb≤ 17) or (women & age≥ 13yr & hgb≥ 12 & hgb≤ 15) or (age< 13yr &hgb≥ 11 & hgb≤ 15)), 1 else 0 | if((men & age≥ 13yr & hgb< 13) or (women & age≥ 13yr & hgb< 12) or (age< 13yr & hgb< 11)), 1 else 0 | if((men & age≥ 13yr & hgb> 15) or (women & age≥ 13yr & hgb< 17) or (age< 13yr & hgb> 15)), 1 else 0 |
| TLC (x109/L) | if((tlc≥ 4) & (age≥ 17yr & tlc≤ 10) or (age< 17yr & tlc≤ 15)), 1 else 0 | if(tlc< 4), 1 else 0 | if((age≥ 17yr & tlc> 10) or (age< 17yr & tlc> 15)), 1 else 0 |
| Lymphocytes (%) | if((age≥ 8yr & lym≥ 20 & lym≤ 40) or (age< 8yr & lym≥ 40 & lym≤ 75)), 1 else 0 | if((age≥ 8yr & lym< 20) or (age< 8yr & lym< 40)), 1 else 0 | if((age≥ 8yr & lym> 40) or (age< 8yr & lym> 75)), 1 else 0 |
| Monocytes (%) | if(mono≥ 2 & mono≤ 10), 1 else 0 | if(mono< 2), 1 else 0 | if(mono> 10), 1 else 0 |
| Neutrophils (%) | if(neutro≥ 20 & neutro≤ 45), 1 else 0 | if(neutro< 20), 1 else 0 | if(neutro> 45), 1 else 0 |
| Eosinophils (%) | if(eos≥ 1 & eos≤ 6), 1 else 0 | if(eos< 1), 1 else 0 | if(eos> 6), 1 else 0 |
| MCH (Pg) | if(mch≥ 26 & mch≤ 34), 1 else 0 | if(mch< 26), 1 else 0 | if(mch> 34), 1 else 0 |
| MCHC (gm/dL) | if(mchc≥ 32 & mchc≤ 36), 1 else 0 | if(mchc< 32), 1 else 0 | if(mchc> 36), 1 else 0 |
| MCV (fL) | if(mcv≥ 80 & mcv≤ 100), 1 else 0 | if(mcv< 80), 1 else 0 | if(mcv> 100), 1 else 0 |
| RBC Count (x1012/L) | if(rbc≥ 4.5 & rbc≤ 5.5), 1 else 0 | if(rbc< 4.5), 1 else 0 | if(rbc> 5.5), 1 else 0 |
| RDW (%) | if(rdw≥ 11.5 & rdw≤ 14.5), 1 else 0 | if(rdw< 11.5), 1 else 0 | if(rdw> 14.5), 1 else 0 |
| ALT (IU/L) | if((men & alt≥ 17 & alt≤ 63) or (women & alt≥ 14 & alt≤ 54)), 1 else 0 | if((men & alt< 17) or (women & alt< 14)), 1 else 0 | if((men & alt> 63) or (women & alt> 54)), 1 else 0 |
| AST (IU/L) | if(ast≥ 15 & ast≤ 41), 1 else 0 | if(ast< 15), 1 else 0 | if(ast> 41), 1 else 0 |
| Potassium (mmol/L) | if(k ≥ 3.6 & k ≤ 5.1), 1 else 0 | if(k < 3.6), 1 else 0 | if(k > 5.1), 1 else 0 |
| Sodium (mmol/L) | if(na≥ 136 & na≤ 144), 1 else 0 | if(na< 136), 1 else 0 | if(na> 144), 1 else 0 |
| Creatinine (mg/dL) | if((men & cr≥ 0.6 & cr≤ 1.2) or (women & cr≥ 0.4 & cr≤ 1)), 1 else 0 | if((men & cr< 0.6) or (women & cr< 1.4)), 1 else 0 | if((men & cr> 1.2) or (women & cr> 1)), 1 else 0 |
| Albumin (g/dL) | if(albm≥ 3.5 & albm≤ 5), 1 else 0 | if(albm< 3.5), 1 else 0 | if(albm> 5), 1 else 0 |
| Bilirubin Total (mg/dL) | if(tbil≥ 0.3 & tbil≤ 1.2), 1 else 0 | if(tbil< 0.3), 1 else 0 | if(tbil> 1.2), 1 else 0 |
| Bilirubin Direct (mg/dL) | if(dbil≥ 0.1 & dbil≤ 0.5), 1 else 0 | if (dbil< 0.1), 1 else 0 | if(dbil> 0.5), 1 else 0 |
| Bilirubin Indirect (mg/dL) | if(ibil≥ 0.1 & ibil≤ 1), 1 else 0 | if(ibil< 0.1), 1 else 0 | if(ibil> 1), 1 else 0 |
| Globulin (g/dL) | if(glb≥ 2.9 & glb≤ 3.3), 1 else 0 | if(glb< 2.9), 1 else 0 | if(glb> 3.3), 1 else 0 |
| Total Protein (g/dL) | if(tp≥ 6.5 & tp≤ 8.1), 1 else 0 | if(tp< 6.5), 1 else 0 | if(tp> 8.1), 1 else 0 |
| Platelet Count (x109/L) | Severe Thrombocytopenia | Moderate Thrombocytopenia | Mild Thrombocytopenia |
| if(plt< 20), 1 else 0 | if(plt> 20 & plt≤ 50), 1 else 0 | if(plt> 50 & plt≤ 100), 1 else 0 | |
1 There are two types of admission (emergency & direct) and we have created two binary independent variables for this and the binary variables only reflect the presence of the label as ’1’ or ’0’ for the individual patients. For example, if the patient is admitted to an emergency then the value is 1, if not then 0. Similarly, for the channel, there are 4 channels (cash, corporate, PSU, TPA) and we have created 4 independent binary variables for this and converted values as 0/1 similar to what we have done for the type of admission, 2 TLC: total leucocyte count; ALT: alanine aminotransferase; AST: aspartate aminotransferase; RBC: red blood cell count; RDW: red cell distribution width; hct: Heamatocrit; hgb:Haemoglobin, lym:Lymphocytes; mono:Monocytes; neutro:Neutrophils; eos:Eosinophils; k:Potassium; na:Sodium; cr:Creatinine; alb:Albumin; tbil:Bilirubin Total; dbil:Bilirubin Direct; ibil:Bilirubin Indirect; glb:Globulin; tp:Total Protein; plt:Platelet Count
Fig. 2Distribution of the Length of Stay (in days) in Dengue data from February 2012 to September 2017
Fig. 3Features with 40% missing data values and their test AUC using regression imputation
Some important features of all 1148 patients in the study population comparing those LOS≤ 5 days with those LOS> 5 days
| Parameter | LOS ≤ 5 days | LOS > 5 days | Overall | p value* | |
|---|---|---|---|---|---|
| (n = 784) | (n = 364) | (n = 1148) | |||
| LOS | n (%) | 784 (68%) | 364 (32%) | 1148 | |
| Age | Median (IQR) | 27 (1-87) | 30 (1-85) | 28 (1-87) | 0.00000 |
| Male Gender | 474 (60.5%) | 218 (59.9%) | 692 (60.3%) | 0.90560 | |
| Admission Type | Emergency | 529 (67.5%) | 274 (75.3%) | 803 (69.9%) | 0.00897 |
| Direct | 255 (32.5%) | 90 (24.7%) | 345 (30.1%) | 0.00934 | |
| Platelet Count (x109/L) | Median (x109/L) | 100 (8-569) | 135 (5-676) | 105 (5-676) | 0.00095 |
| Thrombocytopenia n(%) | 405 (71.6%) | 161 (28.4%) | 566 (49.3%) | 0.02267 | |
| TLC (x109/L) | Mean (x109/L) | 5.5 (1.3-43.6) | 5.5 (0.9-45.4) | 5.5 (0.9-45.4) | 0.00095 |
| Low (%) | 175 (22.3%) | 84 (23.1%) | 259 (22.6%) | 0.03935 | |
| High (%) | 62 (7.9%) | 49 (13.5%) | 111 (9.7%) | 0.00430 | |
| Lymphocytes (%) | Mean | 34 (3.0-92.0) | 30 (2.5-78.8) | 34 (2.5-92.0) | 0.00000 |
| Low | 90 (11.5%) | 102 (28%) | 192 (16.7%) | 0.00000 | |
| High | 212 (27.0%) | 33 (9.1%) | 245 (21.3%) | 0.00000 | |
| Neutrophils (%) | Mean (%) | 55 (7.0-91.9) | 60 (14.8-95.8) | 55 (7.0-95.8) | 0.00000 |
| High | 585 (74.6%) | 326 (89.6%) | 911 (79.4%) | 0.00000 | |
| ALT (IU/L) | Mean (IU/L) | 62.5 (11-6054) | 62.5 (8-3650) | 62.5 (8-6054) | 0.00062 |
| Low | 14 (1.8%) | 16 (4.4%) | 30 (2.6%) | 0.01728 | |
| High | 403 (51.4%) | 159 (43.7%) | 562 (49%) | 0.01769 | |
| AST (IU/L) | Mean (IU/L) | 108 (13-18590) | 108 (14-12559) | 108 (13-18590) | 0.00052 |
| High | 728 (92.9%) | 313 (86%) | 1041 (90.7%) | 0.00030 | |
| Haematocrit (%) | Low | 41.1 ± 5.7 | 39.4 ± 6.1 | 40.6 ± 5.9 | 0.00000 |
| Haemoglobin (g/dL) | Low | 13.3 ± 1.9 | 12.6 ± 2.2 | 13.1 ± 2 | 0.00000 |
| Eosinophils (%) | Low | 1 (0.0-36.9) | 1 (0.0-18.9) | 1 (0.0-36.9) | 0.00003 |
| RBC Count (x1012/L) | Low | 4.67 (1.76-7.7) | 4.67 (2.48-8.6) | 4.67 (1.76-8.6) | 0.00002 |
| RDW (%) | High | 13.8 (10.8-45.5) | 13.8 (10.8-31.2) | 13.8 (10.8-45.5) | 0.00042 |
| Creatinine (mg/dL) | High | 0.7 (6.1-6.3) | 0.7 (0.1-7.6) | 0.7 (0.1-7.6) | 0.00029 |
| Albumin (g/dL) | High | 3.6 (2.0-4.8) | 3.6 (1.3-5.0) | 3.6 (1.3-5.0) | 0.00000 |
| Radiological Findings | Bilateral effusion | 59 (7.5%) | 40 (11%) | 99 (8.6%) | 0.06690 |
| Right effusion | 151 (19.3%) | 92 (25.3%) | 243 (21.2%) | 0.02485 | |
| Left effusion | 66 (8.4%) | 46 (12.6%) | 112 (9.8%) | 0.03277 | |
| Assisted Ventilation | 8 (1.0%) | 21 (5.8%) | 29 (2.5%) | 0.00000 | |
| Blood Transfusion | 130 (16.6%) | 145 (39.8%) | 275 (24%) | 0.00000 |
TLC: total leucocyte count; ALT: alanine aminotransferase; AST: aspartate aminotransferase; RBC: red blood cell count; RDW: red cell distribution width. Categorical variables are summarized as n (%), Continuous variables are presented as mean±SD or median (range) if SD> 50% of the mean. *Pearson chi-square/Fisher exact test applied
Fig. 4Age and gender-wise distribution of dengue confirmed cases
Fig. 5Platelet counts and age-wise distribution of dengue cases
Fig. 6Top risky and safe features extracted by logistic regression with elastic-net model. Positive and negative values represent the risky and safe features respectively and top 18 important variables extracted by the random forest model. A higher relative weight value represents the higher importance of the features
Comparison of the performance measures for the predictive models on test data
| Model | AUC | Sens | Spec | PPV | NPV |
|---|---|---|---|---|---|
| LR with elastic-net | 0.75 | 0.24 | 0.97 | 0.82 | 0.72 |
| Random forest | 0.72 | 0.31 | 0.91 | 0.64 | 0.73 |
This performance result for each model on 345 records which is 30% of the data (n = 1148)
Fig. 7Receiver-operating characteristics curve analysis of both the models on test data predicting prolonged hospitalisation among dengue patients