| Literature DB >> 31961332 |
Gang Luo1, Shan He2, Bryan L Stone3, Flory L Nkoy3, Michael D Johnson3.
Abstract
BACKGROUND: As a major chronic disease, asthma causes many emergency department (ED) visits and hospitalizations each year. Predictive modeling is a key technology to prospectively identify high-risk asthmatic patients and enroll them in care management for preventive care to reduce future hospital encounters, including inpatient stays and ED visits. However, existing models for predicting hospital encounters in asthmatic patients are inaccurate. Usually, they miss over half of the patients who will incur future hospital encounters and incorrectly classify many others who will not. This makes it difficult to match the limited resources of care management to the patients who will incur future hospital encounters, increasing health care costs and degrading patient outcomes.Entities:
Year: 2020 PMID: 31961332 PMCID: PMC7001050 DOI: 10.2196/16080
Source DB: PubMed Journal: JMIR Med Inform
The confusion matrix.
| Class | Future hospital encounters for asthma | No future hospital encounter for asthma |
| Predicted future hospital encounters for asthma | True positive | False positive |
| Predicted no future hospital encounter for asthma | False negative | True negative |
Demographic characteristics of the asthmatic patients at Intermountain Healthcare during 2005 to 2016.
| Characteristics | Data instances (N=315,308), n (%) | Data instances linked to hospital encounters for asthma in the following year (N=11,332), n (%) | Data instances linked to no hospital encounter for asthma in the following year (N=303,976), n (%) | |
|
| ||||
| <6 | 37,826 (12.00) | 3118 (27.52) | 34,708 (11.42) | |
| 6 to <18 | 53,162 (16.86) | 2590 (22.86) | 50,572 (16.64) | |
| 18 to 65 | 177,439 (56.27) | 5003 (44.15) | 172,436 (56.73) | |
| 65+ | 46,881 (14.87) | 621 (5.48) | 46,260 (15.22) | |
|
| ||||
| Male | 127,217 (40.35) | 5169 (45.61) | 122,048 (40.15) | |
| Female | 188,091 (59.65) | 6163 (54.39) | 181,928 (59.85) | |
|
| ||||
| American Indian or Alaskan native | 2509 (0.80) | 214 (1.89) | 2295 (0.76) | |
| Asian | 2197 (0.70) | 77 (0.68) | 2120 (0.70) | |
| Black or African American | 5751 (1.82) | 460 (4.06) | 5291 (1.74) | |
| Native Hawaiian or other Pacific Islander | 4288 (1.36) | 411 (3.63) | 3877 (1.28) | |
| White | 282,626 (89.63) | 9420 (83.13) | 273,206 (89.88) | |
| Unknown or not reported | 17,937 (5.69) | 750 (6.62) | 17,187 (5.65) | |
|
| ||||
| Hispanic | 29,293 (9.29) | 2279 (20.11) | 27,014 (8.89) | |
| Non-Hispanic | 252,599 (80.11) | 8157 (71.98) | 244,442 (80.41) | |
| Unknown or not reported | 33,416 (10.60) | 896 (7.91) | 32,520 (10.70) | |
|
| ||||
| Private | 206,641 (65.54) | 6192 (54.64) | 200,449 (65.94) | |
| Public | 80,154 (25.42) | 3238 (28.57) | 76,916 (25.30) | |
| Self-paid or charity | 28,513 (9.04) | 1902 (16.78) | 26,611 (8.75) | |
|
| ||||
| ≤3 | 234,832 (74.48) | 7666 (67.65) | 227,166 (74.73) | |
| >3 | 80,476 (25.52) | 3666 (32.35) | 76,810 (25.27) | |
|
| ||||
| Inhaled corticosteroid | 78,105 (24.77) | 4539 (40.05) | 73,566 (24.20) | |
| Inhaled steroid and rapid-onset long-acting beta2-agonist combination | 44,992 (14.27) | 2196 (19.38) | 42,796 (14.08) | |
| Leukotriene modifier | 35,507 (11.26) | 2320 (20.47) | 33,187 (10.92) | |
| Long-acting beta2-agonist | 1813 (0.58) | 69 (0.61) | 1744 (0.57) | |
| Mast cell stabilizer | 121 (0.04) | 7 (0.06) | 114 (0.04) | |
| Inhaled short-acting beta2-agonist | 129,528 (41.08) | 7545 (66.58) | 121,983 (40.13) | |
| Systemic corticosteroid | 136,642 (43.34) | 7324 (64.63) | 129,318 (42.54) | |
|
| ||||
| Allergic rhinitis | 4715 (1.50) | 181 (1.60) | 4534 (1.49) | |
| Anxiety or depression | 56,961 (18.07) | 1716 (15.14) | 55,245 (18.17) | |
| Bronchopulmonary dysplasia | 429 (0.14) | 35 (0.31) | 394 (0.13) | |
| Chronic obstructive pulmonary disease | 12,887 (4.09) | 391 (3.45) | 12,496 (4.11) | |
| Cystic fibrosis | 458 (0.15) | 11 (0.10) | 447 (0.15) | |
| Eczema | 4927 (1.56) | 443 (3.91) | 4484 (1.48) | |
| Gastroesophageal reflux | 56,196 (17.82) | 1309 (11.55) | 54,887 (18.06) | |
| Obesity | 36,291 (11.51) | 1076 (9.50) | 35,215 (11.58) | |
| Premature birth | 5542 (1.76) | 440 (3.88) | 5102 (1.68) | |
| Sinusitis | 14,756 (4.68) | 592 (5.22) | 14,164 (4.66) | |
| Sleep apnea | 20,892 (6.63) | 471 (4.16) | 20,421 (6.72) | |
|
| ||||
| Current smoker | 35,551 (11.28) | 1811 (15.98) | 33,740 (11.10) | |
| Former smoker | 19,304 (6.12) | 569 (5.02) | 18,735 (6.16) | |
| Never smoker or unknown | 260,453 (82.60) | 8952 (79.00) | 251,501 (82.74) | |
Demographic characteristics of the asthmatic patients at Intermountain Healthcare in 2017.
| Characteristics | Data instances (N=19,256), n (%) | Data instances linked to hospital encounters for asthma in the following year (N=812), n (%) | Data instances linked to no hospital encounter for asthma in the following year (N=18,444), n (%) | |
|
| ||||
| <6 | 1877 (9.75) | 199 (24.51) | 1678 (9.10) | |
| 6 to <18 | 3235 (16.80) | 181 (22.29) | 3054 (16.56) | |
| 18 to 65 | 10,265 (53.31) | 386 (47.54) | 9879 (53.56) | |
| 65+ | 3879 (20.14) | 46 (5.67) | 3833 (20.78) | |
|
| ||||
| Male | 7816 (40.59) | 373 (45.94) | 7443 (40.35) | |
| Female | 11,440 (59.41) | 439 (54.06) | 11,001 (59.65) | |
|
| ||||
| American Indian or Alaskan native | 159 (0.83) | 13 (1.60) | 146 (0.79) | |
| Asian | 205 (1.06) | 10 (1.23) | 195 (1.06) | |
| Black or African American | 403 (2.09) | 42 (5.17) | 361 (1.96) | |
| Native Hawaiian or other Pacific Islander | 346 (1.80) | 47 (5.79) | 299 (1.62) | |
| White | 17,706 (91.95) | 681 (83.87) | 17,025 (92.31) | |
| Unknown or not reported | 437 (2.27) | 19 (2.34) | 418 (2.27) | |
|
| ||||
| Hispanic | 2212 (11.49) | 192 (23.65) | 2020 (10.95) | |
| Non-Hispanic | 16,860 (87.56) | 618 (76.11) | 16,242 (88.06) | |
| Unknown or not reported | 184 (0.96) | 2 (0.25) | 182 (0.99) | |
|
| ||||
| Private | 12,850 (66.73) | 462 (56.90) | 12,388 (67.17) | |
| Public | 5128 (26.63) | 208 (25.62) | 4920 (26.68) | |
| Self-paid or charity | 1278 (6.64) | 142 (17.49) | 1136 (6.16) | |
|
| ||||
| ≤3 | 11,133 (57.82) | 423 (52.09) | 10,710 (58.07) | |
| >3 | 8123 (42.18) | 389 (47.91) | 7734 (41.93) | |
|
| ||||
| Inhaled corticosteroid | 7241 (37.60) | 424 (52.22) | 6817 (36.96) | |
| Inhaled steroid and rapid-onset long-acting beta2-agonist combination | 4400 (22.85) | 222 (27.34) | 4178 (22.65) | |
| Leukotriene modifier | 3573 (18.56) | 209 (25.74) | 3364 (18.24) | |
| Long-acting beta2-agonist | 52 (0.27) | 5 (0.62) | 47 (0.25) | |
| Mast cell stabilizer | 8 (0.04) | 0 (0.00) | 8 (0.04) | |
| Inhaled short-acting beta2-agonist | 13,785 (71.59) | 739 (91.01) | 13,046 (70.73) | |
| Systemic corticosteroid | 12,020 (62.42) | 693 (85.34) | 11,327 (61.41) | |
|
| ||||
| Allergic rhinitis | 392 (2.04) | 10 (1.23) | 382 (2.07) | |
| Anxiety or depression | 3946 (20.49) | 131 (16.13) | 3815 (20.68) | |
| Bronchopulmonary dysplasia | 15 (0.08) | 3 (0.37) | 12 (0.07) | |
| Chronic obstructive pulmonary disease | 1056 (5.48) | 23 (2.83) | 1033 (5.60) | |
| Cystic fibrosis | 95 (0.49) | 1 (0.12) | 94 (0.51) | |
| Eczema | 307 (1.59) | 34 (4.19) | 273 (1.48) | |
| Gastroesophageal reflux | 3548 (18.43) | 71 (8.74) | 3477 (18.85) | |
| Obesity | 3505 (18.20) | 116 (14.29) | 3389 (18.37) | |
| Premature birth | 476 (2.47) | 41 (5.05) | 435 (2.36) | |
| Sinusitis | 780 (4.05) | 34 (4.19) | 746 (4.04) | |
| Sleep apnea | 3003 (15.60) | 78 (9.61) | 2925 (15.86) | |
|
| ||||
| Current smoker | 2391 (12.42) | 146 (17.98) | 2245 (12.17) | |
| Former smoker | 2326 (12.08) | 83 (10.22) | 2243 (12.16) | |
| Never smoker or unknown | 14,539 (75.50) | 583 (71.80) | 13,956 (75.67) | |
Figure 1Our model’s receiver operating characteristic curve.
Our final model’s performance metrics when differing top percentages of asthmatic patients with the highest predicted risk were used as the cutoff threshold for conducting binary classification.
| Top percentage of asthmatic patients with the highest predicted risk (%) | Accuracy (%) | Sensitivity (%) | Specificity (%) | Positive predictive value (%) | Negative predictive value (%) |
| 1.00 | 95.89 | 13.05 | 99.53 | 55.21 | 96.30 |
| 2.00 | 95.54 | 20.81 | 98.83 | 43.90 | 96.59 |
| 3.00 | 95.00 | 26.23 | 98.03 | 36.92 | 96.79 |
| 4.00 | 94.48 | 32.02 | 97.23 | 33.77 | 97.01 |
| 5.00 | 93.84 | 36.21 | 96.38 | 30.56 | 97.17 |
| 6.00 | 93.19 | 40.39 | 95.52 | 28.40 | 97.33 |
| 7.00 | 92.53 | 44.33 | 94.65 | 26.73 | 97.48 |
| 8.00 | 91.85 | 48.15 | 93.77 | 25.39 | 97.62 |
| 9.00 | 91.09 | 51.11 | 92.85 | 23.95 | 97.73 |
| 10.00 | 90.31 | 53.69 | 91.93 | 22.65 | 97.83 |
| 15.00 | 86.44 | 67.00 | 87.29 | 18.84 | 98.36 |
| 20.00 | 81.95 | 73.15 | 82.34 | 15.42 | 98.58 |
| 25.00 | 77.41 | 78.57 | 77.36 | 13.25 | 98.80 |
Our final model’s confusion matrix when the cutoff threshold for conducting binary classification was set at the top 10.00% (1926/19,256) of asthmatic patients with the highest predicted risk.
| Class | Future hospital encounters for asthma, n | No future hospital encounter for asthma, n |
| Predicted future hospital encounters for asthma | 436 | 1489 |
| Predicted no future hospital encounter for asthma | 376 | 16,955 |
A comparison of our final model and multiple prior models for predicting inpatient stays and emergency department visits in asthmatic patients.
| Model | Prediction target | Classification algorithm | Features used in the model, n | Data instances, n | Area under the receiver operating characteristic curve | Sensitivity (%) | Specificity (%) | Positive predictive value (%) | Negative predictive value (%) |
| Our final model | Hospital encounters for asthma | Extreme gradient boosting | 142 | 334,564 | 0.859 | 53.69 | 91.93 | 22.65 | 97.83 |
| Loymans et al [ | Asthma exacerbation | Logistic regression | 7 | 611 | 0.8 | —a | — | — | — |
| Schatz et al [ | Inpatient stay for asthma in children | Logistic regression | 5 | 4197 | 0.781 | 43.9 | 89.8 | 5.6 | 99.1 |
| Schatz et al [ | Inpatient stay for asthma in adults | Logistic regression | 3 | 6904 | 0.712 | 44.9 | 87.0 | 3.9 | 99.3 |
| Eisner et al [ | Inpatient stay for asthma | Logistic regression | 1 | 2858 | 0.689 | — | — | — | — |
| Eisner et al [ | EDb visit for asthma | Logistic regression | 3 | 2415 | 0.751 | — | — | — | — |
| Sato et al [ | Severe asthma exacerbation | Classification and regression tree | 3 | 78 | 0.625 | — | — | — | — |
| Miller et al [ | Hospital encounters for asthma | Logistic regression | 17 | 2821 | 0.81 | — | — | — | — |
| Yurk et al [ | Hospital encounters or lost day for asthma | Logistic regression | 11 | 4888 | 0.78 | 77 | 63 | 82 | 56 |
| Lieu et al [ | Inpatient stay for asthma | Proportional hazards regression | 7 | 16,520 | 0.79 | — | — | — | — |
| Lieu et al [ | ED visit for asthma | Proportional hazards regression | 7 | 16,520 | 0.69 | — | — | — | — |
| Lieu et al [ | Hospital encounters for asthma | Classification and regression tree | 4 | 7141 | — | 49.0 | 83.6 | 18.5 | — |
| Schatz et al [ | Hospital encounters for asthma | Logistic regression | 4 | 14,893 | 0.614 | 25.4 | 92.0 | 22.0 | 93.2 |
| Forno et al [ | Severe asthma exacerbation | Scoring | 17 | 615 | 0.75 | — | — | — | — |
aThe performance measure is not reported in the original paper describing the model.
bED: emergency department.