| Literature DB >> 32620819 |
Yoshihiko Raita1, Carlos A Camargo2, Charles G Macias3, Jonathan M Mansbach4, Pedro A Piedra5, Stephen C Porter6,7, Stephen J Teach8, Kohei Hasegawa2.
Abstract
We aimed to develop machine learning models to accurately predict bronchiolitis severity, and to compare their predictive performance with a conventional scoring (reference) model. In a 17-center prospective study of infants (aged < 1 year) hospitalized for bronchiolitis, by using routinely-available pre-hospitalization data as predictors, we developed four machine learning models: Lasso regression, elastic net regression, random forest, and gradient boosted decision tree. We compared their predictive performance-e.g., area-under-the-curve (AUC), sensitivity, specificity, and net benefit (decision curves)-using a cross-validation method, with that of the reference model. The outcomes were positive pressure ventilation use and intensive treatment (admission to intensive care unit and/or positive pressure ventilation use). Of 1,016 infants, 5.4% underwent positive pressure ventilation and 16.0% had intensive treatment. For the positive pressure ventilation outcome, machine learning models outperformed reference model (e.g., AUC 0.88 [95% CI 0.84-0.93] in gradient boosted decision tree vs 0.62 [95% CI 0.53-0.70] in reference model), with higher sensitivity (0.89 [95% CI 0.80-0.96] vs. 0.62 [95% CI 0.49-0.75]) and specificity (0.77 [95% CI 0.75-0.80] vs. 0.57 [95% CI 0.54-0.60]). The machine learning models also achieved a greater net benefit over ranges of clinical thresholds. Machine learning models consistently demonstrated a superior ability to predict acute severity and achieved greater net benefit.Entities:
Mesh:
Year: 2020 PMID: 32620819 PMCID: PMC7335203 DOI: 10.1038/s41598-020-67629-8
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Patient characteristics and clinical outcomes in 1,016 infants hospitalized for bronchiolitis.
| Variables | n = 1,016 |
|---|---|
| Age (month), median (IQR) | 3.2 (1.6–6.0) |
| Female sex | 406 (40.0) |
| Race/ethnicity | |
| Non-Hispanic white | 430 (42.0) |
| Non-Hispanic black | 239 (23.5) |
| Others | 347 (34.2) |
| Prenatal maternal smoking | 147 (14.7) |
| Gestational age (week) | |
| 32–33 | 35 (3.4) |
| 34–36 | 151 (14.9) |
| 37–39 | 417 (41.0) |
| 40–41 | 391 (38.5) |
| ≥ 42 | 22 (2.2) |
| Birth weight (kg) | |
| 0–1.3 | 3 (0.3) |
| 1.4–2.2 | 61 (6.0) |
| 2.3–3.1 | 343 (33.9) |
| ≥ 3.2 | 604 (59.7) |
| Postnatal ICU admission | 167 (16.4) |
| Previous hospital admission | 162 (16.0) |
| Previous ICU admission | 17 (1.7) |
| Previous breathing problems (count) | 32 (3.2) |
| 0 | 810 (79.7) |
| 1 | 160 (15.7) |
| 2 | 46 (4.5) |
| History of eczema | 149 (14.7) |
| Poor feeding | 32 (3.2) |
| Cyanosis within 24 h | 92 (9.1) |
| Apnea | 131 (12.9) |
| Apnea within 24 h | 86 (8.5) |
| Duration of symptom ( | 53 (5.2) |
| Vital signs at presentation | |
| Temperature (F), median (IQR) | 99.4 (98.8–100) |
| Pulse rate (bpm), median (IQR) | 162 (150–176) |
| Respiratory rate (per min), median(IQR) | 48 (40–60) |
| Use of supplemental oxygen (%) | 51 (5) |
| Oxygen saturation level (%) at room air (IQR) | 96 (94–98) |
| Oxygen saturation level (%) with the use of supplemental oxygen (IQR) | 98 (95–100) |
| Wheeze | 602 (62.3) |
| Severity of retraction | |
| None | 192 (19.6) |
| Mild | 431 (43.9) |
| Moderate/severe | 358 (36.5) |
| Apnea | 56 (5.5) |
| Dehydration | 392 (39.5) |
| RSV | 821 (80.8) |
| 0–60 | |
| Positive pressure ventilation usea | 55 (5.4) |
| Intensive treatment useb | 163 (16.0) |
Data are no. (%) of infants unless otherwise indicated. Percentages may not equal 100, because of rounding and missingness.
bpm beats per minute, IQR interquartile range, ICU intensive care unit, RSV respiratory syncytial virus.
aInfants with bronchiolitis who underwent continuous positive airway ventilation and/or mechanical ventilation.
bInfants with bronchiolitis who were admitted to ICU and/or who underwent positive pressure ventilation.
Figure 1Prediction ability of the reference and machine learning models for positive pressure ventilation outcome in the overall cross-validation dataset. (A) Receiver-operating-characteristics (ROC) curves. The corresponding value of the area under the receiver-operating-characteristics curve (AUC) for each model are presented in Table 2. (B) Decision curve analysis. X-axis indicates the threshold probability for positive pressure ventilation outcome; Y-axis indicates the net benefit. Compared to the reference model, the net benefit of all machine learning models was larger over the range of clinical threshold.
Prediction performance of the reference, and machine learning models in infants hospitalized for bronchiolitis.
| Outcomes and models | AUC | P-valuea | NRIb | P-valueb | Sensitivity | Specificity | PPV | NPV |
|---|---|---|---|---|---|---|---|---|
| Reference model | 0.62 (0.53–0.70) | Reference | Reference | Reference | 0.62 (0.49–0.75) | 0.57 (0.54–0.60) | 0.075 (0.054–0.097) | 0.96 (0.95–0.97) |
| Logistic regression with Lasso regularization | 0.88 (0.84–0.93) | < 0.001 | 1.09 (0.87–1.32) | < 0.001 | 0.84 (0.73–0.93) | 0.79 (0.77–0.82) | 0.19 (0.14–0.24) | 0.99 (0.99–0.99) |
| Logistic regression with elastic net regularization | 0.89 (0.85–0.92) | < 0.001 | 1.05 (0.82–1.28) | < 0.001 | 0.89 (0.80–0.96) | 0.73 (0.70–0.75) | 0.15 (0.11–0.18) | 0.99 (0.99–0.99) |
| Random forest | 0.89 (0.85–0.92) | < 0.001 | 1.17 (0.96–1.38) | < 0.001 | 0.85 (0.75–0.95) | 0.74 (0.71–0.76) | 0.15 (0.12–0.21) | 0.99 (0.99–0.99) |
| Gradient boosted decision tree | 0.88 (0.84–0.93) | < 0.001 | 1.08 (0.84–1.33) | < 0.001 | 0.89 (0.80–0.96) | 0.77 (0.75–0.80) | 0.17 (0.08–0.21) | 0.99 (0.99–0.99) |
| Reference model | 0.62 (0.57–0.67) | Reference | Reference | Reference | 0.58 (0.55–0.62) | 0.58 (0.50–0.66) | 0.21 (0.18–0.24) | 0.88 (0.86–0.89) |
| Logistic regression with Lasso regularization | 0.79 (0.76–0.83) | < 0.001 | 0.68 (0.52–0.84) | < 0.001 | 0.75 (0.69–0.82) | 0.70 (0.66–0.73) | 0.31 (0.26–0.38) | 0.94 (0.93–0.94) |
| Logistic regression with elastic net regularization | 0.80 (0.76–0.83) | < 0.001 | 0.58 (0.42–0.74) | < 0.001 | 0.72 (0.64–0.79) | 0.74 (0.71–0.77) | 0.33 (0.28–0.41) | 0.93 (0.92–0.94) |
| Random forest | 0.79 (0.75–0.84) | < 0.001 | 0.70 (0.55–0.86) | < 0.001 | 0.70 (0.63–0.77) | 0.78 (0.76–0.81) | 0.37 (0.29–0.45) | 0.93 (0.92–0.94) |
| Gradient boosted decision tree | 0.79 (0.75–0.84) | < 0.001 | 0.72 (0.57–0.87) | < 0.001 | 0.74 (0.67–0.80) | 0.74 (0.71–0.77) | 0.33 (0.26–0.42) | 0.93 (0.92–0.94) |
AUC area under the receiver-operating-characteristic curve, NRI net reclassification improvement, PPV positive predictive value, NPV negative predictive value.
aP-value was calculated to compare area-under-the-curve of the reference model with that of each machine model.
bWe used continuous NRI and its P-value.
The number of actual and predicted outcomes of prediction models, according to the score of the reference model.
| Reference model (score) | Positive pressure ventilation use | Reference model | Lasso regression | Elastic net regression | Random forest | Gradient boosted tree | |||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Correctly identified outcome | Predicted outcome | Correctly identified outcome | Predicted outcome | Correctly identified outcome | Predicted outcome | Correctly identified outcome | Predicted outcome | Correctly identified outcome | Predicted outcome | ||
| 0: (n = 41) | 1 (2.4) | 0 | 6 | 0 | 7 | 0 | 8 | 1 | 9 | 0 | 8 |
| 1: (n = 64) | 3 (4.7) | 1 | 15 | 3 | 11 | 2 | 14 | 2 | 11 | 2 | 11 |
| 2: (n = 359) | 13 (3.6) | 9 | 156 | 12 | 80 | 11 | 106 | 11 | 78 | 12 | 79 |
| 3: (n = 270) | 12 (4.4) | 7 | 109 | 10 | 52 | 12 | 65 | 10 | 58 | 11 | 56 |
| 4: (n = 122) | 3 (0.8) | 2 | 46 | 1 | 20 | 1 | 30 | 2 | 41 | 1 | 24 |
| 5: (n = 58) | 8 (13.8) | 3 | 22 | 7 | 21 | 8 | 24 | 7 | 31 | 8 | 24 |
| 6: (n = 15) | 0 (0.0) | 0 | 3 | 0 | 6 | 0 | 6 | 0 | 8 | 0 | 6 |
| 7: (n = 41) | 5 (12.5) | 3 | 22 | 3 | 22 | 5 | 30 | 4 | 28 | 5 | 29 |
| 8: (n = 24) | 4 (16.7) | 2 | 8 | 4 | 12 | 4 | 14 | 4 | 17 | 4 | 15 |
| 9: (n = 11) | 0 (0.0) | 0 | 2 | 0 | 4 | 0 | 4 | 0 | 7 | 0 | 5 |
| 10: (n = 8) | 5 (62.5) | 3 | 3 | 5 | 7 | 5 | 7 | 5 | 8 | 5 | 7 |
| 11: (n = 3) | 1 (33.3) | 0 | 2 | 1 | 2 | 1 | 3 | 1 | 3 | 1 | 2 |
| 12: (n = 0) | 0 (0) | ||||||||||
| Overall (n = 1,016) | 55 (5.4) | 30 (55) | 394 | 46 (84) | 244 | 49 (89) | 311 | 47 (85) | 299 | 49 (89) | 266 |
Figure 2Prediction ability of the reference and machine learning models for intensive treatment outcome in the overall cross-validated dataset. (A) Receiver-operating-characteristics (ROC) curves. The corresponding values of the area under the receiver-operating-characteristics curve (AUC) for each model are presented in Table 2. (B) Decision curve analysis. X-axis indicates the threshold probability for intensive treatment outcome; Y-axis indicates the net benefit. Compared to the reference model, the net benefit of all machine learning models was larger over the range of clinical threshold.