| Literature DB >> 31328221 |
Alpha Forna1, Pierre Nouvellet1,2, Ilaria Dorigatti1, Christl A Donnelly1,3.
Abstract
BACKGROUND: The 2013-2016 West African Ebola epidemic has been the largest to date with >11 000 deaths in the affected countries. The data collected have provided more insight into the case fatality ratio (CFR) and how it varies with age and other characteristics. However, the accuracy and precision of the naive CFR remain limited because 44% of survival outcomes were unreported.Entities:
Keywords: imputation; infectious disease epidemiology; machine learning; survival; viral hemorrhagic disease
Mesh:
Year: 2020 PMID: 31328221 PMCID: PMC7286386 DOI: 10.1093/cid/ciz678
Source DB: PubMed Journal: Clin Infect Dis ISSN: 1058-4838 Impact factor: 9.079
Figure 1.Schematic summary of the analysis steps used in this study. Abbreviations: AUC, area under the receiver operating characteristic curve; BRT, boosted regression tree; PCC, percentage correctly classified.
Case Fatality Ratio Estimates Without Imputation, Unadjusted With Imputation, and Adjusted With Imputation for Confirmed, Probable, and Suspected Cases
| Cases in the Dataset Without Imputation, n | Cases in the Dataset Including Those With Imputation, n | CFR Without Imputation, % (95% CI) | Unadjusted CFR With Imputation, % (95% CI) | Adjusted CFR With Imputation, % (95% CI) | |
|---|---|---|---|---|---|
| Guinea | 3740 | 3757 | 65.8 (61.6–69.9) | 65.6 (61.3–69.6) | 65.6 (61.3–69.6) |
| Liberia | 4624 | 8130 | 71.7 (67.2–75.6) | 69.7 (55.6–78.7) | 79.2 (45.4–84.1) |
| Sierra Leone | 10 280 | 21 451 | 81.3 (79.3–83.3) | 74.6 (52.2–84.3) | 89.1 (40.8–91.6) |
| Overalla | 18 644 | 33 338 | 75.1 (73.5–76.6) | 71.9 (56.1–79.8) | 82.8 (45.6–85.6) |
All CFR estimates and corresponding CIs were calculated using a nonparametric bootstrap of the BRT model.Abbreviations: BRT, boosted regression tree; CI, confidence interval; CFR, case fatality ratio.
aAdjusted CFR estimated using tp function from the R package RSurveillance to correct for bias in BRT model performance.
Figure 2.Proportion of known survival outcomes (ie, dead and alive) and unknown survival outcomes (ie, missing) for “confirmed, probable, and suspected” cases. A, Proportion of deaths, survivals, and entries with unknown outcome by age group (in years). B, Proportion of deaths, survivals, and entries with unknown outcome by reporting delay (in days).
Relative Contributions of the Predictors in the Minimal BRT Model Using 10-fold Cross-validation; tc = 27, lr = 0.001, and bf = 0.75; and Trained on 1000 Training Sets Generated by Randomly Sampling 65% of Cases With Known Survival Outcomes Without Bootstrapping
| Predictors | Relative Contribution, % |
|---|---|
| District of origin | 41.7 |
| Hospitalization status | 28.9 |
| Age | 10.7 |
| Case Classification | 10.1 |
| Quarter | 2.2 |
| Delay | 1.8 |
| Anorexia | 1.6 |
| Difficult breathing | 1.6 |
| Fever | 1.0 |
| Fatigue | 0.5 |
The minimal model used these 10 predictors.Abbreviations: bf, bag fraction; BRT, boosted regression tree; lr, learning rate; tc, tree complexity.
Figure 3.CFRs by age, delay, country, and fever. Median and 95% confidence intervals are plotted (based on 1000 bootstrap realizations for “confirmed, probable and suspected cases”). Abbreviations: CFR, case fatality ratio; S/Leone, Sierra Leone.
Boosted Regression Tree Model Performance
| Model Performance | |||
|---|---|---|---|
| Performance Measures | Full Modela With Bootstrap Median, % (95% CI) | Simplified Modelb With Bootstrap Median, % (95% CI) | Minimal Modelc With Bootstrap Median, % (95% CI) |
| Sensitivity | 70.5 (56.0–76.5) | 69.7 (52.5–75.6) | 69.7 (51.7–75.7) |
| Specificity | 70.5 (56.8–75.9) | 69.8 (54.1–75.6) | 69.8 (51.2–75.6) |
| PCC | 70.5 (56.6–75.9) | 69.9 (53.7–75.5) | 69.7 (51.0–75.4) |
| AUC | 76.7 (61.0–82.7) | 76.0 (56.8–82.1) | 75.7 (56.1–82.1) |
Medians and 95% CIs are reported, based on 1000 bootstrap realizations using confirmed, probable, and suspected cases.Abbreviations: AUC, area under the receiver operating characteristic curve; BRT, boosted regression tree; CI, confidence interval; PCC, percentage correctly classified.
aThe full BRT model used all 43 candidate predictors.
bThe simplified BRT model used the 24 predictors retained after model simplification.
cThe minimal BRT model used the 10 most important predictors.