| Literature DB >> 30110326 |
Aukje L Kreuger1,2, Rutger A Middelburg1,2, Erik A M Beckers3, Karen M K de Vooght4, Jaap Jan Zwaginga1,5, Jean-Louis H Kerkhoffs1,6, Johanna G van der Bom1,2.
Abstract
INTRODUCTION: Electronic health care data offers the opportunity to study rare events, although detecting these events in large datasets remains difficult. We aimed to develop a model to identify leukemia patients with major hemorrhages within routinely recorded health records.Entities:
Mesh:
Year: 2018 PMID: 30110326 PMCID: PMC6093651 DOI: 10.1371/journal.pone.0200655
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Patient characteristics.
| Complete derivation cohort | Sample derivation cohort | |
|---|---|---|
| 255 | 149 | |
| 154 (60.4) | 87 (58.4) | |
| 56.9 (44.3–65.4) | 58.4 (44.9–67.2) | |
| AML (%) | 189 (74.1) | 113 (75.8) |
| RAEB (%) | 20 (7.8) | 11 (7.4) |
| ALL (%) | 46 (18.0) | 25 (16.8) |
| 1319 | 265 | |
| 1 (1–23) | 25 (2–35) | |
| 10,638 | 353 | |
| CT-scan (%) | 75 (0.7) | 75 (21.3) |
| Hemoglobin drop | ||
| >0.8 to 1.6g/dl (%) | 572 (5.4) | 42 (11.9) |
| >1.6 to 1.9 g/dl (%) | 29 (0.3) | 20 (5.7) |
| ≥1.9 to 2.2 g/dl (%) | 49 (0.5) | 22 (6.2) |
| ≥2.2 to 2.8 g/dl (%) | 18 (0.2) | 18 (5.1) |
| ≥2.8 g/dl (%) | 13 (0.1) | 13 (3.7) |
| Transfusion need | ||
| 2 products (%) | 1,270 (11.9) | 50 (14.2) |
| 3 products (%) | 1,126 (10.6) | 43 (12.2) |
| 4 products (%) | 418 (3.9) | 40 (11.3) |
| 5 products (%) | 156 (1.5) | 31 (8.8) |
| ≥ 6 products (%) | 136 (1.3) | 31 (8.8) |
| Control (%) | 7216 (67.8) | 90 (25.5) |
Univariable predictive capacity for major hemorrhage for CT-scan of the brain and several cut-off values of hemoglobin drop and transfusion need.
| Variables | Sensitivity in % (CI) | Specificity in % (CI) | Positive predictive value in % (CI) | Negative predictive value in % (CI) | C-statistic (CI) |
|---|---|---|---|---|---|
| CT-scan brain | 43.5 (23.2; 65.5) | 99.4 (99.2; 99.5) | 13.3 (6.6; 23.2) | 99.9 (99.8; 99.9) | 0.714 (0.611; 0.818) |
| Hemoglobin drop | |||||
| >0.8 g/dl | 73.9 (51.6; 89.8) | 94.5 (94.0; 94.9) | 2.9 (1.7; 4.5) | 99.9 (99.9; 100) | 0.842 (0.750; 0.934) |
| ≥1.6 g/dl | 47.8 (26.8; 69.4) | 99.2 (99.0; 99.4) | 11.8 (6.1; 20.2) | 99.9 (99.8; 99.9) | 0.735 (0.631; 0.840) |
| ≥2.0 g/dl | 34.8 (16.4; 57.3) | 99.4 (99.2; 99.5) | 11.1 (4.9; 20.7) | 99.9 (99.8; 99.9) | 0.671 (0.571; 0.770) |
| ≥2.4 g/dl | 26.1 (10.2; 48.4) | 99.8 (99.6; 99.8) | 19.4 (7.5; 37.5) | 99.8 (99.7; 99.9) | 0.629 (0.537; 0.721) |
| ≥2.8 g/dl | 21.7 (7.5; 43.7) | 99.9 (99.8; 100) | 38.5 (13.9; 68.4) | 99.8 (99.7; 99.9) | 0.608 (0.522;0.694) |
| Transfusion need | |||||
| 2 products | 13.0 (2.8; 33.6) | 88.0 (87.4; 88.7) | 0.2 (0.05; 0.7) | 99.8 (99.7; 99.9) | 0.505 (0.435; 0.576) |
| 3 products | 4.4 (0.1; 21.9) | 89.3 (88.7; 89.9) | 0.1 (0.0; 0.5) | 99.8 (99.6; 99.9) | 0.468 (0.426; 0.511) |
| 4 products | 26.1 (10.2; 48.4) | 96.6 (96.2; 96.9) | 1.7 (0.6; 3.6) | 99.8 (99.7; 99.9) | 0.613 (0.522; 0.705) |
| 5 products | 4.4 (0.1; 21.9) | 98.6 (98.4; 98.8) | 0.7 (0.0; 3.7) | 99.8 (99.7; 99.9) | 0.515 (0.472; 0.557) |
| ≥ 6 products | 43.5 (23.2; 65.5) | 98.9 (98.7; 99.1) | 8.1 (4.0; 14.4) | 99.9 (99.8; 99.9) | 0.712 (0.608; 0.816) |
Fig 1ROC curve of the model in the derivation cohort.
AUC for major hemorrhages was 0.988 (0.981: 0.995), for bleedings of all severity 0.545 (0.533: 0.557). The depicted results are derived from the sample and extrapolated to the entire cohort.
Characteristics and performance of the model in the derivation cohort.
| Predicted risk | CT | Hb | Tx | Sensitivity in % (CI) | Specificity in % (CI) | Positive predictive value in % (CI) | Negative predictive value in % (CI) | Days needed to screen | False negatives |
|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 0 | 100 (85.2; 100) | 0 (0; 0.04) | 0.2 (0.1; 0.3) | N/A | 454.5 | 0 | |
| 0 | + | 0 | 100 (85.2; 100) | 93.1 (92.6; 93.5) | 3.1 (2.0; 4.6) | 100 (100; 100) | 34.7 | 0 | |
| 0 | 0 | + | 78.3 (56.3; 92.5) | 98.3 (98.1; 98.6) | 9.3 (5.6; 14.3) | 99.9 (99.9; 100) | 11.0 | 5 | |
| + | 0 | 0 | 78.3 (56.3; 92.5) | 99.2 (99.1; 99.4) | 18.4 (11.3; 27.5) | 99.9 (99.9; 100) | 5.5 | 5 | |
| 0 | + | + | 52.2 (30.6; 73.2) | 99.7 (99.6; 99.8) | 27.9 (15.3; 43.7) | 99.9 (99.8; 99.9) | 3.6 | 11 | |
| + | + | 0 | 17.5 (5.0; 38.8) | 99.9 (99.8; 99.9) | 20.0 (5.7; 43.7) | 99.8 (99.7; 99.9) | 5.1 | 19 | |
| + | 0 | + | 8.7 (1.1; 28.0) | 100 (99.9; 100) | 50.0 (6.8; 93.2) | 99.8 (99.7; 99.9) | 2.0 | 21 | |
| + | + | + | 8.7 (1.1; 28.0) | 100 (100; 100) | 100 (15.8; 100) | 99.8 (99.7; 99.9) | 1.0 | 21 |
The sample was reweighted according to the distribution of the indicators in the complete cohort. The total number of events in reweighted dataset was 23.* The predicted risks include the risk for a given risk factor or larger risks (the lines below)
†CT: CT scan brain, Hb: hemoglobin, Tx: transfusion.
+ indicates presence and 0 indicates absence of the indicator
‡ Calculated with an incidence of 0.22 per 100 days, which was the incidence in the extrapolated cohort.
§ N/A not applicable, negative predicted value can’t be calculated when all days are screened.
Baseline characteristics validation cohort.
| Validation cohort | Sample validation cohort | |
|---|---|---|
| Patients | 436 | 294 |
| Male gender (%) | 256 (58.7) | 174 (59.2) |
| Age in years, median (IQR) | 57.7 (46.0–65.5) | 56.7 (40.5–65.4) |
| Diagnosis | ||
| AML (%) | 325 (74.5) | 216 (73.5) |
| RAEB (%) | 28 (6.4) | 21 (7.1) |
| ALL (%) | 83 (19.0) | 55 (18.7) |
| Hospital admissions (n) | 1,276 | 458 |
| Length of hospital stay, median (IQR) | 17 (2–32.5) | 27 (10–37) |
| Observation days | 19,188 | 599 |
| CT-scan (%) | 110 (0.57) | 110 (18.4) |
| Hemoglobin drop | ||
| >0.8 to 1.6g/dl (%) | 1,293 (6.7) | 203 (33.9) |
| >1.6 to 1.9 g/dl (%) | 103 (0.5) | 14 (2.3) |
| ≥1.9 to 2.2 g/dl (%) | 145 (0.8) | 25 (4.2) |
| ≥2.2 to 2.8 g/dl (%) | 89 (0.5) | 11 (1.8) |
| ≥2.8 g/dl (%) | 45 (0.2) | 11 (1.8) |
| Transfusion need | ||
| 2 products (%) | 1,159 (60) | 81 (13.5) |
| 3 products (%) | 1,040 (5.4) | 50 (8.4) |
| 4 products (%) | 656 (3.4) | 14 (2.3) |
| 5 products (%) | 147 (0.8) | 7 (1.2) |
| ≥ 6 products (%) | 51 (0.3) | 92 (15.4) |
| Control (%) | 56 (0.3) | 400 (66.8) |
Performance of the model in the external validation cohort.
| Predicted risk | Sensitivity in % (CI) | Specificity in % (CI) | Positive predictive value in % (CI) | Negative predictive value in % (CI) | Days needed to screen | False negatives |
|---|---|---|---|---|---|---|
| 100 (95.8; 100) | 0 (0; 0.02) | 0.5 (0.4; 0.6) | N/A | 217.4 | 0 | |
| 100 (95.8; 100) | 90.7 (90.2; 91.1) | 4.7 (3.8; 5.7) | 100 (100; 100) | 23.6 | 0 | |
| 54.0 (43.0; 64.8) | 99.2 (99.1; 99.3) | 24.4 (18.5; 31.0) | 99.8 (99.7; 99.8) | 4.2 | 40 | |
| 41.4 (30.9; 52.4) | 99.4 (99.3;99.5) | 24.3 (17.7; 32.1) | 99.7 (99.6; 99.8) | 4.2 | 51 | |
| 29.9 (20.5; 40.6) | 99.8 (99.7; 99.9) | 41.9 (29.5; 55.2) | 99.7 (99.6; 99.8) | 2.4 | 61 | |
| 8.1 (3.3; 15.9) | 99.9 (99.9; 99.9) | 29.2 (12.6; 51.1) | 99.6 (99.5; 99.7) | 3.5 | 80 | |
| 3.5 (0.7; 9.8) | 100 (100; 100) | 100 (29.2; 100) | 99.6 (99.5; 99.6) | 1 | 84 | |
| 2.3 (0.3; 8.1) | 100 (100; 100) | 100 (15.8; 100) | 99.6 (99.4; 99.6) | 1 | 85 |
The sample was reweighted according to the distribution of the indicators in the complete cohort. The total number of events in the reweighted dataset was 87.
* Calculated with an incidence of 0.46 per 100 days, which was the incidence in the extrapolated cohort.
† N/A not applicable, negative predicted value can’t be calculated when all days are screened.
Fig 2ROC curve for major hemorrhages and all bleedings in the external validation cohort.
AUC for major hemorrhages was 0.975 (0.970: 0.980), for bleedings of all severity 0.557 (0.544: 0.569). The depicted results are derived from the sample and extrapolated to the entire cohort.