| Literature DB >> 33084589 |
Feng Xie1, Bibhas Chakraborty1,2,3, Marcus Eng Hock Ong1,4,5, Benjamin Alan Goldstein1,3, Nan Liu1,5,6.
Abstract
BACKGROUND: Risk scores can be useful in clinical risk stratification and accurate allocations of medical resources, helping health providers improve patient care. Point-based scores are more understandable and explainable than other complex models and are now widely used in clinical decision making. However, the development of the risk scoring model is nontrivial and has not yet been systematically presented, with few studies investigating methods of clinical score generation using electronic health records.Entities:
Keywords: clinical decision making; clinical prediction rule; electronic health records; machine learning; prognosis
Year: 2020 PMID: 33084589 PMCID: PMC7641783 DOI: 10.2196/21798
Source DB: PubMed Journal: JMIR Med Inform
Figure 1Flowchart of the AutoScore framework. ROC: receiver operating characteristic.
Figure 2Flowchart of the study cohort formation. BIDMC: Beth Israel Deaconess Medical Center.
Description of the study cohort (N=44,918).
| Variables | All episodes (N=44,918) | Live discharged (n=40,960) | Inpatient mortality (n=3958) | |||
| Age (years), mean (SD) | 62.5 (16.5) | 62.0 (16.6) | 68.5 (14.7) | <.001 | ||
|
| .04 | |||||
|
| Male | 25,788 (57.4) | 23,578 (57.6) | 2210 (55.8) |
| |
|
| Female | 19,130 (42.6) | 17,382 (42.4) | 1748 (44.2) |
| |
|
| <.001 | |||||
|
| Emergency | 38,138 (84.9) | 34,339 (83.8) | 3799 (96.0) |
| |
|
| Elective | 6780 (15.1) | 6621 (16.2) | 159 (4.0) |
| |
|
| <.001 | |||||
|
| White | 31,889 (71.0) | 29,148 (71.2) | 2741 (69.3) |
| |
|
| Hispanic | 1625 (3.6) | 1539 (3.8) | 86 (2.2) |
| |
|
| Asian | 1034 (2.3) | 933 (2.3) | 101 (2.6) |
| |
|
| African | 4399 (9.8) | 4110 (10.0) | 289 (7.3) |
| |
|
| Others or unknown | 5971 (13.3) | 5230 (12.8) | 741 (18.7) |
| |
|
| <.001 | |||||
|
| Government | 1326 (3.0) | 1258 (3.1) | 68 (1.7) |
| |
|
| Medicaid | 4176 (9.3) | 3896 (9.5) | 280 (7.1) |
| |
|
| Medicare | 23,878 (53.2) | 21,283 (52.0) | 2595 (65.6) |
| |
|
| Private | 15,031 (33.5) | 14,063 (34.3) | 968 (24.5) |
| |
|
| Self-pay | 507 (1.1) | 460 (1.1) | 47 (1.2) |
| |
|
| <.001 | |||||
|
| CCUb | 6445 (14.3) | 5907 (14.4) | 538 (13.6) |
| |
|
| CSRUc | 8284 (18.4) | 8031 (19.6) | 253 (6.4) |
| |
|
| MICUd | 17,490 (38.9) | 15,420 (37.6) | 2070 (52.3) |
| |
|
| SICUe | 7320 (16.3) | 6649 (16.2) | 671 (17.0) |
| |
|
| TSICUf | 5379 (12.0) | 4953 (12.1) | 426 (10.8) |
| |
| Length of stay (days), mean (SD) | 4.19 (6.11) | 3.87 (5.75) | 7.57 (8.36) | <.001 | ||
aICU: intensive care unit.
bCCU: coronary care unit.
cCSRU: cardiac surgery recovery unit.
dMICU: medical intensive care unit.
eSICU: surgical intensive care unit.
fTSICU: trauma surgical intensive care unit.
Distribution of clinical variables in the study cohort.
| Variables | Values, median (IQR) |
| Age (years) | 64.4 (51.9-75.9) |
| Heart rate (beats/min) | 84.4 (74.5- 95.2) |
| Systolic blood pressure (mm Hg) | 116.7 (107.1-129.5) |
| Diastolic blood pressure (mm Hg) | 60 (53.7-67.4) |
| Mean arterial pressure (mm Hg) | 76.9 (70.7-84.9) |
| Respiration rate (breaths/min) | 18.0 (15.9-20.6) |
| Temperature (°C) | 36.8 (36.5-37.2) |
| Peripheral capillary oxygen saturation (SpO2; %) | 97.6 (96.2-98.7) |
| Glucose (mg/dL) | 129.0 (111.3-154.3) |
| Anion gap (mEq/L) | 13.5 (12-16) |
| Bicarbonate (mmol/L) | 24.0 (21.5-26.5) |
| Creatinine (μmol/L) | 0.95 (0.7-1.4) |
| Chloride (mEq/L) | 105 (101.5-108) |
| Lactate (mmol/L) | 1.8 (1.7-2.0) |
| Hemoglobin (g/dL) | 10.9 (9.6-12.3) |
| Hematocrit (%) | 32.3 (28.8-36.4) |
| Platelet (thousand per microliter) | 208.5 (153.5-276.5) |
| Potassium (mmol/L) | 4.2 (3.8-4.5) |
| Blood urea nitrogen (mg/dL) | 18.0 (12.5-29.5) |
| Sodium (mmol/L) | 138.5 (136-140.5) |
| White blood cells (thousand per microliter) | 10.7 (8.0-14.3) |
Selected variables by AutoScore and other baseline models.
| Variables | Stepwise | LASSO | AutoScore ( | AutoScore ( |
| Age (years) | ✔b | ✔ | ✔ | ✔ |
| Ethnicity | ✔ | ✔ | —c | — |
| Insurance | ✔ | ✔ | — | — |
| Gender | — | — | — | — |
| Heart rate | ✔ | ✔ | ✔ | ✔ |
| Systolic blood pressure | ✔ | ✔ | ✔ | ✔ |
| Diastolic blood pressure | ✔ | — | — | — |
| Mean arterial pressure | ✔ | ✔ | — | — |
| Respiration rate | ✔ | ✔ | ✔ | ✔ |
| Temperature | ✔ | ✔ | ✔ | ✔ |
| SpO2d | ✔ | ✔ | ✔ | ✔ |
| Glucose | ✔ | ✔ | ✔ | — |
| Anion gap | ✔ | — | — | — |
| Bicarbonate | ✔ | ✔ | ✔ | — |
| Creatinine | ✔ | — | — | — |
| Chloride | ✔ | ✔ | — | — |
| Hematocrit | ✔ | ✔ | — | — |
| Hemoglobin | ✔ | — | — | — |
| Lactate | ✔ | ✔ | ✔ | ✔ |
| Platelet | ✔ | ✔ | ✔ | ✔ |
| Potassium | ✔ | ✔ | — | — |
| BUNe | ✔ | — | ✔ | ✔ |
| Sodium | — | ✔ | — | — |
| White blood cells | ✔ | — | ✔ | — |
aParameter m is the number of variables included in the AutoScore model.
bTick mark represents that this variable is included by the corresponding method.
cThis variable is not included by the corresponding method.
dSpO2: peripheral capillary oxygen saturation.
eBUN: blood urea nitrogen.
Figure 3Model performance versus complexity for the implementation of the AutoScore on (a) the validation set and (b) the test set. The area under the curve reflects the discrimination performance, whereas the number of variables represents the complexity of the model.
A nine-variable AutoScore-created scoring model for inpatient mortality.
| Variables and intervala | Point | |
|
| ||
|
| <30 | 0 |
|
| 30-48 | 5 |
|
| 48-78 | 14 |
|
| 78-85 | 22 |
|
| ≥85 | 24 |
|
| ||
|
| <62 | 1 |
|
| 62-72 | 0 |
|
| 72-98 | 1 |
|
| 98-112 | 8 |
|
| ≥112 | 13 |
|
| ||
|
| <12 | 3 |
|
| 12-16 | 0 |
|
| 16-22 | 4 |
|
| ≥22 | 12 |
|
| ||
|
| <90 | 15 |
|
| 90-100 | 8 |
|
| 100-130 | 0 |
|
| 130-150 | 1 |
|
| ≥150 | 3 |
|
| ||
|
| <36 | 12 |
|
| 36-36.5 | 3 |
|
| 36.5-37.5 | 0 |
|
| 37.5-38 | 5 |
|
| ≥38 | 9 |
|
| ||
|
| <85 | 25 |
|
| 85-90 | 13 |
|
| 90-95 | 4 |
|
| ≥95 | 0 |
|
| ||
|
| <80 | 17 |
|
| 80-150 | 3 |
|
| 150-300 | 0 |
|
| 300-450 | 3 |
|
| ≥450 | 5 |
|
| ||
|
| <7.5 | 0 |
|
| 7.5-12 | 2 |
|
| 12-35 | 9 |
|
| 35-70 | 19 |
|
| ≥70 | 23 |
|
| ||
|
| <1 | 0 |
|
| 1-2.5 | 2 |
|
| 2.5-4 | 8 |
|
| ≥4 | 21 |
aInterval (q1-q2) represents q1 ≤x
bSpO2: peripheral capillary oxygen saturation.
Figure 4(a) Number of cases and (b) observed mortality rate, versus different score intervals obtained by the nine-variable AutoScore model.
Performance of the AutoScore and other baseline models.
| Methods, AUCa (95% CI) |
| Threshold | Sensitivity (%), 95% CI | Specificity (%), 95% CI | PPVc (%), 95% CI | NPVd (%), 95% CI | |
|
| |||||||
|
| 0.780 (0.764-0.798) | 9 | 48e | 63.7 (60.3-67.1) | 77.2 (76.3-78.2) | 20.9 (19.8-22.0) | 95.8 (95.4-96.1) |
|
| N/Af | N/A | 30g | 95.7 (94.3-97.2) | 25.1 (24.2-26.0) | 10.8 (10.6-10.9) | 98.4 (97.9-98.9) |
|
| N/A | N/A | 64h | 28.8 (25.7-32.0) | 95.5 (95.0-95.9) | 37.6 (34.2-41.0) | 93.4 (93.2-93.7) |
|
| |||||||
|
| 0.789 (0.773-0.802) | 12 | 130e | 71.7 (68.5-74.7) | 71.7 (70.7-72.7) | 19.3 (18.4-20.1) | 96.4 (96.0-96.8) |
|
| N/A | N/A | 95g | 93.7 (92.0-95.3) | 34.5 (33.4-35.6) | 11.9 (11.6-12.1) | 98.3 (97.8-98.7) |
|
| N/A | N/A | 180h | 32.0 (28.8-35.3) | 94.8 (94.3-95.2) | 36.6 (33.4-39.9) | 93.7 (93.4-94.0) |
|
| |||||||
|
| 0.778 (0.760-0.795) | 24 | 0.085e | 68.6 (65.4-71.8) | 72.8 (71.8-73.7) | 19.2 (18.3-20.1) | 96.1 (95.7-96.5) |
|
| N/A | N/A | 0.028g | 95.2 (93.5-96.6) | 25.3 (24.4-26.3) | 10.7 (10.5-10.9) | 98.3 (97.7-98.8) |
|
| N/A | N/A | 0.24h | 27.9 (24.5-31.3) | 95.1 (94.7-95.6) | 35.0 (31.7-38.6) | 93.3 (93.0-93.6) |
|
| |||||||
|
| 0.778 (0.760-0.795) | 22 | 0.096e | 65.0 (61.6-68.5) | 76.9 (76.0-77.8) | 21.0 (19.9-22.0) | 95.9 (95.5-96.3) |
|
| N/A | N/A | 0.028g | 95.1 (93.5-96.5) | 25.0 (24.1-26.1) | 10.7 (10.5-10.9) | 98.2 (97.6-98.7) |
|
| N/A | N/A | 0.24h | 28.4 (25.1-31.7) | 95.2 (94.7-95.6) | 35.7 (32.2-39.1) | 93.4 (93.1-93.7) |
|
| |||||||
|
| 0.772 (0.755-0.790) | 17 | –2.47e | 73.4 (70.2-76.4) | 68.1 (67.1-69.2) | 17.8 (17.0-18.6) | 96.4 (96.0-96.8) |
|
| N/A | N/A | –3.34g | 95.2 (93.7-96.5) | 25.1 (24.1-26.1) | 10.7 (10.5-10.9) | 98.2 (97.7-98.7) |
| N/A | N/A | –1.27h | 28.4 (25.2-31.8) | 95.2 (94.7-95.7) | 36.0 (32.6-39.5) | 93.4 (93.1-93.7) | |
|
| |||||||
|
| 0.785 (0.768-0.801) | 9 | 0.085e | 74.2 (71.1-77.0) | 69.4 (68.4-70.4) | 18.6 (17.8-19.4) | 96.6 (96.2-97.0) |
|
| N/A | N/A | 0.015g | 94.2 (92.5-95.7) | 30.1 (29.1-31.1) | 11.3 (11.1-11.5) | 98.2 (97.7-98.7) |
|
| N/A | N/A | 0.3h | 30.5 (27.4-34.0) | 94.8 (94.4-95.3) | 35.7 (32.5-39.0) | 93.5 (93.3-93.8) |
|
| |||||||
|
| 0.809 (0.794-0.825) | 24 | 0.115e | 73.1 (69.9-76.2) | 75.4 (74.5-76.3) | 21.9 (20.9-22.9) | 96.8 (96.4-97.1) |
|
| N/A | N/A | 0.025g | 94.4 (92.8-95.9) | 37.9 (36.9-38.9) | 12.5 (12.3-12.8) | 98.6 (98.2-99.0) |
|
| N/A | N/A | 0.285h | 34.1 (30.6-37.5) | 95.1 (94.6-95.5) | 39.4 (36.2-42.9) | 93.9 (93.6-94.2) |
aAUC: the area under the ROC curve.
bNumber of variables in the model.
cPPV: positive predictive value.
dNPV: negative predictive value.
eOptimal cutoff values, defined as the points nearest to the upper-left corner of the ROC curves.
fN/A: not applicable.
gCutoff values by which the sensitivity could reach about 95%.
hCutoff values by which the specificity could reach about 95%.
iLASSO: least absolute shrinkage and selection operator.
jAutoScore-based variable selection was implemented beforehand, where the same set of variables were selected as the AutoScore (m=9).
Figure 5Calibration belts (at 80% and 95% confidence levels) for (a) a nine-variable AutoScore-created model, (b) a 12-variable AutoScore-created model, (c) a full logistic regression model, (d) a stepwise regression model, (e) the LASSO model, (f) a nine-variable random forest model, and (g) a full random forest model.