| Literature DB >> 33027237 |
Frank Lien1,2, Hsin-Yao Wang2,3, Jang-Jih Lu2, Ying-Hao Wen2,3, Tzong-Shi Chiueh2,3,4.
Abstract
BACKGROUND: Clinical laboratories have traditionally used a single critical value for thrombocytopenic events. This system, however, could lead to inaccuracies and inefficiencies, causing alarm fatigue and compromised patient safety.Entities:
Mesh:
Year: 2021 PMID: 33027237 PMCID: PMC7993911 DOI: 10.1097/MLR.0000000000001421
Source DB: PubMed Journal: Med Care ISSN: 0025-7079 Impact factor: 3.178
FIGURE 1Study population and research framework.
Patient Characteristics and 2-Day Mortality of Thrombocytopenic Events With Platelet Value ≤50,000/mL
| Group | A | B | C | D | E |
|---|---|---|---|---|---|
| No. laboratory tests | 1106 | 2219 | 2894 | 3767 | 4328 |
| No. patients | 519 | 1008 | 1291 | 1708 | 2149 |
| Age (mean±SD) | 58.4±15.7 | 58.0±14.4 | 58.6±14.2 | 59.3±13.8 | 59.6±14.0 |
| Sex | |||||
| Male | 612 | 1293 | 1754 | 2305 | 2660 |
| Female | 494 | 926 | 1140 | 1462 | 1668 |
| 2-d mortality (%) | 210 (18.99) | 335 (15.10) | 324 (11.20) | 309 (8.20) | 343 (7.93) |
The 5 groups by platelet count are: A=≤10,000, B=10,001–20,000, C=20,001–30,000, D=30,001–40,000, and E=40,001–50,000/mL.
A patient might take multiple tests.
Performance Metrics of Machine Learning Models
| Model | Dataset | Sensitivity (%) | Specificity (%) | PPV (%) | NPV (%) | TPP (n) | AUC |
|---|---|---|---|---|---|---|---|
| LR | Training* | 7.2±0.7 | 99.6±0.1 | 50.2±7.4 | 95.4±0.2 | 73.6±16.5 | 0.842±0.011 |
| Testing | 9.7 | 99.6 | 53.3 | 95.7 | 107 | 0.836 | |
| RF | Training | 15.0±0.8 | 99.2±0.1 | 50.3±2.6 | 95.7±0.2 | 149.6±13.0 | 0.859±0.007 |
| Testing | 15.1 | 99.1 | 46.1 | 96.0 | 193 | 0.848 | |
| ANN | Training | 18.8±10.2 | 98.9±0.8 | 50.0±7.3 | 95.9±0.6 | 202.8±121.6 | 0.867±0.005 |
| Testing | 3.6 | 99.9 | 53.8 | 95.5 | 39 | 0.845 | |
| SGD | Training | 4.2±2.5 | 99.8±0.2 | 50.0±7.2 | 95.2±0.4 | 43.0±26.3 | 0.826±0.010 |
| Testing | 2.2 | 100.0 | 72.2 | 95.4 | 18 | 0.819 | |
| NB | Training | 8.5±1.3 | 98.9±0.3 | 28.4±3.4 | 95.4±0.2 | 151.8±33.5 | 0.824±0.011 |
| Testing | 8.3 | 98.5 | 21.0 | 95.6 | 223 | 0.800 | |
| SVM | Training | 0.1±0.1 | 100±0.0 | 16.7±21.1 | 95.1±0.3 | 1.6±0.8 | 0.755±0.010 |
| Testing | 0.2 | 100 | 25 | 95.3 | 4 | 0.729 | |
| DT | Training | 24.2±1.5 | 95.0±0.3 | 20.1±0.9 | 96.0±0.3 | 600.4±37.0 | 0.596±0.008 |
| Testing | 24.9 | 94.6 | 18.5 | 96.3 | 793 | 0.598 |
Data are presented as mean±SD unless indicated otherwise.
Performance metrics are obtained from 5-fold cross-validation.
ANN indicates artificial neural network; AUC, area under the receiver operating characteristic curve; DT, decision tree; LR, logistic regression; NB, naive Bayes; NPV, negative predictive value; PPV, positive predictive value; RF, random forest; SGD, stochastic gradient descent; SVM, support vector machine; TPP (n), total positive prediction number.
FIGURE 2Performance of critical notification of machine learning methods (random forest model) and traditional notification system.
FIGURE 3Feature importance of the final random forest model. MCH indicates mean corpuscular hemoglobin; MCHC, mean corpuscular hemoglobin concentration; MCV, mean corpuscular volume; nRBC, nucleated red blood cell; RBC, red blood cell; RDW, red cell distribution width; WBC, white blood cell.