| Literature DB >> 35993617 |
Ian A Scott1,2, Nazanin Falconer3,4, Stephen Canaris5, Oscar Bonilla5, Sven Marxen6, Aaron Van Garderen5,6, Michael Barras3,7, Ahmad Abdel-Hafez5,8.
Abstract
BACKGROUND: Unfractionated heparin (UFH) is an anticoagulant drug that is considered a high-risk medication because an excessive dose can cause bleeding, whereas an insufficient dose can lead to a recurrent embolic event. Therapeutic response to the initiation of intravenous UFH is monitored using activated partial thromboplastin time (aPTT) as a measure of blood clotting time. Clinicians iteratively adjust the dose of UFH toward a target, indication-defined therapeutic aPTT range using nomograms, but this process can be imprecise and can take ≥36 hours to achieve the target range. Thus, a more efficient approach is required.Entities:
Keywords: aPTT; activated partial thromboplastin time; heparin; machine learning; personalized medicine; predictive modeling
Year: 2022 PMID: 35993617 PMCID: PMC9531006 DOI: 10.2196/34533
Source DB: PubMed Journal: Interact J Med Res ISSN: 1929-073X
Figure 1Experiment setup including training and validation processes. GCUH: Gold Coast University Hospital; ML: machine learning; CSV: comma separated values.
Figure 2Unfractionated heparin and activated partial thromboplastin time tables blending. aPTT: activated partial thromboplastin time; UFH: unfractionated heparin.
Missing data handling.
| Features | Imputation |
| Baseline aPTTa and baseline aPTT minutes | Missing values and values completed more than 24 hours before UFHb bolus administration were imputed to 30 seconds, whereas baseline aPTT minutes are imputed to 1440 (24 hours) minutes for those records. |
| Weight and height | Encounters with no measurements were imputed to the averages for their age, marital status, and sex (as recorded in the patient electronic health record). |
| Vital Signs features | If the results are missing or occurred more than 12 hours before the target aPTT, they are imputed using centroid values of k-means clustering with k=10. |
| Pathology results | If the results are missing or occurred more than 1 week before the target aPTT, they are imputed using k-means clustering using centroid value with k=10. |
| Waterlow score and ADLsc | Imputed to 0 where missing or older than 1 week before target aPTT. |
aaPPT: activated partial thromboplastin time.
bUFH: unfractionated heparin.
cADL: activity of daily living.
Baseline characteristics of training data set (all records, N=2783).
| Feature | Values | ||
|
| |||
|
| Male | 1898 (68.2) | |
|
| Female | 885 (31.8) | |
|
| |||
|
| ACSa | 818 (29.4) | |
|
| VTEb | 540 (19.4) | |
| Age (years), mean (SD) | 65.8 (14.6) | ||
| Weight (kg), mean (SD) | 87.8 (26.7) | ||
| Baseline aPTTc (seconds), mean (SD) | 36 (11.1) | ||
| UFHd bolus dose (units), mean (SD) | 4713 (1467) | ||
| UFH maintenance (units), mean (SD) | 6767 (4993) | ||
| Time between UFH bolus and aPTT (minutes), mean (SD) | 364.1 (149) | ||
aACS: acute coronary syndrome.
bVTE: venous thromboembolism.
caPTT: activated partial thromboplastin time.
dUFH: unfractionated heparin.
Figure 3Frequencies of target activated partial thromboplastin time results with bin size=20. aPTT: activated partial thromboplastin time.
Top 10 most important features with relative importance scores.
| Feature | Relative importance |
| Minutes between UFHa bolus and aPTTb | 1 |
| UFH bolus time | 0.58 |
| Baseline aPTT | 0.41 |
| Age | 0.37 |
| UFH bolus dose | 0.3 |
| Height | 0.25 |
| UFH maintinance | 0.23 |
| UFH bolus size calculated | 0.22 |
| Weight | 0.21 |
| Size calculated | 0.195 |
aUFH: unfractionated heparin.
baPTT: activated partial thromboplastin time.
Descriptions of contributing models in the final regression ensemble model.
| ID | Model type | Model weight | Fitted features | Feature fraction | Max leaves | Learning rate | Max bins | Lambda L1 | Lambda L2 |
| 0 | LightGBM | 0.3333 | 65 | 0.6 | 16 | 0.01 | 128 | 0 | 0.5 |
| 1 | LightGBM | 0.1042 | 69 | 0.6 | 16 | 0.01 | 128 | 0 | 0.5 |
| 2 | LightGBM | 0.1667 | 95 | 0.8 | 64 | 0.01 | 256 | 0 | 10 |
| 3 | LightGBM | 0.3958 | 134 | 0.4 | 16 | 0.01 | 64 | 0 | 2 |
Performance of regression models for predicting activated partial thromboplastin time results.
| Tool | Model | Mean absolute error | Root mean square error |
|
| H2O DAIb | Final ensemble model |
|
|
|
| H2O DAI | XGBoost | 25.51 | 32.33 | 0.31 |
| SKlearnd | Linear regression | 26.89 | 33.8 | 0.244 |
| SKlearn | Ridge regression | 26.93 | 33.79 | 0.244 |
| SKlearn | Lasso regression | 26.93 | 33.68 | 0.249 |
| SKlearn | Elasticnet regression | 27.15 | 33.72 | 0.247 |
aR2 coefficient of determination.
bDAI: Driverless AI.
cMinimum error rate.
dSKlearn: a machine learning library in Python.
Descriptions of contributing models in the final multiclassification ensemble model.
| ID | Model type | Model weight | Fitted features | Feature fraction | Max leaves | Learning rate | Max bins | Lambda L1 | Lambda L2 |
| 0 | XGBoost | 0.3067 | 1900 | 0.2 | 8 | 0.01 | 128 | 0 | 0.5 |
| 1 | XGBoost | 0.2 | 1914 | 0.5 | 8 | 0.01 | 256 | 0 | 5 |
| 2 | LightGBM | 0.4 | 78 | 0.6 | 16 | 0.01 | 64 | 0 | 0.5 |
| 3 | LightGBM | 0.0933 | 183 | 0.8 | 64 | 0.01 | 256 | 0 | 0 |
Performance of multiclassification models in predicting activated partial thromboplastin time class.
| Tool | Model | Accuracy | Macroprecision | Macrorecall | Macro– | Macro-AUCa |
| H2O DAIb | Final ensemble model |
|
|
|
|
|
| SKlearn | Logistic regression | 0.562 | 0.51 | 0.56 | 0.52 | 0.691 |
| SKlearn | Logistic regression with RFEd | 0.557 | 0.49 | 0.56 | 0.5 | 0.687 |
| SKlearn | SVMe—linear SVCf | 0.535 | 0.51 | 0.54 | 0.517 | 0.679 |
| SKlearn | SVM—polynomial SVC | 0.451 | 0.46 | 0.45 | 0.457 | 0.614 |
aAUC: area under the receiver operating characteristic curve.
bDAI: Driverless AI.
cBest calculated accuracy.
dRFE: recursive feature elimination.
eSVM: support vector machine.
fSVC: support vector classifier.
Figure 4Multiclassification confusion matrix. aPTT: activated partial thromboplastin time.
Figure 5Multiclassification confusion matrix for external validation. aPTT: activated partial thromboplastin time.