| Literature DB >> 33928796 |
Li Luo1, Ran Kou1, Yuquan Feng1, Jie Xiang1, Wei Zhu2.
Abstract
In order to overcome the shortage of the current costly DVT diagnosis and reduce the waste of valuable healthcare resources, we proposed a new diagnostic approach based on machine learning pre-test prediction models using EHRs. We examined the sociodemographic and clinical factors in the prediction of DVT with 518 NICU admitted patients, including 189 patients who eventually developed DVT. We used cross-validation on the training data to determine the optimal parameters, and finally, the applied ROC analysis is adopted to evaluate the predictive strength of each model. Two models (GLM and SVM) with the strongest ROC were selected for DVT prediction, based on which, we optimized the current intervention and diagnostic process of DVT and examined the performance of the proposed approach through simulations. The use of machine learning based pre-test prediction models can simplify and improve the intervention and diagnostic process of patients in NICU with suspected DVT, and reduce the valuable healthcare resource occupation/usage and medical costs.Entities:
Keywords: deep vein thrombosis; economic consideration; electronic health records; machine learning; neurological ICU; risk factors
Year: 2021 PMID: 33928796 PMCID: PMC8114755 DOI: 10.1177/10760296211008650
Source DB: PubMed Journal: Clin Appl Thromb Hemost ISSN: 1076-0296 Impact factor: 2.389
Figure 1.Research methodology framework.
Figure 2.Flowchart of the study subjects.
Patient Demographic Details and Other Factors for DVT.
| Overall | No DVT | DVT | ||
|---|---|---|---|---|
| Factors | (518) | (329) | (189) |
|
| Surgery times (mean (SD)) | 0.86 (0.35) | 0.89 (0.31) | 0.81 (0.39) | .01 |
| Age (mean (SD)) | 52.37 (17.35) | 51.77 (17.60) | 53.42 (16.90) | .297 |
| LOS (mean (SD)) | 22.55 (24.03) | 22.00 (26.50) | 23.50 (19.01) | .495 |
| Gender = M (%) | 275 (53.1) | 169 (51.4) | 106 (56.1) | .345 |
| Cost type (%) | .002 | |||
| Cash | 248 (47.9) | 177 (53.8) | 71 (37.6) | |
| Insurance | 217 (41.9) | 121 (36.8) | 96 (50.8) | |
| Others | 53 (10.2) | 31 (9.4) | 22 (11.6) | |
| Marriage status (%) | .014 | |||
| Divorced | 11 (2.1) | 6 (1.8) | 5 (2.6) | |
| Married | 427 (82.4) | 263 (79.9) | 164 (86.8) | |
| Single | 54 (10.4) | 45 (13.7) | 9 (4.8) | |
| Widowed | 26 (5.0) | 15 (4.6) | 11 (5.8) | |
| Job status (%) | .007 | |||
| Labor | 146 (28.2) | 100 (30.4) | 46 (24.3) | |
| Management | 17 (3.3) | 8 (2.4) | 9 (4.8) | |
| Office | 43 (8.3) | 29 (8.8) | 14 (7.4) | |
| Others | 213 (41.1) | 134 (40.7) | 79 (41.8) | |
| Retired | 55 (10.6) | 24 (7.3) | 31 (16.4) | |
| Student | 20 (3.9) | 17 (5.2) | 3 (1.6) | |
| Unemployed | 24 (4.6) | 17 (5.2) | 7 (3.7) | |
| Race ethnicity (%) | .037 | |||
| Han | 472 (91.1) | 308 (93.6) | 164 (86.8) | |
| Others | 10 (1.9) | 6 (1.8) | 4 (2.1) | |
| Yi | 10 (1.9) | 5 (1.5) | 5 (2.6) | |
| Zang | 26 (5.0) | 10 (3.0) | 16 (8.5) | |
| Pay type (%) | .001 | |||
| Medical insurance | 124 (23.9) | 60 (18.2) | 64 (33.9) | |
| Others | 349 (67.4) | 240 (72.9) | 109 (57.7) | |
| Self-paid | 35 (6.8) | 22 (6.7) | 13 (6.9) | |
| Social insurance | 10 (1.9) | 7 (2.1) | 3 (1.6) | |
| Admission type (%) | .012 | |||
| Emergency | 337 (65.1) | 202 (61.4) | 135 (71.4) | |
| Others | 18 (3.5) | 9 (2.7) | 9 (4.8) | |
| Outpatient | 163 (31.5) | 118 (35.9) | 45 (23.8) | |
| If transferred = T (%) | 99 (19.1) | 60 (18.2) | 39 (20.6) | .581 |
| Rehospitalization = T (%) | 54 (10.4) | 33 (10.0) | 21 (11.1) | .812 |
| Admission times (mean (SD)) | 1.39 (1.65) | 1.45 (1.88) | 1.30 (1.14) | .317 |
Laboratory Predictors of Deep Vein Thrombosis.
| Overall | No DVT | DVT | ||
|---|---|---|---|---|
| Category | (518) | (329) | (189) |
|
| Coagulation-Prothrombin time | 13.07 (2.20) | 12.97 (2.29) | 13.26 (2.04) | 0.155 |
| Coagulation-ISR | 1.12 (0.20) | 1.11 (0.20) | 1.13 (0.19) | 0.188 |
| Coagulation-Activated partial thromboplastin time | 32.50 (11.20) | 32.58 (11.67) | 32.35 (10.37) | 0.82 |
| Coagulation-Thrombin time | 20.04 (10.28) | 20.22 (9.57) | 19.73 (11.43) | 0.607 |
| Coagulation-Fibrinogen | 3.02 (1.49) | 2.80 (1.42) | 3.40 (1.54) | <0.001 |
| Coagulation-Thromboplastin time ratio | 1.17 (0.40) | 1.17 (0.42) | 1.16 (0.37) | 0.861 |
| Blood-Red blood cell count | 3.77 (0.76) | 3.84 (0.76) | 3.63 (0.75) | 0.002 |
| Blood-Hemoglobin | 112.26 (23.47) | 114.49 (23.46) | 108.40 (23.04) | 0.004 |
| Blood-Platelet count | 158.19 (75.27) | 158.77 (71.10) | 157.19 (82.22) | 0.818 |
| Blood-White cell count | 10.72 (4.97) | 10.21 (4.53) | 11.61 (5.54) | 0.002 |
| Blood-Percentage of neutrophils | 82.71 (10.78) | 81.33 (11.76) | 85.13 (8.32) | <0.001 |
| Blood-Percentage of Lymphocytes | 11.63 (8.77) | 13.09 (9.64) | 9.09 (6.28) | <0.001 |
| Blood-Percentage of eosinophils | 0.62 (1.22) | 0.63 (1.11) | 0.60 (1.40) | 0.764 |
| Blood-Percentage of basophils | 0.15 (0.21) | 0.15 (0.18) | 0.15 (0.25) | 0.975 |
| Blood-Hematocrit | 0.34 (0.07) | 0.35 (0.07) | 0.33 (0.07) | 0.035 |
| Blood-Average red blood cell volume | 91.04 (7.43) | 90.38 (7.36) | 92.18 (7.44) | 0.008 |
| Blood-Average red blood cell HGB | 29.91 (2.67) | 29.88 (2.72) | 29.97 (2.58) | 0.703 |
| Blood-Average red blood cell HGB_concentration | 328.62 (14.07) | 330.54 (14.19) | 325.26 (13.24) | <0.001 |
| Blood-Red blood cell distribution width CV | 14.05 (1.88) | 13.93 (1.74) | 14.27 (2.09) | 0.046 |
| Blood-Red blood cell distribution width SD | 45.47 (5.99) | 44.76 (5.37) | 46.69 (6.78) | <0.001 |
| Biochemical-Alanine aminotransferase | 27.15 (40.11) | 27.47 (46.40) | 26.60 (25.85) | 0.811 |
| Biochemical-Aspartate aminotransferase | 36.92 (137.11) | 40.30 (170.66) | 31.02 (28.77) | 0.459 |
| Biochemical-Urea | 5.73 (3.66) | 5.33 (3.06) | 6.43 (4.44) | 0.001 |
| Biochemical-Total bilirubin | 13.88 (7.81) | 13.58 (7.64) | 14.41 (8.08) | 0.245 |
| Biochemical-Direct bilirubin | 6.21 (3.99) | 6.09 (4.04) | 6.43 (3.90) | 0.354 |
| Biochemical-Indirect bilirubin | 7.72 (4.84) | 7.58 (4.84) | 7.96 (4.85) | 0.395 |
| Biochemical-Total protein | 58.68 (9.09) | 59.06 (9.53) | 58.01 (8.26) | 0.204 |
| Biochemical-Albumin | 34.20 (6.67) | 34.93 (6.79) | 32.92 (6.28) | 0.001 |
| Biochemical-Creatinine | 78.65 (83.95) | 74.48 (64.07) | 85.92 (110.16) | 0.135 |
| Biochemical-Glucose | 7.94 (3.15) | 7.70 (3.05) | 8.36 (3.30) | 0.022 |
| Biochemical-Alkaline phosphatase | 70.78 (36.39) | 71.36 (40.18) | 69.78 (28.72) | 0.636 |
| Biochemical-Glutamyl transpeptidase | 40.59 (50.61) | 40.12 (53.61) | 41.40 (45.04) | 0.783 |
| Biochemical-Sodium | 143.45 (7.48) | 143.08 (7.74) | 144.08 (6.98) | 0.144 |
| Biochemical-Potassium | 3.91 (0.54) | 3.94 (0.53) | 3.85 (0.55) | 0.089 |
| Biochemical-Chlorine | 106.57 (8.02) | 106.08 (8.26) | 107.41 (7.53) | 0.069 |
| Biochemical-Globulin | 24.48 (5.16) | 24.13 (5.18) | 25.09 (5.09) | 0.041 |
| Biochemical-White ball ratio | 1.45 (0.39) | 1.50 (0.40) | 1.36 (0.38) | <0.001 |
| Biochemical-Uric acid | 216.71 (119.58) | 221.61 (120.54) | 208.19 (117.72) | 0.219 |
| Biochemical-Triglycerides | 1.45 (1.19) | 1.41 (1.21) | 1.53 (1.16) | 0.272 |
Figure 3.ROC curves using FE-RF feature extraction methods for (A) GLM, (B) Xgboost, (C) RF, and (D) SVM.
Figure 4.Proposed new diagnosis and intervention process of suspected DVT.
Mathematical Notations Summary.
| Notation | Description | |
|---|---|---|
| Parameters |
| The total treatment cost pre-day |
|
| The number of days for treatment | |
|
| The cost of a single ultrasound screening | |
|
| The times of ultrasound screening | |
|
| The success rate for the intervention | |
|
| The cost of intervention pre-day | |
|
| The number of days for intervention | |
|
| The actual current prevalence of DVT | |
| Variables |
| The predictive probability of a patient |
|
| The 2 thresholds for the predictive model |
Estimated Effect of the Current Diagnostic Process and the Proposed One on DVT Screening and Interventions.
| Scenario |
|
| Expected cost pre-person |
|---|---|---|---|
| Actual (current) | ¥6456.2 | ||
| Optimized 0 (D-Dimer) | .51 | .8 | ¥3856.5 |
| Optimized 1 (GLM) | .532 | .626 | ¥3143.3 |
| Optimized 2 (SVM) | .583 | .723 | ¥3272.7 |