| Literature DB >> 36246101 |
Xingyu Zhou1,2, Xianying Li1, Zijun Zhang1, Qinrong Han1, Huijiao Deng1, Yi Jiang1, Chunxiao Tang1, Lin Yang2,1.
Abstract
Cardiovascular disease is currently one of the most important diseases causing death in China and the world, and acute myocardial infarction is a major cause of cardiovascular disease. This study provides an analytical technique for predicting the prognosis of patients with severe acute myocardial infarction using a support vector machine (SVM) technique based on information gleaned from electronic medical records in the Medical Information Marketplace for Intensive Care (MIMIC)-III database. The MIMIC-III database provided 4785 electronic medical records data for inclusion in the model development after screening 7070 electronic medical records of patients admitted to the intensive care unit for treatment of acute myocardial infarction. Adopting the APS-III score as the criterion for identifying anticipated risk, the dimensions of data information incorporated into the mathematical model design were found using correlation coefficient matrix heatmaps and ordered logistic analysis. An automated prognostic risk-prediction model was developed using SVM, and the fit was evaluated by 5× cross-validation. We used a grid search method to further optimize the parameters and improve the model fit. The excellent generalization ability of SVM was fully verified by calculating the 95% confidence interval of the area under the receiver operating characteristic curve (AUC) for six algorithms (linear discriminant, tree, Kernel Naive Bayes, RUSBoost, KNN, and SVM). Compared to the remaining five models, its confidence interval was the narrowest with higher fitting accuracy and better performance. The patient prognostic risk prediction model constructed using SVM had a relatively impressive accuracy (92.2%) and AUC value (0.98). In this study, a model was designed for fitting that can maximize the potential information to be gleaned in the electronic medical records data. It was demonstrated that SVM models based on electronic medical records data can offer an effective solution for clinical disease prognostic risk assessment and improved clinical outcomes and have great potential for clinical application in the clinical treatment of myocardial infarction.Entities:
Keywords: ICU; cardiology; electronic medical records; machine learning; support vector machine
Year: 2022 PMID: 36246101 PMCID: PMC9558165 DOI: 10.3389/fphys.2022.991990
Source DB: PubMed Journal: Front Physiol ISSN: 1664-042X Impact factor: 4.755
FIGURE 1The process of acquiring data from a database and constructing a predictive model.
FIGURE 2APS-III scale scores for patients included in the study.
FIGURE 3Thermalmatrix diagram of correlation coefficients for each feature. Description of the abbreviations in Figure 3: K (blood potassium), GLU (blood glucose), TIME (length of hospital stay), PLT (platelets), TC (total cholesterol), TnI (troponin I), TnT (troponin T), LDL (low-density lipoprotein), Nt. proBNP (N-terminal prenatriuretic peptide), CRP (C-reactive protein), CK. MB (creatine kinase isoenzyme), CK (creatine kinase), ICU (patient’s time in ICU), HEIGHT (patient’s height), WEIGHT (patient’s weight), HDL (high-density lipoprotein), APC (total aspirin dose), TG (triglycerides), E.coli (number of Escherichia coli flora), Atorvastatin (total atorvastatin dose), NS (total bacteriocin does), TNG (total nitroglycerin dose), SP (Streptococcus pneumoniae).
FIGURE 4SVM schematic.
FIGURE 5Schematic diagram of the principle of structural risk minimization.
FIGURE 6ROC curves for the models.
FIGURE 7Visual overview of the AUC and 95% CI values for each model.
Conclusion of DeLong test of SVM with other five classifiers.
| Classifier | Z-value |
|
|---|---|---|
| KNN | 16.536 | <2.2e-16 |
| RUBoost | 34.198 | <2.2e-16 |
| Naïve Bayes | 10.448 | <2.2e-16 |
| Tree | 9.0918 | <2.2e-16 |
| Liner Discriminant | 28.143 | <2.2e-16 |