| Literature DB >> 35369326 |
Zhihao Chen1, Jixi Shi1,2, Thibaut Pommier3, Yves Cottin3, Michel Salomon1, Thomas Decourselle4, Alain Lalande5,6, Raphaël Couturier1.
Abstract
This study proposes machine learning-based models to automatically evaluate the severity of myocardial infarction (MI) from physiological, clinical, and paraclinical features. Two types of machine learning models are investigated for the MI assessment: the classification models classify the presence of the infarct and the persistent microvascular obstruction (PMO), and the regression models quantify the Percentage of Infarcted Myocardium (PIM) of patients suspected of having an acute MI during their reception in the emergency department. The ground truth labels for these supervised models are derived from the corresponding Delayed Enhancement MRI (DE-MRI) exams and manual annotations of the myocardium and scar tissues. Experiments were conducted on 150 cases and evaluated with cross-validation. Results showed that for the MI (PMO inclusive) and the PMO (infarct exclusive), the best models obtained respectively a mean error of 0.056 and 0.012 for the quantification, and 88.67 and 77.33% for the classification accuracy of the state of the myocardium. The study of the features' importance also revealed that the troponin value had the strongest correlation to the severity of the MI among the 12 selected features. For the proposal's translational perspective, in cardiac emergencies, qualitative and quantitative analysis can be obtained prior to the achievement of MRI by relying only on conventional tests and patient features, thus, providing an objective reference for further treatment by physicians.Entities:
Keywords: DE-MRI; acute myocardial infarction; automatic prediction; clinical characteristics; machine learning
Year: 2022 PMID: 35369326 PMCID: PMC8964399 DOI: 10.3389/fcvm.2022.754609
Source DB: PubMed Journal: Front Cardiovasc Med ISSN: 2297-055X
Figure 1A typical Delayed Enhancement MRI (DE-MRI) slice involving the myocardial infarction (MI) and the persistent microvascular obstruction (PMO). On the left the cropped short-axis DE-MRI covering the whole left ventricle, on the right the corresponding masks of normal myocardium (green), MI (blue, PMO exclusive) and PMO (yellow). The PMO is semi-wrapped by the MI and contacts the cavity.
Characteristics of pathological and non-pathological patients [according to the Delayed Enhancement MRI (DE-MRI)].
|
|
|
|
|
|---|---|---|---|
| Sex | 38 females and 12 males | 23 females and 77 males | 0.000 |
| Age | 66 ± 14 years | 59 ± 12 years | 0.004 |
| Tobacco (yes, no, former smoker) | 18%, 22%, 60% | 44%, 21%, 35% | 0.001 |
| Overweighta | 62% | 53% | 0.296 |
| Arterial hypertension | 58% | 31% | 0.002 |
| Diabetes | 20% | 10% | 0.126 |
| History of coronary artery disease | 4% | 12% | 0.065 |
| ECG (ST elevation) | 30% | 80% | 0.000 |
| Troponin (ng per mL) | 7.68 ± 12.91 | 101.04 ± 101.35 | 0.000 |
| Killip max (1,2,3,4) | 76%, 22%, 2%, 0% | 83%, 12%, 2%, 3% | 0.916 |
| LVEFb (percentage) | 49.62%±13.49% | 47.74%±13.17% | 0.423 |
| NTProBNPc | 2, 136 ± 3, 696 | 1, 314 ± 2, 109 | 0.154 |
.
The proportion of scar tissues among pathological subjects.
|
|
|
|
|---|---|---|
| MI (PMO inclusive) | 0.1825 ± 0.1152 | 100 |
| PMO | 0.0330 ± 0.0360 | 51 |
Figure 2Workflow of the proposal. This figure presents the inference stage for the quantification task and the classification task of the automatic MI prediction. On the left part, selected patient features are first preprocessed to 17 pieces of numerical or Boolean features. For the quantification task, the features are incorporated through a regression model so that the obtained value is the Percentage of Infarcted Myocardium (PIM) ranging from 0 to 1. In the case of the classification task, the prediction can be obtained by either a regression model followed by thresholding or a classification model. During the training stage, the regression model is supervised by the ground truth PIM, and the classification model is supervised by the ground truth state of the myocardium. Both ground truths are defined from the DE-MRI and manual annotations.
Prediction error of regression models for the PIM quantification.
|
|
| |
|---|---|---|
|
|
| |
| Linear Regression | 0.0639 ± 0.0677 | 0.0152 ± 0.0214 |
| Support Vector Regression | 0.0579 ± 0.0632 | |
| Decision Tree Regressor | 0.0679 ± 0.0741 | 0.0162 ± 0.0293 |
| Random Forest | 0.0587 ± 0.0597 | 0.0149 ± 0.0227 |
| Multilayer Perceptron | 0.0179 ± 0.0229 | |
| Gradient Boosting Regressor | 0.0602 ± 0.0584 | 0.0152 ± 0.0228 |
| XGBoost | 0.0646 ± 0.0572 | 0.0172 ± 0.0199 |
| Light Gradient Boosting | 0.0590 ± 0.0616 | 0.0161 ± 0.0227 |
| Ensemble | 0.0555 ± 0.0594 | 0.0141 ± 0.0210 |
| Mean predicted PIMb | 0.1070 ± 0.0693 | 0.0162 ± 0.0206 |
In bold is the best result of a single model. The ensemble is the average predictions of all the regression models.
.
Figure 3Receiver operating characteristic curves of classification results. The classification of different tissues was realized with Random Forest (RF) algorithms with different optimization functions.
Classification results under different metrics.
|
|
|
|
|
|
|
|---|---|---|---|---|---|
| RF Regressora | Infarction, θ = 0.064 | 85.00% | 96.00% | 97.70% | 88.67% |
| PMO, θ = 0.013 | 70.59% | 80.81% | 65.45% | 77.33% | |
| RF Classifier | Infarction | 89.00% | 84.00% | 91.75% | 87.33% |
| PMO | 50.98% | 84.85% | 63.41% | 73.33% | |
| RF Classifier with only ECG and troponin | Infarction | 77.00% | 62.00% | 80.21% | 72.00% |
| PMO | 58.82% | 79.80% | 60.00% | 72.67% | |
| GT with thresholding | Infarction, θ = 0.064 | 87.00% | 100.00% | 100.00% | 91.33% |
| PMO, θ = 0.013 | 56.86% | 100.00% | 100.00% | 85.33% |
The threshold value θ is derived when the best classification accuracy is achieved from the RF regressor.
.
Figure 4Impact of the training set volume on the mean and the SD of quantification error. The dataset of 150 cases was randomly split into multiple folds to have different amounts of training data. The training set volume ranged from 10 to 149 for each cross-validation.
Figure 5Prediction error on cases of different severity according to the infarction. Normal cases were not considered in this figure. SD, Standard Deviation; mean diff, mean difference.
Feature importance to linear and non-linear models for classification and quantification tasks.
|
|
|
|
| ||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |||
| Infarction | Quantification | RFR | 2.91 |
| 1.04 | 0.67 | 0.73 | 1.06 | 1.73 | 0.55 | 0.61 | 1.70 |
| 0.45 | 0.26 | 0.28 | 0.10 |
|
|
| LR | 6.50 | 3.64 | 0.00 | 0.00 | 0.00 | 1.17 | 5.68 | 1.77 |
|
|
| 0.00 | 0.00 | 0.00 | 0.00 |
| 4.08 | ||
| Classification | RFC |
|
| 2.44 | 1.65 | 2.18 | 2.02 | 3.21 | 1.37 | 2.70 |
|
| 1.18 | 1.22 | 0.17 | 0.10 | 9.29 | 9.71 | |
| SVCL |
| 0.93 | 4.57 | 5.71 | 1.14 | 2.13 | 1.14 | 5.22 |
|
|
| 0.87 | 0.72 | 0.50 | 0.35 | 0.99 | 1.50 | ||
| PMO | Quantification | RFR | 1.48 |
| 2.58 | 0.70 | 1.92 | 2.31 | 1.67 | 0.18 | 0.17 | 0.43 |
| 1.24 | 0.46 | 0.81 | 0.67 |
|
|
| LR | 1.30 |
| 0.00 | 0.00 | 0.00 | 2.30 |
| 2.02 | 2.61 | 10.63 |
| 0.00 | 0.00 | 0.00 | 0.00 | 3.55 |
| ||
| Classification | RFC | 4.04 |
| 2.57 | 1.79 | 2.37 | 3.42 | 2.73 | 1.64 | 0.88 | 6.14 |
| 1.21 | 0.99 | 0.57 | 0.47 |
|
| |
| SVCL | 4.90 | 5.88 | 1.17 | 2.27 | 3.44 | 0.99 | 2.13 |
| 3.81 |
|
| 5.71 | 7.55 |
| 5.19 | 5.16 | 1.02 | ||
| Mean | 5.91 |
| 1.80 | 1.60 | 1.47 | 1.93 | 3.90 | 2.75 | 5.36 | 7.94 |
| 1.33 | 1.40 | 1.30 | 0.86 |
|
| ||
Categorical features were converted to one-hot encoding. Classification models were trained with the Boolean values, quantification models were trained with PIM. The importance of each model and task was normalized, therefore, the sum of importance was 100%. Higher importance shows a closer relationship between the feature and the disease. Bold indicates the four most important features for each model.
Percentage of Infarcted Myocardium prediction error with selected important features using Random Forest Regressor.
|
|
| |
|---|---|---|
|
|
| |
| Troponin | 0.0634 ± 0.0615* | 0.0128 ± 0.0226* |
| Troponin, LVEF | 0.0585 ± 0.0620* | 0.0122 ± 0.02121* |
| Troponin, LVEF, NTp, Age | 0.0645 ± 0.0598* | 0.0145 ± 0.0230* |
| All 12 features | 0.0587 ± 0.0597 | 0.0149 ± 0.0227 |
*If the t-test between the selected features model and the all 12 features model obtains p > 0.05: the difference is not significant.
Cases with incorrect prediction.
|
|
|
|
| |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
|
|
| |||
|
| ||||||||||||||
| 1 | 0 | 32 | 0 | 0 | 0 | 0 | 0 | 1 | 130 | 1 | 35 | 447 | 51.64 | 23.25 |
| 7 | 0 | 66 | 0 | 0 | 0 | 0 | 0 | 0 | 200 | 1 | 45 | 532 | 9.33 | 25.64 |
| 19 | 0 | 52 | 0 | 1 | 0 | 0 | 0 | 0 | 87 | 3 | 20 | 7139 | 48.04 | 14.84 |
| 22 | 0 | 53 | 0 | 0 | 0 | 0 | 0 | 1 | 170 | 1 | 60 | 43 | 42.56 | 23.81 |
| 69 | 1 | 45 | 0 | 0 | 0 | 0 | 0 | 1 | 120 | 1 | 55 | 649 | 39.06 | 18.62 |
| 94 | 0 | 61 | 0 | 0 | 1 | 1 | 0 | 1 | 3.9 | 1 | 46 | 5810 | 29.75 | 12.94 |
| 105 | 0 | 54 | 2 | 1 | 0 | 0 | 1 | 1 | 25 | 1 | 21 | 4153 | 46.41 | 16.52 |
| 110 | 0 | 49 | 0 | 1 | 0 | 0 | 0 | 1 | 200 | 1 | 45 | 29 | 7.54 | 27.95 |
| 119 | 0 | 66 | 2 | 1 | 0 | 0 | 0 | 1 | 73 | 1 | 70 | 159 | 31.79 | 9.86 |
|
| ||||||||||||||
| 16 | 0 | 76 | 0 | 1 | 0 | 0 | 0 | 1 | 14 | 1 | 60 | 192 | 0.00 | 9.32 |
| 34 | 1 | 78 | 2 | 1 | 0 | 0 | 0 | 1 | 1.8 | 1 | 35 | 22577 | 0.00 | 11.20 |
| 65 | 0 | 57 | 1 | 1 | 0 | 0 | 0 | 1 | 19 | 1 | 60 | 71 | 0.00 | 8.06 |
| 68 | 0 | 39 | 0 | 1 | 0 | 0 | 0 | 1 | 9 | 1 | 60 | 23 | 0.00 | 10.40 |
| 114 | 1 | 54 | 0 | 1 | 1 | 0 | 1 | 0 | 1 | 1 | 45 | 68 | 0.00 | 7.98 |
| 117 | 0 | 53 | 1 | 1 | 0 | 0 | 0 | 1 | 83 | 1 | 60 | 94 | 0.00 | 18.83 |
| 145 | 1 | 66 | 2 | 1 | 0 | 0 | 0 | 0 | 2.5 | 2 | 45 | 6209 | 14.93 | 1.38 |
Incorrect predictions are presented as the important quantification error and the wrong classification (false positive and false negative). The ground truth (GT) and the prediction values are the PIM. Classification results in the table were obtained by regression model and thresholding.
.