| Literature DB >> 32401764 |
Fernando De la Garza-Salazar1,2, Maria Elena Romero-Ibarguengoitia1,3, Elias Abraham Rodriguez-Diaz4, Jose Ramón Azpiri-Lopez4, Arnulfo González-Cantu1,3.
Abstract
The electrocardiogram (ECG) is the most common tool used to predict left ventricular hypertrophy (LVH). However, it is limited by its low accuracy (<60%) and sensitivity (30%). We set forth the hypothesis that the Machine Learning (ML) C5.0 algorithm could optimize the ECG in the prediction of LVH by echocardiography (Echo) while also establishing ECG-LVH phenotypes. We used Echo as the standard diagnostic tool to detect LVH and measured the ECG abnormalities found in Echo-LVH. We included 432 patients (power = 99%). Of these, 202 patients (46.7%) had Echo-LVH and 240 (55.6%) were males. We included a wide range of ventricular masses and Echo-LVH severities which were classified as mild (n = 77, 38.1%), moderate (n = 50, 24.7%) and severe (n = 75, 37.1%). Data was divided into a training/testing set (80%/20%) and we applied logistic regression analysis on the ECG measurements. The logistic regression model with the best ability to identify Echo-LVH was introduced into the C5.0 ML algorithm. We created multiple decision trees and selected the tree with the highest performance. The resultant five-level binary decision tree used only six predictive variables and had an accuracy of 71.4% (95%CI, 65.5-80.2), a sensitivity of 79.6%, specificity of 53%, positive predictive value of 66.6% and a negative predictive value of 69.3%. Internal validation reached a mean accuracy of 71.4% (64.4-78.5). Our results were reproduced in a second validation group and a similar diagnostic accuracy was obtained, 73.3% (95%CI, 65.5-80.2), sensitivity (81.6%), specificity (69.3%), positive predictive value (56.3%) and negative predictive value (88.6%). We calculated the Romhilt-Estes multilevel score and compared it to our model. The accuracy of the Romhilt-Estes system had an accuracy of 61.3% (CI95%, 56.5-65.9), a sensitivity of 23.2% and a specificity of 94.8% with similar results in the external validation group. In conclusion, the C5.0 ML algorithm surpassed the accuracy of current ECG criteria in the detection of Echo-LVH. Our new criteria hinge on ECG abnormalities that identify high-risk patients and provide some insight on electrogenesis in Echo-LVH.Entities:
Year: 2020 PMID: 32401764 PMCID: PMC7219774 DOI: 10.1371/journal.pone.0232657
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 2New model and electrocardiographic left ventricular hypertrophy phenotypes.
Echo and ECG were obtained in 432 patients (46.7% LVH positive). Logistic regression modeling was performed for dimensionality reduction. This supervised classification model was created with 80% of the sample and the remaining 20% was used for internal validation. The C5.0 ML algorithm resulted in a simple, five-level, seven-node decision tree. Each leaf shows the probability of having LVH and if greater than 0.5 (50%), the patient will be classified as having LVH and vice-versa. External validation was conducted in 150 subjects (47.3% LVH positive). ECG: Electrocardiography, ECHO: echocardiography, LVH: left ventricular hypertrophy.
Demographic and echocardiographic measurements of the population.
| Mean (SD) | Total sample (n = 432) | UCG | p-value | |
|---|---|---|---|---|
| Negative LVH (n = 230) | Positive LVH (n = 202) | |||
| Age (years) | 67.3 (13.7) | 65.7 (14.8) | 69.3 (12.1) | 0.03 |
| Weight (kg) | 78.2 (16.7) | 78.2 (16) | 78.1 (17.4) | 0.93 |
| Height (cm) | 167.4 (9.7) | 168.3 (9.7) | 166.2 (9.6) | 0.04 |
| BMI (kg/m2) | 27.8 (4.9) | 27.4 (4.3) | 28.2 (5.4) | 0.13 |
| BSA (m2) | 1.9 (0.24) | 1.9 (0.23) | 1.9 (0.24) | 0.49 |
| IVSTd | 1.18 (0.27) | 1.04 (0.19) | 1.34 (0.26) | 0.001 |
| LVIDd | 4.57 (0.75) | 4.4 (0.64) | 4.76 (0.8) | 0.001 |
| LVPWTd | 1.18 (0.26) | 1.04 (0.19) | 1.32 (0.24) | 0.001 |
| RWT | 0.54 (0.19) | 0.49 (0.16) | 0.58 (0.2) | 0.001 |
| LVM (gr) | 201.3 (72.4) | 157.4 (37.5) | 253.3 (68.8) | 0.001 |
| LVMI (gr/m2) | 106.2 (34.2) | 82.3 (15.1) | 133.4 (29) | 0.001 |
Demographic parameters of patients with UCG-LVH vs controls. The models were two sided and significant p -value was <0.05
Abbreviations: SD: standard deviation LVH: left ventricular hypertrophy, BMI: body mass index, BSA: body surface area, IVSTd: interventricular septum thickness diastole, LVIDd: left ventricular internal diameter diastole, LVPWTd: left ventricular posterior wall thickness diastole, RWT: relative wall thickness, LVM: left ventricular mass, LVMI: left ventricular mass index.
Comorbidities of the population.
| Total sample | Negative ECO-LVH (n = 145, 45.1%) | Positive ECO-LVH (n = 176, 54.8%) | p-value | |
|---|---|---|---|---|
| AF | 63 (19.6) | 19 (13.1) | 44 (25) | |
| Aortic stenosis | 17 (5.3) | 1 (0.7) | 16 (9.1) | |
| IHD | 171 (52.3) | 84 (53.2) | 87 (51.5) | |
| CHF | 44 (13.7) | 13 (9) | 31 (17.6) | |
| CKD | 32 (10) | 9 (6.2) | 23 (13.1) | |
| COPD | 9 (2.8) | 1 (0.7) | 8 (4.5) | |
| Dyslipidemia | 60 (18.7) | 23 (15.9) | 37 (21) | |
| DM2 | 114 (35.5) | 43 (29.7) | 71 (40.3) | |
| Hypertension | 207 (64.4) | 81 (25.2) | 126 (39.2) | |
| Hypothyroidism | 33 (10.3) | 9 (6.2) | 24 (13.6) | |
| OSA | 3 (0.9) | 1 (0.7) | 2 (1.1) | |
| PAD | 15 (4.7) | 8 (5.5) | 7 (4) | |
| PH | 5 (1.6) | 2 (1.4) | 3 (1.7) | |
| PE | 7 (2.2) | 4 (2.8) | 3 (1.7) | |
| SSS | 3 (0.9) | 1 (0.7) | 2 (1.1) | |
| Stroke | 34 (10.6) | 13 (9) | 21 (11.9) | |
| SVT | 9 (2.8) | 5 (3.4) | 4 (2.3) |
*Missing completely of random values of comorbidities were 25.7% of the total sample. Complete case analyses were performed.
Abbreviations: AF: atrial fibrillation, IHD: ischemic heart disease, CHF: congestive heart failure, CKD: chronic kidney disease, COPD: chronic obstructive pulmonary disease, DM2: type 2 diabetes mellitus, OSA: obstructive sleep apnea, PAD: peripheral artery disease, PH: pulmonary hypertension, PE: pulmonary embolism, SSS: sick sinus syndrome, SVT: supraventricular tachycardia.
Logistic regression model.
| Estimate | Std Error | p-value | CI 95% | |
|---|---|---|---|---|
| Intecept | -2.7 | 0.86 | [0.01, 0.34] | |
| ST abnormalities | 1.28 | 0.30 | [1.98, 6.52] | |
| S V4 | 0.93 | 0.36 | [1.24, 5.26] | |
| Intrinsicoid deflection in V6 | .97 | .32 | [1.4, 5.04] | |
| Negative P-wave deflection in V1 | 0.96 | 0.24 | [1.61, 4.22] | |
| R aVR | 3.9 | 1.3 | [4.1, 685.1] | |
| S aVR | 0.48 | 0.34 | [0.83, 3.17] | |
| P-wave duration in V1 | -0.009 | 0.003 | [0.98, 0.99] | |
| S V6 | 1.91 | 0.95 | [1.05, 44] | |
| S I | -2.24 | 1.01 | [0.01, 0.77] | |
| QRS duration | 0.01 | 0.008 | [0.99, 1.03] | |
| R I | 0.53 | 0.37 | [0.81, 3.57] |
AIC value: 524.8. These variables were used in the decision tree model. The estimates are standardized. The model was two sided and significant p -value was <0.05
Abbreviations: Std error: standard error, S V4: S-wave voltage in V4 lead, V6 intrinsicoid deflection: defined as a qR duration ≥ 0.05, negative P-wave deflection in V1 lead: defined as having a negative component duration greater than the duration of the positive component, R aVR: R-wave voltage in aVR lead, S aVR: S-wave voltage in aVR lead, S V6: S-wave voltage in V6 lead, S I: S-wave voltage in I lead, QRS duration: duration of QRS complex in V1 lead, R_I: R-wave voltage in I lead.
Diagnostic performance of simplified decision trees.
| Accuracy | Sensitivity (%) | Specificity (%) | PPV (%) | NPV (%) | |
|---|---|---|---|---|---|
| SV4 AND ST abnormality | 60.4 | 82.9 | 57.6 | 19.3 | 96.5 |
| SV4 AND negative deflection in V1 | 60.2 | 75 | 57.8 | 22.3 | 93.5 |
| Negative deflection in V1 AND ST abnormality | 60.4 | 82.9 | 57.6 | 19.3 | 96.5 |
| Negative deflection in V1 AND ST abnormality AND SV4 | 55.3 | 80 | 54.4 | 5.9 | 98.7 |
Abbreviations: PPV: positive predictive value, NPV: negative predictive value, SV4: positive if S-wave equal or greater than 0.6mV, major ST abnormality: defined as Minnesota’s code 4–1, Negative deflection in V1: P wave negative’s component duration in V1 lead is greater than the initial positive component.