| Literature DB >> 31675367 |
Nathan R Hill1, Daniel Ayoubkhani2, Phil McEwan2, Daniel M Sugrue2, Usman Farooqui1, Steven Lister1, Matthew Lumley3, Ameet Bakhai4, Alexander T Cohen5, Mark O'Neill6, David Clifton7, Jason Gordon2.
Abstract
BACKGROUND: Atrial fibrillation (AF) is the most common sustained heart arrhythmia. However, as many cases are asymptomatic, a large proportion of patients remain undiagnosed until serious complications arise. Efficient, cost-effective detection of the undiagnosed may be supported by risk-prediction models relating patient factors to AF risk. However, there exists a need for an implementable risk model that is contemporaneous and informed by routinely collected patient data, reflecting the real-world pathology of AF.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31675367 PMCID: PMC6824570 DOI: 10.1371/journal.pone.0224582
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Patient disposition at study entry.
| Variable | Category | Full population | AF cohort | Non-AF cohort | P-value | Cohen’s |
|---|---|---|---|---|---|---|
| Age (years), mean (SD) | 55.98 (14.46) | 70.23 (11.07) | 55.51 (14.32) | <0.001 | 1.0343 | |
| Sex, n (%) | Male | 1,395,397 (46.6) | 51,738 (54.1) | 1,343,659 (46.3) | <0.001 | 0.1558 |
| Smoking status, n (%) | Current | 555,074 (18.5) | 10,571 (11.1) | 544,503 (18.8) | <0.001 | -0.1989 |
| Former | 701,966 (23.4) | 32,198 (33.7) | 669,768 (23.1) | 0.2499 | ||
| Non-smoker | 1,269,538 (42.4) | 37,384 (39.1) | 1,232,154 (42.5) | -0.0688 | ||
| Passive | 7,876 (0.3) | 279 (0.3) | 7,597 (0.3) | 0.0058 | ||
| Unknown | 460,383 (15.4) | 15,175 (15.9) | 445,208 (15.4) | 0.0143 | ||
| Race, n (%) | Black | 29,380 (1.0) | 207 (0.2) | 29,173 (1.0) | <0.001 | -0.0801 |
| White | 729,507 (24.4) | 25,459 (26.6) | 704,048 (24.3) | 0.0546 | ||
| Other | 75,142 (2.5) | 1,031 (1.1) | 74,111 (2.6) | -0.0945 | ||
| Unknown | 2,160,808 (72.2) | 68,910 (72.1) | 2,091,898 (72.2) | -0.0017 | ||
| Hypertension | 748,849 (25.0) | 50,501 (52.8) | 698,348 (24.1) | <0.001 | 0.6306 | |
| Heart failure | 22,054 (0.7) | 2,805 (2.9) | 19,249 (0.7) | <0.001 | 0.2658 | |
| Left ventricular hypertrophy | 4,727 (0.2) | 502 (0.5) | 4,225 (0.1) | <0.001 | 0.0956 | |
| Myocardial infarction | 42,830 (1.4) | 3,009 (3.1) | 39,821 (1.4) | <0.001 | 0.1494 | |
| Coronary heart disease | 154,029 (5.1) | 13,703 (14.3) | 140,326 (4.8) | <0.001 | 0.4310 | |
| Congenital heart disease | 501 (<0.1) | 58 (0.1) | 443 (<0.1) | <0.001 | 0.0351 | |
| Type 1 diabetes | 19,101 (0.6) | 831 (0.9) | 18,270 (0.6) | <0.001 | 0.0300 | |
| Type 2 diabetes | 187,733 (6.3) | 10,727 (11.2) | 177,006 (6.1) | <0.001 | 0.2111 | |
| Height (m) | 1.68 (0.10) | 1.69 (0.11) | 1.68 (0.10) | <0.001 | 0.0508 | |
| Weight (kg) | 78.32 (18.28) | 81.55 (19.45) | 78.21 (18.23) | <0.001 | 0.1825 | |
| BMI (kg/m2) | 27.59 (5.99) | 28.56 (6.16) | 27.56 (5.99) | <0.001 | 0.1662 | |
| SBP (mmHg) | 133.58 (18.87) | 140.97 (19.29) | 133.34 (18.81) | <0.001 | 0.4054 | |
| DBP (mmHg) | 79.40 (10.92) | 79.12 (11.01) | 79.41 (10.92) | <0.001 | -0.0266 | |
| Pulse pressure (mmHg) | 54.18 (14.93) | 61.85 (16.47) | 53.93 (14.81) | <0.001 | 0.4917 | |
AF: atrial fibrillation and atrial flutter; BMI: body mass index; DBP: diastolic blood pressure; SBP: systolic blood pressure; SD: standard deviation.
*clinical histories assessed five years prior to index.
Summary of key AF risk factors inferred from the final risk model.
| Risk factor |
|---|
| Patient demographics (age, sex, race, smoking status) at baseline |
| History of antihypertensive medication use at baseline |
| History of type 1 or type 2 diabetes at baseline |
| History of cardiovascular comorbidities at baseline |
| Presence of a cardiovascular event in the past year |
| BMI in each of the latest four quarters |
| Change in BMI between the latest two quarters |
| High pulse pressure in the latest quarter |
| Negative absolute change in DBP or positive absolute change in SBP between the latest two quarters |
| Increasing frequency of DBP, SBP and BMI recording in the latest quarter |
BMI: body mass index; DBP: diastolic blood pressure; SBP: systolic blood pressure. Risk factors were identified using model inferences from variable importance plots[23] and partial dependence plots[24] for the fitted baseline and time-varying neural networks. Smoking status was defined as: current smoker, former smoker, non-smoker, passive smoker and unknown. Cardiovascular comorbidities/events considered were: hypertension (diagnosed), hypertension (receiving antihypertensive medication), heart failure, coronary heart disease, congenital heart disease, myocardial infarction left ventricular hypertrophy, type 1 diabetes, type 2 diabetes. Pulse pressure is the difference between systolic and diastolic blood pressure, corresponding to the force generated by cardiac contraction. “High” pulse pressure refers to elevated (>120 mmHg) SBP combined with a low to normal (≤80 mmHg) DBP. For details, see S3 Fig. Frequency with which clinical characteristics were recorded was a continuous variable representing the number (count) of DBP, SBP, etc. recordings, rather than actual values recorded over a given quarter.
Fig 1Most important predictors of AF according to: (A) the CHARGE-AF risk model; (B) the fitted baseline neural network; and (C) the fitted time-varying neural network. Variable importance was determined on the training dataset according to the absolute size of the published regression coefficients for the CHARGE-AF risk model[11] and Garson’s algorithm[23] for the fitted neural networks. Importance is expressed as a percentage of the most important predictor within each model. Importance is shown for all 11 variables in the CHARGE-AF risk model and the top 20 most important variables in each of the fitted neural network models.
Assessment of model performance at 75% sensitivity.
| Model | Specificity | PPV | NNS | AUROC |
|---|---|---|---|---|
| Final machine learning risk model | 74.9% | 11.5% | 9 patients | 0.827 |
| CHARGE-AF risk model | 61.0% | 7.9% | 13 patients | 0.725 |
| Logistic regression | 52.0% | 6.5% | 15 patients | 0.695 |
AUROC: area under the receiver operating characteristic curve; LR: logistic regression; NNS: number needed to screen (number of patients needed to be screened to identify one AF case); PPV: positive predictive value (percentage of screened patients diagnosed with AF). Sensitivity: percentage of patients diagnosed with AF that would be identified for screening; specificity: percentage of patients not diagnosed with AF that would not be identified for screening.
Fig 2Illustration of the trade-off between sensitivity and positive predictive value (PPV) for the final machine learning risk model (A–C) compared with the baseline CHARGE-AF risk model (D–F).