| Literature DB >> 35869152 |
Uri Kartoun1, Shaan Khurshid2,3, Bum Chul Kwon1, Aniruddh P Patel2,4, Puneet Batra5, Anthony Philippakis2, Amit V Khera2,4, Patrick T Ellinor2,3, Steven A Lubitz2,3, Kenney Ng6.
Abstract
Prediction models are commonly used to estimate risk for cardiovascular diseases, to inform diagnosis and management. However, performance may vary substantially across relevant subgroups of the population. Here we investigated heterogeneity of accuracy and fairness metrics across a variety of subgroups for risk prediction of two common diseases: atrial fibrillation (AF) and atherosclerotic cardiovascular disease (ASCVD). We calculated the Cohorts for Heart and Aging in Genomic Epidemiology Atrial Fibrillation (CHARGE-AF) score for AF and the Pooled Cohort Equations (PCE) score for ASCVD in three large datasets: Explorys Life Sciences Dataset (Explorys, n = 21,809,334), Mass General Brigham (MGB, n = 520,868), and the UK Biobank (UKBB, n = 502,521). Our results demonstrate important performance heterogeneity across subpopulations defined by age, sex, and presence of preexisting disease, with fairly consistent patterns across both scores. For example, using CHARGE-AF, discrimination declined with increasing age, with a concordance index of 0.72 [95% CI 0.72-0.73] for the youngest (45-54 years) subgroup to 0.57 [0.56-0.58] for the oldest (85-90 years) subgroup in Explorys. Even though sex is not included in CHARGE-AF, the statistical parity difference (i.e., likelihood of being classified as high risk) was considerable between males and females within the 65-74 years subgroup with a value of - 0.33 [95% CI - 0.33 to - 0.33]. We also observed weak discrimination (i.e., < 0.7) and suboptimal calibration (i.e., calibration slope outside of 0.7-1.3) in large subsets of the population; for example, all individuals aged 75 years or older in Explorys (17.4%). Our findings highlight the need to characterize and quantify the behavior of clinical risk models within specific subpopulations so they can be used appropriately to facilitate more accurate, consistent, and equitable assessment of disease risk.Entities:
Mesh:
Year: 2022 PMID: 35869152 PMCID: PMC9307639 DOI: 10.1038/s41598-022-16615-3
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Baseline characteristics.
| Incident AF (5 years) | Incident ASCVD (10 years) | |||||
|---|---|---|---|---|---|---|
| Explorys (N = 4,750,660) | UKBB (N = 445,329) | MGB (N = 174,644) | Explorys (N = 3,656,680) | UKBB (N = 408,154) | MGB (N = 198,184) | |
| N events | 196,252 | 7404 | 7877 | 346,159 | 10,906 | 10,201 |
| Median follow-up, years (Q1, Q3) | 3.6 (1.6, 5.0) | 5.0 (5.0, 5.0) | 5.0 (2.3, 5.0) | 3.8 (1.8, 6.6) | 8.9 (8.2, 9.7) | 6.8 (2.6, 10.0) |
| Female (%) | 56.7 | 55.0 | 60.9 | 55.9 | 54.8 | 58.8 |
| Age (years) | 62.6 (10.8) | 58.4 (7.0) | 60.9 (10.0) | 59.0 (10.7) | 56.9 (8.1) | 57.0 (10.3) |
| White race (%) | 84.2 | 94.7 | 79.6 | 87.4 | 98.4 | 78.1 |
| Smoking (%) | 17.3 | 10.7 | 8.0 | 18.7 | 10.4 | 7.4 |
| SBP (mmHg) | 131 (18) | 139 (19) | 128 (17) | 129 (17) | 139 (20) | 126 (17) |
| DBP (mmHg | 77 (11) | 83 (10) | 76 (10) | DBP, Height, and Weight were not necessary to calculate PCE scores | ||
| Height (cm) | 168.5 (10.9) | 168.2 (9.2) | 166.6 (10.4) | |||
| Weight (kg) | 86.1 (22.1) | 77.9 (15.8) | 79.4 (19.5) | |||
| HDL (US: mg/dL; UK: mmol/L) | HDL and TC were not necessary to calculate CHARGE-AF scores | 51 (17) | 1.46 (0.4) | 57 (18) | ||
| TC (US: mg/dL; UK: mmol/L) | 189 (42) | 5.7 (1.1) | 195 (39) | |||
| Hypertensive therapy (%) | 50.1 | 30.5 | 44.8 | 52.8 | 27.9 | 39.3 |
| Diabetes (%) | 21.3 | 2.5 | 16.0 | 21.4 | 5.0 | 14.8 |
| Heart failure (%) | 3.7 | 0.4 | 1.9 | 3.5 | 0.3 | 1.6 |
Figure 1Incidence rates per 1 K PY and population sizes. All population and subpopulation sizes and exact incidence rates are provided in Supplementary Table IX.
Figure 2Performance measures for CHARGE-AF. Prev. = Prevalence; HF = Heart failure.
Figure 3Fairness analysis for CHARGE-AF. Note that data was not available in the UKBB for the 75–84 and 85–90 age subpopulations.
Figure 4Performance measures for PCE (Female-White). Prev. = Prevalence; HF = Heart failure. Refer to Supplementary Table VIII for additional PCE models.
Figure 5Fairness analysis for PCE.