| Literature DB >> 27832070 |
Alexander M Kulminski1, Liang He1, Irina Culminskaya1, Yury Loika1, Yelena Kernogitski1, Konstantin G Arbeev1, Elena Loiko1, Liubov Arbeeva1, Olivia Bagley1, Matt Duan1, Arseniy Yashkin1, Fang Fang1, Mikhail Kovtun1, Svetlana V Ukraintseva1, Deqing Wu1, Anatoliy I Yashin1.
Abstract
Gaining insights into genetic predisposition to age-related diseases and lifespan is a challenging task complicated by the elusive role of evolution in these phenotypes. To gain more insights, we combined methods of genome-wide and candidate-gene studies. Genome-wide scan in the Atherosclerosis Risk in Communities (ARIC) Study (N = 9,573) was used to pre-select promising loci. Candidate-gene methods were used to comprehensively analyze associations of novel uncommon variants in Caucasians (minor allele frequency~2.5%) located in band 2q22.3 with risks of coronary heart disease (CHD), heart failure (HF), stroke, diabetes, cancer, neurodegenerative diseases (ND), and mortality in the ARIC study, the Framingham Heart Study (N = 4,434), and the Health and Retirement Study (N = 9,676). We leveraged the analyses of pleiotropy, age-related heterogeneity, and causal inferences. Meta-analysis of the results from these comprehensive analyses shows that the minor allele increases risks of death by about 50% (p = 4.6×10-9), CHD by 35% (p = 8.9×10-6), HF by 55% (p = 9.7×10-5), stroke by 25% (p = 4.0×10-2), and ND by 100% (p = 1.3×10-3). This allele also significantly influences each of two diseases, diabetes and cancer, in antagonistic fashion in different populations. Combined significance of the pleiotropic effects was p = 6.6×10-21. Causal mediation analyses show that endophenotypes explained only small fractions of these effects. This locus harbors an evolutionary conserved gene-desert region with non-coding intergenic sequences likely involved in regulation of protein-coding flanking genes ZEB2 and ACVR2A. This region is intensively studied for mutations causing severe developmental/genetic disorders. Our analyses indicate a promising target region for interventions aimed to reduce risks of many major human diseases and mortality.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27832070 PMCID: PMC5104356 DOI: 10.1371/journal.pgen.1006314
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Basic characteristics of carriers and non-carriers of the minor allele of rs222826 or rs222827 SNPs in the selected studies.
| Factors | ARIC, rs222826 | FHS_C1, rs222826 | FHSO, rs222826 | HRS, rs222827 | ||||
|---|---|---|---|---|---|---|---|---|
| CC | CT+TT | CC | CT+TT | CC | CT+TT | GG | GA+AA | |
| N (% | 9154 (95.6) | 419 (4.4) | 982 (94.7) | 55 (5.3) | 3184 (93.7) | 213 (6.3) | 9308 (96.2) | 368 (3.8) |
| Age, mean (SD), years | 54.3 (5.7) | 54.6 (5.6) | 37.9 (6) | 37.3 (5.9) | 35.4 (10) | 35.1 (9.7) | 58.1 (9.1) | 58 (9) |
| LS, mean (SD), years | 69.8 (5.9) | 69.8 (5.8) | 88.2 (7.1) | 87.7 (6.5) | 72.1 (9.2) | 70.9 (9.1) | 75 (10.2) | 74.7 (10.1) |
| BC, mean (SD), years | 1933.3 (5.8) | 1933.1 (5.7) | 1909.7 (6.1) | 1910.1 (5.7) | 1935.1 (9.9) | 1935.5 (9.4) | 1940.2 (10.3) | 1938.5 (10.3) |
| BMI, mean (SD), kg/m2 | 27 (4.8) | 27.4 (5.1) | 24.7 (3.7) | 23.5 (3.2) | 25.2 (4.3) | 25.7 (4.3) | 26.9 (5) | 27 (5.2) |
| TC | 214.9 (40.8) | 217 (40.3) | 238.3 (44.2) | 239.3 (42.4) | 194.3 (38) | 194.5 (39) | 194.6 (44.3) | 192.1 (44.2) |
| HDL-C | 50.6 (16.7) | 48.6 (16.1) | 49.6 (14.5) | 49.4 (16.1) | 50.9 (14.6) | 50.1 (14.6) | 56.1 (15.7) | 55 (16.4) |
| TG | 136.9 (91.4) | 149.4 (104.9) | 151.2 (99.9) | 151 (86.8) | 91.9 (85.5) | 92.9 (71.1) | NA | NA |
| SBP, mean (SD), mmHg | 139 (15.5) | 139.2 (16.7) | 126.2 (15.1) | 123 (11.2) | 120.2 (14.5) | 121.3 (14.7) | NA | NA |
| DBP, mean (SD), mmHg | 92 (9.8) | 92.2 (9.4) | 79.5 (9.6) | 78.7 (8.1) | 77.7 (10) | 78.3 (10.3) | NA | NA |
| CHD, yes (% | 846 (9.2) | 62 (14.8) | 385 (39.2) | 23 (41.8) | 557 (17.5) | 36 (16.9) | 3579 (38.5) | 149 (40.5) |
| HF, yes (% | 766 (8.9) | 49 (12.6) | 272 (27.7) | 18 (32.7) | 237 (7.4) | 21 (9.9) | NA | NA |
| Stroke, yes (% | 507 (5.5) | 28 (6.7) | 191 (19.5) | 16 (29.1) | 191 (6) | 17 (8) | 1184 (12.7) | 46 (12.5) |
| Diabetes, yes (% | 1608 (18) | 94 (22.9) | 305 (31.1) | 13 (23.6) | 312 (9.9) | 29 (13.7) | 2375 (25.5) | 116 (31.5) |
| Cancer, yes (% | 1225 (13.4) | 66 (15.8) | 467 (47.6) | 28 (50.9) | 1117 (35.1) | 81 (38) | 2213 (23.8) | 71 (19.3) |
| ND, yes (% | NA | NA | 266 (27.7) | 24 (43.6) | NA | NA | NA | NA |
| Death, yes (% | 1344 (14.7) | 79 (18.9) | 880 (89.6) | 54 (98.2) | 656 (20.6) | 60 (28.2) | 1230 (13.2) | 59 (16) |
HRS SNP used is proxy SNP rs222827, which is in perfect linkage disequilibrium (LD), r = 1, with rs222826 in CEU population; these SNPs are 90 bp apart.
Percentage is within
a the genotyped sample
b each genotype.
N denotes sample size; NA = not available.
Mean age is given at baselines; LS denotes lifespan measured by age at death or the end of follow-up; BC denotes birth cohorts.
ARIC = the Atherosclerosis Risk in Communities Study, FHS_C1 = the Framingham Heart Study (FHS) original cohort; FHSO = the FHS Offspring cohort, and HRS = the Health and Retirement Study.
BMI = body mass index; TC = total cholesterol; HDL-C = high-density lipoprotein cholesterol; TG = triglycerides; SBP = systolic blood pressure; DBP = diastolic blood pressure; CHD = coronary heart disease; HF = heart failure; and ND = neurodegenerative diseases (dementias including Alzheimer’s type).
c Means of TC, HDL-C, and TG concentrations are given at 9th FHS_C1 examination because this visit included the largest sample size with non-missing information on these lipids. For FHSO and other studies means of lipid concentrations were representatively given at baselines.
Hazard ratios for major human diseases and death for the rs222826 minor allele carriers in ARIC, FHS, and HRS.
| Outcome | ARIC, N = 9,618 | FHS, N = 4,700 | HRS | ||||||
|---|---|---|---|---|---|---|---|---|---|
| HR | 95% CI | p-value | HR | 95% CI | p-value | HR | 95% CI | p-value | |
| CHD | 1.74 | 1.42–2.14 | 1.1E-7 | 1.07 | 0.82–1.41 | 6.1E-1 | 1.10 | 0.88–1.38 | 3.8E-1 |
| HF | 1.50 | 1.12–2.01 | 6.1E-3 | 1.52 | 1.09–2.12 | 1.5E-2 | NA | NA | NA |
| Stroke | 1.18 | 0.75–1.85 | 4.7E-1 | 1.70 | 1.19–2.43 | 3.6E-3 | 1.01 | 0.72–1.38 | 9.9E-1 |
| Diabetes | 1.35 | 1.01–1.81 | 4.2E-2 | 1.05 | 0.75–1.48 | 7.6E-1 | 1.35 | 1.07–1.69 | 9.9E-3 |
| Cancer | 1.22 | 0.89–1.68 | 2.2E-1 | 1.24 | 1.02–1.50 | 3.4E-2 | 0.76 | 0.59–1.00 | 4.8E-2 |
| Death | 1.32 | 1.05–1.65 | 1.7E-2 | 1.64 | 1.34–2.00 | 1.3E-6 | 1.31 | 1.01–1.70 | 4.4E-2 |
a HRS SNP used is proxy SNP rs222827
N denotes sample size; NA = not available or not estimated
ARIC = the Atherosclerosis Risk in Communities Study, FHS = the Framingham Heart Study (FHS); and HRS = the Health and Retirement Study.
CHD = coronary heart disease; HF = heart failure.
HR denotes hazard ratio from the Cox proportional hazards regression model.
b In HRS HR was evaluated for risk of death. For diseases in HRS we evaluated odds ratios (OR) using logistic regression model because no information on onsets of these diseases was available.
CI denotes confidence interval
Hazard ratios for major human diseases and death for the rs222826_T (minor) allele carriers in the Framingham cohorts.
| Outcome | FHS_C1, N = 1,105 | FHSO, N = 3,595 | ||||
|---|---|---|---|---|---|---|
| HR | 95% CI | p-value | HR | 95% CI | p-value | |
| CHD | 1.20 | 0.79–1.83 | 4.0E-1 | 0.99 | 0.70–1.41 | 9.7E-1 |
| CHD65+ | 1.50 | 0.95–2.37 | 7.9E-2 | NA | NA | NA |
| HF | 1.41 | 0.87–2.28 | 1.6E-1 | 1.70 | 1.07–2.70 | 2.5E-2 |
| Stroke | 1.91 | 1.10–3.31 | 2.1E-2 | 1.60 | 0.97–2.65 | 6.8E-2 |
| Diabetes | 0.71 | 0.40–1.24 | 2.3E-1 | 1.36 | 0.90–2.06 | 1.5E-1 |
| Cancer | 1.21 | 0.82–1.77 | 3.4E-1 | 1.25 | 1.00–1.58 | 5.2E-2 |
| ND | 1.89 | 1.24–2.88 | 2.9E-3 | NA | NA | NA |
| Death | 1.45 | 1.07–1.97 | 1.7E-2 | 1.90 | 1.44–2.50 | 4.7E-6 |
N denotes sample size; NA = not available or not estimated
FHS_C1 = the Framingham Heart Study (FHS) original cohort; FHSO = the FHS Offspring cohort.
CHD = coronary heart disease; CHD65+ = coronary heart disease with onset at 65 years and older; HF = heart failure; ND = neurodegenerative diseases (dementias including Alzheimer’s type).
HR denotes hazard ratio from the Cox proportional hazards regression model.
CI denotes confidence interval
Fig 1Probability of remaining free of coronary heart disease (CHD) for the rs222826_T allele carriers.
(A) The Atherosclerosis Risk in Communities Study (ARIC), (B, D) the Framingham Heart Study (FHS) original (FHS_C1) cohort, (C) the FHS Offspring (FHSO) cohort. HR denotes hazard ratio, HRall in panel (B) denotes the estimates for the entire FHS_C1 sample. HR65+ in (D) denotes the estimates for onsets of CHD at ages 65 years and older. N = n/m denotes the size of the entire sample (n) and the number of CHD cases (m) for major allele homozygotes (CC, blue color) and minor allele carriers (CT+TT, red color). P shows p-values.
Causal mediation analyses: the role of lipids and BMI in the effects of the rs222826_T allele on risks of diseases and death in the ARIC and the FHS original cohort.
| Outcome | ARIC, mediator HDL-C | ARIC, mediator TG | FHS_C1, mediator BMI | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Direct | Indirect | Direct | Indirect | Direct | Indirect | |||||||
| HR | p | HR | p | HR | p | HR | p | HR | p | HR | p | |
| CHD | 1.66 | <8.0E-3 | 1.05 | <8.0E-3 | 1.66 | <8.0E-3 | 1.04 | 1.6E-2 | 1.52 | 1.0E-1 | .989 | 4.9E-1 |
| HF | 1.48 | 2.4E-2 | 1.04 | <8.0E-3 | 1.47 | 2.4E-2 | 1.05 | 8.0E-3 | 1.48 | 8.8E-2 | .958 | 3.2E-2 |
| Stroke | 1.16 | 6.2E-1 | 1.03 | <8.0E-3 | 1.13 | 6.2E-1 | 1.03 | 8.0E-3 | 1.94 | 5.6E-2 | .968 | 1.9E-1 |
| Diabetes | 1.28 | 1.0E-1 | 1.07 | <8.0E-3 | 1.33 | 7.2E-2 | 1.02 | 4.0E-1 | .752 | 3.0E-1 | .953 | 4.8E-2 |
| Cancer | 1.19 | 1.7E-1 | 1.00 | 4.2E-1 | 1.18 | 2.1E-1 | 1.00 | 4.3E-1 | 1.23 | 3.8E-1 | .998 | 9.5E-1 |
| ND | NA | NA | NA | NA | NA | NA | NA | NA | 1.94 | <8.0E-3 | .958 | 1.6E-2 |
| Death | 1.28 | 3.2E-2 | 1.03 | <8.0E-3 | 1.25 | 5.6E-2 | 1.03 | 2.4E-2 | 1.47 | 8.0E-3 | .981 | 1.6E-1 |
HDL-C = high-density lipoprotein cholesterol; TG = triglycerides; CHD = coronary heart disease; HF = heart failure; ND = neurodegenerative diseases (dementias including Alzheimer’s type).
“Direct” (“Indirect”) implies the effect of the rs222826_T allele on an outcome not through (through) a given mediator, i.e., HDL-C or TG in ARIC and BMI in FHS original (FHS_C1) cohort.
HDL-C and BMI were tested for potential mediation effects given their significant associations with the rs222826_T allele in respective studies (S1 Table).
HR denotes hazard ratio from the Cox proportional hazards regression model.
a To correct for disproportionality of hazards, the estimate in the FHS_C1 (original) cohort is given for onset of CHD at ages 65 years and older.
NA = not available
Hazard ratios for HF, ND and death for the minor allele of rs222826 or rs222827 SNPs and potential endophenotypes in the FHS original cohort and HRS.
| Outcome | Study | Predictor | Sample | HR | 95% CI | p |
|---|---|---|---|---|---|---|
| HF | FHS_C1 | rs222826_T | All | 1.52 | 0.94–2.46 | 9.0E-2 |
| BMI | All | 1.05 | 1.02–1.09 | 2.0E-3 | ||
| ND | FHS_C1 | rs222826_T | All | 2.00 | 1.31–3.06 | 1.3E-3 |
| BMI | All | 1.04 | 1.01–1.08 | 1.7E-2 | ||
| Death | HRS | rs222827_A | All | 1.32 | 1.01–1.71 | 3.9E-2 |
| Cancer | All | 1.29 | 1.15–1.45 | 1.4E-5 | ||
| rs222827_A | Cancer free | 1.51 | 1.12–2.03 | 6.7E-3 | ||
| rs222827_A | With cancer | 0.92 | 0.53–1.61 | 7.8E-1 |
FHS_C1 = the Framingham Heart Study (FHS) original cohort and HRS = the Health and Retirement Study.
HF = heart failure; ND = neurodegenerative diseases (dementias including Alzheimer’s type).
HR denotes hazard ratio from the Cox proportional hazards regression model.
CI denotes confidence interval
All: the models were fitted in the entire samples to estimate additive effects of the minor allele and: (i) BMI measured in kg/m2 in FHS and (ii) cancer in HRS.
Rows “Cancer free” and “With cancer” show the results in the HRS samples stratified by cancer status.
Causal mediation analyses: the role of diseases-endophenotypes in the effect of the minor allele of rs222826 SNP on risk of death in ARIC, FHS cohorts, and HRS.
| Mediator | ARIC | FHS_C1 | FHSO | HRS | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Direct | Indirect | Direct | Indirect | Direct | Indirect | Direct | Indirect | |||||||||
| HR | p | HR | p | HR | p | HR | p | HR | p | HR | p | HR | p | HR | p | |
| CHD | 1.22 | 1.5E-1 | 1.09 | <8.0E-3 | 1.42 | <8.0E-3 | 1.00 | 7.7E-1 | 1.88 | <8.0E-3 | 1.00 | 9.4E-1 | 1.30 | 7.2E-2 | 1.01 | 4.2E-1 |
| CHD65+ | NA | NA | NA | NA | 1.44 | 4.8E-2 | 1.00 | 6.9E-1 | NA | NA | NA | NA | NA | NA | NA | NA |
| HF | 1.20 | 2.0E-1 | 1.09 | 2.4E-2 | 1.43 | 1.6E-2 | 1.00 | 8.9E-1 | 1.82 | <8.0E-3 | 1.03 | 1.3E-1 | NA | NA | NA | NA |
| Stroke | 1.30 | 2.4E-2 | 1.01 | 5.7E-1 | 1.39 | 4.0E-2 | 1.02 | 5.6E-2 | 1.83 | <8.0E-3 | 1.03 | 2.4E-1 | 1.31 | 4.8E-2 | 1.00 | 9.2E-1 |
| DM | 1.26 | 1.3E-1 | 1.04 | 4.0E-2 | 1.45 | 1.6E-2 | 1.00 | 8.6E-1 | 1.84 | <8.0E-3 | 1.02 | 5.6E-2 | 1.27 | 1.1E-1 | 1.02 | 8.0E-3 |
| Cancer | 1.30 | 3.2E-2 | 1.01 | 1.7E-1 | 1.41 | <8.0E-3 | 1.00 | 7.2E-1 | 1.83 | <8.0E-3 | 1.03 | 5.2E-1 | 1.31 | 2.4E-2 | 0.99 | 4.0E-2 |
| ND | NA | NA | NA | NA | 1.51 | 8.0E-3 | 1.00 | 8.7E-1 | NA | NA | NA | NA | NA | NA | NA | NA |
| CHD/HF/DM | 1.24 | 1.1E-1 | 1.11 | <8.0E-3 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
a HRS SNP used is proxy SNP rs222827
CHD = coronary heart disease; CHD65+ = coronary heart disease with onset at 65 years and older; HF = heart failure; DM = diabetes mellitus; ND = neurodegenerative diseases (dementias including Alzheimer’s type).
“Direct” (“Indirect”) implies the effect of the rs222826_T allele on risk of death not through (through) a mediator given in column “Mediator.”
HR denotes hazard ratio from the Cox proportional hazards regression model.
CHD/HF/DM denotes the analysis of mediating effect of either of these diseases, i.e., if a person has CHD, or HF, or DM.
b To correct for disproportionality in hazards, the estimate in the FHS_C1 (original) cohort is given for onset of CHD at ages 65 years and older.
NA = not available or not estimated
The results of meta-analyses of the effects of the minor allele of rs222826 SNP on risks of diseases and death.
| Outcome | With improvements | No improvements | ||||
|---|---|---|---|---|---|---|
| Exp(β) | 95% CI | p-value | Exp(β) | 95% CI | p-value | |
| CHD | 1.35 | 1.18–1.54 | 8.9E-06 | 1.32 | 1.16–1.50 | 3.1E-05 |
| HF | 1.55 | 1.24–1.93 | 9.7E-05 | 1.51 | 1.21–1.88 | 2.4E-04 |
| Stroke | 1.25 | 1.01–1.55 | 4.0E-02 | 1.25 | 1.01–1.55 | 4.0E-02 |
| Diabetes | 1.36 | 1.16–1.60 | 1.6E-04 | 1.22 | 1.04–1.43 | 1.5E-02 |
| Cancer | 1.25 | 1.09–1.44 | 1.8E-03 | 1.08 | 0.93–1.25 | 3.0E-01 |
| ND | 2.00 | 1.31–3.05 | 1.3E-03 | 1.89 | 1.24–2.87 | 2.9E-3 |
| Death | 1.49 | 1.30–1.70 | 4.6E-09 | 1.44 | 1.27–1.64 | 3.8E-08 |
| Combined pleiotropic effects p-value | 6.6E-21 | 8.3E-14 | ||||
a HRS SNP used is proxy SNP rs222827
CHD = coronary heart disease; HF = heart failure; ND = neurodegenerative diseases (dementias including Alzheimer’s type).
Columns “No improvements” (“With improvements”) present the results of meta-analyses without (with) leveraging information from the analyses of age-related heterogeneity and causal inferences.
“Combined pleiotropic effects p-value” is p-value from the Fisher’s combined probability test estimating the global null hypothesis that neither of genetic associations with risks of diseases or death is true.
The sources of information and numerical estimates of the effect estimates used in these meta-analyses are given in S2–S5 Tables.
Exp(β) is exponent of the effect size beta.
CI denotes confidence interval
Fig 2A flowchart of the analyses in this study.
All analyses were conducted for whites. Major focus of these analyses was on two uncommon SNPs with minor allele frequency (MAF) ~2.5%. Given this MAF and the available sample size in ARIC, FHS, and HRS, the analyses were conducted for men and women combined to increase the sample of the minor allele carriers.