| Literature DB >> 28824732 |
Xu Gao1, Hauke Thomsen2, Yan Zhang1, Lutz Philipp Breitling1, Hermann Brenner1,3,4.
Abstract
BACKGROUND: Methylation quantitative trait loci (mQTLs) are the genetic variants that may affect the DNA methylation patterns of CpG sites. However, their roles in influencing the disturbances of smoking-related epigenetic changes have not been well established. This study was conducted to address whether mQTLs exist in the vicinity of smoking-related CpG sites (± 50 kb) and to examine their associations with smoking exposure and all-cause mortality in older adults.Entities:
Keywords: Active smoking; DNA methylation; Epigenetic epidemiology; Methylation quantitative trait loci
Mesh:
Year: 2017 PMID: 28824732 PMCID: PMC5561570 DOI: 10.1186/s13148-017-0387-6
Source DB: PubMed Journal: Clin Epigenetics ISSN: 1868-7075 Impact factor: 6.551
Study population characteristics in discovery and validation panels (mean values (SD) for continuous variables and n (%) for categorical variables)
| Characteristics | Discovery panel | Validation panel |
|
|---|---|---|---|
|
| 581 | 368 | |
| Age (years) | 61.0 (6.3) | 61.1 (6.4) | 0.809 |
| Sex (male) | 241 (41.5%) | 117 (31.8%) | < 0.001 |
| Smoking status | 0.864 | ||
| Current smoker | 108 (18.6%) | 65 (17.7%) | |
| Former smoker | 173 (29.8%) | 119 (32.3%) | |
| Never smoker | 300 (51.6%) | 184 (50.0%) | |
| Pack-years of smokinga | |||
| Current smokers | 34.6 (18.2) | 33.1 (18.2) | 0.250 |
| Former smokers | 22.0 (17.5) | 19.4 (15.5) | 0.033 |
| Smoking cessation time (years)b | 16.5 (11.3) | 17.2 (10.2) | 0.742 |
| Body mass indexc | 0.248 | ||
| Underweight or normal weight (< 25.0) | 143 (24.7%) | 116 (31.5%) | |
| Overweight (25.0–< 30.0) | 290 (50.2%) | 151 (41.0%) | |
| vObese (≥ 30.0) | 145 (25.1%) | 101 (27.5%) | |
| Alcohol consumptiond | 0.509 | ||
| Abstainer | 194 (36.3%) | 128 (38.0%) | |
| Low | 301 (56.4%) | 188 (55.8%) | |
| Intermediate | 30 (5.6%) | 17 (5.0%) | |
| High | 9 (1.7%) | 4 (1.2%) | |
| Physical activitye | 0.058 | ||
| Inactive | 109 (18.8%) | 82 (22.3%) | |
| Low | 245 (42.2%) | 176 (47.8%) | |
| Medium or high | 227 (39.0%) | 110 (29.9%) | |
| Prevalence of CVD at baselinef | 0.621 | ||
| Prevalent | 86 (14.8%) | 58 (15.8%) | |
| Prevalence of diabetes at baselineg | 0.617 | ||
| Prevalent | 86 (14.9%) | 60 (16.6%) | |
| Prevalence of cancer at baseline | |||
| Prevalent | 33 (5.7%) | 22 (6.0%) | 0.744 |
aFor subgroups of former and current smokers; data missing for 38 and 24 participants, respectively, in discovery and validation panels; a pack-year was defined as having smoked 20 cigarettes per day for 1 year
bFormer smokers only, data missing for 5 and 2 participants, respectively, in discovery and validation panels; cessation time equals age at recruitment minus age at cessation
cData missing for 3 participants in discovery panel
dData missing for 47 and 31 participants, respectively, in discovery and validation panels. Categories defined as follows: abstainer, low [women, 0–< 20 g/d; men, 0–< 40 g/d], intermediate [20–< 40 g/d and 40–< 60 g/d, respectively], high [≥ 40 g/d and ≥ 60 g/d, respectively]
eCategories defined as follows: inactive [< 1 h of physical activity/week], medium or high [≥ 2 h of vigorous or ≥ 2 h of light physical activity/week], low (other)
fCVD cardiovascular disease. Data missing for 1 participant in discovery panel
gData missing for 5 and 7 participants, respectively, in discovery and validation panels
Fig. 1Flowchart of selection of SNP-CpG pairs
List of 246 significant SNP-CpG pairs (chromosomal and CpG sites positions were based on GRCh37/hg19)
| Chromosome | Gene | CpG site | Position | Number of SNP candidates | Number of mQTLs |
|---|---|---|---|---|---|
| 1 |
| cg09069072 | 15,482,754 | 13 | 5 |
|
| cg09662411 | 92,946,132 | 10 | 2 | |
| cg09935388 | 92,947,588 | 10 | 2 | ||
| cg10399789 | 92,945,668 | 9 | 6 | ||
| cg12876356 | 92,946,825 | 10 | 2 | ||
| cg18146737 | 92,946,701 | 10 | 2 | ||
| cg18316974 | 92,947,035 | 10 | 2 | ||
|
| cg25189904 | 68,299,493 | 5 | 2 | |
|
| cg11231349 | 162,050,657 | 4 | 2 | |
|
| cg21913886 | 15,485,346 | 14 | 9 | |
|
| cg03547355 | 227,003,061 | 8 | 2 | |
|
| cg12547807 | 9,473,751 | 9 | 1 | |
|
| cg21393163 | 12,217,630 | 8 | 2 | |
|
| cg26764244 | 68,299,511 | 3 | 3 | |
| 2 |
| cg05951221 | 233,284,402 | 5 | 1 |
| cg03329539 | 233,283,329 | 5 | 2 | ||
|
| cg23667432 | 233,244,439 | 5 | 2 | |
|
| cg26271591 | 178,125,956 | 8 | 6 | |
|
| cg26718213 | 241,976,081 | 9 | 4 | |
|
| cg27241845 | 233,250,371 | 5 | 2 | |
| 3 |
| cg18642234 | 49,394,623 | 10 | 5 |
| 5 |
| cg03604011 | 400,201 | 15 | 5 |
| cg03991871 | 368,448 | 9 | 1 | ||
| cg11902777 | 368,843 | 9 | 4 | ||
| cg12806681 | 368,395 | 9 | 2 | ||
| cg14817490 | 392,920 | 15 | 4 | ||
| cg17287155 | 393,347 | 15 | 1 | ||
| cg23576855 | 373,300 | 9 | 7 | ||
| cg23916896 | 368,805 | 9 | 4 | ||
| 6 |
| cg06126421 | 30,720,081 | 16 | 8 |
|
| cg15474579 | 36,645,813 | 22 | 8 | |
|
| cg15342087 | 30,720,210 | 16 | 2 | |
| cg24859433 | 30,720,204 | 16 | 3 | ||
|
| cg00931843 | 155,442,993 | 6 | 1 | |
|
| cg17619755 | 31,760,629 | 16 | 8 | |
|
| cg14753356 | 30,720,109 | 8 | 2 | |
| 7 |
| cg03440944 | 45,023,330 | 5 | 1 |
|
| cg11207515 | 146,904,206 | 14 | 7 | |
| cg25949550 | 145,814,306 | 11 | 8 | ||
|
| cg19717773 | 2,847,554 | 22 | 22 | |
|
| cg08396193 | 27,193,709 | 7 | 1 | |
|
| cg11556164 | 110,738,316 | 5 | 5 | |
|
| cg12803068 | 45,002,919 | 10 | 1 | |
| cg22132788 | 45,002,487 | 10 | 1 | ||
|
| cg09022230 | 5,457,226 | 24 | 3 | |
| 8 |
| cg26361535 | 144,576,604 | 8 | 1 |
|
| cg19589396 | 103,937,374 | 15 | 2 | |
| 9 |
| cg01692968 | 108,005,349 | 2 | 2 |
| 10 |
| cg25953130 | 63,753,550 | 6 | 3 |
| 11 |
| cg07123182 | 2,722,391 | 13 | 2 |
| cg26963277 | 2,722,408 | 13 | 3 | ||
| cg01744331 | 2,722,358 | 13 | 3 | ||
|
| cg16556677 | 2,722,402 | 13 | 2 | |
|
| cg11660018 | 86,510,915 | 9 | 2 | |
| cg23771366 | 86,510,999 | 9 | 2 | ||
|
| cg16611234 | 58,870,075 | 10 | 10 | |
| 14 |
| cg01731783 | 74,211,789 | 6 | 1 |
| cg10919522 | 74,227,441 | 5 | 1 | ||
| 15 |
| cg23161492 | 90,357,203 | 19 | 6 |
|
| cg00310412 | 74,724,919 | 13 | 9 | |
| 16 |
| cg09099830 | 30,485,486 | 3 | 2 |
|
| cg16794579 | 17,562,419 | 3 | 1 | |
| 17 |
| cg07251887 | 73,641,810 | 6 | 2 |
|
| cg07465627 | 53,167,407 | 8 | 4 | |
| 19 |
| cg15159987 | 17,003,890 | 15 | 4 |
|
| cg23973524 | 18,873,223 | 12 | 1 | |
|
| cg03636183 | 17,000,586 | 17 | 1 | |
|
| cg03707168 | 49,379,127 | 10 | 4 | |
| 21 |
| cg23110422 | 40,182,073 | 6 | 3 |
| 22 |
| cg02532700 | 37,257,404 | 8 | 2 |
| Total | 590 | 246 | |||
Fig. 2Manhattan plot of the results in validation panel. Red line, FDR-corrected p value = 0.05
Five frequently reported (≥ 6) CpG sites and corresponding mQTLs
| CpG site | Frequencya | Gene | Chr | SNP | SNP positionb | Minor allele | Distance (bp)c | FDRd | MAFe |
|---|---|---|---|---|---|---|---|---|---|
| cg03636183 | 12 |
| 19 | rs2227357 | 17,003,553 | A | 2967 | 0.048 | 0.125 |
| cg05951221 | 8 |
| 2 | rs790051 | 30,718,035 | A | − 1866 | 6.2 e − 4 | 0.226 |
| cg06126421 | 7 |
| 6 | rs2535324 | 30,727,983 | C | − 2046 | 1.8 e − 9 | 0.3 |
| rs3095339 | 30,728,290 | G | 7902 | 2.6 e − 4 | 0.252 | ||||
| rs3131036 | 30,728,360 | A | 8209 | 2.6 e − 4 | 0.253 | ||||
| rs3094122 | 30,737,552 | G | 8279 | 3.2 e − 3 | 0.206 | ||||
| rs13217914 | 30,739,657 | A | 17,471 | 2.4 e − 21 | 0.157 | ||||
| rs6911571 | 30,753,639 | T | 19,576 | 0.007 | 0.16 | ||||
| rs4713361 | 30,756,066 | A | 33,558 | 1.3 e − 21 | 0.159 | ||||
| rs13201769 | 30,718,035 | A | 35,985 | 6.9 e − 7 | 0.326 | ||||
| cg03329539 | 6 |
| 2 | rs790051 | 233,282,536 | A | − 793 | 0.031 | 0.226 |
| rs34547337 | 233,300,755 | T | 17,426 | 1.5 e − 5 | 0.314 | ||||
| cg14817490 | 6 |
| 5 | rs75509302 | 365,653 | C | − 27,267 | 0.002 | 0.144 |
| rs11746079 | 410,980 | C | 18,060 | 1.5 e − 3 | 0.154 | ||||
| rs72717419 | 431,996 | T | 39,076 | 0.021 | 0.207 | ||||
| rs2672725 | 434,981 | G | 42,061 | 0.042 | 0.117 |
aThe reported times of CpG in previous studies (based on systematic review [25])
bPositions of CpG sites and SNPs were based on GRCh37/hg19
cThe distance between SNP and CpG (SNP position–CpG position)
dThe FDR-corrected p values of SNPs in fully adjusted mixed linear regression models, which controlled for age (years), sex, smoking status, random batch effect of methylation measurement, leukocyte distribution (Houseman algorithm), alcohol consumption (abstainer/low/intermediate/high), body mass index (BMI, underweight or normal weight/overweight/obese), physical activity (inactive/low/medium or high), prevalence of cardiovascular diseases (yes/no), prevalence of diabetes (yes/no), and prevalence of cancer (yes/no)
eMAF minor allele frequency
Fig. 3Locations of cg06126421 and eight mQTLs (carrier/non-carrier) (a) and distributions of methylation levels based on smoking status (b) in validation panel. Red line, FDR-corrected p value = 0.05; red dot, mQTLs; yellow triangle, cg06126421; blue bar, non-carriers of minor allele; red bar, carriers of minor allele
Three most frequently identified mQTLs and corresponding CpG sites
| SNP | Chr | SNP positiona | Minor allele | MAFb | CpG | Distance (bp)c | FDRd |
|---|---|---|---|---|---|---|---|
| rs75509302 | 5 | 365,653 | C | 0.144 | cg23576855 | − 7647 | 3.4 e − 100 |
| cg11902777 | − 3190 | 1.4 e − 7 | |||||
| cg17287155 | − 27,694 | 7.8 e − 5 | |||||
| cg03991871 | − 2795 | 1.0 e − 4 | |||||
| cg12806681 | − 2742 | 1.2 e − 4 | |||||
| cg23916896 | − 3152 | 9.5 e − 4 | |||||
| cg03604011 | − 34,548 | 1.1 e − 3 | |||||
| cg14817490 | − 27,267 | 2.1 e − 3 | |||||
| rs34835481 | 1 | 92,991,624 | T | 0.210 | cg10399789 | 45,956 | 2.2 e − 5 |
| cg12876356 | 44,799 | 1.3 e − 3 | |||||
| cg09662411 | 45,492 | 1.9 e − 3 | |||||
| cg18146737 | 44,923 | 2.0 e − 3 | |||||
| cg18316974 | 44,589 | 3.0 e − 3 | |||||
| cg09935388 | 44,036 | 0.016 | |||||
| rs79050605 | 1 | 92,925,962 | G | 0.202 | cg12876356 | − 20,863 | 3.7 e − 4 |
| cg18146737 | − 20,739 | 1.1 e − 3 | |||||
| cg18316974 | − 21,073 | 1.7 e − 3 | |||||
| cg09662411 | − 20,170 | 1.9 e − 3 | |||||
| cg09935388 | − 21,626 | 2.2 e − 3 |
aSNPs positions were based on GRCh37/hg19
bMAF minor allele frequency
cThe distance between SNP and CpG (SNP position–CpG position)
dThe FDR-corrected p values of SNPs in fully adjusted mixed linear regression models, which controlled for age (years), sex, smoking status, random batch effect of methylation measurement, leukocyte distribution (Houseman algorithm), alcohol consumption (abstainer/low/intermediate/high), body mass index (BMI, underweight or normal weight/overweight/obese), physical activity (inactive/low/medium or high), prevalence of cardiovascular diseases (yes/no), prevalence of diabetes (yes/no), and prevalence of cancer (yes/no)
Fig. 4Percentage changes contributed by mQTLs to smoking-related DNA methylation changes based on SNP-CpG distance (a) and reported frequencies of CpG sites (b)
Impact of rs75509302-smoking interaction on the methylation level of cg23576855a
| Gene | CpG site | SNP | SNP-smoking interactionb | Smoking statusd | |||||
|---|---|---|---|---|---|---|---|---|---|
| Genotypec | Coefficient | SE |
| Coefficient | SE |
| |||
|
| cg23576855 | rs75509302 | TT | Ref | −0.182 | 7.1 e − 3 | 3.8 e − 95 | ||
| CT | 0.128 | 0.013 | 1.1 e − 20 | ||||||
| CC | 0.268 | 0.033 | 2.5 e − 15 | ||||||
aModel is fully adjusted for age, sex, BMI, smoking status (current and never smoking only), alcohol consumption, physical activity, prevalence of CVD, diabetes and cancer at baseline. The methylation levels of CpG sites were responses, the SNPs and SNP-smoking interactions were predictors;
bThe never smoking * genotype groups and current smoking * TT group were used as references;
cThe group of interaction between current smoking and listed genotype;
dNever smoking was used as reference