| Literature DB >> 29092026 |
Ivana Nedeljkovic1, Lies Lahousse1,2,3, Elena Carnero-Montoro1,4, Alen Faiz5, Judith M Vonk5,6, Kim de Jong5,6, Diana A van der Plaat5,6, Cleo C van Diemen7, Maarten van den Berge5,8, Ma'en Obeidat9, Yohan Bossé10, David C Nickle11, B I O S Consortium, Andre G Uitterlinden1,12, Joyce B J van Meurs12, Bruno H C Stricker1, Guy G Brusselle1,3,13, Dirkje S Postma5,8, H Marike Boezen5,6, Cornelia M van Duijn1, Najaf Amin1.
Abstract
Chronic obstructive pulmonary disease (COPD) is among the major health burdens in adults. While cigarette smoking is the leading risk factor, a growing number of genetic variations have been discovered to influence disease susceptibility. Epigenetic modifications may mediate the response of the genome to smoking and regulate gene expression. Chromosome 19q13.2 region is associated with both smoking and COPD, yet its functional role is unclear. Our study aimed to determine whether rs7937 (RAB4B, EGLN2), a top genetic variant in 19q13.2 region identified in genome-wide association studies of COPD, is associated with differential DNA methylation in blood (N = 1490) and gene expression in blood (N = 721) and lungs (N = 1087). We combined genetic and epigenetic data from the Rotterdam Study (RS) to perform the epigenome-wide association analysis of rs7937. Further, we used genetic and transcriptomic data from blood (RS) and from lung tissue (Lung expression quantitative trait loci mapping study), to perform the transcriptome-wide association study of rs7937. Rs7937 was significantly (FDR < 0.05) and consistently associated with differential DNA methylation in blood at 4 CpG sites in cis, independent of smoking. One methylation site (cg11298343-EGLN2) was also associated with COPD (P = 0.001). Additionally, rs7937 was associated with gene expression levels in blood in cis (EGLN2), 42% mediated through cg11298343, and in lung tissue, in cis and trans (NUMBL, EGLN2, DNMT3A, LOC101929709 and PAK2). Our results suggest that changes of DNA methylation and gene expression may be intermediate steps between genetic variants and COPD, but further causal studies in lung tissue should confirm this hypothesis.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29092026 PMCID: PMC5886099 DOI: 10.1093/hmg/ddx390
Source DB: PubMed Journal: Hum Mol Genet ISSN: 0964-6906 Impact factor: 6.150
Characteristics of the discovery and replication cohorts and per COPD status
| Discovery cohort | Replication cohort | |||||
|---|---|---|---|---|---|---|
| COPD | Controls | All | COPD | Controls | All | |
| N (% of all) | 114 (15.7) | 541 (74.7) | 724 | 93 (12.1) | 591 (77.2) | 766 |
| Age (years) | 61.9±8.6 | 59.3±7.9 | 59.9±8.2 | 68.2±5.7 | 67.6±5.9 | 67.7±5.9 |
| Males (%) | 68 (59.6) | 233 (43.1) | 331 (45.7) | 54 (58.1) | 249 (42.1) | 324 (42.3) |
| FEV1/FVC(% of all) | 0.63±0.07(71.9) | 0.78±0.04(66.0) | 0.76±0.08(67.0) | 0.63±0.07(95.7) | 0.79±0.05(91.0) | 0.76±0.08(91.8) |
| Current smokers, | 43 (37.7) | 107 (19.8) | 168 (23.2) | 20 (21.5) | 52 (8.8) | 80 (10.4) |
| Ex-smokers, | 55 (48.2) | 239 (44.2) | 322 (44.5) | 51 (54.8) | 326 (55.2) | 427 (55.7) |
| Never smokers, | 16 (14.0) | 195 (36.0) | 234 (32.3) | 22 (23.7) | 213 (36.0) | 259 (33.8) |
| Pack-years | 34.3±26.9 | 19.9±19.6 | 23.2±22.0 | 33.7±18.7 | 19.6±20.1 | 21.9±20.6 |
Data for quantitative measures presented as mean ± SD. COPD: Chronic Obstructive Pulmonary Disease cases; All: all participants included in EWAS. For traits that were not available for all participants (COPD status and FEV1/FVC), the valid percentage is denoted in brackets (% of all).
Significantly different between the discovery and replication cohort.
Pack-years data were available for all participants (mean and SD calculated in current and ex-smokers only).
Figure 1.Analysis pipeline and datasets overview.
Association of rs7937-T with epigenome-wide DNA methylation in discovery (N = 724) and replication (N = 766) cohorts
| Discovery | Replication | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| CpG | Chr | Position | Gene | Model | β | SE | FDR | β | SE | FDR | ||
| cg21653913 | 19 | 41307778 | EGLN2 | 1 | −0.08925 | 0.00496 | 9.17 × 10−59 | 4.35 × 10−53 | −0.10030 | 0.00491 | 1.13 × 10−72 | 4.74 × 10−67 |
| 2 | −0.08956 | 0.00497 | 5.70 × 10−59 | 2.71 × 10−53 | −0.10059 | 0.00492 | 1.17 × 10−72 | 4.92 × 10−67 | ||||
| cg11298343 | 19 | 41306150 | EGLN2 | 1 | −0.02036 | 0.00201 | 1.94 × 10−22 | 3.53 × 10−17 | −0.01410 | 0.00141 | 5.42 × 10−22 | 6.36 × 10−17 |
| 2 | −0.02070 | 0.00199 | 1.37 × 10−23 | 3.25 × 10−18 | −0.01402 | 0.00140 | 5.30 × 10−22 | 7.42 × 10−17 | ||||
| cg10585486 | 19 | 41304133 | EGLN2 | 1 | −0.00901 | 0.00089 | 2.23 × 10−22 | 3.53 × 10−17 | −0.01349 | 0.00130 | 1.47 × 10−23 | 3.09 × 10−18 |
| 2 | −0.00902 | 0.00089 | 2.19 × 10−22 | 3.47 × 10−17 | −0.01347 | 0.00130 | 2.25 × 10−23 | 4.72 × 10−18 | ||||
| cg24958765 | 19 | 41283667 | RAB4B | 1 | 0.00235 | 0.00031 | 1.17 × 10−13 | 9.27 × 10−9 | −0.00427 | 0.00075 | 2.02 × 10−8 | 1.06 × 10−3 |
| 2 | 0.00237 | 0.00031 | 1.02 × 10−13 | 8.06 × 10−9 | −0.00433 | 0.00075 | 1.26 × 10−8 | 6.62 × 10−4 | ||||
| cg13791183 | 19 | 41316697 | CYP2A6- AK097370 | 1 | 0.01244 | 0.00209 | 4.23 × 10−9 | 2.51 × 10−4 | NA | |||
| 2 | 0.01255 | 0.00209 | 3.28 × 10−9 | 1.94 × 10−4 | ||||||||
| cg25923056 | 19 | 41306455 | EGLN2 | 1 | −0.00952 | 0.00177 | 1.00 × 10−7 | 5.30 × 10−3 | −0.01153 | 0.00141 | 1.50 × 10−15 | 1.05 × 10−10 |
| 2 | −0.00985 | 0.00175 | 2.54 × 10−8 | 1.34 × 10−3 | −0.01140 | 0.00141 | 2.73 × 10−15 | 1.91 × 10−10 | ||||
β: Regression coefficient estimates from linear regression model regressing DNA methylation levels on indicated SNPs. In Model 1 coefficients are corrected for sex, age, technical covariates and different white blood cellular proportions. In Model 2 coefficients are additionally adjusted for current smoking and pack-years smoked. SE: standard error of the effect, P: P-value of the significance, FDR: False discovery rate value. NA: Not available in the replication dataset.
Figure 2.Association of the rs7937 with DNA methylation across the genome. In circles are represented all CpGs throughout the genome. X-axis shows chromosome locations; Y-axis shows negative logarithm of the P-value of the associations of the SNP with each CpG site. Dotted line represents the significance threshold (FDR < 0.05). (A) Discovery analysis; (B) Replication analysis.
Figure 3.Region plot of chromosome 19q13.2 with significant SNP-CpG associations. The circles represent SNP-CpG associations; X-axis shows all genes in the region; Y-axis shows negative logarithm of the P-values of the associations of CpGs with the SNP. Crossed circles represent the non-replicated associations.
Association of DNA methylation at significant CpG sites in 19q13.2, with COPD and FEV1/FVC ratio – meta-analysis results
| Trait | CpG | Model | β | SE | ||
|---|---|---|---|---|---|---|
| COPD | cg21653913 | 1339 | 1 | −0.084 | 0.711 | 0.906 |
| 2 | 0.175 | 0.738 | 0.812 | |||
| 1339 | 1 | −7.080 | 2.163 | |||
| 2 | −4.924 | 2.254 | ||||
| cg10585486 | 1339 | 1 | −2.215 | 3.657 | 0.545 | |
| 2 | −3.110 | 3.811 | 0.415 | |||
| cg25923056 | 1339 | 1 | −0.368 | 2.153 | 0.864 | |
| 2 | 1.551 | 2.255 | 0.492 | |||
| FEV1/FVC | cg21653913 | 1188 | 1 | −0.008 | 0.020 | 0.676 |
| 2 | −0.017 | 0.020 | 0.382 | |||
| 1188 | 1 | 0.138 | 0.065 | |||
| 2 | 0.047 | 0.064 | 0.464 | |||
| cg10585486 | 1188 | 1 | 0.096 | 0.093 | 0.304 | |
| 2 | 0.094 | 0.090 | 0.299 | |||
| cg25923056 | 1188 | 1 | −0.021 | 0.064 | 0.743 | |
| 2 | −0.098 | 0.062 | 0.114 |
N: number of participants in the meta-analysis; Model 1 is adjusted for age, sex, technical covariates and different white blood cellular proportions, Model 2 is additionally adjusted for smoking; β: Regression coefficient estimates from logistic/linear regression models; SE: standard error of the effect; P: P-value of the significance. In bold: nominally significant associations.
Association of rs7937 with transcriptome-wide gene expression in blood (N = 721)
| SNP | A1 | A2 | Probe | Chr | Position (GRCh37/hg19) | Gene | Model | β | SE | FDR | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| rs7937 | T | C | NM_080732.1 | 19 | 46006086-46006135 | EGLN2 | 1 | 0.0635 | 0.0109 | 9.29 × 10−9 | 0.000197 |
| 2 | 0.0577 | 0.0120 | 1.88 × 10−6 | 0.039942 |
A1: effect allele (tested allele), A2: alternative allele, Chr: chromosome of the probe, Model 1 is adjusted for age, sex, current smoking, technical covariates and different white blood cellular proportions, Model 2 is additionally adjusted for cg11298343; β: Regression coefficient estimates from linear regression models regressing gene expression on indicated SNP, SE: standard error, P: P-value of the significance, FDR: False discovery rate value.
Mediation of the rs7937-EGLN2 expression association through cg11298343 (N = 721)
| Estimate | 95%CI Lower | 95%CI Upper | ||
|---|---|---|---|---|
| ACME | 0.02 | 0.005 | 0.032 | |
| ADE | 0.02 | −0.014 | 0.068 | 0.22 |
| Total Effect | 0.04 | 0.005 | 0.080 | |
| Proportion Mediated | 0.42 | 0.058 | 1.995 |
ACME: average causal mediation effect by DNA methylation at cg11298343; ADE: average direct effect of rs7937 on EGLN2 expression; Total: total effect rs7937 on EGLN2 expression; Proportion Mediated: proportion of the association between rs7937 and EGLN2 expression, explained by methylation at cg11298343; Regression models adjusted for sex, age, current smoking, pack-years, technical variance and estimated blood cell composition; In bold: significant results with P-value < 0.05.
Association of rs7937 with transcriptome-wide gene expression in lung tissue (N = 1087)
| SNP | A1 | A2 | Probe | Chr | Position (GRCh37/hg19) | Annotation | β | SE | FDR | |
|---|---|---|---|---|---|---|---|---|---|---|
| rs7937 | T | C | NM_004756 | 19 | 40665906-40690658 | −0.099 | 0.013 | 4.39 × 10−15 | 1.17 × 10−11 | |
| BC037804 | 19 | 40808443-40810818 | −0.077 | 0.012 | 3.77 × 10−10 | 1.01 × 10−6 | ||||
| BX330016 | 8 | 89720919-89724906 | 0.049 | 0.011 | 6.15 × 10−6 | 0.016 | ||||
| AK025230 | 2 | 25233434-25246179 | −0.028 | 0.006 | 8.00 × 10−6 | 0.021 | ||||
| BQ445924 | 3 | 196829093-196830253 | −0.028 | 0.006 | 1.36 × 10−5 | 0.036 |
A1: effect allele (tested allele), A2: alternative allele, β: Regression coefficient estimates from linear regression models regressing gene expression on indicated SNP, SE: standard error, P: P-value of the significance, FDR: False discovery rate value.