Literature DB >> 29092026

COPD GWAS variant at 19q13.2 in relation with DNA methylation and gene expression.

Ivana Nedeljkovic1, Lies Lahousse1,2,3, Elena Carnero-Montoro1,4, Alen Faiz5, Judith M Vonk5,6, Kim de Jong5,6, Diana A van der Plaat5,6, Cleo C van Diemen7, Maarten van den Berge5,8, Ma'en Obeidat9, Yohan Bossé10, David C Nickle11, B I O S Consortium, Andre G Uitterlinden1,12, Joyce B J van Meurs12, Bruno H C Stricker1, Guy G Brusselle1,3,13, Dirkje S Postma5,8, H Marike Boezen5,6, Cornelia M van Duijn1, Najaf Amin1.   

Abstract

Chronic obstructive pulmonary disease (COPD) is among the major health burdens in adults. While cigarette smoking is the leading risk factor, a growing number of genetic variations have been discovered to influence disease susceptibility. Epigenetic modifications may mediate the response of the genome to smoking and regulate gene expression. Chromosome 19q13.2 region is associated with both smoking and COPD, yet its functional role is unclear. Our study aimed to determine whether rs7937 (RAB4B, EGLN2), a top genetic variant in 19q13.2 region identified in genome-wide association studies of COPD, is associated with differential DNA methylation in blood (N = 1490) and gene expression in blood (N = 721) and lungs (N = 1087). We combined genetic and epigenetic data from the Rotterdam Study (RS) to perform the epigenome-wide association analysis of rs7937. Further, we used genetic and transcriptomic data from blood (RS) and from lung tissue (Lung expression quantitative trait loci mapping study), to perform the transcriptome-wide association study of rs7937. Rs7937 was significantly (FDR < 0.05) and consistently associated with differential DNA methylation in blood at 4 CpG sites in cis, independent of smoking. One methylation site (cg11298343-EGLN2) was also associated with COPD (P = 0.001). Additionally, rs7937 was associated with gene expression levels in blood in cis (EGLN2), 42% mediated through cg11298343, and in lung tissue, in cis and trans (NUMBL, EGLN2, DNMT3A, LOC101929709 and PAK2). Our results suggest that changes of DNA methylation and gene expression may be intermediate steps between genetic variants and COPD, but further causal studies in lung tissue should confirm this hypothesis.
© The Author 2017. Published by Oxford University Press.

Entities:  

Mesh:

Substances:

Year:  2018        PMID: 29092026      PMCID: PMC5886099          DOI: 10.1093/hmg/ddx390

Source DB:  PubMed          Journal:  Hum Mol Genet        ISSN: 0964-6906            Impact factor:   6.150


Introduction

Chronic obstructive pulmonary disease (COPD) is a common, systemic, lung disease, mainly characterized by airway obstruction and inflammation (1). COPD often develops as a response to chronic exposure to cigarette smoke, fumes and gases (2,3). There is significant inter-individual variability in the response to these environmental exposures (4,5) that has been attributed to genetic factors (6,7). Genome-wide association studies (GWAS) have identified genetic variants associated with COPD susceptibility on chromosomes 4q31, 4q22, 15q25 and 19q13 (8–11). However, the mechanism explaining how these variants are involved in the pathogenesis of COPD remains elusive (12). As is the case for many complex diseases, many single nucleotide polymorphisms (SNPs) associated with COPD and lung function by GWAS are located in non-protein coding intergenic and intronic regulatory regions (13,14). It has been hypothesized that these SNPs may modulate regulatory mechanisms, such as RNA expression, splicing, transcription factor binding and epigenetic modifications (e.g. DNA methylation). Changes in RNA expression as well as in DNA methylation regulating expression have recently been associated with COPD, suggesting that genetic and epigenetic factors are working in concert in the pathogenesis of COPD (15,16). Emerging evidence suggests that differential methylation sites (CpGs) are potentially important for COPD susceptibility (15–17), but their location was not linked to the GWAS loci. However, important associations in COPD genomic regions may have been missed as arrays with limited coverage (27K) were used in the studies conducted to date. The 19q13.2 region is associated with COPD and cigarette smoking (18,19), lung function (20) and emphysema patterns (21). Genes in this region include RAB4B (member RAS oncogene family), EGLN2 (Egl-nine homolog 2), MIA (melanoma inhibitory activity) and CYP2A6 (cytochrome P450 family 2 subfamily A member 6). The top variant in the region, rs7937:C > T, has been identified by Cho et al. (9). This SNP (RAB4B, EGLN2) was associated with COPD (OR = 1.37, P = 2.9 × 10−9), but not with smoking. Nevertheless, in a study of 10 healthy non-smokers and 7 healthy smokers, EGLN2 was found to be expressed at a higher level in airway epithelium of smokers compared with non-smokers (22). In this small, underpowered study of airway epithelial DNA, there was no significant evidence for differential DNA methylation of EGLN2 between smokers and non-smokers. In this study, we set out to determine whether rs7937 is involved in regulatory mechanisms like DNA methylation and gene expression and whether these mechanisms are also associated with COPD. We further evaluated the role of smoking in these regulatory mechanisms. For that purpose, we performed an epigenome-wide association study (EWAS) of rs7937 in blood using an array with high coverage (450K) and a transcriptome-wide association study of rs7937 in blood and lung tissues.

Results

Our discovery cohort comprised 724 participants with genotype and DNA methylation data, while the replication cohort comprised 766 participants from the Rotterdam Study (RS) (23). The summary statistics of the discovery and replication cohorts are shown in Table 1. As expected, the prevalence of males, smokers and the average pack-years of smoking were higher in cases, compared with controls. Compared with the replication cohort, participants in the discovery cohort were on average 8 years younger and included significantly more COPD cases and current smokers, although the pack-years of smoking were comparable. The overview of the analysis pipeline and sample sizes used is presented in Figure 1.
Table 1.

Characteristics of the discovery and replication cohorts and per COPD status

Discovery cohort
Replication cohort
COPDControlsAllCOPDControlsAll
N (% of all)114 (15.7)a541 (74.7)72493 (12.1)a591 (77.2)766
Age (years)a61.9±8.659.3±7.959.9±8.268.2±5.767.6±5.967.7±5.9
Males (%)68 (59.6)233 (43.1)331 (45.7)54 (58.1)249 (42.1)324 (42.3)
FEV1/FVC(% of all)0.63±0.07(71.9)0.78±0.04(66.0)0.76±0.08(67.0)0.63±0.07(95.7)0.79±0.05(91.0)0.76±0.08(91.8)
Current smokers, n (%)a43 (37.7)107 (19.8)168 (23.2)20 (21.5)52 (8.8)80 (10.4)
Ex-smokers, n (%)55 (48.2)239 (44.2)322 (44.5)51 (54.8)326 (55.2)427 (55.7)
Never smokers, n (%)16 (14.0)195 (36.0)234 (32.3)22 (23.7)213 (36.0)259 (33.8)
Pack-yearsb34.3±26.919.9±19.623.2±22.033.7±18.719.6±20.121.9±20.6

Data for quantitative measures presented as mean ± SD. COPD: Chronic Obstructive Pulmonary Disease cases; All: all participants included in EWAS. For traits that were not available for all participants (COPD status and FEV1/FVC), the valid percentage is denoted in brackets (% of all).

Significantly different between the discovery and replication cohort.

Pack-years data were available for all participants (mean and SD calculated in current and ex-smokers only).

Figure 1.

Analysis pipeline and datasets overview.

Characteristics of the discovery and replication cohorts and per COPD status Data for quantitative measures presented as mean ± SD. COPD: Chronic Obstructive Pulmonary Disease cases; All: all participants included in EWAS. For traits that were not available for all participants (COPD status and FEV1/FVC), the valid percentage is denoted in brackets (% of all). Significantly different between the discovery and replication cohort. Pack-years data were available for all participants (mean and SD calculated in current and ex-smokers only). Analysis pipeline and datasets overview.

Methylation quantitative trait locus analysis

In the genome-wide blood methylation quantitative trait locus (meQTL) analysis of rs7937 in the discovery cohort, rs7937 was significantly [False Discovery Rate (FDR) <0.05] associated with differential DNA methylation at 6 CpG sites in the genes ITPKC and EGLN2, located within the same 19q13.2 region (Model 1, Table 2, Fig. 2A). Five of the six methylation sites were available in the replication dataset and four were significantly replicated with the same direction as found in the discovery cohort (Table 2, Figs 2B and 3). Adding smoking as a confounder (Model 2, Table 2) and testing interaction with smoking (Model 3, data not shown) did not change the results, suggesting that the association between rs7937 and DNA methylation at these sites is independent of smoking. In an additional Model 4, we show that adding COPD to the model did not change the effect of rs7937 on DNA methylation (Supplementary Material, Table S1).
Table 2.

Association of rs7937-T with epigenome-wide DNA methylation in discovery (N = 724) and replication (N = 766) cohorts

Discovery
Replication
CpGChrPositionGeneModelβSEPFDRβSEPFDR
cg216539131941307778EGLN21−0.089250.004969.17 × 10−594.35 × 10−53−0.100300.004911.13 × 10−724.74 × 10−67
2−0.089560.004975.70 × 10−592.71 × 10−53−0.100590.004921.17 × 10−724.92 × 10−67
cg112983431941306150EGLN21−0.020360.002011.94 × 10−223.53 × 10−17−0.014100.001415.42 × 10−226.36 × 10−17
2−0.020700.001991.37 × 10−233.25 × 10−18−0.014020.001405.30 × 10−227.42 × 10−17
cg105854861941304133EGLN21−0.009010.000892.23 × 10−223.53 × 10−17−0.013490.001301.47 × 10−233.09 × 10−18
2−0.009020.000892.19 × 10−223.47 × 10−17−0.013470.001302.25 × 10−234.72 × 10−18
cg249587651941283667RAB4B10.002350.000311.17 × 10−139.27 × 10−9−0.004270.000752.02 × 10−81.06 × 10−3
20.002370.000311.02 × 10−138.06 × 10−9−0.004330.000751.26 × 10−86.62 × 10−4
cg137911831941316697CYP2A6- AK09737010.012440.002094.23 × 10−92.51 × 10−4NA
20.012550.002093.28 × 10−91.94 × 10−4
cg259230561941306455EGLN21−0.009520.001771.00 × 10−75.30 × 10−3−0.011530.001411.50 × 10−151.05 × 10−10
2−0.009850.001752.54 × 10−81.34 × 10−3−0.011400.001412.73 × 10−151.91 × 10−10

β: Regression coefficient estimates from linear regression model regressing DNA methylation levels on indicated SNPs. In Model 1 coefficients are corrected for sex, age, technical covariates and different white blood cellular proportions. In Model 2 coefficients are additionally adjusted for current smoking and pack-years smoked. SE: standard error of the effect, P: P-value of the significance, FDR: False discovery rate value. NA: Not available in the replication dataset.

Figure 2.

Association of the rs7937 with DNA methylation across the genome. In circles are represented all CpGs throughout the genome. X-axis shows chromosome locations; Y-axis shows negative logarithm of the P-value of the associations of the SNP with each CpG site. Dotted line represents the significance threshold (FDR < 0.05). (A) Discovery analysis; (B) Replication analysis.

Figure 3.

Region plot of chromosome 19q13.2 with significant SNP-CpG associations. The circles represent SNP-CpG associations; X-axis shows all genes in the region; Y-axis shows negative logarithm of the P-values of the associations of CpGs with the SNP. Crossed circles represent the non-replicated associations.

Association of rs7937-T with epigenome-wide DNA methylation in discovery (N = 724) and replication (N = 766) cohorts β: Regression coefficient estimates from linear regression model regressing DNA methylation levels on indicated SNPs. In Model 1 coefficients are corrected for sex, age, technical covariates and different white blood cellular proportions. In Model 2 coefficients are additionally adjusted for current smoking and pack-years smoked. SE: standard error of the effect, P: P-value of the significance, FDR: False discovery rate value. NA: Not available in the replication dataset. Association of the rs7937 with DNA methylation across the genome. In circles are represented all CpGs throughout the genome. X-axis shows chromosome locations; Y-axis shows negative logarithm of the P-value of the associations of the SNP with each CpG site. Dotted line represents the significance threshold (FDR < 0.05). (A) Discovery analysis; (B) Replication analysis. Region plot of chromosome 19q13.2 with significant SNP-CpG associations. The circles represent SNP-CpG associations; X-axis shows all genes in the region; Y-axis shows negative logarithm of the P-values of the associations of CpGs with the SNP. Crossed circles represent the non-replicated associations.

COPD and FEV1/FVC analyses

When testing for association of DNA methylation at the four replicated differentially methylated CpG sites with COPD, we observed a significant association with cg11298343 (EGLN2) in Model 1 [β (SE)=−7.080 (2.16), P = 0.001] (Table 3, Supplementary Material, Table S2), which remained nominally significant with diminished but still strong and concordant negative effect, after adjusting for smoking [Model 2; β (SE) = −4.924 (2.25), P = 0.029]. Further, we show that additionally adjusting for rs7937 slightly deteriorated the effect of cg11298343 on COPD (Model 4; Supplementary Material, Table S1).
Table 3.

Association of DNA methylation at significant CpG sites in 19q13.2, with COPD and FEV1/FVC ratio – meta-analysis results

TraitCpGNModelβSEP
COPDcg2165391313391−0.0840.7110.906
20.1750.7380.812
cg1129834313391−7.0802.1630.001
2−4.9242.2540.029
cg1058548613391−2.2153.6570.545
2−3.1103.8110.415
cg2592305613391−0.3682.1530.864
21.5512.2550.492
FEV1/FVCcg2165391311881−0.0080.0200.676
2−0.0170.0200.382
cg11298343118810.1380.0650.035
20.0470.0640.464
cg10585486118810.0960.0930.304
20.0940.0900.299
cg2592305611881−0.0210.0640.743
2−0.0980.0620.114

N: number of participants in the meta-analysis; Model 1 is adjusted for age, sex, technical covariates and different white blood cellular proportions, Model 2 is additionally adjusted for smoking; β: Regression coefficient estimates from logistic/linear regression models; SE: standard error of the effect; P: P-value of the significance. In bold: nominally significant associations.

Association of DNA methylation at significant CpG sites in 19q13.2, with COPD and FEV1/FVC ratio – meta-analysis results N: number of participants in the meta-analysis; Model 1 is adjusted for age, sex, technical covariates and different white blood cellular proportions, Model 2 is additionally adjusted for smoking; β: Regression coefficient estimates from logistic/linear regression models; SE: standard error of the effect; P: P-value of the significance. In bold: nominally significant associations. In the association with the quantitative determinant of COPD (Table 3, Supplementary Material, Table S2), the ratio of forced expiratory volume in 1 s (FEV1) over the forced vital capacity (FVC), we observed nominal significance for the same site [Model 1; β (SE)=0.138 (0.07), P = 0.04], which deteriorated with adjusting for smoking [Model 2; β (SE)=0.047 (0.06), P = 0.46].

Blood and lung expression quantitative trait loci analysis

In the genome-wide blood expression quantitative trait loci (eQTL) analysis in RS, rs7937 was significantly associated with differential expression of the ILMN_2354391 probe in the EGLN2 gene [β (SE)=0.064 (0.01), P = 9.3 × 10−9], (Table 4). The risk allele (T) was associated with increased expression of EGLN2 (Supplementary Material, Fig. S1). The association signal dropped albeit remained significant [β (SE)=0.058 (0.01), P = 1.9 × 10−6] after adjusting for the top CpG site cg11298343 in the same gene (Table 4). This suggests that differential DNA methylation at cg11298343 is partly responsible for the differential expression of ILMN_2354391 in EGLN2 in blood. In further investigation, we performed the formal mediation analysis where we show (Table 5) that 42% of the association between rs7937 and EGLN2 expression is indeed mediated through cg11298343 (P = 0.04).
Table 4.

Association of rs7937 with transcriptome-wide gene expression in blood (N = 721)

SNPA1A2ProbeChrPosition (GRCh37/hg19)GeneModelβSEPFDR
rs7937TCNM_080732.11946006086-46006135EGLN210.06350.01099.29 × 10−90.000197
20.05770.01201.88 × 10−60.039942

A1: effect allele (tested allele), A2: alternative allele, Chr: chromosome of the probe, Model 1 is adjusted for age, sex, current smoking, technical covariates and different white blood cellular proportions, Model 2 is additionally adjusted for cg11298343; β: Regression coefficient estimates from linear regression models regressing gene expression on indicated SNP, SE: standard error, P: P-value of the significance, FDR: False discovery rate value.

Table 5.

Mediation of the rs7937-EGLN2 expression association through cg11298343 (N = 721)

Estimate95%CI Lower95%CI UpperP-value
ACME0.020.0050.0320.01
ADE0.02−0.0140.0680.22
Total Effect0.040.0050.0800.03
Proportion Mediated0.420.0581.9950.04

ACME: average causal mediation effect by DNA methylation at cg11298343; ADE: average direct effect of rs7937 on EGLN2 expression; Total: total effect rs7937 on EGLN2 expression; Proportion Mediated: proportion of the association between rs7937 and EGLN2 expression, explained by methylation at cg11298343; Regression models adjusted for sex, age, current smoking, pack-years, technical variance and estimated blood cell composition; In bold: significant results with P-value < 0.05.

Association of rs7937 with transcriptome-wide gene expression in blood (N = 721) A1: effect allele (tested allele), A2: alternative allele, Chr: chromosome of the probe, Model 1 is adjusted for age, sex, current smoking, technical covariates and different white blood cellular proportions, Model 2 is additionally adjusted for cg11298343; β: Regression coefficient estimates from linear regression models regressing gene expression on indicated SNP, SE: standard error, P: P-value of the significance, FDR: False discovery rate value. Mediation of the rs7937-EGLN2 expression association through cg11298343 (N = 721) ACME: average causal mediation effect by DNA methylation at cg11298343; ADE: average direct effect of rs7937 on EGLN2 expression; Total: total effect rs7937 on EGLN2 expression; Proportion Mediated: proportion of the association between rs7937 and EGLN2 expression, explained by methylation at cg11298343; Regression models adjusted for sex, age, current smoking, pack-years, technical variance and estimated blood cell composition; In bold: significant results with P-value < 0.05. Moreover, genome-wide eQTL analysis of rs7937 in lung tissue of 1087 participants from the Lung expression quantitative loci mapping study (LES), showed significant associations (P < 1.36 × 10−6) with 5 probes in the same region [in cis; AK097370(EGLN2), NUMBL] and other chromosomes [in trans; LOC101929709, DNMT3A and PAK2] (Table 6). In all cases, except one (LOC101929709 at chromosome 8), the T allele of rs7937 was consistently associated with decreased expression of the genes in lung tissue (Table 6, Supplementary Material, Fig. S2).
Table 6.

Association of rs7937 with transcriptome-wide gene expression in lung tissue (N = 1087)

SNPA1A2ProbeChrPosition (GRCh37/hg19)AnnotationβSEPFDR
rs7937TCNM_0047561940665906-40690658NUMBL−0.0990.0134.39 × 10−151.17 × 10−11
BC0378041940808443-40810818AK097370 (EGLN2)−0.0770.0123.77 × 10−101.01 × 10−6
BX330016889720919-89724906LOC1019297090.0490.0116.15 × 10−60.016
AK025230225233434-25246179DNMT3A−0.0280.0068.00 × 10−60.021
BQ4459243196829093-196830253PAK2−0.0280.0061.36 × 10−50.036

A1: effect allele (tested allele), A2: alternative allele, β: Regression coefficient estimates from linear regression models regressing gene expression on indicated SNP, SE: standard error, P: P-value of the significance, FDR: False discovery rate value.

Association of rs7937 with transcriptome-wide gene expression in lung tissue (N = 1087) A1: effect allele (tested allele), A2: alternative allele, β: Regression coefficient estimates from linear regression models regressing gene expression on indicated SNP, SE: standard error, P: P-value of the significance, FDR: False discovery rate value.

Discussion

Our study shows that the rs7937 in 19q13.2 is associated with differential blood DNA methylation of 4 CpG sites located in the EGLN2 in the discovery and replication cohort. The COPD risk allele (T) is associated with lower DNA methylation at these sites. These relationships are independent of smoking and of COPD. We further show that DNA methylation in blood at cg11298343 (EGLN2) is associated with COPD, and remains nominally significant after adjusting for smoking. Finally, rs7937 is associated with differential expression in blood of EGLN2, 42% explained by EGLN2 DNA methylation at site cg11298343, and in lung tissue of NUMBL, AK097370 (EGLN2), LOC101929709, DNMT3A and PAK2. EGLN2 is coding prolyl hydroxylase domain-containing protein 1 (PHD1) which regulates posttranscriptional modifications of hypoxia induced factor (HIF), a transcriptional complex involved in oxygen homeostasis. At normal oxygen levels, the alpha subunit of HIF is targeted for degradation by PHD1, which is an essential component of the pathway through which cells sense oxygen (24), is also known to be involved in activation of inflammatory and immune genes, including those implicated in COPD (25). Furthermore, read-through transcription exists between this gene and the upstream RAB4B, and together they were shown to be involved in invasive lung cancer (26). Our study of DNA methylation in blood replicates the findings of a study on meQTLs in blood across the human life course: during pregnancy, at birth, childhood, adolescence and middle age (27) which reported three (cg10585486, cg11298343, cg25923056) out of our four replicated CpGs. We additionally report a novel finding in association with rs7937, our top hit, cg21653913. Interestingly, they report differential DNA methylation at cg11298343 to be associated with rs7937 at all five time points. In the present study, we now show that rs7937 is also associated with cg11298343 in our elderly sample, the age category at highest risk for developing COPD. This finding goes in line with our hypothesis that the life-long change in the DNA methylation is involved in the pathogenesis and onset of COPD in older age, rather than the other way around. However, further longitudinal studies are needed, testing this hypothesis in lung tissue. In line with our findings, rs7937 has previously been associated with differential expression of EGLN2 in blood (28). Having both DNA methylation and transcription data available, we could further test whether the relation of rs7937 and EGLN2 expression could be explained by DNA methylation levels of EGLN2 at cg11298343 site. We show for the first time that in blood there is indeed a mediation of 42% through the DNA methylation at cg11298343, confirming our hypothesis. We report two novel findings in this region in the lung tissue. We found that rs7937 is involved in expression of AK097370 in lung tissue, a DNA clone in the proximity of EGLN2, as well as with NUMBL. NUMBL is known as a negative regulator of NF-kappa-B signaling pathway in neurons (29) and was also found to be expressed in the lungs in the GTEx database (30). The association between rs7937 and gene expression in the same dataset has been tested earlier by Lamontagne et al. (31) but no significant results were reported. In the present study, using a more powerful meta-analysis approach, rs7937 was associated with two loci in the region (NUMBL and AK097370, close to EGLN2) and three more loci in other chromosomes (LOC101929709, DNMT3A and PAK2). Taken together, our findings raise the hypothesis that the genetic effect of rs7937 on COPD might be mediated by DNA methylation at cg11298343 and subsequent alteration of expression of EGLN2 and other genes in this region, such as NUMBL. We show this in blood but further formal mediation analyses in lung tissue are needed to confirm this hypothesis, requiring the assembly of a large dataset of lung tissue characterized for genetic, epigenetic and transcriptomic data. Furthermore, we have found significant associations in lungs of rs7937 in trans, i.e. with the expression of genes on other chromosomes. These effects include differential expression of PAK2 (chromosome 3), DNMT3A (chromosome 2) and long non-coding RNA on chromosome 8. The protein encoded by PAK2 gene is activated by proteolytic cleavage during caspase-mediated apoptosis, and may play a role in regulating the apoptotic events in the dying cell (32). DNMT3A is the gene encoding the DNA methyltransferase which plays a key role in de novo methylation. This may imply that rs7937 is involved in the pathogenesis of COPD through differential DNA methylation and regulation of expression throughout the genome, again asking for further research of DNA methylation. The strength of our analysis is the use of large and unique samples of patients whose genetic, epigenetic and transcriptomic characteristics were assessed in detail. However, a limitation of our study is the use of blood tissue for the assessment of DNA methylation and gene expression. Nevertheless, our findings regarding the role of the genetic variants in blood corroborate with the changes in the transcriptome in lung tissue. It has been shown that blood can be used to evaluate methylation changes related to COPD and smoking, as the disease induces systemic changes associated with elevated markers of systemic inflammation in blood (25). A second limitation of our study is that we cannot distinguish the expression in lung parenchymal tissue, which comprises multiple cell types. It may be speculated that expression of only distinct cells is affected by rs7939. If this is the case, the most likely effect is that the power of our study is reduced, but has not biased our findings in the sense of generating false positives. However, although eQTLs are frequently cell- and tissue-specific (33), many eQTLs are also shared across tissues (34). Finally, the number of patients with COPD and spirometry measures were limited, which may have compromised the power of the study. Nevertheless, we observed significant findings that are relevant for COPD. In conclusion, our findings suggest that genetic variations underlying EGLN2 methylation contribute to the risk of developing COPD. This finding adds insight into how genetic variants are involved in the pathogenesis of COPD, through differential DNA methylation and regulation of expression, irrespective of smoking. Future integrative studies involving genetics, epigenetics and transcriptomics in lung tissue are crucial to elucidate the molecular mechanisms behind COPD genetic susceptibility and to translate the findings to clinical care and prevention. This may lead to an increased specificity and sensitivity of diagnostic and prognostic tools. In addition, novel DNA methylation loci may be used as a target for future drug design in COPD. While smoking cessation is shown to be a useful prevention tool for disease risk and mortality reduction, DNA methylation loci independent of smoking may be used as a target for a more personalized and focused treatment approach.

Materials and Methods

Study population

For our analyses in blood we used two independent subsets of participants from the RS (23). The full discovery set for meQTL analysis was comprised of 724 participants with full genomic and epigenetic data, while replication set included 766 participants. The replication subset is part of the Biobanking and Biomolecular Resources Research Infrastructure for The Netherlands (BBMRI-NL), BIOS (Biobank-based Integrative Omics Studies) project (35). RS has been approved by the Medical Ethics Committee of the Erasmus MC and by the Ministry of Health, Welfare and Sport of the Netherlands, implementing the Population Studies Act: Rotterdam Study. All participants provided written informed consent to participate in the study and to obtain information from their treating physicians. The detailed information on our samples can be found in Supplementary Material.

Spirometry measures and COPD diagnosis

From the initial full datasets for meQTL analysis (ndiscovery=724, nreplication=766), after excluding participants with asthma, we used data from 655 participants in the discovery and 684 participants in the replication cohort for the association analyses with COPD. COPD diagnosis was defined as pre-bronchodilator FEV1/FVC < 0.7. More detailed information can be found in Supplementary Material.

COPD SNPs selection

Using the GWAS catalog (36) on 15 January 2017, we performed a search with the term ‘19q13.2’, additionally applying filters for the P-value ≤ 5 × 10−8 and for the trait to include term ‘Chronic obstructive pulmonary disease’. One GWAS study passed these filtering criteria, and reported the top SNP, rs7937-T (NC_000019.10: g.40796801C > T), to be associated with COPD (OR = 1.37) (9). We used this SNP to perform all analyses in our study.

Genotyping in RS

Genotyping was performed using 610K and 660K Illumina arrays for which whole blood genomic DNA was used. Detailed information can be found in Supplementary Material. Imputation was done using 1000 Genomes (1KG) phase I v3 reference panel, with measured genotypes that had minor allele frequencies (MAF)>1%, performed with MacH software and Minimac implementation. We extracted dosages of rs7937 [risk allele T, allele frequency = 0.54], from RS imputed data using DatABEL library of R-package (37).

DNA methylation array in RS

We used Illumina Infinium Human Methylation 450K array to quantify DNA methylation levels across the genome from whole blood in RS. The detailed QC and normalization criteria can be found in the Supplementary Material. Ultimately, after the QC and normalization steps our discovery set included 724 Caucasian participants and 463 456 probes, while the replication set included 766 Caucasian participants and 419 936 probes.

RNA array in blood in RS

In the discovery sample, we used the same blood samples at baseline to isolate RNA, which we hybridized to Illumina Whole-Genome Expression Beadchips Human HT-12 v4 array. Raw probe intensities were quantile-normalized and 2-log transformed and controlled for quality as described elsewhere (28). After all normalization and QC steps the sample consisted of 21 238 probes in 721 participants with available full data on SNP and RNA arrays and all covariates.

RNA array in lung tissue

Gene expression was quantified using lung tissue samples obtained from patients that underwent lung resection surgery at three facilities participating in the LES: University of Groningen (GRN), Laval University (Laval) and University of British Columbia (UBC) (38). Illumina Human1M-Duo BeadChip arrays were used for genotyping, and a custom Affymetrix microarray (GPL10379) for gene expression profiling. The final dataset for the eQTL analysis consisted of 1087 subjects. More detailed information can be found in Supplementary Material.

Statistical analyses

meQTL analysis

We performed EWAS in the discovery cohort using linear regression analysis with rs7937-T as independent variable and DNA methylation sites as dependent variable. We fitted two models; first adjusted for age, sex, technical covariates to correct for batch effects (array number and position on array) and the estimated white blood cell counts (39) (including monocytes, T-lymphocytes: CD4 and CD8, B-lymphocytes, natural killer cells, neutrophils and eosinophils) (Model 1); and second, for significant sites additionally adjusted for current smoking and pack-years smoked (Model 2). We used the FDR < 0.05 as an epigenome-wide significance threshold (40). Significant sites from Model 1 were then tested for association in the replication cohort using the same models as in the discovery. Since the 19q13.2 region was also implicated in smoking behavior, we also tested significant CpG sites in a third model including ‘rs7937 × smoking’ interaction term to assess possible interaction between rs7937 and smoking (for both current smoking and pack-years smoked), in the discovery and replication cohorts. In addition, for the significant sites we used another model additionally adjusted for COPD status which we compared with the Model 1 in attempt to further elucidate the direction of the effect between DNA methylation and COPD.

COPD and FEV1/FVC analysis

To test if the significantly associated methylation sites are also associated with the lung phenotypes, we performed logistic and linear regression analyses with COPD and FEV1/FVC ratio, respectively as dependent variables and DNA methylation as independent variable. In the first model we adjusted for age, sex, technical covariates and estimated white blood cell counts and additionally for current smoking and pack-years smoked in the second model, in both the discovery and the replication cohort. Results from the two cohorts were meta-analysed using fixed effect models with ‘rmeta’ package in R (41). Bonferroni correction was applied to adjust for multiple testing. In addition, for the significant sites we used another model additionally adjusted for rs7937 which we compared with the first model in attempt to further elucidate the direction of the effect between DNA methylation and COPD.

Blood eQTL analysis

In the discovery cohort, we tested whether rs7937 is associated with differential expression in the whole blood. We used linear regression analysis with rs7937 as independent variable and genome-wide normalized gene expression as dependent variable. For this analysis, we used model adjusted for age, sex, current smoking, technical batch effects (plate ID and RNA quality) and white blood cell counts (lymphocytes, monocytes and granulocytes). For significant (FDR < 0.05) probes, the second model was additionally adjusted for significant DNA methylation levels.

Mediation analysis

We have performed formal mediation analysis using the bootstrapping method in the ‘mediation’ package in R (42), to assess the potential mediator role of significant DNA methylation in the SNP-expression association. One thousand bootstraps were run to estimate the confidence intervals (43). We used models adjusted for age, sex, current smoking, pack-years, expression technical batch effects (plate ID and RNA quality), methylation technical batch effects (position on array and array number) and estimated blood cell composition.

Lung eQTL analysis

To test if the SNP rs7937 is associated (FDR < 0.05) with differential expression in lung tissue in LES, we performed a genome-wide linear regression analysis with the SNP as the independent variable and 2-log transformed gene expression levels as dependent variable. This analysis was performed for each of the three participating cohorts (GRN, Laval and UBC) separately, adjusted for lung disease status, age, sex, smoking status and cohort-specific principal components (PCs). The inverse-variance weighted fixed effect meta-analysis of the results obtained from the three cohorts was performed with ‘rmeta’ package in R software. The detailed overview of the fitted models can be found in the Supplementary Material.

Supplementary Material

Supplementary Material is available at HMG online. Click here for additional data file. Click here for additional data file. Click here for additional data file.
  37 in total

1.  Variable DNA methylation is associated with chronic obstructive pulmonary disease and lung function.

Authors:  Weiliang Qiu; Andrea Baccarelli; Vincent J Carey; Nadia Boutaoui; Helene Bacherman; Barbara Klanderman; Stephen Rennard; Alvar Agusti; Wayne Anderson; David A Lomas; Dawn L DeMeo
Journal:  Am J Respir Crit Care Med       Date:  2011-12-08       Impact factor: 21.405

2.  Theory and Analysis of Total, Direct, and Indirect Causal Effects.

Authors:  Axel Mayer; Felix Thoemmes; Norman Rose; Rolf Steyer; Stephen G West
Journal:  Multivariate Behav Res       Date:  2014 Sep-Oct       Impact factor: 5.923

3.  Prolyl hydroxylase-1 negatively regulates IkappaB kinase-beta, giving insight into hypoxia-induced NFkappaB activity.

Authors:  Eoin P Cummins; Edurne Berra; Katrina M Comerford; Amandine Ginouves; Kathleen T Fitzgerald; Fergal Seeballuck; Catherine Godson; Jens E Nielsen; Paul Moynagh; Jacques Pouyssegur; Cormac T Taylor
Journal:  Proc Natl Acad Sci U S A       Date:  2006-11-17       Impact factor: 11.205

4.  NUMBL interacts with TAB2 and inhibits TNFalpha and IL-1beta-induced NF-kappaB activation.

Authors:  Qi Ma; Li Zhou; Huili Shi; Keke Huo
Journal:  Cell Signal       Date:  2008-01-31       Impact factor: 4.315

5.  Potential etiologic and functional implications of genome-wide association loci for human diseases and traits.

Authors:  Lucia A Hindorff; Praveen Sethupathy; Heather A Junkins; Erin M Ramos; Jayashri P Mehta; Francis S Collins; Teri A Manolio
Journal:  Proc Natl Acad Sci U S A       Date:  2009-05-27       Impact factor: 11.205

6.  A conserved family of prolyl-4-hydroxylases that modify HIF.

Authors:  R K Bruick; S L McKnight
Journal:  Science       Date:  2001-10-11       Impact factor: 47.728

7.  Genome-wide meta-analyses identify multiple loci associated with smoking behavior.

Authors: 
Journal:  Nat Genet       Date:  2010-04-25       Impact factor: 38.330

8.  Systematic identification of trans eQTLs as putative drivers of known disease associations.

Authors:  Harm-Jan Westra; Marjolein J Peters; Tõnu Esko; Hanieh Yaghootkar; Claudia Schurmann; Johannes Kettunen; Mark W Christiansen; Bruce M Psaty; Samuli Ripatti; Alexander Teumer; Timothy M Frayling; Andres Metspalu; Joyce B J van Meurs; Lude Franke; Benjamin P Fairfax; Katharina Schramm; Joseph E Powell; Alexandra Zhernakova; Daria V Zhernakova; Jan H Veldink; Leonard H Van den Berg; Juha Karjalainen; Sebo Withoff; André G Uitterlinden; Albert Hofman; Fernando Rivadeneira; Peter A C 't Hoen; Eva Reinmaa; Krista Fischer; Mari Nelis; Lili Milani; David Melzer; Luigi Ferrucci; Andrew B Singleton; Dena G Hernandez; Michael A Nalls; Georg Homuth; Matthias Nauck; Dörte Radke; Uwe Völker; Markus Perola; Veikko Salomaa; Jennifer Brody; Astrid Suchy-Dicey; Sina A Gharib; Daniel A Enquobahrie; Thomas Lumley; Grant W Montgomery; Seiko Makino; Holger Prokisch; Christian Herder; Michael Roden; Harald Grallert; Thomas Meitinger; Konstantin Strauch; Yang Li; Ritsert C Jansen; Peter M Visscher; Julian C Knight
Journal:  Nat Genet       Date:  2013-09-08       Impact factor: 38.330

9.  Mapping cis- and trans-regulatory effects across multiple tissues in twins.

Authors:  Elin Grundberg; Kerrin S Small; Åsa K Hedman; Alexandra C Nica; Alfonso Buil; Sarah Keildson; Jordana T Bell; Tsun-Po Yang; Eshwar Meduri; Amy Barrett; James Nisbett; Magdalena Sekowska; Alicja Wilk; So-Youn Shin; Daniel Glass; Mary Travers; Josine L Min; Sue Ring; Karen Ho; Gudmar Thorleifsson; Augustine Kong; Unnur Thorsteindottir; Chrysanthi Ainali; Antigone S Dimas; Neelam Hassanali; Catherine Ingle; David Knowles; Maria Krestyaninova; Christopher E Lowe; Paola Di Meglio; Stephen B Montgomery; Leopold Parts; Simon Potter; Gabriela Surdulescu; Loukia Tsaprouni; Sophia Tsoka; Veronique Bataille; Richard Durbin; Frank O Nestle; Stephen O'Rahilly; Nicole Soranzo; Cecilia M Lindgren; Krina T Zondervan; Kourosh R Ahmadi; Eric E Schadt; Kari Stefansson; George Davey Smith; Mark I McCarthy; Panos Deloukas; Emmanouil T Dermitzakis; Tim D Spector
Journal:  Nat Genet       Date:  2012-09-02       Impact factor: 38.330

10.  Systematic identification of genetic influences on methylation across the human life course.

Authors:  Tom R Gaunt; Hashem A Shihab; Gibran Hemani; Josine L Min; Geoff Woodward; Oliver Lyttleton; Jie Zheng; Aparna Duggirala; Wendy L McArdle; Karen Ho; Susan M Ring; David M Evans; George Davey Smith; Caroline L Relton
Journal:  Genome Biol       Date:  2016-03-31       Impact factor: 13.583

View more
  10 in total

1.  EGLN2 DNA methylation and expression interact with HIF1A to affect survival of early-stage NSCLC.

Authors:  Ruyang Zhang; Linjing Lai; Jieyu He; Chao Chen; Dongfang You; Weiwei Duan; Xuesi Dong; Ying Zhu; Lijuan Lin; Sipeng Shen; Yichen Guo; Li Su; Andrea Shafer; Sebastian Moran; Thomas Fleischer; Maria Moksnes Bjaanæs; Anna Karlsson; Maria Planck; Johan Staaf; Åslaug Helland; Manel Esteller; Yongyue Wei; Feng Chen; David C Christiani
Journal:  Epigenetics       Date:  2019-01-31       Impact factor: 4.528

Review 2.  Molecular Mechanisms of Vascular Damage During Lung Injury.

Authors:  Ramon Bossardi Ramos; Alejandro Pablo Adam
Journal:  Adv Exp Med Biol       Date:  2021       Impact factor: 2.622

3.  Integrative genomics identifies new genes associated with severe COPD and emphysema.

Authors:  Phuwanat Sakornsakolpat; Jarrett D Morrow; Peter J Castaldi; Craig P Hersh; Yohan Bossé; Edwin K Silverman; Ani Manichaikul; Michael H Cho
Journal:  Respir Res       Date:  2018-03-22

4.  Upregulation of long noncoding RNA RAB11B-AS1 promotes tumor metastasis and predicts poor prognosis in lung cancer.

Authors:  Tiegang Li; Di Wu; Qun Liu; Dedong Wang; Jinbin Chen; Hongjun Zhao; Lan Zhang; Chenli Xie; Wei Zhu; Zhixu Chen; Yifeng Zhou; Soham Datta; Fuman Qiu; Lei Yang; Jiachun Lu
Journal:  Ann Transl Med       Date:  2020-05

5.  Objectives, design and main findings until 2020 from the Rotterdam Study.

Authors:  M Arfan Ikram; Guy Brusselle; Mohsen Ghanbari; André Goedegebure; M Kamran Ikram; Maryam Kavousi; Brenda C T Kieboom; Caroline C W Klaver; Robert J de Knegt; Annemarie I Luik; Tamar E C Nijsten; Robin P Peeters; Frank J A van Rooij; Bruno H Stricker; André G Uitterlinden; Meike W Vernooij; Trudy Voortman
Journal:  Eur J Epidemiol       Date:  2020-05-04       Impact factor: 8.082

6.  Integration of SNP Disease Association, eQTL, and Enrichment Analyses to Identify Risk SNPs and Susceptibility Genes in Chronic Obstructive Pulmonary Disease.

Authors:  Yang Liu; Kun Huang; Yahui Wang; Erqiang Hu; Benliang Wei; Zhaona Song; Yuqing Zou; Luanfeng Ge; Lina Chen; Wan Li
Journal:  Biomed Res Int       Date:  2020-12-29       Impact factor: 3.411

7.  Epigenome-wide association study of lung function in Latino children and youth with asthma.

Authors:  Esther Herrera-Luis; Annie Li; Esteban G Burchard; Maria Pino-Yanes; Angel C Y Mak; Javier Perez-Garcia; Jennifer R Elhawary; Sam S Oh; Donglei Hu; Celeste Eng; Kevin L Keys; Scott Huntsman; Kenneth B Beckman; Luisa N Borrell; Jose Rodriguez-Santana
Journal:  Clin Epigenetics       Date:  2022-01-15       Impact factor: 6.551

8.  LINC01414/LINC00824 genetic polymorphisms in association with the susceptibility of chronic obstructive pulmonary disease.

Authors:  Xiaoman Zhou; Yunjun Zhang; Yutian Zhang; Quanni Li; Mei Lin; Yixiu Yang; Yufei Xie; Yipeng Ding
Journal:  BMC Pulm Med       Date:  2021-07-07       Impact factor: 3.317

9.  Genome-wide association study of letrozole plasma concentrations identifies non-exonic variants that may affect CYP2A6 metabolic activity.

Authors:  Daniel L Hertz; Julie A Douglas; Kelley M Kidwell; Christina L Gersch; Zeruesenay Desta; Ana-Maria Storniolo; Vered Stearns; Todd C Skaar; Daniel F Hayes; N Lynn Henry; James M Rae
Journal:  Pharmacogenet Genomics       Date:  2021-07-01       Impact factor: 2.000

10.  Functional Gene Variants in Chronic Obstructive Pulmonary Disease: The Search Continues.

Authors:  Mario Acunzo; Patrick Nana-Sinkam
Journal:  Am J Respir Cell Mol Biol       Date:  2021-07       Impact factor: 6.914

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.