Literature DB >> 31549173

Appraising the causal relevance of DNA methylation for risk of lung cancer.

Thomas Battram1,2, Rebecca C Richmond1,2, Laura Baglietto3, Philip C Haycock1,2, Vittorio Perduca4, Stig E Bojesen5,6,7, Tom R Gaunt1,2, Gibran Hemani1,2, Florence Guida8, Robert Carreras-Torres8, Rayjean Hung9, Christopher I Amos10, Joshua R Freeman11, Torkjel M Sandanger12, Therese H Nøst12, Børge G Nordestgaard5,6,7, Andrew E Teschendorff13,14,15, Silvia Polidoro16, Paolo Vineis16,17, Gianluca Severi18,19,20, Allison M Hodge19,20, Graham G Giles19,20, Kjell Grankvist21, Mikael B Johansson22, Mattias Johansson8, George Davey Smith1,2, Caroline L Relton1,2.   

Abstract

BACKGROUND: DNA methylation changes in peripheral blood have recently been identified in relation to lung cancer risk. Some of these changes have been suggested to mediate part of the effect of smoking on lung cancer. However, limitations with conventional mediation analyses mean that the causal nature of these methylation changes has yet to be fully elucidated.
METHODS: We first performed a meta-analysis of four epigenome-wide association studies (EWAS) of lung cancer (918 cases, 918 controls). Next, we conducted a two-sample Mendelian randomization analysis, using genetic instruments for methylation at CpG sites identified in the EWAS meta-analysis, and 29 863 cases and 55 586 controls from the TRICL-ILCCO lung cancer consortium, to appraise the possible causal role of methylation at these sites on lung cancer.
RESULTS: Sixteen CpG sites were identified from the EWAS meta-analysis [false discovery rate (FDR) < 0.05], for 14 of which we could identify genetic instruments. Mendelian randomization provided little evidence that DNA methylation in peripheral blood at the 14 CpG sites plays a causal role in lung cancer development (FDR > 0.05), including for cg05575921-AHRR where methylation is strongly associated with both smoke exposure and lung cancer risk.
CONCLUSIONS: The results contrast with previous observational and mediation analysis, which have made strong claims regarding the causal role of DNA methylation. Thus, previous suggestions of a mediating role of methylation at sites identified in peripheral blood, such as cg05575921-AHRR, could be unfounded. However, this study does not preclude the possibility that differential DNA methylation at other sites is causally involved in lung cancer development, especially within lung tissue.
© The Author(s) 2019. Published by Oxford University Press on behalf of the International Epidemiological Association.

Entities:  

Keywords:  ALSPAC; ARIES; DNA methylation; Lung cancer; Mendelian randomization

Mesh:

Substances:

Year:  2019        PMID: 31549173      PMCID: PMC6857764          DOI: 10.1093/ije/dyz190

Source DB:  PubMed          Journal:  Int J Epidemiol        ISSN: 0300-5771            Impact factor:   7.196


Key Messages

DNA methylation is a modifiable biomarker, giving it the potential to be targeted for intervention in many diseases, including lung cancer that is the most common cause of cancer-related death. This Mendelian randomization study attempted to evaluate whether there was a causal relationship, and thus potential for intervention, between DNA methylation measured in peripheral blood and lung cancer, by assessing whether genetically altered DNA methylation levels impart differential lung cancer risks. Differential methylation at 14 CpG sites identified in epigenome-wide association analysis of lung cancer were assessed. Despite >99% power to detect the observational effect sizes, our Mendelian randomization analysis gave little evidence that any of the sites were causally linked to lung cancer. This is in stark contrast to previous analyses that suggested two CpG sites within the AHRR and F2RL3 loci, which were also observed in this analysis, mediate >30% of the effect of smoking on lung cancer. Overall findings suggest there is little or no role of differential methylation at the CpG sites identified within the blood in the development of lung cancer. Thus, targeting these sites for prevention of lung cancer is unlikely to yield effective treatments.

Background

Lung cancer is the most common cause of cancer-related death worldwide. Several DNA methylation changes have been recently identified in relation to lung cancer risk. Given the plasticity of epigenetic markers, any DNA methylation changes that are causally linked to lung cancer are potentially appealing targets for intervention., However, these epigenetic markers are sensitive to reverse causation, being affected by cancer processes, and are also prone to confounding, for example by socioeconomic and lifestyle factors., One CpG site, cg05575921 within the aryl hydrocarbon receptor repressor (AHRR) gene, has been consistently replicated in relation to both smoking and lung cancer,, and functional evidence suggests that this region could be causally involved in lung cancer. However, the observed association between methylation and lung cancer might simply reflect separate effects of smoking on lung cancer and DNA methylation, i.e. the association may be a result of confounding, including residual confounding after adjustment for self-reported smoking behaviour., Furthermore, recent epigenome-wide association studies (EWAS) for lung cancer have revealed additional CpG sites which may be causally implicated in development of the disease., Mendelian randomization (MR) uses genetic variants associated with modifiable factors as instruments to infer causality between the modifiable factor and outcome, overcoming most unmeasured or residual confounding and reverse causation., In order to infer causality, three core assumptions of MR should be met: (i) the instrument is associated with the exposure; (ii) the instrument is not associated with any confounders; and (iii) the instrument is associated with the outcome only through the exposure. MR may be adapted to the setting of DNA methylation with the use of single nucleotide polymorphisms (SNPs) that correlate with methylation of CpG sites, known as methylation quantitative trait loci (mQTLs). In this study, we performed a meta-analysis of four lung cancer EWAS (918 case-control pairs) from prospective cohort studies to identify CpG sites associated with lung cancer risk, and we applied MR to investigate whether the observed DNA methylation changes at these sites are causally linked to lung cancer.

Methods

EWAS meta-analysis

We conducted a meta-analysis of four lung cancer case-control EWAS that assessed DNA methylation using the Illumina Infinium® HumanMethylation450 BeadChip. All EWAS are nested within prospective cohorts that measured DNA methylation in peripheral blood samples before diagnosis: EPIC-Italy (185 case-control pairs), Melbourne Collaborative Cohort Study (MCCS) (367 case-control pairs), Norwegian Women and Cancer (NOWAC) (132 case-control pairs) and the Northern Sweden Health and Disease Study (NSHDS) (234 case-control pairs). Study populations, laboratory methods, data preprocessing and quality control methods have been described in detail elsewhere and are outlined in the Supplementary Methods, available as Supplementary data at IJE online. To quantify the association between the methylation level at each CpG and the risk of lung cancer, we fitted conditional logistic regression models for beta values of methylation [which ranges from 0 (no cytosines methylated) to 1 (all cytosines methylated)] on lung cancer status for the four studies. The cases and controls in each study were matched; details of this are in the Supplementary Methods, available as Supplementary data at IJE online. Surrogate variables were computed in the four studies using the SVA R package, and the proportion of CD8+ and CD4+ T cells, B cells, monocytes, natural killer cells and granulocytes within whole blood were derived from DNA methylation. The following EWAS models were included in the meta-analysis: Model 1—unadjusted; Model 2—adjusted for 10 surrogate variables (SVs); Model 3—adjusted for 10 SVs and derived cell proportions. Stratification of EWAS by smoking status was also conducted [never (N = 304), former (N = 648) and current smoking (N = 857)]. For Model 1, 2 and 3, the case-control studies not matched on smoking status (EPIC-Italy and NOWAC) were adjusted for smoking. We performed an inverse-variance weighted fixed effects meta-analysis of the EWAS (918 case-control pairs) using the METAL software [http://csg.sph.umich.edu/abecasis/metal/]. Direction of effect, effect estimates and the I2 statistic were used to assess heterogeneity across the studies in addition to effect estimates across smoking strata (never, former and current). All sites identified at a false discovery rate (FDR) <0.05 in Models 2 and 3 were also present in the sites identified in Model 1. The effect size differences between models for all sites identified in Model 1 were assessed by a Kruskal-Wallis test and a post hoc Dunn’s test. There was little evidence for a difference (P > 0.1), so to maximize inclusion into the MR analyses, we took forward the sites identified in the unadjusted model (Model 1).

Mendelian randomization

Two-sample MR was used to establish potential causal effects of differential methylation on lung cancer risk., In the first sample, we identified mQTL-methylation effect estimates (βGP) for each CpG site of interest in an mQTL database from the Accessible Resource for Integrated Epigenomic Studies (ARIES) [http://www.mqtldb.org]. Details on the methylation preprocessing, genotyping and quality control (QC) pipelines are outlined in the Supplementary Methods, available as Supplementary data at IJE online. In the second sample, we used summary data from a GWAS meta-analysis of lung cancer risk conducted by the Transdisciplinary Research in Cancer of the Lung and The International Lung Cancer Consortium (TRICL-ILCCO) (29 863 cases, 55 586 controls) to obtain mQTL-lung cancer estimates (βGD). For each independent mQTL (r2 <0.01), we calculated the log odds ratio (OR) per standard deviation (SD) unit increase in methylation by the formula βGD/βGP (Wald ratio). Standard errors were approximated by the delta method. Where multiple independent mQTLs were available for one CpG site, these were combined in a fixed effects meta-analysis after weighting each ratio estimate by the inverse variance of their associations with the outcome. Heterogeneity in Wald ratios across mQTLs was estimated using Cochran’s Q test, which can be used to indicate horizontal pleiotropy. Differences between the observational and MR estimates were assessed using a Z test for difference. If there was evidence for an mQTL-CpG site association in ARIES in at least one time point, we assessed whether the mQTL replicated across time points in ARIES (FDR < 0.05, same direction of effect). Further, we re-analysed this association using linear regression of methylation on each genotyped SNP available in an independent cohort (NSHDS), using rvtests (Supplementary Methods, available as Supplementary data at IJE online). Replicated mQTLs were included where possible to reduce the effect of winner’s curse using effect estimates from ARIES. We assessed the instrument strength of the mQTLs by investigating the variance explained in methylation by each mQTL (r2) as well as the F statistic in ARIES (Supplementary Table 1, available as Supplementary data at IJE online). The power to detect the observational effect estimates in the two-sample MR analysis was assessed a priori, based on an alpha of 0.05, sample size of 29 863 cases and 55 586 controls (from TRICL-ILCCO) and calculated variance explained (r2). MR analyses were also performed to investigate the impact of methylation on lung cancer subtypes in TRICL-ILCCO: adenocarcinoma (11 245 cases, 54 619 controls), small cell carcinoma (2791 cases, 20 580 controls) and squamous cell carcinoma (7704 cases, 54 763 controls). We also assessed the association in never smokers (2303 cases, 6995 controls) and ever smokers (23 848 cases, 16 605 controls). Differences between the smoking subgroups were assessed using a Z test for difference. We next investigated the extent to which the mQTLs at cancer-related CpGs were associated with four smoking behaviour traits which could confound the methylation-lung cancer association: number of cigarettes per day, smoking cessation rate, smoking initiation and age of smoking initiation, using GWAS data from the Tobacco and Genetics (TAG) consortium (N = 74 053).

Supplementary analyses

Assessing the potential causal effect of AHRR methylation: one-sample MR

Given previous findings implicating methylation at AHRR in relation to lung cancer,, we performed a one-sample MR analysis of AHRR methylation on lung cancer incidence, using individual-level data from the Copenhagen City Heart Study (CCHS) (357 incident cases, 8401 remaining free of lung cancer). Details of the phenotypic, methylation and genetic data, as well as the linked lung cancer data, are outlined in the Supplementary Methods, available as Supplementary data at IJE online. An allele score of mQTLs located with 1 Mb of cg05575921-AHRR was created and its association with AHRR methylation tested (Supplementary Methods, available as Supplementary data at IJE online). We investigated associations between the allele score and several potential confounding factors (sex, alcohol consumption, smoking status, occupational exposure to dust and/or welding fumes, passive smoking). We next performed MR analyses using two-stage Cox regression, with adjustment for age and sex, and further stratified by smoking status.

Tumour and adjacent normal methylation patterns

DNA methylation data from lung cancer tissue and matched normal adjacent tissue (N = 40 squamous cell carcinoma and N = 29 adenocarcinoma), profiled as part of The Cancer Genome Atlas (TCGA), were used to assess tissue-specific DNA methylation changes across sites identified in the meta-analysis of EWAS, as outlined previously.

mQTL association with gene expression

For the genes annotated to CpG sites identified in the lung cancer EWAS, we examined gene expression in whole blood and lung tissue, using data from the gene-tissue expression (GTEx) consortium. Analyses were conducted in Stata (version 14) and R (version 3.2.2). For the two-sample MR analysis we used the MR-Base R package TwoSampleMR. An adjusted P-value that limited the FDR was calculated using the Benjamini-Hochberg method. All statistical tests were two-sided.

Results

A flowchart representing our study design along with a summary of our results at each step is displayed in Figure 1.
Figure 1.

Study design with results summary. ARIES, Accessible Resource for Integrated Epigenomic Studies; TRICL-ILLCO, Transdisciplinary Research in Cancer of the Lung and The International Lung Cancer Consortium; MR, Mendelian randomization; CCHS, Copenhagen City Heart Study; TCGA, The Cancer Genome Atlas. *2 000 individuals with samples at multiple time points.

Study design with results summary. ARIES, Accessible Resource for Integrated Epigenomic Studies; TRICL-ILLCO, Transdisciplinary Research in Cancer of the Lung and The International Lung Cancer Consortium; MR, Mendelian randomization; CCHS, Copenhagen City Heart Study; TCGA, The Cancer Genome Atlas. *2 000 individuals with samples at multiple time points. The basic meta-analysis adjusted for study-specific covariates identified 16 CpG sites that were hypomethylated in relation to lung cancer (FDR < 0.05, Model 1, Figure 2). Adjusting for 10 surrogate variables (Model 2) and derived cell counts (Model 3) gave similar results (Table 1). The direction of effect at the 16 sites did not vary between studies (median I2 = 38.6) (Supplementary Table 2, available as Supplementary data at IJE online), but there was evidence for heterogeneity of effect estimates at some sites when stratifying individuals by smoking status (Table 1).
Figure 2.

Observational associations of DNA methylation and lung cancer: a fixed effects meta-analysis of lung cancer EWAS weighted on the inverse variance was performed to establish the observational association between differential DNA methylation and lung cancer. a) Manhattan plot, all points above the solid line are at P < 1 x 10-7 and all points above the dashed line (and triangular points) are at FDR <0.05. In total, 16 CpG sites are associated with lung cancer (FDR <0.05). b) Quantile-quantile plot of the EWAS results [same data as (a) Manhattan plot].

Table 1.

Meta-analyses of EWAS of lung cancer using four separate cohorts: 16 CpG sites associated with lung cancer at false-discovery rate < 0.05

Basic
SV adjusted
Cell count + SV adjusted
Never smokers
Former smokers
Current smokers
Smoker group comparison
CpGGeneChrPositionORSEPORSE P ORSE P ORSE P ORSE P ORSE P DirI2 P
cg05575921 AHRR 53733780.4740.0471.45E-160.4520.0536.27E-140.4520.0553.60E-130.9320.227.17E-010.4580.0846.10E-070.7080.0665.36E-05+--630.07
cg21566642 ALPPL2 22332846610.5350.0451.70E-150.5250.052.49E-130.5130.0513.12E-130.8920.1454.18E-010.5220.0811.42E-060.7460.0673.67E-04+--810.01
cg06126421 IER3 6307200800.5850.0462.08E-130.5440.0542.49E-110.5130.0543.92E-120.7830.1922.22E-010.5610.0871.88E-050.7270.1121.79E-02−--330.23
cg03636183 F2RL3 19170005850.6360.0457.99E-120.6150.0538.21E-100.610.0541.61E-090.9090.1725.53E-010.6240.0847.50E-050.7860.0692.92E-03−--710.03
cg05951221 ALPPL2 22332844020.660.0459.68E-110.6420.0511.77E-090.6290.0521.50E-090.8680.1764.09E-010.6340.0827.21E-050.8190.0667.42E-03−--440.17
cg01940273 ALPPL2 22332849340.6920.054.20E-080.6750.0587.32E-070.6850.0613.58E-061.1440.234.28E-010.5750.0862.57E-050.8760.0686.59E-02−--220.28
cg23771366 PRSS23 11865109980.7690.041.10E-070.7290.0511.45E-060.7090.0525.60E-071.0930.164.90E-010.6210.0761.40E-050.8560.0611.97E-02−--00.66
cg11660018 PRSS23 11865109150.7880.0371.18E-070.70.0511.97E-070.6780.0538.86E-080.9350.1315.86E-010.7530.0711.01E-030.8440.0534.15E-03−--00.53
cg26963277 KCNQ1 1127224070.6680.0551.21E-070.640.0683.79E-060.6230.0692.53E-060.5390.1751.40E-020.7240.111.54E-020.7070.0871.59E-03−--160.31
cg27241845 ALPPL2 22332503700.6690.0551.45E-070.6790.0671.67E-050.6730.0692.47E-050.750.2081.93E-010.6770.1085.01E-030.7260.0873.09E-03−--00.65
cg23387569 AGAP2 12581200110.7130.0491.53E-070.7020.0583.69E-060.6830.0591.89E-060.7860.1641.69E-010.7140.1071.02E-020.7490.0792.48E-03−--690.04
cg09935388 GFI1 1929475880.6760.0552.48E-070.6690.0669.67E-060.6740.073.00E-050.9610.2428.44E-010.740.1274.22E-020.6810.0751.06E-04−--00.89
cg01901332 ARRB1 11750310540.7250.0482.82E-070.6860.0641.12E-050.6580.0642.20E-061.0170.2149.22E-010.5990.0931.48E-040.7830.0723.92E-03+--810.01
cg25305703 CASC21 81283782180.7250.0494.46E-070.7170.0671.11E-040.7150.0691.48E-040.8010.1692.10E-010.7610.1062.58E-020.7690.0753.20E-03−--00.98
cg16823042 AGAP2 12581199920.7390.0491.14E-060.7260.0581.51E-050.7010.0595.90E-060.830.1833.09E-010.720.17.36E-030.7990.081.35E-02−--100.33
cg08709672 AVPR1B 12062243340.7490.0481.36E-060.7590.0581.14E-040.7390.065.33E-050.7290.1711.02E-010.7380.0853.47E-030.8160.0792.13E-02−--00.85

Meta-analyses of epigenome-wide association studies of lung cancer adjusted for study specific covariates: (basic, N = 1809), basic model + surrogate variables (SV adjusted, N = 1809), basic model + surrogate variables + derived cell counts (cell count + SV adjusted, N = 1809).

Meta-analyses were also conducted stratified by smoking status [never (N = 304), former (N = 648), current (N = 857)] using the basic model.

Smoker group comparison = heterogeneity across meta-analyses when stratifying by smoking status.

Dir, direction of effect; OR, odds ratio per SD increase in DNA methylation; SE, standard error; Chr, chromosome.

Observational associations of DNA methylation and lung cancer: a fixed effects meta-analysis of lung cancer EWAS weighted on the inverse variance was performed to establish the observational association between differential DNA methylation and lung cancer. a) Manhattan plot, all points above the solid line are at P < 1 x 10-7 and all points above the dashed line (and triangular points) are at FDR <0.05. In total, 16 CpG sites are associated with lung cancer (FDR <0.05). b) Quantile-quantile plot of the EWAS results [same data as (a) Manhattan plot]. Meta-analyses of EWAS of lung cancer using four separate cohorts: 16 CpG sites associated with lung cancer at false-discovery rate < 0.05 Meta-analyses of epigenome-wide association studies of lung cancer adjusted for study specific covariates: (basic, N = 1809), basic model + surrogate variables (SV adjusted, N = 1809), basic model + surrogate variables + derived cell counts (cell count + SV adjusted, N = 1809). Meta-analyses were also conducted stratified by smoking status [never (N = 304), former (N = 648), current (N = 857)] using the basic model. Smoker group comparison = heterogeneity across meta-analyses when stratifying by smoking status. Dir, direction of effect; OR, odds ratio per SD increase in DNA methylation; SE, standard error; Chr, chromosome. We identified 15 independent mQTLs (r2<0.01) associated with methylation at 14 of 16 CpGs. Ten mQTLs replicated at FDR < 0.05 in NSHDS (Supplementary Table 3, available as Supplementary data at IJE online). MR power analyses indicated >99% power to detect ORs for lung cancer of the same magnitude as those in the meta-analysis of EWAS. There was little evidence for an effect of methylation at these 14 sites on lung cancer (FDR > 0.05, Supplementary Table 4, available as Supplementary data at IJE online). For nine of 14 CpG sites, the point estimates from the MR analysis were in the same direction as in the EWAS, but of a much smaller magnitude (Z test for difference, P < 0.001) (Figure 3).
Figure 3.

Mendelian randomization (MR) vs observational analysis. Two-sample MR was carried out with methylation at 14/16 CpG sites identified in the EWAS meta-analysis as the exposure and lung cancer as the outcome. cg01901332 and cg05575921 had two instruments, so the estimate was calculated using the inverse variance weighted method; for the rest, the MR estimate was calculated using a Wald ratio. Only 14 of 16 sites could be instrumented using mQTLs from [mqtldb.org]. OR, odds ratio per SD increase in DNA methylation. *Instrumental variable not replicated in independent dataset (NSHDS). The sites for which instrumental variables have not been replicated are cg01901332, cg21566642, cg05575921 and cg08709672.

Mendelian randomization (MR) vs observational analysis. Two-sample MR was carried out with methylation at 14/16 CpG sites identified in the EWAS meta-analysis as the exposure and lung cancer as the outcome. cg01901332 and cg05575921 had two instruments, so the estimate was calculated using the inverse variance weighted method; for the rest, the MR estimate was calculated using a Wald ratio. Only 14 of 16 sites could be instrumented using mQTLs from [mqtldb.org]. OR, odds ratio per SD increase in DNA methylation. *Instrumental variable not replicated in independent dataset (NSHDS). The sites for which instrumental variables have not been replicated are cg01901332, cg21566642, cg05575921 and cg08709672. For nine of out the 16 mQTL-CpG associations, there was strong replication across time points (Supplementary Table 5, available as Supplementary data at IJE online) and 10 out of 16 mQTL-CpG associations replicated at FDR < 0.05 in an independent adult cohort (NSHDS). Using mQTL effect estimates from NSHDS for the 10 CpG sites that replicated (FDR < 0.05), findings were consistent with limited evidence for a causal effect of peripheral blood-derived DNA methylation on lung cancer (Supplementary Figure 1, available as Supplementary data at IJE online). There was little evidence of different effect estimates between ever and never smokers at individual CpG sites (Supplementary Figure 2, available as Supplementary data at IJE online, Z test for difference, P > 0.5). There was some evidence for a possible effect of methylation at cg21566642-ALPPL2 and cg23771366-PRSS23 on squamous cell lung cancer {OR = 0.85 [95% confidence interval (CI)=0.75, 0.97] and 0.91 (95% CI = 0.84, 1.00) per SD (14.4% and 5.8%) increase, respectively} as well as methylation at cg23387569-AGAP2, cg16823042-AGAP2, and cg01901332-ARRB1 on lung adenocarcinoma [OR = 0.86 (95% CI = 0.77, 0.96), 0.84 (95% CI = 0.74, 0.95), and 0.89 (95% CI = 0.80, 1.00) per SD (9.47%, 8.35%, and 8.91%) increase, respectively]. However, none of the results withstood multiple testing correction (FDR < 0.05) (Supplementary Figure 3, available as Supplementary data at IJE online). For those CpGs where multiple mQTLs were used as instruments (cg05575921-AHRR and cg01901332-ARRB1), there was limited evidence for heterogeneity in MR effect estimates (Q test, P > 0.05, Supplementary Table 6, available as Supplementary data at IJE online). Single mQTLs for cg05575921-AHRR, cg27241845-ALPPL2 and cg26963277-KCNQ1 showed some evidence of association with smoking cessation (former vs current smokers), although these associations were not below the FDR < 0.05 threshold (Supplementary Figure 4, available as Supplementary data at IJE online).

Potential causal effect of AHRR methylation on lung cancer risk: one-sample MR

In the CCHS, a per (average methylation-increasing) allele change in a four-mQTL allele score was associated with a 0.73% (95% CI = 0.56, 0.90) increase in methylation (P < 1 x 10–10) and explained 0.8% of the variance in cg05575921-AHRR methylation (F statistic = 74.2). Confounding factors were not strongly associated with the genotypes in this cohort (P ≥ 0.11) (Supplementary Table 7, available as Supplementary data at IJE online). Results provided some evidence for an effect of cg05575921 methylation on total lung cancer risk [hazard ratio (HR) = 0.30 (95% CI = 0.10, 1.00) per SD (9.2%) increase] (Supplementary Table 8, available as Supplementary data at IJE online). The effect estimate did not change substantively when stratified by smoking status (Supplementary Table 8, available as Supplementary data at IJE online). Given contrasting findings with the main MR analysis, where cg05575921-AHRR methylation was not causally implicated in lung cancer, and the lower power in the one-sample analysis to detect an effect of equivalent size to the observational results (power = 19% at alpha = 0.05), we performed further two-sample MR based on the four mQTLs using data from both CCHS (sample one) and the TRICL-ILCCO consortium (sample two). Results showed no strong evidence for a causal effect of DNA methylation on total lung cancer risk [OR = 1.00 (95% CI = 0.83, 1.10) per SD increase] (Supplementary Figure 5, available as Supplementary data at IJE online). There was also limited evidence for an effect of cg05575921-AHRR methylation when stratified by cancer subtype and smoking status (Supplementary Figure 5, available as Supplementary data at IJE online) and no strong evidence for heterogeneity of the mQTL effects (Supplementary Table 9, available as Supplementary data at IJE online). Conclusions were consistent when MR-Egger was applied (Supplementary Figure 5, available as Supplementary data at IJE online) and when accounting for correlation structure between the mQTLs (Supplementary Table 9, available as Supplementary data at IJE online).

Tumour and adjacent normal lung tissue methylation patterns

For cg05575921-AHRR, there was no strong evidence for differential methylation between adenocarcinoma tissue and adjacent healthy tissue (P = 0.963), and weak evidence for hypermethylation in squamous cell carcinoma tissue (P = 0.035) (Figure 4; Supplementary Table 10, available as Supplementary data at IJE online). For the other CpG sites there was evidence for a difference in DNA methylation between tumour and healthy adjacent tissue at several sites in both adenocarcinoma and squamous cell carcinoma, with consistent differences for CpG sites in ALPPL2 (cg2156642, cg05951221 and cg01940273), as well as cg23771366-PRSS23, cg26963277-KCNQ1, cg09935388-GFI1, cg0101332-ARRB1, cg08709672-AVPR1B and cg25305703-CASC21. However, hypermethylation in tumour tissue was found for the majority of these sites, which is opposite to what was observed in the EWAS analysis.
Figure 4.

Differential DNA methylation in lung cancer tissue: a comparison of methylation at each of the 16 CpG sites identified in our meta-analysis was made between lung cancer tissue and adjacent healthy lung tissue for patients with: a) lung adenocarcinoma; and b) squamous cell lung cancer. Publicly available Data from The Cancer Genome Atlas were used for this analysis.

Differential DNA methylation in lung cancer tissue: a comparison of methylation at each of the 16 CpG sites identified in our meta-analysis was made between lung cancer tissue and adjacent healthy lung tissue for patients with: a) lung adenocarcinoma; and b) squamous cell lung cancer. Publicly available Data from The Cancer Genome Atlas were used for this analysis.

Gene expression associated with mQTLs in blood and lung tissue

Of the 10 genes annotated to the 14 CpG sites, eight genes were expressed sufficiently to be detected in lung (AVPR1B and CASC21 were not) and seven in blood (AVPR1B, CASC21 and ALPPL2 were not). Of these, gene expression of ARRB1 could not be investigated as the mQTLs in that region were not present in the GTEx data. rs3748971 and rs878481, mQTLs for cg21566642 and cg05951221, respectively, were associated with increased expression of ALPPL2 (P = 0.002 and P = 0.0001). No other mQTLs were associated with expression of the annotated gene at a Bonferroni corrected P-value threshold (P < 0.05/19 = 0.0026) (Supplementary Table 11, available as Supplementary data at IJE online).

Discussion

In this study, we identified 16 CpG sites associated with lung cancer, of which 14 have been previously identified in relation to smoke exposure and six were highlighted in a previous study as being associated with lung cancer. This previous study used the same data from the four cohorts investigated here, but in a discovery and replication, rather than meta-analysis framework. Overall, using MR we found limited evidence supporting a potential causal effect of methylation at the CpG sites identified in peripheral blood on lung cancer. These findings are in contrast to previous analyses suggesting that methylation at two CpG sites investigated (in AHRR and F2RL3) mediated >30% of the effect of smoking on lung cancer risk. This previous study used methods which are sensitive to residual confounding and measurement error that may have biased results., These limitations are largely overcome using MR. Although there was some evidence for an effect of methylation at some of the other CpG sites on risk of subtypes of lung cancer, these effects were not robust to multiple testing correction and were not validated in the analysis of tumour and adjacent normal lung tissue methylation nor in gene expression analysis. A major strength of the study was the use of two-sample MR to integrate an extensive epigenetic resource and summary data from a large lung cancer GWAS, to appraise causality of observational associations with >99% power. Evidence against the observational findings was also acquired through tissue-specific DNA methylation and gene expression analyses. Limitations include potential ‘winner’s curse’ which may bias causal estimates in a two-sample MR analysis towards the null if the discovery sample for identifying genetic instruments is used as the first sample, as was done for our main MR analysis using data from ARIES. However, findings were similar when using replicated mQTLs in NSHDS, indicating that the potential impact of this bias was minimal (Supplementary Figure 1, available as Supplementary data at IJE online). Another limitation relates to the potential issue of consistency and validity of the instruments across the two samples. For a minority of the mQTL-CpG associations (four out of 16), there was limited replication across time points and in particular, six mQTLs were not strongly associated with DNA methylation in adults. Further, our primary data used for the first sample in the two-sample MR were ARIES, which contains no male adults. If the mQTLs identified vary by sex and time, then this could bias our results. However, our replication cohort NSHDS contains adult males. Therefore, the 10 mQTLs that replicated in NSHDS are unlikely to be biased by the sex discordance. Also, we replicated the findings for cg05575921 AHRR in CCHS, which contains both adult males and females, in a two-sample MR analysis, suggesting that these results also are not influenced by sex discordance. Caution is therefore warranted when interpreting the null results for the two-sample MR estimates for the CpG sites for which mQTLs were not replicated, which could be the result of weak-instrument bias. The lack of independent mQTLs for each CpG site did not allow us to properly appraise horizontal pleiotropy in our MR analyses. Where possible we only included cis-acting mQTLs to minimize pleiotropy, and investigated heterogeneity where there were multiple independent mQTLs. Three mQTLs were nominally associated with smoking phenotypes, but not to the extent that this would bias our MR results substantially. Some of the mQTLs used influence multiple CpGs in the same region, suggesting genomic control of methylation at a regional rather than single CpG level. This was untested, but methods to detect differentially methylated regions (DMRs) and identify genetic variants which proxy for them may be fruitful in probing the effect of methylation across gene regions. A further limitation relates to the inconsistency in effect estimates between the one- and two-sample MR analysis to appraise the causal role of AHRR methylation. Findings in CCHS were supportive of a causal effect of AHRR methylation on lung cancer [HR = 0.30 (95% CI = 0.10, 1.00) per SD], but in two-sample MR this site was not causally implicated [OR = 1.00 (95% CI = 0.83, 1.10) per SD increase]. We verified that this was not due to differences in the genetic instruments used, nor due to issues of weak instrument bias. Given that the CCHS one-sample MR had little power (19% at alpha = 0.05) to detect a causal effect with a size equivalent to that of the observational analysis, we have more confidence in the results from the two-sample approach. Peripheral blood may not be the ideal tissue to assess the association between DNA methylation and lung cancer. A high degree of concordance in mQTLs has been observed across lung tissue, skin and peripheral blood DNA, but we were unable to directly evaluate this here. A possible explanation for a lack of causal effect at AHRR is due to the limitation of tissue specificity, as we found that the mQTLs used to instrument cg05575921 were not strongly related to expression of AHRR in lung tissue. However, findings from MR analysis were corroborated by the lack of evidence for differential methylation at AHRR between lung adenocarcinoma tissue and adjacent healthy tissue, and weak evidence for hypermethylation (opposite to the expected direction) in squamous cell lung cancer tissue. This result may be interesting in itself, as smoking is hypothesized to influence squamous cell carcinoma more than adenocarcinoma. However, the result conflicts with that found in the MR analysis. Furthermore, another study investigating tumorous lung tissue (N = 511) found only weak evidence for an association between smoking and cg05575921 AHRR methylation, which did not survive multiple testing correction (P = 0.02). However, our results do not fully exclude AHRR from involvement in the disease process. AHRR and AHR form a regulatory feedback loop, which means that the actual effect of differential methylation or differential expression of AHR/AHRR on pathway activity is complex. In addition, some of the CpG sites identified in the EWAS were found to be differentially methylated in the tumour and adjacent normal lung tissue comparison. Whereas this could represent a false-negative result of the MR analysis, it is of interest that differential methylation in the tissue comparison analysis was typically in the opposite direction to that observed in the EWAS. Furthermore, although this method can be used to minimize confounding, it does not fully eliminate the possibility of bias due to reverse causation (whereby cancer induces changes in DNA methylation) or intra-individual confounding e.g. by gene expression. Therefore, it does not give conclusive evidence that DNA methylation changes at these sites are not relevant to the development of lung cancer. Whereas DNA methylation in peripheral blood may be predictive of lung cancer risk, according to the present analysis it is unlikely to play a causal role in lung carcinogenesis at the CpG sites investigated. Findings from this study issue caution over the use of traditional mediation analyses to implicate intermediate biomarkers (such as DNA methylation) in pathways linking an exposure with disease, given the potential for residual confounding in this context. However, the findings of this study do not preclude the possibility that other DNA methylation changes are causally related to lung cancer (or other smoking-associated disease).

Funding

This work was partly supported by a Wellcome Trust PhD studentship to T.B. (203746); and by Cancer Research UK (C18281/A19169, C57854/A22171 and C52724/A20138). This work was also supported by the UK Medical Research Council (MC_UU_00011/1 and MC_UU_00011/5), which funds a Unit at the University of Bristol where T.B., R.C.R., P.C.H., T.R.G., G.D.S. and C.L.R. work. Funding to pay the Open Access publication charges for this article was provided by the University of Bristol RCUK.  The UK Medical Research Council and Wellcome (Grant ref: 102215/2/13/2) and the University of Bristol provide core support for ALSPAC. Methylation data in the ALSPAC cohort were generated as part of the UK BBSRC-funded (BB/I025751/1 and BB/I025263/1) Accessible Resource for Integrated Epigenomic Studies (ARIES) [http://www.ariesepigenomics.org.uk]. Click here for additional data file.
  35 in total

1.  Two-step epigenetic Mendelian randomization: a strategy for establishing the causal role of epigenetic processes in pathways to disease.

Authors:  Caroline L Relton; George Davey Smith
Journal:  Int J Epidemiol       Date:  2012-02       Impact factor: 7.196

2.  Correlation of Smoking-Associated DNA Methylation Changes in Buccal Cells With DNA Methylation Changes in Epithelial Cancer.

Authors:  Andrew E Teschendorff; Zhen Yang; Andrew Wong; Christodoulos P Pipinikas; Yinming Jiao; Allison Jones; Shahzia Anjum; Rebecca Hardy; Helga B Salvesen; Christina Thirlwell; Samuel M Janes; Diana Kuh; Martin Widschwendter
Journal:  JAMA Oncol       Date:  2015-07       Impact factor: 31.777

Review 3.  The fundamental role of epigenetic events in cancer.

Authors:  Peter A Jones; Stephen B Baylin
Journal:  Nat Rev Genet       Date:  2002-06       Impact factor: 53.242

4.  The impact of residual and unmeasured confounding in epidemiologic studies: a simulation study.

Authors:  Zoe Fewell; George Davey Smith; Jonathan A C Sterne
Journal:  Am J Epidemiol       Date:  2007-07-05       Impact factor: 4.897

5.  Differences in smoking associated DNA methylation patterns in South Asians and Europeans.

Authors:  Hannah R Elliott; Therese Tillin; Wendy L McArdle; Karen Ho; Aparna Duggirala; Tim M Frayling; George Davey Smith; Alun D Hughes; Nish Chaturvedi; Caroline L Relton
Journal:  Clin Epigenetics       Date:  2014-02-03       Impact factor: 6.551

6.  DNA methylation changes measured in pre-diagnostic peripheral blood samples are associated with smoking and lung cancer risk.

Authors:  Laura Baglietto; Erica Ponzi; Philip Haycock; Allison Hodge; Manuela Bianca Assumma; Chol-Hee Jung; Jessica Chung; Francesca Fasanelli; Florence Guida; Gianluca Campanella; Marc Chadeau-Hyam; Kjell Grankvist; Mikael Johansson; Ugo Ala; Paolo Provero; Ee Ming Wong; Jihoon Joo; Dallas R English; Nabila Kazmi; Eiliv Lund; Christian Faltus; Rudolf Kaaks; Angela Risch; Myrto Barrdahl; Torkjel M Sandanger; Melissa C Southey; Graham G Giles; Mattias Johansson; Paolo Vineis; Silvia Polidoro; Caroline L Relton; Gianluca Severi
Journal:  Int J Cancer       Date:  2016-10-11       Impact factor: 7.396

7.  AHRR (cg05575921) hypomethylation marks smoking behaviour, morbidity and mortality.

Authors:  Stig E Bojesen; Nicholas Timpson; Caroline Relton; George Davey Smith; Børge G Nordestgaard
Journal:  Thorax       Date:  2017-01-18       Impact factor: 9.139

8.  A reference panel of 64,976 haplotypes for genotype imputation.

Authors:  Shane McCarthy; Sayantan Das; Warren Kretzschmar; Olivier Delaneau; Andrew R Wood; Alexander Teumer; Hyun Min Kang; Christian Fuchsberger; Petr Danecek; Kevin Sharp; Yang Luo; Carlo Sidore; Alan Kwong; Nicholas Timpson; Seppo Koskinen; Scott Vrieze; Laura J Scott; He Zhang; Anubha Mahajan; Jan Veldink; Ulrike Peters; Carlos Pato; Cornelia M van Duijn; Christopher E Gillies; Ilaria Gandin; Massimo Mezzavilla; Arthur Gilly; Massimiliano Cocca; Michela Traglia; Andrea Angius; Jeffrey C Barrett; Dorrett Boomsma; Kari Branham; Gerome Breen; Chad M Brummett; Fabio Busonero; Harry Campbell; Andrew Chan; Sai Chen; Emily Chew; Francis S Collins; Laura J Corbin; George Davey Smith; George Dedoussis; Marcus Dorr; Aliki-Eleni Farmaki; Luigi Ferrucci; Lukas Forer; Ross M Fraser; Stacey Gabriel; Shawn Levy; Leif Groop; Tabitha Harrison; Andrew Hattersley; Oddgeir L Holmen; Kristian Hveem; Matthias Kretzler; James C Lee; Matt McGue; Thomas Meitinger; David Melzer; Josine L Min; Karen L Mohlke; John B Vincent; Matthias Nauck; Deborah Nickerson; Aarno Palotie; Michele Pato; Nicola Pirastu; Melvin McInnis; J Brent Richards; Cinzia Sala; Veikko Salomaa; David Schlessinger; Sebastian Schoenherr; P Eline Slagboom; Kerrin Small; Timothy Spector; Dwight Stambolian; Marcus Tuke; Jaakko Tuomilehto; Leonard H Van den Berg; Wouter Van Rheenen; Uwe Volker; Cisca Wijmenga; Daniela Toniolo; Eleftheria Zeggini; Paolo Gasparini; Matthew G Sampson; James F Wilson; Timothy Frayling; Paul I W de Bakker; Morris A Swertz; Steven McCarroll; Charles Kooperberg; Annelot Dekker; David Altshuler; Cristen Willer; William Iacono; Samuli Ripatti; Nicole Soranzo; Klaudia Walter; Anand Swaroop; Francesco Cucca; Carl A Anderson; Richard M Myers; Michael Boehnke; Mark I McCarthy; Richard Durbin
Journal:  Nat Genet       Date:  2016-08-22       Impact factor: 38.330

9.  DNA methylation arrays as surrogate measures of cell mixture distribution.

Authors:  Eugene Andres Houseman; William P Accomando; Devin C Koestler; Brock C Christensen; Carmen J Marsit; Heather H Nelson; John K Wiencke; Karl T Kelsey
Journal:  BMC Bioinformatics       Date:  2012-05-08       Impact factor: 3.169

10.  Efficient design for Mendelian randomization studies: subsample and 2-sample instrumental variable estimators.

Authors:  Brandon L Pierce; Stephen Burgess
Journal:  Am J Epidemiol       Date:  2013-07-17       Impact factor: 4.897

View more
  22 in total

1.  Response to: Prenatal smoke exposure, DNA methylation and a link between DRD1 and lung cancer.

Authors:  Rebecca C Richmond; Matthew Suderman; Ryan Langdon; Caroline L Relton; George Davey Smith
Journal:  Int J Epidemiol       Date:  2019-08-01       Impact factor: 7.196

2.  DNA methylation patterns reflect individual's lifestyle independent of obesity.

Authors:  Ireen Klemp; Anne Hoffmann; Luise Müller; Tobias Hagemann; Kathrin Horn; Kerstin Rohde-Zimmermann; Anke Tönjes; Joachim Thiery; Markus Löffler; Ralph Burkhardt; Yvonne Böttcher; Michael Stumvoll; Matthias Blüher; Knut Krohn; Markus Scholz; Ronny Baber; Paul W Franks; Peter Kovacs; Maria Keller
Journal:  Clin Transl Med       Date:  2022-06

3.  Epigenetic mechanisms of lung carcinogenesis involve differentially methylated CpG sites beyond those associated with smoking.

Authors:  Dusan Petrovic; Barbara Bodinier; Florence Guida; Marc Chadeau-Hyam; Sonia Dagnino; Matthew Whitaker; Maryam Karimi; Gianluca Campanella; Therese Haugdahl Nøst; Silvia Polidoro; Domenico Palli; Vittorio Krogh; Rosario Tumino; Carlotta Sacerdote; Salvatore Panico; Eiliv Lund; Pierre-Antoine Dugué; Graham G Giles; Gianluca Severi; Melissa Southey; Paolo Vineis; Silvia Stringhini; Murielle Bochud; Torkjel M Sandanger; Roel C H Vermeulen
Journal:  Eur J Epidemiol       Date:  2022-05-20       Impact factor: 12.434

4.  Identification of novel susceptibility methylation loci for pancreatic cancer in a two-phase epigenome-wide association study.

Authors:  Ziqiao Wang; Yue Lu; Myriam Fornage; Li Jiao; Jianjun Shen; Donghui Li; Peng Wei
Journal:  Epigenetics       Date:  2022-01-14       Impact factor: 4.861

5.  Epigenome-wide scan identifies differentially methylated regions for lung cancer using pre-diagnostic peripheral blood.

Authors:  Naisi Zhao; Mengyuan Ruan; Devin C Koestler; Jiayun Lu; Carmen J Marsit; Karl T Kelsey; Elizabeth A Platz; Dominique S Michaud
Journal:  Epigenetics       Date:  2021-05-19       Impact factor: 4.528

6.  Methylation Regulation of TLR3 on Immune Parameters in Lung Adenocarcinoma.

Authors:  Ang Li; Hongjiao Wu; Qinqin Tian; Yi Zhang; Zhi Zhang; Xuemei Zhang
Journal:  Front Oncol       Date:  2021-05-20       Impact factor: 6.244

7.  A region-based method for causal mediation analysis of DNA methylation data.

Authors:  Qi Yan; Erick Forno; Juan Celedón; Wei Chen
Journal:  Epigenetics       Date:  2021-03-23       Impact factor: 4.861

8.  Methylome-wide association study of antidepressant use in Generation Scotland and the Netherlands Twin Register implicates the innate immune system.

Authors:  Miruna C Barbu; Floris Huider; Archie Campbell; Carmen Amador; Mark J Adams; Mary-Ellen Lynall; David M Howard; Rosie M Walker; Stewart W Morris; Jenny Van Dongen; David J Porteous; Kathryn L Evans; Edward Bullmore; Gonneke Willemsen; Dorret I Boomsma; Heather C Whalley; Andrew M McIntosh
Journal:  Mol Psychiatry       Date:  2021-12-08       Impact factor: 13.437

Review 9.  DNA Methylation Markers in Lung Cancer.

Authors:  Yoonki Hong; Woo Jin Kim
Journal:  Curr Genomics       Date:  2021-02       Impact factor: 2.236

10.  The increased expression and aberrant methylation of SHC1 in non-small cell lung cancer: Integrative analysis of clinical and bioinformatics databases.

Authors:  Yicheng Liang; Yangyang Lei; Minjun Du; Mei Liang; Zixu Liu; Xingkai Li; Yushun Gao
Journal:  J Cell Mol Med       Date:  2021-06-11       Impact factor: 5.310

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.