Literature DB >> 34415821

Disease-related blood-based differential methylation in cystic fibrosis and its representation in lung cancer revealed a regulatory locus in PKP3 in lung epithelial cells.

Esther Schamschula1, Angelika Lahnsteiner1, Yassen Assenov2, Wolfgang Hagmann2, Nadja Zaborsky3,4, Markus Wiederstein1, Anna Strobl1, Frauke Stanke5,6, Thomas Muley7,8, Christoph Plass2,8, Burkhard Tümmler5,6, Angela Risch1,2,4,8.   

Abstract

Cystic fibrosis (CF) is a monogenic disease, characterized by massive chronic lung inflammation. The observed variability in clinical phenotypes in monozygotic CF twins is likely associated with the extent of inflammation. This study sought to investigate inflammation-related aberrant DNA methylation in CF twins and to determine to what extent acquired methylation changes may be associated with lung cancer.Blood-based genome-wide DNA methylation analysis was performed to compare the DNA methylomes of monozygotic twins, from the European CF Twin and Sibling Study with various degrees of disease severity. Putatively inflammation-related and differentially methylated positions were selected from a large lung cancer case-control study and investigated in blood by targeted bisulphite next-generation-sequencing. An inflammation-related locus located in the Plakophilin-3 (PKP3) gene was functionally analysed regarding promoter and enhancer activity in presence and absence of methylation using luciferase reporter assays.We confirmed in a unique cohort that monozygotic twins, even if clinically discordant, have only minor differences in global DNA methylation patterns and blood cell composition. Further, we determined the most differentially methylated positions, a high proportion of which are blood cell-type-specific, whereas others may be acquired and thus have potential relevance in the context of inflammation as lung cancer risk factors. We identified a sequence in the gene body of PKP3 which is hypermethylated in blood from CF twins with severe phenotype and highly variably methylated in lung cancer patients and controls, independent of known clinical parameters, and showed that this region exhibits methylation-dependent promoter activity in lung epithelial cells.

Entities:  

Keywords:  Cystic fibrosis; DNA methylation; lung cancer; lung inflammation; monozygotic twins

Mesh:

Substances:

Year:  2021        PMID: 34415821      PMCID: PMC9423854          DOI: 10.1080/15592294.2021.1959976

Source DB:  PubMed          Journal:  Epigenetics        ISSN: 1559-2294            Impact factor:   4.861


Introduction

Cystic fibrosis (CF) is an autosomal recessive monogenic disease caused by mutations in the cystic fibrosis transmembrane conductance regulator (CFTR) gene which is located on chromosome 7q31.2 and was discovered in 1989 [1]. It affects approximately over 1:2000 new-borns in the Caucasian population [2]. More than 2000 different genetic variants in CFTR, including missense, frameshift, splicing and nonsense mutations as well as in-frame deletions and insertions, are known to be causative for various CF phenotypes [3]. CFTR protein deficiency causes excessive inflammatory processes in the affected organs, mainly the lung, pancreas and intestines [4,5]. Major contributors to the severity of this condition are respiratory malfunctions and malnutrition. These symptoms are reflected by the clinical parameters forced expiratory volume in 1 second percent (FEV1%) and weight expected for height percent (wfh%), both referring to the non-CF population and representative for pulmonary and nutritional status, respectively, which together are considered to define the disease severity and subsequently intra-pair discordance [6]. Both parameters are associated with the chronic inflammation of the affected organs [7,8]. Considering that CF is a monogenic disease, the severity of the lung condition is remarkably variable in different individuals. The most evident cause for this phenomenon is the high number of distinct CFTR mutation types, leading to different CFTR protein defects and impaired functions [3]. Additionally, a number of genetic modifiers affecting the CF phenotype have been identified in various genetic association studies and have been shown to account for more than 50% of variation in the lung disease phenotype [9-12]. Interestingly, clinical discordance has been observed even in monozygotic twins with CF. This implies a role of environment and epigenetics in determining disease severity in addition to the genetic factors [13,14]. Phenotypical discordance in monozygotic twins is plausibly associated with epigenetic effects, as the respective twins are genetically identical and typically share early-life environmental exposure. Due to the reduced confounders and cohort effects in twin studies, even small sample groups are appropriate for identification of phenotype-associated epigenetic markers and mechanisms [15,16]. Several twin studies have been successfully performed to investigate epigenetic patterns, such as DNA methylation, in the context of lung function, inflammatory diseases and cancer [13,17-21]. Differentially methylated loci associated with diseases such as cancer are considered promising as biomarkers for risk, development and progression of diseases [20,22,23, 24]. DNA methylation patterns, although cell-type and tissue specific can be reflected in clinically accessible surrogate tissues, such as whole blood [25-29]. In concert with other epigenetic mechanisms, DNA methylation decreases the accessibility of genomic regions and therefore has physiological functional relevance for genomic stability, chromatin structure, imprinting, transcription factor binding, splicing, silencing of retro-transposal, repetitive elements and gene expression (e.g., oncogenes) as well as for activation of gene expression (e.g., tumour suppressor genes) and transcription of antisense transcripts [30-33]. Consequently, aberrant DNA methylation or demethylation is also functionally associated with a variety of diseases, among them inflammatory diseases and cancer [31,34-37]. It has been shown that chronic inflammation in the lung and other tissues alters DNA methylation patterns via distinct mechanisms involving inflammatory mediators and infectious agents [38-41]. Chronic inflammation and infections are strongly connected with cancer risk, initiation and progression and have been described as causative and consequential events in cancer development [42,43]. On the one hand, infectious agents generate metabolites, on the other hand, the host´s immune system produces cytokines, chemokines, growth factors and free radicals. If unresolved, chronic inflammation causes a constant disordered environment, finally leading to genetic and epigenetic aberrations promoting neoplasia [44,45]. Hence, infection and inflammation can both increase cancer risk via multiple mechanisms of which many have yet to be clarified. A better understanding of these mechanisms is essential in order to improve cancer intervention strategies. Altered DNA methylation is one mechanism promoting cancer development in an inflammatory environment, as has been shown for infections with the Human Papilloma virus, Helicobacter pylori, Epstein–Barr Virus and Hepatitis B and C viruses [46, 47–52]. Chronic inflammatory lung diseases such as chronic obstructive pulmonary disease (COPD) are associated with an increased risk for lung cancer [53, 54, 55]. Furthermore, chronic inflammation of the lung has been shown to be implicated in lung cancer development also via epigenetic mechanisms [56,57]. Although CF has not been demonstrated to be associated with lung cancer risk, we consider CF an adequate model for investigating the impact of massive chronic inflammation of the lung and its implications in other inflammation-related lung diseases such as lung cancer [58]. It has been shown that the well-known modifier of CF lung disease, Mucin 4 (MUC4) and Mucin 5AC (MUC5AC) are associated with lung cancer risk and prognosis, respectively [11,59,60]. Further CF lung modifier genes, such as ETS homologous factor (EHF), (Epithelial splicing regulatory protein 2) ESRP2 and macrophage migration inhibitory factor (MIF) have functional roles in epithelial to mesenchymal transition, a process that is also epigenetically regulated and plays a major role in lung cancer development [61-65]. Glutathione S-transferase M 1 (GSTM1), involved in detoxification pathways, is related to lung cancer risk in an environment-dependent manner and modifies lung disease severity in CF as well [11,66,67]. Transforming growth factor beta 1 (TGFβ1), associated with inflammation and lung cancer risk and the smoking-related differentially methylated gene Aryl-hydrocarbon receptor repressor (AHRR) are likewise modifiers of CF lung disease [11,65,68,69]. In the present study we aim to identify novel mechanisms that may play a role in inflammation-related lung carcinogenesis. We investigated aberrant DNA methylation patterns putatively acquired by chronic inflammation of the lung, their appearance in the lung cancer context and their functional implications in lung diseases, particularly lung cancer. For this purpose, we examined blood-based genome-wide DNA methylation patterns in monozygotic CF twin pairs with distinct phenotypical discordance from the European CF Twin and Sibling Study [6]. In order to study inflammation-related epigenetic alterations, genome-wide DNA methylation data were obtained for 22 monozygotic twin pairs with CF. The methylomes were characterized with respect to the clinical parameters. Discordance-dependent differentially methylated loci were identified by an unbiased selection process. Differentially methylated loci, potentially interesting in the context of chronic lung inflammation and lung cancer, were tested in whole blood samples of a large lung cancer case-control cohort to discover epigenetic alterations with possible functional implications in inflammation-related lung cancer. For the most interesting locus, functional analysis has been conducted in lung epithelial cells.

Material and methods

Subjects and samples

Illumina 450 K methylation analysis was conducted for 22 monozygotic twin pairs with Cystic Fibrosis from the European CF Twin and Sibling Study. Spirometry-based lung function measurements were available and the clinical severity (DfO: distance from origin) and the phenotypical intra-pair discordance (Delta) had been calculated based on lung function, reflected by FEV1% (forced expiratory volume in 1 second percent) and wfh% (weight expected for height percent), representative for nutritional status as described previously [6]. With respect to their intra-pair discordance, the twin pairs were assigned to the categories concordant (n = 6), intermediate (n = 6), discordant (n = 6) and NA (unknown discordance) (n = 4) (Table 1).
Table 1.

Clinical characteristics of the CF twins.

  Twin pair category
 
  concordantintermediatediscordantunknown discordanceΣ
clinical parameter (n = 12 individuals)(n = 12 individuals)(n = 12 individuals)(n = 8 individuals)(n = 44 individuals)
age [y] 19.3 ± 9.614.6 ± 7.317.0 ± 10.23.0 ± 1.214.4 ± 10.1
weight [kg]    11.3 ± 3.6 
height [cm]    86.5 ± 14.6 
DfO 562.3 ± 235.8466.8 ± 133.6466.9 ± 203.4  
Delta 80.5 ± 36.7164.0 ± 31.5366.2 ± 36.5  
female/male ratio 2.00.22.03.01.2
CFTR genotypeF508d/F580d668424
 F508d/D579G2   2
 F508d/1525-2A-G2   2
 F508d/3849 + 10kbC-T2   2
 3601–2A-G/3601-2A-G 2  2
 F508d/2183 AA-G 2  2
 Q39X/Q552X 2  2
 2183AA-G/IVS8:TG15-5 T  2 2
 F508d/G542X  2 2
 F508d/4016insT   22
 F508d/1717-1 G-A   22

CF: cystic fibrosis; NA: unknown discordance; DfO: distance from origin, corresponds to severity; Delta: intra-pair discordance; CFTR: cystic fibrosis transmembrane conductance regulator

Clinical characteristics of the CF twins. CF: cystic fibrosis; NA: unknown discordance; DfO: distance from origin, corresponds to severity; Delta: intra-pair discordance; CFTR: cystic fibrosis transmembrane conductance regulator For targeted bisulphite sequencing-based methylation analysis 109 lung cancer patients and 109 healthy controls from the HD lung cancer case–control cohort [70,71] were pair-wise matched by sex, age (± 0 years), packyears (±0 years) and smoking status (current, ex and never smokers), in a nested case-control study design (Table 2).
Table 2.

Clinical characteristics of lung cancer cases and healthy controls individually matched by age, gender, smoking status and packyears.

variablescases (n = 109)controls (n = 109)
sexfemale5050
 male5959
mean age [y] 60.6 ± 10.260.6 ± 10.2
mean packyears [y] 15.2 ± 24.315.2 ± 24.3
smoking statuscurrent2525
 ex1717
 never6767
cancer typeno cancer0109
 SCC180
 LCC80
 adenocarcinoma590
 carcinoid60
 NSCLC20
 SCLC140
 mixed20

n: number of samples; SCC: squamous cell carcinoma; LCC: large cell lung cancer; NSCLC: non-small cell lung cancer; SCLC: small cell lung cancer .

Clinical characteristics of lung cancer cases and healthy controls individually matched by age, gender, smoking status and packyears. n: number of samples; SCC: squamous cell carcinoma; LCC: large cell lung cancer; NSCLC: non-small cell lung cancer; SCLC: small cell lung cancer .

Epigenome-wide DNA methylation analysis

From all subjects, frozen peripheral blood samples were used to extract genomic DNA (gDNA). The gDNA was bisulphite-converted with the EZ DNA Methylation™ Kit (Zymo Research). DNA methylation was quantified with the Illumina 450 K BeadChip Array (Illumina). Bioinformatical analysis and quality control of 450 K methylation data were performed with the R package RnBeads [72].

iDMP selection process

For selection of disease-related differentially methylated positions, probes with missing values were removed. Further, positions located in repetitive sequence according to Price et al. (n_bp_repetitive 1) were ignored [73]. Intra-pair differential methylation was calculated as DM = βsevere – βmild, where β is the methylation value corresponding to % methylation of the more or less severely affected twin. For discordance-dependent differentially methylated probes the following assumptions were made: (1) they are strongly differentially methylated, arbitrarily defined as (10%) only in discordant twin pairs and twin pairs with unknown discordance, (2) they are not differentially methylated, defined as (5%) in concordant twin pairs. Methylation differences below 5% at single CpG nucleotides may not be reliably detectable with different methods, such as pyrosequencing [74]. Probes were considered one-directionally differentially methylated if (1) the correlation between methylation values and individual severity (DfO), (2) the correlation between methylation differences and intra-pair discordance (Delta) and (3) the mean methylation differences () consistently indicate a hyper- or hypo-methylation (i.e., positive DfO value, negative Delta value and negative value or vice versa). Correlations were calculated according to: where is the severity or discordance matrix, the methylation value or methylation difference matrix and is one severity or discordance value corresponding to one methylation value or methylation difference, respectively. Identification of positions significantly differentially methylated between different blood-cell-types, was done with an F-test (described in statistical analyses), using a publicly available dataset. Briefly, Reinius et al. sorted and performed cell-type specific genome-wide DNA methylation analysis with different blood cell-types obtained from six healthy individuals [75]. Calculations in our study are based on granulocytes, monocytes, CD4+ and CD8+ T cells, B cells and natural killer cells.

TCGA data analysis

The cancer genome atlas (TCGA) research network DNA methylation data (https://www.cancer.gov/tcga) obtained with the Illumina 450K BeadChip, were downloaded from the TCGA Wanderer for lung adenocarcinoma (LUAD) and lung squamous cell carcinoma (LUSC) and corresponding normal adjacent tissues [76]. Probes with a methylation difference of ≥5% between LUAD (32 normal samples, 463 tumour samples) and LUSC (43 normal samples, 361 tumour samples) and normal lung tissue, in the same direction as in the CF twin cohort, were considered to be differentially methylated.

ENCODE data analysis

Chromatin immunoprecipitation DNA-sequencing (ChIP-seq) data from nine different cell lines (NHLF, NHEK, K562, HUVEC, HSMM, HMEC, HepG2, H1-hESC and GM12878) from the Broad institute were considered using the UCSC genome browser [77-80].

Assay design for targeted bisulphite next-generation sequencing (NGS)

Twenty-one targeted assays, corresponding to 20 genomic regions were designed using the PyroMark Assay Design Software 2.0. Primer sequences are provided in Supplementary Table 1. The assays were designed based on the UCSC Genome Browser Human Feb. 2009 (GRCh37/hg19) Assembly (http://genome.ucsc.edu/cgi-bin/hgGateway?db=hg19).

Targeted bisulphite next-generation sequencing

Whole blood was obtained from the study subjects and gDNA was isolated. Further processing involved multiple kits which were always used according to the manufacturer’s standard instructions, with exceptions detailed below. Concentrations were determined with the Qubit 2.0. (Thermo Fisher Scientific) using the Qubit dsDNA Broad Range Assay (Thermo Fisher Scientific). Five hundred ng gDNA from each donor was bisulphite-converted with the EZ DNA Methylation™ Kit (Zymo Research) using the alternative incubation conditions for the Illumina Infinium® Methylation assay and eluted in 2 × 30 µl instead of 1 × 10 µl. For the amplification of the target regions, the HotStarTaq DNA polymerase (Qiagen) was used. Ten multiplex PCR reactions were performed for 21 regions as shown in Supplementary Table 2. The amplification products were cleaned up with AMPureXP Beads (Agencourt) to remove primer dimers in a sample:beads ratio of 1:0.8 prior to determination of concentration with the Qubit dsDNA High Sensitivity Kit (Thermo Fisher Scientific). Amplicons were pooled so that each amplicon was represented in an equimolar amount within a given sample. Eight cycles of index PCRs were performed with the NexteraXT Index Kit v2 Set A and B (Illumina) at 55°C annealing temperature and 30 sec elongation time using the KAPA HiFi HotStart Mix (Roche). The resulting indexed amplicons were purified with AMPureXP Beads (Agencourt) to remove primer dimers in a sample:beads ratio of 1:0.8 prior to determination of concentration with the Qubit dsDNA Broad Range Kit (Thermo Fisher Scientific). All samples were equimolarly pooled and again cleaned up with AMPureXP Beads (Agencourt) to remove primer dimers in a sample:beads ratio of 1:0.8 prior to determination of concentration with the Qubit dsDNA Broad Range Kit (Thermo Fisher Scientific). The quality and size of the library was verified using the Fragment Analyser Standard Sensitivity NGS Fragment 1–6000bp Kit (Agilent) on the Fragment Analyser (Agilent). The final library was denatured and diluted according to the MiSeq System Denature and Dilute Libraries Guide (Illumina) and library concentration was determined with the Qubit dsDNA High Sensitivity Kit (Thermo Fisher Scientific). The final libraries were spiked with 25% PhiX Control v3 (Illumina), diluted to 8pM and 300bp paired-end sequenced on a MiSeqDx instrument (Illumina) with the MiSeq Reagent Kit V3 600 cycles (Illumina). Samples from cases and controls were sequenced on the same flow cell.

Bioinformatical analyses of targeted bisulphite NGS data

Demultiplexed FASTQ files from two runs on the MiSeq (Illumina) were unzipped and merged sample-wise. Prior to alignment, reads were quality trimmed with Trim Galore! (Babraham Bioinformatics), (https://github.com/FelixKrueger/TrimGalore) using Cutadapt version 1.12 for paired-end reads. The quality encoding type was set ASCII+33 (phred 33). Low-quality ends were removed with a threshold of phred score 20. The Nextera Transposase was assumed as adaptor sequence. The maximum trimming error rate was set to 0.1, the minimum required adapter overlap (stringency) was specified as 2 bp and the minimum required sequence length for both reads, before a sequence pair gets removed, was set to 20 bp. Trimmed reads were aligned with the methylation analysis pipeline ‘bicycle’ in the paired-end mode with a maximum valid read length of 1000 and phred33 as quality settings for bowtie2 [81]. The UCSC Genome Browser on Human Feb. 2009 (GRCh37/hg19) Assembly was used (http://hgdownload.soe.ucsc.edu/goldenPath/hg19/bigZips/hg19.2bit) as reference genome. The obtained file was converted to a FASTA file, which can be used by bicycle with twoBitToFa (http://hgdownload.soe.ucsc.edu/admin/exe/linux.x86_64/twoBitToFa). Methylation analysis with ‘bicycle’ was performed with a fixed error rate computation mode (Watson error rate = 0.01, Crick error rate = 0.01). Non-correctly bisulphite-converted reads, reads aligned to both Watson (+) and Crick (-) strands and reads with more than one possible alignment were ignored. Mean coverage represents the mean read depth across all cytosines in a region.

Statistical analyses

Calculation of 95% confidence interval was performed with the CI function from the ‘Rmisc’ R-package. All statistical analyses were performed with R and RStudio software if not stated differently. The cell-type contribution was estimated with the R package RnBeads applying the algorithm by Houseman et al. [72,82]. As a reference, data from Reinius et al. were used [75]. An F-test was done with GEO2R (https://www.ncbi.nlm.nih.gov/geo/geo2r/) using the dataset of Reinius et al. to identify cell-type specific differentially methylated positions. For this purpose, the methylation values of granulocytes, CD4+ T cells, CD 8+ T cells, CD14+ monocytes, CD19+ B cells and CD56+ natural killer cells were taken into account [75]. P-values were false discovery rate (FDR) corrected (Bonferroni) and values below 0.05 were considered statistically significant. The non-parametric Wilcoxon rank sum test was performed one-sided and unpaired with a confidence interval of 0.95. The exact p-value was computed and then rounded to the second digit. The function wilcox.test was used. Enrichment analysis was executed with one-sided Fisher´s exact test using the fisher.test function. Gene association, CGI relation and regulatory region were characterized according to the Illumina HumanMethylation450 v1.2 Manifest file. Gene association corresponds to functional regions with UCSC genes that a probe is associated with. CGI relation is equivalent to the location of the CpG relative to the CpG island. Enhancers were defined as predicted enhancer elements, DHS as experimentally determined DNase I Hypersensitivity Sites which correspond to open chromatin (ENCODE project). Promoters and cell-type specific promoters are promoter-associated and cell-type-dependent promoter-associated probes, respectively, provided by the Methylation Consortium. Telomeric and subtelomeric regions were defined as the distal 500 kb of each chromosome arm [83]. Imprinted regions were specified according to the Illumina HumanMethylation450 BeadChip (v1.2, extended annotation) file. Probes that overlap with the stretch of 1500 bp around the transcription start site of any of the transcripts of known imprinted genes were defined as imprinted. The power-analysis was done with the power.t.test function using standard deviations of the candidate positions in the whole blood of 796 healthy individuals from case-control pairs from the NOWAC, MCCS, NSHDS and EPIC HD studies [22] (Supplementary Table 3). The power to detect methylation differences ≥5% with a two-sided paired Student´s t-test with a number of 109 observations per group and a Type I error probability of 5% was calculated (Supplementary Table 4). The paired Student´s t-test was performed two-sided with a confidence interval of 0.95 with the function t.test. The two variances have been treated as being equal and NA values were omitted. P-values were corrected for false discovery rate with the R package q-value. The principal component analysis was executed with the methylation values obtained from 218 samples using the default settings of the prcomp function from the stats package. The Pearson correlation between numerical clinical parameters (age and packyears) and mean methylation values per region was calculated using the cor function. The NAs were removed. The Pearson correlation between the global mean methylation difference and discordance (Delta) and age was calculated using the cor function. The NAs were removed. The coefficient of determination, corresponding to in the mean methylation per region explained by the respective nominal parameters (sex, cancer histology, cancer occurrence, and smoking status) was calculated by performing a multiple regression using the lm function. The NAs were removed. Gene-ontology enrichment analysis was performed for genes with inflammation-related differentially methylated positions in their gene body or promoter for the terms biological process, molecular function and cellular component with a Fisher´s Exact test, FDR was calculated. Welch Two Sample t-test was performed with the relative luciferase activity fold-change values. Luciferase activity values were normalized for firefly activity. The fold-change was calculated and compared to the empty vector as a reference.

Luciferase reporter assay

To investigate the functional role of the putatively inflammation-related differentially methylated region (DMR) located in the Plakophilin 3 (PKP3) gene (chr11:396,135–397,671, 1537bp; hg19), it was cloned into the pCpGfree basic vector (Invivogen) for testing promoter activity and into the pCpGfree promoter vector (Invivogen) for testing enhancer activity (Supplementary Figure 8). To determine the effect of methylation on the activity of the region, in vitro methylation of the constructs was performed with M.SssI (NEB). In short, DNA isolated from the blood of a healthy donor was amplified using the Phusion HF polymerase (Thermo Scientific) with primers (Sigma, Supplementary Table 8) containing the restriction enzyme sites for NsiI and BamHI at the 5´ends of the forward and reverse primer, respectively. The PCR product clean-up was performed with the ReliaPrep DNA Clean-Up and Concentration kit (Promega). One thousand ng of the amplicon and the plasmid were double digested with 20 U of BamHI (NEB) and 20 U NsiI (NEB) at 37°C for 20 min. The digests were cleaned-up with 0.5x volumes SPRI beads. The ligation was performed with a 1:3 plasmid-to-insert ratio with T4 DNA ligase (NEB) at 16°C overnight. Chemically competent E. coli GT115 (pir strain, Invivogen) cells were transformed by performing a heat shock with 5 µl ligation reaction and were spread on LB-zeocin (50 µl/ml; Invivogen) plates. After successful growth in larger volumes, the presence of the insert was verified via colony PCR with Phusion HF polymerase (Thermo Scientific) and short primers targeting the insert region and Sanger sequencing (LGC genomics; Supplementary Table 8). The plasmids containing desired inserts were extracted with the PureYield Plasmid Midiprep System (Promega). In vitro methylation of 2 µg constructs was performed with 8 U M.SssI (NEB) at 37°C for 3 h. The enzyme was inactivated at 65°C for 20 min, followed by purification with the ReliaPrep DNA Clean-Up and Concentration kit (Promega). A549 and H1299 lung epithelial cells were seeded into 24 well plates (1.5×105 cells/well) with Ham F-12 K + 10% foetal bovine serum medium for A549 and RPMI + 10% FBS for H1299 without antibiotics. The next day, 500 ng plasmid and 25 ng pGL4 SV40 control plasmid were co-transfected with Lipofectamine 3000 (Thermo Scientific) and incubated overnight. After cell lysis, the Lucia and Firefly read-out was generated with a Tecan Spark plate reader (Tecan). The Lucia luciferase activity was normalized to the firefly activity. To calculate the fold increase in gene expression, the empty vector was used as a control. Each experiment was performed in triplicates and repeated at least three times. Cells were regularly tested for mycoplasma.

Results

The phenotype of CF exhibits extensive inflammation, particularly in the lung [9]. These disease-related processes may have an impact on DNA methylation patterns. In order to investigate CF phenotype- and discordance-related epigenetic alterations in whole blood, 22 clinically well-characterized monozygotic twin pairs with CF from the European CF Twin and Sibling Study were studied [6]. In total, the CF twins comprise eleven different homo- and heterozygous CFTR genotypes of which the most abundant is p.Phe508del/p.Phe508del (p.Phe508del homozygous) representing the expected distribution in Europe [84]. From the 44 individuals, peripheral blood DNA was obtained for epigenome-wide DNA methylation analysis of more than 400,000 selected loci in the human genome (Table 1). The overall methylation patterns in all twins, irrespective of clinical discordance, demonstrate a bimodal distribution, representing the normally observed patterns of DNA methylation values in healthy tissues, with the majority of CpG dinucleotides being either very highly (>85%) or very slightly (<10%) methylated [85]. These results suggest, that even strong inflammatory processes do not shift global DNA methylation patterns in whole blood. To determine the global effect of clinical severity and discordance on the DNA methylome, the total and pair-wise methylation differences across all loci covered by the array were assessed. Cumulated, all twin pairs revealed a very low mean global intra-pair methylation difference of −0.1% (standard deviation 2.7%) with an upper and lower 95% confidence interval of 0.029% and −0.256%, respectively. Thus, a significant difference cannot be assumed. The pair-wise mean global methylation differences were between −0.6% and 0.4%, the absolute differences were between 1.1% and 2.0% and their extent correlates weakly positively and significantly with clinical intra-pair discordance (Delta) (r2 = 0.378, p-value = 0.023) and age (r2 = 0.391, p-value = 0.009). This confirms other studies demonstrating, that monozygotic twins are epigenetically highly similar and reveals that this high similarity remains even in individuals clinically discordant for a severe inflammatory disease such as CF [21,86,87].

Differences in blood cell composition in CF twins are minor and occur mainly in discordant pairs

Blood cell-type composition is known to change upon inflammation [88]. Hence, the blood cell contribution of six different blood cells (granulocytes, CD4+ T cells, CD8+ T cells, B cells, monocytes and natural killer cells) has been estimated for all samples based on the obtained DNA methylation values using six reference methylomes and an algorithm developed by Houseman et al. [75,82]. Irrespective of the individual phenotypical severity and the intra-pair clinical discordance, blood cell compositions within one twin pair are noticeably similar. The minor intra-pair cell-type contribution differences occur mainly in discordant twin pairs and apparently display a shift from natural killer cells to granulocytes in the more severely affected individuals (Figure 1a, SupplementaryFigure 1a, b). This is in concordance with the fact, that a high abundance of neutrophils is frequently observed in individuals with cystic fibrosis and points out that despite clinical discordance, the blood cell-type composition differences in monozygotic CF twins are surprisingly small [88].
Figure 1.

Blood cell-type contribution in CF twins based on whole blood DNA methylation. (a) Estimated cell-type distribution (%) of six different cell-types in peripheral blood according to DNA methylation values in the 44 subjects representing 22 twin pairs. Heatmap of the contributions of six different cell-types in peripheral blood. Twin pairs are separated by black vertical lines. The twin with the milder phenotype is displayed on the left, whereas the one with the more severe phenotype is shown on the right. (b) Scatterplot of intra-pair cell-type contribution differences for six different cell-types. Clinical intra-pair discordance is indicated in colour.

Blood cell-type contribution in CF twins based on whole blood DNA methylation. (a) Estimated cell-type distribution (%) of six different cell-types in peripheral blood according to DNA methylation values in the 44 subjects representing 22 twin pairs. Heatmap of the contributions of six different cell-types in peripheral blood. Twin pairs are separated by black vertical lines. The twin with the milder phenotype is displayed on the left, whereas the one with the more severe phenotype is shown on the right. (b) Scatterplot of intra-pair cell-type contribution differences for six different cell-types. Clinical intra-pair discordance is indicated in colour. In order to investigate disease-related methylation alterations, a selection process was developed and applied to identify CpG sites which are differentially methylated in an intra-pair comparison, taking into account individual severity and intra-pair discordance. 75.2% of all investigated positions covered passed a strict quality control with removal of unreliable probes, probes with missing values and probes located in repetitive sequences. Only 0.86% of all positions investigated, present discordance-dependent differential methylation according to the defined requirements. Of these, 25.6% are consistently differentially methylated in one direction in a pair-wise comparison (e.g., either consistently hyper- or hypo-methylated in all of the more severely affected individuals of a twin pair) resulting in 802 disease-related differentially methylated positions (DMPs). Due to the strongly inflammatory component of CF, these positions are referred to as putative inflammation-related DMPs (iDMPs) (Figure 2, Supplementary table 5).
Figure 2.

Selection process applied for identification of acquired inflammation-related differentially methylated positions (iDMPs). 75.2% of all probes measured passed a strict quality control (QC) including removal of cross-reactive and unreliable probes, probes with missing values and probes located in repetitive sequences. 0.2% of all retained probes are considered disease-related differentially methylated according to the assumptions described in the methods. Given that 87% of iDMPs are significantly differentially methylated in different blood cell-types, 106 probes are presumed to be acquired iDMPs.

Selection process applied for identification of acquired inflammation-related differentially methylated positions (iDMPs). 75.2% of all probes measured passed a strict quality control (QC) including removal of cross-reactive and unreliable probes, probes with missing values and probes located in repetitive sequences. 0.2% of all retained probes are considered disease-related differentially methylated according to the assumptions described in the methods. Given that 87% of iDMPs are significantly differentially methylated in different blood cell-types, 106 probes are presumed to be acquired iDMPs. Since DNA methylation is known to be tissue- and cell-type specific, the list of candidate iDMPs was examined for known blood cell-type specific differentially methylated positions from a publicly available dataset [75,89]. Interestingly, 86.8% of iDMPs have been shown to be significantly differentially methylated between different blood cell-types and therefore could reflect differential blood cell counts between the twin pairs. This would be in concordance with CF being a disease characterized by massive chronic inflammation, especially in the lung with consequent alterations in the immune cell composition and underscores the applied selection process for inflammation-related differentially methylated positions [90]. The remaining 106 iDMPs are considered acquired in relation with the CF disease severity, which may be environmentally promoted and accordingly of particular interest for diseases [91] (Figure 2, Supplementary table 6).

Disease-related methylation differences in monozygotic twins are of a small magnitude

To validate the applied selection process, and thus the relation of the 106 selected candidate positions with the clinical discordance of the twin pairs, the extent of differential methylation was assessed. A trend from lower methylation differences to higher ones with increasing discordance can be observed for the selected iDMPs. Interestingly, twin pairs with unknown discordance exhibit the same mean of absolute sums of intra-pair methylation differences as twin pairs with intermediate discordance. Furthermore, concordant twin pairs display significantly (p = 0.004, one-sided Wilcoxon test) lower sums of intra-pair methylation differences than discordant twin pairs (Figure 3a). This confirms that the putative iDMPs are differentially methylated in a discordance-dependent manner as expected due to this type of selection process. However, when investigating the DNA methylation values in the twins, there are small differences between the individuals and the pairs clearly cluster together (Euclidean distance and complete hierarchical clustering), emphasizing the epigenetic similarity of twins even in the event of clinical discordance (Figure 3b).
Figure 3.

Methylation and intra-pair methylation differences in CF twins. (a) Absolute intra-pair methylation differences of putative acquired iDMPs. Clinical discordance is indicated in colour. Methylation differences are significantly higher (p = 0.004, one-sided Wilcoxon test) in discordant than in concordant twin pairs. The means and the medians of the absolute sums of the differences increase with the extent of clinical discordance. (b) Heatmap of methylation [%] in candidate iDMPs in 44 individuals. Rows and columns are clustered by complete hierarchical clustering and Euclidean distance.

Methylation and intra-pair methylation differences in CF twins. (a) Absolute intra-pair methylation differences of putative acquired iDMPs. Clinical discordance is indicated in colour. Methylation differences are significantly higher (p = 0.004, one-sided Wilcoxon test) in discordant than in concordant twin pairs. The means and the medians of the absolute sums of the differences increase with the extent of clinical discordance. (b) Heatmap of methylation [%] in candidate iDMPs in 44 individuals. Rows and columns are clustered by complete hierarchical clustering and Euclidean distance. In order to determine whether alterations in DNA methylation which are related to the disease phenotype of CF may accumulate in certain genomic or functional regions, enrichment analysis of the candidate iDMPs was performed. Acquired iDMPs occur to be relatively evenly distributed across the chromosomes. In comparison to all CG dinucleotides measured, they are not enriched in imprinted or (sub)telomeric regions. They do not accumulate in CG-rich or regulatory regions such as enhancers, DNA hypersensitivity sites or promoters. Also, no accumulation at gene bodies, transcription start sites or untranslated regions was observed. Of 106 iDMPs, 35 are located in a gene promoter and 47 in a gene body (including pseudogenes and antisense transcripts) and 21, 19 and 10 are located in a CpG island, CpG shore and CpG self, respectively (SupplementaryFigure 2). These 82 gene-associated iDMPs correspond to 81 inflammation-related differentially methylated regions (iDMRs) and 79 different genes. Gene ontology (GO) enrichment analysis was performed with gene-associated iDMPs, however it did not deliver significant (Fisher´s Exact test, FDR) enrichment for the biological processes, the molecular function or the cellular component of the respective genes. In summary, no positional or functional hotspots or patterns were observed for acquired iDMPs; however, due to the stringent selection process, only few observations could be included into the analyses, which makes it difficult to notice genome-wide patterns. It has been shown that environmental factors, such as cigarette smoking and inflammation can change DNA methylation patterns. These alterations can have an impact on the risk, the development and the prognosis of diseases such as cancer, which is known to be strongly associated with environmental factors (i.e., smoking) [22, 23, 92, 93, L. 24]. Since chronic inflammation is a major risk factor for cancer in general, putative inflammation-related variations in DNA methylation may be relevant for lung cancer. To identify positions which are of particular interest in the context of inflammation and lung cancer, the list of putative acquired iDMPs was inspected a) for the loci most prominently differentially methylated in the CF twin cohort and b) for loci for which methylation differences were also observed in TCGA in lung cancer tissues compared to normal adjacent tissues. Of 106 iDMPs, 11 were strongly (more than once ≥10%) or frequently (more than four times ≥5%) differentially methylated in the CF twin data and 21 were differentially methylated (≥5%) in LUAD and LUSC compared to the corresponding normal tissue in TCGA data, with an overlap of three iDMPs in these two groups. Two of the candidates are located in the same region (13) directly adjacent to each other in the gene body of Plakophilin 3 (PKP3) (SupplementaryFigure 3). DNA methylation of these selected acquired iDMPs and flanking regions was assessed in 218 whole blood samples from a lung cancer case-control study (Table 2). Of these 29 most interesting acquired iDMPs with regard to lung cancer, 21, corresponding to 20 genomic regions could be tested with bisulphite targeted-NGS in whole blood from lung cancer patients and healthy controls. The power to detect differences in mean methylation levels of at least 5% in the lung cancer case-control cohort of 109 pairs for all positions tested is above 99% (Supplementary table 4). To cover all samples, two flow cells (MiSeq, Illumina) were loaded. A mean cluster density of 776 K/mm2 could be achieved of which 94.66% (36.19 million reads of 38.27 million reads) passed the filter. On average, 80.62% of the bases hold a quality score above Q30. In order to obtain only high-quality sequences, on average 98.01% and 88.29% of all reads were trimmed from the forward and reverse reads, respectively. No reads were removed. Forward reads had a better quality than reverse reads which is generally observed in paired-end sequencing with Illumina (SupplementaryFigure 4). Of all reads, 62% could be assigned to the samples. We assume that approximately 25% of the library is PhiX spike-in. Of all reads obtained for the 218 samples, 94.65% passed the filter and could be processed with the methylation analysis pipeline ‘bicycle.’ 84.55% of all reads which passed the filter could be uniquely aligned to the human genome. 0.86% were mapped to both Watson (+) and Crick (-) strands, 2.74% were aligned to multiple regions in the genome and 5.57% of the reads were removed due to possibly failed bisulphite-conversion (SupplementaryFigure 4b). 98.55% of the uniquely aligned high-quality reads could be aligned on target (i.e., at the targeted regions) (SupplementaryFigure 4c). For all samples, a mean coverage of 1694x could be achieved across all regions of interest. No region had a mean coverage below 100x with an average of 1634x (range: 124x – 4841x) (SupplementaryFigure 4d).
Figure 4.

DNA methylation differences in lung cancer cases compared to healthy controls. (a) Heatmap of methylation differences in CG dinucleotides of 20 genomic regions (numbers), corresponding to 21 iDMPs (rows separated by black lines) in 209 pairs (columns). NAs are shown in black, clinical parameters are colour coded (b) Bar plot of the means of all absolute intra-pair methylation differences [%] per CG dinucleotide of the 20 numbered regions. Median values across the regions are indicated. The width of the boxes represents the number of observations (i.e., the number of CG dinucleotides per region).

DNA methylation differences in lung cancer cases compared to healthy controls. (a) Heatmap of methylation differences in CG dinucleotides of 20 genomic regions (numbers), corresponding to 21 iDMPs (rows separated by black lines) in 209 pairs (columns). NAs are shown in black, clinical parameters are colour coded (b) Bar plot of the means of all absolute intra-pair methylation differences [%] per CG dinucleotide of the 20 numbered regions. Median values across the regions are indicated. The width of the boxes represents the number of observations (i.e., the number of CG dinucleotides per region). Methylation differences between lung cancer case-control pairs are small in most regions. If differential methylation occurs, then it is region-specific with respect to extent and direction, implying that in one pair, the whole region is consistently differentially methylated (Figure 4a). Applying a two-sided, paired Student´s t-test, no significant differences of the tested iDMPs in lung cancer compared to controls could be observed after FDR correction of p-values. Principal component analysis did not reveal any clustering according to known clinical parameters (Supplementary Figure 5). Further, correlation and multiple regression analysis of mean methylation values per region and known clinical parameters of the samples did not disclose any relations (correlation and coefficient of determination values below 0.18) between methylation in the investigated regions and known clinical parameters (sex, cancer occurrence, cancer histology, age, packyears and smoking status). However, of 20 regions analysed, one region (#13) comprising two adjacent candidate iDMPs (cg17840408, cg25258098) appears to be highly variably methylated across the lung cancer-case control cohort in both directions (i.e., hyper- and hypo-methylated) irrespective of known clinical parameters (Figure 4b, Supplementary Figure 6). The iDMR of interest is located on a CpG island in the gene body of PKP3, encoding for the desmosomal protein PKP3, known to play a role in carcinogenesis [94]. It overlaps with a putative regulatory region (Supplementary Figure 7). CpG islands are often regulated via DNA methylation [83,95-98]. In contrast to promoter methylation, gene body methylation is more frequently, associated with active gene expression and may prevent cryptic transcription initiation [99,100]. Upregulation of gene expression by intragenic CpG island hypermethylation has been demonstrated to be associated with cancer [101]. Also, variably methylated regions have been shown to be enriched for enhancer histone modifications and transcription factor-binding sites and may exhibit important regulatory functions especially regarding cell- and tissue-type [102]. It is therefore suggested that this iDMR, hypermethylated in CF and highly variably methylated in lung cancer and healthy subjects, may have functional relevance in the context of lung inflammation and cancer.
Figure 5.

Promoter and enhancer activity of the methylated and unmethylated inflammation-related DMR in lung epithelial cells. Fold change relative luciferase activity in A549 and H1299 cells. The empty vector corresponds to a construct without promoter sequence. For the putative promoter vector, the sequence of interest has been cloned into the empty vector upstream of the luciferase gene. For the putative enhancer construct, the region of interest was cloned upstream of a minimal promoter for the luciferase gene. Both vectors were compared to their in silico methylated counterparts.

Promoter and enhancer activity of the methylated and unmethylated inflammation-related DMR in lung epithelial cells. Fold change relative luciferase activity in A549 and H1299 cells. The empty vector corresponds to a construct without promoter sequence. For the putative promoter vector, the sequence of interest has been cloned into the empty vector upstream of the luciferase gene. For the putative enhancer construct, the region of interest was cloned upstream of a minimal promoter for the luciferase gene. Both vectors were compared to their in silico methylated counterparts. PKP3 Evaluation of publicly available chromatin immunoprecipitation DNA-sequencing (ChIP-seq) data from the Broad institute and ENCODE revealed that 5´-adjacent to the candidate region in the PKP3 gene body is a CTCF binding site and that the iDMR may be located in an insulator in NHLF, HUVEC and H1-hESC cells; however, it could function also as a promoter for instance in HepG2 cells [77-80]. CTCF binding facilitates inter- and intra-chromosomal chromatin looping and thus enables physical proximity of regulatory regions. Further, CTCF binding sites colocalizes with promoters and enhancers and attracts or repels transcription factors and subsequently activates or represses transcription. CTCF preferentially binds to unmethylated CTCF binding sites and, in turn, binding of CTCF to insulator regions prevents its methylation [103-105]. DNA methylation of cis and trans regulatory functional elements, such as CTCF binding sites, impacts their function and subsequently results in activation or inactivation of gene expression in a context- and cell-type specific manner [32,106]. To evaluate the functional relevance of the putative regulatory region, associated with a CTCF binding site in the tissue of interest, luciferase reporter assays were performed in the lung epithelial cells A549 and H1299. Therefore, the region of interest was cloned into a CpG-free vector upstream of the luciferase gene with and without minimal promoter to determine functional activity as enhancer and promoter, respectively (Supplementary Figure 8). In both cell lines a highly significant increase in the relative luciferase activity (Welch Two Sample t-test, p-value = 0.0008513) could be shown for constructs containing the region of interest 5´ to the luciferase gene compared to vectors without the respective region. This indicates that the putative regulatory region functions as a promoter in lung epithelial cells. However, the insertion of the same regulatory element together with a minimal promoter did not result in enhanced luciferase activity in comparison to the empty plasmid, suggesting that it does not function as an enhancer in the tested cell lines. To determine the effect of DNA methylation of the regulatory region on luciferase expression, both constructs were in vitro methylated. Since the vectors are CpG free, only the inserted sequence is methylated. While no change in luciferase activity was shown for the methylated (A549: 1.20 ± 0.129; H1299 :1.25 ± 0.117) and unmethylated (A549: 1.15 ± 0.109; H1299 :1.14 ± 0.072) enhancer constructs, methylation of the promoter construct resulted in significantly decreased luciferase activity (A549: −3.66 ± 0.400; H1299: −3.78 ± 0.0644) compared to the unmethylated region (A549: −1.36 ± 0.0239; H1299: −0.94 ± 0.064) suggesting that the promoter function of the region of interest is depleted upon its methylation (Figure 5). Taken together, these results show that the iDMR located in the PKP3 gene body can function as a promoter in lung epithelial cells and that this function is eliminated via DNA methylation.

Discussion

In order to investigate inflammation-related aberrant DNA methylation, genome-wide methylation was analysed in twins with CF, as a chronic inflammatory lung disease. CF disease severity is highly associated with the degree and the duration of inflammatory processes, particularly in the lung [4]. In the lung, the neutrophilic inflammation leads to release of neutrophil elastases, matrix-metalloproteinases, nitric oxide and reactive oxygen species (ROS), which cannot be efficiently resolved and cause massive tissue injury. Therefore, the CF lung is flooded with proinflammatory mediators such as tumour necrosis factor alpha (TNF-α), interleukin (IL)-1β, IL-6, IL-8 and others. On the other hand, anti-inflammatory cytokines for instance, IL-10, are decreased, resulting in impaired resolution of inflammation and an extensive and chronic inflammatory state including defective apoptosis and phagocytosis [5,107]. Like CF, COPD is characterized by mucus hypersecretion, oxidative stress, neutrophilic inflammation and the involvement of macrophages. TNF‐α, IL‐1β, IL‐6 and IL‐8 are contributing to inflammation in COPD as well [108]. COPD is an inflammatory disease of the lung highly associated with lung cancer [55,109]. COPD and other chronic inflammatory disorders of the lung are associated with an increased risk for lung cancer development [53, J. 54, 55]. Also, the major risk factor for lung cancer, smoking, is known to trigger lung inflammation [110-112]. CF, although a disease characterized by massive chronic lung inflammation, is not associated with lung cancer risk [58]. This may be due to the fact that the early and strong impairment of the lung is fatal, or lung transplantation is indicated and performed prior to the development of lung cancer [113,114]. It has been even suggested that the p.Phe508del deletion has a protective role in lung cancer [115]. However, due to its severe phenotype, we consider CF an appropriate model for investigating excessive and persistent chronic inflammation of the lung with potential implications for lung cancer. Since the CFTR genotype, the genomic background, sex, age and socioeconomic factors can be neglected in this study exploring monozygotic twins, we assume that clinical intra-pair discordance is at least to some extent due to a different degree of inflammation [84,116,117]. Global alterations of DNA methylation patterns have been described in cancer and ageing [118,119]. Investigating the global methylome of monozygotic twins, no overall alterations in DNA methylation were observed in intra-pair comparisons. These observations were independent of clinical discordance of the respective twin pair and are in accordance with previous studies, showing that monozygotic twins are epigenetically highly similar and emphasize the importance of age, sex, genetic background and environment for the shape of the methylome [86,87,120-126]. Global and locus-specific DNA methylation differences as well as phenotypic discordance in monozygotic twins are known to increase with age [127,128]. In this study, no relation between the absolute global methylation difference and the age of the twin pairs could be observed. However, the twins evaluated in this study are relatively young with a mean age of 14.4 years. DNA methylation is known to be cell-type-specific. Therefore, DNA methylation values obtained from an epigenome-wide analysis can be applied to estimate the cell-type proportion in a sample with a mixed cell population [75,82]. Our data indicate high similarities in blood cell proportion in intra-pair comparisons. Cell-type composition differences were mainly observed in clinically discordant twins which is in accordance with the fact that inflammatory diseases alter the abundance of immunological cells in blood [129]. Relative differential cell-type counts are mainly attributed to granulocytes and natural killer cells in an inverse manner. A larger fraction of granulocytes in blood from more severely diseased individuals, compared to their less severely affected twin counterparts, may point to neutrophilia, the high abundance of neutrophils (granulocytes, basophils and eosinophils), in the blood and subsequently the lung which in turn is a well-known symptom in CF [88]. To discover whether chronic lung inflammation may cause locus-specific differential methylation in the blood, positions differentially methylated in a severity- and discordance-specific manner were selected. Thus, a subset of putative iDMPs could be identified. Our data indicate that a large proportion of these iDMPs have been shown to be differentially methylated in a blood cell-type-specific manner and hence may be attributed to cell-type composition effects [75,89]. Given that the abundance of immune cells in the blood is likely to be altered in an inflammatory setting, this emphasizes that the selected loci may be related to inflammation [129]. All other iDMPs are considered to be acquired due to exposure to the inflammatory environment. Acquired alteration of DNA methylation induced by environmental factors is of particular interest in the context of diseases [91,130,131]. The 106 iDMPs we observed in our study are not enriched in genomic location and in relation to particular genes or to functional regions. Seventy-seven percent of the iDMPs are considered gene-associated; however, their respective genes are not enriched in biological process, molecular function or cellular components. However, due to the stringent selection process, the number of observed iDMPs may have been too small for these analyses. Among the 21 tested loci, none was differentially methylated in blood from lung cancer cases compared to healthy controls. However, one DMR including two iDMPs, cg17840408 and cg25258098, directly adjacent to each other and hyper-methylated in blood of the CF twins, was found to be highly variably methylated in whole blood of lung cancer cases compared to healthy controls, irrespective of known clinical parameters (sex, age, packyears, current smoking status and lung cancer histology in cases) which are known to be related to DNA methylation [24,121,132,133]. This variable methylation is extended also to the 5´- and 3´-flanking regions. The iDMR is located on chromosome 11p15.5, a subtelomeric region, in a CpG island indicating a regulatory function of DNA methylation in this region since these CpG dense regions are often prone to regulation by DNA methylation [83,95-98]. In non-related individuals, one major cause of interindividual methylation variability are genetic differences, whereas in monozygotic twins it is determined either by stochastic effects, which result in an epigenetic drift or by environmental exposure, such as chronic inflammation [122,128]. It has been shown that DNA methylation at certain CpG sites differs between distinct tissues, whereas others are consistently methylated in any tissue of an individual but display interindividual differences. The latter may be particularly valuable in the context of risk biomarkers in peripheral blood independently of functional relevance as they may offer information regarding epigenetic regulation in less accessible tissues [134]. ChIP-seq data from the Broad institute and ENCODE revealed that in lung cells the putatively inflammation-related differentially methylated region may function as an insulator, whereas in other cell-types it may have promoter activity [77-80]. CTCF binding sites regulate inter- and intra-chromosomal interactions between functional elements, enable three-dimensional chromatin structure, separate topological regions and regulate transcription factor accessibility. Therefore, CTCF binding can dynamically control gene transcription and alternative splicing events and thus contributes to dynamic processes such as cell differentiation and generation of immune cell diversity [103,105]. Our results indicate, that the iDMR functions as a promoter in lung epithelial cells, which is inactivated upon methylation. The putatively regulatory iDMR, overlapping with a CTCF-binding site is located in the gene body of PKP3 (NM_007183). PKP3 encodes for the 81.081kD PKP3 protein which is expressed in the desmosomes and the nucleus of the simple and the most stratified epithelia and plays a role in linking cadherins and intermediate filaments in the cytoskeleton, in cell–cell communication by enhancing the desmosomal strength, in the RNA metabolism and as transcription factor associated with cancer and metastasis [135-139]. Desmosomal proteins such as PKP3 have implications in epithelial integrity and tissue remodelling, both hallmarks of inflammatory lung diseases, such as CF [140,141]. They further appear to have tumour-promoting as well as tumour-suppressing properties in distinct cancer entities. They are essential for epithelial homoeostasis, tissue integrity and regulation of cell proliferation, differentiation, migration, invasion and apoptosis and are regulated by growth factors [94,142]. Altered expression of PKP3 transcripts has already been described in connection with lung cancer and other cancer types. Depending on the type of cancer, over-expression of PKP3 protein and PKP3 mRNA have been shown to have oncogenic and tumour suppressing effects [94,143-146]. In non-small cell lung tumours, which include lung adenocarcinomas and squamous cell carcinomas, PKP3 is described as an oncogene, which has the potential as a prognostic marker but also in the therapeutic area [147]. Desmosomal loss and loss of haematopoietic PKP3 is associated with increased inflammation [136,148,149]. Overexpression of adhesion molecules such as PKP3 could be demonstrated to be associated with differential infiltration of various immune cells in human lung cancer tissue, making the demosome status a potential biomarker in the context of immunotherapies [150]. Also, in the airway epithelia of CF lungs, epithelial junctions and integrity may have major roles in permeability to ions and pathogen products and subsequently lung inflammation [151]. Interestingly, aberrant methylation and expression of PKP3 has been associated with endometriosis, which is an inflammatory disease of the endometrium and has been observed in the mucosa of ulcerative colitis patients with and without cancer [152-154]. Directly adjacent to PKP3 is the single immunoglobin interleukin-1 (IL-1)-Related Receptor (SIGIRR) gene, which is also known as IL-1 receptor 8 (IL-1R8). IL-1R8 is anti-inflammatory and shows its effect against inflammatory diseases, also in the lung [L. 155]. It is interesting to note that frequent polymorphisms in the region comprising PKP3, SIGIRR and another gene, TMEM16J, have been linked to susceptibility to tuberculosis, another inflammatory lung disease [156]. At present it is not yet possible to say for certain which genes in which cell-types are regulated by the methylation-dependent promoter iDMR. However, due to the functional studies performed, it can be hypothesized that this iDMR acts as an alternative promoter for the PKP3 gene. In summary, a unique cohort of monozygotic CF twins has been clinically and epigenetically characterized in this study. The remarkable epigenetic intra-pair similarity observed even in clinically discordant twins, indicates that besides genetic and epigenetic phenotype-contribution further factors, such as the microbiome, that is likewise related to CF disease severity and progression, have a major impact on the individual development of cystic fibrosis [157,158]. Selected positions could be shown to be differentially methylated in CF twins in a discordance-dependent manner. These could reflect a consequence of chronic lung inflammation for the DNA methylome in the blood or may have a functional effect on the phenotype. Additionally, we observed one inflammation-related differentially methylated region to be highly variably methylated in the whole blood of individuals with and without lung cancer. Functional assays revealed that this region functions as a methylation-dependent promoter in lung epithelial cells. This suggests that the region in the PKP3 gene body is a functional promoter in lung tissues which is hypermethylated upon inflammation and variably methylated in the context of lung cancer. Whether this regulatory region may act as an alternative promoter for PKP3 gene or has insulatory or chromatin remodelling function, thus influencing nearby genes such as SIGIRR and TMEM16J in situ, may be elucidated in further research. A better understanding of the effect of chronic inflammation on DNA methylation and the presentation of inflammation-related epigenetic changes in lung cancer may improve existing strategies in risk assessment of inflammatory lung diseases and lung cancer. Thus, it can provide further insights into the mechanisms of carcinogenesis promoted by inflammation, such as epithelial to mesenchymal transition and epithelial tissue integrity. These insights have the potential to pave the way for advances in risk stratification, prognosis and treatment of inflammation-related diseases and cancer. Click here for additional data file.
  158 in total

Review 1.  Twins. Novel uses to study complex traits and genetic diseases.

Authors:  A J MacGregor; H Snieder; N J Schork; T D Spector
Journal:  Trends Genet       Date:  2000-03       Impact factor: 11.639

2.  Correlation of Smoking-Associated DNA Methylation Changes in Buccal Cells With DNA Methylation Changes in Epithelial Cancer.

Authors:  Andrew E Teschendorff; Zhen Yang; Andrew Wong; Christodoulos P Pipinikas; Yinming Jiao; Allison Jones; Shahzia Anjum; Rebecca Hardy; Helga B Salvesen; Christina Thirlwell; Samuel M Janes; Diana Kuh; Martin Widschwendter
Journal:  JAMA Oncol       Date:  2015-07       Impact factor: 31.777

3.  Age-dependent DNA methylation of genes that are suppressed in stem cells is a hallmark of cancer.

Authors:  Andrew E Teschendorff; Usha Menon; Aleksandra Gentry-Maharaj; Susan J Ramus; Daniel J Weisenberger; Hui Shen; Mihaela Campan; Houtan Noushmehr; Christopher G Bell; A Peter Maxwell; David A Savage; Elisabeth Mueller-Holzner; Christian Marth; Gabrijela Kocjan; Simon A Gayther; Allison Jones; Stephan Beck; Wolfgang Wagner; Peter W Laird; Ian J Jacobs; Martin Widschwendter
Journal:  Genome Res       Date:  2010-03-10       Impact factor: 9.043

Review 4.  Developing in 3D: the role of CTCF in cell differentiation.

Authors:  Rodrigo G Arzate-Mejía; Félix Recillas-Targa; Victor G Corces
Journal:  Development       Date:  2018-03-22       Impact factor: 6.868

5.  Gene modifiers of cystic fibrosis lung disease: A systematic review.

Authors:  Shivanthan Shanthikumar; Melanie N Neeland; Richard Saffery; Sarath Ranganathan
Journal:  Pediatr Pulmonol       Date:  2019-05-29

6.  The incidence of cystic fibrosis in Caucasian populations.

Authors:  W J Warwick
Journal:  Helv Paediatr Acta       Date:  1978-06

7.  Chloride conductance and genetic background modulate the cystic fibrosis phenotype of Delta F508 homozygous twins and siblings.

Authors:  I Bronsveld; F Mekus; J Bijman; M Ballmann; H R de Jonge; U Laabs; D J Halley; H Ellemunter; G Mastella; S Thomas; H J Veeze; B Tümmler
Journal:  J Clin Invest       Date:  2001-12       Impact factor: 14.808

Review 8.  Dynamics and Context-Dependent Roles of DNA Methylation.

Authors:  Christina Ambrosi; Massimiliano Manzo; Tuncay Baubec
Journal:  J Mol Biol       Date:  2017-02-16       Impact factor: 5.469

9.  High-specificity bioinformatics framework for epigenomic profiling of discordant twins reveals specific and shared markers for ACPA and ACPA-positive rheumatoid arthritis.

Authors:  David Gomez-Cabrero; Malin Almgren; Louise K Sjöholm; Aase H Hensvold; Mikael V Ringh; Rakel Tryggvadottir; Juha Kere; Annika Scheynius; Nathalie Acevedo; Lovisa Reinius; Margaret A Taub; Carolina Montano; Martin J Aryee; Jason I Feinberg; Andrew P Feinberg; Jesper Tegnér; Lars Klareskog; Anca I Catrina; Tomas J Ekström
Journal:  Genome Med       Date:  2016-11-22       Impact factor: 11.117

10.  Hypomethylation of smoking-related genes is associated with future lung cancer in four prospective cohorts.

Authors:  Francesca Fasanelli; Laura Baglietto; Erica Ponzi; Florence Guida; Gianluca Campanella; Mattias Johansson; Kjell Grankvist; Mikael Johansson; Manuela Bianca Assumma; Alessio Naccarati; Marc Chadeau-Hyam; Ugo Ala; Christian Faltus; Rudolf Kaaks; Angela Risch; Bianca De Stavola; Allison Hodge; Graham G Giles; Melissa C Southey; Caroline L Relton; Philip C Haycock; Eiliv Lund; Silvia Polidoro; Torkjel M Sandanger; Gianluca Severi; Paolo Vineis
Journal:  Nat Commun       Date:  2015-12-15       Impact factor: 14.919

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.