Literature DB >> 34447695

EGFR DNA Methylation Correlates With EGFR Expression, Immune Cell Infiltration, and Overall Survival in Lung Adenocarcinoma.

Zhanyu Xu1, Fanglu Qin2, Liqiang Yuan1, Jiangbo Wei1, Yu Sun1, Junqi Qin1, Kun Deng1, Tiaozhan Zheng1, Shikang Li1.   

Abstract

BACKGROUND: The epidermal growth factor receptor (EGFR) is a primary target of molecular targeted therapy for lung adenocarcinoma (LUAD). The mechanisms that lead to epigenetic abnormalities of EGFR in LUAD are still unclear. The purpose of our study was to evaluate the abnormal methylation of EGFR CpG sites as potential biomarkers for LUAD.
METHODS: To assess the differentially methylation CpG sites of EGFR in LUAD, we used an integrative study of Illumina HumanMethylation450K and RNA-seq data from The Cancer Genome Atlas (TCGA). We evaluated and compared EGFR multiple-omics data to explore the role of CpG sites located in EGFR promoter regions and gene body regions and the association with transcripts, protein expression levels, mutations, and somatic copy number variation. We calculated the correlation coefficients between CpG sites of EGFR and immune infiltration fraction (by MCPcounter and ESTIMATE) and immune-related pathways in LUAD. Finally, we validated the differential methylation of clinically and prognostically relevant CpG sites using quantitative methylation-specific PCR (qMSP).
RESULTS: We found that the methylation level of many EGFR CpGs in the promoter region was negatively correlated with the transcription level, protein expression, and SCNV, while the methylation at the gene body region was positively correlated with these features. The methylation level of EGFR CpGs in the promoter region was positively correlated with the level of immune infiltration and IFN-γ signature, while the opposite was found for methylation of the gene body region. The qMSP results showed that cg02316066 had a high methylation level, while cg02166842 had a low methylation level in LUAD. There was a high degree of co-methylation between cg02316066 and cg03046247.
CONCLUSION: Our data indicate that EGFR is an epigenetic regulator in LUAD acting through DNA methylation. Our research provides a theoretical basis for the further detection of EGFR DNA methylation as a predictive biomarker for LUAD survival and immunotherapy.
Copyright © 2021 Xu, Qin, Yuan, Wei, Sun, Qin, Deng, Zheng and Li.

Entities:  

Keywords:  DNA methylation; EGFR; lung adenocarcinoma; tumor biomarkers; tumor-infiltrating

Year:  2021        PMID: 34447695      PMCID: PMC8383738          DOI: 10.3389/fonc.2021.691915

Source DB:  PubMed          Journal:  Front Oncol        ISSN: 2234-943X            Impact factor:   6.244


Introduction

Lung cancer is the primary cause of cancer-related death worldwide (1). The most prominent pathological subtype of lung cancer is lung adenocarcinoma (LUAD), which accounts for about 45 percent of lung cancer cases (2). The five-years overall survival rate for patients with advanced lung cancer is less than 20% (3). Genetic analyses have revealed driver genes in LUAD and have changed the treatment paradigm (4). In Asia, epidermal growth factor receptor (EGFR) mutations account for 51.4% of advanced LUAD driver mutations, while it accounts for 15 to 22% of advanced LUAD driver mutations in non-Asian areas (5, 6). EGFR tyrosine kinase inhibitors (TKIs) are typically used to treat patients with EGFR-mutant LUAD (7, 8). The high heterogeneity of this type of cancer, on the other hand, restricts the survival advantage of patients undergoing EGFR-targeted treatment, indicating the need for more research into new prognosis-related molecular mechanisms. The molecular mechanism by which EGFR regulates LUAD through DNA methylation has yet to be completely elucidated. EGFR is a receptor tyrosine kinase (TK) that dimerizes in response to ligand stimulation, resulting in the activation of intracellular TKs and autophosphorylation of multiple tyrosine residues, which triggers a sequence of downstream signaling cascades (9, 10). The Ras/MAPK and PI3K/PKB signaling pathways are two of the most studied EGFR pathways, both of which have a well-established role in tumor development, survival, and progression (11). The most studied epigenetic mechanism is DNA methylation, which is linked to cell division, immune regulation, and X chromosome inactivation (12). Methylation of gene promoter regions is often linked to transcriptional silencing, while methylation of gene bodies has the opposite effect. Epigenetic dysregulation has been linked to the early stages of oncogenic transformation in a variety of solid tumors, and it can be used as a biomarker for early detection, systemic sampling, and prognosis in a variety of human cancers (13). Indeed, Haijing Liu et al. (14) reported the potential link between EGFR alterations at the multi-omics levels and clinical prognosis by pan-cancer analysis, but the relationship between DNA methylation of EGFR in LUAD and immune infiltration has not been reported. For patients with LUAD who do not benefit from targeted therapy, the discovery of DNA methylation-related immune landscapes has important implications for the molecular mechanisms of immunotherapy. Using the LUAD dataset from The Cancer Genome Atlas (TCGA), we performed a comprehensive multi-omics data assessment of EGFR-annotated CpGs. We investigated whether EGFR CpG methylation sites correlated with EGFR gene expression, protein levels and overall survival (OS) time. We further explored the relationship between somatic copy number variation (SCNV) and DNA methylation of EGFR, and the link between EGFR CpG methylation sites and LUAD immunological infiltration cells and immune-related pathways.

Materials and Methods

EGFR CpGs and TCGA Data Analysis

The Illumina 450K TCGA dataset was used to extract methylation and expression data for all EGFR CpGs. We investigated 49 CpGs in the promoter and gene body regions of EGFR. depicts the EGFR genome arrangement () and the relative locations of all EGFR CpGs genomes (). Clinical information is integrated with data from the GDC data portal (https://portal.gdc.cancer.gov) (15). These data included 535 LUAD patients, as well as information on EGFR status and somatic copy number variation (SCNV). Methylation data came in the form of beta-values, while expression data came in the form of TPM (Transcripts Per kilobase Million)-normalized read counts. cBioProtal (http://www.cbioportal.org/study?id=luad_tcga#summary) was used to retrieve RPPA (Reverse phase protein array)-based protein expression data (16).
Figure 1

EGFR Genomic structure, CpG site landscape, methylation level. (A) Schematic representation of the EGFR gene structure within the human hg19 genome sequence. (B) Overview of 49 analyzed methylation sites of EGFR. (C) Correlation analysis between DNA methylation and mRNA expression of EGFR in the TCGA LUAD cohort. (D) Correlation analysis between DNA methylation and protein expression of EGFR in the TCGA LUAD cohort. (E) Promoter methylation levels of EGFR on normal and LUAD tissues. (F) EGFR mRNA expression in normal and LUAD groups.

EGFR Genomic structure, CpG site landscape, methylation level. (A) Schematic representation of the EGFR gene structure within the human hg19 genome sequence. (B) Overview of 49 analyzed methylation sites of EGFR. (C) Correlation analysis between DNA methylation and mRNA expression of EGFR in the TCGA LUAD cohort. (D) Correlation analysis between DNA methylation and protein expression of EGFR in the TCGA LUAD cohort. (E) Promoter methylation levels of EGFR on normal and LUAD tissues. (F) EGFR mRNA expression in normal and LUAD groups.

Survival Analysis

For all available LUAD samples, Kaplan-Meier survival analysis curves (17) for the 49 EGFR CpG were plotted, with a P-value of 0.05 used as a statistical threshold, according to the group with high or low methylation. We did the above survival analysis curves in the EGFR wild-type group, EGFR mutation group and EGFR wild-type and PDL1 high expression group.

Assessment of Immune Cells Infiltration, ESTIMATE Scores

We employed the R package ESTIMATE (18) to investigate the immune invasion of LUAD samples. After that, to obtain a more detailed picture of immune cell-types and other stromal cells infiltration, R package MCPcounter was used (19). MCPcounter utilizes the scoring data for individual tumor specimens (20). We calculated the Pearson correlation between the β value of the CpG site on EGFR and the score of immune infiltration.

Sample Collection

20 paired LUAD and non-cancerous lung tissue samples were obtained from the First Affiliated Hospital of Guangxi Medical University from September 2019 to February 2020, and stored at -80°C. LUAD diagnosis was confirmed by two independent pathologists. This study was approved by the ethics committee of the First Affiliated Hospital of Guangxi Medical University.

DNA Extraction, DNA Sodium Bisulfite Conversion, and Quantitative Methylation Specific PCR

Genomic DNA was isolated from individual specimens using a CTAB DNA extraction (21). Nanodrop 2000 spectrophotometer (Thermofisher, USA) was used to detect the concentration and purity of the DNA extraction, and nucleic acid gel electrophoresis was used to detect DNA integrity (22). Samples were then bisulfite-converted using the Epitect Fast DNA Bisulfite Kit (Qiagen; 59824), according to the manufacturer protocol. The purified products were quantitated using a Qubit ssDNA Assay kit (Thermo, Q10212). Primer Premier 6.0 software (Premier, Canada) was used to design the primer sequences to target CpG sites. Fully methylated genomic DNA after bisulfite treatment and normal genomic DNA (not transformed with bisulfite) were used as templates. Uncalibrated methylation levels, roughly equivalent to percent methylation, were calculated using cycle threshold (CT) values obtained from probes that specifically bind to methylated (CTmethylated) and unmethylated (CTunmethylated) DNA, respectively (methylation [%] =100%/(1 + 2CTmethylated–CTunmethylated). The primers were utilized as shown in . QMSP was performed using an Applied Biosystems 7900HT Fast Real-Time PCR system (Waltham, Massachusetts, USA) with the following temperature profile: 5 minutes at 95°C, followed by 40 cycles of 15 seconds at 95°C, 30 seconds at 60°C, and 60 seconds at 60°C.
Table 1

Primer sequences used in this study.

NameSequence (5’-3’)TypesProducts (bp)
cg02316066-MTGTGGGGTTACGGGTAAGTTTCForward Primer170
cg02316066-MTCTACCAATTATAAATCTAATATCACATACReverse Primer
cg02316066-UTGTGGGGTTATGGGTAAGTTTTForward Primer170
cg02316066-UTCTACCAATTATAAATCTAATATCACATACReverse Primer
cg03046247-MTGGAAATAGTATAAATTGGAGGTGAForward Primer228
cg03046247-MAACTACGCTATTTTAAAAACCACGReverse Primer
cg03046247-UTGGAAATAGTATAAATTGGAGGTGAForward Primer228
cg03046247-UAAAAACTACACTATTTTAAAAACCACAReverse Primer
cg02166842-MGAGTGAGTGGGTTTAGTTAAGTGAGTForward Primer170
cg02166842-MACCCTCCTAAATATAATATTTACACGReverse Primer
cg02166842-UGAGTGAGTGGGTTTAGTTAAGTGAGTForward Primer170
cg02166842-UAACCCTCCTAAATATAATATTTACACAReverse Primer

M, methylated; U, unmethylated.

Primer sequences used in this study. M, methylated; U, unmethylated.

Statistical Analysis

Pearson correlation coefficients were used to assess correlations between EGFR mRNA expression, protein expression, immune score, and all individual beta values of EGFR in the TCGA dataset. Wilcoxon rank sum test with continuity correction was used to assess differential methylation. Results were deemed significant if p<0.05.

Results

Differential Methylation Analysis of EGFR and Its CpG Sites

We sought to explore whether variations in DNA methylation were linked to EGFR expression abnormalities. The cbioprotal official website analysis showed that EGFR hypermethylation is inversely correlated with mRNA (r2 = -0.38, P = 1.14e-16) () and protein (r2 = -0.39, P = 5.76e-13) () overexpression in LUAD of TCGA. We next analyzed the relationship between the methylation levels of the EGFR promoter and the clinicopathological parameters of LUAD patients by UALCAN. EGFR were significantly hypermethylated in LUAD tissues when compared with normal lung tissues (P = 3.38e-3) (), and the mRNA levels of EGFR in LUAD were remarkably lower than those in normal lung tissues (P = 1.41e-4) (). In the same cohort of LUAD patients from the TCGA database we collected methylation data from the Infinium HumanMethylation450 BeadChip for 49 CpG sites of EGFR ( and ). Six CpG sites were located in promoter regions (cg16751451, cg07311521, cg03860890, cg22396409, cg05064645, cg14094960) and 43 were in the gene body or in the 3’ UTR regions. There were 34 of 49 CpG sites that were differentially methylated between LUAD tissues and control groups (P < 0.05). Five CpG sites in promoter regions and 17 CpG sites in the gene body had a significantly higher percentage of methylation in LUAD when compared to normal lung tissues. Meanwhile, cg16751451 in promoter regions and 11 CpG sites in the gene body or 3’ UTR regions had a significantly lower percentage of methylation in LUAD compared to normal lung tissues.
Table 2

Differential methylation levels of EGFR CpG sites among different subgroups.

CpG sitePositionMean methylation levelp valueMean methylation levelp valueMean methylation levelp value
normalLUADEGFR-mutationEGFR-wild-2 -1 0 1 2
cg03860890 TSS15000.130.16 9.80E-04 0.140.16 3.40E-04 0.180.180.160.150.14 2.30E-02
cg05064645 5’UTR;1stExon0.050.08 4.00E-04 0.060.08 1.10E-02 0.200.090.090.070.06 3.90E-07
cg07311521 TSS15000.030.05 3.90E-03 0.040.05 6.60E-03 0.040.050.050.050.068.10E-01
cg14094960 5’UTR;1stExon0.080.11 4.70E-05 0.100.111.70E-010.240.120.120.100.09 1.50E-06
cg16751451 TSS15000.380.36 1.50E-02 0.300.36 1.00E-04 0.420.390.360.350.332.50E-01
cg22396409 TSS15000.130.16 1.20E-02 0.150.16 3.80E-02 0.150.170.160.160.167.50E-01
cg01461514Body0.590.39 2.20E-15 0.320.40 7.20E-07 0.490.470.400.390.26 3.70E-10
cg02003682Body0.810.76 1.00E-04 0.770.75 3.60E-02 0.720.710.750.770.74 4.00E-03
cg02166842Body0.600.53 2.60E-03 0.420.54 2.40E-10 0.590.580.530.530.42 7.30E-04
cg02316066Body0.630.67 3.50E-03 0.690.66 6.60E-03 0.580.650.650.680.71 1.40E-04
cg03046247Body0.730.701.30E-010.710.701.30E-010.620.700.690.710.71 2.50E-02
cg04116217Body0.790.76 3.20E-02 0.770.768.30E-020.670.720.750.770.74 2.50E-02
cg04625338Body0.310.29 1.30E-02 0.280.294.80E-010.250.300.280.310.263.00E-01
cg05207583Body0.740.76 1.60E-04 0.720.77 1.20E-04 0.660.770.770.770.72 2.70E-03
cg05530630Body0.790.80 2.90E-02 0.800.80 4.10E-02 0.720.800.790.810.80 1.60E-02
cg05537387Body0.810.808.10E-010.760.80 1.10E-02 0.720.790.790.810.76 2.00E-02
cg05898452Body0.590.611.20E-010.630.60 1.10E-02 0.460.580.590.620.64 2.40E-04
cg06052090Body0.670.74 5.80E-07 0.680.75 1.30E-03 0.680.740.720.750.746.10E-02
cg10002850Body0.700.80 1.60E-06 0.690.81 1.30E-05 0.830.820.810.800.68 4.70E-03
cg10550611Body0.760.722.80E-010.700.722.30E-010.730.710.720.720.709.10E-01
cg10690277Body0.860.851.20E-010.840.852.60E-010.790.850.850.850.82 7.40E-03
cg11849717Body0.120.124.40E-010.110.12 9.10E-03 0.160.130.130.110.11 2.00E-03
cg14344486Body0.700.708.80E-020.600.72 1.50E-10 0.750.730.720.700.60 1.80E-04
cg14688342Body0.480.43 3.30E-04 0.420.433.30E-010.420.420.430.430.429.30E-01
cg15692229Body0.730.86 2.20E-16 0.850.867.40E-010.860.860.850.860.841.30E-01
cg16488565Body0.540.62 4.90E-10 0.630.62 3.60E-02 0.510.600.610.630.61 5.70E-04
cg16589260Body0.760.79 1.70E-03 0.790.795.40E-010.740.810.790.790.788.40E-02
cg17319788Body0.850.813.60E-010.800.823.30E-010.790.810.830.820.68 2.70E-10
cg17389149Body0.860.858.70E-010.860.85 2.30E-02 0.830.820.840.860.86 2.40E-02
cg18071865Body0.580.61 3.30E-03 0.630.619.40E-020.510.590.600.630.64 8.80E-04
cg18452131Body0.830.79 2.70E-04 0.790.795.50E-010.710.780.780.800.76 2.10E-02
cg18809076Body0.790.795.10E-020.710.80 1.00E-09 0.780.820.800.780.65 3.70E-10
cg20041612Body0.700.642.60E-010.600.65 7.80E-03 0.550.620.660.640.601.50E-01
cg20062492Body0.790.73 1.30E-06 0.730.721.50E-010.640.720.720.740.72 2.30E-02
cg20706768Body0.750.78 5.20E-04 0.780.789.20E-010.670.780.770.790.81 1.20E-04
cg20773588Body0.740.78 5.60E-04 0.790.785.10E-020.610.770.770.800.73 9.00E-04
cg21681212Body0.600.74 4.90E-14 0.720.747.70E-020.680.730.730.740.718.70E-02
cg21808635Body0.770.83 6.00E-06 0.840.833.70E-010.710.830.820.830.86 2.70E-04
cg21901928Body0.730.75 5.60E-03 0.740.753.70E-010.660.720.740.760.74 3.80E-02
cg22427313Body0.680.691.00E-010.680.696.00E-010.650.640.660.710.69 9.90E-04
cg23757825Body0.880.841.70E-010.820.85 5.80E-03 0.760.810.850.850.80 7.80E-03
cg25311271Body0.090.10 3.00E-03 0.100.116.10E-020.140.110.110.100.09 1.30E-03
cg25815893Body0.820.802.00E-010.810.80 4.80E-02 0.730.780.800.810.77 1.80E-02
cg26055062Body0.680.76 1.70E-08 0.680.77 1.70E-05 0.730.780.760.760.724.90E-01
cg26277197Body0.410.391.10E-010.310.40 6.00E-09 0.510.430.410.370.28 3.30E-08
cg27598340Body0.950.85 4.20E-03 0.880.842.00E-010.650.770.830.870.84 2.30E-02
cg27637738Body0.480.53 2.70E-02 0.420.54 1.70E-07 0.510.600.490.550.48 4.30E-05
ch.7.1264585RBody0.120.11 2.50E-02 0.110.117.70E-010.100.110.110.110.108.20E-01
cg084282663’UTR0.820.78 1.30E-03 0.780.771.10E-010.670.770.760.790.78 3.20E-03

(Somatic copy number variation type: -2, shallow deletion; -1, diploid; 0, normal; 1, gain; 2, amplification). CpG sites in bold values indicate located in the EGFR promoter region. The p-values in bold values indicate statistical differences (p < 0.05).

Differential methylation levels of EGFR CpG sites among different subgroups. (Somatic copy number variation type: -2, shallow deletion; -1, diploid; 0, normal; 1, gain; 2, amplification). CpG sites in bold values indicate located in the EGFR promoter region. The p-values in bold values indicate statistical differences (p < 0.05). LUAD with EGFR mutations is a subtype of LUAD with a particular molecular mechanism and selective treatment (23). We analyzed EGFR methylation changes of LUAD in the EGFR mutations group and EGFR wild-type (non-mutated) group. We found that in the EGFR mutation group, EGFR showed a significant hypomethylation state. Interestingly, when performing differential methylation analysis of CpG sites we found that 19 of 26 CpGs were significantly hypomethylated while 7 CpGs were significantly hypermethylated in the EGFR mutation group compared to the EGFR wild-type group. We further explored the relationship between SCNV and DNA methylation of EGFR, and found that there was a negative correlation between SCNV and DNA methylation for some CpGs (cg05064645, cg14094960, cg25311271, cg11849717, cg03860890), while for the majority (26/31 CpGs) there was a positive correlation.

OS-Related CpG Sites

To explore the prognostic value of 49 CpGs of EGFR in LUAD, we constructed survival curves to evaluate the association between CpGs and OS with the Kaplan-Meier method. A total of 10 CpG sites were significantly associated with the OS of LUAD patients (). Except for cg05064645, the hypermethylation of cg27637738, cg16751451, cg02316066, cg22396409, cg03046247, cg02166842, cg21901928, cg07311521 and cg06052090 CpG sites revealed poor prognosis of LUAD patients (p<0.05).
Figure 2

KM curves of EGFR CpG sites. (A) Kaplan-Meier analysis regarding overall survival in patients with LUAD stratified according to EGFR methylation CpG sites. (B, C) Kaplan-Meier analysis of the overall survival rate of LUAD patients based on the EGFR mutation group and the EGFR wild-type PDL1 high expression group.

KM curves of EGFR CpG sites. (A) Kaplan-Meier analysis regarding overall survival in patients with LUAD stratified according to EGFR methylation CpG sites. (B, C) Kaplan-Meier analysis of the overall survival rate of LUAD patients based on the EGFR mutation group and the EGFR wild-type PDL1 high expression group. We next performed a survival analysis in the EGFR-mutant and EGFR-wild subsets separately. In the EGFR mutation group, the hypermethylation of cg01461514, cg26277197 and cg25311271 was associated with a good prognosis of LUAD (p<0.05), while the hypermethylation of cg21901928, cg22427313, cg02316066, cg26055062, cg10002850 and cg03046247 was associated with a poor prognosis (p<0.05) (). In patients with wild type EGFR, high levels of PDL1 expression affected the prognosis of immunotherapy (24). In the EGFR wild-type group, we divided LUAD patients into three equal parts according to the mRNA expression level of PDL1, and the group with the highest expression was identified as the EGFR wild-type and high PDL1 expression group. We found that the hypermethylation of cg02316066, cg16589260, and cg27637738 was associated with a poor prognosis in LUAD patients without EGFR mutations but with high PDL1 expression (p<0.05) (). Apart from EGFR mutations, KRAS mutations were the most common mutations in LUAD (25), and we discovered that hypermethylation of cg26055062 and cg04625338 predicted good prognosis in LUAD patients with KRSA mutations (). Hypermethylation of cg18809076 and cg25311271 in the KRAS wild-type group predicted good prognosis, but ch.7.1264585R had the opposite effect ().
Figure 3

KM curves of EGFR CpG sites. Kaplan-Meier analysis of the overall survival rate of LUAD patients based on the KRAS mutation group (A), KRAS wild-type group (B), smoking group (C) and the non-smoking group (D).

KM curves of EGFR CpG sites. Kaplan-Meier analysis of the overall survival rate of LUAD patients based on the KRAS mutation group (A), KRAS wild-type group (B), smoking group (C) and the non-smoking group (D). Patients with LUAD might have a variety of molecular features depending on their smoking history (26). We explored the relationship between EGFR methylation and prognosis in patients with LUAD in the smoking and non-smoking groups. Hypermethylation of cg16751451 and cg27637738 suggested a poor prognosis for LUAD in the smoking group (). In the nonsmoking group, hypermethylation of cg04625338 and cg05064645 indicated a favorable outcome, but cg02316066, cg05898452, cg06052090, cg21808635, cg16751451, and cg16589260 had the opposite effect ().

EGFR Expression Is Correlated With DNA Methylation

Individual CpG methylation was studied in relation to EGFR mRNA and protein expression (). Of the 49 CpG sites examined in LUAD tissue, 44 had a strong association with EGFR mRNA expression. The methylation levels of all CpG sites in promoter regions were shown to be inversely correlated to EGFR mRNA levels. The methylation level of CpG sites in 27 of 43 CpGs within the gene body region was positively correlated with the mRNA level of EGFR, and 11/43 showed a significant negative correlation. Cg03046247, cg08428266 and cg20062492 hypermethylation was significantly related to the high expression of EGFR mRNA, and cg10002850 hypomethylation was significantly related to the high expression of EGFR mRNA. Then we investigated the connection between CpG methylation and EGFR protein expression, and the findings matched the EGFR mRNA association described previously. Cg02316066 hypermethylation and cg01461514 hypomethylation were significantly related to the high expression of EGFR protein.
Figure 4

The correlation of EGFR CpG sites methylation and mRNA and protein expression of EGFR in LUAD. (A) Correlation analysis between DNA methylation levels of 49 EGFR CpG sites and mRNA and protein expression of EGFR in the TCGA LUAD cohort. (B, C) Correlation analysis between DNA methylation levels of 49 EGFR CpG sites and mRNA and protein expression of EGFR in EGFR and KRAS status groups of TCGA LUAD cohort. CpG sites labeled in red are located in the EGFR promoter region and in black in the gene body region.

The correlation of EGFR CpG sites methylation and mRNA and protein expression of EGFR in LUAD. (A) Correlation analysis between DNA methylation levels of 49 EGFR CpG sites and mRNA and protein expression of EGFR in the TCGA LUAD cohort. (B, C) Correlation analysis between DNA methylation levels of 49 EGFR CpG sites and mRNA and protein expression of EGFR in EGFR and KRAS status groups of TCGA LUAD cohort. CpG sites labeled in red are located in the EGFR promoter region and in black in the gene body region. Next, we explored the correlation between the mRNA and protein expression levels of EGFR and EGFR CpG sites in EGFR and KRAS mutation status or not (). Overall, similar to the above results, the β values of CpG sites located in the EGFR promoter region were negatively correlated with the mRNA and protein levels of EGFR in both the EGFR mutant and KRAS mutant groups, while the CpG sites of the gene body showed the opposite effect. Interestingly, we found a higher correlation, both positive and negative, in the EGFR mutant group compared to the EGFR wild-type group. In contrast, this correlation was lower in the KRAS mutation group than in the KRAS wild-type group.

EGFR Methylation and Expression Are Associated With Immune Cells Infiltration

Immune cells from a variety of species are known to infiltrate the tumor microenvironment (27). We explored the association between EGFR methylation levels and the infiltration levels of 8 immune cells and 2 stromal cells of MCPcounter. Based on the median EGFR integrated methylation level, we divided the LUAD samples into hypermethylated and hypomethylated EGFR groups. The results showed that the EGFR hypermethylation group was associated with increased infiltration of T cells, CD8 T cells, cytotoxic lymphocytes, B lineage, NK cells, monocytic lineage, and fibroblasts (). Moreover, the immune score, stromal score and estimate score of ESTIMATE were higher in the EGFR hypermethylated group than those of the hypomethylated group (). We found a positive association between EGFR promoter hypermethylation and the infiltration of T cells, CD8 T cells, cytotoxic lymphocytes, B lineage, NK cells, monocytic lineage, endothelial cells and fibroblasts, while EGFR body hypermethylation had lower infiltration of the above immune cells (). To further examine this scenario, we found that EGFR promoter hypermethylation had higher immune scores (p<0.001), a marker of total immune infiltration (). The association between EGFR methylation and the IFN-signature was also investigated. Increased mRNAs of the main IFN-signature genes (IFNG, STAT1, STAT2, JAK2) were found to be linked to extensive promoter hypermethylation and gene body hypomethylation ().
Figure 5

The correlation of EGFR CpG sites methylation and immune cells infiltrates and IFN-γ signature in LUAD. (A) The landscape of immune infiltration in LUAD according to the median EGFR methylation level by MCPcounter. (B) ESTIMATE scores in LUAD according to the median EGFR methylation level. (C–E)Correlation analysis between DNA methylation levels of 49 EGFR CpG sites and immune infiltration, ESTIMATE scores, IFN-γ signature in the TCGA LUAD cohort. CpG sites labeled in red are located in the EGFR promoter region and in black in the gene body region. ****P < 0.0001, ns, not significant.

The correlation of EGFR CpG sites methylation and immune cells infiltrates and IFN-γ signature in LUAD. (A) The landscape of immune infiltration in LUAD according to the median EGFR methylation level by MCPcounter. (B) ESTIMATE scores in LUAD according to the median EGFR methylation level. (C–E)Correlation analysis between DNA methylation levels of 49 EGFR CpG sites and immune infiltration, ESTIMATE scores, IFN-γ signature in the TCGA LUAD cohort. CpG sites labeled in red are located in the EGFR promoter region and in black in the gene body region. ****P < 0.0001, ns, not significant.

Validation of CpG Site Methylation by qMSP in External Cohorts

We evaluated the relationship between CpG sites and clinical information, such as T staging, N staging and cancer states. As shown in , the increase in the methylation β value of cg02166842 was significantly correlated with the T stage (T1 vs T2, T1 vs T3) (p=0.027, p=0.025 respectively) (), N stage (N0 vs N2) (), (p=0.035) and with tumor status (p=0.032) (). The methylation β value of cg02316066 was positively correlated with the T stage (T1 vs T3, T1 vs T4, T2 vs T3) (p=0.0049, p=0.019, p=0.032 respectively) (), while the methylation β value of cg03046247 was positively correlated with the T stage (T1 vs T2, T1 vs T3, T1 vs T4, T2 vs T3,T2 vs T4) (p=0.026, p=0.0011, p=0.0038, p=0.049, p=0.028 respectively) () and N stage (N0 vs N2) (p=0.0068) (). To determine whether there was a difference in the methylation levels of cg02316066, cg03046247 and cg02166842 between LUAD tissue and adjacent tissues, we collected 20 pairs of tissue specimens. Consistent with the TCGA database results, cg02316066 showed hypermethylation levels and cg02166842 showed hypomethylation in LUAD (). Ten OS-related CpG sites showed high correlation coefficients towards each other indicating a high degree of co-methylation (). We noted that cg02316066 and cg03046247 were strongly associated with multiple types of clinical profiles and LUAD prognosis, and there was also a high degree of co-methylation between cg02316066 and cg03046247. In the TCGA cohort, the cg02316066 and cg03046247 Pearson correlation coefficient was 0.76 (p<0.001) () and, consistent with this, in our validation cohort the correlation coefficient was 0.56 (p<0.001) ().
Figure 6

The relationship of EGFR CpG sites DNA methylation levels and clinicopathologic parameters. (A) Association of cg02166842 DNA methylation levels with T stage. (B) Association of cg02166842 DNA methylation levels with N stage. (C) Association of cg02166842 DNA methylation levels with cancer status. (D) Association of cg02316066 DNA methylation levels with T stage. (E) Association of cg03046247 DNA methylation levels with T stage. (F) Association of cg03046247 DNA methylation levels with N stage. (G) DNA methylation levels of cg02316066 in normal and LUAD samples. (H) DNA methylation levels of cg02166842 in normal and LUAD samples. (I) Correlation heat map of DNA methylation levels at 10 OS-associated CpG sites. (J) Association between the DNA methylation of cg02316066 and cg03046247 in the TCGA LUAD cohort. (K) Association between the DNA methylation of cg02316066 and cg03046247 in external validation cohort.

The relationship of EGFR CpG sites DNA methylation levels and clinicopathologic parameters. (A) Association of cg02166842 DNA methylation levels with T stage. (B) Association of cg02166842 DNA methylation levels with N stage. (C) Association of cg02166842 DNA methylation levels with cancer status. (D) Association of cg02316066 DNA methylation levels with T stage. (E) Association of cg03046247 DNA methylation levels with T stage. (F) Association of cg03046247 DNA methylation levels with N stage. (G) DNA methylation levels of cg02316066 in normal and LUAD samples. (H) DNA methylation levels of cg02166842 in normal and LUAD samples. (I) Correlation heat map of DNA methylation levels at 10 OS-associated CpG sites. (J) Association between the DNA methylation of cg02316066 and cg03046247 in the TCGA LUAD cohort. (K) Association between the DNA methylation of cg02316066 and cg03046247 in external validation cohort.

Discussion

So far, EGFR is a consensual factor that promotes cancer progression and the development of EGFR-TKIs has dramatically changed the therapeutic landscape for patients with non-small cell lung cancer (28). However, the rapid occurrence of clinical drug resistance hinders patient survival (8). TKIs or monoclonal antibodies targeting EGFR can block the infiltration of immunosuppressive cells and improve the antitumor response in NSCLCs, indicating that combining EGFR-targeted therapy with immune checkpoint inhibitors is a viable alternative for combination immunotherapy (29). Epigenetic variations are being gradually investigated, and they are now changing the idea that malignant lesions depend entirely on genetic expressions to develop (30). DNA methylation, which is largely responsible for gene silencing and chromatin formation, is by far the most studied epigenetic regulatory mechanism (31). Methyl groups are covalently bound to cytosine during DNA methylation to generate 5-methylcytosine (5mC) (32). Furthermore, DNA methylation is chemically stable, can be tested separately, and has strong biomarker potential (33). Methylation quantitative measurement of small samples (microanatomical cells, biopsies) is often performed in the clinic. These are the advantages of using methylation as a biomarker (34, 35). To establish the various levels of methylation sites of EGFR in LUAD, we collected RNA-seq results and Illumina HumanMethylation450K from TCGA. We meticulously investigated DNA methylation at the EGFR 49 CpG sites. We correlated EGFR methylation with transcription and protein expression in LUAD tissues. Our results indicate that DNA methylation in the promoter and gene body regions resulted in strong epigenetic regulation of EGFR. In the LUAD TCGA cohort, hypomethylation of the promoter region was negatively associated with increased mRNA and protein expression, while hypomethylation in the gene body was nearly always positively correlated with EGFR expression. This strong epigenetic regulation of EGFR is present not only in the different mutational states of EGFR but also in the KRAS mutant and wild-type groups, and interestingly, mutations in EGFR enhance this epigenetic regulation of EGFR, while mutations in KRAS attenuate this property. Some studies have revealed that EGFR and KRAS mutations are mutually exclusive in lung adenocarcinoma (36), the mechanisms of which need to be investigated in more depth. These methylation defects can eventually affect the clinical characteristics and prognosis of LUAD patients. Mutations in the Furin-like and Pkinase-Tyr domains were shown to be predictive indicators of effective TKI therapy for NSCLC (37–39), with slightly longer survival as compared to standard combination chemotherapy (40, 41). Different EGFR mutations have various benefits, and those that inhibit EGFR kinase activity can benefit from EGFR-targeted therapy (42, 43). In LUAD patients with an EGFR mutant phenotype, we found that most (19/26) CpG sites were hypomethylated and six of these were predictors for a good prognosis. There was a negative correlation between SCNV and DNA methylation at sites in CpG islands, and conversely, a positive correlation between sites in the CpG ocean of EGFR (26/31). T cells, CD8 T cells, cytotoxic lymphocytes, B lineage, NK cells, monocytic lineage, and fibroblasts were found to be infiltrated more frequently in tissues that presented EGFR hypermethylation. Also, the immune score, stromal score and estimate score were higher in the EGFR hypermethylated group than those in the hypermethylated group. The non-inflammatory tumor microenvironment (TME) in EGFR-mutated NSCLCs is abundant in Treg cells and macrophages, with the latter releasing chemokines that attract more Treg cells in the inflammatory TME (44). EGFR-TKI therapy facilitates CD8+ T cell recruitment and prevents Treg cell infiltration in the TME in EGFR-mutated tumors in vivo (45). Therefore, we evaluated the relationship between EGFR methylation and an IFN-γ signature. Wide-spread promoter hypermethylation and body hypomethylation were strongly associated with increased IFN-γ signature. In vitro studies have shown that blocking EGFR with antibodies or kinase inhibitors facilitate the secretion of chemokines (CCL2, CCL5, and CXCL10) in HNSCC cells and keratinocytes when IFN and tumor necrosis factor (TNF) are stimulated (46). In summary, our research shows that EGFR participates in the epigenetic regulation of LUAD through DNA methylation. DNA methylation of EGFR shows unique clinical characteristics and immunogenicity. Our research provides a theoretical basis for further assessment of EGFR DNA methylation, which can be used as a biomarker to predict the prognosis and immune mechanisms of LUAD.

Data Availability Statement

The original contributions presented in the study are included in the article/. Further inquiries can be directed to the corresponding author.

Ethics Statement

The studies involving human participants were reviewed and approved by The Ethics Committee of the First Affiliated Hospital of Guangxi Medical University. Written informed consent for participation was not required for this study in accordance with the national legislation and the institutional requirements.

Author Contributions

ZX and FQ conceived and designed the experiments, performed the qMSP experiments, prepared figures and/or tables and approved the final draft. LY, KD, and TZ performed bioinformatics analysis, analyzed the data, authored or reviewed drafts of the paper and approved the final draft. YS and JQ prepared figures and/or tables and approved the final draft. SL conceived and designed the experiments, approved the final draft. All authors contributed to the article and approved the submitted version.

Funding

This study was supported by the National Natural Science Foundation of China (no. NSFC81660488) and the Guangxi Natural Science Foundation under grant no. 2017GXNSFAA198123.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
  46 in total

1.  Analysis of DNA methylation of multiple genes in microdissected cells from formalin-fixed and paraffin-embedded tissues.

Authors:  Dimo Dietrich; Ralf Lesche; Reimo Tetzner; Manuel Krispin; Jörn Dietrich; Wolfgang Haedicke; Matthias Schuster; Glen Kristiansen
Journal:  J Histochem Cytochem       Date:  2009-01-19       Impact factor: 2.479

Review 2.  A comprehensive review of uncommon EGFR mutations in patients with non-small cell lung cancer.

Authors:  Hai-Yan Tu; E-E Ke; Jin-Ji Yang; Yue-Li Sun; Hong-Hong Yan; Ming-Ying Zheng; Xiao-Yan Bai; Zhen Wang; Jian Su; Zhi-Hong Chen; Xu-Chao Zhang; Zhong-Yi Dong; Si-Pei Wu; Ben-Yuan Jiang; Hua-Jun Chen; Bin-Chao Wang; Chong-Rui Xu; Qing Zhou; Ping Mei; Dong-Lan Luo; Wen-Zhao Zhong; Xue-Ning Yang; Yi-Long Wu
Journal:  Lung Cancer       Date:  2017-11-07       Impact factor: 5.705

Review 3.  The biology and management of non-small cell lung cancer.

Authors:  Roy S Herbst; Daniel Morgensztern; Chris Boshoff
Journal:  Nature       Date:  2018-01-24       Impact factor: 49.962

4.  Gefitinib or chemotherapy for non-small-cell lung cancer with mutated EGFR.

Authors:  Makoto Maemondo; Akira Inoue; Kunihiko Kobayashi; Shunichi Sugawara; Satoshi Oizumi; Hiroshi Isobe; Akihiko Gemma; Masao Harada; Hirohisa Yoshizawa; Ichiro Kinoshita; Yuka Fujita; Shoji Okinaga; Haruto Hirano; Kozo Yoshimori; Toshiyuki Harada; Takashi Ogura; Masahiro Ando; Hitoshi Miyazawa; Tomoaki Tanaka; Yasuo Saijo; Koichi Hagiwara; Satoshi Morita; Toshihiro Nukiwa
Journal:  N Engl J Med       Date:  2010-06-24       Impact factor: 91.245

Review 5.  Epigenetic alterations as a universal feature of cancer hallmarks and a promising target for personalized treatments.

Authors:  Michael Schnekenburger; Cristina Florean; Mario Dicato; Marc Diederich
Journal:  Curr Top Med Chem       Date:  2016       Impact factor: 3.295

Review 6.  DNA methylation, its mediators and genome integrity.

Authors:  Huan Meng; Ying Cao; Jinzhong Qin; Xiaoyu Song; Qing Zhang; Yun Shi; Liu Cao
Journal:  Int J Biol Sci       Date:  2015-04-08       Impact factor: 6.580

7.  A prospective, molecular epidemiology study of EGFR mutations in Asian patients with advanced non-small-cell lung cancer of adenocarcinoma histology (PIONEER).

Authors:  Yuankai Shi; Joseph Siu-Kie Au; Sumitra Thongprasert; Sankar Srinivasan; Chun-Ming Tsai; Mai Trong Khoa; Karin Heeroma; Yohji Itoh; Gerardo Cornelio; Pan-Chyr Yang
Journal:  J Thorac Oncol       Date:  2014-02       Impact factor: 15.609

8.  Estimating the population abundance of tissue-infiltrating immune and stromal cell populations using gene expression.

Authors:  Etienne Becht; Nicolas A Giraldo; Laetitia Lacroix; Bénédicte Buttard; Nabila Elarouci; Florent Petitprez; Janick Selves; Pierre Laurent-Puig; Catherine Sautès-Fridman; Wolf H Fridman; Aurélien de Reyniès
Journal:  Genome Biol       Date:  2016-10-20       Impact factor: 13.583

9.  Defining quantification methods and optimizing protocols for microarray hybridization of circulating microRNAs.

Authors:  Anna Garcia-Elias; Leonor Alloza; Eulàlia Puigdecanet; Lara Nonell; Marta Tajes; Joao Curado; Cristina Enjuanes; Oscar Díaz; Jordi Bruguera; Julio Martí-Almor; Josep Comín-Colet; Begoña Benito
Journal:  Sci Rep       Date:  2017-08-10       Impact factor: 4.379

10.  Investigation on the potential of circulating tumor DNA methylation patterns as prognostic biomarkers for lung squamous cell carcinoma.

Authors:  Yutao Liu; Yu Feng; Ting Hou; Analyn Lizaso; Feng Xu; Puyuan Xing; Hongyu Wang; Qiaolin Kang; Lu Zhang; Yuankai Shi; Xingsheng Hu
Journal:  Transl Lung Cancer Res       Date:  2020-12
View more
  2 in total

Review 1.  DNA Methylation in Lung Cancer: Mechanisms and Associations with Histological Subtypes, Molecular Alterations, and Major Epidemiological Factors.

Authors:  Phuc H Hoang; Maria Teresa Landi
Journal:  Cancers (Basel)       Date:  2022-02-15       Impact factor: 6.639

2.  BTG2 Serves as a Potential Prognostic Marker and Correlates with Immune Infiltration in Lung Adenocarcinoma.

Authors:  Xiao Zhen Zhang; Mao Jian Chen; Ping Ming Fan; Wei Jiang; Shi Xiong Liang
Journal:  Int J Gen Med       Date:  2022-03-08
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.