Literature DB >> 32037691

Comprehensive analysis of prognosis-related methylated sites in breast carcinoma.

Jia Li1, Xinzheng Li1.   

Abstract

BACKGROUND: Breast carcinoma has become a nonnegligible public health problem in China with its increasing incidence and mortality in woman. As a early event regulating tumorigenesis and development, DNA methylation became one of the focuses of current carcinoma researches on potential diagnostic and therapeutic targets.
METHODS: In this study, we comprehensively analyzed the gene expression data and DNA methylation data of breast carcinoma and adjacent normal tissues samples in the Gene Expression Omnibus database. Influences of tumor stage, adjuvant therapy, hormone therapy, and chemotherapy on CpG methylation level were explored by linear regression analysis. Correlations between methylation and gene expression levels were determined by spearman rank correlation analysis. Log-rank test was applied for determining significance of associations between CpG sites methylation level and breast cancer patients' Kaplan-Meier survival.
RESULTS: A total of 229 CpG sites were found to be significantly associated with tumor stage or treatment, and eight of which were potential markers that affect the survival of breast carcinoma and negatively correlated with their genes' expression levels.
CONCLUSIONS: We reported eight CpG sites as potential breast cancer prognosis signatures through comprehensively analyzed gene expression and DNA methylation datasets, and excluding influences of tumor stage and treatment. This should be helpful for breast cancer early diagnosis and treatment.
© 2020 The Authors. Molecular Genetics & Genomic Medicine published by Wiley Periodicals, Inc.

Entities:  

Keywords:  CpG island; DNA methylation; breast carcinoma; prognosis

Mesh:

Substances:

Year:  2020        PMID: 32037691      PMCID: PMC7196449          DOI: 10.1002/mgg3.1161

Source DB:  PubMed          Journal:  Mol Genet Genomic Med        ISSN: 2324-9269            Impact factor:   2.183


INTRODUCTION

Breast carcinoma (BC) is one of the most common reproductive carcinomas. In China, BC is the leading cause of carcinoma death in women younger than 45 years, and also is expected to account for 15% of all new carcinomas in women per year (Chen et al., 2016) In recent decades, the promotion of mammographic screening in physical examination has contributed to improve the early diagnosis of BC (Howell et al., 2014). However, the widespread application of these screening and diagnostic interventions requires significant resources (Eccles et al., 2013). New biomarker development is the basis of BC detection and diagnosis, and personalized treatment (Wagner & Srivastava, 2012). Thus, integrative analysis on high‐throughput data for searching biomarkers and potential therapeutic targets has been extensively studied in recent years (Sethi, Ali, Philip, & Sarkar, 2013). And the good news is that, several promising drugs that target epigenetic alterations are currently available for clinical investigation in solid tumors, including BC (Connolly & Stearns, 2012). Epigenetic alterations have recently emerged as a common hallmark of multiple tumors (Abdel‐Hafiz & Horwitz, 2015; Chakravarthi, Nepal, & Varambally, 2016; Okugawa, 2015). The occurrence of tumor cells is able to be activated by epigenetic alterations. Furthermore, their cellular behaviors, including proliferation, invasion, metastasis and even escape from chemotherapy and host immune surveillance, are also regulated by epigenetic processes (Klymenko & Nephew, 2018; Marks, Olson, & Fernandez‐Zapico, 2016). The epigenetics refers to changes in gene expression without changes in the DNA, including DNA methylation, histone posttranslational modifications, recruitment of chromatin remodeling factors, and expressions of micro and long noncoding RNA (Baylin & Jones, 2011). Especially, the causal relationships between gene expressions and DNA methylation have received considerable attention, and the epigenetic modification of different gene regions may consequently lead to distinct biological and clinical implications (Dafni, Anna, & Francesc, 2018). In BC, DNA methylation has been proved to be associated with clinicopathological features, such as tumor stage, histological and grade (Fleischer et al., 2017; Holm et al., 2016). Moreover, it can also influence the progression and prognosis of BC patients (Rauscher et al., 2015). The potential is great for DNA methylation markers to improve carcinoma outcomes across the prevention continuum (Terry, Mcdonald, Wu, Eng, & Santella, 2016). To screen the key methylated sites related to BC prognosis, in this study, we comprehensively analyzed the gene expressions and DNA methylation in BC tumor tissues, and explored the mechanisms and biological processes affecting the occurrence and development. We found that eight methylated sites that can influence breast carcinoma survival independently of clinical factors such as clinical grade and treatment. Finally, we identified two potential gene markers (ESPL1 andPARPBP) that might affect breast carcinoma survival, which supplemented the existing system of DNA methylation in the regulation of breast carcinoma.

MATERIALS AND METHODS

Data source

Datasets on DNA methylation and mRNA expression profiles of BC were downloaded from the Gene Expression Omnibus (GEO) database (http://www.ncbi.nlm.nih.gov/geo/). The datasets of DNA methylation (http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE37754; Platform: Illumina HumanMethylation450 BeadChip [HumanMethylation450 15017482 v.1.1]) and mRNA expression profile (http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE37751; Platform: Affymetrix Human Gene 1.0 ST Array [HuGene‐1_0‐st]) were selected for further analysis. From http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE37754 dataset, 72 samples were obtained, including 10 normal (noncarcinomaous) tissues and 62 tumor (breast carcinoma) tissues. From http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE37751 dataset, 108 samples were obtained, including 47 normal tissues and 61 tumor tissues.

Preprocessing of DNA methylation and mRNA expression data

DNA methylation data were processed via the IMA R Bioconductor package. The following criteria were used: Remove the methylated site if its detection p value (DP) > .05 in >75% samples and the methylated sites on sex chromosomes; remove the sample if >75% methylated sites in it had DP > 10–5. Meanwhile, the mRNA expression data were normalized with Robust Multi‐Chip Averaging with the Affymetrix Expression Console software.

Screening of differentially methylated genes

The differentially methylated sites (DMGs) between tumors and normal tissues were identified by limma method in IMA package with the thresholds of false discovery rate ≤0.05 (Wang et al., 2012). The Benjamini–Hochberg method was used for statistical corrections. The correlation of the methylation level of the DMGs with the stage and treatment of the patients was analyzed using limma R Bioconductor package (Ritchie et al., 2015). The correlation was defined as follows:where a is the methylation level at baseline, and X 1, X 2, X 3, X represent for tumor‐node‐metastasis (TNM) carcinoma staging, adjuvant therapy, hormone therapy, and chemotherapy respectively. p ≤ .05 was considered to be significantly correlated.

Survival analysis

The significantly differentially methylated genes that correlated with prognosis were screened using survival R package. The survival curve analysis was performed by the Kaplan–Meier method and comparison between subgroups were examined by the log‐rank test. The influence of the methylation level of the DMGs on the overall survival of breast carcinoma patients was analyzed using the Cox model.

Correlation between methylation and mRNA expression

Aberrant DNA methylation is usually closely associated with altered gene expression. Thus, we selected the DMGs that significantly associated with carcinoma staging, treatment and prognosis, and used the spearman rank test to calculate the correlation between their methylation and gene expression levels with the cut‐off of the absolute value of the correlation coefficient >0.75.

RESULTS

Identification of DMGs in BC

The distribution of the mRNA expression profiles of different samples after normalization was suitable for subsequent analysis (Figure 1a). After the preprocessing of DNA methylation, 455,968 methylated sites were retained from the 485,577 methylated sites with a pass rate of 93.9%, and all the samples were retained. The methylation level distribution of remaining methylated sites in tumor and normal tissues after removing the low‐quality methylation site (Figure 1b) indicated that the overall methylation level in tumor was higher than that in normal tissues. After screened by limma method in IMA package, 6,043 significantly differentially methylated genes were identified (Table S1). The heatmap of significantly differentially methylated genes in normal and tumor tissues (Figure 1c), indicating that the gene expression patterns of most tumor samples are consistent.
Figure 1

Identification of differentially methylated genes in BC. (a) The distribution of the mRNA expression profiles of normal and tumor tissues after normalization. (b) The methylation level distribution of differentially methylated sites of normal and tumor tissues after preprocessing. (c) Heat map of significantly differentially methylated genes in normal and tumor tissues

Identification of differentially methylated genes in BC. (a) The distribution of the mRNA expression profiles of normal and tumor tissues after normalization. (b) The methylation level distribution of differentially methylated sites of normal and tumor tissues after preprocessing. (c) Heat map of significantly differentially methylated genes in normal and tumor tissues

DMGs associated with tumor staging and treatment

As a result, 229 sites were identified out of the 6,043 DMGs with a standard of p ≤ .05 (Table S2). Among the DMGs, 191 were significantly associated with tumor TNM stage, 10 were associated with adjuvant therapy, 13 were associated with hormonal therapy, and 24 were associated with chemotherapy. There were 83.4% of these 229 DMGs associated with tumor stages (Figure 2). Furthermore, the expression pattern of these four DMGs groups were different in their related subgroups. Based on these 229 DMGs, we used principal component analysis to analyze the differences between tumors of different stages, and showed the first three principal components in a three‐dimensional scatter plot (Figure 3).
Figure 2

Heat map of differentially methylated genes related to tumor staging and treatments. (a) Methylation level of differentially methylated sites (DMGs) in tissues from patients at different stages. (b) Methylation level of DMGs in tissues from patients with and without adjuvant therapy. (c) Methylation level of DMGs in tissues from patients with and without hormone therapy. (d) Methylation level of DMGs in tissues from patients with and without chemotherapy

Figure 3

3D scatter plot of principal component analysis

Heat map of differentially methylated genes related to tumor staging and treatments. (a) Methylation level of differentially methylated sites (DMGs) in tissues from patients at different stages. (b) Methylation level of DMGs in tissues from patients with and without adjuvant therapy. (c) Methylation level of DMGs in tissues from patients with and without hormone therapy. (d) Methylation level of DMGs in tissues from patients with and without chemotherapy 3D scatter plot of principal component analysis

Analysis of prognosis‐related DNA methylated sites in BC

We performed a survival analysis of the 229 methylated sites significantly associated with tumor staging and treatment. For each methylated site, we divided the tumor samples into two groups using the median of the methylation levels of these sites. The survival curve was plotted using the Kaplan–Meier method. The log‐rank test was used in the judgment of difference in the survival curves. p < .05 was set as the standard. The results showed that the survival curves of 55 methylated sites were significantly different (Table S3). We then performed a cox proportional regression to analyze of the effects of these methylated sites, the effects and relative risks of race, age, tumor stages, and treatments (adjuvant therapy, hormonal therapy and chemotherapy) on survival times (Table S4). The results indicated that among these factors, age, tumor stages, and adjuvant therapy were significantly associated with survival time. Of the 229 DMGs, 87 were significantly associated with carcinoma survival time. To find the methylated sites that associated with survival time independent of age, tumor stages and adjuvant therapy, we analyzed the methylation levels of these 229 DMGs and the factors above in a multivariate cox proportional regression risk model. The results showed that after correcting with these factors, 13 DMGs were significantly associated with survival time (Table S5). In addition, eight of the 13 DMGs were also significant in the single‐factor cox proportional regression, including cg04988216, cg00118989, cg01967564, cg21374754, cg26874872, cg04836851, cg26090534, and cg08783934. The effects of these eight DMGs on BC patients' survival were shown in Figure 4. The red survival curves were the sample with higher methylation level, and the blue survival curves were the sample with lower methylation levels.
Figure 4

Survival analysis for the most significant differentially methylated sites. (a) Survival analysis stratified by cg00118989. (b) Survival analysis stratified by cg01967564. (c) Survival analysis stratified by cg26874872. (d) Survival analysis stratified by cg21374754. (e) Survival analysis stratified by cg04836851. (f) Survival analysis stratified by cg26090534. (g) Survival analysis stratified by cg04988216. (h) Survival analysis stratified by cg08783934

Survival analysis for the most significant differentially methylated sites. (a) Survival analysis stratified by cg00118989. (b) Survival analysis stratified by cg01967564. (c) Survival analysis stratified by cg26874872. (d) Survival analysis stratified by cg21374754. (e) Survival analysis stratified by cg04836851. (f) Survival analysis stratified by cg26090534. (g) Survival analysis stratified by cg04988216. (h) Survival analysis stratified by cg08783934

Correlation between DNA methylation and gene expression in BC

The level of methylation often affects the level of gene expression. We used the Spearman method to calculate the correlation between the methylation levels of the above 229 methylated sites and their corresponding gene expression levels. Using the absolute value of the correlation coefficient >0.75 as the threshold, a total of 19 methylated sites in the above 229 methylated sites were significantly associated with the expression levels of at least one of the corresponding genes (Table S6).

DISCUSSION

DNA hypermethylation is conventionally negatively associated with gene expression, and the DNA methylation in the promoter region of tumor suppressor genes appears to be a key event at early stages of carcinogenesis (Baylin & Jones, 2016). CpG islands are CpG‐rich areas of 200 bp to several kilobases in length, usually located near the promoters of highly expressed genes (Baylin, 2000). The aberrant CpG methylation has been proven to be a nearly universal feature of human carcinoma (Yang & Park, 2012). Damir Herman et al. had shown that CHST11 played an important role in the development of breast carcinoma, and its expression was regulated by DNA methylation (Herman et al., 2015). Studies had also shown that the promoter region of Runx3 gene was methylated, inhibited cell proliferation, apoptosis and differentiation, and promotes the formation of breast carcinoma (Lotem et al., 2017; Rossi, Bagalà, Inzani, Leoncini, & Schinzari, 2017). Liu H et al. revealed that hypermethylation of the RUNX3 gene promoter might play an important role in ER‐positive breast tumor progression (Liu et al., 2018). However, as BC is a heterogeneous disease, which includes several subtypes with different molecular and clinical features, the prognostic value of these aberrantly methylated biomarkers and the complex role of DNA methylation in distinct gene regions are still controversial topics (Győrffy et al., 2016). In this study, we comprehensively analyzed the gene expressions and DNA methylation in BC tumor tissues, and explored the mechanisms and biological processes affecting the occurrence and development. Comparing with normal tissues, a total of 6,043 significantly differentially methylated genes were identified in BC tumor tissues. Of these, 229 DMGs were significantly associated with tumor staging (83.4%) and treatment (16.6%), and 19 DMGs in the these 229 DMGs were associated with the expression levels of at least one of the corresponding genes. The corresponding genes were CDCA8, KIF2C, ARF1, RGS5, CENPL, TMEM206, EPRS, DHTKD1, CEP55, COL17A1, STIP1, GRIA4, OR5P1P, TUBA1C, ESPL1, PARPBP, RHOJ, PLK1, HERC2P4, HLF, MYO15B, KRT14, L3MBTL4, 8,022,168, CALR, DDX49, MRPS12, SMYD1, C2orf40, TPX2, ADRM1, TP63, PATA18, CENPU, IRX1, CARMN, HIST1H3A, SAMD5, KIAA1456, MTDH, PRDX4, and NAA10. Among them, the value of 13/42 genes in affecting the tumor cell behavior and BC prognosis has been clarified. CDCA8 is a putative oncogene that is upregulated in multiple types of carcinomas (Ci et al., 2018). In BC, the expression of CDCA8 correlates closely with FOXM1, which might be might be involved in BC progression (Jiao et al., 2015). ARF1 regulates the adhesion and invasion of MDA‐MB‐231 cells (Schlienger, Campbell, & Claing, 2014). EP RS is a critical regulator of cell proliferation and estrogen signaling in ER + BC (Katsyv, Wang, Song, Zhou, & Irie, 2016). CEP55 is a downstream target of FOXM1, which is involved in the proliferation of BC cells (Ye, Tao, Xueming, & Junming, 2016). The aberrant COL17A1 promoter methylation predicts the misexpression and increased invasion in BC (Thangavelu, Krenács, Dray, & Duijf, 2016). ESPL1 is a candidate oncogene of luminal B BC (Finetti et al., 2014). PLK1 is related to the growth and metastasis of Her2+ BC cells (Yao et al., 2012). L3MBTL4 is a potential tumor suppressor gene of chromosome arm 18p regulating the aggressive phenotype of BC (Addou‐Klouche et al., 2010). C2ORF40 suppresses BC cell proliferation and invasion through modulating expression of M phase cell cycle genes (Lu et al., 2013). TPX2 promotes migration and invasion of BC cells (Yang, Li, Shen, et al., 2015). TP63 is involved in the regulation by estrogen receptor‐α and ERK2 that controls BC proliferation and invasiveness properties (Wang et al., 2017). The activation of MTDH can promote chemoresistance and metastasis of poor‐prognosis BC (Hu et al., 2009). Moreover, nine of the other 29 genes have also been named in studies of a variety of carcinomas (Cabagnols, Cayuela, & Vainchenker, 2015; Ho et al., 2012; Jang, Park, Kim, Seong, & Kim, 2014; Jiang et al., 2011; Wang et al., 2010; Wang et al., 2017; Yang, Li, Shen, et al., 2015; Yang, Li, Niu, Li, & Bai, 2015; Zhao et al., 2018). Breast carcinoma is a complex biological process caused by both hereditary and nonhereditary factors, such as patient age, family heredity, lifestyle habits, estrogen levels, growth factors, cytokines, kinases, and epigenetic regulation (Akram, Iqbal, Daniyal, & Khan, 2017). In this study, we found that age, tumor stage, and adjuvant therapy were significantly associated with the survival time of BC patients. Subsequently, we screened the DMGs that were able to influence survival time independently of these three factors. Eight DMGs showed significant associations, and only one of them (cg26090534) was shown to significantly affect gene expression. These DNA methylation modifications did not directly act on promoter region, but might act on enhancer element to regulate gene expression level to silence tumor suppressor genes. Even though DNA hypermethylation is conventionally negatively associated with gene expression, recently methylation has been demonstrated to positively correlate with gene expression (Fleischer et al., 2014). The genes with the highest positive and negative correlation with cg26090534 were PARPBP and ESPL1. PARPBP, PARP‐1 binding protein, is able to enhance poly (ADP‐ribose) polymerase‐1 (PARP‐1) activity. It has been reported that, PARP‐1 was highly expressed in NSCLC, reducing the effects of CBP in NSCLC. And PARP‐1 as oncogene was found to effect NSCLC cell migration through known oncogene in terms of gene interaction network (Chen, Li, Xu, Zhang, & Wang, 2017). Furthermore, PARP inhibition significantly decreased cell viability, migration, invasion, chromatin loop dimensions, and histone acetylation, and it could play a key role in the compartmentalization of chromatin and in the development of the more aggressive phenotype (Barboro et al., 2015). ESPL1 is located in the 12q13.13 chromosomal region, coding the separase protein (Zhang & Pati, 2017). The separase proteinan is an endopeptidase, which is activated at the onset of anaphase and cleaves the cohesin subunit RAD21, allowing the release of sister chromatid cohesion required for chromosomal disjunction (Xu et al., 2018). The epigenetic modification of functionally different gene regions may consequently lead to distinct biological and clinical implications. Moreover, specific methylation patterns have been proven to be associated with different BC subtypes. More evidences showed that abnormal methylation of enhancer element, same as gene mutation and abnormal expression played an important role in the occurrence and development of tumor. Vecchione et al. reported that hypermethylation of LZTS1 gene was found in 75% (6/8) of gastric carcinoma cell lines, and the deletion or decrease of this gene expression was more attributed to the enhancer methylation mechanism (Vecchione et al., 2002). In addition, hypermethylation of enhancers was also found in oral squamous cell carcinoma and non‐hodgkin lymphoma (Nakagawa et al., 2006; Rahmatpanah et al., 2006). Thus, the abnormal methylation in enhancer region found in this study might be closely related to the occurrence of BC, and we further study the actual function of these genes in the follow‐up work.

CONCLUSION

In this study, we comprehensively analyzed the gene expression data and DNA methylation data of BC and adjacent normal tissues samples in the GEO database, and identified two potential methylated gene markers that may affect the survival of BC patients. This study has reference significance for exploring the molecular mechanism of DNA methylation in the regulation of BC occurrence and development, and would contribute to screening of key methylation markers related to BC prognosis.

CONFLICT OF INTEREST

The authors declare that there are no conflicts of interest.

AUTHORS' CONTRIBUTIONS

Jia Li and Xinzheng Li conceived and designed the project, Jia Li acquired the data, analyzed and interpreted the data, and wrote the paper. Xinzheng Li approved the final version. Click here for additional data file. Click here for additional data file. Click here for additional data file. Click here for additional data file. Click here for additional data file. Click here for additional data file.
  57 in total

Review 1.  A decade of exploring the cancer epigenome - biological and translational implications.

Authors:  Stephen B Baylin; Peter A Jones
Journal:  Nat Rev Cancer       Date:  2011-09-23       Impact factor: 60.716

2.  Expression of nuclear matrix proteins binding matrix attachment regions in prostate cancer. PARP-1: New player in tumor progression.

Authors:  Paola Barboro; Nicoletta Ferrari; Matteo Capaia; Andrea Petretto; Sandra Salvi; Simona Boccardo; Cecilia Balbi
Journal:  Int J Cancer       Date:  2015-04-06       Impact factor: 7.396

3.  FEZ1/LZTS1 is down-regulated in high-grade bladder cancer, and its restoration suppresses tumorigenicity in transitional cell carcinoma cells.

Authors:  Andrea Vecchione; Hideshi Ishii; Gustavo Baldassarre; Pierfrancesco Bassi; Francesco Trapasso; Hansjuerg Alder; Francesco Pagano; Leonard G Gomella; Carlo M Croce; Raffaele Baffa
Journal:  Am J Pathol       Date:  2002-04       Impact factor: 4.307

Review 4.  Biology and insights into the role of cohesin protease separase in human malignancies.

Authors:  Nenggang Zhang; Debananda Pati
Journal:  Biol Rev Camb Philos Soc       Date:  2017-02-08

5.  ADRM1 gene amplification is a candidate driver for metastatic gastric cancers.

Authors:  Seok Hoon Jang; Jun Won Park; Hyo Rim Kim; Je Kyung Seong; Hark Kyun Kim
Journal:  Clin Exp Metastasis       Date:  2014-06-27       Impact factor: 5.150

Review 6.  Epigenetic Alterations in Colorectal Cancer: Emerging Biomarkers.

Authors:  Yoshinaga Okugawa; William M Grady; Ajay Goel
Journal:  Gastroenterology       Date:  2015-07-26       Impact factor: 22.682

7.  C2ORF40 suppresses breast cancer cell proliferation and invasion through modulating expression of M phase cell cycle genes.

Authors:  Jing Lu; Mingxin Wen; Yurong Huang; Xiuquan He; Yunshan Wang; Qi Wu; Zengchun Li; Andres Castellanos-Martin; Mar Abad; Juan J Cruz-Hernandez; Cesar A Rodriguez; Jesús Pérez-Losada; Jian-Hua Mao; Guangwei Wei
Journal:  Epigenetics       Date:  2013-04-26       Impact factor: 4.528

8.  CHST11 gene expression and DNA methylation in breast cancer.

Authors:  Damir Herman; Tatiana I Leakey; Alice Behrens; Aiwei Yao-Borengasser; Craig A Cooney; Fariba Jousheghany; Bounleut Phanavanh; Eric R Siegel; A Mazin Safar; Soheila Korourian; Thomas Kieber-Emmons; Behjatolah Monzavi-Karbassi
Journal:  Int J Oncol       Date:  2015-01-09       Impact factor: 5.650

9.  ARF1 regulates the Rho/MLC pathway to control EGF-dependent breast cancer cell invasion.

Authors:  Sabrina Schlienger; Shirley Campbell; Audrey Claing
Journal:  Mol Biol Cell       Date:  2013-11-06       Impact factor: 4.138

10.  EPRS is a critical regulator of cell proliferation and estrogen signaling in ER+ breast cancer.

Authors:  Igor Katsyv; Minghui Wang; Won Min Song; Xianxiao Zhou; Yongzhong Zhao; Sun Park; Jun Zhu; Bin Zhang; Hanna Y Irie
Journal:  Oncotarget       Date:  2016-10-25
View more
  1 in total

1.  Comprehensive analysis of prognosis-related methylated sites in breast carcinoma.

Authors:  Jia Li; Xinzheng Li
Journal:  Mol Genet Genomic Med       Date:  2020-02-10       Impact factor: 2.183

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.