Literature DB >> 31423189

Dysregulation of DNA methylation patterns may identify patients with breast cancer resistant to endocrine therapy: A predictive classifier based on differentially methylated regions.

Fan Zhang1, Yukun Cui1.   

Abstract

Endocrine therapy (ET) is one of a number of targeted therapies for estrogen receptor-positive breast cancer (BRCA); however, resistance to ET has become the primary issue affecting treatment outcome. In the present study, a predictive classifier was created using a DNA methylation dataset to identify patients susceptible to endocrine resistance. DNA methylation and RNA sequencing data, and the clinicopathological features of BRCA, were obtained from The Cancer Genome Atlas. Stringent criteria were set to select and classify patients into two groups, namely those resistant to ET (n=11) and sensitive to ET (n=21) groups. Bump hunting analysis revealed that 502 out of 135,418 genomic regions were differentially methylated between these two groups; these regions were differentially methylated regions (DMRs). The majority of the CpG sites contained in the DMRs mapped to the promoter region. Functional enrichment analyses indicated that a total of 562 specific genes encompassing these DMRs were primarily associated with 'biological progress of organ morphogenesis and development' and 'cell-cell adhesion' gene ontologies. Logistic regression and Pearson's correlation analysis were conducted to construct a predictive classifier for distinguishing patients resistant or sensitive to ET. The highest areas under the curve and relatively low Akaike information criterion values were associated with a total of 60 DMRs; a risk score retained from this classifier was revealed to be an unfavorable predictor of survival in two additional independent datasets. Furthermore, the majority of genes (55/63) exhibited a statistically significant association between DNA methylation and mRNA expression (P<0.05). The association between the mRNA expression of a number of genes (namely calcium release activated channel regulator 2A, Schlafen family member 12, chromosome 3 open reading frame 18, zinc finger protein 880, dual oxidase 1, major histocompatibility complex, class II, DP β1, C-terminal binding protein 1, ALG13 UDP-N-acetylglucosaminyltransferase subunit and RAS protein activator like 2) and the prognosis of patients with estrogen receptor-positive BRCA and ET resistance was determined using Kaplan-Meier Plotter. In summary, the predictive classifier proposed in the present study may aid the identification of patients sensitive or resistant to ET, and numerous genes maybe potential therapeutic targets to delay the development of resistance to ET.

Entities:  

Keywords:  DNA methylation; breast cancer; differentially methylated regions; predictive classifier; resistance to endocrine therapy

Year:  2019        PMID: 31423189      PMCID: PMC6607238          DOI: 10.3892/ol.2019.10405

Source DB:  PubMed          Journal:  Oncol Lett        ISSN: 1792-1074            Impact factor:   2.967


Introduction

Breast cancer (BRCA) is the most common type of cancer in women globally (1) and is characterized by notable heterogeneity (2). The expression levels of estrogen and progesterone receptors (ER/PR) and human epidermal growth factor receptor (HER2) have been investigated to further classify BRCA into numerous subtypes: Luminal (ER+/PR+), HER2-positive (ER−/PR−/HER2+), basal-like or triple negative (ER−/PR−/HER2−), claudin-low and normal-like BRCA (3). Based on these guidelines (4), ~70% of patients with BRCA may be classified as the luminal subtype (5). Endocrine therapy (ET), one of the crucial adjuvant treatments for luminal BRCA, suppresses tumor growth by targeting the ER signaling pathways. Unfortunately, >30% of ER-positive tumor types are intrinsically endocrine-resistant at diagnosis; ~40% of breast tumor types that initially respond to ET eventually acquire resistance (6). Additionally, the clinical characteristics of BRCA may be notably heterogeneous even when similar expression levels of ER are observed (7). ET resistance in ER-positive tumor cells may be ascribed to a variety of factors, including the post-transcriptional modifications of ERs (8,9) or activation of the ER-independent growth factor signaling pathways (10); however, these results have not been further investigated for the effective clinical treatment of BRCA (11,12). Upon metastasis, surgical intervention and present second-line therapeutic strategies have limited effectiveness. Thus, precisely predicting the prognosis of patients with BRCA and ET resistance is vital for generating the most appropriate individualized treatment. As ER status alone is inadequate for identifying patients responsive to ET, multi-gene signatures from gene transcripts have been obtained via analyses with Oncotype DX (13) and MammaPrint tests (14). The expression profiles of these biomarkers may substantially aid the prediction of therapeutic outcomes and the selection of adjuvant therapy (3); however, transcriptional expression may be regulated by a variety of factors and appears to be unstable. Additionally, gene transcripts may not reflect marked changes in regulatory mechanisms, including epigenetic alterations, which may result in disease susceptibility. This represents a limitation of current molecular diagnostic tools based on gene expression assays (15). DNA methylation is a chemical modification of DNA that does not result in alterations in its sequence and may be inherited during cell division. It is well established that notable alterations to the genome-wide DNA methylation landscape may occur in the early stages of cancer initiation and during cancer progression, and throughout the acquisition of drug resistance (16,17). The hypermethylation of tumor suppressor genes or the hypomethylation of oncogenes maybe associated with the development of BRCA (18,19). DNA methylation is an enzymatic process and may be reversed by epigenetic inhibitors (17). Compared with genetic transcription (mRNA), DNA is inherently stable and may be obtained from numerous sources, including tissue, plasma, saliva and urine (20). Therefore, the DNA methylation profile is promising for identifying patients susceptible to ET. In addition, specific epigenotypes have been identified for the characterization and molecular subtyping of BRCA (21–23); however, few studies focusing on the DNA methylome associated with endocrine-resistant BRCA have been conducted. Furthermore, previous studies have revealed that remodeling of the epigenome is associated with the endocrine-resistant cell phenotype (24,25). Thus, DNA methylation signatures may serve as predictive biomarkers to identify ET-responsive patients with BRCA. DNA methylation levels maybe simultaneously determined via microarray analyses of numerous CpGs. An increasing number of genome-wide DNA methylation profiles of various types of cancer are available from public databases (26). Previous studies have revealed the importance of methylation in genomic regions compared with that at a single CpG island (27,28). A genome-wide bump hunting approach, introduced by Jaffe et al (29), was originally designed to identify differentially methylated regions (DMRs) detected on numerous microarray platforms, including the Infinium HumanMethylation450 BeadArray (HM450 array). This approach was demonstrated to effectively model expression profiles without measurement errors, remove batch effects and detect regions of interest. The present study aimed to identify a novel predictive classifier of BRCA by applying the bump hunting method and logistic regression to The Cancer Genome Atlas (TCGA) BRCA datasets on the basis of the DNA methylation profile of BRCA. The results of the present study may aid the identification of patients susceptible to endocrine resistance.

Materials and methods

Data downloading and processing

The DNA methylation profiles associated with BRCA were determined using an HM450 array; the corresponding RNA sequencing data (IlluminaHiSeq_RNASeqV2 arrays; measured using RSEM software; version 1.2.31) (30) and detailed clinicopathological features, including ET information, were downloaded from TCGA (accessed: January 2016; known as the Genomic Data Common Data Portal; http://portal.gdc.cancer.gov/); there were a total of 885 and 1,213 tumor/adjacent tissues with DNA methylation and RNA sequencing data, respectively. Among them, 787 tumor/adjacent tissues possessed both DNA methylation and RNA sequencing data. These samples were used to examine the association between DNA methylation and mRNA expression for the DMRs included in the predictive classifier. Additionally, two DNA methylation datasets based on HM450 array analysis, namely GSE75067 (31) and GSE72251 (32) from the Gene Expression Omnibus database (33) (http://www.ncbi.nlm.nih.gov/geo/), were used as independent datasets to assess the predictive potential of DNA methylation as a classifier of endocrine resistance.

Patient enrollment

In order to investigate the DNA methylation patterns associated with sensitivity to ET, inclusion criteria for patient enrollment were set. In the present study, all patients were: i) Diagnosed with BRCA; ii) female and ≤75-year-old; iii) positive for tumor types of tumor, node and metastasis (TNM) stage <4 (34,35) and ERα expression; and iv) treated with ET. Consequently, of the 1,097 patients with BRCA, 404 patients were selected. Of these patients, those with disease free survival (DFS) ≤30 months were regarded to resist ET and were defined as the resistant to ET (RTE) group. Those with DFS >100 months were classified as the sensitive to ET (STE) group; patients without DNA methylation data were excluded. Furthermore, there were 11 and 21 patients in the RTE and STE groups, respectively; the data of these patients were included for the predictive classifier building. Either a Fisher's exact test or a Student's t-test were conducted to determine differences of clinicopathological features between these two groups.

Model construction and selection

The level 3 DNA methylation β-value of BRCA from TCGA was defined as the percentage of DNA methylation in the tissue samples at each CpG probe; the methylation ranged from 0.0 (unmethylated) to 1.0 (fully methylated). In the present study, the M-value [logit(β)] was used instead of the β-value to calculate the test statistics; however, for ease of interpretation, the β-value was employed to report differences in methylation levels between the groups. The association between M- and β-values were determined as follows: M-value=Log2[β-value/(1-β-value)]. For the identification of DMRs, CpG sites that did not target specific genes and sites without data in any patients were excluded. Additionally, differential expression analysis was conducted to reveal the DMRs between the RTE and SET groups with the R-package ‘bumphunter’ (version 1.10.0) (29). This package was used to determine regions of methylation that deviated from the baseline values. DMRs were defined as genomic regions of differential methylation between two populations with a P-value <0.001 and covering ≥3 CpG sites. The diagnostic potential of these DMRs was further investigated by producing receiver operating characteristic curves (ROCs) and calculating the area under the curve (AUC). The median β-value across the CpG sites in each DMR (m) was calculated and the difference of m between the two groups (d) was determined. Finally, DMRs with an AUC ≥0.6 and |d|>0.2 were included to build the predictive classifier, and were ranked in an ascending order of P-values obtained from the bumphunter analysis. In the first step of the classifier building, 3 DMRs per analysis were added into the predictive classifier. The mean m of each DMR across all patients in the STE group, representing relatively normal DNA methylation, was calculated as m. In the second step, a Pearson's correlation coefficient was calculated between the m and m of each patient, labeled as the r value (−1 to 1). A positive r value represented the expression profile, indicating an association with patients in the STE group; otherwise, the expression profile of patients was considered to be less associated with the STE group. In step three, the effect of r values on the prediction of patients with ER-positive BRCA resistant to ET was evaluated via the logistic regression analysis; the predictive classifier was then generated. Simultaneously, ROCs in addition to the AUC were used to assess the diagnostic potential of the predictive classifier. A 95% confidence interval (CI) of the AUC was calculated according to the order of the observed AUC values among 1,000 permutations. Finally, for the model selection, the Akaike information criterion (AIC) was used to determine the goodness of fit and the simplicity of the classifier. As aforementioned, 3 DMRs were included each time and the process (steps 1–3) was repeated until all 80 DMRs were included. A risk score (RS) was then able to be calculated based on the final classifier.

Model validation

As public datasets with both DNA methylation (determined via an HM450 array) and treatment information were unavailable, patients with ER-positive BRCA without explicit information regarding ET were used. The independent datasets, GSE75067 and GSE72251, which determined DNA methylation in 87 and 70 patients with ER-positive BRCA, respectively, were used as external validations of the predictive classifier. An RS was assigned to each patient; Kaplan-Meier (KM) survival analysis was then performed to investigate the association between RS and cumulative rates of overall survival (OS) and invasive disease-free survival (IDFS). As for GSE75067, univariate and multivariate analyses using Cox regressions were also performed to screen out the independent factors affecting OS.

Identification of the function of genes included in DMRs

Functional enrichment analysis was performed using the R-package ‘clusterProfiler’ (version 3.6.0) (36) to investigate the well-known database, Gene Ontology (GO; http://geneontology.org/) (37,38). The specific genes, which were mapped by DMR analysis, were annotated with GO ‘biological process’ (BP), ‘molecular function’ (MF) and ‘cellular component’ (CC) terms. GO terms of P<0.01 and Pa<0.05 obtained via the Benjamini and Hochberg method (39) were considered to be statistically enriched.

Correlation between DNA methylation and mRNA transcripts

Pearson's correlation coefficients were further calculated to reveal the correlation between DNA methylation and mRNA expression. Since the sample sizes of the groups were limited, the data of 787 tumor/adjacent tissues in the TCGA dataset were employed for the correlation analysis. Since one DMR could contain several CpG sites mapped to one specific gene, several Pearson's correlation analyses were conducted separately to examine the associations between these CpG sites and the mRNA level of the gene. The CpG site in a specific DMR with the lowest P-value was demonstrated to exhibit the strongest correlation with the mRNA expression, and presented.

Effects of numerous specific genes on relapse-free survival (RFS)

KM Plotter (http://kmplot.com/analysis/) (40,41), a tool containing the gene expression and survival data of >4,000 patients with BRCA, was used to perform KM survival analyses to further assess the association between mRNA expression and RFS. Patients with ER-positive BRCA and ET were selected, and divided into the high and low expression groups based on the median expression levels of each specific gene. Subsequently, survival curves were created and log-rank tests were conducted.

Results

Identification of DMRs associated with the response to ET in patients with ER-positive BRCA

As presented in Fig. 1A, patients with BRCA meeting the inclusion criteria were divided into two groups according to their DFS, namely the RTE and STE groups. Detailed clinicopathological and treatment information of these 32 patients were presented in Table I. Of them, five patients had received tamoxifen, while 14 patients had been treated with aromatase inhibitors (anastrozole, letrozole and aromasin). The remaining patients were treated with one type of these drugs for a period of time, and subsequently treated with another type. A Fisher's exact test and a Student's t-test were conducted to determine differences between these two groups. No statistically significant differences in TNM stage and receptors status were observed between the two groups. In the RTE group, the tumor types of eight patients (8/11, 72.7%) were TNM stage II and three (3/11, 27.3%) were TNM stage III; however, in the STE group, five (5/21, 23.8%), 12 (12/21, 57.1%) and four (4/21, 19.0%) tumor types were classified as TNM I, II and III stages (P=0.205), respectively. As for receptor status, nine tumor types (9/11, 81.8%) from the RTE group and 16 (16/21, 76.2%) tumor types from the STE group were PR-positive (P=1.000). In addition, only two tumor types (2/11, 18.2%) from the RTE group and two tumor types (2/21, 9.5%) from the STE group expressed HER-2 (P=0.738). The mean age of the patients in the RTE groups was slightly higher compared with that of the patients in the STE group; no statistical significance was observed (P=0.235). Based on the aforementioned results, the clinicopathological data of these two groups were comparable.
Figure 1.

Flow chart of patient selection and model construction. (A) Publicly available DNA methylation, RNA sequence and clinicopathological features of BRCA were downloaded from The Cancer Genome Atlas. Stringent criteria were set to select and classify patients into two groups, namely the RTE (n=11) group and the STE (n=21) group. (B) Bump hunting was used to identify DMRs, which was used to construct a predictive classifier. The final classifier had 60 DMRs and was validated in an independent dataset (GSE75067). BRCA, breast cancer; TNM, Tumor Node Metastasis; ERα, estrogen receptor α; RTE, resistant to endocrine therapy; STE, sensitive to endocrine therapy; ROC, receiver operating characteristic; AUC, area under the curve; DMR, differentially methylated regions; ET, endocrine therapy.

Table I.

Clinicopathological features of patients with breast cancer from The Cancer Genome Atlas included in the present study.

Patient IDAge (year)TNM stageDFS (month)Relapse/deathERPRHER2GroupDrug nameRegimen indicationRadiation therapyHistory of neoadjuvant treatment
TCGA-A2-A0YC59226.2YesPosPosNegRTEArimidexAdjuvantNo
TCGA-A7-A13E62218.3YesPosNegNegRTEArimidex, AnastrozoleAdjuvantNo
TCGA-A7-A13H61226.1YesPosPosNegRTEAnastrozoleNo
TCGA-A7-A26H72210.2YesPosNegPosRTEAnastrozoleAdjuvantNo
TCGA-A7-A42570314.6YesPosPosNegRTEArimidexYesNo
TCGA-AO-A0JA3634.7YesPosPosNegRTETamoxifenRecurrenceNo
TCGA-AO-A1263928.8YesPosPosNegRTETamoxifenRecurrenceNo
TCGA-E2-A10B6726.4YesPosPosNegRTEArimidexAdjuvantNo
TCGA-E9-A1N652222.1YesPosPosPosRTETamoxifen, LetrozoleNo
TCGA-E9-A1NF6023.0YesPosPosNegRTETamoxifenNo
TCGA-LQ-A4E473322.4YesPosPosNegRTEAnastrozoleYesNo
TCGA-A2-A04R361121.9NoPosPosNegSTETamoxifen, AnastrozoleAdjuvantNo
TCGA-A2-A0CR542107.9NoPosPosNegSTETamoxifen, AnastrazoleNo
TCGA-A2-A0EN702134.3NoPosPosNegSTEAnastrozole, TamoxifenAdjuvantNo
TCGA-A2-A0EP561118.4NoPosNegNegSTEArimidexNoNo
TCGA-A2-A25A442107.6NoPosPosNegSTEFemaraAdjuvantNo
TCGA-AO-A125722113.5NoPosPosNegSTETamoxifen, AromasinAdjuvantNo
TCGA-AQ-A04L482130.0NoPosNegPosSTETamoxifenAdjuvantNo
TCGA-AR-A0TT533108.9NoPosNegNegSTETamoxifen, AnastrozoleAdjuvantNo
TCGA-AR-A0U3592134.0NoPosPosNegSTEAnastrozoleAdjuvantNo
TCGA-AR-A1AK701103.8NoPosPosNegSTEAnastrozoleAdjuvantNo
TCGA-AR-A24H652160.8NoPosPosNegSTEAnastrozole, TamoxifenAdjuvantNo
TCGA-AR-A24M383120.2NoPosPosNegSTELetrozole, TamoxifenAdjuvantNo
TCGA-AR-A24Q492104.2NoPosNegNegSTEAnastrozoleAdjuvantNo
TCGA-AR-A24R453112.7NoPosPosNegSTELetrozole, TamoxifenAdjuvantNo
TCGA-AR-A24T463105.2NoPosPosNegSTELetrozole, TamoxifenAdjuvantNo
TCGA-AR-A24V522105.2NoPosPosNegSTETamoxifen, AnastrozoleNo
TCGA-AR-A2LE691138.1YesPosNegSTETamoxifenNo
TCGA-GM-A2DA462214.7YesPosPosPosSTETamoxifen, Aromasin, LetrozoleNo
TCGA-GM-A2DL501115.6NoPosPosNegSTETamoxifen, ArimidexNo
TCGA-GM-A2DM572106.0NoPosPosNegSTEArimidexNo
TCGA-GM-A2DN582101.5NoPosPosNegSTEArimidexNo

TNM stage was determined according to the staging system of American Joint Committee on Cancer, either version 5th, 6th or 7th (34,35). Pos, positive; neg, negative; RTE, resistant to endocrine therapy; STE, sensitive to endocrine therapy; DFS, disease free survival; ER, estrogen receptor status determined by immunohistochemistry; PR, progesterone receptor status determined by immunohistochemistry; HER2, human epidermal growth factor receptor 2 status determined by immunohistochemistry.

Aberrant methylation profiles of DMRs were identified between the RTE and STE groups (Fig. 1B). Genomic locations were grouped into clusters (regions) based on a maximum distance of 500 base pairs (bp); of the 135,418 genomic regions, 502 regions, including 5,252 CpG sites were significantly differently methylated (P<0.001) and had ≥3 individual CpG sites (Fig. 2A). Due to the limited sample sizes of these two groups, multiple testing correction was not conducted. Of the 502 DMRs, the median size was 803 bp, with a range of 41–3,509 bp; the median number of CpGs in each DMR was 10, with a range of 3–32. According to the annotation files, the majority of the 5,252 CpG sites were mapped to the promoter region (TSS1500 and TSS200), the gene body and the 5′-untranslated region (UTR) and 3′-UTR, in addition to the first exon (Fig. 2B). Of note, each CpG site may simultaneously be detected in the promoter and other regions due to the various transcripts of a specific gene. These 502 DMRs encompassed 562 specific genes, and genes in the top ranked 20 DMRs were calcium release activated channel regulator 2A (EFCAB4B), paraoxonase 3, homeobox C4 (HOXC4), podoplanin (PDPN), major histocompatibility complex, class II, DQ β2, helt bHLH transcription factor, major histocompatibility complex, class I, J (pseudogene)/zinc ribbon domain containing 1 antisense, pseudogene, proline rich transmembrane protein 1/palmitoyl-protein thioesterase 2, family with sequence similarity 24 member B/chromosome 10 open reading frame 88 pseudogene, achaete-scute family bHLH transcription factor 2, histone cluster 1 H4 family member L/histone cluster 1 H3 family member I, tenascin XB, EYA transcriptional coactivator and phosphatase 4, lymphotoxin α, epithelial stromal interaction 1 (EPSTI1), dimethylarginine dimethylaminohydrolase 2, GATA binding protein 5, heparan sulfate-glucosamine 3-sulfotransferase 1, major histocompatibility complex, class II, DOα and homeobox C8 (HOXC8) (genes in the top ranked 50 DMRs were listed in Table II). Functional enrichment analysis was performed on these genes. Consequently, 48 BP, 8 MF and only one CC GO terms were significantly enriched with a P-value <0.01 and a false discovery rate q-value <0.05 (Fig. 2C-E). The top-ranked representative 10 BP GO terms were ‘skeletal system development’ (P=6.24×10−7), ‘cell-cell adhesion via plasma-membrane adhesion molecules’ (P=1.64×10−15), ‘homophilic cell adhesion via plasma membrane adhesion molecules’ (P=9.51×10−21), ‘pattern specification process’ (P=7.88×10−7), ‘embryonic organ morphogenesis’ (P=1.86×10−6), ‘gland development’ (P=6.45×10−7), ‘forebrain development’ (P=9.95×10−8), ‘regulation of hormone levels’ (P=5.88×10−5), ‘cell fate commitment’ (P=4.35×10−11) and ‘muscle organ development’ (P=2.19×10−5). The associated genes are listed in Table III, including the HOX family of genes, such as HOXC4 and HOXC8, the protocadherin (PCDH) α gene cluster, such as PCDHA4, PCDHA7, PCDHA10, PCDH8 and PCDHGA1, and other genes known to affect the tumor growth and metastasis [including C-X-C motif chemokine ligand 12, SRY-box 2, E74 like ETS transcription factor 5 (ELF5), epidermal growth factor receptor, tumor necrosis factor and fibroblast growth factor receptor 1, etc.]. Additionally, these genes were significantly associated with the MF GO terms of ‘protein heterodimerization activity’ (P=1.34×10−4), ‘transcription factor activity, RNA polymerase II core promoter proximal region sequence-specific binding’ (P=1.10×10−6), ‘transcriptional activator activity, RNA polymerase II transcription regulatory region sequence-specific binding’ (P=5.79×10−7), ‘transcriptional activator activity, RNA polymerase II core promoter proximal region sequence-specific binding’ (P=3.85×10−5), ‘glucuronosyltransferase activity’ (P=1.38×10−7), ‘retinoid binding’ (P=1.21×10−4), ‘retinoic acid binding’ (P=2.77×10−6) and ‘oxidoreductase activity and acting on NAD(P)H, oxygen as acceptor’ (P=3.85×10−5). The only significant CC GO term was ‘transcription factor complex’ (P=1.31×10−5).
Figure 2.

Identification of DMRs by bump hunting. (A) DMRs were defined as the genomic regions differentially methylated between two populations at a P-value <0.001 (retained from bump hunting) and covering ≥3 CpG sites. In total, 502 out of 135,418 genomic regions were regarded as DMRs and represented as red points in the volcano plot, while other genomic regions were indicated as blue points. The x-axis represented the log-transformed FC of the β-value between the resistant to endocrine therapy group and the sensitive to endocrine therapy group, and the y-axis represented the log-transformed P-value retained from bump hunting. (B) Pie chart indicating the location of 5,252 CpG sites enclosed in these DMRs. As observed, the majority of the CpG sites mapped into the promoter region (TSS1500 and TSS200), and then the gene body, followed by the 5′UTR and 3′UTR and 1st Exon. Gene Ontology (C) Biological Process, (D) Molecular Function and (E) Cellular Component terms were enriched by 562 specific genes encompassed by DMRs. FC, fold change; DMR, differentially methylated regions; UTR, untranslated region.

Table II.

DMRs between patients resistant and sensitive to endocrine therapy.

Bumps (no.)ChrStart coordinate of DMREnd coordinate of DMRGene(s)Number of CpGs in DMRDMR P-value
234741238622213862597EFCAB4B121.28×10−5
9571979502519495026937PON3251.46×10−5
25363125444610054448913HOXC4262.21×10−5
201311390916113910796PDPN172.35×10−5
8806263272917432730299HLA-DQB2313.60×10−5
810424185938933185941625HELT234.27×10−5
8753862997402229975078HLA-J;NCRNA00171324.52×10−5
8795863211961632121249PRRT1;PPT2324.96×10−5
1561910124638756124639892FAM24B;LOC399815195.27×10−5
169831122913472292905ASCL2226.87×10−5
8735962784095727842098HIST1H4L;HIST1H3I168.00×10−5
8794663206377432064749TNXB318.77×10−5
907076133561614133562196EYA4199.02×10−5
8777363153960131540750LTA189.29×10−5
29316134356590143566902EPSTI1149.85×10−5
8783063169590331697276DDAH2329.90×10−5
66889206105056061051561GATA5151.00×10−4
7795241143002211431359HS3ST1121.03×10−4
8810063297587532978129HLA-DOA241.27×10−4
25345125440243154403314HOXC8121.42×10−4
12810105096999750970591OGDHL111.71×10−4
170991128900192891118KCNQ1DN301.74×10−4
844285140305713140306458PCDHAC1; PCDHA7; PCDHAC1; PCDHA12; PCDHA6; PCDHA10; PCDHA4; PCDHA11; PCDHA8; PCDHAC1; PCDHA6; PCDHA1; PCDHA2; PCDHA9; PCDHA1; PCDHAC1; PCDHA13; PCDHA5; PCDHA3; PCDHA10111.80×10−4
169891123972012397977CD81161.80×10−4
36576157266788372669149HEXA; C15orf34151.99×10−4
844345140345966140346403PCDHAC2; PCDHA7; PCDHA12; PCDHA6; PCDHA10; PCDHA4; PCDHA11; PCDHA8; PCDHA6; PCDHA1; PCDHA2; PCDHA1; PCDHA9; PCDHA13; PCDHA5; PCDHAC1; PCDHA3; PCDHAC2; PCDHA10102.01×10−4
233881228000552801584CACNA1C142.03×10−4
169851123217702323059C11orf21;TSPAN32262.09×10−4
46839174665516446656543HOXB4192.52×10−4
577819295135592952268GFI1122.56×10−4
835005112073348112073769APC152.70×10−4
2663912103351180103352454ASCL1132.76×10−4
109271X153236083153238579HCFC1; TMEM187192.81×10−4
7898547655554776556042CDKL2112.94×10−4
42578168871713488717989CYBA133.13×10−4
107070X1672956416731095CTPS2153.16×10−4
808254174449827174451468HAND2; NBLA00301143.34×10−4
5977827542583275428132TACR1213.35×10−4
611302127413363127414883GYPC123.47×10−4
862946291687293285DUSP22103.65×10−4
9373072728091427282444EVX1223.75×10−4
1531910118030848118034357GFRA1303.76×10−4
67499213444216034443672OLIG1143.93×10−4
10080985416362254164442OPRK1104.01×10−4
623242177052486177053496HOXD1124.35×10−4
8722662565238125652815SCGN104.38×10−4
110571248020350248021163TRIM58104.42×10−4
13473107511810375118887TTC18124.48×10−4
47489175947650559478068TBX2174.92×10−4
971727130125511130126871MEST164.98×10−4

Chr, chromosome; DMR, differentially methylated region.

Table III.

Top-ranked 10 biological process terms enriched by genes included in differentially methylated regions determined through the functional enrichment analysis.

DescriptionGene ratioGene IDP-value
Homophilic cell adhesion via plasma membrane adhesion molecules33/446PCDHA7/PCDHAC1/PCDHA12/PCDHA6/PCDHA10/PCDHA4/PCDHA11/PCDHA8/PCDHA1/PCDHA2/PCDHA9/PCDHA13/PCDHA5/PCDHA3/PCDHAC2/GYPC/FAT1/SDK1/IGSF9B/PCDH8/CDH7/PCDHGA4/PCDHGA6/PCDHGA1/PCDHGA5/PCDHGB1/PCDHGB4/PCDHGA3/PCDHGA8/PCDHGA2/PCDHGA7/PCDHGB2/PCDHGB39.51×10−21
Cell-cell adhesion via plasma-membrane adhesion molecules33/446PCDHA7/PCDHAC1/PCDHA12/PCDHA6/PCDHA10/PCDHA4/PCDHA11/PCDHA8/PCDHA1/PCDHA2/PCDHA9/PCDHA13/PCDHA5/PCDHA3/PCDHAC2/GYPC/FAT1/SDK1/IGSF9B/PCDH8/CDH7/PCDHGA4/PCDHGA6/PCDHGA1/PCDHGA5/PCDHGB1/PCDHGB4/PCDHGA3/PCDHGA8/PCDHGA2/PCDHGA7/PCDHGB2/PCDHGB31.64×10−15
Cell fate commitment28/446ASCL1/EVX1/OLIG1/TBX2/ELF5/TRIM15/SOX2/NKX6-2/NOTCH4/FGFR1/GDF7/FGF10/PROX1/WT1/SOX8/BCL11B/EBF2/PITX1/GSX1/GLI3/FGF13/PAX7/NKX2-5/LBX1/GATA3/NR2F2/TGFB1I1/GATA24.35×10−11
Forebrain development29/446ASCL1/CXCL12/SOX2/KCNA1/FGFR1/GDF7/FGF10/NPY/SRD5A2/PROX1/ALK/BCL11B/PITX1/GSX1/GLI3/FGF13/DLX5/AQP1/DAB2IP/EGFR/NR2F2/RARB/TACC2/DUOX2/TRAPPC9/GATA2/PITX2/HTR5A/INHBA9.95×10−8
Skeletal system development33/446HOXC4/HOXC8/HOXB4/HAND2/HOXD1/HOXB5/HOXD9/HAPLN3/HOXD4/FGFR1/TBX15/SRD5A2/COL11A2/PITX1/HOXC6/HOXC5/GLI3/DLX5/COL1A2/PAX7/GNAS/CDX1/TLL1/ALPL/RUNX3/RARB/CDKN1C/PITX2/SHOX2/BARX2/BMP8B/COL2A1/MEIS16.24×10−7
Gland development30/446ASCL1/HAND2/TBX2/ELF5/GPX1/HOXD9/SOX2/KALRN/TNF/NOTCH4/FGFR1/GDF7/FGF10/PROX1/WT1/BCL11B/PITX1/GSX1/GLI3/BSX/LIMS2/NKX2-5/GATA3/EGFR/CDKN1C/DUOX2/GATA2/PITX2/UGT1A1/IRS26.45×10−7
Embryonic organ development30/446HOXC4/ASCL2/HOXB4/HAND2/TBX2/HOXB5/HOXD9/TNF/HOXD4/FGFR1/CITED1/TBX15/FGF10/VANGL2/PROX1/EN2/GLI3/DLX5/GNAS/NKX2-5/LBX1/GATA3/EGFR/NR2F2/RARB/CDKN1C/GATA2/PITX2/SHOX2/COL2A17.50×10−7
Pattern specification process30/446HOXC4/HOXC8/HOXB4/ASCL1/HAND2/EVX1/TBX2/HOXB5/HOXD9/NKX6-2/HOXD4/FGFR1/CITED1/IRX4/FGF10/VANGL2/GDNF/WT1/HOXC6/HOXC5/GLI3/PAX7/CDX1/SYNGAP1/NKX2-5/LBX1/PCDH8/NR2F2/PITX2/BCOR7.88×10−7
Embryonic organ morphogenesis23/446HOXC4/HOXB4/HAND2/TBX2/HOXB5/HOXD9/HOXD4/FGFR1/TBX15/FGF10/VANGL2/PROX1/GLI3/DLX5/GNAS/NKX2-5/LBX1/GATA3/RARB/GATA2/PITX2/SHOX2/COL2A11.86×10−6
Regulation of hormone levels28/446CACNA1C/TACR1/OPRK1/TRH/KALRN/TNF/FGFR1/DUOX1/DUOXA1/SRD5A2/SOX8/GALR1/P2RY1/GNAS/KCNS3/GATA3/EGFR/DUOX2/DUOXA2/UCN/RAB11FIP3/UGT1A1/UGT1A8/UGT1A3/UGT1A9/UGT1A7/IRS2/INHBA5.88×10−5

Classifier building for predicting the response to ET in patients with ER-positive BRCA based on DMR patterns

Following the process presented in Fig. 1B, 502 DMRs were identified, of which 457 DMRs had an AUC ≥0.6. Finally, 80 DMRs remained to build the predictive classifier set (with a difference of m between the RTE and STE groups >0.2). As presented in Fig. 3A, the logistic regression, including 60 DMRs, had the highest AUC to distinguish the RTE group from the STE group, and a relatively low AIC to indicate the goodness of fit and the simplicity of the classifier. Therefore, these 60 DMRs listed in Table IV were included. Among them, 31 DMRs, including EFCAB4B, secretoglobin family 3A member 1 (SCGB3A1) and dual oxidase maturation factor 1 (DUOXA1), were hypermethylated in the RTE group, and the remaining DMRs, including tripartite motif containing 58 and ELF5, were hypomethylated in the RTE group. Unsupervised hierarchical clustering analysis of these patients in the RTE and STE groups was performed based on these 60 DMRs. As presented in Fig. 3B, patients were divided into two classes, as follows: Class 1 was mainly enriched for patients in the STE group (20/21), while class 2 comprised 10 patients in the RTE group and one in the STE group. In total, only two samples (2/32, 6.25%) were incorrectly sorted into the wrong group.
Figure 3.

Construction and validation of the predictive classifier. (A) When modeling, 3 DMRs were included into the prediction classifier each time, followed by an assessment of the m in the STE group and the calculation of the r-value. The effect of the r-values was evaluated using the logistic regression, and AUC and AIC were simultaneously created to assess its diagnostic capacity, the goodness of fit and the simplicity of the classifier. The 95% CI of the AUC were calculated following 1,000 permutations. The left y-axis represented the AIC values, and the right y-axis represented the AUC values. The x-axis indicated the number of DMRs included in the logistic regression. (B) Hierarchical clustering was conducted on the DMRs. Each row of the heat map represented one of the DMRs with each column representing a different sample belonging to the RTE or STE group. RS was retained from each patient in the independent dataset GSE75067 on the basis of the predictive classifier. Kaplan-Meier analyses with log-rank tests were used to assess the effect of RS on OS time. The cutoff value of RS was set to be either (C) 0.2 or (D) 0.5. Similarly, for GSE72251, RS was assigned to each patient and Kaplan-Meier analyses were also used to assess the association between RS and OS, when the cutoff value was set to be (E) 0.2 or (F) 0.5. In addition, survival curves were also created by Kaplan-Meier analyses for invasive disease-free survival time, and the cutoff of RS was set to (G) 0.2 or (H) 0.5. AIC, Akaike information criterion; AUC, area under the curve; CI, confidence interval; RS, risk score; DMR, differentially methylated regions; RTE, resistant to endocrine therapy; STE, sensitive to endocrine therapy; OS, overall survival.

Table IV.

Differentially methylated regions for predicting patients who are resistant to endocrine therapy.

Bumps (no.)Gene symbolr-value[a]P-value[b]DNA methylation (resistant/sensitive to endocrine therapy)ChrNo. of CpG sitesP-values[c]
23474EFCAB4B−0.3583.20×10−25Up12121.28×10−5
2013PDPN−0.2102.94×10−9Up1172.35×10−5
87359HIST1H4L, HIST1H3I−0.233, −0.2703.41×10−11, 1.26×10−14Up6168.00×10−5
29316EPSTI1−0.5812.89×10−72Up13149.85×10−5
23388CACNA1C0.3051.86×10−18Up12142.03×10−4
83500APC0.0753.57×10−2Up5152.70×10−4
109271HCFC1, TMEM1870.072, −0.0764.37×10−2, 3.27×10−2DownX192.81×10−4
87226SCGN−0.1943.83×10−8Down6104.38×10−4
11057TRIM58−0.6191.39×10−84Down1104.42×10−4
93160MIR589, FBXL18-, −0.255-, 4.23×10−13Down7125.07×10−4
18397ELF5−0.5942.53×10−76Down11135.32×10−4
45490SLFN12−0.5533.98×10−64Up1795.33×10−4
72514C3orf18−0.5673.93×10−68Up375.53×10−4
86221SCGB3A1−0.3097.02×10−19Up5145.56×10−4
19965SIPA1−0.2601.13×10−13Up11126.43×10−4
53071ACP5−0.2223.18×10−10Up19116.58×10−4
67311MIR155HG−0.4241.01×10−35Up2177.44×10−4
16291NKX6-2−0.0481.76×10−1Down10128.42×10−4
56468ZNF880−0.4643.35×10−43Up1999.37×10−4
78770IGFBP70.0841.82×10−2Down481.01×10−3
12633ALOX5−0.1898.55×10−8Up1071.06×10−3
20753PHOX2A−0.2251.79×10−10Down1171.09×10−3
33797PPP2R5C0.0685.51×10−2Up1451.13×10−3
51880NFIC0.1408.11×10−5Up1951.17×10−3
87280HIST1H4F−0.2281.14×10−10Up671.25×10−3
94632IKZF1−0.2654.47×10−14Down761.25×10−3
67598CBR1−0.6213.20×10−85Up2161.27×10−3
102219ZNF572−0.6451.22×10−93Up881.30×10−3
35402DUOX1, DUOXA1−0.456, −0.4851.08×10−41, 1.03×10−47Up1591.36×10−3
108507CHRDL10.1207.29×10−4DownX91.61×10−3
69212PLA2G3−0.5813.43×10−72Down2261.63×10−3
23113GLB1L3−0.1758.25×10−7Down1171.89×10−3
98905VIPR2−0.4332.76×10−37Down761.92×10−3
28896GSX1−0.0948.09×10−3Down1391.98×10−3
88104HLA-DPB1−0.3008.83×10−18Up692.26×10−3
86545CDYL−0.0734.05×10−2Up662.30×10−3
11209DIP2C−0.1943.95×10−8Down10152.50×10−3
97788FAM115A0.0618.98×10−2Up742.54×10−3
84265LOC389333−0.1653.14×10−6Up582.58×10−3
17233TRIM68−0.5735.39×10−70Up1182.63×10−3
57826KCNS3−0.2271.13×10−10Up282.65×10−3
30133DOCK9−0.1197.89×10−4Down1362.74×10−3
108522ALG13−0.1226.13×10−4DownX52.92×10−3
7843SLAMF1−0.2701.33×10−14Up163.20×10−3
99079ARHGEF10−0.0304.00×10−1Down843.47×10−3
40848MT1G−0.5367.32×10−60Down1663.70×10−3
26270LIN7A−0.2917.61×10−17Up1244.08×10−3
57453TTC150.1953.45×10−8Down254.13×10−3
77073CTBP1−0.1757.47×10−7Down464.15×10−3
42328FBXO31−0.0753.59×10−2Up1634.28×10−3
85239EBF10.0353.28×10−1Down544.31×10−3
19658FERMT30.1024.24×10−3Down1144.32×10−3
7847CD48−0.1435.38×10−5Up144.56×10−3
102571TRAPPC90.0609.15×10−2Down835.43×10−3
49506HEXDC0.0782.90×10−2Down1735.80×10−3
81036ACSL10.0609.37×10−2Down435.89×10−3
8503RASAL2−0.2791.67×10−15Down135.93×10−3
40647ZNF4230.3844.02×10−29Up1646.19×10−3
393PRKCZ−0.0656.77×10−2Down136.31×10−3
11176DIP2C−0.1943.95×10−8Down1046.67×10−3

Pearson's correlation coefficient between mRNA expression and DNA methylation

P-values were obtained from Pearson's correlation analyses

P-values represented the statistical significance of differentially methylated regions on identifying patients with estrogen receptor-positive breast cancer sensitive to endocrine therapy; Chr, chromosome.

Based on the aforementioned 60 DMRs, an equation based on the aforementioned logistic regression, ln(p/1-p)=7.207–12.610x, was generated to evaluate the probability of resistance to ET in patients with ER-positive BRCA. The denoting of letters in the equation is as follows: p, the probability of resistance to ET, and x, r values between m and m across 60 DMRs in each patient. Thus, the RS was generated and described as follows: RS=e7.207–12.610×/(1+ e7.207–12.610×).

Application of RS on the prognosis of patients with ER-positive BRCA in two additional independent test datasets

The GSE75067 dataset containing the data of 87 patients with ER-positive BRCA was used instead to externally analyze the prognostic power of the RS on BRCA. The RS of each ER-positive patient was calculated; the median RS among these patients was 0.161, with a range of 0.051–0.982. As observed in Fig. 3C and D, whether the cutoff value of RS was 0.2 or 0.5, patients with higher RS values tended to exhibit significantly shorter OS times compared with those with lower RS values (log-rank test, P=0.042 and P<0.001, respectively). Furthermore, Cox regression analysis indicated that, following the adjustment for age, tumor type, lymph node status and PR expression, RS was an independent predictor for OS in patients with ER-positive BRCA (hazard ratio: 2.551; 95% confidence interval, 1.048–6.206; P=0.039; Table V).
Table V.

Univariate and multivariate Cox regressions in the dataset GSE75067.

UnivariateMultivariate


Clinical featuresHR (95%CI)P-valueHR (95%CI)P-value
Age (years, >50/≤50)1.286 (0.677, 2.442)0.4421.688 (0.800, 3.563)0.170
Tumor types (non-ductal/ductal)0.613 (0.290, 1.294)0.1990.528 (0.224, 1.245)0.145
Lymph node status (positive/negative)4.199 (2.096, 8.411)<0.0015.319 (2.440, 11.596)<0.001
Progesterone receptor expression (positive/negative)0.472 (0.215, 1.036)0.0611.282 (0.429, 3.835)0.656
Risk Score (>0.5/≤0.5)3.463 (1.742, 6.887)<0.0012.551 (1.048, 6.206)0.039

HR, hazard ratio; CI, confidence interval.

Additionally, the GSE72251 dataset containing the data of 70 patients with tumor types expressing ER was used for further validation; the median RS among these patients was 0.227, with a range of 0.053–0.957. As presented in Fig. 3E, patients with RS≤0.2 tended to have a better OS times, but statistical significance was not observed (log-rank test, P=0.717); however, the OS times of patients with RS ≤0.5 were significantly longer compared with that of their counterparts with RS >0.5 (log-rank test, P=0.006; Fig. 3F). IDFS data were available in this dataset and were also analyzed. As presented in Fig. 3G, no significant differences in IDFS were observed between the two groups (patients with BRCA and RS >0.2 compared with those with RS ≤0.2); however, patients assigned a higher RS exhibited a significantly longer IDFS compared with those with a lower RS when the cutoff of RS was set as 0.5 (P=0.009; Fig. 3H).

Correlation between DNA methylation and mRNA expression in DMRs included in the predictive classifier

Of the 60 DMRs, 63 specific genes were encompassed, and Pearson's correlation coefficients were determined to reveal the effects of epigenetic regulation. The CpG site in one specific DMR exhibiting the strongest correlation with mRNA expression was identified. The majority of the genes (55/63) had a statistically significant correlation between DNA methylation and mRNA expression with a P-value <0.05; 17 genes had an r-value ≤-0.3 (Table IV). Numerous representative DMRs and their correlation with mRNA expression in specific genes were presented in Fig. 4. Of these genes, EFCAB4B (P=1.28×10−5), Schlafen family member 12 (SLFN12; P=5.33×10−4), chromosome 3 open reading frame 18 (C3orf18; P=5.53×10−4), zinc finger protein 880 (ZNF880; P=9.37×10−4), dual oxidase 1 (DUOX1; P=1.08×10−41) and major histocompatibility complex, class II, DPβ1 (HLA-DPB1; P=2.27×10−3) were hypermethylated in patients in the RTE group, while ELF5 (P=5.32×10−4), phospholipase A2 group III (PLA2G3; P=1.63×10−3), metallothionein 1G (MT1G; P=3.70×10−3), C-terminal binding protein 1 (CTBP1; P=4.15×10−3), ALG13 UDP-N-acetylglucosaminyltransferase subunit (ALG13; P=2.92×10−3) and RAS protein activator like 1 (RASAL1; P=5.93×10−3) were hypomethylated in patients in the RTE group compared with that in the STE group. Among them, ELF5 (r=−0.594, P=2.53×10−76) exhibited the strongest negative correlation with mRNA expression, followed by PLA2G3 (r=−0.581, P=3.43×10−72), C3orf18 (r=−0.567, P=3.93×10−68), MT1G (r=−0.536, P=7.32×10−60), SLFN12 (r=−0.533, P=3.98×10−64), ZNF880 (r=−0.464, P=3.35×10−43), DUOX1 (r=−0.456, P=1.08×10−41), EFCAB4B (r=−0.358, P=3.20×10−25), HLA-DPB1 (r=−0.300, P=8.83×10−18), RASAL2 (r=−0.279, P=1.67×10−15), CTBP1 (r=−0.175, P=7.47×10−7) and ALG13 (r=−0.122, P=6.13×10−4).
Figure 4.

DMRs of a number of specific genes between the RTE and STE groups and their association with mRNA expression levels. Left: The points exhibited methylation measurements against genomic location. A total of 11 RTE and 21 STE samples were presented and represented by red points and green points, respectively, at each genomic location. The curves represent the smooth estimate of the population-level methylation profiles for RTE (red) and STE (green) samples. Right: Methylation measurements (M values) of a number of DMRs were plotted against the log-transformed mRNA expression. Each point represented an individual sample from 787 tumor/adjacent tissues measured using DNA methylation and RNA sequencing data. The specific gene covering the DMRs is presented in the top left. DMR, differentially methylated regions; RTE, resistant to endocrine therapy; STE, sensitive to endocrine therapy; EFCAB4B, calcium release activated channel regulator 2A; ELF5, E74 like ETS transcription factor 5; SLFN12, Schlafen family member 12; PLA2G3, phospholipase A2 group III; C3orf18, chromosome 3 open reading frame 18; MT1G, metallothionein 1G; ZNF880, zinc finger protein 880; CTBP1, C-terminal binding protein 1; DUOX1, dual oxidase 1; ALG13, ALG13 UDP-N-acetylglucosaminyltransferase subunit; HLA-DPB1, major histocompatibility complex, class II, DPβ1; RASAL2, RAS protein activator like 2.

Effects of the mRNA expression levels of numerous genes on the prognosis of patients with ER-positive BRCA and ET

Considering the association between DNA methylation and mRNA expression levels in the majority of the DMRs included in the prediction model, the effects of the mRNA expression levels of numerous genes on the prognosis of patients with ER-positive BRCA and ET were investigated using independent datasets in the KM Plotter tool. The cutoff value of the mRNA expression level for each gene was set as the median. A number of representative genes are presented in Fig. 5. Patients with higher mRNA expression levels of C3orf18 (log-rank P=0.003), ZNF880 (P=0.035), DUOX1 (P=0.013) and HLA-DPB1 (P=0.033) exhibited significantly longer RFS times compared with those with lower mRNA expression levels. Conversely, patients with lower expression levels of RASAL2 exhibited a significantly longer RFS (P=0.006) compared with those with increased expression levels. Additionally, separate survival curves associated with EFCAB4B (P=0.170), SLFN12 (P=0.063), CTBP1 (P=0.170) and ALG13 (P=0.092) expression were generated; however, statistical significance was not observed.
Figure 5.

Prognostic significance of mRNA expression of a number of specific genes in estrogen receptor-positive patients with BRCA receiving ET. Effect of mRNA expression on the relapse-free survival time of patients with BRCA receiving ET was assessed using the Kaplan Meier plotter. P-values were obtained from Kaplan Meier analysis with a log-rank test. ET, endocrine therapy; BRCA, breast cancer; EFCAB4B, calcium release activated channel regulator 2A; SLFN12, Schlafen family member 12; C3orf18, chromosome 3 open reading frame 18; ZNF880, zinc finger protein 880; DUOX1, dual oxidase 1; HLA-DPB1, major histocompatibility complex, class II, DPβ1; CTBP1, C-terminal binding protein 1; ALG13, ALG13 UDP-N-acetylglucosaminyltransferase subunit; RASAL2, RAS protein activator like 2.

Discussion

Previous studies have indicated that dysregulated DNA methylation is associated with carcinogenesis and therapeutic effectiveness (16,17,42). In the present study, DMRs were identified via the bumphunting analysis, followed by the building of a predictive classifier to identify ET-responsive patients with ER-positive BRCA. The RS was then calculated, which served as an indicator to classify patients with ER-positive BRCA into two groups with distinct survival outcomes in two additional independent datasets. Functional enrichment analyses demonstrated that genes with DMRs associated with ET sensitivity were associated organ morphogenesis and development and cell-cell adhesion. The present study mainly reported on two groups of genes, namely the PCDH family and homeobox genes. PCDHs, as part of the cadherin superfamily, were originally identified in the rat brain via polymerase chain reaction analysis and were associated with certain types of neurological disease (43,44). Previously, aberrant PCDH expression was observed in a variety of human malignant tumor types, potentially due to post-translational regulatory mechanisms, including DNA methylation (45). In the present study, it was reported that the methylation status of numerous PCDHs, including PCDHA4, PCDHA7, PCDHA10, PCDH8 and PCDHGA1 may be associated with the resistance to ET. In addition, the tumor suppressor and oncogenic functions of PCDHs have been reported in BRCA (46,47). The effects of certain PCDHs, including PCDH10, have been associated with fulvestrant resistance in BRCA (48). The function of PCDHs is associated with numerous signaling pathways, including the Wnt/β-catenin (49) and receptor tyrosine kinase (50) pathways, which have been proposed to be associated with tamoxifen resistance. The results of the present study suggested a potential association between PCDHs and resistance to ET; however, further investigation is required. Additionally, the HOX genes encode a family of highly conserved homeodomain-containing transcription factors that serve crucial functions during embryogenesis (51). In BRCA, the expression of numerous HOX genes has been reported to be up- or downregulated, which may be associated with carcinogenesis, metastasis and tamoxifen resistance (52,53). It was proposed that HOX genes contribute to a major part of DNA methylation profiles in BRCA subtypes (54). In the present study, the methylation patterns of numerous HOX genes, including HOXB4, HOXB5, HOXC4, HOXC8, HOXD1 and HOXD4, were observed to be associated with resistance to ET. DNA methylation of gene promoters may downregulate transcriptional expression, affecting tumorigenesis or the progression of tumor types (52). Therefore, gene promoters have become the focus of research to investigate DNA methylation. Li et al (55) reported that 3.3% inter-tumor gene expression maybe attributed to DNA methylation in gene promoters. The present study revealed that the locations of DMRs were primarily in the gene promoter, namely TSS200 and TSS1500; however, other regions, including the gene body, additionally exhibited a large proportion of methylation, which indicated that other regions containing CpG sites may regulate gene expression, contributing to ET resistance. Li et al (55) revealed that in addition to the gene promoter, other regions may substantially affect inter-tumor gene expression. For instance, enhancer methylation was associated with 4.0% of inter-tumor gene expression variation (56). Compared with a single CpG site, the varied methylation of genomic regions containing a number of CpG sites was more stable. Integrating the bumphunting method and logistic regression, 60 DMRs were reported to have the potential to identify patients with ER-positive BRCA and ET resistance. Due to the inadequate treatment information of the data from TCGA and the stringent criteria set, sample sizes in the present study were limited. Therefore, 11 patients with BRCA possessing a DFS ≤30 months were regarded as exhibiting ET resistance, while 21 patients exhibiting a DFS >100 months were regarded as sensitive to ET (57,58). The limited sample sizes may reduce the comparative power of the identification of DMRs. Therefore, multiple test adjustment was not applied to retain potentially genuine biomarkers. Furthermore, the GSE75067 and GSE72251 datasets lacking treatment information were included to externally validate the model proposed in the present study. The survival curves, particularly when the cutoff value of RS was set as 0.5, demonstrated notable curve separation in patients with ER-positive BRCA. These survival analyses indicated the potential application of RS in the prediction of the prognosis of patients with ER-positive BRCA and suggested its potential for identifying patients resistant to ET; however, cohorts with a large sample size are required to further support this predictive classifier. Pearson's correlation analyses revealed that the majority of DMRs (46/60) in the predictive model of the present study exhibited a negative correlation with the transcript expression. Numerous genes in this model, including ELF5 (59), CTBP1 (60) and zinc finger protein 423 (ZNF423) (61), have previously been reported to be involved in anti-estrogen resistance. Elevated expression levels of ELF5 were detected in luminal BRCA cells that had acquired resistance to tamoxifen (59). In addition, ELF5 may be a key transcriptional determinant of BRCA molecular subtypes by suppressing estrogen sensitivity in luminal BRCA cells (59). Furthermore, as a corepressor, CTBP1 was reported to be associated with the silencing of ubiquitin-conjugating enzyme E2 D1 and simultaneously elevated cyclin D1 expression levels, which may underlie the mechanism of acquired resistance to 4-hydroxytamoxifen (60). It was demonstrated that ZNF423 may be an estrogen-inducible BRCA1 transcription factor, and may contribute to variations in selective ER modulators in the prevention of BRCA (61). Additionally, a number of other genes, namely APC regulator of WNT signaling pathway, PDPN, EPSTI1, SCGB3A1, signal-induced proliferation-associated 1, acid phosphatase 5, tartrate resistant, MIR155 host gene, insulin like growth factor binding protein 7, arachidonate 5-lipoxygenase, protein phosphatase 2 regulatory subunit B′γ, nuclear factor I C, IKAROS family zinc finger 1, carbonyl reductase 1, DUOXA1, chordin like 1, disco interacting protein 2 homolog C, signaling lymphocytic activation molecule family member 1, Rho guanine nucleotide exchange factor 10, MT1G, lin-7 homolog A, crumbs cell polarity complex component, F-box protein 31, EBF transcription factor 1, trafficking protein particle complex 9, acyl-CoA synthetase long chain family member 1, RASAL2, ZNF423, protein kinase CΖ, PLA2G3 and ALG13 have been reported to be involved in the development of cancer (62–65), including BRCA; however, their association with ET resistance remains unknown. For instance, previous studies have identified the increased methylation of SCGB3A1 in metastases compared with that in primary breast tumor types (62,63). In non-invasive MCF7 cells, DUOXA1 expression was upregulated compared with that in highly metastatic cells; DUOXA1 overexpression sensitized cells to doxorubicin (64). In the present study, the increased DNA methylation of SCGB3A1 and DUOXA1 were observed in the RTE group, indicating their potential function in resistance to anti-estrogenic treatment. Additionally, a significant difference in the methylation frequencies and expression levels of MT1G was reported between BRCA subtypes (65). MT1G hypomethylation in patients who were resistant to ET was detected in the present study, indicating its association with antiestrogen therapy. The present study also reported numerous genes that have been rarely investigated in cancer research, including EFCAB4B. EFCAB4B is a Ca2+-binding protein that serves a key function in store-operated calcium entry in T-cells (66). A previous study demonstrated EFCAB4B hypermethylation in a twin with rheumatoid arthritis compared with their healthy co-twin (67). The results of the present study indicate a putative function of EFCAB4B in ET or potential immune/inflammatory alterations in the tumor microenvironment; however, further investigation is required. Generally, the targeting of numerous genes has been reported to be superior to targeting an individual target, and DNA methylation patterns have become a promising diagnostic tool in addition to gene transcripts in BRCA. In the present study, a number of DMRs were detected between patients with ET resistance and those sensitive to ET; DMRs were used to build a predictive classifier. Furthermore, an RS was generated based on the classifier, which may determine the distinct outcomes of patients with ER-positive BRCA, suggesting a beneficial function in the identification of patients who are resistant to ET. Additionally, a potential function underlying the development of BRCA and resistance to ET was indicated for a number of genes (EFCAB4B and SLFN12); however, further investigation using a larger cohort is required. The present study primarily proposed a useful tool for assessing patient responses to ET and a number of potential therapeutic targets to promote the sensitivity of patients to ET with ER-positive BRCA.
  66 in total

1.  Gene ontology: tool for the unification of biology. The Gene Ontology Consortium.

Authors:  M Ashburner; C A Ball; J A Blake; D Botstein; H Butler; J M Cherry; A P Davis; K Dolinski; S S Dwight; J T Eppig; M A Harris; D P Hill; L Issel-Tarver; A Kasarskis; S Lewis; J C Matese; J E Richardson; M Ringwald; G M Rubin; G Sherlock
Journal:  Nat Genet       Date:  2000-05       Impact factor: 38.330

Review 2.  Cadherin superfamily genes: functions, genomic organization, and neurologic diversity.

Authors:  T Yagi; M Takeichi
Journal:  Genes Dev       Date:  2000-05-15       Impact factor: 11.361

3.  Gene Expression Omnibus: NCBI gene expression and hybridization array data repository.

Authors:  Ron Edgar; Michael Domrachev; Alex E Lash
Journal:  Nucleic Acids Res       Date:  2002-01-01       Impact factor: 16.971

Review 4.  Epigenetic regulation of gene expression: how the genome integrates intrinsic and environmental signals.

Authors:  Rudolf Jaenisch; Adrian Bird
Journal:  Nat Genet       Date:  2003-03       Impact factor: 38.330

5.  A multigene assay to predict recurrence of tamoxifen-treated, node-negative breast cancer.

Authors:  Soonmyung Paik; Steven Shak; Gong Tang; Chungyeul Kim; Joffre Baker; Maureen Cronin; Frederick L Baehner; Michael G Walker; Drew Watson; Taesung Park; William Hiller; Edwin R Fisher; D Lawrence Wickerham; John Bryant; Norman Wolmark
Journal:  N Engl J Med       Date:  2004-12-10       Impact factor: 91.245

6.  Very high frequency of hypermethylated genes in breast cancer metastasis to the bone, brain, and lung.

Authors:  Jyoti Mehrotra; Mustafa Vali; Megan McVeigh; Scott L Kominsky; Mary Jo Fackler; Jaana Lahti-Domenici; Kornelia Polyak; Nicoletta Sacchi; Elizabeth Garrett-Mayer; Pedram Argani; Saraswati Sukumar
Journal:  Clin Cancer Res       Date:  2004-05-01       Impact factor: 12.531

7.  Comparison of stage-specific outcome of breast cancer based on 5th and 6th AJCC staging system.

Authors:  Seung Il Kim; Byeong Woo Park; Kyong Sik Lee
Journal:  J Surg Oncol       Date:  2006-03-01       Impact factor: 3.454

Review 8.  Novel mechanisms of resistance to endocrine therapy: genomic and nongenomic considerations.

Authors:  Anupama E Gururaj; Suresh K Rayala; Ratna K Vadlamudi; Rakesh Kumar
Journal:  Clin Cancer Res       Date:  2006-02-01       Impact factor: 12.531

Review 9.  Endocrine-responsive breast cancer and strategies for combating resistance.

Authors:  Simak Ali; R Charles Coombes
Journal:  Nat Rev Cancer       Date:  2002-02       Impact factor: 60.716

Review 10.  Epigenetics as a mechanism driving polygenic clinical drug resistance.

Authors:  R M Glasspool; J M Teodoridis; R Brown
Journal:  Br J Cancer       Date:  2006-04-24       Impact factor: 7.640

View more
  5 in total

1.  Microarray expression profiles and bioinformatics analysis of mRNAs, lncRNAs, and circRNAs in the secondary temozolomide-resistant glioblastoma.

Authors:  Chengbin Zhao; Yuyuan Gao; Ruiming Guo; Hongwei Li; Bo Yang
Journal:  Invest New Drugs       Date:  2019-12-10       Impact factor: 3.850

2.  Targeting DNMTs to Overcome Enzalutamide Resistance in Prostate Cancer.

Authors:  Elia Farah; Zhuangzhuang Zhang; Sagar M Utturkar; Jinpeng Liu; Timothy L Ratliff; Xiaoqi Liu
Journal:  Mol Cancer Ther       Date:  2021-11-02       Impact factor: 6.009

3.  Candidate methylation sites associated with endocrine therapy resistance in ER+/HER2- breast cancer.

Authors:  Maryam Soleimani Dodaran; Simone Borgoni; Emre Sofyalı; Pernette J Verschure; Stefan Wiemann; Perry D Moerland; Antoine H C van Kampen
Journal:  BMC Cancer       Date:  2020-07-19       Impact factor: 4.430

4.  Comparison of a histone deacetylase inhibitor plus exemestane with exemestane alone in hormone receptor-positive advanced breast cancer that progressed on prior endocrine therapy: A meta-analysis.

Authors:  Liang Xu; Weifan Jiang; Wenwei Li; Chungen Guo; Lihua Luo; Yufeng Gao; Yali Cao
Journal:  Exp Ther Med       Date:  2022-07-18       Impact factor: 2.751

5.  Comprehensive methylome sequencing reveals prognostic epigenetic biomarkers for prostate cancer mortality.

Authors:  Ruth Pidsley; Dilys Lam; Wenjia Qu; Timothy J Peters; Phuc-Loi Luu; Darren Korbie; Clare Stirzaker; Roger J Daly; Phillip Stricker; James G Kench; Lisa G Horvath; Susan J Clark
Journal:  Clin Transl Med       Date:  2022-10
  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.