Literature DB >> 30832724

Discovery and validation of DNA methylation markers for overall survival prognosis in patients with thymic epithelial tumors.

Songlin Li1, Yuan Yuan2, He Xiao1, Jiajia Dai1, Yunfei Ye1, Qin Zhang1, Zhimin Zhang1, Yuhan Jiang1, Jia Luo1, Jing Hu3, Chuan Chen1, Ge Wang4.   

Abstract

BACKGROUND: The current prognosis of thymic epithelial tumors (TETs) is according to the World Health Organization (WHO) histologic classification and the Masaoka staging system. These methods of prognosis have certain limitations in clinical application and there is a need to seek new method for determining the prognosis of patients with TETs. To date, there have been no studies done on the use of DNA methylation biomarkers for prognosis of TETs. The present study was therefore carried out to identify DNA methylation biomarkers that can determine the overall survival in patients with TETs.
METHODS: Bioinformatic analysis of TCGA 450 K methylation array data, transcriptome sequencing data, WHO histologic classification and Masaoka staging system was performed to identify differentially expressed methylation sites between thymoma and thymic carcinoma as well as the different DNA methylation sites associated with the overall survival in patients with TETs. Using pyrosequencing, 4 different methylation sites (cg05784862, cg07154254, cg02543462, and cg06288355) were sequenced from tumor tissues of 100 Chinese patients with TETs. A prognostic model for TETs was constructed using these four methylation sites.
RESULTS: The TCGA dataset showed 5155 and 6967 hyper- and hypomethylated CpG sites in type A-B3 group and type C group, respectively, of which 3600 were located within the gene promoter regions. One hundred thirty-four genes were silenced by promoter hypermethylation and 174 mRNAs were upregulated. Analysis of univariate and multivariate Cox regression showed significant association between the methylation levels of 187 sites and the overall survival in patients with TETs. cg05784862(KSR1), cg07154254(ELF3), cg02543462(ILRN), and cg06288355(RAG1) were identified as independent prognostic factors for overall survival in patients with TETs after adjusting for Masaoka staging in 100 Chinese patients. The prognostic model which consists of the four abovementioned genes had higher accuracy for predicting the 5-year overall survival in patients with TETs as compared to the Masaoka clinical staging. (Time-dependent ROC analysis AUC 1.000 vs 0.742, P = 2.7 × 10-6).
CONCLUSIONS: The methylation levels of cg05784862(KSR1), cg07154254(ELF3), cg02543462(ILRN), and cg06288355(RAG1) sites are associated with the progression of TETs and may serve as new biomarkers for predicting the overall survival in patients with TETs.

Entities:  

Keywords:  DNA methylation; Prognostic model; Thymic epithelial tumors

Mesh:

Substances:

Year:  2019        PMID: 30832724      PMCID: PMC6398263          DOI: 10.1186/s13148-019-0619-z

Source DB:  PubMed          Journal:  Clin Epigenetics        ISSN: 1868-7075            Impact factor:   6.551


Background

Thymic epithelial tumors (TETs) are rare neoplasms arising from the epithelial cells of the thymus with an incidence of 0.13 per 100,000 person/year in the USA [1]. According to the 2015 World Health Organization (WHO) classification, TETs are divided into thymomas (A, A/B, B1, B2, B3 subtypes) and thymic carcinomas (TCs) based on the tumor cell morphology, degree of atypia, and extent of the thymocyte component [2]. The diagnosis of TETs relies largely on histology supported by immunohistochemistry [3]. Most types A and AB thymomas have low malignant potential, whereas types B1, B2, and B3 thymomas are more aggressive, with B3 thymoma having the greatest tendency for intrathoracic spread. On the contrary, thymic carcinoma is a highly aggressive tumor with frequent lymphatic and hematogenous metastasis [4]. However, its prognostic significance in guiding further treatment is controversial [5]. Surgical resection is considered the potential curative treatment. However, local recurrence or distant metastasis may occur in some patients even after complete resection [6]. Masaoka staging system and WHO classification at diagnosis were reported to be the main prognostic factors for recurrence and survival [7]. Although some genetic profiles have recently been reported in TETs, little is known about their genetic variability and clinical value [8]. Therefore, it is necessary to identify novel molecular biomarkers that improve diagnosis, prognosis, and treatment planning. Epigenetic alterations such as DNA methylation, histone modification, and loss of genome imprinting play crucial roles in the formation and progression of cancer [9]. Over the past decade, many researchers have demonstrated the presence of aberrant DNA methylation in various types of tumor. As is known, aberrant DNA methylation includes global hypomethylation and regional hypermethylation of which regional hypermethylation is generally associated with gene silencing. However, few studies have investigated DNA methylation in TETs [10]. Furthermore, published data on tumor suppressor genes MGMT and RASSF1A was not closely related with clinical significance. It is therefore necessary to identify DNA methylation biomarkers that could be used for detection and prognosis in TETs. In this present study, in order to evaluate the potential of DNA methylation markers in the prognosis of TETs, we compared various methylation profiles of thymoma tissues and thymic carcinomas tissues by analyzing 485,000 CpG markers. We managed to identify a methylation marker panel, and this panel was further validated in 100 TETs tissue samples. The results suggest that DNA methylation sites may be potential biomarkers in the prognosis of TETs.

Results

Differentially methylated sites in WHO histological type C thymoma

To identify potential differentially methylated sites specific to WHO histological type C thymoma, DNA methylation profiles of 124 tumor tissues consisting of 113 type A–B3 and 11 type C were used. The clinicopathologic characteristics of these 124 cases are shown in Table 1. A total number of 12,122 CpG sites among 392,653 probes were identified as differentially methylated sites through site level analysis (Fig. 1 and Additional file 1: Table S2), of which, 5155 CpG sites were found to be hypomethylated and 6967 CpG sites hypermethylated. This corresponded to 2693 and 1734 genes respectively. Kyoto Encyclopedia of Genes and Genomes (KEGG) functional enrichment analysis was used to annotate the two set of genes (Additional file 2: Table S3). The genes associated with hypomethylated sites mainly participated in focal adhesion (hsa04510, adjusted P = 4.21 × 10−5) and apoptosis (hsa04210, adjusted P = 2.40 × 10−4). On the other hand, the genes regulated by hypermethylation sites manifested more diverse functions and were predominately involved in neuroactive ligand-receptor interactions (hsa04080, adjusted P = 4.53 × 10−24), calcium signaling pathways (hsa04020, adjusted P = 4.72 × 10−13), and cAMP signaling pathways (hsa04024, adjusted P = 8.33 × 10−8).
Table 1

Clinicopathologic characteristics of 124 cases from TCGA dataset THYM

Clinicopathologic characteristicsn (%)
Overall survivalNon-sensored114 (92.7)
Censored9 (7.3)
Recurrence-free survivalNon-sensored109 (92.4)
Censored9 (7.6)
GenderFemale60 (48.4)
Male64 (51.6)
WHO histological typesA–B3 type113 (91.1)
C type11 (8.9)
History myasthenia gravisNo87 (71.9)
Yes34 (28.1)
Masaoka stageI–IIB99 (81.1)
III–IV23 (18.9)
Tumor tissue siteThymus97 (78.2)
Anterior mediastinum27 (21.8)
History of neoadjuvant treatmentNo122 (98.4)
Yes2 (1.6)
Postoperative radiotherapy and chemotherapyNo114 (92.7)
Yes9 (7.3)
Radiation therapyNo80 (65.0)
Yes43 (35.0)
Fig. 1

A volcano plots showing significantly expressed methylation sites in 392,653 probes in HumanMethylation450K array between thymoma with WHO histological type C and type A to B3. The red dots represent significantly differential methylation probes among all 392,653 probes included into analysis according to criteria mentioned in the “Methods” section

Clinicopathologic characteristics of 124 cases from TCGA dataset THYM A volcano plots showing significantly expressed methylation sites in 392,653 probes in HumanMethylation450K array between thymoma with WHO histological type C and type A to B3. The red dots represent significantly differential methylation probes among all 392,653 probes included into analysis according to criteria mentioned in the “Methods” section

Identification of genes potentially affected by differentially methylated sites in promoter regions

Among 12,122 differentially methylated sites, 3600 CpG sites were localized within the promotor regions which corresponded to a total number of 2029 genes which could be transcriptionally regulated. In addition, 3490 genes were identified as differentially expressed genes (DEGs) between WHO histological type A–B3 and type C thymoma through linear regression embedded in limma. However, only 480 genes out of the 3490 DEGs were found to overlap with 2029 genes that had differentially methylated CpG sites in their promotor regions. Based on the criteria proposed under the “Methods” section, 134 genes and 174 genes which corresponded to 268 and 274 CpG sites in their promoter regions were considered epigenetically silenced and upregulated respectively (Additional file 3: Table S4). The heatmap constructed on beta values of these CpG sites across all 124 patients was shown in Fig. 2. This unsupervised cluster analysis revealed three distinct groups of thymoma tumors. It was noteworthy that not all WHO histological type C thymoma are clustered together, and there were some cases where type B3, type AB, and type C thymoma were categorized into one group. Furthermore, the same methylation probes provided similar results in the subset which consisted of type A to B3 thymoma tumors (Additional file 4: Figure S1). These results suggest that thymoma patients diagnosed as the same WHO histological type could exhibit heterogeneity according to methylation profiles.
Fig. 2

A heatmap showing methylation profiles of 542 significantly expressed methylation sites which localize within promotor regions in corresponding genes and could be involved in regulation of mRNA expression for genes across all 124 cases. The top is the list of patients’ identifiers provided by TCGA, and the terminal characters in each patient’s IDs indicate classification of WHO histological types

A heatmap showing methylation profiles of 542 significantly expressed methylation sites which localize within promotor regions in corresponding genes and could be involved in regulation of mRNA expression for genes across all 124 cases. The top is the list of patients’ identifiers provided by TCGA, and the terminal characters in each patient’s IDs indicate classification of WHO histological types

Candidate methylation sites for prognosis

Out of 542 methylation sites which could transcriptionally regulate expression of corresponding genes, 187 CpG sites were identified as potential DNA methylation biomarkers for overall survival in thymoma patients using univariate and multivariate Cox regression (Additional file 5: Table S5). Among these CpG sites, methylation status in four genes, KSR1, ELF3, ILRN, RAG1, had shown strong association with overall survival and corresponding mRNA expression (Additional file 6: Figure S2 and Additional file 7: Figure S3). Median beta values for the probes of cg05784862(KSR1), cg07154254(ELF3), cg02543462(ILRN), and cg06288355(RAG1) were chosen as cut-off values to categorize patients into low and high methylation subgroups. As shown in Fig. 3, patients with high methylation in the first three methylation sites exhibited excellent prognosis, whereas those with low methylation in the last methylation site were associated with significantly longer overall survival. Moreover, after adjustment for age, gender, WHO histological type, Masaoka stage, presence of myasthenia gravis, tumor site, and radiotherapy, cg07154254(ELF3) (HR = 1.091 × 10−6 95% CI 0.000–0.098, P = 0.018) and cg02543462(ILRN) (HR = 9.744 × 10−4 95% CI 0.000–0.669, P = 0.037) remained significantly associated with overall survival. Cg05784862(KSR1) (HR = 0.014 95% CI 0.000–1.297, P = 0.065) and cg06288355(RAG1) (HR = 62.037 95% CI 0.934–4122.5, P = 0.054) had borderline significance for overall survival after adjustment mainly because of extremely low number of deaths. These four methylation sites were thus selected as candidates for further validation.
Fig. 3

Kaplan-Meier curves showing methylation profile stratifies thymoma patients in whole population into survival subgroups in TCGA dataset. a–c High methylation in cg05784862(KSR1), cg07154254(ELF3), and cg02543462(ILRN) is associated with significantly longer overall survival. d Low methylation in cg06288355(RAG1) is associated with significantly longer overall survival

Kaplan-Meier curves showing methylation profile stratifies thymoma patients in whole population into survival subgroups in TCGA dataset. a–c High methylation in cg05784862(KSR1), cg07154254(ELF3), and cg02543462(ILRN) is associated with significantly longer overall survival. d Low methylation in cg06288355(RAG1) is associated with significantly longer overall survival

Validation of these four methylation sites for prognosis in our cohort

The clinicopathologic characteristics of 100 cases enrolled in validation set are shown in Table 2. Twenty-four deaths were observed during follow-up. In addition, the percentage of patients with WHO histological type C and Masaoka stage III–IV was 46% and 63% respectively, which was significantly higher than those in the TCGA dataset. The representative results of pyrosequencing for methylation status in four patients were shown in Fig. 4. In the whole population, the methylation status in these four candidate methylation sites was revealed to be consistent with the TCGA cohort. Beta values for cg05784862(KSR1), cg07154254(ELF3), and cg02543462(ILRN) were significantly lower in patients with WHO histological type C compared with those with type A–B3. Cg06288355(RAG1) exhibited opposite pattern (Fig. 5). Masaoka staging was an independent prognostic factor for overall survival, resulting from forward stepwise Cox regression using age, gender, WHO histological type, Masaoka stage, presence of myasthenia gravis, radiotherapy, and chemotherapy as candidate variables. After adjusting for Masaoka stage, methylation status in the four sites were significantly associated with overall survival (Table 3, Fig. 6), which was consistent with results from the TCGA dataset. Time-dependent ROC curve analysis revealed that risk score from combination of beta values in the four methylation sites was superior to Masaoka stage for prediction of 5-year overall survival (AUC 1.000 vs 0.742, P = 2.7 × 10− 6) (Table 4, Fig. 7).
Table 2

Clinicopathologic characteristics of 100 cases enrolled for validation

Clinicopathologic characteristics n
SurvivalSurvival76
Death24
GenderFemale43
Male57
HistologyTumor54
Carcinoma46
Myasthenia gravisNo69
Yes31
WHO histological typesA/AB/B1/B2/B354
C46
Masaoka stageI–II37
III–IV63
RadiotherapyNo66
Yes34
ChemotherapyNo69
Yes31
Beta valueWHO histological types A–B3 (median/range)WHO histological type C (median/range)
cg05784862 KSR161.5 (19–70)42 (0–49)
cg07154254 ELF330 (8–40)16 (0–20)
cg02543462 IL1RN67 (13–80)24.5 (0–40)
cg06288355 RAG140 (27–88)77.5 (60–92)
Fig. 4

Representative images of pyrosequencing for cg07154254 in ELF3 in four patients. Increased methylation shown in patients no. 141 and no. 135 and low methylation in patients no. 29 and No.33

Fig. 5

Box plots showing the distribution of beta values in cg05784862 in KSR1, cg07154254(ELF3), cg02543462(ILRN), and cg06288355(RAG1) between thymoma with WHO histological type C and type A to B3 in validation set. The boxes with shadow represent patients with WHO histological type C

Table 3

Results from Cox regression in different populations†

Univariable Cox regression in whole populationWhole population, adjusted for Masaoka stageWHO histological type C only, adjusted for age, gender, radiotherapy, and chemotherapy
HR (95% CI) P HR (95% CI) P HR (95% CI) P
cg05784862 KSR1†0.868 (0.814–0.926)< 0.0010.870 (0.814–0.931)< 0.0010.852 (0.768–0.945)0.003
cg07154254 ELF3†0.718 (0.611–0.844)< 0.0010.720 (0.611–0.848)< 0.0010.674 (0.519–0.874)0.003
cg02543462 IL1RN†0.868 (0.819–0.920)< 0.0010.870 (0.819–0.923)< 0.0010.859 (0.798–0.925)< 0.001
cg06288355 RAG1†1.202 (1.089–1.328)< 0.0011.201 (1.085–1.328)< 0.0011.201 (1.083–1.332)0.001

†All beta values in each methylation sites were entered into equation as continuous variables

Fig. 6

Kaplan-Meier curves showing methylation profile stratifies thymoma patients in whole population into survival subgroups in validation set. a–c High methylation in cg05784862(KSR1), cg07154254(ELF3), and cg02543462(ILRN) is associated with significantly longer overall survival. d Low methylation in cg06288355(RAG1) is associated with significantly longer overall survival

Table 4

Area under the curves and corresponding 95% confidential intervals predictive for 5-year overall survival†

AUC 95% CICrude P compared with Masaoka stageAdjusted P value
Masaoka stage0.742 (0.656–0.828)
cg05784862 KSR1†0.941 (0.881–1.000)7.63 × 10−61.526 × 10−5
cg07154254 ELF3†0.940 (0.880–0.999)7.50 × 10−62.249 × 10−5
cg02543462 IL1RN†0.966 (0.923–1.000)2.77 × 10−71.385 × 10−6
cg06288355 RAG1†0.933 (0.869–0.996)1.75 × 10−51.749 × 10−5
Risk score1.000 (0.998–1.000)6.75 × 10− 72.7 × 10−6

†Masaoka stage was tested as binary variable (III–IV and I–II) and methylation status and risk score continuous variables

Fig. 7

Time-dependent curves showing different capacities for predicting 5-year overall survival in validation set. Risk score is constructed from linear combination of each coefficient in univariate Cox regression for the four methylation sites in TCGA dataset and beta value in validation set as proposed in the “Methods” section

Clinicopathologic characteristics of 100 cases enrolled for validation Representative images of pyrosequencing for cg07154254 in ELF3 in four patients. Increased methylation shown in patients no. 141 and no. 135 and low methylation in patients no. 29 and No.33 Box plots showing the distribution of beta values in cg05784862 in KSR1, cg07154254(ELF3), cg02543462(ILRN), and cg06288355(RAG1) between thymoma with WHO histological type C and type A to B3 in validation set. The boxes with shadow represent patients with WHO histological type C Results from Cox regression in different populations† †All beta values in each methylation sites were entered into equation as continuous variables Kaplan-Meier curves showing methylation profile stratifies thymoma patients in whole population into survival subgroups in validation set. a–c High methylation in cg05784862(KSR1), cg07154254(ELF3), and cg02543462(ILRN) is associated with significantly longer overall survival. d Low methylation in cg06288355(RAG1) is associated with significantly longer overall survival Area under the curves and corresponding 95% confidential intervals predictive for 5-year overall survival† †Masaoka stage was tested as binary variable (III–IV and I–II) and methylation status and risk score continuous variables Time-dependent curves showing different capacities for predicting 5-year overall survival in validation set. Risk score is constructed from linear combination of each coefficient in univariate Cox regression for the four methylation sites in TCGA dataset and beta value in validation set as proposed in the “Methods” section More importantly, in the subset that comprised of only WHO histological type C, each of the individual CpG sites was an independent prognostic factor for overall survival after adjusting for age, gender, radiotherapy, and chemotherapy (Table 3). However, it was difficult to draw meaningful results from Cox regression with regards to prognosis of these four methylation sites in the subset of WHO histological type A to B3 because only six patients died.

Discussion

Thymic epithelial tumors are a group of rare thoracic cancers including thymomas and thymic carcinomas which originate from thymic epithelial cells. Due to the relatively low incidence rate, there is a lack of studies on TETs and therefore little is known about the pathogenesis of TETs [11]. The current prognosis for patients with TETs mainly depends on the WHO histologic classification and the Masaoka staging system which have certain limitations in clinical application. Thus, there is a need for new biomarkers to better improve patient prognosis [12, 13]. Several other studies had identified DNA methylation gene as prognostic biomarkers of nasopharyngeal carcinoma and acute myeloid leukemia [14, 15]. In the current field of TETs, tumor size, gene mutation, protein expression, and miRNA had been reported to affect the prognosis of patients with TETs and potentially be used as prognostic biomarkers for TETs [15, 16]. However, targeting DNA methylation sites as prognostic biomarkers for TETs have not been explored, and our current study is the first to report DNA methylation sites as biomarkers for the prognosis of patients with TETs. In addition, our study also has the highest number of patients enrolled for the prognostic study of patients with TETs, with 124 patient data from the TCGA data portal and100 patient data from our own study. In this study, potential prognostic biomarkers of TETs were found in sites of DNA methylation in tumor tissues of patients with TETs. Four DNA methylation sites (cg05784862, cg07154254, cg02543462, and cg06288355) that predict the overall survival in patients with TETs were selected, and a prognostic model was constructed which has a higher accuracy compared to the commonly used Masaoka staging (AUC 1.000 vs 0.742, P = 2.7 × 10−6). This prognostic model could serve as a new method in the clinical field for the prediction of overall survival in patients with TETs. Besides predicting the overall survival in patients with TETs, the DNA methylation sites were also found to differentiate type C TETs from other types of TETs. Currently, diagnosis of TETs requires pathological study of biopsies [17] and WHO classification by cell morphology which the precision of these diagnosis methods needs to be improved. The four DNA methylation sites (cg05784862, cg07154254, cg02543462, and cg06288355) were selected by comparing WHO’s type A–B3 and type C DNA methylation levels; therefore, they can be used to differentially diagnose for type C TETs. It is important to know that the treatment of type C TETs is very different from type A–B3 [18]; therefore, it is necessary to improve the precision for treatment in different types of TETs. KSR1 gene is an oncogene regulated by the cg05784862 site. In this study, patients with poor prognosis showed lower levels of cg05784862 site methylation in the tissue, resulting in higher expression of KSR1 gene. A recent study by Steven PD et al. [19] reported that suppression of KSR1 gene expression could prolong survival in patients with colorectal cancer, and McCall JL et al. [20] reported that the KSR1 gene affects Myc protein expression which affects the prognosis of patients with colon cancer. In addition, Neilsen BL [21] reported that KSR1 could be a therapeutic target for Ras-dependent cancers. With all the studies done on KSR1, this is the first time KSR1 has been reported to affect the prognosis of patients with TETs. ELF3, a transcription factor, is regulated by the cg07154254 site [22]. Our results showed that patients with poor prognosis had lower levels of cg07154254 site methylation in the tissue, which results in higher expression of ELF3 gene. ELF3 gene plays a major role in tumor proliferation, invasion, and metastasis. A recent study had reported that activation of ELF3 gene expression by CircHIPK3 promotes proliferation and invasion in nasopharyngeal carcinoma which affect the prognosis of patients [23]. Similarly, Zhao W et al. reported that ELF3 regulates cell metastasis and invasion in non-small cell lung cancer via the PI3K/Akt pathway, thereby affecting prognosis of patients [24]. In view of the studies mentioned above, this is the first time ELF3 has been reported to affect the prognosis of patients with TETs, which suggest differences in the pathogenesis of TETs compared to other tumors. ILRN gene is mainly involved in immune response and is regulated by the cg02543462 site [25]. ILRN involvement in immune response and TETs association with autoimmune disorders [26] may suggest a possible mechanism on how ILRN could affect the prognosis of patients with TETs. The RAG1 is mainly involved in immune regulation and is regulated by the cg06288355 site [27]. Results from this study showed that both ILRN and RAG1 genes affected the prognosis of patients with TETs, possibly through the immune regulation of TETs. KSR1, ELF3, ILRN, and RAG1 regulated by the DNA methylation sites cg05784862, cg07154254, cg02543462, and cg06288355, respectively, are known to be involved in tumorigenesis, progression, and death, which is one of the reasons for using them as prognostic biomarkers in patients with TETs. Currently, Masaoka staging is commonly used for the prognosis of TETs, however, the accuracy of Masaoka staging needs to be improved. Therefore, it is important to search for new biomarkers in order to understand the prognosis of patients with TETs. Previous study by Wei J et al. had collected and analyzed the miRNA expression of patients with TETs from the TCGA data portal. The results showed that seven miRNAs were associated with the overall survival in patients with TETs and could be used as a differential diagnosis for type C TETs [28]. Likewise, the differential DNA methylation sites found in this study were able to determine the prognosis of patients with TETs and has higher accuracy than the commonly used Masaoka staging system. In addition, these sites can also differentially diagnose type C TETs. The above results were further validated by sequencing our own specimen. Santoni G et al. reported that high CTLA-4 protein expression correlates with poor prognosis of patients with TETs, suggesting that CTLA-4 can be used as a prognostic factor in TETs [29]. However, the drawback is that there is no comparison with the current Masaoka staging system. Literature had also reported that overexpression of focal adhesion kinase (FAK) protein in TETs was associated with poor prognosis and may serve as an independent prognostic biomarker for TETs [30]. Xu RH et al. reported that DNA methylation sites can be used for tumor diagnosis and prognosis, which further support our current findings [31]. At present, tumor research has entered the era of omics; it is difficult for a single gene or protein to determine the prognosis of tumor patients. Therefore, several DNA methylation sites that affect the prognosis of patients with TETs were proposed, forming a prognostic model which further enhance the accuracy in determining the prognosis and is also more accurate than the current Masaoka staging system. Although a prognostic model for TETs was successfully constructed using DNA methylation sites, the number of cases can still be improved, and more experiments are needed to further validate these results. Currently, the mechanism of KSR1, ELF3, ILRN, and RAG1 in regulating the prognosis of patients with TETs is not fully understood. Therefore, more experiments are needed to understand these mechanisms.

Conclusions

Taken together, ELF3, KSR1, ILRN, and RAG1 methylation sites can be used to determine the prognosis of patients with TETs and can also differentially diagnose subtypes of TETs. This combination of methylation sites can guide clinical diagnosis in differentiating thymoma and thymic carcinoma and better determine the prognosis of patients.

Methods

TCGA thymoma datasets

The preprocessed methylation dataset of thymoma TCGA.THYM.sampleMap/Human Methylation450(v.2017-09-08) was downloaded from UCSC (https://xenabrowser.net/ datapages/). This dataset contains DNA methylation profiles of 124 tumor tissues and 2 matched adjacent normal tissues from 124 cases of thymoma as discovery set. DNA methylation profile was measured experimentally using the Illumina Infinium HumanMethylation450 platform. Microarray probes are mapped onto the human genome coordinates using xena probeMap derived from GEO GPL13534 record (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GPL13534). M values obtained from logarithm transformation from beta values were used for statistical analyses and beta-values were used for heatmap visualizations and clustering [32]. A raw counts matrix of gene-level RSEM values from 122 cases of thymoma was obtained from http://gdac.broadinstitute.org/runs/stddata__2015_11_01/data/THYM/20151101/. This dataset was used to identify differentially expressed genes between groups which consisted of thymoma type A to B3 as well as type C thymic carcinoma. Clinical information of this TCGA cohort such as WHO histological type and Masaoka stage was also obtained from UCSC (TCGA.THYM.sampleMap/THYM_clinicalMatrix, v.2016-04-27).

Differential methylation analysis, differential expression analysis, and identification of epigenetically regulated genes in discovery set

DNA methylation probes with at least one “NA” across all samples were removed, and 392,653 probes were left for further analysis. To explore the thymic carcinoma-specific DNA hypermethylation or hypomethylation events, two sample t test was used to perform site-level testing between thymoma type A to B3 (n = 113) and type C thymic carcinoma (n = 11) using transformed M values and Bonferroni procedure was used to adjust crude P values for multiple comparisons. All probes with adjusted P < 0.05 and estimated mean difference of methylation between thymoma type A to B3 and type C thymic carcinoma of at least 50% were considered as candidate methylation sites for further identification of epigenetically regulated genes. Probes that were located within a promotor region were identified using GEO GPL13534 annotation file. Differential expression analysis between thymoma type A to B3 and type C thymic carcinoma was carried out with R/Bioconductor limma package after filtering genes with very low counts and the voom transformation. FDR < 0.05 and |log2Ratio| ≥ 1 was set as the threshold for significantly differential expression. DNA methylation and gene expression data were merged by gene symbol. Genes that met the following criteria were considered epigenetically silenced or upregulated respectively: △β > 50% and log2Ratio < − 1 or △β < − 50% and log2Ratio > 1.

Unsupervised clustering analysis

A total of 694 methylation sites within the promotor region that could potentially regulate gene expression were identified through the procedure described above. The beta values of thymic carcinoma-specific DNA methylation sites were used to perform unsupervised hierarchical clustering on all 124 cases of thymoma using the Cluster 3.0 program in the Biopython package [33] and visualized using JAVA TreeView program [34]. All pairwise distance between the patients was measured with Euclidean distance and pairwise maximum linkage clustering was used to define the distance between clusters.

Survival analysis for identification of candidate methylation sites responsible for overall survival prognosis

Survival analysis was carried out using Cox proportional hazards model as implemented in R survival package against overall survival data in discovery set. Univariate and multiple variable Cox regression were separately used to evaluate the prognosis value of 694 individual probes identified as epigenetically regulated sites. Beta values were represented by continuous variables and Masaoka stage, whereas WHO histological type was represented by binary variables. During adjustment for two important clinical prognostic factors in multiple variables Cox regression, Masaoka stages I and II were categorized into one state and Masaoka stages III and IV into another state. WHO histological type A to B3 was grouped into one state and type C into another state. The likelihood ratio test was used to determine if beta value from one probe entered into the regression models that contains Masaoka stage and WHO histological type as covariates is significant. Only probes with crude P value <  0.05 in both univariate and multiple variable Cox regression were considered statistically significant and were identified as candidate probes for further validation for overall survival prognosis.

Patients for validation

To verify the association of the methylation status of these four candidate genes with overall survival, a total of 100 patients with histologically confirmed thymoma or thymic carcinoma who were admitted into the Department of Thoracic Surgery at Daping hospital of the Third Military Medical University, China between October 2007 and October 2017 were enrolled in the study. Patients with concomitant malignant neoplasms were excluded. Tumors were reviewed and reclassified according to the 2015 WHO criteria. Tumor staging was performed according to the revised Masaoka system. This was used as the validation set. This study was approved by the Medical Ethics Committee at our hospital. Written informed consent was obtained from all patients prior to their enrolment. Patient information including age, sex, histology, presence of myasthenia gravis, Masaoka stage, WHO histological type and follow-up information including the follow-up period, time of the last follow-up, and overall survival of patients were obtained from clinical records and questionnaires.

Pyrosequencing

Resected specimens were obtained via complete tumor resection, fixed with 10% formalin, embedded in paraffin, and divided into 10 μm sections. Genomic DNA was extracted from 10 sections using the QIAamp DNA FFPE Tissue kit (Qiagen, Hilden, Germany). The concentration and purity of these DNA samples were determined with a spectrophotometer (NanoDrop2000, Thermo Scientific, Waltham, MA, USA). Bisulfite conversion of total 500 ng purified DNA in each sample was performed with EZ DNA Methylation-GoldTM Kit according to manufacturer’s instructions (Cat. No. D5006, Zymo Research Corporation, Orange, CA, USA). The bisulfite conversed DNA was amplified with TaKaRa EpiTaqTM HS (Cat. No. R110A, Takara Biomedical Technology (Beijing) Co., Ltd. Beijing, China) with reaction setup: 10 ng bisulfite-treated DNA, 0.4 μM forward and reverse primers, 2.5 μL 10 × EpiTap PCR Buffer, 2.5 mM MgCl2, dNTP Mixture (0.264 mM each), EpiTap HS(0.025 U/μL) in total 25 μL each reaction and with following thermal cycle condition: denaturation at 98 °C for 10 s, annealing at 55 °C for 30 s, extension at 72 °C for 30 s executed for 35 cycles followed by extension at 72 °C for 1 min and hold at 4 °C. The amplicons were then subjected to pyrosequencing with PyroMark Q96 (Qiagen, Hilden, Germany). All primers used are presented in Additional file 8: Table S1.

Statistical analysis

All beta values and mRNA expression levels were represented with median value, range, and visualized with a box plot. The difference in four methylation sites between thymoma type A to B3 and type C thymic carcinoma and the difference in mRNA expression levels between low and high methylation subgroups was evaluated by Kruskal-Wallis test. Kaplan-Meier method and the log-rank test were used to compare the overall survival between low and high methylation subgroups. A weighted model was constructed for prognostic model [35]. The total sum of products with coefficient of four candidate methylation sites in univariate Cox regression and corresponding beta values were calculated as risk scores for each patient in the validation set. The predictive efficiency of risk score, Masaoka stage, and methylation in cg05784862(KSR1) for 5-year overall survival in the validation set was determined with time-dependent ROC curve analysis using function “timeROC.” Comparison between two time-dependent AUCs was performed with function “compare” embedded in R language package “timeROC” (version 0.3 published in 2015-03-25) [36]. All other statistical analyses were performed using SPSS 17.0 (IBM SPSS, Chicago, IL, USA). All tests were bilateral, and P < 0.05 was considered statistically significant. Table S2. List of significantly expressed methylation probes. In this table, the “mean_AB” and “mean_C” stands for the average values of beta values in cases with WHO histological type A to B3 and WHO histological type C, respectively. “FC” for fold changes which is calculated as (mean_C- mean_AB)/mean_AB. The “p” for crude probabilities resulting from t test for each probe. (XLSX 1490 kb) Table S3. Results from KEGG enrichment analysis for genes corresponding to the significantly expressed methylation probes. (XLSX 17 kb) Table S4. The list of genes transcriptionally affected by methylation of significantly expressed methylation probes within promotor regions. In this table, meaning of the “mean_AB”, “mean_C”, “FC” and “p” is the same asdescribed in “Additional file 1: Table S2”. (XLSX 119 kb) Figure S1. A heatmap showing methylation profiles of 542 significantly expressed methylation sites which localize within promotor regions in corresponding genes and could be involved in regulation of mRNA expression for genes across patients with WHO histological type A to B3. (PNG 93 kb) Table S5. Summary of results from univariate and multivariable Cox regression for overall survival of probes which potentially affect transcription mRNA. The “Pcg” in this table respectively stands for probabilities for methylation of each probe adjusted for WHO histological type, Masaoka stage based on likelihood ratio test. The “Pt” for probabilities of significance of main effect model including WHO histological type, Masaoka stage and methylation of each probe. The “coeff” for coefficients of each probe in multivariable Cox regression. (XLSX 63 kb) Figures S2. Box plots showing the distribution of beta values in cg05784862(KSR1), cg07154254(ELF3), cg02543462(ILRN) and cg06288355(RAG1) between thymoma patients with WHO histological type C and type A to B3 in TCGA dataset. (TIFF 403 kb) Figures S3. Scatter plots showing relationship between mRNA expression levels and methylation in four candidate methylation sites cg05784862(KSR1), cg07154254(ELF3), cg02543462(ILRN) and cg06288355(RAG1). The red dots in each square indicate cases with WHO histological type C. (TIFF 734 kb) Table S1. List of primers used for pyrosequencing and quantitative RT-PCR. (XLSX 10 kb)
  36 in total

1.  Java Treeview--extensible visualization of microarray data.

Authors:  Alok J Saldanha
Journal:  Bioinformatics       Date:  2004-06-04       Impact factor: 6.937

2.  Open source clustering software.

Authors:  M J L de Hoon; S Imoto; J Nolan; S Miyano
Journal:  Bioinformatics       Date:  2004-02-10       Impact factor: 6.937

Review 3.  KSR as a therapeutic target for Ras-dependent cancers.

Authors:  Beth K Neilsen; Danielle E Frodyma; Robert E Lewis; Kurt W Fisher
Journal:  Expert Opin Ther Targets       Date:  2017-04-07       Impact factor: 6.902

4.  Comparison of Beta-value and M-value methods for quantifying methylation levels by microarray analysis.

Authors:  Pan Du; Xiao Zhang; Chiang-Ching Huang; Nadereh Jafari; Warren A Kibbe; Lifang Hou; Simon M Lin
Journal:  BMC Bioinformatics       Date:  2010-11-30       Impact factor: 3.169

5.  WHO histologic classification is a prognostic indicator in thymoma.

Authors:  Kazuya Kondo; Kiyoshi Yoshizawa; Masaru Tsuyuguchi; Suguru Kimura; Masayuki Sumitomo; Junji Morita; Takanori Miyoshi; Shoji Sakiyama; Kiyoshi Mukai; Yasumasa Monden
Journal:  Ann Thorac Surg       Date:  2004-04       Impact factor: 4.330

6.  Prognostic Model for Stratification of Radioresistant Nasopharynx Carcinoma to Curative Salvage Radiotherapy.

Authors:  You Quan Li; Yun Ming Tian; Sze Huey Tan; Ming Zhu Liu; Grace Kusumawidjaja; Enya H W Ong; Chong Zhao; Terence W K Tan; Kam Weng Fong; Kiattisa Sommat; Yoke Lim Soong; Joseph T S Wee; Fei Han; Melvin L K Chua
Journal:  J Clin Oncol       Date:  2018-02-07       Impact factor: 44.544

7.  Long-Term Health Outcome and Quality of Life Post-HSCT for IL7Rα-, Artemis-, RAG1- and RAG2-Deficient Severe Combined Immunodeficiency: a Single Center Report.

Authors:  Intan Juliana Abd Hamid; Mary A Slatter; Fiona McKendrick; Mark S Pearce; Andrew R Gennery
Journal:  J Clin Immunol       Date:  2018-08-13       Impact factor: 8.317

8.  Correlation between the Expression of PD-L1 and Clinicopathological Features in Patients with Thymic Epithelial Tumors.

Authors:  Yanmei Chen; Yuping Zhang; Xiaoling Chai; Jianfang Gao; Guorong Chen; Weifen Zhang; Yunxiang Zhang
Journal:  Biomed Res Int       Date:  2018-04-23       Impact factor: 3.411

9.  Alu hypomethylation and MGMT hypermethylation in serum as biomarkers of glioma.

Authors:  Mingjie Gong; Wei Shi; Jing Qi; Guoping Shao; Zhenghua Shi; Junxiang Wang; Jian Chen; Rongtao Chu
Journal:  Oncotarget       Date:  2017-08-07

Review 10.  ELF3, ELF5, EHF and SPDEF Transcription Factors in Tissue Homeostasis and Cancer.

Authors:  Ian Y Luk; Camilla M Reehorst; John M Mariadason
Journal:  Molecules       Date:  2018-08-30       Impact factor: 4.411

View more
  7 in total

1.  Identification of the gene expression changes and gene regulatory aspects in ELF3 mutant bladder cancer.

Authors:  Perihan Yagmur Guneri-Sozeri; Serap Erkek-Ozhan
Journal:  Mol Biol Rep       Date:  2022-02-24       Impact factor: 2.316

2.  Promotor methylation status of MAPK4 is a novel epigenetic biomarker for prognosis of recurrence in patients with thymic epithelial tumors.

Authors:  Wei Guan; Songlin Li; Zhimin Zhang; He Xiao; Juan He; Jian Li; Xuan He; Jia Luo; Yun Liu; Lin Lei; Jungang Ma; Lizhao Chen; Chuan Chen
Journal:  Thorac Cancer       Date:  2022-09-08       Impact factor: 3.223

3.  Identification of Molecular Characteristics and New Prognostic Targets for Thymoma by Multiomics Analysis.

Authors:  Dazhong Liu; Pengfei Zhang; Jiaying Zhao; Lei Yang; Wei Wang
Journal:  Biomed Res Int       Date:  2021-05-19       Impact factor: 3.411

4.  Deciphering the biology of thymic epithelial tumors.

Authors:  Arun Rajan; Chen Zhao
Journal:  Mediastinum       Date:  2019-09-17

5.  Discovery and validation of the prognostic value of the lncRNAs encoding snoRNAs in patients with clear cell renal cell carcinoma.

Authors:  Wuping Yang; Kenan Zhang; Lei Li; Kaifang Ma; Baoan Hong; Yanqing Gong; Kan Gong
Journal:  Aging (Albany NY)       Date:  2020-03-03       Impact factor: 5.682

6.  Integrated analysis of DNA methylation and mRNA expression profiles to identify key genes involved in the regrowth of clinically non-functioning pituitary adenoma.

Authors:  Sen Cheng; Chuzhong Li; Weiyan Xie; Yazhou Miao; Jing Guo; Jichao Wang; Yazhuo Zhang
Journal:  Aging (Albany NY)       Date:  2020-02-03       Impact factor: 5.682

Review 7.  Thymic Epithelial Neoplasms: Focusing on the Epigenetic Alterations.

Authors:  Iason Psilopatis; Alexandros Pergaris; Kleio Vrettou; Stamatios Theocharis; Constantinos Troungos
Journal:  Int J Mol Sci       Date:  2022-04-06       Impact factor: 5.923

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.