Literature DB >> 28731140

Integrated analysis of genome‑wide gene expression and DNA methylation microarray of diffuse large B‑cell lymphoma with TET mutations.

Pengfei Liu1, Wenhua Jiang2, Jinkun Zhao3, Huilai Zhang1.   

Abstract

Diffuse large B-cell lymphoma (DLBCL), the most frequently occurring type of lymphoid malignancy, has been demonstrated to be associated with mutations of Ten‑Eleven Translocation (TET). In order to explore the association between DLBCL and TET mutations, the present study analyzed the gene expression and methylation profiles in human DLBCL biopsy tissues with wildtype and mutated TET2. The microarray dataset GSE37365, containing two subseries: the genome‑wide gene expression dataset GSE37362 and the DNA methylation microarray dataset GSE37363, was downloaded from the Gene Expression Omnibus database. Differentially expressed genes (DEGs) were identified using the limma package of R. Furthermore, differentially methylated sites and differentially methylated regions were identified. Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analyses were performed via GO stats and GSEABase packages respectively. Finally, the Pathview package was used to construct the network of enriched pathways. A total of 198 DEGs (106 up‑ and 92 downregulated) were obtained. A total of 602 shared differentially methylated genes (DMGs) were identified according to differentially methylated levels. A total of 12 overlapping genes were identified in DEGs and DMGs. It was observed that 9 of the 12 overlapped genes were downregulated and hypermethylated, with 24 GO terms and one KEGG pathway significantly enriched. The results of the present study demonstrated that the genes cryptochrome circadian clock 1, zinc finger protein (ZNF) interacting with K protein 1, ZNF134, ZNF256 and ZNF615, which were hypermethylated and downregulated in DLBCL patients with TET2 mutations, were the key genes in the association between DLBCL and TET mutations. These genes may act as potential biomarkers for the diagnosis of DLBCL in the future.

Entities:  

Mesh:

Substances:

Year:  2017        PMID: 28731140      PMCID: PMC5646955          DOI: 10.3892/mmr.2017.7058

Source DB:  PubMed          Journal:  Mol Med Rep        ISSN: 1791-2997            Impact factor:   2.952


Introduction

Diffuse large B-cell lymphoma (DLBCL) is the most frequently occurring type of lymphoid malignancy in adults worldwide. It arises from normal antigen-exposed B cells which have migrated to germinal centers in secondary lymphoid organs (1). DLBCL is a clinically heterogeneous disease and ~40% of patients may be cured by anthracycline-based chemotherapy (2). Therefore, the early diagnosis and treatment of DLBCL is critical. A previous study indicated that specific genetic and epigenetic cancer characteristics were detectable in the plasma, serum and tissue of patients, and these were able to be used for the early detection and diagnosis of cancer (3). Ten-Eleven Translocation (TET) hydroxylases are able to convert 5-methylcytosine to 5-hydroxymethylcytosine and induce DNA demethylation (4). The use of a mutation screening assay and the genome-wide profiling has verified that TET2 mutations may result in the aberrant methylation of various genes (5). Microarray technology has contributed to the development of life sciences, making it possible to obtain large quantities of genes, microRNAs, or DNA methylated sites for further analysis of cancer at the molecular level (6,7). In addition, combining microarray data and statistical methods may be used to detect influential genes in cancers (8). Although TET mutations were considered to be associated with DLBCL, the mechanism of DNA methylation in DLBCL remains to be elucidated. Novel diagnostic targets and treatment strategies for this disease are urgently required and of primary concern. The present study compared the data from the DNA methylation and gene expression profiles in TET2 mutated samples, with TET2 wild-type samples, and differentially expressed genes (DEGs) and differentially methylated regions (DMRs) were identified. Following this, functional and pathway analyses were conducted and various key genes and pathways associated with DLBCL and TET2 mutations were identified.

Materials and methods

Microarray data

The gene expression profile of GSE37365 was downloaded from the Gene Expression Omnibus (GEO) database (www.ncbi.nlm.nih.gov/geo/) and deposited by Asmar et al (5). It contained two subseries: the genome-wide gene expression dataset GSE37362 and the DNA methylation microarray dataset GSE37363. In GSE37362, there were 12 human DLBCL biopsy tissues with TET2 mutations and 19 TET2 wild-type DLBCL biopsy tissues, and the dataset was performed on the Illumina HumanMethylation450 BeadChip (GPL13534) platform (Illumina, San Diego, CA, USA). The gene expression dataset (GSE37363) contained 4 human DLBCL biopsy tissues with TET2 mutations and 5 TET2 wild-type DLBCL biopsy tissues and was detected with Affymetrix Human Gene 1.0 ST Array (GPL6244) platform (Affymetrix, Inc., Santa Clara, CA, USA).

Data preprocessing

The original CEL gene microarray data were imported into R and the Affy package (http://bioconductor.org/packages/release/bioc/html/affy.html) was used for the background correction and robust multi array normalization (9). The probe-level symbols were converted into gene-level symbols. If multiple probes corresponded to one gene symbol, the average expression value of all probes was defined as the gene expression value. For the DNA methylation dataset, the IMA package (https://www.rforge.net/IMA) of R was used to normalize the methylation level in each methylation site. The methylation sites located in the X and Y chromosomes or SNPs were removed. P-value <0.05 was set as the cut-off criterion for the DNA-methylation matrix.

Differential expression analysis

DEGs were identified by the limma package (http://bioconductor.org/packages/release/bioc/html/limma.html) (10) of R in TET2 mutated DLBCL samples compared with wild-type samples, according to the thresholds of Benjamini-Hochberg (BH) adjusted P-value <0.05 and |log2 (fold change)| >1. A total of 3 differentially methylated levels were identified. The first one was obtained by the P-value of a Student's t-test (P<0.05) in R (www.r-project.org) and the β value of the DNA methylation site in TET2 mutated samples compared with wild-type samples. The second one was determined by the gene regions, including the first exon area (EXON1), 1–200 bp upstream of the transcription start site (TSS200), 1–1500 bp upstream of the transcription start site (TSS1500), 3′-untranslated region (UTR) and 5′UTR. The third one was calculated by the locational data involving the CpG island, 1–200 bp upstream of the CpG island (N-CPGshore), 200–400 bp upstream of the CpG island (N-CPGshelf), 1–200 bp downstream of the CpG island (S-CPGshore) and 200–400 bp downstream of the CpG island (S-CPGshelf). Finally, with |∆β| >0.1 and P-value <0.05, the differentially methylated sites (DMSs) and DMRs were selected out in TET2 mutated samples and compared with wild-type samples. The DMSs and DMRs were transformed to target genes termed differentially methylated genes (DMGs) using the annotation information, and the overlapping genes between the identified DMGs and DEGs were obtained.

Functional enrichment analysis

The biological process (BP) of Gene Ontology (GO) terms and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways for the aforementioned screened genes were identified using GOstats (http://www.bioconductor.org/packages/release/bioc/html/GOstats.html) (11) and GSEABase packages (http://www.bioconductor.org/packages/release/bioc/html/GSEABase.html) (12), respectively. The adjusted P-value <0.01 was selected as the cut-off criterion. The network of enriched pathways was constructed using the Pathview package (http://www.bioconductor.org/packages/release/bioc/html/pathview.html) (13).

Results

Screening of DEGs and DMGs

The raw data were normalized in all samples and are presented in Fig. 1. A total of 198 DEGs (106 up- and 92 downregulated) were identified in TET2 mutated DLBCL samples compared with wild-type samples and the top 30 DEGs are listed in Table I. A total of 10493 DMSs, distributed over 3768 genes with |∆β| >0.1 and P-value <0.05 (group 1) were screened. A total of 1411 non-redundant genes (group 2) were selected from the DMRs (Fig. 2). As presented, the majority of DMRs were located in the gene body accounting for 26.5%, followed by TSS1500 with 20.2%. A further 1670 non-redundant genes (group 3) were identified by their location, of which 51.9% genes were located in CpG islands, 15.5% in N-shore, 12.1% in S-shore, 1.63% in N-shelf and 1.49% in S-shelf (Fig. 3). The results revealed that 602 DMGs were shared among the three groups. In addition, 12 overlapping genes, cryptochrome circadian clock (CRY)1, cytochrome B5 reductase (CYB5R) 2, doublecortin like kinase (DCLK) 2, fibronectin 1, glutathione peroxidase 3, short chain dehydrogenase/reductase family 42E, member 1 (SDR42E1), secreted frizzled related protein 2, spi-B transcription factor (SPIB), zinc finger protein (ZNF) interacting with K protein (ZIK) 1, ZNF134, ZNF256 and ZNF615 were identified in the 198 DEGs and 602 DMGs.
Figure 1.

Boxplots for microarrays (A) prior to and (B) following application of normalization. Red boxes represent TET mutated diffuse large B-cell lymphoma samples. Blue boxes represent TET2 wild-type samples. TET, Ten-Eleven Translocation hydroxylases.

Table I.

The top 30 DEGs in TET2 mutated DLBCL samples compared with wild-type samples.

GeneLogFCP-value
ZIK1−1.002045.43×10−5
SLC9A7−2.124358.30×10−5
FAM129C−1.782540.000152
VCAN1.8884210.000272
ZNF256−2.008470.00029
ZNF542P−1.489070.000365
RNU2-6P−2.035560.00037
KIAA0226L−3.088680.000435
VDR1.2066150.000439
IFT57−1.363410.00056
APOBEC3B2.4770520.000594
ZNF667-AS1−2.007690.000808
SERPINB91.4055570.001202
SDR42E1−1.284850.001228
BCL2−1.238090.001266
KCNN31.9154990.001271
ZNF350−1.706970.001372
PRRG41.0800110.001556
SLC7A112.2764440.001604
CEP68−1.007050.001643
SPP13.1866190.001661
ZNF382−1.468860.00197
EMP11.0988110.001994
PLA2G2A1.1480080.002013
GPR183−2.035110.002126
PDGFRA2.2473030.002385
CHPF1.0781770.002536
CRY1−1.080590.00269
LZTFL1−1.777820.002797
AKT3−1.138530.002883

DEGs, differentially expressed genes; TET, Ten-Eleven Translocation; DLBCL, diffuse large B-cell lymphoma; LogFC, log (fold change).

Figure 2.

The distributions of differentially methylated genes based on the genetic regions. UTR, untranslated region; TSS, transcription start site.

Figure 3.

The distributions of differentially methylated genes based on the locations of CpG islands.

Enriched GO terms and KEGG pathways of key target genes

It was demonstrated that 9 of the 12 overlapped genes were downregulated and hypermethylated, including CRY1, CYB5R2, DCLK2, SDR42E1, SPIB, ZIK1, ZNF134, ZNF256 and ZNF615. The heatmap of these 9 genes is presented in Fig. 4. Furthermore, using the cut-off criterion of P-value <0.01, 24 BPs of GO terms were obtained, including organic cyclic compound biosynthetic process (P=0.000131518), blue light signaling pathway (P=0.001212284) and cellular response to blue light (P=0.001212284; Table II). A significant KEGG pathway associated with circadian rhythm was obtained (Fig. 5).
Figure 4.

Hierarchical clustering heatmap of CRY1, CYB5R2, DCLK2, SDR42E1, SPIB, ZIK1, ZNF134, ZNF256 and ZNF615. Each column corresponds to a single microarray and each row represents expression profile of a single gene. Red and blue represent the low and high values of the gene expression, respectively. Case-1, 2, 3, 4 represent human DLBCL biopsy tissue samples with TET mutations and control-1, 2, 3, 4, 5, represent TET2 wild-type DLBCL biopsy tissue samples. DLBCL, diffuse large B-cell lymphoma; TET, Ten-Eleven Translocation hydroxylases; CRY1 cryptochrome circadian clock 1; CYB5R2, cytochrome B5 reductase 2; DCLK2, doublecortin like kinase; SDR42E1, short chain dehydrogenase/reductase family 42E, member 1; SPIB, spi-B transcription factor; ZNF, zinc finger protein; ZIK1, ZNF interacting with K protein.

Table II.

Significant 24 GO terms for screened genes.

GO termDescriptionP-valueGenes
GO:1901362Organic cyclic compound biosynthetic process0.000131518CRY1, SPIB, ZNF134, ZNF256, CYB5R2, SDR42E1, ZIK1, ZNF615
GO:0009785Blue light signaling pathway0.001212284CRY1
GO:0071483Cellular response to blue light0.001212284CRY1
GO:1901576Organic substance biosynthetic process0.001486849CRY1, SPIB, ZNF134, ZNF256, CYB5R2, SDR42E1, ZIK1, ZNF615
GO:0009058Biosynthetic process0.00165506CRY1, SPIB, ZNF134, ZNF256, CYB5R2, SDR42E1, ZIK1, ZNF615
GO:0009637Response to blue light0.001817937CRY1
GO:1901360Organic cyclic compound metabolic process0.00276577CRY1, SPIB, ZNF134, ZNF256, CYB5R2, SDR42E1, ZIK1, ZNF615
GO:0006355Regulation of transcription, DNA-dependent0.002884379CRY1, SPIB, ZNF134, ZNF256, ZIK1, ZNF615
GO:2001141Regulation of RNA biosynthetic process0.002991969CRY1, SPIB, ZNF134, ZNF256, ZIK1, ZNF615
GO:0006694Steroid biosynthetic process0.003173823CYB5R2, SDR42E1
GO:0051252Regulation of RNA metabolic process0.003375671CRY1, SPIB, ZNF134, ZNF256, ZIK1, ZNF615
GO:2000112Regulation of cellular macromolecule biosynthetic process0.004585358CRY1, SPIB, ZNF134, ZNF256, ZIK1, ZNF615
GO:0006351Transcription, DNA-dependent0.004971651CRY1, SPIB, ZNF134, ZNF256, ZIK1, ZNF615
GO:0010556Regulation of macromolecule biosynthetic process0.005245834CRY1, SPIB, ZNF134, ZNF256, ZIK1, ZNF615
GO:0032774RNA biosynthetic process0.005738022CRY1, SPIB, ZNF134, ZNF256, ZIK1, ZNF615
GO:0031326Rregulation of cellular biosynthetic process0.006573337CRY1, SPIB, ZNF134, ZNF256, ZIK1,
GO:0009889Regulation of biosynthetic process0.006882303CRY1, SPIB, ZNF134, ZNF256, ZIK1, ZNF615
GO:0010468Regulation of gene expression0.006955993CRY1, SPIB, ZNF134, ZNF256, ZIK1, ZNF615
GO:0019219Regulation of nucleobase-containing compound metabolic process0.007355736CRY1, SPIB, ZNF134, ZNF256, ZIK1, ZNF615
GO:0018298Protein-chromophore linkage0.007856527CRY1
GO:0051171Regulation of nitrogen compound metabolic process0.008353762CRY1, SPIB, ZNF134, ZNF256, ZIK1, ZNF615
GO:0034654Nucleobase-containing compound biosynthetic process0.008928906CRY1, SPIB, ZNF134, ZNF256, ZIK1, ZNF615
GO:0018130Heterocycle biosynthetic process0.009820777CRY1, SPIB, ZNF134, ZNF256, ZIK1, ZNF615
GO:0019438Aromatic compound biosynthetic process0.009834588CRY1, SPIB, ZNF134, ZNF256, ZIK1, ZNF615

GO, Gene Ontology.

Figure 5.

Pathway of circadian rhythm. The red gene is the significant gene involved in the pathway. CRY1 cryptochrome circadian clock 1; KEGG, kyoto encyclopedia of genes and genomes; CREB1, CAMP responsive element binding protein 1; MAPK, mitogen activated protein kinase; Fbx 13, F-box 13; Ror, RAR related orphan receptor; PER, period circadian clock; Cklε, casein kinase 1 ε; Cklδ, casein kinase 1δ; SCF, Skp1-Cul1-F-box protein; β-TrCP, β-transducin repeat containing E3 ubiquitin protein ligase.

Discussion

TET2, one of the three proteins of the TET family, catalyzes the conversion of 5-methyl-cytosine to 5-hydroxymethyl-cytosine and promotes DNA demethylation (14). The present study, by using the data reported by Asmar et al (5), analyzed the DNA methylation and gene expression profiles in TET2 mutated samples compared with wild-type samples. DNA methylation is one of the most common epigenetic modifications which may regulate gene expression in mammalian cells (15). Abnormal DNA methylation may lead to tumor formation and development (16,17). DLBCL represents the most frequently occurring B-cell malignancy with a high mortality rate, with a rapid progression and fatal outcome (18). It has previously been demonstrated that abnormal DNA methylation is associated with the pathogenesis and development of DLBCL. Shaknovich et al (19) reported that methylation of Src homology region 2 domain-containing phosphatase (SHP) 1 is frequently detected in plasma and tissues of DLBCL patients and therefore methylation of SHP1 may serve as a candidate biomarker for DLBCL. Additionally, TET mutations are typically observed in early hematopoietic progenitors (20) and it was hypothesized that they resulted in gene aberrant methylation in hematopoietic development (5). In the present study, 198 DEGs and 602 DMGs were identified and 9 genes CRY1, CYB5R2, DCLK2, SDR42E1, SPIB, ZIK1, ZNF134, ZNF256 and ZNF615 were revealed to be downregulated and hypermethylated in the TET2 mutated DLBCL samples compared with wild-type samples. The results demonstrated that CRY1, a gene associated with the circadian system (21), was significantly enriched in the circadian rhythm pathway. CRY1 and CRY2, interact with the period circadian clock (PER) proteins, and have a role in the negative limbs of the clock feedback loop (22). Circadian rhythms control a variety of metabolic and physiological functions, and abnormal circadian rhythms may be directly associated with the development of cancer (23). A previous study demonstrated that the downregulated clock gene PER2 in DLBCL patients was targeted by the transcription factor (CCAAT/enhancer-binding protein α and PER2 may act as a tumor suppressor gene (24). In addition, the epigenetic inactivation of aryl hydrocarbon receptor nuclear translocator-like protein (BMAL) 1, another circadian clock gene, has been revealed to contribute to the development of hematologic malignancies (25). Furthermore, CRY1 may modulate the ATR-mediated DNA damage checkpoint response by interacting with the timeless gene (26). The role of CRY1 as a key gene in the association between DLBCL and TET mutations has previously been investigated. Furthermore, it was noted that ~50% of the aforementioned 9 genes encoded zinc finger proteins, including ZIK1, ZNF134, ZNF256 and ZNF615. Consistent with Asmar et al (5), ZIK1 was hypermethylated and downregulated in the results of the present study. Numerous studies have reported an association between zinc finger proteins and lymphoma or lymphatic disease. Bond et al (27) suggests that early hematopoietic zinc finger proteins are important in the regulation of human hematopoiesis and may be associated with the development of hematopoietic malignancies. Möröy et al (28) suggests that zinc finger protein growth factor independent 1 transcriptional repressor may maintain development and progression of lymphoid leukemia by suppressing the activation of the tumor suppressor p53. This suggests that different zinc finger proteins serve different roles in various types of cancer. Therefore, it was hypothesized that ZIK1, ZNF134, ZNF256 and ZNF615 may be involved in the development of DLBCL with TET2 mutations. Further validation of these genes is required. In conclusion, the results of the present study suggested that the genes which were hypermethylated and downregulated in DLBCL biopsy tissues with TET2 mutations, exhibited important roles in the development of DLBCL. These genes, including CRY1, ZIK1, ZNF134, ZNF256 and ZNF615, may be affected by TET mutations and may act as potential biomarkers for the early diagnosis, therapy and prognosis of DLBCL. However, further experiments are required in order to validate the specific function of these genes in DLBCL.
  27 in total

1.  DNA methylation signatures define molecular subtypes of diffuse large B-cell lymphoma.

Authors:  Rita Shaknovich; Huimin Geng; Nathalie A Johnson; Lucas Tsikitas; Leandro Cerchietti; John M Greally; Randy D Gascoyne; Olivier Elemento; Ari Melnick
Journal:  Blood       Date:  2010-07-07       Impact factor: 22.113

2.  DNA methylation in eukaryotes: kinetics of demethylation and de novo methylation during the life cycle.

Authors:  S P Otto; V Walbot
Journal:  Genetics       Date:  1990-02       Impact factor: 4.562

3.  TET2 inactivation results in pleiotropic hematopoietic abnormalities in mouse and is a recurrent event during human lymphomagenesis.

Authors:  Cyril Quivoron; Lucile Couronné; Véronique Della Valle; Cécile K Lopez; Isabelle Plo; Orianne Wagner-Ballon; Marcio Do Cruzeiro; Francois Delhommeau; Bertrand Arnulf; Marc-Henri Stern; Lucy Godley; Paule Opolon; Hervé Tilly; Eric Solary; Yannis Duffourd; Philippe Dessen; Hélène Merle-Beral; Florence Nguyen-Khac; Michaëla Fontenay; William Vainchenker; Christian Bastard; Thomas Mercher; Olivier A Bernard
Journal:  Cancer Cell       Date:  2011-06-30       Impact factor: 31.743

4.  mCRY1 and mCRY2 are essential components of the negative limb of the circadian clock feedback loop.

Authors:  K Kume; M J Zylka; S Sriram; L P Shearman; D R Weaver; X Jin; E S Maywood; M H Hastings; S M Reppert
Journal:  Cell       Date:  1999-07-23       Impact factor: 41.582

Review 5.  The fundamental role of epigenetic events in cancer.

Authors:  Peter A Jones; Stephen B Baylin
Journal:  Nat Rev Genet       Date:  2002-06       Impact factor: 53.242

6.  Molecular profiling of diffuse large B-cell lymphoma identifies robust subtypes including one characterized by host inflammatory response.

Authors:  Stefano Monti; Kerry J Savage; Jeffery L Kutok; Friedrich Feuerhake; Paul Kurtin; Martin Mihm; Bingyan Wu; Laura Pasqualucci; Donna Neuberg; Ricardo C T Aguiar; Paola Dal Cin; Christine Ladd; Geraldine S Pinkus; Gilles Salles; Nancy Lee Harris; Riccardo Dalla-Favera; Thomas M Habermann; Jon C Aster; Todd R Golub; Margaret A Shipp
Journal:  Blood       Date:  2004-11-18       Impact factor: 22.113

7.  Enrichment map: a network-based method for gene-set enrichment visualization and interpretation.

Authors:  Daniele Merico; Ruth Isserlin; Oliver Stueker; Andrew Emili; Gary D Bader
Journal:  PLoS One       Date:  2010-11-15       Impact factor: 3.240

Review 8.  Circulating nucleic acids in plasma/serum.

Authors:  Jason C H Tsang; Y M Dennis Lo
Journal:  Pathology       Date:  2007-04       Impact factor: 5.306

9.  Microarray-based DNA methylation profiling: technology and applications.

Authors:  Axel Schumacher; Philipp Kapranov; Zachary Kaminsky; James Flanagan; Abbas Assadzadeh; Patrick Yau; Carl Virtanen; Neil Winegarden; Jill Cheng; Thomas Gingeras; Arturas Petronis
Journal:  Nucleic Acids Res       Date:  2006-01-20       Impact factor: 16.971

10.  Pathview: an R/Bioconductor package for pathway-based data integration and visualization.

Authors:  Weijun Luo; Cory Brouwer
Journal:  Bioinformatics       Date:  2013-06-04       Impact factor: 6.937

View more
  3 in total

1.  Epigenetic Regulation of miR-92a and TET2 and Their Association in Non-Hodgkin Lymphoma.

Authors:  Esther K Elliott; Lloyd N Hopkins; Robert Hensen; Heidi G Sutherland; Larisa M Haupt; Lyn R Griffiths
Journal:  Front Genet       Date:  2021-11-26       Impact factor: 4.599

2.  Identification of DNA methylation biomarkers with potential to predict response to neoadjuvant chemotherapy in triple-negative breast cancer.

Authors:  Vinod Ganju; Susan J Clark; Ruth Pidsley; Clare Stirzaker; Braydon Meyer; Samuel Clifton; Warwick Locke; Phuc-Loi Luu; Qian Du; Dilys Lam; Nicola J Armstrong; Beena Kumar; Niantao Deng; Kate Harvey; Alex Swarbrick
Journal:  Clin Epigenetics       Date:  2021-12-18       Impact factor: 6.551

3.  LTBP2 inhibits prostate cancer progression and metastasis via the PI3K/AKT signaling pathway.

Authors:  Xiaowen Zhang; Chuanjie Tian; Jianbin Cheng; Weipu Mao; Menglan Li; Ming Chen
Journal:  Exp Ther Med       Date:  2022-07-08       Impact factor: 2.751

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.