Literature DB >> 28927102

Mining the glioma susceptibility genes in children from gene expression profiles and a methylation database.

Yongqiang Xi1, Wanzhong Tang2, Song Yang3, Maolei Li4, Yuchao He3, Xianhua Fu3.   

Abstract

Glioma is the most common type of primary brain tumor, which is associated with a poor prognosis due to its aggressive growth behavior and highly invasive nature. Research regarding glioma pathogenesis is expected to provide novel methods of adjuvant therapy for the treatment of glioma. The use of bioinformatics to identify candidate genes is commonly used to understand the genetic basis of disease. The present study used bioinformatics to mine the disease-related genes using gene expression profiles (GSE50021) and dual-channel DNA methylation data (GSE50022). The results identified 17 methylation sites located on 33 transcription factor binding sites, which may be responsible for downregulation of 17 target genes. glutamate metabotropic receptor 2 was one of the 17 downregulated target genes. Furthermore, inositol-trisphosphate 3-kinase A (ITPKA) was revealed to be the gene most associated with the risk of glioma in children. The protein coded by the ITPKA gene appeared in all risk sub-pathways, thus suggesting that ITPKA was the gene most associated with the risk of glioma, and inositol phosphate metabolism may be a key pathway associated with glioma in children. The identification of specific genes helps to determine the pathogenesis and possible therapeutic targets for the treatment of glioma in children.

Entities:  

Keywords:  children; gene expression profile; glioma; methylation database

Year:  2017        PMID: 28927102      PMCID: PMC5587983          DOI: 10.3892/ol.2017.6579

Source DB:  PubMed          Journal:  Oncol Lett        ISSN: 1792-1074            Impact factor:   2.967


Introduction

Glioma commonly occurs in the brain or spine, and is a particularly lethal solid tumor that arises from support cells in the central nervous system. Glioma can be classified into the following types: Astrocytic tumors, oligodendrogliomas and oligoastrocytomas (1). In addition, patients with glioma often suffer from fatigue, motor difficulties, drowsiness, communication difficulties, headaches and visual problems (2). A previous study reported that the median survival time of patients with the most malignant form of glioma is only 14.6 months, and the 5-year survival rate is only 9.8% (3). The high rates of morbidity and mortality associated with glioma are due to its chemoradiation resistance, and high proliferative and invasive capabilities (4). In addition to childhood leukemia, glioma is the most common type of tumor in children, and accounts for 80% of malignant brain tumors (5). There is a peak incidence of glioma between the ages of 0 and 8 years (6). Patients with glioma that have survived surgery are more affected by their tumor and its resection than previously appreciated (7). In addition, in patients with childhood brain tumors treated with radiotherapy, memory deficits reflect general cognitive dysfunction (8). At present, the standard approach for treating glioma in children is chemotherapy; however, the safety and effectiveness of chemotherapy in children remains a problem. Understanding the regulation of the pathological activities of glioma is important for its eradication. Research regarding glioma pathogenesis is expected to provide novel methods of adjuvant therapy for the treatment of glioma (9). With scientific improvements, the concept of epigenetics offers a potential method for cancer therapy. Epigenetics refers to stable alteration in gene expression with no underlying modifications in genetic sequence (10); DNA methylation is central to the aberrant epigentics associated with cancer (11). DNA methylation is a biochemical process where a methyl group is added to the CpG islands of DNA nucleotides by DNA methyltransferases in mammals. DNA methylation of CpG dinucleotides is considered a distinct inhibitory pathway that may mediate stable silencing (12). CpG islands are located on promoter regions, and the sequences that surround promoter regions. It has previously been reported that methylation at the CpG islands may induce tumorigenesis, since the abnormal expression of genes is involved in tumor development (13). Costello et al (14) reported that transcriptional repression of a tumor suppressor gene, cyclin-dependent kinase inhibitor 2A, was mediated by aberrant methylation of the CpG island in glioma. A previous study also demonstrated that promoter methylation of DNA repair genes is associated with the tumorigenesis and development of glioma (15). However, despite extensive research, the exact mechanisms underlying how DNA methylation induces glioma remain unclear. The present study compared DNA methylation and differentially expressed genes (DEGs) between glioma and normal tissues using Gene Expression Omnibus (GEO) data. Subsequently, the important genes and pathways in glioma were screened by enrichment analysis of transcriptional regulatory networks and sub-pathways. The present study aimed to determine the possible molecular mechanisms and the aberrant promoter methylation of tumor-associated genes in glioma, via identification of key genes and pathways. The results may provide a theoretical basis and identify potential molecular biological therapies for the treatment of glioma in children.

Materials and methods

Data sources

GSE50021 DNA microarray data (16) for gene expression profiling and GSE50022 dual-channel DNA methylation data (16) were downloaded from GEO (http://www.ncbi.nlm.nih.gov/geo/). GSE50021 contains 45 samples, including 35 samples from patients with glioma (average age 1.008±1.190491 years) and 10 autopsy samples of normal human brain tissue. Data from the chip (Illumina HumanHT-12 WG-DASL v4.0 Expression Beadchip; Illumina, San Diego, CA, USA) were annotated based on the GPL13938 platform. GSE50022 contains 28 samples from patients with glioma (average age, 0.9433333±0.781596 years). Data from the chip (Illumina HumanMethylation450 BeadChip [UBC enhanced annotation v1.0]; Illumina) were annotated based on the GPL16304 platform.

Preprocessing of gene expression profiles

The expression values for genes with various capture probes were used to determine a mean value for each gene. The K-nearest neighbour (KNN) impute missing values module was used to impute missing values by assigning gene expression values based on the nearest neighbors of the gene (k=10). All expression values were normalized using quantile normalization, as implemented in the normalize.quantiles function of the preprocessCore R package (17). Expression values were displayed using a box plot.

Preprocessing of DNA methylation data

For probes with missing values in the original methylation index matrix the KNN impute missing values module was used to impute the missing values by assigning methylation gene expression values based on the nearest neighbors of the gene (k=10).

Screening of differentially expressed genes (DEGs)

The probe identifiers were converted into gene names. The DEG analysis was conducted using limma package (http://www.bioconductor.org). |log2FC|>1 and P<0.05 were set as the cut-offs to screen for DEGs. DEGs were initially identified following an analysis using Student's t-test. The identified DEGs were considered glioma-related genes in children.

Screening of key methylated sites

According to the annotation information, probes were screened to determine which methylated sites were located near DEGs. Subsequently, the preliminarily reserved methylated sites were screened, and ≥80% of samples with a methylation index of ≥0.8 were reserved. Due to the lack of clear definition of the promoter regions of genes, 50 kb on either side of the transcription start site was considered the promoter region. Finally, the methylated sites were re-screened, and methylated sites were reserved that were located 50 kb on either side of the transcription start site. After screening three times, the remaining methylated sites were considered key methylated sites, which were located in the promoter region of genes and have a high methylation index, thus potentially influencing the expression of downstream genes.

Screening of significant transcription factors

Using the transcription factor binding sites (TFBS) prediction tool from University of California, Santa Cruz (UCSC; http://genome.ucsc.edu) the methylated sites were screened in TFBS. A transcription factor regulatory network was generated using Cytoscape (http://cytoscape.org/), which is an open source desktop tool for integrating, visualizing and analyzing data in the context of biological networks. Enrichment analysis of potential downregulated genes influenced by methylation was performed using the Database for Annotation, Visualization and Integrated Discovery (http://david.abcc.ncifcrf.gov/).

Analysis of risk sub-pathways

A hypergeometric test for risk sub-pathways was performed by application of the R package iSubpathwayMiner (http://cran.r-project.org/web/packages/iSubpathwayMiner/). All genes that were mapped to sub-pathways with a significant enrichment were reported. These pathways were used to identify glioma-related pathways.

Results

Preprocessing of gene expression profiles and DNA methylation data

To explore glioma-related genes in children, 19,252 genes were identified from 45 samples following preprocessing of gene expression profiles. As shown in Fig. 1 expression boxplots were generated before and after normalization. The original expression datasets from all conditions were processed into expression estimates using the median polish algorithm. As shown in a box plot (Fig. 1B), the median of different samples was almost on the same line after normalization, which shows an excellent degree of standardization.
Figure 1.

Expression boxplots for glioma-related genes. Box plots represent the mean ± interquartile range, with whiskers extending to the minimum and maximum value. (A) Data prior to normalization; (B) data after normalization. GSM1212511-GSM1212545 (green) represent gliomas samples; GSM1212546-GSM1212555 (yellow) represent normal samples.

Data located in the X/Y and mitochondrial chromosomes were removed. Subsequently, a methylation index array identified 371,920 sites from the 28 samples, following imputation of missing values.

Screening of DEGs

DEGs were analyzed using limma package. Using the selective conditions (|log2FC|>1 and P<0.05), 58 DEGs were identified, all of which were downregulated.

Identification of DNA methylation in the promoter regions of downregulated genes

TFBS methylation has a strong impact on the regulatory role of transcription factors and the normal expression of downstream genes, thus affecting body functioning. DNA methylation in the promoter region is an important aspect of DNA methylation in cancer (18,19). The 58 downregulated DEGs identified in the present study were overlapped with DNA methylation data; 1,123 methylation sites were revealed to be associated with the 58 downregulated genes. The sites were distributed evenly over the 22 autosomes. The methylation index of the 1,123 methylation sites was analyzed, and ≥80% of samples with a methylation index of ≥0.8 were reserved. This resulted in the reservation of 168 methylation sites. To further analyze the 168 methylation sites, methylated sites located 50 kb on either side of the transcription start sites were reserved. This resulted in the reservation of 119 methylation sites located on the transcription start sites of 43 genes. These 43 genes are considered potential glioma susceptibility genes in children. Finally, 17 methylation sites located on 33 TFBS were identifed, which may regulate 17 of the downregulated target genes. These results were determined using the TFBS prediction tool from UCSC (Table I).
Table I.

Key methylation sites.

IDChromosomeMapinfotfbs_starttfbs_endTranscription factorDistance_closest_TSSClosest_TSS_gene_name
cg06191091chr17305838553058384830583862USF−9339RHBDL3
cg02629157chr9138670609138670546138670568TCF1125013KCNT1
cg11709150chr1244043824404312440444TCF1110256PLCH2
cg04585209chr11629231162922576292272TAXCREB306CCKBR
cg10707626chr3517470985174702751747051STAT5A6018GRM2
cg06191091chr17305838553058384930583860SREBP1−9339RHBDL3
cg12603173chr11645084216450840964508423RREB1−66RASGRP2
cg11025960chr3517491885174917751749195RFX18108GRM2
cg10692302chr3517472275174722451747245PPARG6147GRM2
cg02629157chr9138670609138670558138670569POU6F125013KCNT1
cg11014582chr6763337277633367576333696PAX6−852LMO7
cg04341461chr1241000624099782410006PAX5−1616PLCH2
cg10692302chr3517472275174722251747252PAX46147GRM2
cg04625615chr15417883684178831041788330P532313ITPKA
cg07200386chr8220791692207911322079135OLF110682PHYHIP
cg11014582chr6763337277633367676333683NKX25−852LMO7
cg09864712chr16726786726720726749MYOGNF1712RHBDL1
cg06191091chr17305838553058384830583862MYCMAX−9339RHBDL3
cg00810908chr3136123191361230613612320MEIS1AHOXA92080FBLN2
cg11025960chr3517491885174918151749190LMO2COM8108GRM2
cg03358506chr8220587022205868822058703ISRE31149PHYHIP
cg07776629chr16579891225798911657989129IRF215898CNGB1
cg07776629chr16579891225798911657989129IRF115898CNGB1
cg10692302chr3517472275174722551747244HNF46147GRM2
cg06632557chr11613135486131349561313505HMX1−3678SYT7
cg00155846chr9138011566138011506138011522HAND1E4714081OLFM1
cg11025960chr3517491885174918151749190GATA38108GRM2
cg11025960chr3517491885174917951749193GATA18108GRM2
cg04625615chr15417883684178831041788321GATA32313ITPKA
cg05392169chr9138011814138011802138011816FOXO314329OLFM1

tfbs, transcription factor binding sites; TSS, transcription start site.

Analysis of transcriptional regulatory networks

To construct a transcription factor regulatory network, predictive models of transcriptional regulatory networks were generated using Cytoscape. A transcriptional regulatory network including 33 transcription factors and target genes is presented in Fig. 2. The eight methylation sites in the TFBS of glutamate metabotropic receptor 2 (GRM2) potentially influenced the binding of eight transcription factors to TFBS. It may therefore be hypothesized that the downregulation of GRM2 is caused by methylation of these TFBS. In addition, methylation sites in the TFBS of rhomboid, veinlet-like 3 (Drosophila) (RHBDL3) resulted in the failure of four transcription factors binding with TFBS, thus affecting transcriptional regulation. Furthermore, the transcription factor CP2 regulates rhomboid 5 homolog 2 (Drosophila) (RHBDF2) expression; in the present study, two methylation sites were detected in TFBS of CP2, which may influence transcriptional regulation of RHBDF2.
Figure 2.

Transcriptional regulatory networks. Yellow triangles represent transcription factors; purple circles represent target genes; arrows represent transcriptional regulation.

Functional analysis of key genes

To generate the downregulated target gene-related risk sub-pathways, a hypergeometric test was performed using R package iSubpathwayMiner. Four sub-pathways were generated, which were associated with inositol phosphate metabolism (hsa00562). As shown in Fig. 3 the inositol-trisphosphate 3-kinase protein (EC 2.7.1.127), which is coded by the ITPKA gene (marked with a red rim), appeared in all of the risk sub-pathways. In conclusion, ITPKA was considered to be the gene most associated with the risk of glioma in children.
Figure 3.

Risk sub-pathways in inositol phosphate metabolism. ITPKA gene is marked with a red rim. ITPKA, inositol-trisphosphate 3-kinase.

Discussion

Bioinformatics is an interdisciplinary field that develops methods and software tools for understanding biological data. The identification of candidate genes and single nucleotide polymorphisms are common uses of bioinformatics to understand the genetic basis of disease (20). The present study identified 17 methylation sites located on 33 TFBS, which may regulate the 17 downregulated target genes. GRM2 was identified as one of the 17 downregulated target genes. Furthermore, ITPKA was revealed to be the gene most associated with glioma risk in children. The protein coded by the ITPKA gene appeared in all risk sub-pathways, thus suggesting that ITPKA is the gene most associated with glioma risk, and inositol phosphate metabolism is the key pathway associated with glioma in children. Glioma is the most common type of primary brain tumor, which is associated with a poor prognosis for patients due to its aggressive growth behavior and highly invasive nature (21). Abnormal metabolic networks are closely associated with the pathogenesis of glioma. A metabolic pathway is a series of chemical reactions, which are catalyzed by enzymes, where the product of one enzyme acts as a substrate for the next. Abnormal gene expression of specific enzymes may affect other enzymatic genes in the network. Therefore, identification of disease-related genes in the metabolic pathway is of great importance. The correct targeting, localization, regulation and signaling of metabotropic glutamate receptor 2 (mGluR2), which is coded by the GRM2 gene, represents a major mechanism underlying the complex function of neuronal networks (22). Dysfunction of mGluR and associated proteins is associated with neurodegenerative and neuropsychiatric disorders. Several types of glioma have been reported to release high levels of glutamate, which promotes malignancy. A previous study reported that activation of mGluR2 supports the growth of human glioma cells in culture, and that antagonists of mGluR2 should be assessed for their ability to reduce tumour growth in vivo (23). Arcella et al (24) demonstrated that mGlu2/3 receptor antagonists are of potential use in the experimental treatment of malignant gliomas. Furthermore, ectopic expression of ITPKA in lung cancer cells has been shown to increase their metastatic potential, since the protein exhibits two actin-regulating activities (25). The ITPKA gene has also been reported to be hypermethylated in human patients with normal karyotype acute myeloid leukemia (26). Furthermore, the remaining 15 downregulated genes identified in the present study may be beneficial for the generation of a target therapy for the treatment of patients with glioma. In conclusion, the present study proposed a novel method for extracting DEGs from gene expression datasets combined with DNA methylation data, and applied it to the occurrence of glioma in children. Results of the functional analysis revealed that ITPKA was considered the gene most associated with the risk of glioma, and inositol phosphate metabolism may serve an important role in glioma in children. The identification of specific genes may improve understanding regarding the pathogenesis and possible therapeutic targets of glioma in children.
  25 in total

1.  Genes, genetics, and epigenetics: a correspondence.

Authors:  J R Morris
Journal:  Science       Date:  2001-08-10       Impact factor: 47.728

Review 2.  Epigenetic modifications and human disease.

Authors:  Anna Portela; Manel Esteller
Journal:  Nat Biotechnol       Date:  2010-10       Impact factor: 54.908

Review 3.  Eukaryotic cytosine methyltransferases.

Authors:  Mary Grace Goll; Timothy H Bestor
Journal:  Annu Rev Biochem       Date:  2005       Impact factor: 23.643

4.  Effect of disease burden on health-related quality of life in patients with malignant gliomas.

Authors:  D Osoba; M Brada; M D Prados; W K Yung
Journal:  Neuro Oncol       Date:  2000-10       Impact factor: 12.300

5.  Silencing of p16/CDKN2 expression in human gliomas by methylation and chromatin condensation.

Authors:  J F Costello; M S Berger; H S Huang; W K Cavenee
Journal:  Cancer Res       Date:  1996-05-15       Impact factor: 12.701

6.  Effects of radiotherapy with concomitant and adjuvant temozolomide versus radiotherapy alone on survival in glioblastoma in a randomised phase III study: 5-year analysis of the EORTC-NCIC trial.

Authors:  Roger Stupp; Monika E Hegi; Warren P Mason; Martin J van den Bent; Martin J B Taphoorn; Robert C Janzer; Samuel K Ludwin; Anouk Allgeier; Barbara Fisher; Karl Belanger; Peter Hau; Alba A Brandes; Johanna Gijtenbeek; Christine Marosi; Charles J Vecht; Karima Mokhtari; Pieter Wesseling; Salvador Villa; Elizabeth Eisenhauer; Thierry Gorlia; Michael Weller; Denis Lacombe; J Gregory Cairncross; René-Olivier Mirimanoff
Journal:  Lancet Oncol       Date:  2009-03-09       Impact factor: 41.316

7.  DNA methylation at promoter regions of interleukin 1B, interleukin 6, and interleukin 8 in non-small cell lung cancer.

Authors:  Xavier Tekpli; Nina E Landvik; Kristine H Anmarkud; Vidar Skaug; Aage Haugen; Shanbeh Zienolddiny
Journal:  Cancer Immunol Immunother       Date:  2012-08-26       Impact factor: 6.968

8.  Medical, psychological, cognitive and educational late-effects in pediatric low-grade glioma survivors treated with surgery only.

Authors:  Christopher D Turner; Christine A Chordas; Cori C Liptak; Celiane Rey-Casserly; Brian L Delaney; Nicole J Ullrich; Liliana C Goumnerova; R Michael Scott; Heather C Begley; William J Fletcher; Xiaopan Yao; Susan Chi; Mark W Kieran
Journal:  Pediatr Blood Cancer       Date:  2009-09       Impact factor: 3.167

Review 9.  Malignant astrocytic glioma: genetics, biology, and paths to treatment.

Authors:  Frank B Furnari; Tim Fenton; Robert M Bachoo; Akitake Mukasa; Jayne M Stommel; Alexander Stegh; William C Hahn; Keith L Ligon; David N Louis; Cameron Brennan; Lynda Chin; Ronald A DePinho; Webster K Cavenee
Journal:  Genes Dev       Date:  2007-11-01       Impact factor: 11.361

10.  MGMT, GATA6, CD81, DR4, and CASP8 gene promoter methylation in glioblastoma.

Authors:  Daina Skiriute; Paulina Vaitkiene; Viktoras Saferis; Virginija Asmoniene; Kestutis Skauminas; Vytenis Pranas Deltuva; Arimantas Tamasauskas
Journal:  BMC Cancer       Date:  2012-06-06       Impact factor: 4.430

View more
  2 in total

1.  Bioinformatics analysis of microarray data to reveal the pathogenesis of diffuse intrinsic pontine glioma.

Authors:  Li Wei; Fei He; Wen Zhang; Wenhua Chen; Bo Yu
Journal:  Biol Res       Date:  2018-08-20       Impact factor: 5.612

2.  Identification of hub genes and small-molecule compounds in medulloblastoma by integrated bioinformatic analyses.

Authors:  Zhendong Liu; Ruotian Zhang; Zhenying Sun; Jiawei Yao; Penglei Yao; Xin Chen; Xinzhuang Wang; Ming Gao; Jinzhao Wan; Yiming Du; Shiguang Zhao
Journal:  PeerJ       Date:  2020-04-14       Impact factor: 2.984

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.