| Literature DB >> 30675303 |
Ranran Wang1,2, Xiao Zhou1,3, Hui Wang2, Bo Zhou3, Shanshan Dong1,2, Qi Ding1,2, Mingjing Peng1, Xiaowu Sheng1, Jianfeng Yao4, Rongfu Huang5, Yong Zeng1,2, Ying Long1,2.
Abstract
Oral tongue squamous cell carcinoma (OTSCC) is the most common type of oral cancer. Despite advances in knowledge regarding the genome-scale gene expression pattern of oral cancer, the molecular portrait of OTSCC biology has remained unclear over the last few decades. Furthermore, studies concerning OTSCC gene-expression profiles are limited or inconsistent owing to tissue heterogeneity in single-cohort studies. Consequently, the present study integrated the profile datasets of three cohorts in order to screen for differentially expressed genes (DEGs), and subsequently identified the potential candidate genes and pathways in OTSCC through gene enrichment analysis and protein-protein interaction (PPI) network construction. Using the selected Gene Expression Omnibus datasets GSE13601, GSE31056 and GSE78060, 206 DEGs (125 upregulated and 81 downregulated) were identified in OTSCC, principally associated with extracellular matrix (ECM) organization and the phosphoinositide 3-kinase/protein kinase B signaling pathway. Furthermore, 146/206 DEGs were filtered into the PPI network and 20 hub genes were sorted. Further results indicated that the two most significant modules filtered from the PPI network were associated with ECM organization and human papillomavirus infection, which are important factors affecting OTSCC pathology. Overall, a set of OTSCC-associated DEGs has been identified, including certain key candidate genes that may be of vital importance for diagnosis, therapy and prevention of this disease.Entities:
Keywords: differentially expressed gene; integrative bioinformatics; microarray; oral tongue squamous cell carcinoma; protein-protein network
Year: 2018 PMID: 30675303 PMCID: PMC6341834 DOI: 10.3892/ol.2018.9866
Source DB: PubMed Journal: Oncol Lett ISSN: 1792-1074 Impact factor: 2.967
Figure 1.Identification of 206 common DEGs. Cohort profile datasets A, B and C were selected from datasets GSE13601, GSE31056 and GSE78060, respectively, and are indicated in different colors. The DEGs were identified using the edgeR package in R with cut-off criteria of [log2fold change] >1 and adjusted P<0.05. The overlapping areas represent the common genes. DEGs, differentially expressed genes.
List of 206 consistently aberrant genes identified from three Gene Expression Omnibus datasets.
| Gene expression | Differentially expressed genes |
|---|---|
| Upregulated (n=125) | IFI27, CDH3, PYGL, MYO1B, MMP1, SCO2, TYMP, BNC1, COL4A1, MMP3, PTHLH, IRF6, F2RL1, COL4A2, IFI6, ACTN1, THBS2, RAB31, SLC16A1, ISG15, PRNP, KRT16, TPBG, MDFI, OSMR, PLAU, SERPINE1, PROCR, PXDN, DUSP7, ITGA6, COL1A2, SOX15, LAMB3, SHC1, NDRG1, LAMC2, ADORA2B, PDLIM4, COL5A2, GJA1, LGALS3BP, MMP13, DFNA5, IL1RAP, PDPN, RGS20, FSCN1, TPST1, STK3, SLC7A5, CTSC, ADAM10, COL7A1, UPP1, PTK7, CA2, ITGA3, GJB3, APOL1, SCG5, EIF6, PLAUR, SOX11, MMP10, COL3A1, TGFBI, MMP12, COL17A1, IRF9, ZWINT, STAT1, BPGM, PCDH7, NUP155, GNA15, POSTN, OAS1, IGFBP3, FAP, COL4A5, TUBB3, DUSP14, FST, TK1, SNAI2, FOXM1, GINS1, TRIP13, HIST1H2AE, IFIT3, PLOD2, DSG2, TGIF1, MYO10, IFI44, IFIT1, CXCL11, PRSS23, RBP1, SQLE, YKT6, KRT10, SNAPC1, BST2, HOMER3, SPP1, ENO1, DLGAP5, KIF23, OASL, COL4A6, RSAD2, CDC20, TNC, F3, FOLR3, EFNA1, PLSCR1, FN1, HIST1H2BD, GNLY, S100A3, LY6E, CCNB1 |
| Downregulated (n=81) | ADH1B, GPRASP1, MEOX2, MYRIP, CBX7, ATP6V0E2, GPR64, C7, RNASE4, ITM2A, SLC25A20, CDO1, CLDN10, MAN2A2, GNG7, SATB1, TXNIP, SERPINA5, LPIN1, ABCA3, SELENBP1, LMO2, GYPC, CXCL12, KAT2B, ZNF529, RTN1, PRELP, ANG, CFD, SSBP2, CCDC69, ENPP4, BEX4, TSPYL5, MYOC, NCLN, SYNGR1, GDF10, P2RY14, CLU, PIP5K1B, ALDH1A1, CILP, MFAP4, FRZB, IGF1, TOX3, ZBTB20, RORC, NR3C2, PTGFR, CPEB3, LGI1, SUSD5, CLGN, GAS2, LCP1, SORBS2, HLF, DPT, CX3CR1, SERPINI1, ACOX2, ASPA, PCK1, MIA, LMOD1, NFIB, SLITRK5, CRISP3, DCLK1, ANGPT1, ABCA6, FAM149A, SPARCL1, NPY1R, PTGDS, AMPD1, FBLN5, STATH |
Genes significantly associated with overall survival in oral tongue squamous cell carcinoma.
| Gene | HR | 95% CI | P-value |
|---|---|---|---|
| NCLN | 38.678 | 1.117–6.192 | 0.0047 |
| THBS2 | 2.050 | 0.153–1.283 | 0.0127 |
| SPARCL1 | 3.333 | 0.097–2.310 | 0.0330 |
| YKT6 | 13.765 | 0.179–5.065 | 0.0354 |
HR, hazard ratio; CI, confidence interval.
Figure 2.Visualization of the GO enrichment analysis for 206 differentially expressed genes in oral tongue squamous cell carcinoma. The GO enrichment analysis was performed with the Database for Annotation, Visualization and Integrated Discovery online tool, and the detailed information is presented as a bubble chart. The y-axis represents the GO terms, the x-axis represents the BP, CC and MF functional group categorization, the size of bubbles represents the number of assigned genes, and the color of the bubbles indicates the -log10 (Q-value). The larger the number of genes associated with the term, the larger the bubble. The more significant the GO category, the higher on the color bar the bubble is. GO, Gene Ontology; BP, biological process; CC, cellular component; MF, molecular function; Q-value, Bonferroni-adjusted P-value.
Figure 3.Visualization of the pathway enrichment analysis for 206 differentially expressed genes in oral tongue squamous cell carcinoma. The pathway enrichment analysis was performed with Database for Annotation, Visualization and Integrated Discovery online tool, and the detailed information is presented as a bubble chart. The y-axis represents the significantly enriched pathways, the x-axis represents the richness factor, the size of the bubbles represents the number of assigned genes, and the color of bubbles represents the -log10 (Q-value). The larger number of genes classified into the pathway, the larger the node size is. The more significant the pathway, the higher on the color bar the bubble is. KEGG, Kyoto Encyclopedia of Genes and Genomes; PI3K, phosphoinositide 3-kinase; Akt, protein kinase B; ECM, extracellular matrix; PDGF, platelet-derived growth factor; Q-value, Bonferroni-adjusted P-value.
Figure 4.PPI network for DEGs in oral tongue squamous cell carcinoma. Using the STRING online database, a total of 142 DEG-encoded proteins were filtered into the PPI network, with the remaining 64 disconnected nodes hidden. Topology analysis was performed using the MCODE plugin and two significant sub-networks (modules), termed 1 and 2, were identified. Nodes in light salmon color represent the DEG-encoded proteins of module 1, and nodes in yellow-green color represent the DEG-encoded proteins of module 2. PPI, protein-protein interaction; DEGs, differentially expressed genes.
Figure 5.Functional annotation of the two significant modules from the protein-protein interaction network analysis. The function annotation of the two sub-networks was performed using the ClueGO and CluePedia plugins. (A) Module 1 consists of 20 proteins that are principally associated with extracellular matrix organization. (B) Module 2 consists of 14 proteins that are principally associated with human papillomavirus infection. Solid rounded rectangles represent enriched Gene Ontology terms, solid circles represent enriched Kyoto Encyclopedia of Genes and Genomes pathways.
Figure 6.Expression heat maps of DEGs in different samples of GSE31056. (A) In total, 206 DEGs were identified in oral tongue squamous cell carcinoma samples compared with normal tongue tissues. (B) A total of 206 DEGs in oral squamous cell carcinoma samples compared with normal oral tissues. The rows represent the genes, and the columns represent the samples. The red and blue colors indicate upregulated and downregulated genes, respectively. DEGs, differentially expressed genes.
Figure 7.Expression heat maps of the 206 identified DEGs in oral tongue squamous cell carcinoma samples compared with normal tongue tissues in datasets (A) GSE13601 and (B) GSE78060. The rows represent the genes, and the columns represent the samples. The red and blue colors indicate upregulated and downregulated genes, respectively. DEGs, differentially expressed genes; OTSCC, oral tongue squamous cell carcinoma.
Figure 8.Intersection of differentially expressed genes identified from the GEO microarray datasets and a TCGA RNA sequencing dataset of oral tongue squamous cell carcinoma samples. As indicated in the Venn diagram, the results from the two databases had a total of 119 genes (72 upregulated and 47 downregulated genes) in common. TCGA, The Cancer Genome Atlas; GEO, Gene Expression Omnibus.
List of 119 aberrant genes in oral tongue squamous cell carcinoma identified from the Gene Expression Omnibus and the cancer Genome Atlas databases.
| Gene expression | Differentially expressed genes |
|---|---|
| Upregulated (n=72) | COL4A5, CCNB1, SHC1, ITGA3, GINS1, FOXM1, PXDN, TPBG, FN1, IFI27, FST, COL5A2, SPP1, ITGA6, PLOD2, MMP1, MMP12, BNC1, KIF23, GNLY, CDH3, COL4A2, MMP3, POSTN, FSCN1, PLSCR1, DLGAP5, COL4A6, COL4A1, LAMC2, TPST1, ACTN1, COL1A2, PROCR, SLC16A1, FOLR3, IFIT3, MYO1B, PLAU, MMP13, HOMER3, PTHLH, CXCL11, MYO10, PTK7, ADAM10, CDC20, RAB31, OASL, PRNP, TRIP13, DFNA5, ISG15, PDPN, TK1, TNC, FAP, BST2, IFI6, PYGL, IFIT1, THBS2, PRSS23, SERPINE1, RSAD2, SOX11, RBP1, TGFBI, SNAI2, SCG5, IFI44, CTSC |
| Downregulated (n=47) | GDF10, CPEB3, TOX3, HLF, SORBS2, NPY1R, CLDN10, MIA, SSBP2, NR3C2, CBX7, MYOC, SLC25A20, GAS2, GNG7, RORC, PIP5K1B, LPIN1, CX3CR1, ATP6V0E2, SERPINA5, SYNGR1, CFD, RNASE4, SATB1, KAT2B, ENPP4, FAM149A, LMOD1, ASPA, AMPD1, ANG, BEX4, CRISP3, STATH, DPT, PTGDS, NFIB, SLITRK5, ALDH1A1, ITM2A, GPRASP1, ADH1B, MYRIP, FRZB, ACOX2, SELENBP1 |