| Literature DB >> 32887903 |
Syeda Zahra Abbas1, Muhammad Imran Qadir1, Syed Aun Muhammad2.
Abstract
Oral cancer (OC) ranked as eleventh malignancy worldwide, with the increasing incidence among young patients. Limited understanding of complications in cancer progression, its development system, and their interactions are major restrictions towards the progress of optimal and effective treatment strategies. The system-level approach has been designed to explore genetic complexity of the disease and to identify novel oral cancer related genes to detect genomic alterations at molecular level, through cDNA differential analysis. We analyzed 21 oral cancer-related cDNA datasets and listed 30 differentially expressed genes (DEGs). Among 30, we found 6 significant DEGs including CYP1A1, CYP1B1, ADCY2, C7, SERPINB5, and ANAPC13 and studied their functional role in OC. Our genomic and interactive analysis showed significant enrichment of xenobiotics metabolism, p53 signaling pathway and microRNA pathways, towards OC progression and development. We used human proteomic data for post-translational modifications to interpret disease mutations and inter-individual genetic variations. The mutational analysis revealed the sequence predicted disordered region of 14%, 12.5%, 10.5% for ADCY2, CYP1B1, and C7 respectively. The MiRNA target prediction showed functional molecular annotation including specific miRNA-targets hsa-miR-4282, hsa-miR-2052, hsa-miR-216a-3p, for CYP1B1, C7, and ADCY2 respectively associated with oral cancer. We constructed the system level network and found important gene signatures. The drug-gene interaction of OC source genes with seven FDA approved OC drugs help to design or identify new drug target or establishing novel biomedical linkages regarding disease pathophysiology. This investigation demonstrates the importance of system genetics for identifying 6 OC genes (CYP1A1, CYP1B1, ADCY2, C7, SERPINB5, and ANAPC13) as potential drugs targets. Our integrative network-based system-level approach would help to find the genetic variants of OC that can accelerate drug discovery outcomes to develop a better understanding regarding treatment strategies for many cancer types.Entities:
Mesh:
Substances:
Year: 2020 PMID: 32887903 PMCID: PMC7473858 DOI: 10.1038/s41598-020-71346-7
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
List of cDNA datasets analyzed in this study.
| S. no | Geo ID | Sample count (case: control) | Platform used | Tissues |
|---|---|---|---|---|
| 1 | GSE2280 | 22:05 | GPL96[HG-U133A] Affymetrix Human Genome U133A Array | Oral lymph node |
| 2 | GSE3524 | 16:04 | GPL96[HG-U133A] Affymetrix Human Genome U133A Array | Oral squamous |
| 3 | GSE10063 | 4:30 | GPL570[HG-U133_Plus_2] Affymetrix Human Genome U133 | Keratinocyte cell |
| 4 | GSE13601 | 31:27 | GPL8300 [HG_U95Av2] Affymetrix Human Genome U95 Array | Tongue |
| 5 | GSE21866 | 2:03 | GPL201 [HG-Focus] Affymetrix Human HG-Focus Target Array | Tongue squamous |
| 6 | GSE32142 | 2:02 | GPL570 [HG-U133_Plus_2] Affymetrix Human Genome U133 | Oral squamous |
| 7 | GSE36111 | 5:00 | GPL571 [HG-U133A_2] Affymetrix Human Genome U133A 2.0 Array | Oral squamous |
| 8 | GSE38058 | 4:00 | GPL570 [HG-U133_Plus_2] Affymetrix Human Genome U133 | Oral squamous |
| 9 | GSE38517 | 9:11 | GPL570 [HG-U133_Plus_2] Affymetrix Human Genome U133 | Fibroblasts |
| 10 | GSE39376 | 11:17 | GPL201 [HG-Focus] Affymetrix Human HG-Focus Target Array | Buccal carcinoma cell |
| 11 | GSE43862 | 1:03 | GPL570 [HG-U133_Plus_2] Affymetrix Human Genome U133 | Oral squamous |
| 12 | GSE44458 | 4:00 | GPL570 [HG-U133_Plus_2] Affymetrix Human Genome U133 | Tongue squamous |
| 13 | GSE49673 | 6:06 | GPL570 [HG-U133_Plus_2] Affymetrix Human Genome U133 | Parotid adenocarcinoma |
| 14 | GSE52811 | 4:04 | GPL8786 [miRNA-1] Affymetrix Multispecies miRNA-1 Array | Oral keratinocyte |
| 15 | GSE52915 | 27:00 | GPL570 [HG-U133_Plus_2] Affymetrix Human Genome U133 | Tongue squamous |
| 16 | GSE57022 | 2:02 | GPL570 [HG-U133_Plus_2] Affymetrix Human Genome U133 | Parotid adenocarcinoma |
| 17 | GSE59795 | 10:10 | GPL570 [HG-U133_Plus_2] Affymetrix Human Genome U133 | Parotid adenocarcinoma |
| 18 | GSE70301 | 3:03 | GPL570 [HG-U133_Plus_2] Affymetrix Human Genome U133 | Tongue squamous |
| 19 | GSE73171 | 3:03 | GPL14613 [miRNA-2] Affymetrix Multispecies miRNA-2 Array | Laryngeal squamous |
| 20 | GSE75127 | 4:4 | GPL570 [HG-U133_Plus_2] Affymetrix Human Genome U133 Plus 2.0 Array | Oral squamous |
| 21 | GSE81821 | 5:5 | GPL14613 [miRNA-2] Affymetrix Multispecies miRNA-2 Array | Metastatic tumor tissue |
Figure 1Normalization and differential analysis. Histogram (smoothed histograms) shows density estimate of the data. Typically, the distributions of the arrays have similar shapes and ranges. Arrays whose distributions are very different from the others are considered for possible problems. High levels of background shifted an array's distribution to the right. Lack of signal diminishes its right tail. A bulge at the upper end of the intensity range indicated the signal saturation.
K-fold Cross-validation using “Boot” package of bioconductor software based on Gaussian dispersion modules.
| (Intercept) | 0.021148 | 0.003402 | 6.17 | < 1.00E−14*** |
| x1 | 0.052155 | 0.003825 | 22.009 | < 1.00E−14*** |
| x2 | − 0.02651 | 0.005117 | − 7.027 | < 1.96E−09*** |
| x3 | 0.149112 | 0.003105 | 30.007 | < 1.00E−14*** |
| x4 | 0.216532 | 0.001828 | 21.510 | < 1.00E−14*** |
| x5 | 0.048403 | 0.002152 | 31.003 | < 1.00E−14*** |
| x6 | 0.132542 | 0.002071 | 25.001 | < 1.00E−14*** |
| x7 | − 0.07733 | 0.001672 | − 29.216 | < 1.00E−14*** |
| x8 | 0.121504 | 0.002002 | 20.124 | < 1.00E−14*** |
| x9 | 0.212811 | 0.002825 | 65.278 | < 1.00E−14*** |
| x10 | 0.010029 | 0.003708 | 6.335 | < 1.00E−14*** |
| x11 | 0.022541 | 0.007523 | 22.601 | < 1.00E−14*** |
| x12 | − 0.02215 | 0.001205 | − 5.011 | 0.0089* |
| x13 | − 0.12532 | 0.003051 | − 56.055 | < 1.00E−14*** |
| x14 | 0.030526 | 0.001052 | 3.415 | 5.28E−12*** |
| x15 | − 0.028691 | 0.001611 | − 20.502 | < 1.00E−14*** |
Deviance residuals: Min (− 1.5101), 1Q (− 0.0412), Median (− 0.0100), 3Q (0.0132), Max (3.1977).
Signif. codes: 0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1.
Number of Fisher Scoring iterations: 2; $K: [1] 10; $delta: [1] 0.00516 = 0.00515.
Null deviance: 100,813.5 on 53,225 degrees of freedom.
Residual deviance: 2,817.3 on 53,209 degrees of freedom.
Figure 2RNA degradation plot. Side-by-side plot produced by plot AffyRNAdeg representing 5′–3′-trend indicating an assessment of the severity of RNA degradation and significance level.
Figure 3The OC-related DEGs were curated using CTD (comparative toxicogenomics database), PubMed, OMIM, and MeSH databases.
Gene Ontology and functional enrichment of differentially expressed genes.
| Term | Fold enrichment | FDR | |
|---|---|---|---|
| hsa04913: Ovarian steroidogenesis | 4.86E−04 | 70.5102 | 0.464991 |
| GO:0,097,267 ~ omega-hydroxylase P450 pathway | 2.68E−03 | 621.9259 | 2.926305 |
| GO:0,016,712 ~ oxidoreductase activity | 0.00355 | 450.16 | 2.812419 |
| GO:0,019,373 ~ epoxygenase P450 pathway | 0.005349 | 310.963 | 5.768479 |
| GO:0,070,330 ~ aromatase activity | 0.006383 | 250.0889 | 5.007033 |
| GO:0,019,825 ~ oxygen binding | 0.011091 | 143.6681 | 8.558457 |
| GO:0,008,202 ~ steroid metabolic process | 0.01274 | 130.1705 | 13.24158 |
| IPR002401: Cytochrome P450, E-class, group I | 0.0134 | 123.7267 | 9.757773 |
| GO:0,004,497 ~ monooxygenase activity | 0.013674 | 116.4207 | 10.45655 |
| IPR017972: Cytochrome P450, conserved site | 0.01553 | 106.6609 | 11.23041 |
| IPR001128: Cytochrome P450 | 0.017126 | 96.66146 | 12.31951 |
| GO:0,071,407 ~ cellular response to organic cyclic compound | 0.017447 | 94.87006 | 17.71563 |
| Monooxygenase | 0.021199 | 77.95833 | 17.98805 |
| GO:0,031,090 ~ organelle membrane | 0.023645 | 69.82375 | 15.90821 |
| Metal ion-binding site: Iron (heme axial ligand) | 0.024432 | 67.55219 | 18.69242 |
| hsa00380: Tryptophan metabolism | 0.028619 | 57.58333 | 24.30038 |
| Microsome | 0.030952 | 53.18088 | 25.24757 |
| Heme | 0.031663 | 51.97222 | 25.75335 |
| GO:0,020,037 ~ heme binding | 0.032072 | 49.28759 | 23.01009 |
| Secondary metabolites biosynthesis, transport, and catabolism | 0.032577 | 30.69697 | 3.257651 |
| GO:0,005,506 ~ iron ion binding | 0.035767 | 44.13333 | 25.3362 |
| hsa00140: Steroid hormone biosynthesis | 0.041281 | 39.71264 | 33.24928 |
| hsa00980: Metabolism of xenobiotics by cytochrome P450 | 0.052426 | 31.12613 | 40.32831 |
| hsa05204: Chemical carcinogenesis | 0.056578 | 28.79167 | 42.78879 |
| hsa04914: Progesterone-mediated oocyte maturation | 0.061404 | 26.4751 | 45.53397 |
| Glycoprotein | 0.075408 | 3.014869 | 51.59749 |
| Iron | 0.076281 | 21.17387 | 52.01866 |
| hsa04114: Oocyte meiosis | 0.076443 | 21.1315 | 53.34849 |
| hsa04114: Oocyte meiosis | 0.076443 | 21.1315 | 53.34849 |
Figure 4Clinical phenotypes for oral cancer related DEGs using FunRich annotation tool.
Figure 5Cluster analysis of 6 oral cancer-related DEGs with Euclidean distance (Binning method). Quantile lines indicate the boundaries of the clusters in the level of the tree.
Expression profiling of cDNA microarray datasets.
| AFFYMETRIX_ 3PRIME_IVT_ID | Gene name | logFC | AveExpr | t | adj. | B | Abberation | |
|---|---|---|---|---|---|---|---|---|
| 205749_at | 3.900696 | 8.398162 | 39.22931 | 2.99E−46 | 1.63E−41 | 87.06766 | Up regulated | |
| 202992_at | − 1.84918 | 4.492108 | − 6.64914 | 3.77E−07 | 0.008403 | 4.18572 | Down regulated | |
| 202437_s_at | 3.140615 | 8.48289 | 25.87402 | 1.89E−35 | 5.18E−31 | 67.29499 | Up regulated | |
| 213217_at | − 2.07208 | 5.693218 | − 7.54109 | 3.51E−07 | 0.019192 | 4.699974 | Down regulated | |
| 204855_at | − 6.46131 | 9.39441 | − 41.2474 | 2.06E−26 | 9.05E−23 | 49.19926 | Down regulated | |
| 209001_s_at | ANAPC13 | − 2.49883 | 10.52767 | − 31.6716 | 1.32E−10 | 7.23E−06 | 13.02867 | Down regulated |
Figure 6Analysis and exploration of mutations affecting post-translational modification (PTM) sites in human genes/proteins using online ActiveDriverDB database. Needle plots indicate the PTM site mutations in our genes/proteins. The graph shows the outcomes based on the specific type of PTM, cancer type, and mutation subset (presented in legend color codes). Height (y-axis) represents the number of occurrences of the mutation while Horizontal position (x-axis) indicates the position of protein’s amino acid sequence. Pinhead color signifies the mutation impact and X-axis coloring shows the type of PTM associated with the mutation location. Mutational Impacts: Rewiring: mutation-induced gains and losses of kinase-bound sequence motifs (predicted by MIMP software); Distal: mutation affects an amino acid located 4–7 amino acids away from the PTM site; Direct: mutation affects the post transcriptionally modified amino acid; Proximal: mutation affects an amino acid located 1–3 amino acids away from PTM site; Sites: Amino acid sites/ regions enriched for mutations affecting post-translational modifications (PTMs).
Figure 7Protein–protein interaction of OC genes. Red nodes represent DEGs interacting with Pink nodes (target genes/gene signatures). High-confidence interactions of HAPPI database were selected in this network (the five stars are equivalent to high score (0.90–1).
Figure 8Pathway modeling for genome signaling and metabolic reconstruction revealed the pathological mechanism of oral cancer using KEGG and Wiki pathway databases.
Figure 9Toxicogenomic analysis of differentially expressed genes carried out by a comparative toxicogenomic database (CTD) helps to study the chemical-genome to phenome relationships.
Over-representation of oral cancer-related DEGs using oPOSSUM with 80% matrix score.
| TF | Class | Family | Tax group | IC | Target gene hits | Target gene non-hits | Back-ground gene hits | Back-ground gene non-hits | Target TFBS hits | Target TFBS nucleotide rate | Back-ground TFBS hits | Back-ground TFBS nucleotide rate | Z-score | Fisher score |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ESR1 | Zinc-coordinating | Hormone-nuclear Receptor | Vertebrates | 13.563 | 1 | 5 | 345 | 24,407 | 1 | 0.00319 | 355 | 0.000197 | 16.419 | 2.514 |
| Arnt::Ahr | Zipper-Type | Helix-Loop-Helix | Vertebrates | 9.532 | 5 | 1 | 13,597 | 11,155 | 23 | 0.022 | 66,086 | 0.011 | 8.273 | 1.815 |
| CEBPA | Zipper-Type | Leucine Zipper | Vertebrates | 8.712 | 5 | 1 | 12,828 | 11,924 | 18 | 0.0258 | 55,606 | 0.0139 | 8.01 | 2.06 |
| CTCF | Zinc-coordinating | Beta Alpha-zinc finger | Vertebrates | 17.205 | 2 | 4 | 3,236 | 21,516 | 2 | 0.00606 | 3,982 | 0.0021 | 6.703 | 1.718 |
| HNF1B | Helix-Turn-Helix | Homeo | Vertebrates | 16.821 | 3 | 3 | 4,107 | 20,645 | 3 | 0.00574 | 6,910 | 0.0023 | 5.543 | 2.787 |
| ELK4 | Winged Helix-Turn-Helix | Ets | Vertebrates | 14.123 | 2 | 4 | 6,034 | 18,718 | 4 | 0.00574 | 9,342 | 0.00234 | 5.452 | 0.796 |
| TAL1::TCF3 | Zipper-Type | Helix-Loop-Helix | Vertebrates | 14.07 | 2 | 4 | 7,491 | 17,261 | 5 | 0.00957 | 14,681 | 0.0049 | 5.21 | 0.535 |
| Nr2e3 | Zinc-coordinating | Hormone-nuclear Receptor | Vertebrates | 12.028 | 4 | 2 | 6,354 | 18,398 | 5 | 0.00558 | 12,287 | 0.00239 | 5.045 | 3.187 |
| Lhx3 | Helix-Turn-Helix | Homeo | Vertebrates | 16.354 | 2 | 4 | 5,806 | 18,946 | 4 | 0.00829 | 12,758 | 0.00461 | 4.212 | 0.846 |
| Nkx2-5 | Helix-Turn-Helix | Homeo | Vertebrates | 8.27 | 6 | 0 | 16,973 | 7,779 | 43 | 0.048 | 197,210 | 0.0384 | 3.939 | 2.263 |
| Arnt | Zipper-Type | Helix-Loop-Helix | Vertebrates | 10.992 | 4 | 2 | 7,052 | 17,700 | 5 | 0.00478 | 13,904 | 0.00232 | 3.928 | 2.827 |
| NFE2L2 | Zipper-Type | Leucine Zipper | Vertebrates | 14.394 | 2 | 4 | 5,635 | 19,117 | 3 | 0.00526 | 8,970 | 0.00274 | 3.696 | 0.886 |
| ESR2 | Zinc-coordinating | Hormone-nuclear Receptor | Vertebrates | 13.618 | 1 | 5 | 2,219 | 22,533 | 1 | 0.00287 | 2,670 | 0.00134 | 3.155 | 0.842 |
| Tal1::Gata1 | Zipper-Type | Helix-Loop-Helix | Vertebrates | 11.297 | 2 | 4 | 6,212 | 18,540 | 3 | 0.00861 | 11,200 | 0.0056 | 3.107 | 0.758 |
| Evi1 | Zinc-coordinating | BetaBetaAlpha-zinc finger | Vertebrates | 17.909 | 1 | 5 | 1931 | 22,821 | 1 | 0.00223 | 2,457 | 0.000956 | 3.067 | 0.952 |
| PLAG1 | Zinc-coordinating | BetaBetaAlpha-zinc finger | Vertebrates | 19.352 | 1 | 5 | 1971 | 22,781 | 1 | 0.00223 | 2,462 | 0.000958 | 3.059 | 0.936 |
| MZF1_5-13 | Zinc-coordinating | BetaBetaAlpha-zinc finger | Vertebrates | 9.4 | 5 | 1 | 13,425 | 11,327 | 17 | 0.0271 | 77,649 | 0.0216 | 2.97 | 1.868 |
| STAT1 | Ig-fold | Stat | Vertebrates | 13.119 | 2 | 4 | 6,394 | 18,358 | 3 | 0.00717 | 10,990 | 0.00458 | 2.949 | 0.722 |
| Mycn | Zipper-Type | Helix-Loop-Helix | Vertebrates | 11.104 | 3 | 3 | 8,332 | 16,420 | 5 | 0.00797 | 18,928 | 0.00526 | 2.883 | 1.12 |
| Prrx2 | Helix-Turn-Helix | Homeo | Vertebrates | 9.063 | 4 | 2 | 15,063 | 9,689 | 27 | 0.0215 | 123,270 | 0.0171 | 2.636 | 0.576 |
| FOXD1 | Winged Helix-Turn-Helix | Forkhead | Vertebrates | 11.926 | 6 | 0 | 13,087 | 11,665 | 14 | 0.0179 | 62,921 | 0.014 | 2.556 | 3.823 |
| MEF2A | Other Alpha-Helix | MADS | Vertebrates | 15.709 | 2 | 4 | 7,006 | 17,746 | 4 | 0.00638 | 15,631 | 0.00434 | 2.354 | 0.612 |
| Gfi | Zinc-coordinating | BetaBetaAlpha-zinc finger | Vertebrates | 9.47 | 4 | 2 | 13,731 | 11,021 | 14 | 0.0223 | 65,844 | 0.0183 | 2.332 | 0.796 |
| FOXO3 | Winged Helix-Turn-Helix | Forkhead | Vertebrates | 11.734 | 5 | 1 | 13,611 | 11,141 | 15 | 0.0191 | 69,410 | 0.0154 | 2.328 | 1.811 |
Prediction of gene specific MiRNA-targets associated with oral cancer.
| Serial no | Gene symbol | Gene description | *Target score | microRNA name | Total hits | miRNA sequence | Seed location | 3′-UTR length |
|---|---|---|---|---|---|---|---|---|
| 1 | CYP1A1 | cytochrome P450 family 1 subfamily A member 1 | 90 | hsa-miR-4786-5p | 35 | UGAGACCAGGACUGGAUGCACC | 735 | 966 |
| 2 | CYP1B1 | cytochrome P450 family 1 subfamily B member 1 | 98 | hsa-miR-4282 | 141 | UAAAAUUUGCAUCCAGGA | 19, 704, 1,092, 1,235, 1,246, 1753, 2028, 2,137 | 3,125 |
| 3 | C7 | complement C7 | 97 | hsa-miR-2052 | 84 | UGUUUUGAUAACAGUAAUGU | 1,197, 1,203, 1,311 | 3,073 |
| 4 | ADCY2 | adenylate cyclase 2 | 96 | hsa-miR-216a-3p | 142 | UCACAGUGGUCUCUGGGAUUAU | 190, 1575, 1641, 2,853 | 3,210 |
| 5 | SERPINB5 | serpin family B member 5 | 98 | hsa-miR-3148 | 64 | UGGAAAAAACUGGUGUGUGCUU | 169, 196, 1,323 | 1,363 |
| 6 | ANAPC13 | anaphase promoting complex subunit 13 | 89 | hsa-let-7f-1-3p | 61 | CUAUACAAUCUAUUGCCUUCCC | 699 | 903 |
*Highly reliable score—> 80, least reliable score—< 50.
Figure 10The drug–gene network was constructed between the FDA approved drugs and target genes. A broken line indicates the interaction between known drugs while solid line represents the novel drug targets. Anticancer drugs were retrieved from drug B+ ank.
Figure 11The steps have been integrated in basic framework of our study.
Databases and tools used during this meta-analysis.
| Databases/oftware/tools | Accessibility | Utility | References |
|---|---|---|---|
| STRING database version 11 | For known and predicted protein/COGs interaction | 74 | |
| National Center for Biotechnology Information (NCBI) | Biomedical and genomic information source | – | |
| Cytoscape version 3.6.0 | For network analysis and visualization | 31 | |
| DAVID Bioinformatics tool 6.8 | Gene ontology/ Functional Annotation tool | 75 | |
| Uniprot | Resource of protein functional information | 76 | |
| Kyoto Encyclopedia of Genes and Genomes (KEGG) | Pathways analysis and comparison | 77 | |
| FunRich version 3 | Enrichment analysis | 24 | |
| R version 3.3.3 | Statistical computing/data mining | 14–16 | |
| Opossum Version 3.0 | Single site analysis | 78 | |
| Wiki-Pathways | Pathways analysis | 79 | |
| Path-Visio 3.3.0 | pathway analysis and drawing software | 34 | |
| Comparative Toxicogenomics Database CTD | gene–disease relationships | 80 | |
| CIMminer | Cluster analysis | 25–26 | |
| HAPPI version 2,0 | Protein–protein interaction | 30 |