| Literature DB >> 27149165 |
Zhandong Li1, Lifeng An2, Hao Li1, ShaoPeng Wang3, You Zhou4, Fei Yuan4, Lin Li2.
Abstract
Nasopharyngeal cancer or nasopharyngeal carcinoma (NPC) is the most common cancer originating in the nasopharynx. The factors that induce nasopharyngeal cancer are still not clear. Additional information about the chemicals or genes related to nasopharyngeal cancer will promote a better understanding of the pathogenesis of this cancer and the factors that induce it. Thus, a computational method NPC-RGCP was proposed in this study to identify the possible relevant chemicals and genes based on the presently known chemicals and genes related to nasopharyngeal cancer. To extensively utilize the functional associations between proteins and chemicals, a heterogeneous network was constructed based on interactions of proteins and chemicals. The NPC-RGCP included two stages: the searching stage and the screening stage. The former stage is for finding new possible genes and chemicals in the heterogeneous network, while the latter stage is for screening and removing false discoveries and selecting the core genes and chemicals. As a result, five putative genes, CXCR3, IRF1, CDK1, GSTP1, and CDH2, and seven putative chemicals, iron, propionic acid, dimethyl sulfoxide, isopropanol, erythrose 4-phosphate, β-D-Fructose 6-phosphate, and flavin adenine dinucleotide, were identified by NPC-RGCP. Extensive analyses provided confirmation that the putative genes and chemicals have significant associations with nasopharyngeal cancer.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27149165 PMCID: PMC4857740 DOI: 10.1038/srep25515
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Eight NPC-related chemicals retrieved from CTD.
| PubChem ID | Chemical name |
|---|---|
| CID000000264 | Butyric acid |
| CID000000712 | Formaldehyde |
| CID000003385 | 5-Fluorouracil |
| CID000006037 | Folic acid |
| CID000006468 | PHENCYCLIDINE |
| CID000148123 | Docetaxel trihydrate |
| CID000445154 | Resveratrol |
| CID009562060 | 1-Methyl-3-nitro-1-nitrosoguanidine |
Figure 1An example showing the structure of the heterogeneous network.
The triangles represent proteins, the dots represent chemicals. The sold lines represent protein-chemical interactions, the dashed lines represent protein-protein interactions, and the dashed dotted lines represent chemical-chemical interactions.
The pseudocode of the algorithm for NPC-RGCP.
| 1. Searching stage |
| 1.1 Search all shortest paths connecting any two NPC-related genes or chemicals |
| 1.2 Extract inner nodes in these paths; their corresponding genes and chemicals that are not in |
| 2. Screening stage |
| 2.1 For each shortest path gene or chemical, calculate its FDR using |
| 2.2 For each shortest path gene, calculate its MIS and MAS using |
| 2.3 For each shortest path chemical, calculate its MIS using |
| 2.4 Select shortest path genes with FDR smaller than 0.05, MIS no less than 900 and MAS no less than 90 |
| 2.5 Select shortest path chemicals with FDR smaller than 0.05 and MIS no less than 900 |
| 3. Output the remaining shortest path genes and chemicals as the putative NPC-related genes and chemicals |
Figure 2A figure illustrating the procedures of the NPC-RGCP method and its results.
The detailed information of five putative genes.
| Ensembl ID | Gene symbol | Betweenness | Permutation FDR | MIS | MAS |
|---|---|---|---|---|---|
| ENSP00000362795 | CXCR3 | 136 | 0.015 | 999 | 139 |
| ENSP00000245414 | IRF1 | 151 | 0.013 | 999 | 114 |
| ENSP00000306043 | CDK1 | 667 | 0.021 | 994 | 233 |
| ENSP00000381607 | GSTP1 | 131 | 0.019 | 985 | 100 |
| ENSP00000269141 | CDH2 | 421 | 0.03 | 965 | 1091 |
aMaximum interaction score.
bMaximum alignment score.
The detailed information of seven putative chemicals.
| PubChem ID | Chemical name | Betweenness | Permutation FDR | MIS |
|---|---|---|---|---|
| CID000023925 | Iron | 658 | 0.002 | 994 |
| CID000001032 | Propionic acid | 139 | <0.001 | 987 |
| CID000000679 | Dimethyl sulfoxide | 14 | 0.048 | 970 |
| CID000003776 | Isopropanol | 108 | 0.013 | 965 |
| CID000122357 | Erythrose 4-phosphate | 2 | 0.024 | 906 |
| CID000440641 | β-D-Fructose 6-phosphate | 139 | 0.001 | 905 |
| CID000643975 | Flavin adenine dinucleotide | 158 | 0.022 | 900 |
aMaximum interaction score.
Figure 3A bar chart illustrating the number of genes among the shortest path genes with FDRs less than 0.05 enriching each KEGG pathway and the FDR obtained by DAVID.
Three enriched GO terms of the shortest path genes.
| GO term | Description | Number of sharing genes | FDR |
|---|---|---|---|
| GO:0002474 | Antigen processing and presentation of peptide antigen via MHC class I | 5 | 0.014 |
| GO:0032269 | Negative regulation of cellular protein metabolic process | 10 | 0.022 |
| GO:0051248 | Negative regulation of protein metabolic process | 10 | 0.030 |