| Literature DB >> 20137077 |
Xue Gong1, Ruihong Wu, Yuannv Zhang, Wenyuan Zhao, Lixin Cheng, Yunyan Gu, Lin Zhang, Jing Wang, Jing Zhu, Zheng Guo.
Abstract
BACKGROUND: Hundreds of genes that are causally implicated in oncogenesis have been found and collected in various databases. For efficient application of these abundant but diverse data sources, it is of fundamental importance to evaluate their consistency.Entities:
Mesh:
Year: 2010 PMID: 20137077 PMCID: PMC2832783 DOI: 10.1186/1471-2105-11-76
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
The eight cancer gene lists analyzed in this paper
| Short names | Full names and URLs | References | No. of genes |
|---|---|---|---|
| CGC | Cancer Gene Census | [ | 377*(328**) |
| OMIMa | Online Mendelian Inheritance in Man | [ | 244 (217) |
| Reviewsb | CancerGenes database | [ | 289(261) |
| AGCOH | Atlas of Genetics and Cytogenetics in Oncology and Haematology | [ | 727(619) |
| TGDBs | The Tumor Gene Family of Databases | [ | 314(295) |
| TSGDB | Tumor suppressor gene database | [ | 148(109) |
| H-list | Candidate cancer genes provided by genome mutation scans | [ | 491(316) |
| R-list | Candidate cancer genes identified by retroviral insertional mutagenesis screens | [ | 646(496) |
| Total | 2105 (1594) |
Note: *the numbers of genes compiled in the original datasets; **the numbers of genes with PPI data. a cancer genes were extracted from OMIM as did in [53]. b cancer genes from two Reviews which are collected in the CancerGenes database [28,50].
Figure 1The consistency scores between different cancer gene lists. A. The bars above the x-axis depict POG (nPOG) and POGF (nPOGF) scores from other lists to CGC, and the bars below the x-axis depict the POG (nPOG) and POGF (nPOGF) scores from CGC to other lists. The red lines within the bars depict the POGF scores expected by random chance. B. The POG (nPOG) and POGF (nPOGF) scores between the H-list, L-list and R-List. The bars above the x-axis depict POG (nPOG) and POGF (nPOGF) scores from H-list to L-list, from L-list to R-list and from H-list to R-list respectively. The bars below the x-axis depict scores from L-list to H-list, from R-list to L-list and from R-list to H-list respectively. The red lines within the bars depict the POGF scores expected by random chance.
Figure 2Distribution of cancer genes in the Wnt and EGFR1 pathways. The x axis depicts the eight cancer gene lists and the y axis depicts the overlapping genes between the genes in the eight lists and all the genes in the pathways.
The scores between sub-lists of cancer genes for each cancer type
| Tumor | Gene No. | From L to H | From H to L | ||||||
|---|---|---|---|---|---|---|---|---|---|
| POG | nPOG | POGF | nPOGF | POG | nPOG | POGF | nPOGF | ||
| Breast | 67:122 a | 0.11 | 0.10 | 0.62 | 0.60 | 0.04 | 0.04 | 0.58 | 0.56 |
| Colon | 44:118 | 0.24 | 0.23 | 0.82 | 0.81 | 0.06 | 0.06 | 0.64 | 0.63 |
| Pancreatic | 13:44 | 0.36 | 0.35 | 0.62 | 0.60 | 0.10 | 0.10 | 0.34 | 0.32 |
| Glioblastoma | 18:50 | 0.37 | 0.37 | 0.83 | 0.83 | 0.10 | 0.10 | 0.64 | 0.63 |
Note: a the number of cancer genes with PPI data for the low- and high-throughput data. All the scores are statistically significant (p values < 1.0 E-04).
The enrichment of the selected genes in cancer pathways (FDR < 0.01)
| Signal pathway names | Ratios | ||
|---|---|---|---|
| Alpha6Beta4Integrin_pathway(54a) | 4.75E-12 | 0.89 | 24 |
| AndrogenReceptor_pathway(103) | 2.76E-11 | 1 | Inf*** |
| EGFR1_pathway(179) | 0 | 0.64 | 10.7 |
| Hedgehog_pathway(23) | 3.41E-05 | 0.01 | 1.75 |
| ID_pathway(25) | 1.30E-10 | 1 | Inf |
| KitReceptor_pathway(69) | 0 | 0.72 | 16.5 |
| NOTCH_pathway(80) | 9.40E-11 | 0.33 | 5 |
| TGFBR_pathway(159) | 8.29E-13 | 0.69 | 9.8 |
| TNFAlphaNFkb_pathway(189) | 0 | 0.82 | 7.2 |
| Wnt_pathway(105) | 0 | 0.01 | 2.6 |
Note: a the number of genes in the pathway;* the p value of the enrichment of the selected genes; ** the p value of the enrichment of the rest cancer genes; *** Inf means none of the rest genes are annotated in the pathway.