| Literature DB >> 16405732 |
Simon J Furney1, Desmond G Higgins, Christos A Ouzounis, Núria López-Bigas.
Abstract
BACKGROUND: One of the main goals of cancer genetics is to identify the causative elements at the molecular level leading to cancer.Entities:
Mesh:
Substances:
Year: 2006 PMID: 16405732 PMCID: PMC1373651 DOI: 10.1186/1471-2164-7-3
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Mean values and statistical analysis for degree of conservation and paralogy. Kolmogorov-Smirnov (KS) test of the conservation score between cancer proteins and the rest of human proteins. The KS test analyses show how different two distributions are, and computes a probability (P-value) that the two distributions are equal as well as the maximum distance (D) between them.
| M. musculus | 0.79 | 0.73 | 19.5 | 1.57e-09 |
| 0.77 | 0.71 | 16.7 | 4.69e-07 | |
| 0.62 | 0.56 | 14.1 | 5.36e-05 | |
| 0.52 | 0.48 | 10.6 | 5.4e-03 | |
| paralogues | 0.36 | 0.40 | 9.8 | 9.1e-03 |
Figure 1(a) Distribution of conservation score of proteins involved in cancer (red line) and all human proteins (blue line) against their closest homologue in M. musculus, R. norvegicus, G. gallus and between Paralogues. The conservation score gives an estimation of the mutation rate that the protein has been subjected to during evolution that is independent of the length of the protein. (b) Protein length, calculated as number of amino acids, and gene length distribution of cancer proteins (red) and all human proteins (blue).
Mean values and statistical analysis for gene length, protein length and the gene protein length ratio. The P-value for the KS test of the values distribution between each of the groups and the non-cancer group is shown in parenthesis.
| Cancer genes | 721 (<2.2e-16) | 87426 (5e-14) | 157 (4.1e-08) |
| Cancer genes with point mutations | 817 (1.2e-8) | 82615 (7.4e-07) | 121 (3.9e-03) |
| Translocated cancer genes | 690 (7.7e-08) | 92494 (8.7e-15) | 176 (7.7e-08) |
| Non-cancer genes | 491 | 49437 | 114 |
Figure 2Number of genes involved in cancer with each Molecular function (a) or Biological process (b) GO assignments (red) and number of genes expected in a same size random group of genes from the human genome (blue) (the P-value for the χ2 test is 1.5e-30 for the Molecular function and 3.5e-36 for the Biological process GO assignments). Note that one gene can have multiple GO assignments. χ2 values for each cell are represented with a colour-coded scale. Colours towards red signify over-representation and those towards blue signify under-representation of cancer genes with a particular GO assignment. Green signifies equal representation of both sets in a category.
Selected GO annotations of genes involved in cancer compared to all human genes. The sign in the χ2 value indicates over-representation (positive values) or under-representation (negative values) of the GO term in the group of cancer proteins.
| GO:0045786 | negative regulation of cell cycle | 68 | 22 | 225.91 |
| GO:0003684 | damaged DNA binding | 35 | 10 | 88.55 |
| GO:0030528 | transcription regulator activity | 1034 | 76 | 85.86 |
| GO:0006355 | regulation of transcription, DNA-dependent | 1281 | 84 | 73.49 |
| GO:0003700 | transcription factor activity | 770 | 59 | 72.73 |
| GO:0007049 | cell cycle | 601 | 48 | 64.36 |
| GO:0005634 | nucleus | 2492 | 122 | 47.13 |
| GO:0006366 | transcription from Pol II promoter | 181 | 19 | 41.93 |
| GO:0008151 | cell growth and/or maintenance | 3014 | 137 | 40.58 |
| GO:0003713 | transcription coactivator activity | 109 | 13 | 35.30 |
| GO:0006281 | DNA repair | 94 | 11 | 28.98 |
| GO:0003676 | nucleic acid binding | 2546 | 111 | 27.88 |
| GO:0003824 | catalytic activity | 3768 | 62 | -14.46 |
| GO:0006810 | transport | 1529 | 12 | -20.14 |
| GO:0016021 | integral to membrane | 1986 | 20 | -20.31 |
Figure 3ROC curve for the prediction of cancer genes. The 45° diagonal of the ROC space represents a random guess situation. The performance of the model at 0.5 and 0.7 cut-off probability scores are shown with dashed lines.