| Literature DB >> 21575214 |
Henning Stehr1, Seon-Hi J Jang, José M Duarte, Christoph Wierling, Hans Lehrach, Michael Lappe, Bodo M H Lange.
Abstract
BACKGROUND: Current large-scale cancer sequencing projects have identified large numbers of somatic mutations covering an increasing number of different cancer tissues and patients. However, the characterization of these mutations at the structural and functional level remains a challenge.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21575214 PMCID: PMC3123651 DOI: 10.1186/1476-4598-10-54
Source DB: PubMed Journal: Mol Cancer ISSN: 1476-4598 Impact factor: 27.401
Overview of genes
| Gene name | Length (AA) | Mut | SNP | PDB codes (sequence range) |
|---|---|---|---|---|
| 478 | 6 | 5 | ||
| 766 | 46 | 3 | ||
| 1210 | 224 | 9 | ||
| 394 | 12 | 9 | ||
| 189 | 19 | 0 | ||
| 976 | 9 | 9 | ||
| 188 | 85 | 1 | ||
| 1408 | 24 | 30 | ||
| 189 | 9 | 1 | ||
| 1068 | 148 | 17 | ||
| 593 | 7 | 6 | ||
| 1114 | 24 | 3 | ||
| 882 | 17 | 3 | ||
| 156 | 76 | 10 | ||
| 707 | 34 | 4 | ||
| 756 | 8 | 3 | ||
| 934 | 12 | 17 | ||
| 403 | 93 | 2 | ||
| 928 | 7 | 9 | ||
| 552 | 51 | 3 | ||
| 433 | 30 | 1 | ||
| 393 | 826 | 17 | ||
| 213 | 216 | 16 | ||
| 449 | 9 | 3 | ||
Abbreviations: AA, amino acid, Mut, number of mutations, SNP, number of SNPs, PDB, Protein Data Bank
Figure 1Structural impact of mutations. The columns show the structural properties of random mutations (Rnd), natural variations (Snp) and cancer mutations (Mut). Cancer mutations are further analyzed separately as mutations of oncogenes (Onc) and mutations of tumor suppressor genes (Sup). The error bars indicate the variability of the data under the jackknife test. The reported values are the odds ratios averaged over the genes in the dataset. The p-values are calculated over all mutations within a dataset. A, observed over expected fraction of mutations occurring at the protein surface. Onc show significantly more and Sup significantly less solvent accessible mutations. B, observed over expected fraction of destabilizing mutations. Onc mutations are less often destabilizing, while Sup mutations disrupt stability far more often than the controls. C, observed over expected functional site mutations. Functional sites are more frequently mutated in Onc than in Sup. D, observed over expected spatial clustering of mutations. Mutations particularly in Onc are significantly more clustered than expected by chance.
Figure 2Distribution of functional site mutations. Distribution of mutations affecting functional sites in oncogenes (Onc) and tumor suppressors (Sup) compared to distribution of random mutations. A and B, distribution obtained by random sampling of positions in Onc and Sup, respectively. C, distribution of functional site mutations in Onc. ATP and GTP binding sites in Onc are significantly more often mutated than expected by chance. D, distribution of functional site mutations in Sup. Observed distribution does not differ significantly from expected random distribution.
Figure 3Linear classification of cancer genes. The different pairs of structural features are shown as scatter plots in A-F. Oncogenes are depicted as blue dots, tumor suppressors as red diamonds. The separating linear functions have been calculated using Fisher's linear discriminant method. The classifiers in A, D and E show the best training performance.
Performance of linear classifiers
| Training Performance | Cross validation | ||||||
|---|---|---|---|---|---|---|---|
| Feature combination | true | false | ratio | true | false | ratio | |
| Solv | Stab | 20 | 4 | 83.33% | 20 | 4 | 83.33% |
| Solv | Func | 16 | 8 | 66.67% | 16 | 8 | 66.67% |
| Solv | Clust | 16 | 8 | 66.67% | 16 | 8 | 66.67% |
| Stab | Func | 23 | 1 | 95.83% | 23 | 1 | 95.83% |
| Stab | Clust | 20 | 4 | 83.33% | 19 | 5 | 79.17% |
| Func | Clust | 18 | 6 | 75.00% | 17 | 7 | 70.83% |
Abbreviations: Solv, Solvent Accessibility, Stab, Stability, Func, Functional Sites, Clust, Clustering
Prediction of cancer gene classes
| Prediction | ||||||
|---|---|---|---|---|---|---|
| Feature combination | DCLK3 | MMP2 | PIK3C3 | TGM3 | EPHA3 | |
| Solv | Stab | O | S | S | O | S |
| Solv | Func | S | O | S | O | S |
| Solv | Clust | O | O | S | O | S |
| Stab | Func | O | O | S | O | S |
| Stab | Clust | O | S | S | O | S |
| Func | Clust | S | O | S | S | S |
Abbreviations: Solv, Solvent Accessibility, Stab, Stability, Func, Functional Sites, Clust, Clustering, O, Oncogene, S, Tumor Suppressor