| Literature DB >> 19115005 |
Pablo Freire1, Marco Vilela, Helena Deus, Yong-Wan Kim, Dimpy Koul, Howard Colman, Kenneth D Aldape, Oliver Bogler, W K Alfred Yung, Kevin Coombes, Gordon B Mills, Ana T Vasconcelos, Jonas S Almeida.
Abstract
BACKGROUND: The Cancer Genome Atlas project (TCGA) has initiated the analysis of multiple samples of a variety of tumor types, starting with glioblastoma multiforme. The analytical methods encompass genomic and transcriptomic information, as well as demographic and clinical data about the sample donors. The data create the opportunity for a systematic screening of the components of the molecular machinery for features that may be associated with tumor formation. The wealth of existing mechanistic information about cancer cell biology provides a natural reference for the exploratory exercise. METHODOLOGY/PRINCIPALEntities:
Mesh:
Year: 2008 PMID: 19115005 PMCID: PMC2605252 DOI: 10.1371/journal.pone.0004076
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Evaluation of the entropy method performance by the area under the ROC curve from simulated datasets, accessing different amplitudes and prevalences of copy number aberrations.
An area under the ROC curve of 1 means a perfect separation between mutated and normal regions, while a value of 0.5 means a random classification.
Amplifications and deletions in the GBM data.
| Type | Region # | Known genes in GBM | Candidates | Ch | Start | End | Entropy | # of genes | Amplitude | Prevalence |
|
| 82 | EGFR | 7 | 54027966 | 55983910 | −2.5865 | 7 | 0.000275 | 0.234739 | |
| 55 | PDFGRA | 4 | 53634367 | 57090231 | −0.94003 | 22 | 0.000708 | 0.139728 | ||
| 117 | CDK4 | 12 | 56097914 | 56851736 | −0.69274 | 26 | 0.001321 | 0.122698 | ||
| 118 | MDM2 | 12 | 67315903 | 67981156 | −0.49955 | 7 | 0.000446 | 0.103178 | ||
| 79 | 7 | 32978614 | 32991778 | −0.48806 | 1 | 0.002777 | 0.101796 | |||
| 22 | MDM4 | 1 | 201942811 | 203355851 | −0.48298 | 19 | 0.000311 | 0.084066 | ||
| 34 | 2 | 113106316 | 113112980 | −0.4263 | 0 | 0.000597 | 0.083832 | |||
| 38 | 2 | 202864197 | 203906650 | −0.26864 | 9 | 0.014994 | 0.038269 | |||
| 51 | 3 | 181382003 | 181447344 | −0.22919 | 0 | 0.000371 | 0.071856 | |||
| 44 | 3 | 106776838 | 106776898 | −0.21799 | 1 | 0.003143 | 0.047904 | |||
| 91 | 7 | 152135252 | 152147233 | −0.20795 | 1 | 0.005562 | 0.05988 | |||
| 96 | SGK3 | 8 | 67783070 | 68102356 | −0.18707 | 3 | 0.017116 | 0.041916 | ||
| 27 | NCOA1 | 2 | 24616100 | 24712348 | −0.18293 | 1 | 0.008564 | 0.053892 | ||
| 26 | 2 | 24426149 | 24551934 | −0.16724 | 1 | 0.008564 | 0.05988 | |||
| 31 | ATOH8 | 2 | 85806436 | 85899917 | −0.15732 | 1 | 0.012083 | 0.047904 | ||
| 28 | 2 | 24819254 | 24866786 | −0.15088 | 2 | 0.008564 | 0.047904 | |||
|
| 100 | CDKN2A | 9 | 20336123 | 24769734 | −1.89499 | 25 | 0.001192 | 0.258973 | |
| 112 | 11 | 70513145 | 70559392 | −0.67419 | 0 | 0.018841 | 0.052181 | |||
| 3 | CHD5 | AJAP1 | 1 | 4028404 | 6333694 | −0.36827 | 10 | 0.004095 | 0.137835 | |
| 9 | CDKN2C | 1 | 50961735 | 51283220 | −0.33261 | 2 | 0.000366 | 0.066068 | ||
| 99 | 9 | 20240063 | 20280240 | −0.31992 | 0 | 0.002421 | 0.147705 | |||
| 83 | 7 | 86779323 | 86785351 | −0.26366 | 0 | 0.000305 | 0.065868 | |||
| 106 | PTEN | 10 | 89545841 | 89680005 | −0.24913 | 3 | 0.000401 | 0.06373 | ||
| 5 | TNFRSF9 | 1 | 7889543 | 7926197 | −0.23605 | 1 | 0.000965 | 0.155689 | ||
| 2 | 1 | 3928417 | 3978407 | −0.21835 | 0 | 0.006145 | 0.113772 | |||
| 4 | CAMTA1 | 1 | 7728761 | 7742417 | −0.21773 | 1 | 0.005078 | 0.149701 | ||
| 146 | 17 | 28797037 | 29337463 | −0.2158 | 1 | 0.01771 | 0.054943 | |||
| 45 | LSAMP | 3 | 117520637 | 117525922 | −0.2012 | 1 | 0.015102 | 0.05988 | ||
| 145 | NF1 | 17 | 26438606 | 26453055 | −0.19121 | 1 | 0.004396 | 0.053892 | ||
| 67 | 5 | 164370828 | 164522125 | −0.19096 | 0 | 0.013057 | 0.05988 | |||
| 147 | ACCN1 | 17 | 29404279 | 29546036 | −0.17745 | 1 | 0.021171 | 0.053892 |
Aberrant regions determined by the entropy method. These tables do not include regions for which more then half of the DNA probes were annotated as normal CNV. Candidate oncogenes / suppressors were selected by their potential association with cancer based on their molecular function or for being the only gene in the region. The genomic mapping was based on Build 18. The calling for amplifications and deletions was made by observing which amplitude measure, QQ0.025 (deletion) or 1-QQ0.975 (amplification), was closer to 0.
Not described or validated as oncogene or tumor suppressor in previous studies.
Figure 2Entropy analysis for the GBM data entire genome (A), chromosome 1(B) and chromosome 7 (C).
The segmented copy number values per sample are displayed as a heatmap on the left, with the tumor samples as columns; the entropy values are shown on the right. The gaps on the heatmap indicate genomic regions that lack coverage, such as the p arms of acrocentric chromosomes. The number in A corresponds to the region number in Table 1. Beside each region label, a plus sign indicates an amplification and a minus indicates a deletion. Note that a low-entropy region can be either a amplification or a deletion. The red line in the entropy plot shows the threshold for defining an aberrant region, which is the 0.05 quantile of the bootstrap distribution of the entropy. Peaks that are below the threshold but have no region assigned are normal CNV. The cytoband annotation was retrieved using the UCSC Table Browser.
Figure 3Interplay between the amplitude, prevalence and entropy in deletions (A) and amplifications (B).
The prevalence measure adopted is the proportion of DNA probes on determined regions that have a log2 ratio above or below the 0.025 and 0.975 quantiles.
Figure 4Log2 ratio of copy number of the 58 tumor-normal sample pairs.
The values in the diagonal line correspond to variations similarly observed in tumor and normal tissue. The values in the horizontal line correspond to amplifications (right) and deletions (left) observed only in the tumor samples.