| Literature DB >> 19775444 |
Oscar M Rueda1, Ramon Diaz-Uriarte.
Abstract
BACKGROUND: Alterations in the number of copies of genomic DNA that are common or recurrent among diseased individuals are likely to contain disease-critical genes. Unfortunately, defining common or recurrent copy number alteration (CNA) regions remains a challenge. Moreover, the heterogeneous nature of many diseases requires that we search for common or recurrent CNA regions that affect only some subsets of the samples (without knowledge of the regions and subsets affected), but this is neglected by most methods.Entities:
Mesh:
Year: 2009 PMID: 19775444 PMCID: PMC2760535 DOI: 10.1186/1471-2105-10-308
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
pREC-A algorithm
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
pREC-S algorithm
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
Simulated data example.
| A1 | 0.17 | 0.17 | 0.97 | 0.97 | 0.97 | 0.17 |
| A2 | 0.16 | 1.00 | 1.00 | 1.00 | 0.15 | 1.00 |
| A3 | 0.08 | 0.07 | 0.93 | 0.07 | 0.06 | 0.92 |
| A4 | 0.16 | 0.16 | 0.99 | 1.00 | 1.00 | 1.00 |
Marginal probabilities of being gained.
Figure 1pREC-S, simple numerical example. Subsets of at least 2 arrays that share common regions of gain of at least 0.90 probability: freq.arrays = 2, p= 0.90. Boxes of the same color represent the same region. In circles, the marginal probabilities of gain. In boxes, the joint probabilities.
Figure 2Clustering based upon pREC-S. Number of common regions shared by pairs of arrays. In parenthesis, the average length in probes of the regions. On the left, a dendrogram using hierarchical clustering (complete linkage) with number of common regions shared by pairs of arrays as similarity measure.
Figure 3Chromosome 8 from the Pollack et al. example. Number of regions of gain with at least 0.50 probability shared by at least two arrays (i.e., pREC-S, freq.arrays = 2, p= 0.50). The arrays are ordered according to tumor grade. Arrays with grade III share many more alterations between them than the other arrays. Four arrays with grade II share the same gains in copy number with tumors of higher grade, so they are probably related. There is one array unidentified.
Alterations in Pollack et al. [45], genomewide.
| ER | Positive | 0.75 |
| Negative | 1.12 | |
| p53 | Wild Type | 0.67 |
| Mutant | 1.23 | |
| Grade | I | 1.21 |
| II | 0.56 | |
| III | 1.40 | |
The homogeneity index, , is computed over the whole genome, not chromosome by chromosome, as done in previous tables.