| Literature DB >> 24646301 |
Shaylan K Govind, Amin Zia, Pablo H Hennings-Yeomans, John D Watson, Michael Fraser, Catalina Anghel, Alexander W Wyatt, Theodorus van der Kwast, Colin C Collins, John D McPherson, Robert G Bristow, Paul C Boutros1.
Abstract
BACKGROUND: Chromothripsis, a newly discovered type of complex genomic rearrangement, has been implicated in the evolution of several types of cancers. To date, it has been described in bone cancer, SHH-medulloblastoma and acute myeloid leukemia, amongst others, however there are still no formal or automated methods for detecting or annotating it in high throughput sequencing data. As such, findings of chromothripsis are difficult to compare and many cases likely escape detection altogether.Entities:
Mesh:
Year: 2014 PMID: 24646301 PMCID: PMC3999944 DOI: 10.1186/1471-2105-15-78
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1ShatterProof input parsing workflow. Original output from existing tools is converted to the ShatterProof input file formats using scripts. ShatterProof then reads and parses the data in these input files into efficient data structures.
MCA calculated hallmark weights
| Genome localization | 0.1145 |
| Chromosome localization | 0.1697 |
| Copy-number aberrations | 0.2724 |
| Translocation localization | 0.2724 |
| Retention of heterozgosity | 0.0648 |
| Presences of insertions at translocation breakpoints | 0.0657 |
| Presences of TP53 mutations | 0.0406 |
Figure 2Suspect region identification and score calculation. Once ShatterProof has parsed all input data, a sliding window analysis identifies genomic regions which are heavily mutated. Note that the window size is user definable. ShatterProof then produces a chromothripsis score for each of these regions, by analyzing all SV data corresponding to it. The analysis produces hallmark scores which represent how significantly the region exhibits each hallmark,which are then scaled by their respective MCA weighting and summed to produce a chromothripsis score.
Figure 3Sliding window analysis. Graphing the data from the tab delimited intermediate output files produces plots that visualize the clustering of SVs along a chromosome. The region highlighted in red was identified by ShatterProof as highly chromothriptic. Note that chromothriptic regions are not always those with the highest SV density. ShatterProof identifies regions as chromothriptic primarily based on the organization and clustering of SVs.
Summary of clinical data
| Normal blood | CPCG network | 10 | 30 × | hg19 |
| Prostate cancer | CPCG network | 7 | 50 × | hg19 |
| Prostate cancer | Living tumor lab | 2 | 3 × | hg18 |
| Colorectal cancer | [ | 1 | 50 × | hg19 |
| SCLC | [ | 1 | 50 × | hg19 |
Figure 4CNV and translocation count vs average chromothripsis score. These plots illustrates how a high total SV count does not necessarily produce high chromothripsis scores. Samples producing the highest chromothripsis scores had some of the lowest translocation and CNV counts, showing that ShatterProof scores are dominated by the clustering of SVs, not the absolute counts.
Figure 5Hallmark scores of highest scoring region per sample. The bottom row indicates the relative weightings assigned to each hallmark via MCA and thus the maximum possible score that can be achieved. All rows above the bottom one depict the contribution of each hallmark score to the highest final score for a sample. Dark purple indicates high scores; white indicates low scores. The two samples expressing the TP53 hallmark correspond to the highest scores from the 2 LTL samples.
Figure 6Final chromothripsis scores. All scores produced by ShatterProof for each sample. The gold symbols represent calls made by ShatterProof that correspond to regions of the genomes that were previously identified in other studies as being chromothriptic. Samples highlighted in red produced numerous calls with scores over 0.37 and as such are believed to have experienced a chromothriptic event.