| Literature DB >> 29072143 |
Matthew Hayes1, Jeremy S Pearson2.
Abstract
BACKGROUND: Genomic structural variants (SV) play a significant role in the onset and progression of cancer. Genomic deletions can create oncogenic fusion genes or cause the loss of tumor suppressing gene function which can lead to tumorigenesis by downregulating these genes. Detecting these variants has clinical importance in the treatment of diseases. Furthermore, it is also clinically important to detect their breakpoint boundaries at high resolution. We have generalized the framework of a previously-published algorithm that located translocations, and we have applied that framework to develop a method to locate deletions at base pair level using next-generation sequencing data. Our method uses abnormally mapped read pairs, and then subsequently maps split reads to identify precise breakpoints.Entities:
Keywords: Deletions; Sequencing; Structural variant
Mesh:
Year: 2017 PMID: 29072143 PMCID: PMC5657039 DOI: 10.1186/s12859-017-1829-z
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1Visual depiction of Pegasus algorithm. The images show read pairs that are mapped to a reference genome, where the blue read pairs span a deletion event. (Top panel) Soft-clipped reads (pink and green subreads) are initially unaligned since they fall within the deleted region. Pegasus first clusters discordant read pairs that may indicate a deletion. It then maps the clipped subreads back to the reference using BLAT (middle panel). The subreads align to each side of the deletion event. The coordinate locations with the most aligned subreads are the predicted deletion breakpoints (bottom panel)
Fig. 2Sensitivity of predictions on 2500 simulated deletions: 20X coverage dataset. The x-axis gives the size of the deletions in base pairs (bp). There were 500 deletions per size category
Fig. 3Sensitivity of predictions on 2500 simulated deletions: 5X coverage dataset
Fig. 4Average breakpoint error on 2500 simulated deletions: 20X coverage dataset
Fig. 5Average breakpoint error on 2500 simulated deletions: 5X coverage dataset