| Literature DB >> 25849937 |
Biao Liu1, Jeffrey M Conroy1, Carl D Morrison1, Adekunle O Odunsi2, Maochun Qin3, Lei Wei3, Donald L Trump4, Candace S Johnson5, Song Liu3, Jianmin Wang3.
Abstract
Somatic Structural Variations (SVs) are a complex collection of chromosomal mutations that could directly contribute to carcinogenesis. Next Generation Sequencing (NGS) technology has emerged as the primary means of interrogating the SVs of the cancer genome in recent investigations. Sophisticated computational methods are required to accurately identify the SV events and delineate their breakpoints from the massive amounts of reads generated by a NGS experiment. In this review, we provide an overview of current analytic tools used for SV detection in NGS-based cancer studies. We summarize the features of common SV groups and the primary types of NGS signatures that can be used in SV detection methods. We discuss the principles and key similarities and differences of existing computational programs and comment on unresolved issues related to this research field. The aim of this article is to provide a practical guide of relevant concepts, computational methods, software tools and important factors for analyzing and interpreting NGS data for the detection of SVs in the cancer genome.Entities:
Keywords: cancer genome analysis; next generation sequencing; somatic mutation; structural variation
Mesh:
Year: 2015 PMID: 25849937 PMCID: PMC4467381 DOI: 10.18632/oncotarget.3491
Source DB: PubMed Journal: Oncotarget ISSN: 1949-2553
Figure 1Breakpoint signatures of SVs
(a) In each diagram, the up strands are from sample genome, and the lower strand are from reference genome. (b) Depending on the mapping of the inserted strand B, other relationships of coordinates in reference genome can be determined (details not shown). (c) Tandem duplication creates one or multiple breakpoints. NGS is able to detect either 1 (novel tandem duplication) or 0 (non-novel tandem duplication) breakpoint.
Figure 2Diagram of SV types and NGS signatures, before and after mapping
A) Deletion; B) Insertion; C) Inversion; D) Tandem duplication; E Intra-chromosomal translocation (ITX); F) Inter-chromosomal translocation (CTX).
A list of selected programs for SV detection using NGS data
| Signature reads/read pairs | Method | Publishing Month-Year | Detectable SV type | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| small deletion | large deletion | small insertion | large insertion | small inversion | large inversion | tandem duplication | ITX | CTX | |||
| Discordant read pairs | PEMer[ | Feb-2009 | √ | √ | √ | ||||||
| GASV[ | Jun-2009 | √ | √ | √ | √ | √ | |||||
| BreakDancer[ | Sep-2009 | √ | √ | √ | √ | √ | √ | √ | |||
| HYDRA[ | Mar-2010 | √ | √ | √ | √ | √ | √ | √ | √ | √ | |
| SVDetect[ | Jun-2010 | √ | √ | √ | √ | √ | |||||
| Splitting reads | CREST[ | Aug-2011 | √ | √ | √ | √ | √ | √ | √ | √ | |
| Discordant read pairs and splitting reads | DELLY[ | Sep-2012 | √ | √ | √ | √ | √ | √ | √ | ||
| PRISM[ | Oct-2012 | √ | √ | √ | √ | √ | √ | √ | |||
| LUMPY[ | Jun-2014 | √ | √ | √ | √ | √ | √ | √ | |||
Figure 3An exemplary illustration of the impact of SV event sizes and library insert sizes on the NGS signatures
I: length of insertion event (purple strand); r: read length; s: length of un-sequenced part in a read-pair; insert size equals 2r+s, assuming reads are in same length.