Literature DB >> 25998133

Robust and exact structural variation detection with paired-end and soft-clipped alignments: SoftSV compared with eight algorithms.

Christoph Bartenhagen, Martin Dugas.   

Abstract

Structural variation (SV) plays an important role in genetic diversity among the population in general and specifically in diseases such as cancer. Modern next-generation sequencing (NGS) technologies provide paired-end sequencing data at high depth with increasing read lengths. This development enabled the analysis of split-reads to detect SV breakpoints with single-nucleotide resolution. But ambiguous mappings and breakpoint sequences with further co-occurring mutations hamper split-read alignments against a reference sequence. The trade-off between high sensitivity and low false-positive rate is problematic and often requires a lot of fine-tuning of the analysis method based on knowledge about its algorithm and the characteristics of the data set. We present SoftSV, a method for exact breakpoint detection for small and large deletions, inversions, tandem duplications and inter-chromosomal translocations, which relies solely on the mutual alignment of soft-clipped reads within the neighborhood of discordantly mapped paired-end reads. Unlike other SV detection algorithms, our approach does not require thresholds regarding sequencing coverage or mapping quality. We evaluate SoftSV together with eight approaches (Breakdancer, Clever, CREST, Delly, GASVPro, Pindel, Socrates and SoftSearch) on simulated and real data sets. Our results show that sensitive and reliable SV detection is subject to many different factors like read length, sequence coverage and SV type. While most programs have their individual drawbacks, our greedy approach turns out to be the most robust and sensitive on many experimental setups. Sensitivities above 85% and positive predictive values between 80 and 100% could be achieved consistently for all SV types on simulated data sets starting at relatively short 75 bp reads and low 10-15× sequence coverage.
© The Author 2015. Published by Oxford University Press. For Permissions, please email: journals.permissions@oup.com.

Entities:  

Keywords:  structural variation; paired-end sequencing; split-reads; simulation

Mesh:

Year:  2015        PMID: 25998133     DOI: 10.1093/bib/bbv028

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  20 in total

1.  Rare De Novo Missense Variants in RNA Helicase DDX6 Cause Intellectual Disability and Dysmorphic Features and Lead to P-Body Defects and RNA Dysregulation.

Authors:  Chris Balak; Marianne Benard; Elise Schaefer; Sumaiya Iqbal; Keri Ramsey; Michèle Ernoult-Lange; Francesca Mattioli; Lorida Llaci; Véronique Geoffroy; Maité Courel; Marcus Naymik; Kristine K Bachman; Rolph Pfundt; Patrick Rump; Johanna Ter Beest; Ingrid M Wentzensen; Kristin G Monaghan; Kirsty McWalter; Ryan Richholt; Antony Le Béchec; Wayne Jepsen; Matt De Both; Newell Belnap; Anne Boland; Ignazio S Piras; Jean-François Deleuze; Szabolcs Szelinger; Hélène Dollfus; Jamel Chelly; Jean Muller; Arthur Campbell; Dennis Lal; Sampathkumar Rangasamy; Jean-Louis Mandel; Vinodh Narayanan; Matt Huentelman; Dominique Weil; Amélie Piton
Journal:  Am J Hum Genet       Date:  2019-08-15       Impact factor: 11.025

2.  Discovery of tandem and interspersed segmental duplications using high-throughput sequencing.

Authors:  Arda Soylev; Thong Minh Le; Hajar Amini; Can Alkan; Fereydoun Hormozdiari
Journal:  Bioinformatics       Date:  2019-10-15       Impact factor: 6.937

3.  Next-Generation Sequencing Reveals Novel Mutations in X-linked Intellectual Disability.

Authors:  Babylakshmi Muthusamy; Lakshmi Dhevi N Selvan; Thong T Nguyen; Jesna Manoj; Eric W Stawiski; Bijay S Jaiswal; Weiru Wang; Remya Raja; Vedam Laxmi Ramprasad; Ravi Gupta; Sakthivel Murugan; Jayarama S Kadandale; T S Keshava Prasad; Kavita Reddy; Andrew Peterson; Akhilesh Pandey; Somasekar Seshagiri; Satish Chandra Girimaji; Harsha Gowda
Journal:  OMICS       Date:  2017-05

Review 4.  Implications of germline copy-number variations in psychiatric disorders: review of large-scale genetic studies.

Authors:  Masahiro Nakatochi; Itaru Kushima; Norio Ozaki
Journal:  J Hum Genet       Date:  2020-09-21       Impact factor: 3.172

5.  ScanITD: Detecting internal tandem duplication with robust variant allele frequency estimation.

Authors:  Ting-You Wang; Rendong Yang
Journal:  Gigascience       Date:  2020-08-01       Impact factor: 6.524

6.  Matrilineal analysis of mutations in the DMD gene in a multigenerational South Indian cohort using DMD gene panel sequencing.

Authors:  Arun Shastry; Sankaramoorthy Aravind; Meeta Sunil; Keerthi Ramesh; Berty Ashley; Nithyanandan T; Vedam L Ramprasad; Ravi Gupta; Somasekar Seshagiri; Upendra Nongthomba; Sameer Phalke
Journal:  Mol Genet Genomic Med       Date:  2021-05-07       Impact factor: 2.183

7.  The genome of pest Rhynchophorus ferrugineus reveals gene families important at the plant-beetle interface.

Authors:  Khaled Michel Hazzouri; Naganeeswaran Sudalaimuthuasari; Biduth Kundu; David Nelson; Mohammad Ali Al-Deeb; Alain Le Mansour; Johnston J Spencer; Claude Desplan; Khaled M A Amiri
Journal:  Commun Biol       Date:  2020-06-24

8.  Detecting large deletions at base pair level by combining split read and paired read data.

Authors:  Matthew Hayes; Jeremy S Pearson
Journal:  BMC Bioinformatics       Date:  2017-10-16       Impact factor: 3.169

9.  Detection and quantification of mitochondrial DNA deletions from next-generation sequence data.

Authors:  Colleen M Bosworth; Sneha Grandhi; Meetha P Gould; Thomas LaFramboise
Journal:  BMC Bioinformatics       Date:  2017-10-16       Impact factor: 3.169

10.  Detection of genomic structural variations in Guizhou indigenous pigs and the comparison with other breeds.

Authors:  Chang Liu; Xueqin Ran; Jiafu Wang; Sheng Li; Jianfeng Liu
Journal:  PLoS One       Date:  2018-03-20       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.