Literature DB >> 30937433

Discovery of tandem and interspersed segmental duplications using high-throughput sequencing.

Arda Soylev1,2, Thong Minh Le3,4, Hajar Amini5, Can Alkan1,6,7, Fereydoun Hormozdiari3,8,9.   

Abstract

MOTIVATION: Several algorithms have been developed that use high-throughput sequencing technology to characterize structural variations (SVs). Most of the existing approaches focus on detecting relatively simple types of SVs such as insertions, deletions and short inversions. In fact, complex SVs are of crucial importance and several have been associated with genomic disorders. To better understand the contribution of complex SVs to human disease, we need new algorithms to accurately discover and genotype such variants. Additionally, due to similar sequencing signatures, inverted duplications or gene conversion events that include inverted segmental duplications are often characterized as simple inversions, likewise, duplications and gene conversions in direct orientation may be called as simple deletions. Therefore, there is still a need for accurate algorithms to fully characterize complex SVs and thus improve calling accuracy of more simple variants.
RESULTS: We developed novel algorithms to accurately characterize tandem, direct and inverted interspersed segmental duplications using short read whole genome sequencing datasets. We integrated these methods to our TARDIS tool, which is now capable of detecting various types of SVs using multiple sequence signatures such as read pair, read depth and split read. We evaluated the prediction performance of our algorithms through several experiments using both simulated and real datasets. In the simulation experiments, using a 30× coverage TARDIS achieved 96% sensitivity with only 4% false discovery rate. For experiments that involve real data, we used two haploid genomes (CHM1 and CHM13) and one human genome (NA12878) from the Illumina Platinum Genomes set. Comparison of our results with orthogonal PacBio call sets from the same genomes revealed higher accuracy for TARDIS than state-of-the-art methods. Furthermore, we showed a surprisingly low false discovery rate of our approach for discovery of tandem, direct and inverted interspersed segmental duplications prediction on CHM1 (<5% for the top 50 predictions).
AVAILABILITY AND IMPLEMENTATION: TARDIS source code is available at https://github.com/BilkentCompGen/tardis, and a corresponding Docker image is available at https://hub.docker.com/r/alkanlab/tardis/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) 2019. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Mesh:

Year:  2019        PMID: 30937433      PMCID: PMC6792081          DOI: 10.1093/bioinformatics/btz237

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  43 in total

1.  Simultaneous structural variation discovery among multiple paired-end sequenced genomes.

Authors:  Fereydoun Hormozdiari; Iman Hajirasouliha; Andrew McPherson; Evan E Eichler; S Cenk Sahinalp
Journal:  Genome Res       Date:  2011-11-02       Impact factor: 9.043

2.  Fine-scale structural variation of the human genome.

Authors:  Eray Tuzun; Andrew J Sharp; Jeffrey A Bailey; Rajinder Kaul; V Anne Morrison; Lisa M Pertz; Eric Haugen; Hillary Hayden; Donna Albertson; Daniel Pinkel; Maynard V Olson; Evan E Eichler
Journal:  Nat Genet       Date:  2005-05-15       Impact factor: 38.330

3.  MoDIL: detecting small indels from clone-end sequencing with mixtures of distributions.

Authors:  Seunghak Lee; Fereydoun Hormozdiari; Can Alkan; Michael Brudno
Journal:  Nat Methods       Date:  2009-05-31       Impact factor: 28.547

4.  Combinatorial algorithms for structural variation detection in high-throughput sequenced genomes.

Authors:  Fereydoun Hormozdiari; Can Alkan; Evan E Eichler; S Cenk Sahinalp
Journal:  Genome Res       Date:  2009-05-15       Impact factor: 9.043

5.  Origins and functional impact of copy number variation in the human genome.

Authors:  Donald F Conrad; Dalila Pinto; Richard Redon; Lars Feuk; Omer Gokcumen; Yujun Zhang; Jan Aerts; T Daniel Andrews; Chris Barnes; Peter Campbell; Tomas Fitzgerald; Min Hu; Chun Hwa Ihm; Kati Kristiansson; Daniel G Macarthur; Jeffrey R Macdonald; Ifejinelo Onyiah; Andy Wing Chun Pang; Sam Robson; Kathy Stirrups; Armand Valsesia; Klaudia Walter; John Wei; Chris Tyler-Smith; Nigel P Carter; Charles Lee; Stephen W Scherer; Matthew E Hurles
Journal:  Nature       Date:  2009-10-07       Impact factor: 49.962

6.  Reconstructing complex regions of genomes using long-read sequencing technology.

Authors:  John Huddleston; Swati Ranade; Maika Malig; Francesca Antonacci; Mark Chaisson; Lawrence Hon; Peter H Sudmant; Tina A Graves; Can Alkan; Megan Y Dennis; Richard K Wilson; Stephen W Turner; Jonas Korlach; Evan E Eichler
Journal:  Genome Res       Date:  2014-01-13       Impact factor: 9.043

7.  TIDDIT, an efficient and comprehensive structural variant caller for massive parallel sequencing data.

Authors:  Jesper Eisfeldt; Francesco Vezzi; Pall Olason; Daniel Nilsson; Anna Lindstrand
Journal:  F1000Res       Date:  2017-05-10

8.  Discovery and genotyping of structural variation from long-read haploid genome sequence data.

Authors:  John Huddleston; Mark J P Chaisson; Karyn Meltz Steinberg; Wes Warren; Kendra Hoekzema; David Gordon; Tina A Graves-Lindsay; Katherine M Munson; Zev N Kronenberg; Laura Vives; Paul Peluso; Matthew Boitano; Chen-Shin Chin; Jonas Korlach; Richard K Wilson; Evan E Eichler
Journal:  Genome Res       Date:  2016-11-28       Impact factor: 9.043

9.  Systematic assessment of copy number variant detection via genome-wide SNP genotyping.

Authors:  Gregory M Cooper; Troy Zerr; Jeffrey M Kidd; Evan E Eichler; Deborah A Nickerson
Journal:  Nat Genet       Date:  2008-09-07       Impact factor: 38.330

10.  DELLY: structural variant discovery by integrated paired-end and split-read analysis.

Authors:  Tobias Rausch; Thomas Zichner; Andreas Schlattl; Adrian M Stütz; Vladimir Benes; Jan O Korbel
Journal:  Bioinformatics       Date:  2012-09-15       Impact factor: 6.937

View more
  8 in total

1.  Nebula: ultra-efficient mapping-free structural variant genotyper.

Authors:  Parsoa Khorsand; Fereydoun Hormozdiari
Journal:  Nucleic Acids Res       Date:  2021-05-07       Impact factor: 16.971

2.  Systematic analysis of CCCH zinc finger family in Brassica napus showed that BnRR-TZFs are involved in stress resistance.

Authors:  Boyi Pi; Jiao Pan; Mu Xiao; Xinchang Hu; Lei Zhang; Min Chen; Boyu Liu; Ying Ruan; Yong Huang
Journal:  BMC Plant Biol       Date:  2021-11-23       Impact factor: 4.215

3.  svBreak: A New Approach for the Detection of Structural Variant Breakpoints Based on Convolutional Neural Network.

Authors:  Shaoqiang Wang; Jie Li; A K Alvi Haque; Haiyong Zhao; Liying Yang; Xiguo Yuan
Journal:  Biomed Res Int       Date:  2022-03-19       Impact factor: 3.411

4.  Systematic analysis of CNGCs in cotton and the positive role of GhCNGC32 and GhCNGC35 in salt tolerance.

Authors:  Zhengying Lu; Guo Yin; Mao Chai; Lu Sun; Hengling Wei; Jie Chen; Yufeng Yang; Xiaokang Fu; Shiyun Li
Journal:  BMC Genomics       Date:  2022-08-05       Impact factor: 4.547

5.  SVXplorer: Three-tier approach to identification of structural variants via sequential recombination of discordant cluster signatures.

Authors:  Kunal Kathuria; Aakrosh Ratan
Journal:  PLoS Comput Biol       Date:  2020-03-17       Impact factor: 4.475

6.  VALOR2: characterization of large-scale structural variants using linked-reads.

Authors:  Fatih Karaoğlanoğlu; Camir Ricketts; Ezgi Ebren; Marzieh Eslami Rasekh; Iman Hajirasouliha; Can Alkan
Journal:  Genome Biol       Date:  2020-03-19       Impact factor: 13.583

7.  Towards a better understanding of the low recall of insertion variants with short-read based variant callers.

Authors:  Wesley J Delage; Julien Thevenon; Claire Lemaitre
Journal:  BMC Genomics       Date:  2020-11-04       Impact factor: 3.969

Review 8.  An Overview of Duplicated Gene Detection Methods: Why the Duplication Mechanism Has to Be Accounted for in Their Choice.

Authors:  Tanguy Lallemand; Martin Leduc; Claudine Landès; Carène Rizzon; Emmanuelle Lerat
Journal:  Genes (Basel)       Date:  2020-09-04       Impact factor: 4.096

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.