Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Discovery of tandem and interspersed segmental duplications using high-throughput sequencing.

Literature DB >> 30937433

Discovery of tandem and interspersed segmental duplications using high-throughput sequencing.

Arda Soylev^1,2, Thong Minh Le^3,4, Hajar Amini⁵, Can Alkan^1,6,7, Fereydoun Hormozdiari^3,8,9.

Abstract

MOTIVATION: Several algorithms have been developed that use high-throughput sequencing technology to characterize structural variations (SVs). Most of the existing approaches focus on detecting relatively simple types of SVs such as insertions, deletions and short inversions. In fact, complex SVs are of crucial importance and several have been associated with genomic disorders. To better understand the contribution of complex SVs to human disease, we need new algorithms to accurately discover and genotype such variants. Additionally, due to similar sequencing signatures, inverted duplications or gene conversion events that include inverted segmental duplications are often characterized as simple inversions, likewise, duplications and gene conversions in direct orientation may be called as simple deletions. Therefore, there is still a need for accurate algorithms to fully characterize complex SVs and thus improve calling accuracy of more simple variants.
RESULTS: We developed novel algorithms to accurately characterize tandem, direct and inverted interspersed segmental duplications using short read whole genome sequencing datasets. We integrated these methods to our TARDIS tool, which is now capable of detecting various types of SVs using multiple sequence signatures such as read pair, read depth and split read. We evaluated the prediction performance of our algorithms through several experiments using both simulated and real datasets. In the simulation experiments, using a 30× coverage TARDIS achieved 96% sensitivity with only 4% false discovery rate. For experiments that involve real data, we used two haploid genomes (CHM1 and CHM13) and one human genome (NA12878) from the Illumina Platinum Genomes set. Comparison of our results with orthogonal PacBio call sets from the same genomes revealed higher accuracy for TARDIS than state-of-the-art methods. Furthermore, we showed a surprisingly low false discovery rate of our approach for discovery of tandem, direct and inverted interspersed segmental duplications prediction on CHM1 (<5% for the top 50 predictions).
AVAILABILITY AND IMPLEMENTATION: TARDIS source code is available at https://github.com/BilkentCompGen/tardis, and a corresponding Docker image is available at https://hub.docker.com/r/alkanlab/tardis/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Entities: Gene Species

Mesh：

Year: 2019 PMID： 30937433 PMCID： PMC6792081 DOI： 10.1093/bioinformatics/btz237

Source DB: PubMed Journal: Bioinformatics ISSN： 1367-4803 Impact factor: 6.937

43 in total

1. Simultaneous structural variation discovery among multiple paired-end sequenced genomes.

Authors: Fereydoun Hormozdiari; Iman Hajirasouliha; Andrew McPherson; Evan E Eichler; S Cenk Sahinalp
Journal: Genome Res Date: 2011-11-02 Impact factor: 9.043

2. Fine-scale structural variation of the human genome.

Authors: Eray Tuzun; Andrew J Sharp; Jeffrey A Bailey; Rajinder Kaul; V Anne Morrison; Lisa M Pertz; Eric Haugen; Hillary Hayden; Donna Albertson; Daniel Pinkel; Maynard V Olson; Evan E Eichler
Journal: Nat Genet Date: 2005-05-15 Impact factor: 38.330

3. MoDIL: detecting small indels from clone-end sequencing with mixtures of distributions.

Authors: Seunghak Lee; Fereydoun Hormozdiari; Can Alkan; Michael Brudno
Journal: Nat Methods Date: 2009-05-31 Impact factor: 28.547

4. Combinatorial algorithms for structural variation detection in high-throughput sequenced genomes.

Authors: Fereydoun Hormozdiari; Can Alkan; Evan E Eichler; S Cenk Sahinalp
Journal: Genome Res Date: 2009-05-15 Impact factor: 9.043

5. Origins and functional impact of copy number variation in the human genome.

Authors: Donald F Conrad; Dalila Pinto; Richard Redon; Lars Feuk; Omer Gokcumen; Yujun Zhang; Jan Aerts; T Daniel Andrews; Chris Barnes; Peter Campbell; Tomas Fitzgerald; Min Hu; Chun Hwa Ihm; Kati Kristiansson; Daniel G Macarthur; Jeffrey R Macdonald; Ifejinelo Onyiah; Andy Wing Chun Pang; Sam Robson; Kathy Stirrups; Armand Valsesia; Klaudia Walter; John Wei; Chris Tyler-Smith; Nigel P Carter; Charles Lee; Stephen W Scherer; Matthew E Hurles
Journal: Nature Date: 2009-10-07 Impact factor: 49.962

6. Reconstructing complex regions of genomes using long-read sequencing technology.

Authors: John Huddleston; Swati Ranade; Maika Malig; Francesca Antonacci; Mark Chaisson; Lawrence Hon; Peter H Sudmant; Tina A Graves; Can Alkan; Megan Y Dennis; Richard K Wilson; Stephen W Turner; Jonas Korlach; Evan E Eichler
Journal: Genome Res Date: 2014-01-13 Impact factor: 9.043

7. TIDDIT, an efficient and comprehensive structural variant caller for massive parallel sequencing data.

Authors: Jesper Eisfeldt; Francesco Vezzi; Pall Olason; Daniel Nilsson; Anna Lindstrand
Journal: F1000Res Date: 2017-05-10

8. Discovery and genotyping of structural variation from long-read haploid genome sequence data.

Authors: John Huddleston; Mark J P Chaisson; Karyn Meltz Steinberg; Wes Warren; Kendra Hoekzema; David Gordon; Tina A Graves-Lindsay; Katherine M Munson; Zev N Kronenberg; Laura Vives; Paul Peluso; Matthew Boitano; Chen-Shin Chin; Jonas Korlach; Richard K Wilson; Evan E Eichler
Journal: Genome Res Date: 2016-11-28 Impact factor: 9.043

9. Systematic assessment of copy number variant detection via genome-wide SNP genotyping.

Authors: Gregory M Cooper; Troy Zerr; Jeffrey M Kidd; Evan E Eichler; Deborah A Nickerson
Journal: Nat Genet Date: 2008-09-07 Impact factor: 38.330

10. DELLY: structural variant discovery by integrated paired-end and split-read analysis.

Authors: Tobias Rausch; Thomas Zichner; Andreas Schlattl; Adrian M Stütz; Vladimir Benes; Jan O Korbel
Journal: Bioinformatics Date: 2012-09-15 Impact factor: 6.937

8 in total

1. Nebula: ultra-efficient mapping-free structural variant genotyper.

Authors: Parsoa Khorsand; Fereydoun Hormozdiari
Journal: Nucleic Acids Res Date: 2021-05-07 Impact factor: 16.971

2. Systematic analysis of CCCH zinc finger family in Brassica napus showed that BnRR-TZFs are involved in stress resistance.

Authors: Boyi Pi; Jiao Pan; Mu Xiao; Xinchang Hu; Lei Zhang; Min Chen; Boyu Liu; Ying Ruan; Yong Huang
Journal: BMC Plant Biol Date: 2021-11-23 Impact factor: 4.215

3. svBreak: A New Approach for the Detection of Structural Variant Breakpoints Based on Convolutional Neural Network.

Authors: Shaoqiang Wang; Jie Li; A K Alvi Haque; Haiyong Zhao; Liying Yang; Xiguo Yuan
Journal: Biomed Res Int Date: 2022-03-19 Impact factor: 3.411

4. Systematic analysis of CNGCs in cotton and the positive role of GhCNGC32 and GhCNGC35 in salt tolerance.

Authors: Zhengying Lu; Guo Yin; Mao Chai; Lu Sun; Hengling Wei; Jie Chen; Yufeng Yang; Xiaokang Fu; Shiyun Li
Journal: BMC Genomics Date: 2022-08-05 Impact factor: 4.547

5. SVXplorer: Three-tier approach to identification of structural variants via sequential recombination of discordant cluster signatures.

Authors: Kunal Kathuria; Aakrosh Ratan
Journal: PLoS Comput Biol Date: 2020-03-17 Impact factor: 4.475

6. VALOR2: characterization of large-scale structural variants using linked-reads.

Authors: Fatih Karaoğlanoğlu; Camir Ricketts; Ezgi Ebren; Marzieh Eslami Rasekh; Iman Hajirasouliha; Can Alkan
Journal: Genome Biol Date: 2020-03-19 Impact factor: 13.583

7. Towards a better understanding of the low recall of insertion variants with short-read based variant callers.

Authors: Wesley J Delage; Julien Thevenon; Claire Lemaitre
Journal: BMC Genomics Date: 2020-11-04 Impact factor: 3.969

Review 8. An Overview of Duplicated Gene Detection Methods: Why the Duplication Mechanism Has to Be Accounted for in Their Choice.

Authors: Tanguy Lallemand; Martin Leduc; Claudine Landès; Carène Rizzon; Emmanuelle Lerat
Journal: Genes (Basel) Date: 2020-09-04 Impact factor: 4.096

8 in total