Literature DB >> 30137425

TranSurVeyor: an improved database-free algorithm for finding non-reference transpositions in high-throughput sequencing data.

Ramesh Rajaby1,2, Wing-Kin Sung1,3.   

Abstract

Transpositions transfer DNA segments between different loci within a genome; in particular, when a transposition is found in a sample but not in a reference genome, it is called a non-reference transposition. They are important structural variations that have clinical impact. Transpositions can be called by analyzing second generation high-throughput sequencing datasets. Current methods follow either a database-based or a database-free approach. Database-based methods require a database of transposable elements. Some of them have good specificity; however this approach cannot detect novel transpositions, and it requires a good database of transposable elements, which is not yet available for many species. Database-free methods perform de novo calling of transpositions, but their accuracy is low. We observe that this is due to the misalignment of the reads; since reads are short and the human genome has many repeats, false alignments create false positive predictions while missing alignments reduce the true positive rate. This paper proposes new techniques to improve database-free non-reference transposition calling: first, we propose a realignment strategy called one-end remapping that corrects the alignments of reads in interspersed repeats; second, we propose a SNV-aware filter that removes some incorrectly aligned reads. By combining these two techniques and other techniques like clustering and positive-to-negative ratio filter, our proposed transposition caller TranSurVeyor shows at least 3.1-fold improvement in terms of F1-score over existing database-free methods. More importantly, even though TranSurVeyor does not use databases of prior information, its performance is at least as good as existing database-based methods such as MELT, Mobster and Retroseq. We also illustrate that TranSurVeyor can discover transpositions that are not known in the current database.

Entities:  

Mesh:

Substances:

Year:  2018        PMID: 30137425      PMCID: PMC6237741          DOI: 10.1093/nar/gky685

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  39 in total

1.  pIRS: Profile-based Illumina pair-end reads simulator.

Authors:  Xuesong Hu; Jianying Yuan; Yujian Shi; Jianliang Lu; Binghang Liu; Zhenyu Li; Yanxiang Chen; Desheng Mu; Hao Zhang; Nan Li; Zhen Yue; Fan Bai; Heng Li; Wei Fan
Journal:  Bioinformatics       Date:  2012-04-15       Impact factor: 6.937

2.  Fast gapped-read alignment with Bowtie 2.

Authors:  Ben Langmead; Steven L Salzberg
Journal:  Nat Methods       Date:  2012-03-04       Impact factor: 28.547

3.  Benchmarking computational tools for polymorphic transposable element detection.

Authors:  Lavanya Rishishwar; Leonardo Mariño-Ramírez; I King Jordan
Journal:  Brief Bioinform       Date:  2017-11-01       Impact factor: 11.622

Review 4.  Transposable element detection from whole genome sequence data.

Authors:  Adam D Ewing
Journal:  Mob DNA       Date:  2015-12-29

5.  Haemophilia A resulting from de novo insertion of L1 sequences represents a novel mechanism for mutation in man.

Authors:  H H Kazazian; C Wong; H Youssoufian; A F Scott; D G Phillips; S E Antonarakis
Journal:  Nature       Date:  1988-03-10       Impact factor: 49.962

6.  Disruption of the APC gene by a retrotransposal insertion of L1 sequence in a colon cancer.

Authors:  Y Miki; I Nishisho; A Horii; Y Miyoshi; J Utsunomiya; K W Kinzler; B Vogelstein; Y Nakamura
Journal:  Cancer Res       Date:  1992-02-01       Impact factor: 12.701

7.  RetroSeq: transposable element discovery from next-generation sequencing data.

Authors:  Thomas M Keane; Kim Wong; David J Adams
Journal:  Bioinformatics       Date:  2012-12-10       Impact factor: 6.937

8.  A comprehensive map of mobile element insertion polymorphisms in humans.

Authors:  Chip Stewart; Deniz Kural; Michael P Strömberg; Jerilyn A Walker; Miriam K Konkel; Adrian M Stütz; Alexander E Urban; Fabian Grubert; Hugo Y K Lam; Wan-Ping Lee; Michele Busby; Amit R Indap; Erik Garrison; Chad Huff; Jinchuan Xing; Michael P Snyder; Lynn B Jorde; Mark A Batzer; Jan O Korbel; Gabor T Marth
Journal:  PLoS Genet       Date:  2011-08-18       Impact factor: 5.917

9.  PBHoney: identifying genomic variants via long-read discordance and interrupted mapping.

Authors:  Adam C English; William J Salerno; Jeffrey G Reid
Journal:  BMC Bioinformatics       Date:  2014-06-10       Impact factor: 3.169

10.  The Mobile Element Locator Tool (MELT): population-scale mobile element discovery and biology.

Authors:  Eugene J Gardner; Vincent K Lam; Daniel N Harris; Nelson T Chuang; Emma C Scott; W Stephen Pittard; Ryan E Mills; Scott E Devine
Journal:  Genome Res       Date:  2017-08-30       Impact factor: 9.043

View more
  8 in total

1.  Whole Genome Analysis of Dizygotic Twins With Autism Reveals Prevalent Transposon Insertion Within Neuronal Regulatory Elements: Potential Implications for Disease Etiology and Clinical Assessment.

Authors:  Kaan Okay; Pelin Ünal Varış; Süha Miral; Athanasia Pavlopoulou; Yavuz Oktay; Gökhan Karakülah
Journal:  J Autism Dev Disord       Date:  2022-06-27

2.  Retrotransposon insertion as a novel mutational cause of spinal muscular atrophy.

Authors:  Myriam Vezain; Christel Thauvin-Robinet; Yoann Vial; Sophie Coutant; Séverine Drunat; Jon Andoni Urtizberea; Anne Rolland; Agnès Jacquin-Piques; Séverine Fehrenbach; Gaël Nicolas; François Lecoquierre; Pascale Saugier-Veber
Journal:  Hum Genet       Date:  2022-09-23       Impact factor: 5.881

3.  TypeTE: a tool to genotype mobile element insertions from whole genome resequencing data.

Authors:  Clément Goubert; Jainy Thomas; Lindsay M Payer; Jeffrey M Kidd; Julie Feusier; W Scott Watkins; Kathleen H Burns; Lynn B Jorde; Cédric Feschotte
Journal:  Nucleic Acids Res       Date:  2020-04-06       Impact factor: 16.971

Review 4.  Identification and Genotyping of Transposable Element Insertions From Genome Sequencing Data.

Authors:  Chong Chu; Boxun Zhao; Peter J Park; Eunjung Alice Lee
Journal:  Curr Protoc Hum Genet       Date:  2020-09

5.  Pedigree-based estimation of human mobile element retrotransposition rates.

Authors:  Julie Feusier; W Scott Watkins; Jainy Thomas; Andrew Farrell; David J Witherspoon; Lisa Baird; Hongseok Ha; Jinchuan Xing; Lynn B Jorde
Journal:  Genome Res       Date:  2019-10       Impact factor: 9.043

6.  Accurate Tracking of the Mutational Landscape of Diploid Hybrid Genomes.

Authors:  Lorenzo Tattini; Nicolò Tellini; Simone Mozzachiodi; Melania D'Angiolo; Sophie Loeillet; Alain Nicolas; Gianni Liti
Journal:  Mol Biol Evol       Date:  2019-12-01       Impact factor: 16.240

7.  Calling large indels in 1047 Arabidopsis with IndelEnsembler.

Authors:  Dong-Xu Liu; Ramesh Rajaby; Lu-Lu Wei; Lei Zhang; Zhi-Quan Yang; Qing-Yong Yang; Wing-Kin Sung
Journal:  Nucleic Acids Res       Date:  2021-11-08       Impact factor: 16.971

8.  Mobile element insertions and associated structural variants in longitudinal breast cancer samples.

Authors:  Cody J Steely; Kristi L Russell; Julie E Feusier; Yi Qiao; Sean V Tavtigian; Gabor Marth; Lynn B Jorde
Journal:  Sci Rep       Date:  2021-06-22       Impact factor: 4.379

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.