Literature DB >> 31152127

Generic Repeat Finder: A High-Sensitivity Tool for Genome-Wide De Novo Repeat Detection.

Jieming Shi1, Chun Liang2,3.   

Abstract

Comprehensive and accurate annotation of the repeatome, including transposons, is critical for deepening our understanding of repeat origins, biogenesis, regulatory mechanisms, and roles. Here, we developed Generic Repeat Finder (GRF), a tool for genome-wide repeat detection based on fast, exhaustive numerical calculation algorithms integrated with optimized dynamic programming strategies. GRF sensitively identifies terminal inverted repeats (TIRs), terminal direct repeats (TDRs), and interspersed repeats that bear both inverted and direct repeats. GRF also detects DNA or RNA transposable elements characterized by these repeats in plant and animal genomes. For TIRs and TDRs, GRF identifies spacers in the middle and mismatches/insertions or deletions in terminal repeats, showing their alignment or base-pairing information. GRF helps improve the annotation for various DNA transposons and retrotransposons, such as miniature inverted-repeat transposable elements (MITEs), long terminal repeat (LTR) retrotransposons, and non-LTR retrotransposons, including long interspersed nuclear elements and short interspersed nuclear elements in plants. We used GRF to perform TIR/TDR, interspersed-repeat, and MITE detection in several species, including Arabidopsis (Arabidopsis thaliana), rice (Oryza sativa), and mouse (Mus musculus). As a generic bioinformatics tool in repeat finding implemented as a parallelized C++ program, GRF was faster and more sensitive than the existing inverted repeat/MITE detection tools based on numerical approaches (i.e. detectIR and detectMITE) in Arabidopsis and mouse. GRF is more sensitive than Inverted Repeat Finder in TIR detection, LTR_FINDER in short TDR detection (≤1,000 nt), and phRAIDER in interspersed repeat detection in Arabidopsis and rice. GRF is an open source available from Github.
© 2019 American Society of Plant Biologists. All Rights Reserved.

Entities:  

Mesh:

Substances:

Year:  2019        PMID: 31152127      PMCID: PMC6670090          DOI: 10.1104/pp.19.00386

Source DB:  PubMed          Journal:  Plant Physiol        ISSN: 0032-0889            Impact factor:   8.340


  67 in total

1.  Computational complexity of multiple sequence alignment with SP-score.

Authors:  W Just
Journal:  J Comput Biol       Date:  2001       Impact factor: 1.479

2.  Inverted repeat structure of the human genome: the X-chromosome contains a preponderance of large, highly homologous inverted repeats that contain testes genes.

Authors:  Peter E Warburton; Joti Giordano; Fanny Cheung; Yefgeniy Gelfand; Gary Benson
Journal:  Genome Res       Date:  2004-10       Impact factor: 9.043

3.  A tool for multiple sequence alignment.

Authors:  D J Lipman; S F Altschul; J D Kececioglu
Journal:  Proc Natl Acad Sci U S A       Date:  1989-06       Impact factor: 11.205

Review 4.  DIRS-1 and the other tyrosine recombinase retrotransposons.

Authors:  R T M Poulter; T J D Goodwin
Journal:  Cytogenet Genome Res       Date:  2005       Impact factor: 1.636

5.  Analysis of the t(3;8) of hereditary renal cell carcinoma: a palindrome-mediated translocation.

Authors:  Takema Kato; Colleen P Franconi; Molly B Sheridan; April M Hacker; Hidehito Inagakai; Thomas W Glover; Martin F Arlt; Harry A Drabkin; Robert M Gemmill; Hiroki Kurahashi; Beverly S Emanuel
Journal:  Cancer Genet       Date:  2014-03-18

6.  Meeting DNA palindromes head-to-head.

Authors:  Gerald R Smith
Journal:  Genes Dev       Date:  2008-10-01       Impact factor: 11.361

7.  Rice Annotation Project Database (RAP-DB): an integrative and interactive database for rice genomics.

Authors:  Hiroaki Sakai; Sung Shin Lee; Tsuyoshi Tanaka; Hisataka Numa; Jungsok Kim; Yoshihiro Kawahara; Hironobu Wakimoto; Ching-chia Yang; Masao Iwamoto; Takashi Abe; Yuko Yamada; Akira Muto; Hachiro Inokuchi; Toshimichi Ikemura; Takashi Matsumoto; Takuji Sasaki; Takeshi Itoh
Journal:  Plant Cell Physiol       Date:  2013-01-07       Impact factor: 4.927

8.  The distribution of inverted repeat sequences in the Saccharomyces cerevisiae genome.

Authors:  Eva M Strawbridge; Gary Benson; Yevgeniy Gelfand; Craig J Benham
Journal:  Curr Genet       Date:  2010-05-06       Impact factor: 3.886

Review 9.  Cruciform structures are a common DNA feature important for regulating biological processes.

Authors:  Václav Brázda; Rob C Laister; Eva B Jagelská; Cheryl Arrowsmith
Journal:  BMC Mol Biol       Date:  2011-08-05       Impact factor: 2.946

10.  A novel method for identifying polymorphic transposable elements via scanning of high-throughput short reads.

Authors:  Houxiang Kang; Dan Zhu; Runmao Lin; Stephen Obol Opiyo; Ning Jiang; Shin-Han Shiu; Guo-Liang Wang
Journal:  DNA Res       Date:  2016-04-20       Impact factor: 4.458

View more
  11 in total

1.  Finding and Characterizing Repeats in Plant Genomes.

Authors:  Jacques Nicolas; Sébastien Tempel; Anna-Sophie Fiston-Lavier; Emira Cherif
Journal:  Methods Mol Biol       Date:  2022

2.  The clove (Syzygium aromaticum) genome provides insights into the eugenol biosynthesis pathway.

Authors:  Sonia Ouadi; Nicolas Sierro; Simon Goepfert; Lucien Bovet; Gaetan Glauser; Armelle Vallat; Manuel C Peitsch; Felix Kessler; Nikolai V Ivanov
Journal:  Commun Biol       Date:  2022-07-09

3.  Benchmarking transposable element annotation methods for creation of a streamlined, comprehensive pipeline.

Authors:  Shujun Ou; Weija Su; Yi Liao; Kapeel Chougule; Jireh R A Agda; Adam J Hellinga; Carlos Santiago Blanco Lugo; Tyler A Elliott; Doreen Ware; Thomas Peterson; Ning Jiang; Candice N Hirsch; Matthew B Hufford
Journal:  Genome Biol       Date:  2019-12-16       Impact factor: 13.583

4.  Comparative Population Genomics of Cryptic Speciation and Adaptive Divergence in Bicknell's and Gray-Cheeked Thrushes (Aves: Catharus bicknelli and Catharus minimus).

Authors:  Flavia Termignoni-Garcia; Jeremy J Kirchman; Johnathan Clark; Scott V Edwards
Journal:  Genome Biol Evol       Date:  2022-01-04       Impact factor: 3.416

5.  CicerSpTEdb: A web-based database for high-resolution genome-wide identification of transposable elements in Cicer species.

Authors:  Morad M Mokhtar; Alsamman M Alsamman; Haytham M Abd-Elhalim; Achraf El Allali
Journal:  PLoS One       Date:  2021-11-11       Impact factor: 3.240

6.  Combined use of Oxford Nanopore and Illumina sequencing yields insights into soybean structural variation biology.

Authors:  Marc-André Lemay; Jonas A Sibbesen; Davoud Torkamaneh; Jérémie Hamel; Roger C Levesque; François Belzile
Journal:  BMC Biol       Date:  2022-02-23       Impact factor: 7.431

7.  A sensitive repeat identification framework based on short and long reads.

Authors:  Xingyu Liao; Min Li; Kang Hu; Fang-Xiang Wu; Xin Gao; Jianxin Wang
Journal:  Nucleic Acids Res       Date:  2021-09-27       Impact factor: 16.971

Review 8.  Population Genomic Approaches for Weed Science.

Authors:  Sara L Martin; Jean-Sebastien Parent; Martin Laforest; Eric Page; Julia M Kreiner; Tracey James
Journal:  Plants (Basel)       Date:  2019-09-19

9.  Improved High-Quality Genome Assembly and Annotation of Pineapple (Ananas comosus) Cultivar MD2 Revealed Extensive Haplotype Diversity and Diversified FRS/FRF Gene Family.

Authors:  Ashley G Yow; Hamed Bostan; Raúl Castanera; Valentino Ruggieri; Molla F Mengist; Julien Curaba; Roberto Young; Nicholas Gillitt; Massimo Iorizzo
Journal:  Genes (Basel)       Date:  2021-12-24       Impact factor: 4.096

10.  The Transposable Elements of the Drosophila serrata Reference Panel.

Authors:  Zachery Tiedeman; Sarah Signor
Journal:  Genome Biol Evol       Date:  2021-09-01       Impact factor: 3.416

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.