Literature DB >> 35037215

Finding and Characterizing Repeats in Plant Genomes.

Jacques Nicolas1, Sébastien Tempel2, Anna-Sophie Fiston-Lavier3,4, Emira Cherif3.   

Abstract

Plant genomes contain a particularly high proportion of repeated structures of various types. This chapter proposes a guided tour of the available software that can help biologists to scan automatically for these repeats in sequence data or check hypothetical models intended to characterize their structures. Since transposable elements (TEs) are a major source of repeats in plants, many methods have been used or developed for this broad class of sequences. They are representative of the range of tools available for other classes of repeats and we have provided two sections on this topic (for the analysis of genomes or directly of sequenced reads), as well as a selection of the main existing software. It may be hard to keep up with the profusion of proposals in this dynamic field and the rest of the chapter is devoted to the foundations of an efficient search for repeats and more complex patterns. We first introduce the key concepts of the art of indexing and mapping or querying sequences. We end the chapter with the more prospective issue of building models of repeat families. We present the Machine Learning approach first, seeking to build predictors automatically for some families of ET, from a set of sequences known to belong to this family. A second approach, the linguistic (or syntactic) approach, allows biologists to describe themselves and check the validity of models of their favorite repeat family.
© 2022. The Author(s), under exclusive license to Springer Science+Business Media, LLC, part of Springer Nature.

Entities:  

Keywords:  Algorithmics on words; Homology-based; Indexing; Machine Learning; Mapping; Pattern matching; Repeats; Structure-based methods; Transposon

Mesh:

Substances:

Year:  2022        PMID: 35037215     DOI: 10.1007/978-1-0716-2067-0_18

Source DB:  PubMed          Journal:  Methods Mol Biol        ISSN: 1064-3745


  143 in total

1.  Inverted repeat structure of the human genome: the X-chromosome contains a preponderance of large, highly homologous inverted repeats that contain testes genes.

Authors:  Peter E Warburton; Joti Giordano; Fanny Cheung; Yefgeniy Gelfand; Gary Benson
Journal:  Genome Res       Date:  2004-10       Impact factor: 9.043

2.  Review of tandem repeat search tools: a systematic approach to evaluating algorithmic performance.

Authors:  Kian Guan Lim; Chee Keong Kwoh; Li Yang Hsu; Adrianto Wirawan
Journal:  Brief Bioinform       Date:  2012-05-29       Impact factor: 11.622

Review 3.  Repbase Update, a database of eukaryotic repetitive elements.

Authors:  J Jurka; V V Kapitonov; A Pavlicek; P Klonowski; O Kohany; J Walichiewicz
Journal:  Cytogenet Genome Res       Date:  2005       Impact factor: 1.636

4.  Sequence-specific error profile of Illumina sequencers.

Authors:  Kensuke Nakamura; Taku Oshima; Takuya Morimoto; Shun Ikeda; Hirofumi Yoshikawa; Yuh Shiwa; Shu Ishikawa; Margaret C Linak; Aki Hirai; Hiroki Takahashi; Md Altaf-Ul-Amin; Naotake Ogasawara; Shigehiko Kanaya
Journal:  Nucleic Acids Res       Date:  2011-05-16       Impact factor: 16.971

5.  The peculiar landscape of repetitive sequences in the olive (Olea europaea L.) genome.

Authors:  Elena Barghini; Lucia Natali; Rosa Maria Cossu; Tommaso Giordani; Massimo Pindo; Federica Cattonaro; Simone Scalabrin; Riccardo Velasco; Michele Morgante; Andrea Cavallini
Journal:  Genome Biol Evol       Date:  2014-04       Impact factor: 3.416

6.  detectIR: a novel program for detecting perfect and imperfect inverted repeats using complex numbers and vector calculation.

Authors:  Congting Ye; Guoli Ji; Lei Li; Chun Liang
Journal:  PLoS One       Date:  2014-11-19       Impact factor: 3.240

7.  Lirex: A Package for Identification of Long Inverted Repeats in Genomes.

Authors:  Yong Wang; Jiao-Mei Huang
Journal:  Genomics Proteomics Bioinformatics       Date:  2017-04-07       Impact factor: 7.691

8.  Noise-cancelling repeat finder: uncovering tandem repeats in error-prone long-read sequencing data.

Authors:  Robert S Harris; Monika Cechova; Kateryna D Makova
Journal:  Bioinformatics       Date:  2019-11-01       Impact factor: 6.937

9.  PGSB PlantsDB: updates to the database framework for comparative plant genome research.

Authors:  Manuel Spannagl; Thomas Nussbaumer; Kai C Bader; Mihaela M Martis; Michael Seidel; Karl G Kugler; Heidrun Gundlach; Klaus F X Mayer
Journal:  Nucleic Acids Res       Date:  2015-11-02       Impact factor: 16.971

10.  Finding and extending ancient simple sequence repeat-derived regions in the human genome.

Authors:  Jonathan A Shortt; Robert P Ruggiero; Corey Cox; Aaron C Wacholder; David D Pollock
Journal:  Mob DNA       Date:  2020-02-17
View more
  1 in total

Review 1.  Methodologies for the De novo Discovery of Transposable Element Families.

Authors:  Jessica M Storer; Robert Hubley; Jeb Rosen; Arian F A Smit
Journal:  Genes (Basel)       Date:  2022-04-17       Impact factor: 4.141

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.