Literature DB >> 32300014

RepeatModeler2 for automated genomic discovery of transposable element families.

Jullien M Flynn1, Robert Hubley2, Clément Goubert1, Jeb Rosen2, Andrew G Clark3, Cédric Feschotte3, Arian F Smit4.   

Abstract

The accelerating pace of genome sequencing throughout the tree of life is driving the need for improved unsupervised annotation of genome components such as transposable elements (TEs). Because the types and sequences of TEs are highly variable across species, automated TE discovery and annotation are challenging and time-consuming tasks. A critical first step is the de novo identification and accurate compilation of sequence models representing all of the unique TE families dispersed in the genome. Here we introduce RepeatModeler2, a pipeline that greatly facilitates this process. This program brings substantial improvements over the original version of RepeatModeler, one of the most widely used tools for TE discovery. In particular, this version incorporates a module for structural discovery of complete long terminal repeat (LTR) retroelements, which are widespread in eukaryotic genomes but recalcitrant to automated identification because of their size and sequence complexity. We benchmarked RepeatModeler2 on three model species with diverse TE landscapes and high-quality, manually curated TE libraries: Drosophila melanogaster (fruit fly), Danio rerio (zebrafish), and Oryza sativa (rice). In these three species, RepeatModeler2 identified approximately 3 times more consensus sequences matching with >95% sequence identity and sequence coverage to the manually curated sequences than the original RepeatModeler. As expected, the greatest improvement is for LTR retroelements. Thus, RepeatModeler2 represents a valuable addition to the genome annotation toolkit that will enhance the identification and study of TEs in eukaryotic genome sequences. RepeatModeler2 is available as source code or a containerized package under an open license (https://github.com/Dfam-consortium/RepeatModeler, http://www.repeatmasker.org/RepeatModeler/).

Entities:  

Keywords:  genome annotation; mobile genetic elements; transposon families

Year:  2020        PMID: 32300014     DOI: 10.1073/pnas.1921046117

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  264 in total

1.  A bird-like genome from a frog: Mechanisms of genome size reduction in the ornate burrowing frog, Platyplectrum ornatum.

Authors:  Sangeet Lamichhaney; Renee Catullo; J Scott Keogh; Simon Clulow; Scott V Edwards; Tariq Ezaz
Journal:  Proc Natl Acad Sci U S A       Date:  2021-03-16       Impact factor: 11.205

2.  Evolutionary Dynamics of the Pericentromeric Heterochromatin in Drosophila virilis and Related Species.

Authors:  Alexander P Rezvykh; Sergei Yu Funikov; Lyudmila A Protsenko; Dina A Kulikova; Elena S Zelentsova; Lyubov N Chuvakova; Justin P Blumenstiel; Michael B Evgen'ev
Journal:  Genes (Basel)       Date:  2021-01-27       Impact factor: 4.096

3.  Genome sequences and in silico effector mining of Corynespora cassiicola CC_29 and Corynespora olivacea CBS 114450.

Authors:  Thaís Carolina da Silva Dal'Sasso; Hugo Vianna Silva Rody; Pablo Enrique Grijalba; Luiz Orlando de Oliveira
Journal:  Arch Microbiol       Date:  2021-07-02       Impact factor: 2.552

4.  Evolutionary dynamics of transposable elements in bdelloid rotifers.

Authors:  Reuben W Nowell; Christopher G Wilson; Pedro Almeida; Philipp H Schiffer; Diego Fontaneto; Lutz Becks; Fernando Rodriguez; Irina R Arkhipova; Timothy G Barraclough
Journal:  Elife       Date:  2021-02-05       Impact factor: 8.140

5.  A mosquito small RNA genomics resource reveals dynamic evolution and host responses to viruses and transposons.

Authors:  Qicheng Ma; Satyam P Srivastav; Stephanie Gamez; Gargi Dayama; Fabiana Feitosa-Suntheimer; Edward I Patterson; Rebecca M Johnson; Erik M Matson; Alexander S Gold; Douglas E Brackney; John H Connor; Tonya M Colpitts; Grant L Hughes; Jason L Rasgon; Tony Nolan; Omar S Akbari; Nelson C Lau
Journal:  Genome Res       Date:  2021-01-08       Impact factor: 9.043

6.  Transcription initiation of distant core promoters in a large-sized genome of an insect.

Authors:  Qing Liu; Feng Jiang; Jie Zhang; Xiao Li; Le Kang
Journal:  BMC Biol       Date:  2021-03-30       Impact factor: 7.431

7.  Evidence from oyster suggests an ancient role for Pdx in regulating insulin gene expression in animals.

Authors:  Fei Xu; Ferdinand Marlétaz; Daria Gavriouchkina; Xiao Liu; Tatjana Sauka-Spengler; Guofan Zhang; Peter W H Holland
Journal:  Nat Commun       Date:  2021-05-25       Impact factor: 14.919

8.  Chromosome-level genome assembly and manually-curated proteome of model necrotroph Parastagonospora nodorum Sn15 reveals a genome-wide trove of candidate effector homologs, and redundancy of virulence-related functions within an accessory chromosome.

Authors:  Stefania Bertazzoni; Darcy A B Jones; Huyen T Phan; Kar-Chun Tan; James K Hane
Journal:  BMC Genomics       Date:  2021-05-25       Impact factor: 3.969

9.  A sensitive repeat identification framework based on short and long reads.

Authors:  Xingyu Liao; Min Li; Kang Hu; Fang-Xiang Wu; Xin Gao; Jianxin Wang
Journal:  Nucleic Acids Res       Date:  2021-09-27       Impact factor: 16.971

10.  Amplified Fragments of an Autosome-Borne Gene Constitute a Significant Component of the W Sex Chromosome of Eremias velox (Reptilia, Lacertidae).

Authors:  Artem Lisachov; Daria Andreyushkova; Guzel Davletshina; Dmitry Prokopov; Svetlana Romanenko; Svetlana Galkina; Alsu Saifitdinova; Evgeniy Simonov; Pavel Borodin; Vladimir Trifonov
Journal:  Genes (Basel)       Date:  2021-05-20       Impact factor: 4.096

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.