Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 BRAKER1: Unsupervised RNA-Seq-Based Genome Annotation with GeneMark-ET and AUGUSTUS.

Literature DB >> 26559507

BRAKER1: Unsupervised RNA-Seq-Based Genome Annotation with GeneMark-ET and AUGUSTUS.

Katharina J Hoff¹, Simone Lange¹, Alexandre Lomsadze², Mark Borodovsky³, Mario Stanke¹.

Abstract

MOTIVATION: Gene finding in eukaryotic genomes is notoriously difficult to automate. The task is to design a work flow with a minimal set of tools that would reach state-of-the-art performance across a wide range of species. GeneMark-ET is a gene prediction tool that incorporates RNA-Seq data into unsupervised training and subsequently generates ab initio gene predictions. AUGUSTUS is a gene finder that usually requires supervised training and uses information from RNA-Seq reads in the prediction step. Complementary strengths of GeneMark-ET and AUGUSTUS provided motivation for designing a new combined tool for automatic gene prediction.
RESULTS: We present BRAKER1, a pipeline for unsupervised RNA-Seq-based genome annotation that combines the advantages of GeneMark-ET and AUGUSTUS. As input, BRAKER1 requires a genome assembly file and a file in bam-format with spliced alignments of RNA-Seq reads to the genome. First, GeneMark-ET performs iterative training and generates initial gene structures. Second, AUGUSTUS uses predicted genes for training and then integrates RNA-Seq read information into final gene predictions. In our experiments, we observed that BRAKER1 was more accurate than MAKER2 when it is using RNA-Seq as sole source for training and prediction. BRAKER1 does not require pre-trained parameters or a separate expert-prepared training step.
AVAILABILITY AND IMPLEMENTATION: BRAKER1 is available for download at http://bioinf.uni-greifswald.de/bioinf/braker/ and http://exon.gatech.edu/GeneMark/ CONTACT: katharina.hoff@uni-greifswald.de or borodovsky@gatech.edu SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Mesh：

Substances：
RNA

Year: 2015 PMID： 26559507 PMCID： PMC6078167 DOI： 10.1093/bioinformatics/btv661

Source DB: PubMed Journal: Bioinformatics ISSN： 1367-4803 Impact factor: 6.937

8 in total

1. Using native and syntenically mapped cDNA alignments to improve de novo gene finding.

Authors: Mario Stanke; Mark Diekhans; Robert Baertsch; David Haussler
Journal: Bioinformatics Date: 2008-01-24 Impact factor: 6.937

2. Gene prediction in novel fungal genomes using an ab initio algorithm with unsupervised training.

Authors: Vardges Ter-Hovhannisyan; Alexandre Lomsadze; Yury O Chernoff; Mark Borodovsky
Journal: Genome Res Date: 2008-08-29 Impact factor: 9.043

3. MAKER2: an annotation pipeline and genome-database management tool for second-generation genome projects.

Authors: Carson Holt; Mark Yandell
Journal: BMC Bioinformatics Date: 2011-12-22 Impact factor: 3.307

4. Eval: a software package for analysis of genome annotations.

Authors: Evan Keibler; Michael R Brent
Journal: BMC Bioinformatics Date: 2003-10-17 Impact factor: 3.169

5. CodingQuarry: highly accurate hidden Markov model gene prediction in fungal genomes using RNA-seq transcripts.

Authors: Alison C Testa; James K Hane; Simon R Ellwood; Richard P Oliver
Journal: BMC Genomics Date: 2015-03-11 Impact factor: 3.969

6. SnowyOwl: accurate prediction of fungal genes by using RNA-Seq and homology information to select among ab initio models.

Authors: Ian Reid; Nicholas O'Toole; Omar Zabaneh; Reza Nourzadeh; Mahmoud Dahdouli; Mostafa Abdellateef; Paul M K Gordon; Jung Soh; Gregory Butler; Christoph W Sensen; Adrian Tsang
Journal: BMC Bioinformatics Date: 2014-07-01 Impact factor: 3.169

7. Assessment of transcript reconstruction methods for RNA-seq.

Authors: Josep F Abril; Pär G Engström; Felix Kokocinski; Tamara Steijger; Tim J Hubbard; Roderic Guigó; Jennifer Harrow; Paul Bertone
Journal: Nat Methods Date: 2013-11-03 Impact factor: 28.547

8. Integration of mapped RNA-Seq reads into automatic training of eukaryotic gene finding algorithm.

Authors: Alexandre Lomsadze; Paul D Burns; Mark Borodovsky
Journal: Nucleic Acids Res Date: 2014-07-02 Impact factor: 16.971

8 in total

318 in total

1. Single-molecule sequencing and chromatin conformation capture enable de novo reference assembly of the domestic goat genome.

Authors: Derek M Bickhart; Benjamin D Rosen; Sergey Koren; Brian L Sayre; Alex R Hastie; Saki Chan; Joyce Lee; Ernest T Lam; Ivan Liachko; Shawn T Sullivan; Joshua N Burton; Heather J Huson; John C Nystrom; Christy M Kelley; Jana L Hutchison; Yang Zhou; Jiajie Sun; Alessandra Crisà; F Abel Ponce de León; John C Schwartz; John A Hammond; Geoffrey C Waldbieser; Steven G Schroeder; George E Liu; Maitreya J Dunham; Jay Shendure; Tad S Sonstegard; Adam M Phillippy; Curtis P Van Tassell; Timothy P L Smith
Journal: Nat Genet Date: 2017-03-06 Impact factor: 38.330

2. Long-Read Annotation: Automated Eukaryotic Genome Annotation Based on Long-Read cDNA Sequencing.

Authors: David E Cook; Jose Espejo Valle-Inclan; Alice Pajoro; Hanna Rovenich; Bart P H J Thomma; Luigi Faino
Journal: Plant Physiol Date: 2018-11-06 Impact factor: 8.340

3. RNA-Seq in Nonmodel Organisms.

Authors: Vered Chalifa-Caspi
Journal: Methods Mol Biol Date: 2021

Review 4. Methods, Tools and Current Perspectives in Proteogenomics.

Authors: Kelly V Ruggles; Karsten Krug; Xiaojing Wang; Karl R Clauser; Jing Wang; Samuel H Payne; David Fenyö; Bing Zhang; D R Mani
Journal: Mol Cell Proteomics Date: 2017-04-29 Impact factor: 5.911

5. Fusarium virguliform e Transcriptional Plasticity Is Revealed by Host Colonization of Maize versus Soybean.

Authors: Amy Baetsen-Young; Ching Man Wai; Robert VanBuren; Brad Day
Journal: Plant Cell Date: 2019-12-18 Impact factor: 11.277

6. Are We There Yet? Reliably Estimating the Completeness of Plant Genome Sequences.

Authors: Elisabeth Veeckman; Tom Ruttink; Klaas Vandepoele
Journal: Plant Cell Date: 2016-08-10 Impact factor: 11.277

7. xGDBvm: A Web GUI-Driven Workflow for Annotating Eukaryotic Genomes in the Cloud.

Authors: Jon Duvick; Daniel S Standage; Nirav Merchant; Volker P Brendel
Journal: Plant Cell Date: 2016-03-28 Impact factor: 11.277

8. GeneMark-EP+: eukaryotic gene prediction with self-training in the space of genes and proteins.

Authors: Tomáš Brůna; Alexandre Lomsadze; Mark Borodovsky
Journal: NAR Genom Bioinform Date: 2020-05-13

9. Time-resolved transcriptome analysis and lipid pathway reconstruction of the oleaginous green microalga Monoraphidium neglectum reveal a model for triacylglycerol and lipid hyperaccumulation.

Authors: Daniel Jaeger; Anika Winkler; Jan H Mussgnug; Jörn Kalinowski; Alexander Goesmann; Olaf Kruse
Journal: Biotechnol Biofuels Date: 2017-08-14 Impact factor: 6.040

10. Sterol regulatory element-binding protein Sre1 regulates carotenogenesis in the red yeast Xanthophyllomyces dendrorhous.

Authors: Melissa Gómez; Sebastián Campusano; María Soledad Gutiérrez; Dionisia Sepúlveda; Salvador Barahona; Marcelo Baeza; Víctor Cifuentes; Jennifer Alcaíno
Journal: J Lipid Res Date: 2020-09-15 Impact factor: 5.922