Literature DB >> 30052957

Leveraging multiple transcriptome assembly methods for improved gene structure annotation.

Luca Venturini1, Shabhonam Caim1,2, Gemy George Kaithakottil1, Daniel Lee Mapleson1, David Swarbreck1.   

Abstract

Background: The performance of RNA sequencing (RNA-seq) aligners and assemblers varies greatly across different organisms and experiments, and often the optimal approach is not known beforehand.
Results: Here, we show that the accuracy of transcript reconstruction can be boosted by combining multiple methods, and we present a novel algorithm to integrate multiple RNA-seq assemblies into a coherent transcript annotation. Our algorithm can remove redundancies and select the best transcript models according to user-specified metrics, while solving common artifacts such as erroneous transcript chimerisms. Conclusions: We have implemented this method in an open-source Python3 and Cython program, Mikado, available on GitHub.

Entities:  

Mesh:

Year:  2018        PMID: 30052957      PMCID: PMC6105091          DOI: 10.1093/gigascience/giy093

Source DB:  PubMed          Journal:  Gigascience        ISSN: 2047-217X            Impact factor:   6.524


  40 in total

1.  Improving the Arabidopsis genome annotation using maximal transcript alignment assemblies.

Authors:  Brian J Haas; Arthur L Delcher; Stephen M Mount; Jennifer R Wortman; Roger K Smith; Linda I Hannick; Rama Maiti; Catherine M Ronning; Douglas B Rusch; Christopher D Town; Steven L Salzberg; Owen White
Journal:  Nucleic Acids Res       Date:  2003-10-01       Impact factor: 16.971

2.  Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences.

Authors:  Weizhong Li; Adam Godzik
Journal:  Bioinformatics       Date:  2006-05-26       Impact factor: 6.937

Review 3.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.

Authors:  S F Altschul; T L Madden; A A Schäffer; J Zhang; Z Zhang; W Miller; D J Lipman
Journal:  Nucleic Acids Res       Date:  1997-09-01       Impact factor: 16.971

4.  Accurate assembly of transcripts through phase-preserving graph decomposition.

Authors:  Mingfu Shao; Carl Kingsford
Journal:  Nat Biotechnol       Date:  2017-11-13       Impact factor: 54.908

5.  BEDTools: a flexible suite of utilities for comparing genomic features.

Authors:  Aaron R Quinlan; Ira M Hall
Journal:  Bioinformatics       Date:  2010-01-28       Impact factor: 6.937

6.  MAKER-P: a tool kit for the rapid creation, management, and quality control of plant genome annotations.

Authors:  Michael S Campbell; MeiYee Law; Carson Holt; Joshua C Stein; Gaurav D Moghe; David E Hufnagel; Jikai Lei; Rujira Achawanantakun; Dian Jiao; Carolyn J Lawrence; Doreen Ware; Shin-Han Shiu; Kevin L Childs; Yanni Sun; Ning Jiang; Mark Yandell
Journal:  Plant Physiol       Date:  2013-12-04       Impact factor: 8.340

7.  Combining transcriptome assemblies from multiple de novo assemblers in the allo-tetraploid plant Nicotiana benthamiana.

Authors:  Kenlee Nakasugi; Ross Crowhurst; Julia Bally; Peter Waterhouse
Journal:  PLoS One       Date:  2014-03-10       Impact factor: 3.240

8.  Streaming fragment assignment for real-time analysis of sequencing experiments.

Authors:  Adam Roberts; Lior Pachter
Journal:  Nat Methods       Date:  2012-11-18       Impact factor: 28.547

9.  Design of RNA splicing analysis null models for post hoc filtering of Drosophila head RNA-Seq data with the splicing analysis kit (Spanki).

Authors:  David Sturgill; John H Malone; Xia Sun; Harold E Smith; Leonard Rabinow; Marie-Laure Samson; Brian Oliver
Journal:  BMC Bioinformatics       Date:  2013-11-09       Impact factor: 3.169

10.  Evaluation of de novo transcriptome assemblies from RNA-Seq data.

Authors:  Bo Li; Nathanael Fillmore; Yongsheng Bai; Mike Collins; James A Thomson; Ron Stewart; Colin N Dewey
Journal:  Genome Biol       Date:  2014-12-21       Impact factor: 13.583

View more
  26 in total

1.  Independent evolution of ancestral and novel defenses in a genus of toxic plants (Erysimum, Brassicaceae).

Authors:  Tobias Züst; Susan R Strickler; Adrian F Powell; Makenzie E Mabry; Hong An; Mahdieh Mirzaei; Thomas York; Cynthia K Holland; Pavan Kumar; Matthias Erb; Georg Petschenka; José-María Gómez; Francisco Perfectti; Caroline Müller; J Chris Pires; Lukas A Mueller; Georg Jander
Journal:  Elife       Date:  2020-04-07       Impact factor: 8.140

2.  Chromosome-scale assembly and annotation of the perennial ryegrass genome.

Authors:  Istvan Nagy; Elisabeth Veeckman; Chang Liu; Michiel Van Bel; Klaas Vandepoele; Christian Sig Jensen; Tom Ruttink; Torben Asp
Journal:  BMC Genomics       Date:  2022-07-12       Impact factor: 4.547

3.  Genome assembly of two nematode-resistant cotton lines (Gossypium hirsutum L.).

Authors:  Lindsey C Perkin; Al Bell; Lori L Hinze; Charles P-C Suh; Mark A Arick; Daniel G Peterson; Joshua A Udall
Journal:  G3 (Bethesda)       Date:  2021-10-19       Impact factor: 3.542

4.  Foster thy young: enhanced prediction of orphan genes in assembled genomes.

Authors:  Jing Li; Urminder Singh; Priyanka Bhandary; Jacqueline Campbell; Zebulun Arendsee; Arun S Seetharam; Eve Syrkin Wurtele
Journal:  Nucleic Acids Res       Date:  2022-04-22       Impact factor: 19.160

5.  The Gossypium stocksii genome as a novel resource for cotton improvement.

Authors:  Corrinne E Grover; Daojun Yuan; Mark A Arick; Emma R Miller; Guanjing Hu; Daniel G Peterson; Jonathan F Wendel; Joshua A Udall
Journal:  G3 (Bethesda)       Date:  2021-04-19       Impact factor: 3.154

6.  pyrpipe: a Python package for RNA-Seq workflows.

Authors:  Urminder Singh; Jing Li; Arun Seetharam; Eve Syrkin Wurtele
Journal:  NAR Genom Bioinform       Date:  2021-06-01

7.  Impact of transposable elements on genome structure and evolution in bread wheat.

Authors:  Thomas Wicker; Heidrun Gundlach; Manuel Spannagl; Cristobal Uauy; Philippa Borrill; Ricardo H Ramírez-González; Romain De Oliveira; Klaus F X Mayer; Etienne Paux; Frédéric Choulet
Journal:  Genome Biol       Date:  2018-08-17       Impact factor: 13.583

8.  Leveraging multiple transcriptome assembly methods for improved gene structure annotation.

Authors:  Luca Venturini; Shabhonam Caim; Gemy George Kaithakottil; Daniel Lee Mapleson; David Swarbreck
Journal:  Gigascience       Date:  2018-08-01       Impact factor: 6.524

9.  De novo assembly, annotation, and comparative analysis of 26 diverse maize genomes.

Authors:  Matthew B Hufford; Arun S Seetharam; Margaret R Woodhouse; Kapeel M Chougule; Shujun Ou; Jianing Liu; William A Ricci; Tingting Guo; Andrew Olson; Yinjie Qiu; Rafael Della Coletta; Silas Tittes; Asher I Hudson; Alexandre P Marand; Sharon Wei; Zhenyuan Lu; Bo Wang; Marcela K Tello-Ruiz; Rebecca D Piri; Na Wang; Dong Won Kim; Yibing Zeng; Christine H O'Connor; Xianran Li; Amanda M Gilbert; Erin Baggs; Ksenia V Krasileva; John L Portwood; Ethalinda K S Cannon; Carson M Andorf; Nancy Manchanda; Samantha J Snodgrass; David E Hufnagel; Qiuhan Jiang; Sarah Pedersen; Michael L Syring; David A Kudrna; Victor Llaca; Kevin Fengler; Robert J Schmitz; Jeffrey Ross-Ibarra; Jianming Yu; Jonathan I Gent; Candice N Hirsch; Doreen Ware; R Kelly Dawe
Journal:  Science       Date:  2021-08-06       Impact factor: 47.728

10.  Large-Scale Multiplexing Permits Full-Length Transcriptome Annotation of 32 Bovine Tissues From a Single Nanopore Flow Cell.

Authors:  Michelle M Halstead; Alma Islas-Trejo; Daniel E Goszczynski; Juan F Medrano; Huaijun Zhou; Pablo J Ross
Journal:  Front Genet       Date:  2021-05-20       Impact factor: 4.599

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.