Literature DB >> 31077315

De novo transcriptome assembly: A comprehensive cross-species comparison of short-read RNA-Seq assemblers.

Martin Hölzer1,2, Manja Marz1,2,3.   

Abstract

BACKGROUND: In recent years, massively parallel complementary DNA sequencing (RNA sequencing [RNA-Seq]) has emerged as a fast, cost-effective, and robust technology to study entire transcriptomes in various manners. In particular, for non-model organisms and in the absence of an appropriate reference genome, RNA-Seq is used to reconstruct the transcriptome de novo. Although the de novo transcriptome assembly of non-model organisms has been on the rise recently and new tools are frequently developing, there is still a knowledge gap about which assembly software should be used to build a comprehensive de novo assembly.
RESULTS: Here, we present a large-scale comparative study in which 10 de novo assembly tools are applied to 9 RNA-Seq data sets spanning different kingdoms of life. Overall, we built >200 single assemblies and evaluated their performance on a combination of 20 biological-based and reference-free metrics. Our study is accompanied by a comprehensive and extensible Electronic Supplement that summarizes all data sets, assembly execution instructions, and evaluation results. Trinity, SPAdes, and Trans-ABySS, followed by Bridger and SOAPdenovo-Trans, generally outperformed the other tools compared. Moreover, we observed species-specific differences in the performance of each assembler. No tool delivered the best results for all data sets.
CONCLUSIONS: We recommend a careful choice and normalization of evaluation metrics to select the best assembling results as a critical step in the reconstruction of a comprehensive de novo transcriptome assembly.
© The Author(s) 2019. Published by Oxford University Press.

Entities:  

Keywords:  RNA-Seq; assembly; comparison; de novo; transcriptomics

Mesh:

Year:  2019        PMID: 31077315      PMCID: PMC6511074          DOI: 10.1093/gigascience/giz039

Source DB:  PubMed          Journal:  Gigascience        ISSN: 2047-217X            Impact factor:   6.524


  47 in total

1.  Using the miraEST assembler for reliable and automated mRNA transcript assembly and SNP detection in sequenced ESTs.

Authors:  Bastien Chevreux; Thomas Pfisterer; Bernd Drescher; Albert J Driesel; Werner E G Müller; Thomas Wetter; Sándor Suhai
Journal:  Genome Res       Date:  2004-05-12       Impact factor: 9.043

2.  De novo assembly and analysis of RNA-seq data.

Authors:  Gordon Robertson; Jacqueline Schein; Readman Chiu; Richard Corbett; Matthew Field; Shaun D Jackman; Karen Mungall; Sam Lee; Hisanaga Mark Okada; Jenny Q Qian; Malachi Griffith; Anthony Raymond; Nina Thiessen; Timothee Cezard; Yaron S Butterfield; Richard Newsome; Simon K Chan; Rong She; Richard Varhol; Baljit Kamoh; Anna-Liisa Prabhu; Angela Tam; YongJun Zhao; Richard A Moore; Martin Hirst; Marco A Marra; Steven J M Jones; Pamela A Hoodless; Inanc Birol
Journal:  Nat Methods       Date:  2010-10-10       Impact factor: 28.547

3.  BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs.

Authors:  Felipe A Simão; Robert M Waterhouse; Panagiotis Ioannidis; Evgenia V Kriventseva; Evgeny M Zdobnov
Journal:  Bioinformatics       Date:  2015-06-09       Impact factor: 6.937

Review 4.  Molecular biology and evolution of filoviruses.

Authors:  H Feldmann; H D Klenk; A Sanchez
Journal:  Arch Virol Suppl       Date:  1993

5.  UniProt: a hub for protein information.

Authors: 
Journal:  Nucleic Acids Res       Date:  2014-10-27       Impact factor: 16.971

6.  Bridger: a new framework for de novo transcriptome assembly using RNA-seq data.

Authors:  Zheng Chang; Guojun Li; Juntao Liu; Yu Zhang; Cody Ashby; Deli Liu; Carole L Cramer; Xiuzhen Huang
Journal:  Genome Biol       Date:  2015-02-11       Impact factor: 13.583

7.  Evaluation of de novo transcriptome assemblies from RNA-Seq data.

Authors:  Bo Li; Nathanael Fillmore; Yongsheng Bai; Mike Collins; James A Thomson; Ron Stewart; Colin N Dewey
Journal:  Genome Biol       Date:  2014-12-21       Impact factor: 13.583

8.  Comparison of De Novo Transcriptome Assemblers and k-mer Strategies Using the Killifish, Fundulus heteroclitus.

Authors:  Satshil B Rana; Frank J Zadlock; Ziping Zhang; Wyatt R Murphy; Carolyn S Bentivegna
Journal:  PLoS One       Date:  2016-04-07       Impact factor: 3.240

9.  TransRate: reference-free quality assessment of de novo transcriptome assemblies.

Authors:  Richard Smith-Unna; Chris Boursnell; Rob Patro; Julian M Hibberd; Steven Kelly
Journal:  Genome Res       Date:  2016-06-01       Impact factor: 9.043

10.  BUSCO Applications from Quality Assessments to Gene Prediction and Phylogenomics.

Authors:  Robert M Waterhouse; Mathieu Seppey; Felipe A Simão; Mosè Manni; Panagiotis Ioannidis; Guennadi Klioutchnikov; Evgenia V Kriventseva; Evgeny M Zdobnov
Journal:  Mol Biol Evol       Date:  2018-03-01       Impact factor: 16.240

View more
  45 in total

1.  Visual pigment evolution in Characiformes: The dynamic interplay of teleost whole-genome duplication, surviving opsins and spectral tuning.

Authors:  Daniel Escobar-Camacho; Karen L Carleton; Devika W Narain; Michele E R Pierotti
Journal:  Mol Ecol       Date:  2020-06-08       Impact factor: 6.185

2.  IntAPT: integrated assembly of phenotype-specific transcripts from multiple RNA-seq profiles.

Authors:  Xu Shi; Andrew F Neuwald; Xiao Wang; Tian-Li Wang; Leena Hilakivi-Clarke; Robert Clarke; Jianhua Xuan
Journal:  Bioinformatics       Date:  2021-05-05       Impact factor: 6.937

Review 3.  How to turn an organism into a model organism in 10 'easy' steps.

Authors:  Benjamin J Matthews; Leslie B Vosshall
Journal:  J Exp Biol       Date:  2020-02-07       Impact factor: 3.312

Review 4.  Using Gene Expression to Study Specialized Metabolism-A Practical Guide.

Authors:  Riccardo Delli-Ponti; Devendra Shivhare; Marek Mutwil
Journal:  Front Plant Sci       Date:  2021-01-12       Impact factor: 5.753

Review 5.  A simple guide to de novo transcriptome assembly and annotation.

Authors:  Venket Raghavan; Louis Kraft; Fantin Mesny; Linda Rigerte
Journal:  Brief Bioinform       Date:  2022-03-10       Impact factor: 11.622

6.  Metagenomics versus total RNA sequencing: most accurate data-processing tools, microbial identification accuracy and perspectives for ecological assessments.

Authors:  Christopher A Hempel; Natalie Wright; Julia Harvie; Jose S Hleap; Sarah J Adamowicz; Dirk Steinke
Journal:  Nucleic Acids Res       Date:  2022-08-18       Impact factor: 19.160

Review 7.  Proteotranscriptomics - A facilitator in omics research.

Authors:  Michal Levin; Falk Butter
Journal:  Comput Struct Biotechnol J       Date:  2022-07-09       Impact factor: 6.155

8.  Fatty acid bioconversion in harpacticoid copepods in a changing environment: a transcriptomic approach.

Authors:  Jens Boyen; Patrick Fink; Christoph Mensens; Pascal I Hablützel; Marleen De Troch
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2020-06-15       Impact factor: 6.237

9.  A comprehensive evaluation of skin aging-related circular RNA expression profiles.

Authors:  Lili Wang; Xijian Si; Shuang Chen; Xiuli Wang; Dan Yang; Henan Yang; Chundi He
Journal:  J Clin Lab Anal       Date:  2021-02-03       Impact factor: 2.352

10.  SAUTE: sequence assembly using target enrichment.

Authors:  Alexandre Souvorov; Richa Agarwala
Journal:  BMC Bioinformatics       Date:  2021-07-21       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.