BACKGROUND: The possibility of generating large RNA-sequencing datasets has led to development of various reference-based and de novo transcriptome assemblers with their own strengths and limitations. While reference-based tools are widely used in various transcriptomic studies, their application is limited to the organisms with finished and well-annotated genomes. De novo transcriptome reconstruction from short reads remains an open challenging problem, which is complicated by the varying expression levels across different genes, alternative splicing, and paralogous genes. RESULTS: Herein we describe the novel transcriptome assembler rnaSPAdes, which has been developed on top of the SPAdes genome assembler and explores computational parallels between assembly of transcriptomes and single-cell genomes. We also present quality assessment reports for rnaSPAdes assemblies, compare it with modern transcriptome assembly tools using several evaluation approaches on various RNA-sequencing datasets, and briefly highlight strong and weak points of different assemblers. CONCLUSIONS: Based on the performed comparison between different assembly methods, we infer that it is not possible to detect the absolute leader according to all quality metrics and all used datasets. However, rnaSPAdes typically outperforms other assemblers by such important property as the number of assembled genes and isoforms, and at the same time has higher accuracy statistics on average comparing to the closest competitors.
BACKGROUND: The possibility of generating large RNA-sequencing datasets has led to development of various reference-based and de novo transcriptome assemblers with their own strengths and limitations. While reference-based tools are widely used in various transcriptomic studies, their application is limited to the organisms with finished and well-annotated genomes. De novo transcriptome reconstruction from short reads remains an open challenging problem, which is complicated by the varying expression levels across different genes, alternative splicing, and paralogous genes. RESULTS: Herein we describe the novel transcriptome assembler rnaSPAdes, which has been developed on top of the SPAdes genome assembler and explores computational parallels between assembly of transcriptomes and single-cell genomes. We also present quality assessment reports for rnaSPAdes assemblies, compare it with modern transcriptome assembly tools using several evaluation approaches on various RNA-sequencing datasets, and briefly highlight strong and weak points of different assemblers. CONCLUSIONS: Based on the performed comparison between different assembly methods, we infer that it is not possible to detect the absolute leader according to all quality metrics and all used datasets. However, rnaSPAdes typically outperforms other assemblers by such important property as the number of assembled genes and isoforms, and at the same time has higher accuracy statistics on average comparing to the closest competitors.
Authors: Gordon Robertson; Jacqueline Schein; Readman Chiu; Richard Corbett; Matthew Field; Shaun D Jackman; Karen Mungall; Sam Lee; Hisanaga Mark Okada; Jenny Q Qian; Malachi Griffith; Anthony Raymond; Nina Thiessen; Timothee Cezard; Yaron S Butterfield; Richard Newsome; Simon K Chan; Rong She; Richard Varhol; Baljit Kamoh; Anna-Liisa Prabhu; Angela Tam; YongJun Zhao; Richard A Moore; Martin Hirst; Marco A Marra; Steven J M Jones; Pamela A Hoodless; Inanc Birol Journal: Nat Methods Date: 2010-10-10 Impact factor: 28.547
Authors: Irina Vasilinetc; Andrey D Prjibelski; Alexey Gurevich; Anton Korobeynikov; Pavel A Pevzner Journal: Bioinformatics Date: 2015-06-03 Impact factor: 6.937
Authors: Felipe A Simão; Robert M Waterhouse; Panagiotis Ioannidis; Evgenia V Kriventseva; Evgeny M Zdobnov Journal: Bioinformatics Date: 2015-06-09 Impact factor: 6.937
Authors: Sylvain Aubry; Steven Kelly; Britta M C Kümpers; Richard D Smith-Unna; Julian M Hibberd Journal: PLoS Genet Date: 2014-06-05 Impact factor: 5.917
Authors: Bo Li; Nathanael Fillmore; Yongsheng Bai; Mike Collins; James A Thomson; Ron Stewart; Colin N Dewey Journal: Genome Biol Date: 2014-12-21 Impact factor: 13.583
Authors: Manfred G Grabherr; Brian J Haas; Moran Yassour; Joshua Z Levin; Dawn A Thompson; Ido Amit; Xian Adiconis; Lin Fan; Raktima Raychowdhury; Qiandong Zeng; Zehua Chen; Evan Mauceli; Nir Hacohen; Andreas Gnirke; Nicholas Rhind; Federica di Palma; Bruce W Birren; Chad Nusbaum; Kerstin Lindblad-Toh; Nir Friedman; Aviv Regev Journal: Nat Biotechnol Date: 2011-05-15 Impact factor: 54.908
Authors: Daniele Merico; Carl Spickett; Matthew O'Hara; Boyko Kakaradov; Amit G Deshwar; Phil Fradkin; Shreshth Gandhi; Jiexin Gao; Solomon Grant; Ken Kron; Frank W Schmitges; Zvi Shalev; Mark Sun; Marta Verby; Matthew Cahill; James J Dowling; Johan Fransson; Erno Wienholds; Brendan J Frey Journal: NPJ Genom Med Date: 2020-04-08 Impact factor: 8.617
Authors: André M Machado; Antonio Muñoz-Merida; Elza Fonseca; Ana Veríssimo; Rui Pinto; Mónica Felício; Rute R da Fonseca; Elsa Froufe; L Filipe C Castro Journal: Sci Data Date: 2020-07-07 Impact factor: 6.444
Authors: José Roberto Bermúdez-Barrientos; Obed Ramírez-Sánchez; Franklin Wang-Ngai Chow; Amy H Buck; Cei Abreu-Goodger Journal: Nucleic Acids Res Date: 2020-02-28 Impact factor: 16.971
Authors: Robert C Edgar; Jeff Taylor; Victor Lin; Tomer Altman; Pierre Barbera; Dmitry Meleshko; Dan Lohr; Gherman Novakovsky; Benjamin Buchfink; Basem Al-Shayeb; Jillian F Banfield; Marcos de la Peña; Anton Korobeynikov; Rayan Chikhi; Artem Babaian Journal: Nature Date: 2022-01-26 Impact factor: 49.962
Authors: Bohuslav Janousek; Roman Gogela; Vaclav Bacovsky; Susanne S Renner Journal: Philos Trans R Soc Lond B Biol Sci Date: 2022-03-21 Impact factor: 6.237
Authors: Jason Nomburg; Susan Bullman; Sun Sook Chung; Katsuhiro Togami; Mark A Walker; Gabriel K Griffin; Elizabeth A Morgan; Nicole R LeBoeuf; James A DeCaprio; Matthew Meyerson; Andrew A Lane Journal: Blood Adv Date: 2020-03-24
Authors: Marina P Slezina; Ekaterina A Istomina; Tatyana V Korostyleva; Alexey S Kovtun; Artem S Kasianov; Alexey A Konopkin; Larisa A Shcherbakova; Tatyana I Odintsova Journal: Int J Mol Sci Date: 2021-05-27 Impact factor: 5.923