Yinlong Xie1, Gengxiong Wu2, Jingbo Tang1, Ruibang Luo1, Jordan Patterson2, Shanlin Liu2, Weihua Huang2, Guangzhu He2, Shengchang Gu1, Shengkang Li2, Xin Zhou2, Tak-Wah Lam2, Yingrui Li2, Xun Xu2, Gane Ka-Shu Wong1, Jun Wang1. 1. School of Bioscience and Bioengineering, South China University of Technology 510006, Guangzhou, China, BGI-Shenzhen, Shenzhen 518083, China, HKU-BGI Bioinformatics Algorithms and Core Technology Research Laboratory and Department of Computer Science, University of Hong Kong, Pokfulam, Hong Kong, Institute of Biomedical Engineering, XiangYa School of Medicine, Central South University, Changsha 410008, China, BGI-tech, BGI-Shenzhen, Shenzhen 518083, China, Department of Medicine, University of Alberta, Edmonton, AB T6G 2E1, Canada, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China, Department of Biological Sciences, University of Alberta, Edmonton, AB T6G 2E9, Canada, The Novo Nordisk Foundation Center for Basic Metabolic Research, University of Copenhagen, Copenhagen DK-2200, Denmark, Department of Biology, University of Copenhagen, Copenhagen DK-2200, Denmark and Princess Al Jawhara Center of Excellence in the Research of Hereditary Disorders, King Abdulaziz University, Jeddah 21589, Saudi ArabiaSchool of Bioscience and Bioengineering, South China University of Technology 510006, Guangzhou, China, BGI-Shenzhen, Shenzhen 518083, China, HKU-BGI Bioinformatics Algorithms and Core Technology Research Laboratory and Department of Computer Science, University of Hong Kong, Pokfulam, Hong Kong, Institute of Biomedical Engineering, XiangYa School of Medicine, Central South University, Changsha 410008, China, BGI-tech, BGI-Shenzhen, Shenzhen 518083, China, Department of Medicine, University of Alberta, Edmonton, AB T6G 2E1, Canada, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China, Department of Biological Sciences, University of Alberta, Edmonton, AB T6G 2E9, Canada, The Novo Nordisk Foundation Center for Basic Metabolic Research, University of Copenhagen, Copenhagen DK-2200, Denmark, Department of Biology, University of Copenhagen, Copenhagen DK-2200, Denmark and Princess Al Jawha 2. School of Bioscience and Bioengineering, South China University of Technology 510006, Guangzhou, China, BGI-Shenzhen, Shenzhen 518083, China, HKU-BGI Bioinformatics Algorithms and Core Technology Research Laboratory and Department of Computer Science, University of Hong Kong, Pokfulam, Hong Kong, Institute of Biomedical Engineering, XiangYa School of Medicine, Central South University, Changsha 410008, China, BGI-tech, BGI-Shenzhen, Shenzhen 518083, China, Department of Medicine, University of Alberta, Edmonton, AB T6G 2E1, Canada, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China, Department of Biological Sciences, University of Alberta, Edmonton, AB T6G 2E9, Canada, The Novo Nordisk Foundation Center for Basic Metabolic Research, University of Copenhagen, Copenhagen DK-2200, Denmark, Department of Biology, University of Copenhagen, Copenhagen DK-2200, Denmark and Princess Al Jawhara Center of Excellence in the Research of Hereditary Disorders, King Abdulaziz University, Jeddah 21589, Saudi Arabia.
Abstract
MOTIVATION: Transcriptome sequencing has long been the favored method for quickly and inexpensively obtaining a large number of gene sequences from an organism with no reference genome. Owing to the rapid increase in throughputs and decrease in costs of next- generation sequencing, RNA-Seq in particular has become the method of choice. However, the very short reads (e.g. 2 × 90 bp paired ends) from next generation sequencing makes de novo assembly to recover complete or full-length transcript sequences an algorithmic challenge. RESULTS: Here, we present SOAPdenovo-Trans, a de novo transcriptome assembler designed specifically for RNA-Seq. We evaluated its performance on transcriptome datasets from rice and mouse. Using as our benchmarks the known transcripts from these well-annotated genomes (sequenced a decade ago), we assessed how SOAPdenovo-Trans and two other popular transcriptome assemblers handled such practical issues as alternative splicing and variable expression levels. Our conclusion is that SOAPdenovo-Trans provides higher contiguity, lower redundancy and faster execution. AVAILABILITY AND IMPLEMENTATION: Source code and user manual are available at http://sourceforge.net/projects/soapdenovotrans/.
MOTIVATION: Transcriptome sequencing has long been the favored method for quickly and inexpensively obtaining a large number of gene sequences from an organism with no reference genome. Owing to the rapid increase in throughputs and decrease in costs of next- generation sequencing, RNA-Seq in particular has become the method of choice. However, the very short reads (e.g. 2 × 90 bp paired ends) from next generation sequencing makes de novo assembly to recover complete or full-length transcript sequences an algorithmic challenge. RESULTS: Here, we present SOAPdenovo-Trans, a de novo transcriptome assembler designed specifically for RNA-Seq. We evaluated its performance on transcriptome datasets from rice and mouse. Using as our benchmarks the known transcripts from these well-annotated genomes (sequenced a decade ago), we assessed how SOAPdenovo-Trans and two other popular transcriptome assemblers handled such practical issues as alternative splicing and variable expression levels. Our conclusion is that SOAPdenovo-Trans provides higher contiguity, lower redundancy and faster execution. AVAILABILITY AND IMPLEMENTATION: Source code and user manual are available at http://sourceforge.net/projects/soapdenovotrans/.
Authors: Michela L Mitchell; Gerry Q Tonkin-Hill; Rodrigo A V Morales; Anthony W Purcell; Anthony T Papenfuss; Raymond S Norton Journal: Mar Biotechnol (NY) Date: 2020-01-24 Impact factor: 3.619
Authors: Tessa B Moyer; Lilian R Heil; Christine L Kirkpatrick; Dennis Goldfarb; William A Lefever; Nicole C Parsley; Andrew J Wommack; Leslie M Hicks Journal: J Nat Prod Date: 2019-09-26 Impact factor: 4.050
Authors: Kylie G Bendele; Felix D Guerrero; Connor Cameron; Deanna M Bodine; Robert J Miller Journal: Exp Appl Acarol Date: 2019-09-24 Impact factor: 2.132
Authors: Vincent Lavergne; Ivon Harliwong; Alun Jones; David Miller; Ryan J Taft; Paul F Alewood Journal: Proc Natl Acad Sci U S A Date: 2015-07-06 Impact factor: 11.205
Authors: Kim L Johnson; Andrew M Cassin; Andrew Lonsdale; Antony Bacic; Monika S Doblin; Carolyn J Schultz Journal: Plant Physiol Date: 2017-04-26 Impact factor: 8.340
Authors: Kate Rockenbach; Justin C Havird; J Grey Monroe; Deborah A Triant; Douglas R Taylor; Daniel B Sloan Journal: Genetics Date: 2016-10-05 Impact factor: 4.562