Literature DB >> 29746620

Grouper: graph-based clustering and annotation for improved de novo transcriptome analysis.

Laraib Malik1, Fatemeh Almodaresi1, Rob Patro1.   

Abstract

Motivation: De novo transcriptome analysis using RNA-seq offers a promising means to study gene expression in non-model organisms. Yet, the difficulty of transcriptome assembly means that the contigs provided by the assembler often represent a fractured and incomplete view of the transcriptome, complicating downstream analysis. We introduce Grouper, a new method for clustering contigs from de novo assemblies that are likely to belong to the same transcripts and genes; these groups can subsequently be analyzed more robustly. When provided with access to the genome of a related organism, Grouper can transfer annotations to the de novo assembly, further improving the clustering.
Results: On de novo assemblies from four different species, we show that Grouper is able to accurately cluster a larger number of contigs than the existing state-of-the-art method. The Grouper pipeline is able to map greater than 10% more reads against the contigs, leading to accurate downstream differential expression analyses. The labeling module, in the presence of a closely related annotated genome, can efficiently transfer annotations to the contigs and use this information to further improve clustering. Overall, Grouper provides a complete and efficient pipeline for processing de novo transcriptomic assemblies. Availability and implementation: The Grouper software is freely available at https://github.com/COMBINE-lab/grouper under the 2-clause BSD license. Supplementary information: Supplementary data are available at Bioinformatics online.

Entities:  

Mesh:

Year:  2018        PMID: 29746620     DOI: 10.1093/bioinformatics/bty378

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  5 in total

Review 1.  A simple guide to de novo transcriptome assembly and annotation.

Authors:  Venket Raghavan; Louis Kraft; Fantin Mesny; Linda Rigerte
Journal:  Brief Bioinform       Date:  2022-03-10       Impact factor: 11.622

2.  De Novo Clustering of Long-Read Transcriptome Data Using a Greedy, Quality Value-Based Algorithm.

Authors:  Kristoffer Sahlin; Paul Medvedev
Journal:  J Comput Biol       Date:  2020-03-16       Impact factor: 1.479

3.  CB2 improves power of cell detection in droplet-based single-cell RNA sequencing data.

Authors:  Zijian Ni; Shuyang Chen; Jared Brown; Christina Kendziorski
Journal:  Genome Biol       Date:  2020-06-08       Impact factor: 13.583

4.  Compacta: a fast contig clustering tool for de novo assembled transcriptomes.

Authors:  Fernando G Razo-Mendivil; Octavio Martínez; Corina Hayano-Kanashiro
Journal:  BMC Genomics       Date:  2020-02-11       Impact factor: 3.969

5.  Regulatory context drives conservation of glycine riboswitch aptamers.

Authors:  Matt Crum; Nikhil Ram-Mohan; Michelle M Meyer
Journal:  PLoS Comput Biol       Date:  2019-12-20       Impact factor: 4.475

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.