| Literature DB >> 25084827 |
Shorash Amin, Peter J Prentis, Edward K Gilding, Ana Pavasovic1.
Abstract
BACKGROUND: The sequencing, de novo assembly and annotation of transcriptome datasets generated with next generation sequencing (NGS) has enabled biologists to answer genomic questions in non-model species with unprecedented ease. Reliable and accurate de novo assembly and annotation of transcriptomes, however, is a critically important step for transcriptome assemblies generated from short read sequences. Typical benchmarks for assembly and annotation reliability have been performed with model species. To address the reliability and accuracy of de novo transcriptome assembly in non-model species, we generated an RNAseq dataset for an intertidal gastropod mollusc species, Nerita melanotragus, and compared the assembly produced by four different de novo transcriptome assemblers; Velvet, Oases, Geneious and Trinity, for a number of quality metrics and redundancy.Entities:
Mesh:
Year: 2014 PMID: 25084827 PMCID: PMC4124492 DOI: 10.1186/1756-0500-7-488
Source DB: PubMed Journal: BMC Res Notes ISSN: 1756-0500
Figure 1Black nerite . Black nerite displaying external morphology (A) and black nerite displaying tan/brown colouration of its operculum (B).
Assembly quality metrics
| Number of contigs | 3 090 | 10 886 | 78 306 | 112 762 |
| Average contig length | 175 | 293 | 111 | 140 |
| Longest contig | 1 700 | 1 618 | 458 | 711 |
| N50 | 149 | 258 | 107 | 124 |
Assembly statistics for the transcriptomes produced by the four different short read de novo assemblers.
Annotation results
| Without blast result | 0 (0%) | 0 (0%) |
| Without blast hits | 8823 (81%) | 2615 (84.6%) |
| With blast result | 301 (2.7%) | 66 (2.1%) |
| With mapping result | 177 (1.6%) | 28 (0.9%) |
| Annotated sequences | 1585 (14.5%) | 381 (12.3%) |
| Total sequences | 10886 | 3090 |
The number of contigs allocated to different annotation categories for the Trinity and Oases assemblies.
Figure 2GO category assignment. Comparative analysis and functional classification of the top 20 GO terms for the Trinity and Oases assembly.
Figure 3BLAST top hit species distribution. The 20 species most commonly represented in BLAST hits for Trinity and Oases assemblies.