| Literature DB >> 21720557 |
Rui Hou1, Zhenmin Bao, Shan Wang, Hailin Su, Yan Li, Huixia Du, Jingjie Hu, Shi Wang, Xiaoli Hu.
Abstract
BACKGROUND: Bivalves comprise 30,000 extant species, constituting the second largest group of mollusks. However, limited genetic research has focused on this group of animals so far, which is, in part, due to the lack of genomic resources. The advent of high-throughput sequencing technologies enables generation of genomic resources in a short time and at a minimal cost, and therefore provides a turning point for bivalve research. In the present study, we performed de novo transcriptome sequencing to first produce a comprehensive expressed sequence tag (EST) dataset for the Yesso scallop (Patinopecten yessoensis).Entities:
Mesh:
Year: 2011 PMID: 21720557 PMCID: PMC3123371 DOI: 10.1371/journal.pone.0021560
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Summary of 454 transcriptome sequencing and assembly for P. yessoensis.
| Reads (n) | Bases (Mb) | Average length (bp) | |
| Raw sequencing reads | 970,422 | 303.8 | 313.1 |
| Clean reads | 805,330 | 230.7 | 286.5 |
| Contigs | 32,590 | 20.2 | 618.4 |
| Singletons | 106,807 | 28.1 | 262.9 |
| Total | 139,397 | 48.3 | 346.0 |
Figure 1Overview of the P. yessoensis transcriptome sequencing and assembly.
(A) Size distribution of 454 sequencing reads after removal of adaptor and short sequences (<60 bases). (B) Size distribution of contigs. (C) Log-log plot showing the dependence of contig lengths on the number of reads assembled into each contig.
Functional annotation of the P. yessoensis transcriptome.
| ESTs (unique genes) | |||
| All sequences | ≥300 bp | ≥1000 bp | |
| Total number of sequences | 139,397 | 72,077 | 3,978 |
| ESTs with BLAST matches against Nr | 38,536 (28,864) | 27,971 (22,067) | 2,783 (2,549) |
| ESTs with BLAST matches against Swiss-Prot | 29,195 (17,647) | 21,932 (14,472) | 2,455 (2,259) |
| ESTs assigned with GO terms | 15,530(9,290) | 17,027 (7,921) | 1,873 (1,607) |
| ESTs assigned with EC numbers | 4,846 (3,209) | 4,454 (3,078) | 990 (894) |
Figure 2Functional annotation of assembled sequences based on gene ontology (GO) categorization.
GO analysis was performed at the level 2 for three main categories (cellular component, molecular function and biological process).
Summary of simple sequence repeat (SSR) types in the P. yessoensis transcriptome.
| SSR Type | No. of SSR-containing ESTs | No. of SSRs | % of total SSRs |
| Di-nucleotides | 557 | 580 | 21.1% |
| Tri-nucleotides | 936 | 1,084 | 39.4% |
| Tetra-nucleotides | 404 | 426 | 15.5% |
| Penta-nucleotides | 387 | 401 | 14.6% |
| Hexa-nucleotides | 245 | 257 | 9.4% |
| Total | 2,480 | 2,748 | 100% |
Figure 3Classification of single nucleotide polymorphisms (SNPs) identified in the P. yessoensis transcriptome.
The overall frequency of these SNP types in P. yessoensis transcriptome is one per 156 bp.