| Literature DB >> 24924151 |
Wenji Wang, Qilin Yi, Liman Ma, Xiaosu Zhou, Haitao Zhao, Xubo Wang, Jie Qi, Haiyang Yu, Zhigang Wang, Quanqi Zhang1.
Abstract
BACKGROUND: Half-smooth tongue sole (Cynoglossus semilaevis) is a valuable fish for aquaculture in China. This fish exhibits sexual dimorphism, particularly different growth rates and body sizes between two genders. Thus, C. semilaevis is a good model that can be used to investigate mechanisms responsible for such dimorphism, this model can also be utilized to answer fundamental questions in evolution and applied fields of aquaculture. Hence, advances in second-generation sequencing technology, such as 454 pyrosequencing, could provide a robust tool to study the genome characteristics of non-model species.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24924151 PMCID: PMC4072885 DOI: 10.1186/1471-2164-15-470
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Summary of 454 transcriptome sequencing and assembly for
| Sequencing number | Bases (Mb) | Average length (bp) | |
|---|---|---|---|
| Raw sequencing reads | 749, 954 | 176.3 | 235.1 |
| Clean reads | 584, 419 | 120.5 | 206.2 |
| Contigs | 62, 632 | 17 | 272 |
| Singletons | 98, 262 | 17 | 173 |
| Unigenes | 150,039 | 32.5 | 216.3 |
Figure 1Overview of the transcriptome sequencing and assembly. (A) Size distribution for raw reads. (B) Size distribution for contigs. (C) Log-log plot showing the dependence of contig length on the number of reads assembled into each contig.
Compared with other fish transcriptomes using 454-pyrosequencing
| Species | Average length of raw reads | Numbers of raw reads | Total bases (M) | Average length of contigs | Number of contigs | Total bases (M) |
|---|---|---|---|---|---|---|
|
| 266 | 310, 079 | 82.5 | 530.6 | 19,631 | 10.4 |
|
| 202.3 | 1, 665, 609 | 336.9 | 464.8 | 54,921 | 25.5 |
|
| 344 | 1, 416, 404 | 447 | 662 | 151, 847 | 100.5 |
|
| 235.1 | 749, 954 | 235.1 | 272 | 62, 632 | 17 |
Figure 2Functional annotation of assembled sequences based on gene ontology (GO) categorization. (A) Cellular component (B) Biological process (C) Molecular function.
Figure 3Abundance distribution of transposable elements in the unigenes of . The blue bars represent retroelements while the red bars represent DNA transposons.
The relationship of number putative SNPs and indels and number of minor allele reads
| Reads number | 2 | 3 | 4 | 5 | 6 | 7 | 8 |
|---|---|---|---|---|---|---|---|
| SNP number | 21,234 | 8,284 | 4,631 | 3,013 | 2,076 | 1,535 | 1,222 |
| Indel number | 13,370 | 5,072 | 2,715 | 1,698 | 1,147 | 847 | 629 |
Reads number means the least reads number supporting the minor allele.