| Literature DB >> 29989595 |
Yin Zhang1,2, Guidong Miao1,2, Qingyang Wu1,2, Fan Lin1,2, Cuihong You1,2, Shuqi Wang1,2, Jude Juventus Aweya1,2, Hongyu Ma1,2.
Abstract
Crab culture has gained prominence in the last decade due to the large global market demand for live crabs and crab products. Portunus sanguinolentus is one of the economically important crab species in the Indo-Pacific region, with distinct differences in growth and size between male and female crabs, thus, leading to huge difference in their market values. The culture of P. sanguinolentus is still in its infancy, with crab supplies heavily dependent on wild catch. In order to unravel the molecular differences between male and female crabs, we generated a comprehensive transcriptomic dataset for P. sanguinolentus by sequencing the gonads of both sexes using the Illumina Hiseq 2500 system. Transcriptomes were assembled using Trinity de novo assembly followed by annotation. This transcriptomic data set for P. sanguinolentus would serve as an important reference data for genomic and genetic studies in this crab and related species.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29989595 PMCID: PMC6038849 DOI: 10.1038/sdata.2018.131
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 6.444
Characteristics of transcriptome sequencing project of the Portunus sanguinolentus.
| Item | Description |
|---|---|
| Investigation type | Eukaryote transcriptome |
| Sampling date | 13 Mar 2017 |
| Geographic location | 23°21′21.47″N, 116°40′38.61″E |
| Temperature | 18–20 ºC |
| Tissue type | Testes and ovaries |
| Developmental stage | Adult crabs (ovary stage III-IV; testis stage II-III) |
| Size | Females: 178.60±60.42 g, Males: 145.05±16.58 g |
| Sequencing technology | Illumina Hiseq 2500 |
| Assembly | Trinity |
| Finishing strategy | Contigs |
| Data accessibility | Bioproject PRJNA415670 |
Sequencing, assembly and annotation summary statistics for male (HXX) and female (HXC) Portunus sanguinolentus.
| Sample | HXC | HXX | All |
|---|---|---|---|
| Total Raw Reads | 80,685,590 | 94,249,998 | 174,935,588 |
| Total Clean Reads | 77,875,810 | 89,125,386 | 167,001,196 |
| Total Clean Nucleotides (nt) | 11,681,371,500 | 13,368,807,900 | 25,050,179,400 |
| Q20 percentage | 94.09% | 94.80% | |
| Q30 percentage | 86.75 | 87.95 | |
| N percentage | 0.00% | 0.00% | |
| GC percentage | 48.97% | 50.60% | |
| contig | |||
| Total Number | 138,461 | 226,875 | |
| Total Length(nt) | 59,236,035 | 76,708,954 | |
| Mean Length(nt) | 428 | 338 | |
| N50 | 976 | 547 | |
| unigene | |||
| Total Number | 96,434 | 149,359 | 119,718 |
| Total Length(nt) | 78,041,034 | 92,887,016 | 108,264,060 |
| Mean Length(nt) | 809 | 622 | 904 |
| N50 | 1830 | 1318 | 1712 |
| Total Consensus Sequences | 96,434 | 149,359 | 119,718 |
| Distinct Clusters | 20,187 | 22,667 | 30,415 |
| Distinct Singletons | 76,247 | 126,692 | 89,303 |
| annotation | |||
| NR | 38,909 | ||
| NT | 24,641 | ||
| Swiss-Prot | 31,849 | ||
| KEGG | 29,103 | ||
| COG | 14,937 | ||
| GO | 18,406 | ||
| All annotation unigenes | 47,536 | ||
| SSR | 93,196 | ||
| SNP | 151,626 | 97,364 | |
Figure 1COG functional classification of the Portunus sanguinolentus transcriptome.
Figure 2Gene ontology (GO) assignment of assembled unigenes of Portunus sanguinolentus.
GO classification analysis of Unigenes in All-Unigene. GO functions is showed in X-axis. The right Y-axis shows the number of genes which have the GO function.
Summary of KEGG pathway and GO annotation for all unigenes.
| All unigenes | |
|---|---|
| Number of unigenes | |
| Top KEGG pathway | |
| metabolic pathways | 3,609 (12.4%) |
| regulation of actin cytoskeleton | 1,622 (5.57%) |
| amoebiasis | 1,356 (4.66%) |
| Vibrio cholerae infection | 1,222 (4.2%) |
| focal adhesion | 1,205 (4.14%) |
| Top GO annotation | |
| biological process | |
| cellular process | 12,963 |
| single-organism process | 10,591 |
| metabolic process | 9,889 |
| biological regulation | 7,265 |
| regulation of biological process | 6,647 |
| cellular_component | |
| cell | 10,673 |
| cell part | 10,660 |
| organelle | 7,413 |
| membrane | 5,837 |
| organelle part | 4,415 |
| molecular function | |
| binding | 9,392 |
| catalytic activity | 7,736 |
| transporter activity | 1,358 |
| molecular transducer activity | 655 |
| receptor activity | 550 |
| Top GO annotation in different levels | |
| membrane (Level 1) | 1,888 |
| protein binding (Level 2) | 1,732 |
| cytoplasm (Level 5) | 1,497 |
| nucleus (Level 7) | 1,480 |
| integral component of membrane (Level 4) | 1,456 |
| binding (Level 1) | 1,056 |
| ATP binding (Level 8) | 972 |
| plasma membrane (Level 4) | 935 |
| metabolic process (Level 1) | 905 |
| cytosol (Level 7) | 876 |
Figure 3Species that match to the annotated sequences of Portunus sanguinolentus.
a: The E-value distribution of the result of NR annotation. b: The similarity distribution of the result of NR annotation. c: The species distribution of the result of NR annotation.
Figure 4Quantity statistics of SSR classification.
The X-axis is the repeat times of repeat units. The Y-axis is the number of SSRs.
Figure 5Statistics of SNP number.
The X-axis is SNP types, the Y-axis is the number of SNP. HXX represents the male Portunus sanguinolentus and HXC represents female Portunus sanguinolentus.