| Literature DB >> 28135340 |
Khor Waiho1, Hanafiah Fazhan1, Md Sheriff Shahreza1,2, Julia Hwei Zhong Moh1, Shaibani Noorbaiduri1, Li Lian Wong1, Saranya Sinnasamy3, Mhd Ikhwanuddin1.
Abstract
Adequate genetic information is essential for sustainable crustacean fisheries and aquaculture management. The commercially important orange mud crab, Scylla olivacea, is prevalent in Southeast Asia region and is highly sought after. Although it is a suitable aquaculture candidate, full domestication of this species is hampered by the lack of knowledge about the sexual maturation process and the molecular mechanisms behind it, especially in males. To date, data on its whole genome is yet to be reported for S. olivacea. The available transcriptome data published previously on this species focus primarily on females and the role of central nervous system in reproductive development. De novo transcriptome sequencing for the testes of S. olivacea from immature, maturing and mature stages were performed. A total of approximately 144 million high-quality reads were generated and de novo assembled into 160,569 transcripts with a total length of 142.2 Mb. Approximately 15-23% of the total assembled transcripts were annotated when compared to public protein sequence databases (i.e. UniProt database, Interpro database, Pfam database and Drosophila melanogaster protein database), and GO-categorised with GO Ontology terms. A total of 156,181 high-quality Single-Nucleotide Polymorphisms (SNPs) were mined from the transcriptome data of present study. Transcriptome comparison among the testes of different maturation stages revealed one gene (beta crystallin like gene) with the most significant differential expression-up-regulated in immature stage and down-regulated in maturing and mature stages. This was further validated by qRT-PCR. In conclusion, a comprehensive transcriptome of the testis of orange mud crabs from different maturation stages were obtained. This report provides an invaluable resource for enhancing our understanding of this species' genome structure and biology, as expressed and controlled by their gonads.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28135340 PMCID: PMC5279790 DOI: 10.1371/journal.pone.0171095
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Primers used in quantitative real-time polymerase chain reaction.
| Gene name | Primers | Sequence | Target size | Reference |
|---|---|---|---|---|
| Beta crystallin like gene | BCG-F | 103 bp | - | |
| BCG-R | ||||
| 18S rRNA | qRT-F | - | Wang et al. [ | |
| qRT-R |
Summary of assembly statistics.
| Immature crabs | Maturing crabs | Mature crabs | |
|---|---|---|---|
| Total reads | 76,337,338 | 64,928,802 | 30,841,304 |
| Total bases (bp) | 7,710,071,138 | 6,557,809,002 | 3,114,971,704 |
| Total reads | 65,859,364 | 55,873,266 | 22,828,074 |
| Total bases (bp) | 6,578,177,998 | 5,579,213,298 | 2,261,133,698 |
| Percentage of clean reads (%) | 86.27 | 86.05 | 74.02 |
| Percentage of clean bases (%) | 85.32 | 85.08 | 72.59 |
| Total Reads | 144,560,704 | ||
| Reads Aligned | 125,050,327 | ||
| % Reads Aligned | 86.50 | ||
| Assembled Transcripts Length (in bases) | 142,192,028 | ||
| Total Assembled Transcripts Covered (in bases) | 140,431,613 | ||
| % Total Assembled Transcripts Covered | 98.76 | ||
| Average Read Depth | 73.39 | ||
| Number of assembled transcripts | 160,569 | ||
| Total size of assembled transcripts (bp) | 142,192,028 | ||
| Longest assembled transcripts (bp) | 16,041 | ||
| Shortest assembled transcripts (bp) | 300 | ||
| Number of assembled transcripts > 1K nt | 39,060 | ||
| Number of transcripts > 10K nt | 49 | ||
| Mean assembled transcripts size | 886 | ||
| N50 assembled transcripts length | 1,225 | ||
| Assembled transcripts %A | 26.87 | ||
| Assembled transcripts %C | 22.96 | ||
| Assembled transcripts %G | 23.86 | ||
| Assembled transcripts %T | 26.31 | ||
| Assembled transcripts %N | 0 | ||
| Assembled transcripts %non-ACGTN | 0 | ||
Fig 1Graphical length distribution summary of transcripts identified in S. olivacea transcriptome data sets.
Fig 2Similarity distribution of BLAST hits.
Top 10 organism hits of assembled transcripts in S. olivacea transcriptome against UniProt database.
| Organism scientific name (common name) | Taxonomy | Matched assembled transcripts (n) | ||
|---|---|---|---|---|
| Phylum | Subphylum | Class | ||
| Arthropoda | Hexapoda | Insecta | 4,170 | |
| Arthropoda | Crustacea | Branchiopoda | 2,444 | |
| Arthropoda | Myriapoda | Chilopoda | 1,817 | |
| Arthropoda | Chelicerata | Arachnida | 1,284 | |
| Echinodermata | Echinozoa | Echinoidea | 1,123 | |
| Arthropoda | Hexapoda | Insecta | 927 | |
| Arthropoda | Hexapoda | Insecta | 816 | |
| Arthropoda | Hexapoda | Insecta | 774 | |
| Chordata | Cephalochordata | Leptocardii | 731 | |
| Annelida | - | Polychaeta | 593 | |
Fig 3Species distribution of top ten BLAST hits against UniProt database and D. melanogaster protein database, with a 1e-5 cut-off threshold.
Top 20 annotations of S. olivacea transcriptome with the highest bit score.
| Description | Accession ID | Organism Scientific name (common name) | Alignment length (amino acids) | E value | Bit Score | Type |
|---|---|---|---|---|---|---|
| Cj-cadherin | Q5CCS4 | 3007 | 0 | 5472 | Full length | |
| Dynein heavy chain, cytoplasmic | A0A067RE92 | 3298 | 0 | 5434 | Full length | |
| Uncharacterized protein | K7J7S2 | 4156 | 0 | 5202 | Partial | |
| Projectin | Q86GD6 | 2694 | 0 | 4756 | Partial | |
| Pre-mRNA-processing-splicing factor, putative | E0VM49 | 2373 | 0 | 4354 | Full length | |
| Uncharacterized protein | T1JAK1 | 3404 | 0 | 4342 | Partial | |
| Spectrin alpha chain | A0A067RUI8 | 2422 | 0 | 4045 | Full length | |
| Laminin subunit alpha | A0A067R415 | 3616 | 0 | 3516 | Full length | |
| Talin-1 | A0A067R9F3 | 2522 | 0 | 3336 | Partial | |
| Spectrin beta chain | A0A067R2J7 | 2151 | 0 | 3322 | Full length | |
| Ciliary dynein heavy chain, putative | E0VLA6 | 2193 | 0 | 3212 | Partial | |
| Myosin Va | F6K356 | 1778 | 0 | 3203 | Full length | |
| Putative uncharacterized protein | E9G1C9 | 2247 | 0 | 3094 | Full length | |
| Putative uncharacterized protein | D6X207 | 2254 | 0 | 3049 | Full length | |
| Dynein beta chain, ciliary | P39057 | 2780 | 0 | 3025 | Partial | |
| Clathrin heavy chain | A0A067RP81 | 1683 | 0 | 2978 | Full length | |
| Target of rapamycin | B5M076 | 2495 | 0 | 2963 | Full length | |
| Fatty acid synthase | F8RHR0 | 2445 | 0 | 2928 | Partial | |
| DNA-directed_RNA_polymerase | V5YTD8 | 1552 | 0 | 2916 | Full length | |
| Putative U5 small nuclear ribonucleoprotein 200 kDa helicase | A0A067RJY2 | 1891 | 0 | 2883 | Partial |
Fig 4Gene ontologies.
Transcript counts (log10) for Gene Ontology (GO) classification of the S. olivacea transcriptome for biological process (black), cellular component (white) and molecular function (grey) categories.
Number of hits of selected GO child terms and co-occurring terms based on multicellular organismal development (GO:0007275).
| GO ID | GO term | Hits | Example Sequence | E value | Accession ID |
|---|---|---|---|---|---|
| GO:0007349 | Cellularization | 2 | Vielfältig, isoform D | 9e-10 | M9NES1 |
| GO:0007566 | Embryo implantation | 2 | Uncharacterized protein | 1e-73 | F6SMD1 |
| GO:0009790 | Embryo development | 2 | Trip12 protein | 1e-139 | Q3KR60 |
| GO:0009791 | post-embryonic development | 1 | Uncharacterized protein | 1e-21 | ALDH5A1 |
| GO:0030237 | Female sex determination | 1 | Protein Wnt | 1e-27 | WNT4 |
| GO:0030238 | Male sex determination | 1 | Tyrosine-protein kinase receptor | 4e-139 | IGF1R |
| GO:0007283 | Spermatogenesis | 11 | Gilgamesh isoform E | 0 | Q59DW8 |
| GO:0010468 | Regulation of gene expression | 2 | Putative uncharacterized protein | 1e-43 | Q8CDC6 |
| GO:1900194 | Negative regulation of oocyte maturation | 2 | Uncharacterized protein | 1e-10 | F6Q3S2 |
| GO:1902436 | Negative regulation of male mating behaviour | 1 | Putative uncharacterized protein | 1e-43 | Q8CDC6 |
| GO:0061369 | Negative regulation of testicular blood vessel morphogenesis | 1 | Protein Wnt | 1e-27 | F6UNR8 |
| GO:0051781 | Positive regulation of cell division | 10 | PDGF-and VEGF-related factor | 6e-10 | V9IG02 |
| GO:0050793 | Regulation of developmental process | 5 | Notch protein | 0 | A0MK40 |
| GO:0040014 | Regulation of multicellular organism growth | 3 | Amyloid beta protein | 8e-63 | M1EDY4 |
| GO:0040034 | Regulation of development, heterochronic | 2 | Hunchback transcription factor | 2e-114 | C4PGG7 |
| GO:0048047 | Mating behaviour, sex discrimination | 1 | Putative uncharacterized protein | 1e-43 | Q8CDC6 |
| GO:0007617 | Mating behaviour | 1 | Amyloid beta protein | 8e-63 | M1EDY4 |
Selected regulators (i.e. proteins) of growth, sexual differentiation and reproduction expressed in the testis of S. olivacea.
| Identity | Accession ID | Hit organism | Similarity (%) | E value | Example Query ID |
|---|---|---|---|---|---|
| Vitellogenesis-inhibiting hormone (VIH) | V9ZBG5 | 99.2 | 5e-85 | Ref_Crab_Transcript_38360_732 | |
| Crustacean hyperglycemic hormone (CHH) | A5A599 | 99.28 | 2e-91 | Ref_Crab_Transcript_59696_1668 | |
| Neuropeptide | B4IC30 | 51.43 | 2e-11 | Ref_Crab_Transcript_54220_1201 | |
| Neurotrophin | G5CJW4 | 68.57 | 2e-53 | Ref_Crab_Transcript_58755_445 | |
| Neuroparsin | A0A023PY98 | 47.13 | 8e-12 | Ref_Crab_Transcript_70043_2913 | |
| SOX14 protein | B9VWK7 | 99.48 | 0 | Ref_Crab_Transcript_45440_2864 | |
| VASA-like protein variant | E5FQX4 | 99.05 | 0 | Ref_Crab_Transcript_27201_2369 | |
| Sex-lethal | V9PP85 | 98.71 | 1e-97 | Ref_Crab_Transcript_49023_2008 | |
| Piwi-like protein | X2CS90 | 96.65 | 0 | Ref_Crab_Transcript_53740_1185 | |
| Doublesex and mab-3 related transcription-like protein | D7REN5 | 95.7 | 3e-56 | Ref_Crab_Transcript_69520_1864 | |
| Doublesex and mab-3 related transcription factor 11E | X2D7J9 | 86.67 | 2e-12 | Ref_Crab_Transcript_31999_445 | |
| Male-specific lethal 3-like protein | A0A067R1H2 | 82.98 | 3e-41 | Ref_Crab_Transcript_88052_3530 | |
| VASA | A0A023JMC5 | 80 | 3e-18 | Ref_Crab_Transcript_23777_903 | |
| Krueppel-like factor 10 | Q13118 | 100 | 2e-79 | Ref_Crab_Transcript_160024_379 | |
| Growth factor receptor-bound protein 2 | U6DGZ9 | 100 | 4e-83 | Ref_Crab_Transcript_151979_370 | |
| Transforming growth factor-beta regulator I | H9B3Y8 | 99.27 | 1e-91 | Ref_Crab_Transcript_149715_1563 | |
| Sex combs reduced | A0A059PB91 | 93.58 | 3e-48 | Ref_Crab_Transcript_44496_829 | |
| Early growth response protein 3 | A0A067R8D8 | 90.71 | 8e-87 | Ref_Crab_Transcript_152664_761 | |
| Male reproductive-related LIM protein | B8LG57 | 89.13 | 1e-33 | Ref_Crab_Transcript_29874_1367 | |
| Prostaglandin E synthase 2 | M1F4P3 | 78.92 | 0 | Ref_Crab_Transcript_59927_1959 | |
| Prostaglandin F synthase | M1F418 | 77.92 | 1e-170 | Ref_Crab_Transcript_36654_1601 | |
| Up-regulated during skeletal muscle growth protein 5 | A0A067QSY5 | 69.23 | 2e-14 | Ref_Crab_Transcript_21795_733 | |
| Fibroblast growth factor receptor substrate 2 | A0A067RCV1 | 66.67 | 3e-40 | Ref_Crab_Transcript_50850_3332 | |
| Putative transforming growth factor beta receptor 1 | L7MGX7 | 63.67 | 0 | Ref_Crab_Transcript_59348_1873 | |
| Inhibitor of growth protein | A0A067QYK9 | 66.91 | 1e-52 | Ref_Crab_Transcript_62117_1271 | |
| Epidermal growth factor receptor | A0A067R240 | 68.7 | 0 | Ref_Crab_Transcript_84884_4296 | |
| Vascular endothelial growth factor receptor 2 | A0A067QWZ0 | 69.81 | 8e-16 | Ref_Crab_Transcript_127316_361 | |
| Estrogen-related receptor | D2Y1A7 | 100 | 6e-22 | Ref_Crab_Transcript_7743_1209 | |
| Insulin-like androgenic gland hormone | A0A075INW9 | 98.51 | 7e-39 | Ref_Crab_Transcript_35719_619 | |
| Red-pigment concentrating hormone | U3PE66 | 97.8 | 1e-18 | Ref_Crab_Transcript_16600_356 | |
| Bursicon hormone alpha subunit | C3S7D8 | 96.88 | 6e-19 | Ref_Crab_Transcript_2620_1085 | |
| E75 nuclear receptor | Q3I5Q8 | 95.55 | 0 | Ref_Crab_Transcript_69307_4236 | |
| Prohormone convertase | D0UJV3 | 93.55 | 2e-9 | Ref_Crab_Transcript_15118_512 | |
| Pigment dispersing hormone receptor | C6L2K2 | 80.85 | 1e-16 | Ref_Crab_Transcript_149701_305 | |
| Growth hormone secretagogue receptor type | A0A026WUM1 | 78.79 | 5e-7 | Ref_Crab_Transcript_136683_553 | |
| Lutropin-choriogonadotropic hormone receptor | A0A067QJD2 | 76.32 | 3e-32 | Ref_Crab_Transcript_5206_458 | |
| Gonadotropin-releasing hormone receptor | A0A087TGG2 | 62.14 | 5e-55 | Ref_Crab_Transcript_35602_1201 | |
| Juvenile hormone epoxide hydrolase 2 | V9IEI5 | 60.42 | 2e-11 | Ref_Crab_Transcript_96004_451 | |
| Retinoid-X receptor-2 | S4TH64 | 99.49 | 0 | Ref_Crab_Transcript_60375_1703 | |
| Ecdysteroid receptor | O76246 | 95.81 | 2e-47 | Ref_Crab_Transcript_141583_1159 | |
| Ecdysteroid receptor 3 | I6UZ31 | 92.94 | 2e-99 | Ref_Crab_Transcript_46583_2256 | |
| Ecdysteroid receptor 2 | I6V8K3 | 87.65 | 0 | Ref_Crab_Transcript_46581_2790 | |
| Putative ecdysteroids/dopamine receptor | D6WWZ1 | 59.09 | 4e-79 | Ref_Crab_Transcript_110825_1460 | |
Fig 5Distribution of putative single nucleotide polymorphisms (SNPs) in S. olivacea sequences.
Differential expression analysis of S. olivacea at gene level.
| All genes | Significant differentially expressed genes (Padj < 0.05) | |||||
|---|---|---|---|---|---|---|
| Cond. 1 | Cond. 2 | Total | Up-regulation (Cond. 2 > Cond. 1) | Down-regulation (Cond. 2 < Cond. 1) | Up-regulation (Cond. 2 > Cond. 1) | Down-regulation (Cond. 2 < Cond. 1) |
| I | M | 121,288 | 53,598 | 67,690 | 26 | 41 |
| I | R | 125,107 | 62,787 | 62,320 | 65 | 41 |
| M | R | 115,207 | 62,183 | 53,024 | 15 | 12 |
Note: Cond.: condition; I: immature; M: mature; R: maturing.
Fig 6Heatmap of gene expression values depicting clustering of genes between immature (UMT_I, left panel) and mature stages (UMT_M, right panel) based on the expression of mRNAs for a set of significant genes (Padj < 0.05).
Sample names are represented in columns and significant genes are represented in rows. Genes are clustered together based on expression similarity. Low to high expression is represented by a change of colour from red to green, respectively. The colour key scale bar at upper left shows Z-score values for the heatmap.
Fig 7Heatmap of gene expression values depicting clustering of genes between immature (UMT_I, left panel) and mature stages (UMT_R, right panel) based on the expression of mRNAs for a set of significant genes (Padj < 0.05).
Sample names are represented in columns and significant genes are represented in rows. Genes are clustered together based on expression similarity. Low to high expression is represented by a change of colour from red to green, respectively. The colour key scale bar at upper left shows Z-score values for the heatmap.
Fig 8Heatmap of gene expression values depicting clustering of genes between immature (UMT_M, left panel) and mature stages (UMT_R, right panel) based on the expression of mRNAs for a set of significant genes (Padj < 0.05).
Sample names are represented in columns and significant genes are represented in rows. Genes are clustered together based on expression similarity. Low to high expression is represented by a change of colour from red to green, respectively. The colour key scale bar at upper left shows Z-score values for the heatmap.
Selected differentially expressed genes between maturation stages in the S. olivacea testis expression profile.
| Cluster on heatmap | Description | Log2FC | Padj | Regulation | Accession ID of Hit |
|---|---|---|---|---|---|
| I | beta crystallin like gene | -6.78 | 7.74E-11 | − | D3PHS5 |
| III | Aminopeptidase | -5.00 | 3.46E-02 | − | A0A067QSR5 |
| III | Beta-2-microglobulin | -8.18 | 2.22E-03 | − | P16213 |
| I | beta crystallin like gene | -6.78 | 7.74E-11 | − | D3PHS5 |
| III | Aminopeptidase | -5.00 | 3.46E-02 | − | A0A067QSR5 |
| III | Beta-2-microglobulin | -8.18 | 2.22E-03 | − | P16213 |
| III | C-type lysozyme | -6.17 | 1.63E-03 | − | B2R4C5 |
| III | Elongation factor 1-alpha | -9.13 | 2.71E-05 | − | W5PHA3 |
| III | MHC class I antigen | -7.59 | 2.60E-02 | − | R4ZGR1 |
| III | Mobile element protein | -5.36 | 8.51E-03 | − | L7VVN2 |
| III | Prosaposin | -8.65 | 2.00E-04 | − | A0A024QZQ2 |
| III | Transposase | -5.53 | 4.51E-03 | − | G8UKJ2 |
| III | Transposase | -5.28 | 1.21E-02 | − | A3JAS2 |
| III | Transposase | -4.87 | 6.51E-03 | − | A0A037X5S6 |
| III | Transposase mutator type | -5.66 | 2.22E-03 | − | R9CHU0 |
| IV | Actin 1 | -4.08 | 1.27E-02 | − | C5HF65 |
| IV | Gamma-crystallin A | -4.88 | 1.90E-05 | − | D3PIA3 |
| V | Vitellogenin | 3.00 | 4.13E-02 | + | Q9UAR3 |
| VI | Amyloid beta A4 protein | 3.57 | 4.88E-02 | + | A0A067QWW4 |
| VI | Capsid protein | 4.03 | 4.71E-03 | + | D9ZD21 |
| VI | C-type-lectin-like-4 protein | 5.72 | 1.47E-04 | + | W6MNG5 |
| I | Aminopeptidase N | -5.98 | 2.78E-04 | − | A0A067QSR5 |
| I | Gamma-crystallin A | -4.37 | 5.88E-05 | − | D3PIA3 |
| I | TnpC | -4.18 | 4.96E-02 | − | G9HZ26 |
| I | Transposase | -4.15 | 6.26E-03 | − | G8UKJ2 |
| III | Beta-crystallin A1 | -10.18 | 1.35E-19 | − | D3PHS5 |
| IV | Farnesoic acid O-methyltransferase | 3.39 | 1.29E-02 | + | B8X2Z4 |
| IV | FreD | 4.91 | 1.34E-04 | + | A0A068LKH5 |
| IV | Leukocyte elastase inhibitor | 8.95 | 2.67E-06 | + | G7Y5W4 |
| I | C-type lysozyme | 5.85 | 3.23E-03 | + | B2R4C5 |
| I | Putative nuclease HARBI1 | 3.44 | 2.96E-02 | + | A0A067RIF9 |
| II | Beta-2-microglobulin | 7.57 | 4.28E-02 | + | P16213 |
| II | Elongation factor 1-alpha | 8.27 | 3.23E-03 | + | W5PHA3 |
| II | Prosaposin | 8.45 | 1.67E-03 | + | A0A024QZQ2 |
| V | Vitellogenin | -3.27 | 2.96E-02 | − | Q9UAR3 |
| V | Capsid protein | -3.77 | 1.25E-02 | − | E1CI71 |
| V | C-type-lectin-like-4 protein | -6.31 | 9.43E-04 | − | W6MNG5 |
| VI | RNA-dependent RNA polymerase | -6.05 | 2.79E-03 | − | A0A023VRY1 |
Note: Cond.: condition; I: immature; M: mature; R: maturing; Log2FC: Log2FoldChange; Padj: adjusted P-value;
−: down-regulated;
+: up-regulated.
Fig 9Fold difference of beta crystallin like gene in different maturation stages of testis of male S. olivacea. 18S rRNA was used as reference gene.
Different superscript letters within the same row indicate significant differences (P < 0.0001) between different stages of maturation.