| Literature DB >> 24204682 |
Shubo Jin1, Hongtuo Fu, Qiao Zhou, Shengming Sun, Sufei Jiang, Yiwei Xiong, Yongsheng Gong, Hui Qiao, Wenyi Zhang.
Abstract
BACKGROUND: The oriental river prawn, Macrobrachium nipponense, is an important aquaculture species in China, even in whole of Asia. The androgenic gland produces hormones that play crucial roles in sexual differentiation to maleness. This study is the first de novo M. nipponense transcriptome analysis using cDNA prepared from mRNA isolated from the androgenic gland. Illumina/Solexa was used for sequencing. METHODOLOGY AND PRINCIPAL FINDING: The total volume of RNA sample was more than 5 ug. We generated 70,853,361 high quality reads after eliminating adapter sequences and filtering out low-quality reads. A total of 78,408 isosequences were obtained by clustering and assembly of the clean reads, producing 57,619 non-redundant transcripts with an average length of 1244.19 bp. In total 70,702 isosequences were matched to the Nr database, additional analyses were performed by GO (33,203), KEGG (17,868), and COG analyses (13,817), identifying the potential genes and their functions. A total of 47 sex-determination related gene families were identified from the M. nipponense androgenic gland transcriptome based on the functional annotation of non-redundant transcripts and comparisons with the published literature. Furthermore, a total of 40 candidate novel genes were found, that may contribute to sex-determination based on their extremely high expression levels in the androgenic compared to other sex glands,. Further, 437 SSRs and 65,535 high-confidence SNPs were identified in this EST dataset from which 14 EST-SSR markers have been isolated.Entities:
Mesh:
Substances:
Year: 2013 PMID: 24204682 PMCID: PMC3810145 DOI: 10.1371/journal.pone.0076840
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Summary of Illumina Hiseq2000 assembly and analysis of M.nipponense transcriptomic sequences.
| Number | |
| Total genes | 57619 |
| Total isogenes | 78408 |
| Total residues | 97554839 |
| Average length | 1244.19 |
| Largest isogene | 23217 |
| Smallest isogene | 351 |
Figure 1Contig length distribution of M. nipponense transcriptomic ESTs.
Figure 2Gene ontology classification of non-redundant transcripts.
By alignment to GO terms, 33203 isogenes were mainly divided into three categories with 62 functional groups: biological process (25 functional groups), cellular component (19 functional groups), and molecular function (18 functional groups). The left y-axis indicates the percentage of a specific category of genes existed in the main category, whereas the right y-axis indicates the number of a specific category of genes existed in main category.
Figure 3Cluster of orthologous groups (COG) classification of putative proteins.
A total of 13817 putative proteins were classified functionally into 25 molecular families in the COG database.
Sex- or reproduction- related ESTs identified in the androgenic gland transcriptome of M. nipponense.
| Transcripts | Length (bp) | E-value | Accession number | Hits |
| Insulin-like androgenic gland specificfactor (IAG) | 854–2541 | 0 | ref|NP_563742.1| | 4 |
| DEAD-box ATP-dependent RNA helicase | 397–4215 | 0 | ref|NP_187299.1| | 42 |
| Sex-lethal | 1981–2058 | 0 | ref|XP_003250344.1| | 2 |
| Transformer-2 protein | 452–4421 | 0–1.00E-05 | ref|XP_001513669.2| | 8 |
| Extra sex combs | 1291–1534 | 0 | gb|AAC05332.1| | 2 |
| FTZ-F1 | 382–2037 | 0 | gb|ADK46871.1| | 5 |
| FOXL2 | 3196 | 8.00E-42 | gb|ABP63571.1| | 1 |
| ECM | 437–5934 | 0 | ref|NM_179755.3| | 5 |
| FEM1 | 2066–2741 | 0 | ref|NP_001153369.1| | 4 |
| DHH | 377–2647 | 0–3.00E-28 | ref|NP_193244.2| | 13 |
| START | 430–2598 | 0–2.00E-16 | ref|NM_124797.4| | 8 |
| GATA | 382–2598 | 0–7.00E-14 | ref|NM_001036712.1| | 31 |
| Pumilio | 363–2504 | 0–6.00E-07 | ref|NM_001202701.1| | 9 |
| Argonaute | 380–3427 | 0–2.00E-39 | ref|NM_179453.2| | 13 |
| Chromobox protein | 2954–3086 | 0 | gb|BT060302.1| | 2 |
| Akt | 1520–2755 | 0 | ref|NM_128222.5| | 3 |
| Wnt | 854–2802 | 0–4.00E-40 | gb|EFX66479.1| | 8 |
| Heat Shock Protein | 275–2541 | 0–9.00E-33 | gb|AAM67147.1 | 86 |
| Male reproductive-related protein | 691–6177 | 0–8.00E-42 | gb| EF364539.1| | 12 |
| DNM | 638–2896 | 4.00E-20 | gb|EF208559.1| | 8 |
| Tsunagi | 798 | 0 | gb|EFX81381.1| | 1 |
| Gustavus | 3301 | 0 | gb|GU462157.1| | 1 |
| Cytochrome P450 | 352–2369 | 0–9.00E-14 | ref|NP_189262.1| | 175 |
| Cathepsin A | 2384 | 0 | gb|ADO65982.1| | 1 |
| Cathepsin B | 370–1470 | 0 | gb|EFA01289.1| | 5 |
| Cathepsin D | 731–923 | 0 | sp|Q05744| | 2 |
| Cathepsin L | 1406–1717 | 0 | emb|X99730.1| | 2 |
| PIGS | 443–3186 | 0 | ref|NP_187374.2| | 4 |
| Cyclin | 125–4125 | 0–7.00E-43 | ref|NP_180363.1| | 93 |
| CDC | 456–287 | 0–9.00E-41 | ref|NP_566911.1| | 24 |
| BCS | 364–1566 | 0–1.00E-12 | emb|CAB52469.1| | 7 |
| Cyclophilin | 728–1819 | 0–1.00E-05 | ref|NP_194968.2| | 15 |
| WWP | 3123–4408 | 0 | ref|NP_197706.1| | 2 |
| Dynactin | 796–1677 | 0 | gb|ACD13590.1| | 3 |
| Flotillin | 370–5757 | 0–7.00E-30 | ref|NP_197908.1| | 9 |
| PSMB | 871–1721 | 0 | ref|NP_565156.1| | 15 |
| TCP | 363–2504 | 0–7.00E-33 | ref|NP_172520.1| | 25 |
| Ubiquitin-conjugating enzyme E2 | 365–4010 | 0–6.00E-44 | ref|NP_565440.1| | 67 |
| E3 ubiquitin-protein ligase | 352–10699 | 0–8.00E-29 | ref|NP_192209.2| | 177 |
| Ubiquitin carboxyl-terminal esterase L3 | 1204 | 0 | gb|ACO36738.1| | 1 |
| Ubiquitin carboxyl-terminal hydrolaseisozyme L5 | 1757 | 0 | gb|ACM43511.1| | 1 |
| Ubiquitin carboxyl-terminal hydrolase | 354–3818 | 0–4.00E-39 | ref|NP_563719.1| | 79 |
| Ubiquitin-binding protein | 1349 | 0 | ref|XM_001180530.1| | 1 |
| Ferritin | 172–1116 | 0–3.00E-22 | gb|EU371046.1| | 4 |
| Ferritin heavy-chain subunit | 524–1100 | 0 | ref|NP_990417.1| | 3 |
| Ferritin light-chain subunit | 4589 | 0 | gb|FJ446525.1| | 1 |
| Ferritin peptide | 972 | 2.00E-42 | gb|DQ205422.1| | 1 |
High expression levels of genes in androgenic gland specially expressed gene group.
| AG | |
| Slow-tonic S2 tropomyosin | 35859.99 |
| Slow-tonic S2 tropomyosin | 35322.07 |
| Slow tropomyosin isoform | 6497.87 |
| Beta-glucosidase 23 | 5577 |
| Slow tropomyosin isoform | 4756.74 |
| Transketolase | 4295 |
| Nitric oxide synthase | 3159.14 |
| Uncharacterized protein | 2652.28 |
| Conserved hypothetical protein | 2213.42 |
| Calmin-like protein | 1896.84 |
| Similar to ankyrin 2,3/unc44 | 1866.91 |
| Glycogen debranching enzyme-like | 1855.11 |
| Dihydropteroate synthase | 1838 |
| Glutamine synthetase cytosolic isozyme 1–2 | 1655.61 |
| Aquaporin PIP2-1 | 1384.64 |
| Endoplasmin-like protein | 1352 |
| Hypothetical protein CaO19.7238 | 1231.24 |
| Glycine cleavage system H protein | 1171 |
| Tubulin binding cofactor C domain-containing protein | 1163 |
| Uncharacterized protein | 1141 |
| Uncharacterized protein | 1041 |
| Uncharacterized protein | 1023.71 |
| ATP synthase subunit gamma | 1015 |
| 40S ribosomal protein S6-1 | 1012 |
Note: AG means androgenic gland. Genes in this table only expressed in androgenic gland and were not detected in vasa deferentia, ovary and testis. Nmuber means the gene expression level in androgenic gland.
Genes with similar expression pattern with IAG in generally expressed gene group.
| AG | VD | O | T | |
| Troponin I | 64234.01 | 1301.32 | 9 | 97.62 |
| Uncharacterized protein | 60606.33 | 3145.35 | 4.07 | 123.7 |
| Uncharacterized protein | 50406 | 3535.28 | 11 | 60.13 |
| Uncharacterized protein | 35327.41 | 2353.87 | 3.93 | 72.92 |
| Troponin T-like isoform 4 | 34579.06 | 202.28 | 33.54 | 35.08 |
| SERCA | 21300.36 | 261.57 | 9.35 | 63.52 |
|
|
|
|
|
|
| Lit v 3 allergen myosin light chain | 20123 | 946.23 | 1 | 29 |
| Actin 1 | 19808.65 | 138.52 | 3 | 3 |
| Chlorophyll a–b binding protein 4 | 16184 | 32 | 14 | 15 |
| Troponin C2 | 15186.76 | 1136.6 | 1 | 36.2 |
| Beta-adaptin-like protein C | 14389 | 24 | 22 | 18 |
| Chlorophyll a–b bindingprotein CP26 | 14270 | 10 | 11 | 7 |
| Sarcoplasmic calcium-binding protein | 13419 | 37 | 7 | 10 |
| Uncharacterized protein | 13217 | 597 | 24 | 22 |
| Muscle LIM protein isoform 1 | 11554.41 | 1946.92 | 6.67 | 18 |
Note: AG means androgenic gland. VD indicated vasa deferentia. O indicates ovary. T means testis. Nmuber means the gene expression level in each tissue.
Figure 4Distribution of simple sequence repeat (SSR) nucleotide classes among different nucleotide types found in the transcriptome of M. nipponense.
Characterization of 14 polymorphic EST-SSR makers in M. nipponense.
| Locus | Primer sequence(5-3) | Size(bp) |
|
|
|
| PIC |
| E-WXM3 | F:AGTTGCTGTGCCACCTGC | 213–278 | 58 | 5 | 0.7135 | 0.8235 | 0.802 |
| R:AAGCCACCACTGCCCTGT | |||||||
| E-WXM7 | F: | 295–336 | 54 | 8 | 0.6625 | 0.8388 | 0.821 |
| R: | |||||||
| E-WXM9 | F:AACATTAAACCGTCTGAA | 338–402 | 54 | 5 | 0.5000 | 0.8297 | 0.899 |
| R:ACCCTATGCGTCCTAACT | |||||||
| E-WXM10 | F: | 241–254 | 54 | 4 | 0.5312 | 0.7178 | 0.791 |
| R:TGAGCATCAGCAGCATTA | |||||||
| E-WXM11 | F:GTCCGAGCCTCCTTCTTC | 234–248 | 56 | 4 | 0.4125 | 0.6786 | 0.613 |
| R:TCCACCTCCTTTGCCACT | |||||||
| E-WXM14 | F:CCCTCGTGAGATGATGTG | 348–416 | 55 | 7 | 0.6875 | 0.8333 | 0.799 |
| R:CAGGACTGAGTGGCAAAA | |||||||
| E-WXM16 | F:GCAGTGAATTATTGTGCTCCTA | 303–335 | 56 | 7 | 0.6700 | 0.8145 | 0.776 |
| R:TCCTGTGGCTCTGCTTTG | |||||||
| E-WXM24 | F:AAGGTTCGTTCATGCGTTAG | 289–358 | 56 | 6 | 0.7923 | 0.8520 | 0.725 |
| R:CGGATATTATTTCTGTTGGGTT | |||||||
| E-WXM29 | F: | 182–194 | 54 | 4 | 0.7500 | 0.8436 | 0.874 |
| R: | |||||||
| E-WXM33 | F: | 188–236 | 54 | 13 | 0.8938 | 0.9216 | 0.890 |
| R: | |||||||
| E-WXM62 | F:GCTTGTAGAAACCCGTAG | 134–189 | 52 | 13 | 0.7188 | 0.9132 | 0.672 |
| R:CTCTGACCTGCTTAGAAAA | |||||||
| E-WXM89 | F:GTTACCCAACCAGGCATT | 282–333 | 56 | 10 | 0.7688 | 0.8249 | 0.682 |
| R:GCATTTTCAGACGCACATAA | |||||||
| E-WXM93 | F:GCCAAGAAGCCGAAGACT | 235–298 | 56 | 8 | 0.7138 | 0.8652 | 0.798 |
| R:TTTTGACAGCAAGGGGAT | |||||||
| E-WXM147 | F:ATTGTCGTAGGCTCACGT | 243–280 | 50 | 9 | 0.8688 | 0.9332 | 0.657 |
| R:AAAATTGGTCTTGCTCCC |
Note: Ta, annealing temperature; Na number of alleles; H O observed heterozygosity; H E expected heterozygosity; PIC, polymorphic information content.
indicates significant deviation from HWE (P<0.05).
Figure 5Distribution of putative single nucleotide polymorphisms (SNP) in the transcriptome of M. nipponense.