| Literature DB >> 22745820 |
Keyi Ma1, Gaofeng Qiu, Jianbin Feng, Jiale Li.
Abstract
BACKGROUND: The oriental river prawn, Macrobrachium nipponense, is an economically and nutritionally important species of the Palaemonidae family of decapod crustaceans. To date, the sequencing of its whole genome is unavailable as a non-model organism. Transcriptomic information is also scarce for this species. In this study, we performed de novo transcriptome sequencing to produce the first comprehensive expressed sequence tag (EST) dataset for M. nipponense using high-throughput sequencing technologies. METHODOLOGY AND PRINCIPALEntities:
Mesh:
Substances:
Year: 2012 PMID: 22745820 PMCID: PMC3380025 DOI: 10.1371/journal.pone.0039727
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Length distribution of M. nipponense transcriptomic ESTs.
(A) total transcriptomic reads, (B) contigs, (C) singletons.
Summary of Roche 454 GS FLX assembly and analysis of M. nipponense transcriptomic sequences.
| Dataset name | All |
| transcriptomic reads | 984,204 |
| total number of bases | 338,592,850 bp |
| average read length | 344 bp |
| No. of contigs | 42,551 |
| longest contig | 14,768 bp |
| No. of singletons | 38,860 |
Figure 2Gene Ontology (GO) terms for the transcriptomic sequences of M. nipponense.
Figure 3Clusters of orthologous groups (COG) classifications of unique M. nipponense sequences.
1: RNA processing and modification; 2: chromatin structure and dynamics; 3: energy production and conversion; 4: cell cycle control, cell division, chromosome partitioning; 5: amino acid transport and metabolism; 6: nucleotide transport and metabolism; 7: carbohydrate transport and metabolism; 8: coenzyme transport and metabolism; 9: lipid transport and metabolism; 10: translation, ribosomal structure and biogenesis; 11: transcription; 12: replication, recombination and repair; 13: cell wall/membrane/envelope biogenesis; 14: cell motility; 15: posttranslational modification, protein turnover, chaperones; 16: inorganic ion transport and metabolism; 17: secondary metabolites biosynthesis, transport and catabolism; 18: general function prediction only; 19: function unknown; 20: signal transduction mechanisms; 21: intracellular trafficking, secretion, and vesicular transport; 22: defense mechanisms; 23: nuclear structure; and 24: cytoskeleton.
Selected genes of interest for sex determination/differentiation in the M. nipponense transcriptome, including the contigs and singletons.
| Candidate genes | Hit(s) | Similarity (%) | E-value | Length (bp) |
|
| 1 | 45 | 3.23E-08 | 377 |
|
| 1 | 33 | 3.00E-07 | 807 |
|
| 4 | 45 | 1.84E-08∼4.01E- 08 | 799∼1330 |
|
| 2 | 60∼88 | 1.00E-14∼1.77E- 06 | 318∼404 |
|
| 31 | 56∼90 | 0∼6.04E-06 | 157∼1672 |
|
| 9 | 50∼92 | 4.01E-99∼4.90E- 09 | 136∼1521 |
|
| 1 | 50 | 8.03E-07 | 619 |
|
| 1 | 58 | 0 | 6049 |
|
| 8 | 59∼88 | 0∼1.12E-08 | 109∼3831 |
|
| 1 | 50 | 5.87E-14 | 751 |
|
| 1 | 82 | 8.80E-134 | 1164 |
|
| 4 | 58∼78 | 5.94E-81∼2.03E- 10 | 160∼1976 |
|
| 4 | 43∼64 | 1.00E-16∼1.00E- 11 | 288∼1127 |
|
| 6 | 65∼87 | 4.38E-61∼5.80E- 10 | 179∼560 |
|
| 2 | 53∼86 | 2.16E-08∼6.63E- 06 | 136∼329 |
|
| 6 | 60∼98 | 2.65E-156∼3.70E- 13 | 266∼1968 |
|
| 2 | 58∼60 | 0∼4.18E-133 | 2625∼3790 |
|
| 1 | 74 | 9.21E-16 | 391 |
|
| 11 | 41∼89 | 0∼2.70E-07 | 576∼5692 |
|
| 6 | 52∼83 | 0∼2.29E-06 | 127∼6975 |
|
| 1 | 100 | 0 | 2357 |
|
| 1 | 98 | 0 | 5421 |
| testis-specific | 11 | 40∼77 | 0∼6.48E-07 | 369∼2524 |
Figure 4Distribution of simple sequence repeat (SSR) nucleotide classes among different nucleotide types found in the transcriptome of M. nipponense.
Both contig and singleton sequences are used to predict the SSR loci.
Figure 5Distribution of putative single nucleotide polymorphisms (SNP) in the transcriptome of M. nipponense.
Primers used and verified SNPs in the ESTs of M. nipponense.
| Contig ID | Primers sequence (5′–3′) | Products length (bp) | No. of SNPs |
| 06300 | AACTTAGGGCTTGCGATTC CTCTTCCGAGTGCGTGAT | 262 | 1/1/0 |
| 06559 | GTTCTTTGGATGTTAGGTG AAGTTTGTTGATGTGGTTT | 221 | 1/1/0 |
| 06567 | GACATGACCATGCTCCCA AGGAACTGTCTCGCCAAC | 551 | 0/1/0 |
| 07723 | ACACTATTTAGGCATTTAT GGAAGTTCAGTAGGAGGT | 208 | 2/2/0 |
| 01235 | AGCAGACGCAGGTAGAAC AGTGAAGGCTTGAGGAAC | 432 | 1/2/0 |
| 01295 | GAGCCTCAAATAAGAAAG TATGAAGCCCATCAACCC | 431 | 4/4/0 |
| 01191 | ACCAGGAATGTTGGAGCA GCCTAGCATACCCGATGA | 351 | 0/5/2 |
| 01399 | TATCGCATACACTAAGGC ATTTAATGGTCGCTACTC | 396 | 4/5/0 |
| 06092 | GGAGGAGTGGGTTAGGTC TTACTGTCGCTGTGGTGA | 319 | 2/2/1 |
| 08341 | ACAGCCATGAAGTCATAAG AGAAGGTCCAGCGATACG | 454 | 3/3/3 |
| 08357 | GTTATTGCCGTGTTCTTA ATGGTGCTCCCTTATCTT | 373 | 2/3/0 |
| CAGAAGCACATAGGCACA CAGGTCATCAAGAGGAGG | 316 | 0/2/0 | |
| 08359 | ACTTGCTCCTGATAACCC CCACTTCTTTGACACACC | 302 | 1/1/0 |
| 08391 | CTCTTCCGCATTTCTACG TAAACCCAGGGATTCAGT | 200 | 1/1/0 |
| 08461 | TGAATGGGTCTGGTTGGT CATTTGTGGCGGAGGAAG | 370 | 1/1/0 |
| 08476 | AGTTCACAGCAGTGGGATT TGGTGAGGGAGACGAAGA | 200 | 1/1/0 |
| 08491 | GGAGAAACGCAAGAACGC TCCTGATGGTGGAGACCC | 345 | 1/1/0 |
| 15044 | TATTGAAGCTGCACAACC TAGAGCCATAAGCGAGGA | n/a | |
| 27660 | TCCGTATCTTTCGAGTTT GTCCCGCATTGTATTTTT | 166 | 1/2/0 |
| 08344 | GGAAGACAACAACGACAC CCAGCTAACACAAACAAT | 496 | 0/1/0 |
For each contig, the following are indicated: the number of SNPs verified/the number of SNPs predicted in the fragment/the number of additional SNPs using Sanger technology; n/a indicates that sequencing was not possible because the sequencing signal was interrupted.