| Literature DB >> 25420880 |
Noushin Ghaffari1, Alejandro Sanchez-Flores2, Ryan Doan3, Karina D Garcia-Orozco4, Patricia L Chen5, Adrian Ochoa-Leyva6, Alonso A Lopez-Zavala4, J Salvador Carrasco4, Chris Hong5, Luis G Brieba7, Enrique Rudiño-Piñera8, Philip D Blood9, Jason E Sawyer10, Charles D Johnson1, Scott V Dindot3, Rogerio R Sotelo-Mundo4, Michael F Criscitiello11.
Abstract
We present a new transcriptome assembly of the Pacific whiteleg shrimp (Litopenaeus vannamei), the species most farmed for human consumption. Its functional annotation, a substantial improvement over previous ones, is provided freely. RNA-Seq with Illumina HiSeq technology was used to analyze samples extracted from shrimp abdominal muscle, hepatopancreas, gills and pleopods. We used the Trinity and Trinotate software suites for transcriptome assembly and annotation, respectively. The quality of this assembly and the affiliated targeted homology searches greatly enrich the curated transcripts currently available in public databases for this species. Comparison with the model arthropod Daphnia allows some insights into defining characteristics of decapod crustaceans. This large-scale gene discovery gives the broadest depth yet to the annotated transcriptome of this important species and should be of value to ongoing genomics and immunogenetic resistance studies in this shrimp of paramount global economic importance.Entities:
Mesh:
Year: 2014 PMID: 25420880 PMCID: PMC4243063 DOI: 10.1038/srep07081
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
RNA-Seq reads before and after adapter filtering
| Gill | Hepatopancreas | Pleopods | Tail Muscle | Total | |
|---|---|---|---|---|---|
| 111,119,234 | 59,419,932 | 122,556,454 | 107,132,420 | 400,228,040 | |
| 110,846,074 | 59,190,538 | 122,146,844 | 106,873,256 | 399,056,712 | |
| 11.112 | 5.942 | 12.256 | 10.713 | 40.023 |
Summary statistics of L. vannamei transcriptome assembly
| Trinity Outputs | |
|---|---|
| 87,307 | |
| 110,474 | |
| 44.12 | |
| 31,344 | |
| 201 | |
| 595 | |
| 4,281 | |
| 17,726 | |
| 8,571 | |
| 6,003 | |
| 4,551 | |
| 3,542 | |
| 2,701 | |
| 429 | |
| 1137.44 | |
| 125,657,935 |
Figure 1Immune gene relative representation.
Of immune genes previously identified in resistance to Taura virus in L. vannamei, these twenty had the most hits from the transcriptome described here. Others outside of the top 20 accounted for the remaining 22% of hits.
Figure 2Species percentages in BLASTX hits.
Sorting BLASTX results against our contigs, these 59 database hits appeared most frequently. These proteins were hit from 20 to 107 times.
Blast of L. vannamei contigs against D. pulex
| Lowest E-value < 1E-4 | Lowest E-value < 1E-10 | |
|---|---|---|
| 30,534 | 26,224 | |
| 5,668 | 2,610 |
Top BlastN search results for Trinity assembled contigs against D. pulex transcripts and CDS database
| Total Appearance | GeneID: Frozen Gene Catalog | ID: wFleaBase final annotation | wFleaBase: Daphnia Water Flea Genome Database Link | UniProt Link | |
|---|---|---|---|---|---|
| 112 | NCBI_GNO_0800085 | NCBI_GNO_292084 | |||
| 54 | e_gw1.107.25.1 | NCBI_GNO_254664 | |||
| 44 | NCBI_GNO_14500072 | NCBI_GNO_398774 | |||
| 33 | PASA_GEN_8300084 | NCBI_GNO_174584 | |||
| 32 | e_gw1.21.116.1 | NCBI_GNO_386214 | |||
| 27 | NCBI_GNO_4000059 | NCBI_GNO_426364 | |||
| 26 | estExt_Genewise1Plus.C_1610007 | NCBI_GNO_232814 | |||
| 24 | estExt_fgenesh1_pm.C_110071 | NCBI_GNO_1014113 | |||
| 24 | estExt_fgenesh1_pm.C_470008 | NCBI_GNO_108404 | |||
| 24 | e_gw1.28.118.1 | NCBI_GNO_322284 | |||
| 23 | estExt_Genewise1.C_120222 | NCBI_GNO_352124 | |||
| 23 | PASA_GEN_16200014 | NCBI_GNO_368814 | |||
| 23 | e_gw1.155.10.1 | NCBI_GNO_112804 | |||
| 20 | estExt_Genewise1Plus.C_1070005 | NCBI_GNO_212664 | |||
| 20 | gw1.8.471.1 | NCBI_GNO_532084 | |||
| 18 | SNAP_00035721 | NCBI_GNO_81074 | |||
| 17 | estExt_Genewise1.C_80108 | NCBI_GNO_274084 | |||
| 16 | gw1.5.209.1 | NCBI_GNO_252054 | |||
| 15 | estExt_Genewise1Plus.C_110113 | NCBI_GNO_226114 | |||
| 15 | SNAP_00011758 | NCBI_GNO_644253 |
Top BlastX search results for Trinity assembled contigs against D. pulex protein database
| Total Appearance | Accession | UniProt Link | |
|---|---|---|---|
| 113 | jgi|Dappu1|313136|NCBI_GNO_0800085 | ||
| 87 | jgi|Dappu1|190169|estExt_Genewise1Plus.C_10589 | ||
| 75 | jgi|Dappu1|49585|e_gw1.19.281.1 | ||
| 72 | jgi|Dappu1|30981|gw1.291.11.1 | ||
| 70 | jgi|Dappu1|232498|SNAP_00000556 | ||
| 64 | jgi|Dappu1|305108|PASA_GEN_3300053 | ||
| 63 | jgi|Dappu1|307561|PASA_GEN_6500041 | ||
| 60 | jgi|Dappu1|260966|SNAP_00029024 | ||
| 55 | jgi|Dappu1|61826|e_gw1.107.25.1 | ||
| 54 | jgi|Dappu1|308707|PASA_GEN_8300084 | ||
| 53 | jgi|Dappu1|330675|NCBI_GNO_14500072 | ||
| 50 | jgi|Dappu1|233798|SNAP_00001856 | ||
| 49 | jgi|Dappu1|197424|estExt_Genewise1Plus.C_290215 | ||
| 49 | jgi|Dappu1|213970est|Ext_Genewise1.C_460168 | ||
| 49 | jgi|Dappu1|327214|NCBI_GNO_8800026 | ||
| 47 | jgi|Dappu1|228567|estExt_fgenesh1_pg.C_1090053 | ||
| 47 | jgi|Dappu1|304667|PASA_GEN_0300133 | ||
| 43 | jgi|Dappu1|232691|SNAP_00000749 | ||
| 42 | jgi|Dappu1|190748|estExt_Genewise1Plus.C_20519 | ||
| 42 | jgi|Dappu1|20391|gw1.30.80.1 |
Figure 3DNA replication pathway in L. vannamei.
KEGG pathway analysis20 shows orthologs in shrimp involved in DNA replication, as indicated by green highlighting. This is one example of hundreds of such pathway analyses available in Supplemental Data 5.
Figure 4BLAST2GO identifies D. pulex homologues and gene ontology profile.
(A) Protein BLAST top hit species distribution for the shrimp transcriptome. For identifying homologous proteins, the transcriptome shrimp proteins were BLASTed (BLASTP) against the non-redundant (nr) database using the Blast2GO package. The E-value cut-off was set at 1.0E−3. (B) Distribution of Gene Ontology terms related to Molecular function at level 2 for the shrimp transcriptome. Supported by Blast2GO trascriptome proteins were functionally mapped to GO terms and annotated by setting the following parameters: E-Value-Hit-Filter: 1.0E−3; Annotation cut-off: 55; GO weight: 5; Hsp-Hit Coverage cut-off: 0.
Figure 5Overview of workflow.
Schematic flowchart shows the molecular biology and bioinformatic methods and software employed in this study. Blue steps are wet-lab procedures, red and purple steps are in silico.