| Literature DB >> 24952385 |
Parpakron Korshkari, Sirintra Vaiwsri, Timothy W Flegel, Sudsanguan Ngamsuriyaroj, Burachai Sonthayanon1, Anuphap Prachumwat.
Abstract
BACKGROUND: Although captured and cultivated marine shrimp constitute highly important seafood in terms of both economic value and production quantity, biologists have little knowledge of the shrimp genome and this partly hinders their ability to improve shrimp aquaculture. To help improve this situation, the Shrimp Gene and Protein Annotation Tool (ShrimpGPAT) was conceived as a community-based annotation platform for the acquisition and updating of full-length complementary DNAs (cDNAs), Expressed Sequence Tags (ESTs), transcript contigs and protein sequences of penaeid shrimp and their decapod relatives and for in-silico functional annotation and sequence analysis. DESCRIPTION: ShrimpGPAT currently holds quality-filtered, molecular sequences of 14 decapod species (~500,000 records for six penaeid shrimp and eight other decapods). The database predominantly comprises transcript sequences derived by both traditional EST Sanger sequencing and more recently by massive-parallel sequencing technologies. The analysis pipeline provides putative functions in terms of sequence homologs, gene ontologies and protein-protein interactions. Data retrieval can be conducted easily either by a keyword text search or by a sequence query via BLAST, and users can save records of interest for later investigation using tools such as multiple sequence alignment and BLAST searches against pre-defined databases. In addition, ShrimpGPAT provides space for community insights by allowing functional annotation with tags and comments on sequences. Community-contributed information will allow for continuous database enrichment, for improvement of functions and for other aspects of sequence analysis.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24952385 PMCID: PMC4094775 DOI: 10.1186/1471-2164-15-506
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1ShrimpGPAT database schema. This entity–relationship diagram shows relations among tables of four groups: sequence record tables (blue), in-silico annotation tables (green), users’ data tables (yellow) and shared information tables (pink). Primary keys are underlined. Boldface indicates non-null field columns. Connections between tables are represented by lines, and relations between entities are indicated above the connection lines.
The number of molecular sequence records in ShrimpGPAT
| Species | # of records | ||||
|---|---|---|---|---|---|
| Scientific name | Common name | EST | Transcript contigs a | cDNA | Protein |
|
| Black tiger shrimp | 86,327 | 18,410 | 1,976 | 602 |
|
| Pacific whiteleg shrimp | 176,592 | 47,058 | 74,828 | 574 |
|
| White shrimp | 1,042 | 126 | 135 | 27 |
|
| Fleshy prawn | 10,446 | 2,714 | 478 | 257 |
|
| Indian prawn | 714 | 155 | 348 | 127 |
|
| Kuruma prawn | 3,156 | 662 | 989 | 743 |
|
| Giant freshwater prawn | 4,427 | 8,550b | 635 | 389 |
|
| Cray fish | 120 | 90 | 239 | 226 |
|
| Signal crayfish | 802 | 199 | 914 | 88 |
|
| American lobster | 29,957 | 12,709 | 186 | 227 |
|
| Orange mud crab | 203 | 80 | 121 | 0 |
|
| Green mud crab | 3,972 | 56 | 720 | 698 |
|
| Blue crab | 10,563 | 2,104 | 173 | 161 |
|
| Green crab | 15,559 | 7,672 | 273 | 275 |
aThe number of transcript contigs in each species is the summation of all contig sequences constructed by a set of ESTs and by a set of SRA reads with CAP3 (with default or 97%-similarity parameters) and Newbler (with default parameters).
bIncluding SRA transcript contigs produced by Newbler.
Figure 2A screenshot of ShrimpGPAT record display page. Its layout is divided into I) the title, II) the menu bar, III) the content and IV) the footer. See text for description.