| Literature DB >> 20374649 |
Sandeep J Joseph1, José A Fernández-Robledo, Malcolm J Gardner, Najib M El-Sayed, Chih-Horng Kuo, Eric J Schott, Haiming Wang, Jessica C Kissinger, Gerardo R Vasta.
Abstract
BACKGROUND: Perkinsus marinus, a protozoan parasite of the eastern oyster Crassostrea virginica, has devastated natural and farmed oyster populations along the Atlantic and Gulf coasts of the United States. It is classified as a member of the Perkinsozoa, a recently established phylum considered close to the ancestor of ciliates, dinoflagellates, and apicomplexans, and a key taxon for understanding unique adaptations (e.g. parasitism) within the Alveolata. Despite intense parasite pressure, no disease-resistant oysters have been identified and no effective therapies have been developed to date.Entities:
Mesh:
Year: 2010 PMID: 20374649 PMCID: PMC2868825 DOI: 10.1186/1471-2164-11-228
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
EST sequence numbers by library, Perkinsus strain and sequence source.
| No. of ESTs | |
|---|---|
| 28,882 | |
| 1,243 | |
| 1,602 | |
| Total | 31,727 |
*Note: strain CB5D4 was used to construct two libraries, long- and short-insert. The numbers listed represent the total of the two.
Figure 1Distribution of nr and dbEST best BLAST hits in GenBank. The comparative cumulative distribution of the best BLASTX and best non-self TBLASTX hits in the NCBI non-redundant (nr) protein database (E-value ≤ 1e-5) and NCBI dbEST database (Evalue ≤ 1e-5), respectively for P. marinus consensus sequences among the major taxonomic groups.
Figure 2Distribution of nr and dbEST best BLAST hits among Chromalveolates. The comparative cumulative distribution of the best BLASTX and best non-self TBLASTX hits in the NCBI non-redundant (nr) protein database (E-value ≤ 1e-5) and NCBI dbEST database (Evalue ≤ 1e-5), respectively for P. marinus consensus sequences among the Chromalveolata.
Figure 3Gene Ontology annotation of . The top BLASTX hit provided annotation and functional categorization (gene ontology assignment) for each P. marinus assembled consensus sequences. The total numbers of sequences annotated for each main category are (a) 8,670 for Biological Process, (b) 3,618 for Molecular Function and (c) 5,452 for Cellular Component. A single gene product may be associated with multiple GO annotations with in a single category, giving rise to more GO annotations than sequences.
The 20 most common Pfam domains
| Description of Pfam domain | Pfam accession | No. of |
|---|---|---|
| Transmembrane amino acid transporter protein | PF01490 | 27 |
| ABC transporter | PF00005 | 26 |
| ADP-ribosylation factor family | PF00025 | 30 |
| Helicase conserved C-terminal domain | PF00271 | 23 |
| Mitochondrial carrier protein | PF00153 | 22 |
| Papain family cysteine peptidase | PF00112 | 11 |
| Protein kinase domain | PF00069 | 51 |
| Proteasome A-type and B-type | PF00227 | 20 |
| RNA recognition motif 1 | PF00076 | 78 |
| RNA recognition motif 2 | PF04059 | 18 |
| Sugar (and other) transporter | PF00083 | 15 |
| Tetra tricopeptide repeat | PF00515 | 20 |
| tRNA synthetase | PF00133 | 11 |
| WD domain, G-beta repeat | PF00400 | 36 |
| Zinc finger C3HC4 type | PF00097 | 77 |
| Zinc finger C8C5C3H type | PF00642 | 43 |
| ATPase family associated with various cellular activities | PF00004 | 16 |
| DEAD/DEAH box helicase | PF00270 | 12 |
| Dna J domain: associated with hsp 70 heat-shock protein | PF00226 | 35 |
| Cytochrome b5-like Heme/Steroid binding domain | PF00173 | 12 |
Pfam domains of interest identified in P. marinus EST cluster open reading frames.
| Pfam family domain | Description of the domain | Pfam accession | |
|---|---|---|---|
| Peptidase_C1 | Papain family cysteine protease | PF00112 | Pm02598, Pm05550, Pm02598, Pm01528, Pm00748, Pm00174, Pm00175, Pm00188, Pm01093, Pm02540, Pm00187 |
| Peptidase_C13 | Asparaginyl peptidase | PF01650 | Pm03639 |
| Peptidase_C14 | Caspase | PF00656 | Pm00713 |
| Peptidase_C15 | Pyroglutamyl peptidase | PF01470 | Pm03835 |
| Peptidase_C26 | Gamma-glutamyl hydrolase | PF07722 | Pm03998, Pm00516, Pm01387 |
| Peptidase_C48 | UIp1 protease family, C-terminal catalytic domain | PF02902 | Pm03070 |
| Peptidase_C54 | Peptidase family C54 | PF03416 | Pm01837 |
| Peptidase_C69 | Peptidase family C69 | PF03577 | Pm00327, Pm01089, Pm00100, Pm00136 |
| Peptidase_M1 | Peptidase family M1 | PF01433 | Pm04612 |
| Peptidase_M14 | Zinc carboxy peptidase | PF00246 | Pm04493 |
| Peptidase_M16 | Insulinase | PF00675 | Pm05750, Pm04910, Pm01740, Pm00611, Pm01740, Pm07103 |
| Peptidase_M17 | Cytosol amino peptidase | PF00883 | Pm05788, Pm04421 |
| Peptidase_M18 | Aminopeptidase I zinc metalloprotease | PF02127 | Pm00540 |
| Peptidase_M20 | Peptidase family M20 | PF01546 | Pm05166 |
| Peptidase_M24 | Metallopeptidase family M24 | PF00557 | Pm00721, Pm05108, Pm01923, Pm01484, Pm03328, Pm05608, Pm01369 |
| Peptidase_M3 | Peptidase family M3 | PF01432 | Pm06832 |
| Peptidase_M41 | Peptidase family M41 | PF01432 | Pm06918 |
| Peptidase_M48 | Peptidase family M48 | PF01435 | Pm02186 |
| Peptidase_S10 | Serine carboxy peptidase | PF00450 | Pm04952, Pm03877, Pm07791, Pm01720, Pm07092, Pm05386, Pm02482, Pm07066, Pm02294, Pm00207, Pm00346, Pm06037 |
| Peptidase_S15 | X-Pro dipeptidyl peptidase | PF02129 | Pm01961 |
| Peptidase_S24 | Peptidase S24 like | PF00717 | Pm02936, Pm07617 |
| Peptidase_S8 | Subtilase family | PF00082 | Pm05197, Pm00155, Pm01368, Pm00344, Pm00265, Pm02193, Pm03660, Pm00235, Pm00154 |
| Peptidase_S9 | Prolyl oligo peptidase family | PF00326 | Pm01915 |
| Rhomboid | Rhomboid-like family | PF01694 | Pm05706, Pm00872 |
| Sod_Fe_N | Iron/manganese superoxide dismutase, alpha-hairpin | PF00081 | Pm00420, Pm05057, Pm02278 |
| Sod_Fe_C | Iron/manganese superoxide dismutase, C-terminal domain | PF02777 | Pm00420, Pm05057, Pm02278 |
| Glutaredoxin | Glutaredoxin | PF00462 | Pm07679, Pm07604, Pm04790, Pm04765 |
| Thioredoxin | Thioredoxin | PF00085 | Pm01390, Pm02808, Pm00345, Pm02770, Pm07686, Pm03543, Pm01102, Pm00367, Pm03381, Pm01163, Pm03631, Pm07411, Pm02203, Pm03850, Pm05817 |
| FA_desaturase | Fatty acid desaturase | PF00487 | Pm02182, Pm03006, Pm06024, Pm07340, Pm02313, Pm02648, Pm03710, Pm03609 |
| IspD | MEP cytidylyl transferase | PF01128 | Pm05015 |
| Glyco_transf_20 | Trehalose-6-phosphate transferase | PF00982 | Pm01181 |
| Glyco_transf_22 | Alg9-like mannosyl transferase | PF03901 | Pm03101 |
| Glyco_transf_28 | UDP-N-acetylglucosamine transferase | PF03033 | Pm01717 |
| Glyco_transf_8 | Lipopolysaccharide galactosyl tranferase | PF01501 | Pm01991 |
| Alg14 | Oligosaccharide biosynthesis protein Alg14-like | PF08660 | Pm04341 |
| HSP90 | Heat shock protein 90 | PF00183 | Pm00142, Pm00170, Pm00836, Pm03154 |
| HSP70 | Heat shock protein 70 | PF00012 | Pm00172, Pm05287, Pm00172, Pm03153, Pm05233, Pm06933, Pm00065, Pm00137, Pm01460, Pm00363, Pm01884 |
| Cpn60_TCP1 | HSp60 chaperonin family and the TCP-1 family | PF00118 | Pm03177, Pm02627, Pm01137, Pm02465, Pm00825, Pm03422, Pm04033, Pm02165, Pm01657, Pm05905 |
| Ricin_B_lectin | Ricin-type beta-trefoil domain | PF00652 | Pm01439, Pm02264, Pm03133 |
| Lectin_leg_like | Legume -like lectin family | PF03388 | Pm03024, Pm03327 |
| Pribosyltran Adenylsucc_synt | Phosphoribosyl transferase adenylosuccinate synthetase | PF00156 | Pm04986, Pm07718, Pm03005 |
| PF00709 | Pm02370 | ||
P. marinus consensus sequences with similarity to known proteases.
| Protein | GenBank best hit accession | E-value | Species with best hit | |
|---|---|---|---|---|
| Subtilisin-like serine protease | Pm00235 | 0.00 | ||
| Pm02193 | 1e-119 | |||
| Pm00154 | 7e-92 | |||
| Pm01368 | 5e-86 | |||
| Pm02998 | 5e-41 | |||
| Pm00155 | 2e-37 | |||
| Pm05826 | 3e-35 | |||
| Pm06080 | 2e-32 | |||
| Pm07074 | 5e-18 | |||
| Cathepsin-like cysteine protease | Pm05181 | 1e-55 | ||
| Papain cysteine protease | Pm02690 | 6e-50 | ||
| Pm00748 | 2e-44 | |||
| Pm02540 | 7e-36 | |||
| Pm03675 | 2e-20 | |||
| Rhomboid-like protease 1 | Pm05706 | 3e-25 | ||
| Cysteine protease | Pm05710 | 2e-30 | ||
| Calcium-dependent cysteine protease, putative | Pm03124 | 2e-19 | ||
| ATP-dependent protease La | Pm01082 | 1e-108 | ||
| Serine protease, putative | Pm03660 | 6e-28 | ||
| Pm00265 | 2e-22 | |||
| Ubiquitin-specific protease | Pm04052 | 5e-12 | ||
| Metacaspase 1 precursor, putative | Pm00713 | 6e-46 | ||
| Pm00714 | 2e-16 | |||
| Putative LON protease | Pm07114 | 3e-14 | ||
| Peptidase family M48 domain | Pm02186 | 1e-69 | ||
P. marinus sequences with similarity to known oxidative enzymes.
| Protein | GenBank best hit accession | E-value | Species with best blast hit | |
|---|---|---|---|---|
| Superoxide dismutase 1 | Pm00420 | 5e-79 | ||
| Pm05057 | 1e-78 | |||
| Superoxide dismutase 2 | Pm05920 | 1e-154 | ||
| Thioredoxin | Pm03850 | 1e-17 | ||
| Pm07686 | 7e-12 | |||
| Pm02203 | 1e-120 | |||
| Pm01102 | 2e-06 | |||
| Pm04526 | 1e-08 | |||
| Peroxiredoxin 6 | Pm01619 | 2e-67 | ||
| Peroxiredoxin V | Pm01453 | 3e-75 | ||
| Glutaredoxin | Pm06883 | 9e-16 | ||
| Pm07604 | 2e-27 | |||
| Pm04444 | 4e-28 | |||
| Pm07679 | 2e-20 | |||
| Glutathione reductase | Pm04212 | 1e-112 | ||
| Thioredoxin reductase | Pm07484 | 7e-54 | ||
| Pm02868 | 1e-78 | |||
| Pm04212 | 1e-112 | |||
P. marinus consensus sequences with similarity to enzymes involved in plastid isoprenoid metabolism and other plant-type plastid proteins.
| Protein | GenBank best hit accession | E-value | Species with best hit | |
|---|---|---|---|---|
| 2C-methyl-D-erythritol 2,4-cyclodiphosphate synthase | Pm01490 | 1e-123 | ||
| 4-hydroxy-3-methyl-2-butenyl 1-diphosphate synthase | Pm02434 | 1e-167 | ||
| Pm02620 | 1e-147 | |||
| Pm07308 | 1e-145 | |||
| Triose phosphate/phosphate translocator, non-green plastid, chloroplast precursor, putative | Pm05421 | 1e-18 | ||
| 1-deoxy-D-xylulose-5-phosphate synthase | Pm01411 | 0.00 | ||
| Pm01258 | 1e-160 | |||
| Pm01490 | 1e-154 | |||
| Cytidylyltransferase family protein | Pm07765 | 1e-24 | ||
| Pm03879 | 2e-24 | |||
| 2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase | Pm05015 | 9e-26 | ||
| Ferredoxin, putative | Pm01110 | 9e-53 | ||
| Ferredoxin-NADP reductase | Pm04555 | 1e-62 | ||
Figure 4Biological role(s) of . Several aspects of Perkinsus biology are highlighted, including the identification of pathways indicative of a relict/cryptic plastid, expression of numerous transporters and proteases likely be involved in the uptake of nutrients and the degradation of host components, enzymes and lectins involved in glycan assembly, protein folding and secretion, and parasite-host interactions.