| Literature DB >> 20113487 |
Yogeswaran Lokanathan1, Adura Mohd-Adnan, Kiew-Lian Wan, Sheila Nathan.
Abstract
BACKGROUND: Cryptocaryon irritans is a parasitic ciliate that causes cryptocaryonosis (white spot disease) in marine fish. Diagnosis of cryptocaryonosis often depends on the appearance of white spots on the surface of the fish, which are usually visible only during later stages of the disease. Identifying suitable biomarkers of this parasite would aid the development of diagnostic tools and control strategies for C. irritans. The C. irritans genome is virtually unexplored; therefore, we generated and analyzed expressed sequence tags (ESTs) of the parasite to identify genes that encode for surface proteins, excretory/secretory proteins and repeat-containing proteins.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20113487 PMCID: PMC2828411 DOI: 10.1186/1471-2164-11-76
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Summary of C. irritans EST analysis
| Number (percentage) | |
|---|---|
| Total number of clones sequenced | 5760 |
| Number of high quality sequences | 5356 (93) |
| Number of consensi | 670 |
| Singletons | 1989 |
| Unique transcripts (UTs) | 2659 |
| Number of known genes | 1966 (74) |
| Number of unknown genes | 692 (26) |
The 20 most abundantly encountered genes at the tomont stage
| UT ID | No. ESTs | Putative Identity | Organism | % Identity | E Value |
|---|---|---|---|---|---|
| cn48 | 136 | Outer membrane adhesin like protein | 30.95 | 4E-09 | |
| cn52 | 98 | 28S ribosomal RNA | 84.00 | 0 | |
| cn10 | 79 | RTX toxins and related Ca2+-binding proteins | 34.78 | 3E-09 | |
| cn42 | 68 | Agglutination/immobilization antigen isoform 1 | 71.82 | 1E-118 | |
| cn24 | 59 | Insect antifreeze protein | 22.67 | 1E-38 | |
| cn1 | 54 | Granule tip protein 2 | 23.24 | 7E-08 | |
| cn21 | 50 | No significant hit | |||
| cn57 | 48 | Agglutination/immobilization antigen isoform 1 | 41.04 | 7E-58 | |
| cn107 | 40 | Polyubiquitin | 97.21 | 0 | |
| cn123 | 37 | Hypothetical 18K mitochondrion protein | 49.05 | 3E-15 | |
| cn41 | 35 | Agglutination/immobilization antigen isoform 1 | 66.87 | 1E-119 | |
| cn14 | 34 | Beta-glucanase/Beta-glucan synthetase | 37.95 | 5E-20 | |
| cn25 | 32 | MCM2/3/5 family protein | 51.06 | 0 | |
| cn11 | 29 | Tubulin beta chain | 95.11 | 0 | |
| cn174 | 28 | ER-type hsp70 | 74.88 | 0 | |
| cn81 | 23 | Chitinase | 32.03 | 3E-47 | |
| cn66 | 22 | Insect antifreeze protein | 23.49 | 2E-24 | |
| cn110 | 22 | Calpain-like protein | 32.95 | 8E-29 | |
| cn56 | 20 | Agglutination/immobilization antigen isoform 4 | 42.67 | 4E-65 | |
| cn80 | 20 | Tubulin/FtsZ family, GTPase domain containing protein | 93.63 | 0 |
Figure 1Length of unique transcripts versus BLAST hit. The relationship between the length of the unique transcripts (UTs) and the number of UTs with BLAST hits.
Figure 2Distribution of best BLASTX matches according to organisms. A diagrammatic representation of organism distribution according to UTs with the best match to the NCBI non-redundant protein database.
Figure 3Phylogenetic analysis of . A maximum parsimony tree inferred from the complete β-tubulin amino acid sequence of C. irritans and other alveolates, with Mus musculus as an outgroup. The I. multifiliis β-tubulin amino acid sequence was inferred from the EST dataset. The numbers below branches are the bootstrap values of 1000 iterations of the data file.
Figure 4Proteins shared among . Venn diagram summary of C. irritans translated UTs comparison with T. thermophila and P. falciparum peptide sequences. The numbers at the overlapping area represent matching peptides (BLASTX of E < 10-5) in the relevant organism with the query 2659 translated UTs of C. irritans.
The summary of SUPERFAMILY and Pfam domains of C . irritan s, T. thermophila and P. falciparum
| Total peptide sequences | 2 659 | 24 725 | 5460 |
| Sequences with SUPERFAMILY domain assignments | 1 025(39%) | 11268(46%) | 2468(45%) |
| Total SUPERFAMILY hits | 1 163 | 17379 | 3553 |
| Unique SUPERFAMILY domains | 323 | 715 | 1101 |
| Sequences with Pfam domain assignments | 1 043(39%) | 13 896 (56%) | 2492 (45%) |
| Total Pfam hits | 1 203 | 39507 | 5542 |
| Unique Pfam domains | 608 | 4 168 | 1 101 |
Figure 5GO annotation. GO annotation was done with the Blast2GO tool. The level 2 GO terms which are (a) biological process, (b) molecular function, and (c) cellular component are shown.
Top 30 metabolic pathways in C. irritans mapped by KEGG
| No. | KEGG pathway | No. UTs | Background genes distributiona |
|---|---|---|---|
| 1 | Ribosome | 47 | 69 |
| 2 | Chaperones and folding catalysts | 28 | 57 |
| 3 | Cell cycle | 26 | 24 |
| 4 | Oxidative phosphorylation | 22 | 49 |
| 5 | Translation factors | 19 | 41 |
| 6 | Pyruvate metabolism | 18 | 36 |
| 7 | Transcription factors | 15 | 3 |
| 8 | Cell cycle - yeast | 14 | 24 |
| 9 | Insulin signaling pathway | 14 | 27 |
| 10 | Proteasome | 13 | 27 |
| 11 | Apoptosis | 12 | 10 |
| 12 | Wnt signaling pathway | 11 | 14 |
| 13 | Focal adhesion | 10 | 5 |
| 14 | Prostate cancer | 10 | 9 |
| 15 | Regulation of actin cytoskeleton | 10 | 13 |
| 16 | Beta-alanine metabolism | 9 | 12 |
| 17 | Epithelial cell signaling in | 9 | 14 |
| 18 | Receptors and channels | 8 | 2 |
| 19 | Long-term potentiation | 8 | 8 |
| 20 | Tight junction | 8 | 9 |
| 21 | Melanogenesis | 7 | 5 |
| 22 | p53 signaling pathway | 7 | 10 |
| 23 | Huntington's disease | 6 | 4 |
| 24 | TGF-beta signaling pathway | 6 | 8 |
| 25 | Notch signaling pathway | 5 | 2 |
| 26 | Adherence junction | 5 | 4 |
| 27 | Thyroid cancer | 4 | 2 |
| 28 | Renal cell carcinoma | 4 | 3 |
| 29 | Jak-STAT signaling pathway | 3 | 0 |
| 30 | Neuroactive ligand-receptor interaction | 3 | 1 |
Number of genes present for the particular pathway in whole gene data set of Tetrahymena thermophila.
Repetitive motifs containing transmembrane and extracellular proteins
| ID | BLASTX | Pfam/Interpro | L | Repeat | ESTs | |
|---|---|---|---|---|---|---|
| cn349 | Multicystatin | No significant hit | EC | 2X | DKIPKSVLEFGINKLELSNVFAHKDFSKIENAQMKVVSGYIYKFTLVYQFSEQEHKFEIQVWSKADQTLELISMKEIT | 7 |
| cn66 | Unknown protein | EGF-like domain | EC | 4X | YVNNGSCSSNSTLFNFTSKNCEKSCG | 22 |
| cn65 | Insect antifreeze protein | EGF-like domain | EC | 5X | LFNFTSKNCEKSCGESGY | 10 |
| cn131 | Protein disulfide-isomerase domain containing protein | Protein disulphide isomerase family | EC | 3X | EEKEEK | 9 |
| cn85 | Predicted protein | Ricin-type beta-trefoil lectin domain | EC | 4X | VLDVYG | 8 |
| cn81 | Chitinase | Glycoside hydrolase, family 18 domain, chitinase active site | EC | 2X | YARGYELCKTPGDKLDKIFYAFLNPTTG | |
| CiTo 53E07 | Membrane associated protein | No significant hit | TM | 9X | FSFLLFFFFFSFWS | 1 |
| CiTo 12F11 | No significant hit | EGF-like domain, Metridin-like ShK toxin | EC | 3X | AETGST | 1 |
| CiTo 13H10 | Tenascin | EGF domain | TM | 3X | PNNCSG | |
| CiTo 26C11 | No significant hit | No significant hit | TM | 3X | EKCRCL | 1 |
| CiTo 2H11 | No significant hit | No significant hit | EC | 3X | AKTAAE | 1 |
| CiTo 6G09 | Cation diffusion facilitator family transporter | Cation efflux protein | TM | 3X | GHGHSH | 1 |
L, Localization; TM, transmembrane domain; EC, extracellular