| Literature DB >> 17587442 |
Thewarach Laha1, Porntip Pinlaor, Jason Mulvenna, Banchob Sripa, Manop Sripa, Michael J Smout, Robin B Gasser, Paul J Brindley, Alex Loukas.
Abstract
BACKGROUND: Cholangiocarcinoma (CCA)--cancer of the bile ducts--is associated with chronic infection with the liver fluke, Opisthorchis viverrini. Despite being the only eukaryote that is designated as a 'class I carcinogen' by the International Agency for Research on Cancer, little is known about its genome.Entities:
Mesh:
Year: 2007 PMID: 17587442 PMCID: PMC1913519 DOI: 10.1186/1471-2164-8-189
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Features of the Opisthorchis viverrini EST catalogue.
| Initial Sequences | 5159 |
| Usable sequencesa | 4241 |
| Contigs | 1995 (1632 singletons; 363 clusters) |
| Contigs after clean-upb | 1932 |
| Contigs identical to known proteinsc | 68 |
| Contigs similar to other proteinsd | 794 |
| Contigs with gene ontology assignments | 383 |
| Novel contigs | 1070 |
| Novel contigs with signal sequences | 75 (29 signal peptides; 46 signal anchors) |
| Average insert size | 548 bp (ESTs); 660 bp (contigs) |
| Percentage of recombinant clones | 95% |
| Number of ribosomal seqs | 1184 ESTs; 136 clusters |
aUsable sequences were determined using seqclean – sequences that were removed were either non-recombinant, of low complexity and/or quality and those of < 100 nt in length.
bclean-up refers to removal of sequences from contaminating sources; eg. Mycoplasma
cidentity determined by ≥ 95% identity over ≥ 50 amino acids.
dbased on BLASTx and tBLASTx searchers of GenBank nr and dbEST respectively.
The 10 most abundant contigsa from the Opisthorchis viverrini EST dataset.
| OvAE1587 | 100 | vitelline B precursor, | 99% (230) | 493 | 92% (225) | 524 | |
| OvAE1588 | 77 | 17 kDa myoglobin, | 81% (149) | 244 | 77% (188) | 344 | |
| OvAE1585 | 77 | hypothetical protein, | 84% (90) | 156 | 84% (100) | 199 | |
| OvAE1593 | 41 | egg protein, | 89% (253) | 389 | 82% (237) | 477 | |
| OvAE1584 | 37 | hypothetical protein, | 67% (59) | 82 | SJA_AAF_D11.T3 SJA | 85% (94) | 183 |
| OvAE1602 | 21 | histone H1, | 74% (70) | 112 | 85% (177) | 302 | |
| OvAE1607 | 17 | egg protein, | 59% (252) | 288 | 82% (252) | 493 | |
| OvAE1595 | 16 | retrotransposon gag region, | 31% (57)b | 33 | NA | NA | NA |
| OvAE1608 | 15 | translationally controlled tumor protein, | 98% (169) | 306 | 92% (122) | 228 | |
| OvAE1601 | 11 | glutathione- | 98%(213) | 429 | 86% (232) | 484 |
aNon-ribosomal sequences only were used in this analysis.
bSequence identity was low but diagnostic motifs of gag were detected over 57 amino acids.
Figure 1Summary of predicted gene product function and location using gene ontology terms. Gene ontology (GO) terms for annotated Opisthorchis viverrini assembled ESTs were extracted, if present, from the GO database and sorted into the immediate subcategories for molecular function, cellular component and biological process. The GO subcategory and percentage relative to the total number of extracted terms is indicated in the legend. Although cellular and physiological processes, structural proteins and catalytic activity were strongly represented other categories of interest include the caspases and transporter activity that may represent proteins important for a parasitic lifestyle. The large number of unknowns in each of the three categories highlights the lack of knowledge regarding many of the proteins found in these parasites.
Figure 2Comparison of the gene ontology molecular function terms for expressed sequence tags from . Expressed sequence tags from C. sinensis and S. japonicum were downloaded from NCBI and subjected to the same analyses used for O. viverrini sequences. A comparison of the percentage of terms correlating to the molecular function subcategory for each organism shows a broad similarity, although in some cases, such as categories for structural or motor proteins, categories are over- or under-represented in certain species.
Figure 3Evolutionary relationships between . Similarity of O. viverrini ORFs (1,932 ESTs) to those from the liver fluke Clonorchis sinensis (2,679 ESTs), the blood fluke Schistosoma japonicum (107, 427 ESTs) and the free-living turbellarian Schmidtea mediterranea (171,472 ESTs). SimiTri [20] was used to plot 1,932 O. viverrini contigs against related species database entries (A). Each spot represents a unique contig and its sequence similarity to each of the three selected databases as determined by tBLASTx scores. Sequences showing similarity to only one database are not shown. Sequences showing sequence similarity to only two databases appear on the lines joining the two databases. Spots are coloured by their highest tBLASTx score to each of the databases. O. viverrini sequences with homologues in the parasitic flukes only (not in Schmidtea) are highlighted in the dotted region and the identities of selected examples are shown in the table (B). The entire list (105) of these putative parasite-specific proteins is shown in Table S1.
Figure 4Distribution of . OvAEs that had BLAST hits greater than 1 × 10-5 were sorted into conserved (those matching entries for species other than platyhelminths), phylum Platyhelminthes-specific (only matching platyhelminth entries) and novel (no significant homology to any database entry). The sequences in each category were then analysed for the presence of a signal sequence using SignalP. The relative percentages of each category are indicated along with the sub-category of signal sequence positive contigs.
Selected Opsithorchis viverrini contigs that encode families of secreted/membrane proteins that potentially interact with or are exposed to host tissues. Genera of the closest homologues from BLAST × (nr) searches are shown. Where the closest homologue was from a vertebrate (bold font), a tBLASTx search against dbEST was conducted.
| TGF-β receptor | bone morphogenic protein receptor type I ( | OvAE22 | 44 |
| Seven transmembrane receptor | DC-STAMP ( | OvAE92, OvAE1722 | 51 |
| Tetraspanin | stabilize cell membranes – expressed in the tegument of schistosomes ( | OvAE953 | 34 |
| C1 family papain-like cysteine protease | cathepsin L ( | OvAE1795, OvAE813, OvAE1171, OvAE532, OvAE1070, OvAE1613, OvAE1711, OvAE615, OvAE398 | > 80 |
| C13 family asparaginyl endopeptidase | legumain ( | OvAE1624, OvAE1824 | 94 |
| S1 family serine protease | HtrA-like ( | OvAE622, OvAE1918 | 47–53 |
| A1 family aspartic protease | cathepsin D-like; digestive enzyme in helminths ( | OvAE1300 | 80 |
| M41 family metalloprotease | mitochondrial membrane proteinase ( | OvAE65 | 91 |
| Granulin | mitogen associated with cancer ( | OvAE1732 | 45 |
| Aquaporin | water channel protein ( | OvAE6 | 48 |
| Tyrosinase | critical for | OvAE1900, OvAE1854 | 63 |
| Phospholipase A2 | similar to vertebrate venom proteins; ( | OvAE1644 | 55 |
| Thioredoxin peroxidase | immunomodulatory in fasciolosis ( | OvAE54 | 74 |
| EF-hand secreted Ca2+-binding protein | calumenin ( | OvAE61 | 47 |
| Saposin-like protein | pore forming; similar to fluke cytolysins ( | OvAE1692 | 64 |
| Pathogenesis related protein | similar to helminth venom allergen homologues ( | OvAE534, OvAE1862 | 38 |
| Glutathione-S-transferase | detoxification of heme and free radicals ( | OvAE1057, OvAE1892, OvAE1601, OvAE1729 | 86 |
| Synaptobrevin | neurotransmission/vesicular docking – vesicle associated ( | OvAE1001 | 73 |
| Innexin | integral membrane protein forming gap junctions ( | OvAE631 | 78 |
| Fibroblast growth factor (FGF) receptor substrate 2 | host FGF is essential for growth of schistosomes ( | OvAE1563 | 32 |
| Ly6c | Immune cell differentiation antigen | OvAE82 | 26 |
Figure 5An . Multiple sequence alignment comparing the ORF of OvAE953 with Sm-TSP-2 from S. mansoni (GenBank AF521091) and human CD63 (NM_001780). Both Sm-TSP-2 and CD63 sequences shown here are truncated at the C-terminus (fourth transmembrane domain and C-terminal tail are not shown) for comparative purposes because OvAE953 is a partial sequence. Black boxes denote identical residues shared by two or more of the sequences. Grey boxes denote conservative substitutions. Dashed lines denote the predicted transmembrane (TM) domains of Sm-TSP-2; the solid line represents the extracellular (EC) loop 2 region of Sm-TSP-2 [33].
Figure 6A TGF-β receptor type I from . Multiple sequence alignment of the ORFs of OvAE22 with homologues from Schistosoma mansoni (SmRK-I – GenBank AF031557), the hydatid tapeworm Echinococcus multilocularis (TR1 – AJ841786) and human (TGF-β receptor type I – L11695) (A). The overlined region denotes the putative serine-threonine kinase active site in SmRK-I [44]. Residues highlighted in red font in OvAE22 are putative sites of serine/threonine phosphorylation. Both SmRK-I and human TGF-β receptor type I sequences shown here are truncated at the N-terminus and SmRK-I is truncated at the C-terminus for comparative purposes with the partial sequence from O. viverrini. Black boxes denote identical residues shared by two or more of the sequences. Grey boxes denote conservative substitutions. Neighbour joining phylogenetic tree showing the relationship between the ORF of OvAE22 and other members of the TGF-β receptor type I family (B). Numbers on branches denote bootstrap values from 100 samplings. The nominated outgroup was the type 2 receptor, SmRK-2. GenBank accession numbers not already provided above are as follows: pig bone morphogenic protein (BMP) receptor type I (AY065994); dog hookworm Ancylostoma caninum S/T kinase (AY053388); Caenorhabditis briggsae CBG02627 (CAAC01000012); filarial nematode Brugia pahangi trk-1 (AF013991); S. mansoni SmRK-2 (AY550912).
Opisthorchis viverrini ESTs with sequence identity to mRNAs encoding proteins efficacious as vaccines against other flatworm and nematode parasites.
| Aspartic protease | APR-1 for hookworm; cathepsin D for | [59, 60] |
| Glutathione- | [79, 80] | |
| Cysteine protease | TSBP for | [61–63] |
| Tetraspanin | TSP-1, TSP-2 and Sm23 for | [33, 81] |
| Pathogenesis related protein | ASP-2 for hookworm; ASP-1 for | [82, 83] |
| calpain | Smp80 for schistosomes | [64] |
| Fatty acid binding protein | Sm14 for | [66, 67] |
| Saposin-like protein | FhSAP-2 for | [65] |
| 14-3-3 | Sm14-3-3 for schistosomes | [84] |
| 22.6 (unknown function) | Sm22.6 for | [58] |