| Literature DB >> 22717097 |
Iain W Chalmers1, Karl F Hoffmann.
Abstract
During platyhelminth infection, a cocktail of proteins is released by the parasite to aid invasion, initiate feeding, facilitate adaptation and mediate modulation of the host immune response. Included amongst these proteins is the Venom Allergen-Like (VAL) family, part of the larger sperm coating protein/Tpx-1/Ag5/PR-1/Sc7 (SCP/TAPS) superfamily. To explore the significance of this protein family during Platyhelminthes development and host interactions, we systematically summarize all published proteomic, genomic and immunological investigations of the VAL protein family to date. By conducting new genomic and transcriptomic interrogations to identify over 200 VAL proteins (228) from species in all 4 traditional taxonomic classes (Trematoda, Cestoda, Monogenea and Turbellaria), we further expand our knowledge related to platyhelminth VAL diversity across the phylum. Subsequent phylogenetic and tertiary structural analyses reveal several class-specific VAL features, which likely indicate a range of roles mediated by this protein family. Our comprehensive analysis of platyhelminth VALs represents a unifying synopsis for understanding diversity within this protein family and a firm context in which to initiate future functional characterization of these enigmatic members.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22717097 PMCID: PMC3435950 DOI: 10.1017/S0031182012000704
Source DB: PubMed Journal: Parasitology ISSN: 0031-1820 Impact factor: 3.234
Published findings on platyhelminth venom allergen-like proteins
Names of VAL proteins listed in ‘Studies on platyhelminth VALs’ section are as listed in the original publication. For the ‘Platyhelminth VAL proteins identified in global proteomic studies’ section, names are derived from this review's platyhelminth VAL analysis and are listed Supplementary File 1, online version only. Names used in the original publications are present in parentheses.
Venom allergen-like family distribution across the phylum Platyhelminthes
VAL members from platyhelminth species were identified by tBLASTn searches of NCBI (http://blast.ncbi.nlm.nih.gov/Blast.cgi), Wellcome Trust Sanger Institute (http://www.sanger.ac.uk/cgi-bin/blast/submitblast/) and Gasser Laboratory (http://gasser-research.vet.unimelb.edu.au/) EST databases and genome gene predictions for S. mansoni, (http://www.genedb.org/Homepage/Smansoni), S. japonicum (http://www.genedb.org/Homepage/Sjaponicum) and S. mediterranea (http://smedgd.neuro.utah.edu) using SmVAL1-29 protein sequences. All sequences with a tBLASTn e-value of <1 e-04 were then clustered to create a non-redundant dataset using a CAP3 clustering and additional pair-wise alignment interrogation (98% match over 150 bp minimum). Database searches were preformed on the 11 November 2011 (a) Number of VAL members refers to the number of unique sequences encoding a protein sequence containing at least 90% of a SCP/TAPS domain as defined by Pfam (PF00188). (b) Number of Group 1 and Group 2 members were defined by phylogenetic clustering (Fig. 2) with known SmVAL group 1 and group 2 members.
Fig. 1Comparison of Schistosoma mansoni and Echinococcus multilocularis VAL11 gene structure. (A) SmVAL11 gene structure over the SCP/TAPS domain-encoding exons. Structure obtained from S. mansoni genome v. 4 (http://www.genedb.org/Homepage/Smansoni). (B) EmVAL11 gene structure over the SCP/TAPS domain-encoding exons. The genomic region (scaffold_008000) was identified by a tBLASTn search of the E. multilocularis genome (http://www.sanger.ac.uk/cgi-bin/blast/submitblast/Echinococcus) using SmVAL11. EmVAL11 gene structure was manually predicted with all exon/intron junctions conforming to the consensus (GT/AG) splice donor/acceptor sequences for eukaryotes. Exons are represented by boxes with the length shown in base pairs above. Introns are represented by lines with the length shown in base pairs below. Exon regions coloured red represent regions encoding the SCP/TAPS domain.
Fig. 2Phylogenetic analysis of platyhelminth VAL proteins. In total, 237 platyhelminth SCP/TAPS domain amino acid sequences were aligned using ClustalW (Larkin et al. 2007) with Bayesian inference phylogenetic analysis performed using MrBayes software (version 3.1.2, WAG protein substitution model used, 3×106 generations run). The resulting unrooted consensus phylogenetic tree was visualized using Mesquite software. Branches are coloured to indicate the taxonomic class each sequence derives from: Trematoda (red), Cestoda (yellow), Turbellaria (blue) or Monogenea (green). Group 1 (dashed black line) and Group 2 proteins (solid black line) are indicated, as are the 2 major group 2 clades – Clade 2a (light grey line) and 2b (dark grey line). Examples of class-specific group 1 clades are highlighted red (trematode-specific), yellow (cestode-specific), green (monogenean-specific) or blue (turbellarian-specific) depending on the taxonomic class. Bayesian posterior probability support values greater than 0·6 are indicated. Species identifiers are as follows; Schistosoma mansoni (Sm), Schistosoma japonicum (Sj), Schistosoma haematobium (Sh), Opisthorchis viverrini (Ov), Fasciola hepatica (Fh), Fasciola gigantica (Fg), Clonorchis sinensis (Cs), Mesocestoides corti (Mc), Taenia asiatica (Ta), Taenia solium (Ts), Taenia saginata (Tsg), Moniezia expansa (Me), Echinococcus multilocularis (Em), Neobenedenia melleni (Nm), Dugesia japonica (Dj), Dugesia ryukyuensis (Dr), Schmidtea mediterranea (Smd) and Macrostomum lignano (Ml).
Fig. 3Diversity of domain architectures across platyhelminth VALs. (A) Cartoon representation of different domain architectures within platyhelminth group 1 VALs across different taxonomic classes. Signal peptides (represented by a yellow box) were identified by SignalP searches. A question mark indicates when the incomplete nature of the sequences did not allow for presence/absence of a signal peptide to be determined. Protein domains were identified by Pfam searches (red boxes represent SCP/TAPS domains (PF00188), white boxes represent low-density lipoprotein receptor domains (PF00057) and blue boxes represent fibronectin 2 domains (PF00040)). The M sequence subdomain (represented by green boxes) was identified by manual inspection of the alignment using the following amino acid convention derived from Gibbs et al. (2008) – C-X(2)-C-X(5-10)-C-X(5-15)-C (where C indicates a cysteine residue and X indicates any amino acid). SCP/TAPS domains containing the additional disulphide bond are represented by 2 circled ‘C’ letters. (B) Homology model of M. corti Crisp2 protein. The McCrisp2 M sequence subdomain is coloured white. Potential disulphide bonds are coloured yellow, with the cysteines involved in the formation of each disulphide bond labelled C1-C6. (C) Homology model of S. mansoni VAL4 (SmVAL4) protein. The SmVAL4 C-terminal region is coloured white, potential disulfide bonds are coloured yellow and the additional disulphide bond between Cysteine 26 and Cysteine 195 (where the starting Methionine is the first amino acid) indicated by an arrow. Homology models were produced, optimized and verified as described by Chalmers et al. (2008) using MODELLER version 9.1 (Eswar, 2006). Specific constraints employed to model the SmVAL4 Cys26-Cys195 disulphide bond did not adversely affect model quality by PROSA-web analysis (Wiederstein and Sippl, 2007). Models were visualized using MacPyMOL (DeLano Scientific LLC).