| Literature DB >> 19401778 |
Karen Luisa Haag1, Bruno Gottstein, Francisco Jose Ayala.
Abstract
Echinococcosis is a worldwide zoonotic parasitic disease of humans and various herbivorous domestic animals (intermediate hosts) transmitted by the contact with wild and domestic carnivores (definitive hosts), mainly foxes and dogs. Recently, a vaccine was developed showing high levels of protection against one parasite haplotype (G1) of Echinococcus granulosus, and its potential efficacy against distinct parasite variants or species is still unclear. Interestingly, the EG95 vaccine antigen is a secreted glycosylphosphatydilinositol (GPI)-anchored protein containing a fibronectin type III domain, which is ubiquitous in modular proteins involved in cell adhesion. EG95 is highly expressed in oncospheres, the parasite life cycle stage which actively invades the intermediate hosts. After amplifying and sequencing the complete CDS of 57 Echinococcus isolates belonging to 7 distinct species, we uncovered a large amount of genetic variability, which may influence protein folding. Two positively selected sites are outside the vaccine epitopes, but are predicted to alter protein conformation. Moreover, phylogenetic analyses indicate that EG95 isoform evolution is convergent with regard to the number of beta-sheets and alpha-helices. We conclude that having a variety of EG95 isoforms is adaptive for Echinococcus parasites, in terms of their ability to invade different hosts, and we propose that a mixture of isoforms could possibly maximize vaccine efficacy.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19401778 PMCID: PMC2671473 DOI: 10.1371/journal.pone.0005362
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Number of nucleotide substitutions in the 59 contigs obtained from the direct PCR sequencing, in different regions of the eg95 gene.
| Region | Sites | Invariant | Transitions | Transversions | Ambiguous | Indels |
| 5′ UTR | All | 4 | 0 | 0 | 0 | no |
| Exon 1 | All | 67 | 1 | 1 | 4 | no |
| 1st | 23 | 1 | 0 | 2 | ||
| 2nd | 22 | 0 | 0 | 1 | ||
| 3rd | 22 | 0 | 0 | 1 | ||
| Intron 1 | All | 542 | 14 | 10 | 64 | yes |
| Exon 2 | All | 284 | 14 | 5 | 14 | no |
| 1st | 96 | 4 | 1 | 2 | ||
| 2nd | 95 | 5 | 2 | 7 | ||
| 3rd | 93 | 5 | 2 | 5 | ||
| Intron 2 | All | 179 | 10 | 4 | 30 | yes |
| Exon 3 | All | 91 | 3 | 0 | 3 | no |
| 1st | 30 | 1 | 0 | 1 | ||
| 2nd | 32 | 0 | 0 | 1 | ||
| 3rd | 29 | 2 | 0 | 1 | ||
| 3′ UTR | All | 2 | 0 | 0 | 0 | no |
| Total | 1169 | 42 | 20 | 115 |
Invariant = sites showing no changes; transitions = purines changed to purines or pyrimidines to pyrimidines; transversions = purines changed to pyrimidines or vice versa; ambiguous = sites showing double-peaks; indels = deleted sites; UTR = untranslated region.
1st, 2nd, and 3rd refer to codon positions.
Figure 1Phylogenetic analyses of eg95 and em95 sequences.
a) Neighbor Joining tree based on the Kimura 2-parameters distance. Different E. granulosus, E. canadensis and E. ortleppi isolates are identified by a number, followed by the cox1 haplotype, and abbreviations referring to the geographic origin (Alg = Algeria, Arg = Argentina, Bra = Brazil, Eth = Ethiopia, Rom = Romainia, Spa = Spain) and to the respective host (Bv = cattle, Dr = dromedary, Hu = human, Ov = sheep, Pi = pig); for E. multilocularis, only the geographic origin (Aus = Austria, Fra = France; Jap = Japan, SLI = St Laurence Island, Swi = Switzerland) and the host (Mon = monkey, Ro = rodent) are indicated. b) Bayesian phyologeny constructed with model GTR+G+I. A subset of sequences obtained in the present study is compared to GenBank eg95-related sequences. ToW4 and ToW5/7 are Taenia ovis sequences published by Waterkeyn et al [9]; eg95-XJ, eg95-QH-2, eg95-QH-1, eg95-QH-3 are unpublished sequences deposited by Lin et al (2002) with accessions AF465599, AY421717, AY421716 and AY421718, respectively; eg95-2, eg95-2′, eg95-1, eg95-3, eg95-4, eg95-7-pseudogene, eg95-6 and eg95-5 were obtained by Chow et al [3]; AF359277 and AF414455 were submitted directly to GenBank by Lu et al (2001); and em95 is the E. multilocularis sequence published by Gauci et al [4]. Numbers on each node represent the bootstrap support (a) or the posterior probability (b). Arrows indicate outlier sequences with respect to the mitochondrial cox1 haplotype (see text).
Figure 2Tracking the unambiguous amino acid replacements along the phylogeny of EG95 isoforms.
Particular amino acid changes, and their positions, are indicated on each branch. The predicted secondary structure for each isoform is represented schematically to illustrate evolutionary convergence. Red bars are alpha-helix regions; blue arrows are conserved beta-sheets and green arrows are variant beta sheets.
Figure 3Positive selection analysis of secreted EG95 proteins deduced from nucleotide sequences isolated with the cloning experiments.
Proteins derived from distinct mRNAs are identified by the isolate number (see Table S1), cox1 haplotype, and a letter. The peptides corresponding to vaccine epitopes as described by Woollard et al [12], [15] are shown above the alignment. Below the consensus sequence, a graph shows the posterior probability (y-axis) of each site (x-axis) showing one of three rates of non synonymous to synonymous substitutions (ω), as estimated by model M3 [24].
Figure 4Positive selection analysis of eg95 and em95 mRNA sequences encoding the secreted protein.
a) Predicted secondary structure of EG95 isoforms. Amino acid differences in relation to E. granulosus A are shown in red; blue bars are alpha-helix regions; red arrows are conserved beta-sheets and green arrows are variant beta sheets. The regions corresponding to the fibronectin type III (Fn3) domain and the GPI anchor are indicated. b) Graphic representation of the estimated ratio of non synonymous to synonymous polymorphism within E. granulosus sensu lato (pi(a)/pi(s)) and divergence to E. multilocularis (K(a)/K(s)) calculated within a sliding window of 50 nucleotides moving in steps of 10 nucleotides along the eg95 CDS. The nucleotide positions corresponding to exons 1 and 2 are indicated. The downstream region of Exon2, encoding the C-terminal portion of the EG95 and EM95 Fn3 domain, and showing polymorphism in the number of beta-sheets, contains the highest level of non synonymous to synonymous polymorphism and divergence.