| Literature DB >> 21245911 |
Russell M Morphew1, Hazel A Wright, E James Lacourse, Joanne Porter, John Barrett, Debra J Woods, Peter M Brophy.
Abstract
BACKGROUND: fasciola hepatica, along with Fasciola gigantica, is the causative agent of fasciolosis, a foodborne zoonotic disease affecting grazing animals and humans worldwide. Pathology is directly related to the release of parasite proteins that facilitate establishment within the host. The dominant components of these excretory-secretory (ES) products are also the most promising vaccine candidates, the cathepsin L (Cat L) protease family. METHODOLOGY/PRINCIPALEntities:
Mesh:
Substances:
Year: 2011 PMID: 21245911 PMCID: PMC3014944 DOI: 10.1371/journal.pntd.0000937
Source DB: PubMed Journal: PLoS Negl Trop Dis ISSN: 1935-2727
Figure 1Representative 2-DE protein arrays of in vitro and in vivo ES Cat L proteases.
Proteins were separated across a linear pH range of 4–7 using IEF in the first dimension and 11% SDS-PAGE in the second dimension and Coomassie blue stained. A) & C) 100 µg of F. hepatica ES Cat L proteases from in vitro culture. B) & D) 250 µg of Fasciola ES Cat L proteases from in vivo host bile analysis. In both C and D numbered and circled protein spots correspond to putative identifications located in Table 1.
Figure 2Further resolving the Cat L protease sub-proteome using micro range IPG strips.
Representative micro range 2-DE protein arrays of in vitro ES Cat L proteases. All 2-DE maps were run with 11% SDS-PAGE in the second dimension and (A) pH 4–7 (B) pH 4.9–5.7 (C) pH 5.5–6.7 in the first dimension. Numbered and circled protein spots correspond to putative identifications located in Table 1.
Identification of Cat L proteases from ES preparations by MSMS.
| ES Preparation | ||||||
| Spot Identifier | MS/MS Derived Peptides | Putitative Identity | Genbank Accession Number |
|
| Clade |
| 1 | - | Not Identified | Not Identified | − | + | N/A |
| 2 | 1- | Cathepsin L-like | CAA80445/6 | • (5.20) | • (6.29) | CL2 |
| 2- | ||||||
| 3- | ||||||
| 4- | ||||||
| 5- | ||||||
| 3 | 1- | Secreted Cathepsin L2 | ACC47721 | • (9.24) | • (8.78) | CL2 |
| 2- | ||||||
| 3- | ||||||
| 4- | ||||||
| 5- | ||||||
| 6- | ||||||
| 4 | - | Not Identified | Not Identified | • | N/A | |
| 5 | 1- | Cathepsin L-like | CAA80445/6 | • (4.70) | CL2 | |
| 2- | ||||||
| 3- | ||||||
| 6 | 1- | Cathepsin L-like/Cathepsin L | CAA80447 or AAF76330 | • (2.25) | • (3.77) | CL5 |
| 2- | ||||||
| 7 | 1- | Cathepsin L-like | CAA80445/6 | • (1.46) | CL2 | |
| 2- | ||||||
| 3- | ||||||
| 4- | ||||||
| 5- | ||||||
| 8 | 1- | Cathepsin L-like/Cathepsin L | CAA80447 or AAF76330 | • (2.36) | • (1.84) | CL5 |
| 2- | ||||||
| 3- | ||||||
| 4- | ||||||
| 9 | - | Not Identified | Not Identified | • | N/A | |
| 10 | 1- | Cathepsin L-like/Cathepsin L | CAA80447 or AAF76330 | • (0.91) | CL5 | |
| 2- | ||||||
| 1- | Cathepsin L/Cathepsin L-like | NFD | CL1 | |||
| 11 | 1- | Cathepsin L and Cathepsin L/Cathepsin L-like | NFD | • (2.87) | • (3.61) | CL1A |
| 2- | ||||||
| 3- | ||||||
| 4- | ||||||
| 5- | ||||||
| 6- | ||||||
| 7- | ||||||
| 12 | 1- | Cathepsin L (Numerous Types) | NFD | − (0.32) | + (1.00) | CL1 |
| 13 | 1- | Cathepsin L (Numerous Types) | NFD | • (0.23) | CL1 | |
| 14 | 1- | Cathepsin L (Numerous Types) | NFD | • (1.55) | • (4.07) | CL1 |
| 15 | 1- | Cathepsin L | AAR99518 | • (3.03) | • (5.25) | CL1A |
| 2- | ||||||
| 3- | ||||||
| 16 | 1- | Cathepsin L | AAM11647 | • (6.61) | • (6.46) | CL1A |
| 2- | ||||||
| 3- | ||||||
| 4- | ||||||
| 5- | ||||||
| 17 | 1- | Cathepsin L/Cathepsin L-like/Cathepsin L1 | NFD | • (1.85) | • (2.18) | CL1A/B |
| 2- | ||||||
| 3- | ||||||
| 4- | ||||||
| 18A | 1- | Cathepsin L1 | CAC12806 | • (9.21) | • (14.81) | CL1B |
| 2- | ||||||
| 3- | ||||||
| 4- | ||||||
| 5- | ||||||
| 18B | 1- | Cathepsin L | AAR99518 OR AAA29136 | • (3.23) | • (5.21) | CL1A |
| 2- | ||||||
| 3- | ||||||
| 4- | ||||||
| 18C | 1- | Cathepsin L (Numerous types) | NFD | • (1.57) | • (2.17) | CL1A/B |
| 2- | ||||||
| 3- | ||||||
| 19 | 1- | Cathepsin L | AAM11647 | • (4.20) | CL1A | |
| 2- | ||||||
| 3- | ||||||
| 4- | ||||||
| 5- | ||||||
| 6- | ||||||
| 20 | 1- | Cathepsin L/Cathepsin L-like/Cathepsin L1 | NFD | • (1.27) | CL1A/B | |
| 2- | ||||||
| 3- | ||||||
| 4- | ||||||
| 21 | 1- | Cathepsin L (Numerous Types) | NFD | − (1.24) | + (3.46) | CL1A/1B |
| 22 | 1- | Cathepsin L-like | ACJ12893/4 | • (1.72) | • (2.55) | CL1D |
| 2- | ||||||
| 3- | ||||||
| 4- | ||||||
| 5- | ||||||
| 6- | ||||||
| 23 | 1- | Cathepsin L/Cysteine Protease | AAM11647, AAR99518 or AAB30089 | • (13.85) | • (12.51) | CL1A |
| 2- | ||||||
| 3- | ||||||
| 4- | ||||||
| 5- | ||||||
| 24 | 1- | Cathepsin L (Numerous types) | NFD | • (0.47) | CL1 | |
| 2- | ||||||
| 25 | 1- | Cathepsin L (Numerous types) | NFD | • (0.28) | CL1 | |
| 26 | - | Not Identified | Not Identified | • | N/A | |
| 27 | 1- | Cathepsin L-like | ACJ12893/4 | • (6.35) | • (6.01) | CL1D |
| 2- | ||||||
| 3- | ||||||
| 4- | ||||||
| 5- | ||||||
| 28 | 1- | Cathepsin (Precursor) | AAA29136 | + (8.08) | − (2.94) | CL1A |
| 2- | ||||||
| 3- | ||||||
| 29 | 1- | Cathepsin L/Cathepsin L-like | NFD | • (1.14) | CL1A/B | |
| 30 | 1- | Cathepsin L-like | ACJ12893/4 | + (1.32) | − (0.55) | CL1D |
| 2- | ||||||
| 3- | ||||||
| 4- | ||||||
| 5- | ||||||
| 31 | 1- | Cathepsin L-like | ACJ12893/4 | • (0.95) | CL1D | |
| 2- | ||||||
| 3- | ||||||
| 4- | ||||||
| 5- | ||||||
| 32 | 1- | Secreted Cathepsin L1/Cathepsin L/Cathepsin | AAB41670, AAP49831, AAM11647, AAA29136 | • (0.34) | CL1A | |
| 33 | - | Not Identified | Not Identified | • | N/A | |
| 34 | - | Not Identified | Not Identified | • | N/A | |
Peptide sequences were used to search against Genbank or a translated EST library for the identification of specific Cat L proteases. Single amino acids in bold type in MSMS sequences indicate proteomic identification of single amino acid polymorphisms (SAAP) deviating from published sequences and revealed though a translated EST database. Spots with accession numbers as NFD relate to spots where too few peptides were sequenced preventing isoform and sub-clade identification and were consequently not fully designated (NFD), although the clade could be defined. All data for protein identification, such as percentage coverage and search scores, can be seen in Tables S2 and S3.
Sequences derived from MSMS analysis were interpreted either, automated or manually (where manually interpreted using Masslynx version 3.5 sequences are denoted by a *). Sequenced amino acids that match exactly with those found in the Genbank database or translated EST database are underlined.
For MSMS spectra from peptides specific to each Cat L isoform see Figures S6, S7, S8, S9, S10, S11.
Protein accession numbers correspond to those from Genbank.
If protein spots were identified in vitro or in vivo they are denoted by •, if they are up or down regulated when compared with the other they are denoted with + or − respectively. The percentage contribution of each identified Cat L protease spot compared to the total Cat L proteases calculated using densitometry are in parentheses below the appropriate symbol (•,+ or −).
Figure 3MSMS evidence of novel sequence and SAAP.
MSMS spectra from (A) the analysis of a peptide (VTGYYTLHSGNEAGLK) sequenced from spots 22, 27, 30 and 31, belonging to a novel CL1D isoform (B) the analysis of a SAAP variant peptide (QFGLETESSYPYR) sequenced from many protein spots including the novel CL1D isoform. Sequencing was performed both automated and manually (in these cases automated) using MassLynx v 3.5.
Figure 4Evidence for SAAPs and non-synonymous single nucleotide polymorphisms (nsSNPs).
Amino acid residues and their corresponding nucleotide codons outlined in grey locate the nsSNPs seen between two polymorph sequences, between A and B and between C and D. Nucleotides outlined in bold are those responsible for the amino acid substitution. Fhep numbers seen to the left of the nucleotide sequences correspond to individual qlk numbers used to distinguish between ESTs in the F. hepatica EST database. The three chosen qlk numbers for each sequence presented here are from the top three hits identified when locally BLASTing the F. hepatica EST database yet are representative of all sequences matching each peptide amino acid sequence.
Figure 5Novel CL1D protease sequences.
Boxed and shaded in light grey are the predicted signal peptides using SignalP 3.0. Boxed and shaded in dark grey is the conserved GXNXFXD motif for autoactivation. Arrowed are the five amino acid substitutions varying between both contigs with the associated nucleotide substitutions above each. Individual boxed amino acids correspond to the active site residues with the exception of one, labelled with a *. This corresponds to the leucine at position 69 (papain numbering) dictating substrate specificity [61], [62]. The dashed line indicates the start of the N-terminus of the mature enzyme. A primary consensus sequence (Prim.cons.) is also included. Alignment was performed using ClustalW [63].
Figure 6Phylogenetic analysis of the Fasciola Cat L family.
Phylogenetic trees constructed using F. hepatica and F. gigantica Cat L protease nucleotide and amino acid sequences. All reported accession numbers are from Genbank with the suffix Fhep for F. hepatica and Fgig for F. gigantica. The origin of each Cat L sequence are reported in parentheses (ARG – Argentina, AUS – Australia, CHN – China, IDO – Indonesia, IRE – Ireland, JPN – Japan, NED – The Netherlands, PER – Peru, POL – Poland, POR – Portugal, SWI – Switzerland, THA – Thailand, TUK – Turkey and U.K. – United Kingdom). A) A neighbour-joining tree using nucleotide data constructed through MEGA v 4.0 with 1000 trial bootstrapped support using a Nei-Gojobori/Jukes-Cantor calculation. B) Neighbour-joining phylogenetic tree constructed using amino acid sequences through MEGA v 4.0 with 1000 bootstrapped support and a Poisson correction.