Literature DB >> 19357640

GPIomics: global analysis of glycosylphosphatidylinositol-anchored molecules of Trypanosoma cruzi.

Ernesto S Nakayasu1, Dmitry V Yashunsky, Lilian L Nohara, Ana Claudia T Torrecilhas, Andrei V Nikolaev, Igor C Almeida.   

Abstract

Glycosylphosphatidylinositol (GPI) anchoring is a common, relevant posttranslational modification of eukaryotic surface proteins. Here, we developed a fast, simple, and highly sensitive (high attomole-low femtomole range) method that uses liquid chromatography-tandem mass spectrometry (LC-MS(n)) for the first large-scale analysis of GPI-anchored molecules (i.e., the GPIome) of a eukaryote, Trypanosoma cruzi, the etiologic agent of Chagas disease. Our genome-wise prediction analysis revealed that approximately 12% of T. cruzi genes possibly encode GPI-anchored proteins. By analyzing the GPIome of T. cruzi insect-dwelling epimastigote stage using LC-MS(n), we identified 90 GPI species, of which 79 were novel. Moreover, we determined that mucins coded by the T. cruzi small mucin-like gene (TcSMUG S) family are the major GPI-anchored proteins expressed on the epimastigote cell surface. TcSMUG S mucin mature sequences are short (56-85 amino acids) and highly O-glycosylated, and contain few proteolytic sites, therefore, less likely susceptible to proteases of the midgut of the insect vector. We propose that our approach could be used for the high throughput GPIomic analysis of other lower and higher eukaryotes.

Entities:  

Mesh:

Substances:

Year:  2009        PMID: 19357640      PMCID: PMC2683718          DOI: 10.1038/msb.2009.13

Source DB:  PubMed          Journal:  Mol Syst Biol        ISSN: 1744-4292            Impact factor:   11.429


Introduction

Glycosylphosphatidylinositol (GPI) anchoring is a ubiquitous posttranslational modification (PTM) of proteins in lower and higher eukaryotes (McConville and Ferguson, 1993). In mammals, GPI biosynthesis is vital for embryonic development, and GPI-anchored proteins participate in important biological processes such as cell–cell interactions, signal transduction, endocytosis, complement regulation, and antigenic presentation (McConville and Ferguson, 1993; Orlean and Menon, 2007; Paulick and Bertozzi, 2008). In lower eukaryotes, particularly yeast and protozoa, GPI-anchored molecules have also been shown to have important biological functions (Ferguson, 1999; McConville and Menon, 2000; Orlean and Menon, 2007). In pathogenic protozoan parasites (e.g., Trypanosoma cruzi, Trypanosoma brucei, Leishmania major, and Plasmodium falciparum), for instance, GPI-anchored glycoconjugates may extensively coat the plasma membrane and are involved in many aspects of host–parasite interactions, such as adhesion and invasion of host cells, modulation and evasion from host immune response, and pathogenesis (McConville and Ferguson, 1993; Ferguson, 1999; McConville and Menon, 2000; Buscaglia ; Gazzinelli and Denkers, 2006; Acosta-Serrano ). In T. brucei, the GPI biosynthesis has already been validated as a molecular target for development of new drugs against African sleeping sickness (Smith ). To exploit GPIs as targets for the development of new therapies against other endemic protozoa (e.g., T. cruzi, P. falciparum, Leishmania spp.), a detailed, large-scale analysis of the GPI-anchored molecules expressed on the cell surface of these parasites (i.e., the GPIome) is of paramount importance. However, one of the major hurdles to study the large-scale temporal and spatial expression of GPI-anchored molecules in eukaryotic cells is the dearth of a universal and straightforward approach for GPI analysis (Ferguson, 1992; Hooper, 2001). The difficulty to develop a universal method to analyze GPI and GPI-anchored proteins may be due to their complex structure. The general structure of a GPI anchor comprises a lipid tail containing either a phosphatidylinositol (PI) or an inositolphosphorylceramide (IPC) moiety, attached to a glycan core consisting of a glucosamine (GlcN) residue followed by three mannose (Man) residues. In the third Man distal from the GlcN residue, usually an ethanolaminephosphate (EtNP) group attaches the GPI to the C-terminus of the protein. Further modifications in the anchor may occur, such as extra glycan, EtNP and/or aminoethylphosphonate (AEP) residues substituting the glycan core, and/or an extra fatty acid (acyl) group attached to the myo-inositol ring, increasing the complexity of the GPI structure (Ferguson, 1999; McConville and Ferguson, 1993). Owing to their complex structure and amphiphilic nature, GPIs and GPI-anchored proteins are somewhat difficult to be extracted, purified, and fully characterized. Elortza applied an approach to release GPI-anchored proteins from the plasma membrane by treatment with PI-specific phospholipases. Enzyme-treated and control samples were then separated by sodium dodecyl sulfatepolyacrylamide gel electrophoresis (SDS–PAGE) and the differential bands were excised and analyzed by mass spectrometry (MS) (Elortza ). Although this approach proved to be valuable in identifying GPI-anchored proteins, it did not provide any information on the GPI anchor structure itself. Another useful but somewhat limited approach is the extraction of GPI-anchored proteins with neutral detergents or 9% aqueous (aq.) n-butanol and further purification by hydrophobic interaction (HIC) or reverse phase chromatography (RPC) (Ferguson, 1992; Hooper, 2001). Both HIC and RPC methods separate biomolecules according to differences in their hydrophobicity. However, in contrast to RPC, HIC uses less denaturing buffers and/or organic solvents, and higher initial salt concentration to enhance the hydrophobic interactions between the biomolecule and the chromatography medium (stationary phase). On the other hand, RPC medium is generally more hydrophobic than that of a HIC medium, leading to stronger, more selective interactions between the biomolecule and the medium, and successful elution is achieved by the use of increasing concentration of organic solvent. A major drawback is that most stationary phases used for HIC and RPC have amphiphilic nature (i.e., hydrophobic groups attached to hydrophilic beads) and, therefore, their resolution is rather low. This might be due to the interaction of the resin with both hydrophobic (lipid) and hydrophilic (glycan) moieties of the GPI. Thus, here we hypothesize that the use of a more hydrophobic, polystyrene-divinylbenzene-based resin for RPC (e.g., POROS R1) (DePhillips ; Whitney ) could result in a much stronger, more selective interaction between the GPI lipid moiety and the covalently attached functional group (e.g., C4) of the stationary phase. This could considerably enhance the resolution of the process of purification of GPI-anchored molecules and free GPIs. T. cruzi is the etiologic agent of Chagas disease or American trypanosomiasis, a neglected tropical disease that affects over 11 million people and causes an estimated 50 000 annual deaths in Latin America (Dias ; Barrett ; Moncayo and Ortiz Yanine, 2006). Lately, Chagas disease has become a public health threat for the US and European countries like Spain, where an increasing number of chronically T. cruzi-infected migrants from endemic countries are residing in (Bern ; Piron ). There are only two commercial drugs (Benznidazole and Nifurtimox) available for the treatment of Chagas disease, and both are partially effective and highly toxic. In addition, an increasing number of drug-resistant T. cruzi strains have been reported (Urbina and Docampo, 2003; Wilkinson ). Moreover, thus far no human vaccine is available for treating or preventing Chagas disease (Garg and Bhatia, 2005; Dumonteil, 2007; Hotez ). Therefore, there is an urgent need for new therapeutic targets against T. cruzi. T. cruzi has four developmental stages or forms, two (i.e., epimastigote and metacyclic trypomastigote) dwelling in the triatomine insect vector, and two (i.e., amastigote and trypomastigote) in the mammalian host. The parasite can be transmitted by the bloodsucking insect vector (a Reduviidae, popularly known as the kissing bug), blood transfusion, organ transplantation, or congenitally. The vector-mediated transmission begins when metacyclic trypomastigotes present in the vector's excrement enter the host bloodstream through the insect's bite wound or exposed mucosal tissues. Metacyclic forms immediately invade a wide range of nucleated cells and transform into proliferative amastigote forms, which divide by binary fission and, 4–5 days later, transform into intracellular trypomastigote forms. These forms rupture the cell membrane and invade other surrounding cells, or enter the bloodstream to infect remote organs or tissues, or eventually another insect vector (Barrett ). T. cruzi GPI-anchored proteins are expressed in all developmental stages and encoded by thousands of members of multigene families, such as trans-sialidase (TS)/gp85 glycoprotein, mucin, mucin-associated surface protein (MASP), and metalloproteinase gp63. Some of these proteins (e.g., TS/gp85, mucins) have been shown to be essential for the infectivity of the parasite or its escape from the host immune response (Frasch, 2000; Acosta-Serrano , 2007; Buscaglia ; Yoshida, 2006; Alves and Colli, 2008). In addition, GPI anchors from T. cruzi were shown to be strong proinflammatory molecules, being critical in the modulation of the host immune response against this parasite (Almeida and Gazzinelli, 2001; Gazzinelli and Denkers, 2006). Thus, GPI-anchored proteins and GPI anchors themselves seem to be very attractive targets for new therapies against Chagas disease. Now we report the first large-scale analysis of the GPIome of a eukaryote, T. cruzi. Initially, we carried out a GPI-anchoring prediction analysis for all protein-encoding genes of the T. cruzi genome. Next, we developed a polystyrene-based (POROS R1) RPC to purify protein-free GPIs (glycoinositolphospholipids, GIPLs) and GPI-anchored proteins. This chromatographic step was then coupled to tandem MS, and used for the analysis of GIPLs and proteinase-released GPIs. Our results show that the GPIome of T. cruzi is much more complex than was known earlier, and provide new insights into the biosynthesis of T. cruzi GPI anchors, which could be eventually exploited as potential therapeutic target in Chagas disease.

Results

Genome-wise prediction of GPI-anchored proteins from T. cruzi

To estimate the extent of T. cruzi proteins that might be GPI anchored, we performed a prediction analysis using the FragAnchor algorithm (Poisson ). Proteins that will undergo GPI PTM contain a short hydrophobic C-terminal sequence that is removed before the en bloc addition of the complete GPI anchor by the transamidase complex (McConville and Ferguson, 1993; Ferguson, 1999). The newly formed C-terminal amino acid receiving the GPI is known as the ω (omega) site. The FragAnchor algorithm, thus, predicts the GPI-anchoring sites based on the short hydrophobic C-terminal amino acid sequence and the ω site. To test the sensitivity and accuracy of the prediction analysis, we used a database set containing all sequences of the T. cruzi mucin II (TcMUC II) family. This family of glycoproteins had their GPI-anchoring sites experimentally validated (Buscaglia ) and, thus, could be used as the training set for our analysis. Out of 624 TcMUC II sequences, 598 (95.8%) were highly probable to be GPI anchored (Table I), showing that the sensitivity of the prediction is likely high. Next, to test the accuracy of the prediction, we removed a short hydrophobic sequence comprising the last 20 amino-acid residues at the C-terminus of the sequences and analyzed them with the FragAnchor algorithm. With this approach only two sequences (0.3%) were wrongly predicted to be GPI anchored (Table I), so the prediction could be considered very accurate.
Table 1

Genome-wise prediction of putative GPI-anchored protein sequences in T. cruzi

DatabaseNumber of sequencesHighly probable GPI-anchored sequences% Total
TcruziDB v5.019 613230411.9
GenBank T. cruzi sequences41 478494111.7
GeneDB TcMUC II sequences62459895.8
GeneDB TcMUC II−20 AA sequencesa62420.3
    
Classification of the predicted T. cruzi GPI-anchored proteins from GenBank
Protein familyHighly probable GPI-anchored sequences% Total potentially GPI-anchored proteins
MASP (mucin-associated proteins)173035.0
    
Mucins126225.5
 TcMUC I1002.0
 TcMUC II112622.8
 TcMUC III20.0
 TcSMUG S200.4
 TcSMUG L140.3
    
TS/gp85 glycoproteins116023.5
trans-Sialidase110222.3
 gp90220.4
 gp82150.3
 Complement regulatory protein90.2
 Tc85/gp8580.2
 Flagellum-associated protein30.1
 SAPA10.0
    
Other families78916.0
 Amastin2164.4
 gp631432.9
 Mucin-like861.7
 TolT140.3
 Hypothetical proteins2074.2
 Others1232.5
    
 Total4941100

aThese sequences had 20 amino-acid residues from the C-terminal removed before the GPI prediction analysis, and served as negative controls for the prediction.

The prediction analysis using the genome sequences (TcruziDB v5.0) lead to the estimation that 11.9% of the sequences have potential GPI-anchoring site (Table I). By performing the analysis using the T. cruzi sequences deposited on the GenBank, 11.7% of the sequences were predicted to be GPI anchored (Table I). Of those predicted GPI-anchored proteins, the members of MASP (1730 sequences), mucin (1262 sequences), and TS/gp85 (1160 sequences) multigene families were the most abundant among the sequences deposited in the GenBank. Amastin, gp63, mucin-like, and TolT were also represented by multiple sequences (Table I). TolT is an immunogenic surface protein found on T. cruzi trypomastigotes (Quanquin ) with protein sequence identity and biochemical and biological traits in common with the bacterial TolA proteins, which are part of the Tol system of most gram-negative bacteria and are involved in the outer membrane stability (Lazzaroni ). Taken together, our GPI-anchoring prediction data indicates that a high proportion of the T. cruzi genes possibly encode GPI-anchored proteins.

Implementing a polystyrene-based RPC to purify free and protein-linked GPIs

Next, we developed a polystyrene-based RPC and tested it using a synthetic GPI (Yashunsky ) containing several by-products resulting from unsaturated fatty acid oxidation or removal during its prolonged storage as nuclear magnetic resonance (NMR) sample in [2H]6-DMSO solution. The GPI sample was loaded onto a ziptip manufactured with POROS R1 resin (C4 groups linked to polystyrene-divinylbenzene beads) and eluted step-wisely with increasing (40, 50, 60, and 70%) 2-propanol concentrations. Each fraction was then analyzed by negative-ion mode electrospray ionization (ESI)-MS (Materials and methods). The results clearly showed an efficient separation between the bona fide GPI species and lyso-GPIs, oxidized GPIs, and other by-products (Figure 1).
Figure 1

Fractionation of the synthetic trypomastigote mucin GPI and by-products using POROS R1 ziptip. The synthetic mucin GPI (Man-[EtNP]Man-Man2-[AEP]GlcN-InsP-1-O-alkyl-C16:0-2-O-acyl -C18:2-Gro) was purified through POROS R1 50 resin as described in Materials and methods. The top panel represents the MS spectrum of the starting material, whereas the bottom four panels represent the MS spectra of the ziptip fractions eluted with 40, 50, 60, and 70% 2-propanol, respectively. All observed GPI species are doubly charged, plus or minus sodium adduct. lyso-GPI, GPI species missing the C18:2-fatty acid at C-2 of the glycerol backbone; m/z, mass-to-charge ratio.

Then, we tested the suitability and efficiency of POROS R1 for the purification of GPI-anchored proteins. We analyzed an organic (9% n-butanol) extract enriched with GPI-anchored proteins from epimastigote forms of T. cruzi (Almeida ) (Figure 2A). This extract was spiked with small amount of biotin-labeled epimastigote mucins, and fractionated in a solid-phase extraction cartridge prepared with POROS R1 resin. The fractions were then monitored by SDS–PAGE and chemiluminescent assay (CLA) (Figure 2A). By CLA, the epimastigote mucins were shown to be eluted mainly with 22.5% aq. n-propanol (Figure 2B). Consistently with this result, the silver-stained gel showed two broad bands in the 34–67 kDa range with typical behavior of epimastigote mucins (Acosta-Serrano ; Buscaglia ) (bands 9 and 10, Supplementary Figure 1). These bands were excised, digested with trypsin, and sequenced by liquid chromatography (LC) coupled to MS2. As expected, here we were unable to detect any peptide sequence, most probably because (1) epimastigote mucins are highly glycosylated (Acosta-Serrano ; Buscaglia ), and (2) their protein core seems almost completely resistant to different proteases (e.g., trypsin and proteinase K) (Almeida et al, unpublished observation). We have observed earlier that on trypsin treatment, epimastigote mucins only slightly shift their relative migration on SDS–PAGE, suggesting that small peptide fragment(s) is(are) removed from the protein backbone (Almeida et al, unpublished observation). We have also reported that epimastigote mucins are only partially degraded by proteinase K, releasing the intact GPI anchor previously linked to their C-terminal end (Almeida ).
Figure 2

Purification of GPI-anchored proteins from T. cruzi epimastigotes using POROS R1 50 cartridge. (A) Schematic representation of the methodology used for the purification and analysis of GPI-anchored proteins. Briefly, epimastigote cells were delipidated with chloroform/methanol (1:1, v/v), followed by chloroform/methanol/water (1:2:0.8, v/v/v). Then, GPIs were extracted with 9% n-butanol and partitioned with n-butanol/water solution. The aqueous phase, rich in GPI-anchored proteins, was spiked with biotinylated epimastigote mucins (biotin-eMUC), purified in an octyl-Sepharose column. The GPI-rich extract was fractionated in a POROS R1 50 solid-phase extraction cartridge and the eluted fractions were analyzed by chemiluminescent assay (CLA) and SDS–PAGE, or digested with proteinase K and analyzed by electrospray ionization-tandem mass spectrometry (ESI-MS/MS). (B) CLA of POROS R1 50 fractions. Biotin-eMUC, biotinylated epimastigote mucins. (C) The fraction eluted with 22.5% n-propanol (equivalent to 2.7 × 107 epimastigote cells) was digested with proteinase K, purified in a POROS R1 50 ziptip, and analyzed in negative-ion mode using an ESI-QTOF-MS (Micromass Qtof-1, Waters). The top panel represents the MS spectrum of the purified epimastigote mucin (p-eMUC-GPI) fraction, whereas the bottom panel is the MS spectrum of the synthetic T. cruzi epimastigote mucin GPI (s-eMUC-GPI) (Man-[EtNP]Man-Man2-GlcN[AEP]-InsP-1-O-C16:0-alkyl-2-O -C16:0-acyl-Gro) (Yashunsky ). The asterisk indicates another species of GPI, previously characterized as [AEP]Man4-GlcN[AEP]-InsP-1-O-alkyl-C16:0-2-O-acyl-C16:0-Gro (Almeida ). (D) ESI-QTOF-MS2 spectrum of parent ion at m/z 917.0. The top panel represents the MS2 spectrum of the purified T. cruzi epimastigote mucin GPIs (p-eMUC-GPIs), whereas the bottom panel represents the MS2 spectrum of the synthetic eMUC-GPI (s-eMUC-GPI) purified by POROS R1. Noteworthily, the same ion species are observed in both spectra, confirming the assignment of the major parent ion at m/z 917.03 in (C). AAG, alkylacylglycerol; AEP, aminoethylphosphonate; Alk, alkyl; Gro, glycerol; GroP, 3-O-phosphoglycerol; Hex, hexose; HexN, hexosamine; Ins, myo-inositol; InsP, myo-inositolphosphate; M, molecular mass; m/z, mass-to-charge ratio; PI, phosphatidylinositol.

To further corroborate the presence of GPI-anchored proteins, the fraction eluted from the POROS R1 column with 22.5% aq. n-propanol was digested with proteinase K and analyzed by negative ion-mode ESI-MS. We could detect at least four ion species with m/z identical to purified eMUC-derived GPIs (eMUC-GPIs) (Previato ; Serrano ; Almeida ) that were further confirmed by fragmentation analysis (Figure 2C and D). Taken together, our results validate the use of POROS R1 for the purification of free GPIs and GPI-anchored proteins.

Analysis of protein-free GPIs by LC-MSn

To ultimately increase the sensitivity and the speed of our analysis, we packed a nanocapillary column with POROS R1 resin and coupled it to an LC-MS system. Then, we performed LC-MS2 and LC-MS3 analysis of organic (91% n-butanol, and combined chloroform/methanol and chloroform/methanol/water) extracts enriched for GIPLs (Figure 3A). GPIs are frequently analyzed in negative-ion mode ESI-MS, dissolved in solutions/buffers with neutral pH. However, under these conditions, the silica tubing has negative charges, which interact with amine (GlcN, AEP, and/or EtNP) groups present in the GPI. To neutralize the negative charge of the silica tubing, we analyzed our GPI samples in positive-ion mode ESI-MS, in the presence of 0.2% formic acid (FA), used as the ion pair for the LC-MS analysis. With the LC-MS approach, we could detect and characterize at least 78 doubly charged ion species of GIPLs (Table II; Supplementary Table I; Supplementary Figure 2A–C). Of those, 70 were found to be novel species (Supplementary Table I). The extracted-ion chromatograms for GIPLs obtained by either extraction procedure (9% n-butanol or combined chloroform/methanol and chloroform/methanol/water) (Figure 3A) clearly indicate that the majority (92–98%) of GIPLs contain ceramide (Cer) and only a small fraction (2–8%) have alkylacylglycerol (AAG) in the lipid moiety (Table II). In MS2, typically the fragmentation (collision-induced dissociation, CID) of the doubly charged GIPL parent-ion gave rise to fragment- or daughter-ions corresponding to the loss of Cer ([M−Cer+H]+), and to a series of ions containing AEP or EtNP residue attached to multiple (n=1–5) hexose (Hex) residues (e.g., Supplementary Figure 2C3, 2C5, 2C7, and 2C9). On the other hand, fragment-ions corresponding to the neutral loss of Hex residue(s) and AEP or EtNP were also highly abundant. We could also find a fragment corresponding to the inositolphosphate (InsP) attached to AEP-HexN (AEP-HexN-InsP) at m/z 529.3 (e.g., Supplementary Figure 2C3, 2C5, 2C7, and 2C9). Interestingly, some species had only one AEP residue and no EtNP. In these species, the AEP residue could be attached to either a Hex or HexN residue. For instance, the fragmentation of the GIPL species AEP-Hex6-HexN-InsP-C16:0/d18:0-Cer observed at m/z 1012.49 showed diagnostic fragment ions corresponding to the AEP-HexN attached to IPC (AEP-HexN-IPC) (m/z 1050.48) and Hex2-AEP (m/z 432.22) (Supplementary Figure 2C22).
Figure 3

(A) General GPIomic approach. Organic extracts rich in GIPLs and GPI-anchored proteins were obtained as described in Materials and methods. The fraction rich in GIPLs was directly analyzed by LC-MS2 and LC-MS3, whereas the fraction rich in GPI-anchored proteins were digested with proteinase K or trypsin before LC-MS2 and LC-MS3 analyses. In both cases, the LC step was carried out using a POROS R1 10 column (75 μm × 10 cm) and the MS analysis was performed using a LTQXL ESI-linear ion-trap-MS. The resulting spectra were analyzed manually for annotation and assignment of the GPI species. (B) Data-dependent acquisition (DDA) (no dynamic exclusion enabled) LC-MS2 analysis of epimastigote GPIs. The extracted-ion chromatogram was plotted for four major GPI species. The plotted ion species correspond to the dehydrated alkylacylglycerol (AAG—H2O) and the loss of alkylacylglycerol moiety (M—AAG). The insert shows details about the m/z and the area of each peak. (C) Annotated MS2 spectrum from GPI structure at m/z 975.18 corresponding to EtNP-Hex4-[AEP]HexN-InsP-1-O-alkyl-C16:0-2-O-acyl-C24:0-Gro. (D) MS3 spectrum for the dehydrated AAG fragment at m/z 649.56. (E) Proposed fragmentation and structure of the novel GPI species (EtNP-Hex4-[AEP]HexN-InsP-1-O-alkyl-C16:0-2-O-acyl-C24:0-Gro) observed at m/z 975.18. Ac, acyl; Gro, glycerol.

Table 2

Summary of glycan, lipid, and peptide composition of the GPIome of T. cruzi epimastigotes characterized by LC-MS/MS

Overall dataGPI-anchored molecule (origin or treatment)a
 GIPLs (CM and CMW)GIPLs (BuOH)Proteins (Proteinase K)Proteins (Trypsin)
Inositol content (picomoles per 1e7 cells)889.95±49.02b9.17±1.510.43±0.0020.43±0.002
Number of MS2 spectra11621965425
Number of identified species7044115
     
 Relative quantification by MS2 TIC (% total)c
Glycan moiety
 [AEP]Hex4-[AEP]HexN-InsP14.3111.43
 [AEP]Hex5-[AEP]HexN-InsP17.6146.243.82
 [AEP]Hex6-[AEP]HexN-InsP0.161.16
 [EtNP]Hex4-[AEP]HexN-InsP0.120.0183.9756.50
 [EtNP]Hex5-[AEP]HexN-InsP26.6835.471.3619.93
 [EtNP]Hex6-[AEP]HexN-InsP0.090.170.18
 [EtNP]Hex4-HexN-InsP0.198.31
 [AEP]Hex4-HexN-InsP or Hex4-[AEP]HexN-InsP1.310.16
 [AEP]Hex5-HexN-InsP or Hex5-[AEP]HexN-InsP0.03
 [AEP]Hex6-HexN-InsP or Hex6-[AEP]HexN-InsP52.0314.72
 Hex7-[AEP]HexN-InsP0.810.59
 Hex8-[AEP]HexN-InsP1.131.48
 Hex7-HexN-InsP0.03
 NANA-Hex7-[AEP]HexN-InsP0.01
     
 Total100.00100.00100.00100.00
     
Lipid moiety
 Alkyl-C16:0-acyl-C16:0-Gro7.482.1296.2491.69
 Alkylacyl-C33:0-Gro0.18
 Alkylacyl-C34-Gro0.150.080.80
 Alkylacyl-C34:1-Gro0.66
 Alkylacyl-C34:2-Gro0.220.258.31
 Alkyl-C16:0-acyl-C24:0-Gro0.132.49
 Alkylacyl-C42:1-Gro0.04
 C16:0/d18:0-Cer21.674.80
 C16:0/d18:1-Cer26.0011.29
 C16:0/t18:0-Cer0.30
 C22:0/d18:0-Cer0.64
 C22:0/d18:1-Cer0.010.08
 C22:0/t18:1-Cer0.23
 C23:0/d18:0-Cer0.10
 C23:0/d18:1-Cer0.210.75
 C24:0/d18:0-Cer19.2126.11
 C24:0/d18:1-Cer21.0948.84
 C24:0/d18:2-Cer0.120.16
 C24:0/t18:0-Cer0.050.14
 C24:0/t18:1-Cer1.14
 C25:0/d18:0-Cer0.420.39
 C25:0/d18:1-Cer0.521.31
 C26:0/d18:0-Cer0.640.92
 C26:0/d18:1-Cer0.791.22
     
 Total100.00100.00100.00100.00
     
Peptide    
 APTPGD100.00

aCM and CMW, CHCl3:MeOH (1:1, v/v) and CHCl3:MeOH:water (1:2:0.8, v/v/v), respectively; BuOH, 1-butanol.

bThe amount of GIPL-derived myo-inositol is overestimated because of the presence of large quantities of free phosphatidylinositol (PI) in this fraction.

cThe most abundant (>10%) glycan or lipid structures are shown in bold italics.

Still in the case of ceramide-containing GIPLs, the two most abundant daughter-ions observed by MS2 analysis were the one resulting from the neutral loss of ceramide ([M−Cer+H]+) and the dehydrated ceramide moiety itself ([CerH2O+H]+) (Supplementary Figure 2C3). Thus, to obtain more detail of the ceramide moiety, we performed MS3 analysis of the two most abundant ions resulting from the loss of H2O (at m/z 520.56 and 502.49). The ion species at m/z 520.56 gave rise to a C16:0-1-azirine derivative at m/z 280.24 and a dehydrated sphingoid (d18:1)-base derivative at m/z 264.41 (Hsu and Turk, 2000) (Supplementary Figure 2C4). 1-Azirine fatty acid derivatives were also frequent in MS2 spectra (e.g., Supplementary Figure 2C3, 2C7, 2C9, and 2C12), thus, the structure of many ceramides could be determined without the need of MS3 analysis. Most of the ceramides were dihydroxylated, but some of them had three hydroxyl groups instead (Table II; Supplementary Table I). The sphingoid base was invariably d18:0, d18:1, or d18:2, whereas the fatty acid varied in length from C16:0 to C26:0. Surprisingly, we found a few ceramide species containing odd-carbon number fatty acids (OCFAs), such as C23:0 and C25:0 (e.g., species 31, 33, 36, 40, 47, 60, 62–64, 66, and 67; Supplementary Table I). Interestingly, OCFAs have been described earlier in T. cruzi neutral glycosphingolipids (Barreto-Bergter ). Other trypanosomatids, such as Phytomonas and Herpetomonas, have also been reported to biosynthesize de novo a range of iso-branched C18, C20, and C22 polyunsaturated fatty acids (Fish ). In our case, we could not determine whether the observed OCFAs were linear or methyl-branched, or both. Furthermore, considering the calculated monoisotopic molecular mass, we cannot exclude the possibility that these fatty acids are actually oxidized C22:1 and C24:1 species. However, owing to their late retention time in the RPC (Supplementary Table I), more likely these fatty acids are OCFAs. Further studies are necessary to confirm the existence and precise structure of these unusual fatty acid species in T. cruzi. A few GIPL species containing AAG as the lipid tail were also found (Table II; Supplementary Table I; Supplementary Figure 2A and B). Unlike the ceramide-containing GIPLs, MS2 analysis of AAG-containing GIPL species revealed that fragments derived from the lipid tail were not always the most abundant ones (Supplementary Figure 2C25, 27–35). Therefore, we could not carry out data-dependent MS3 analysis of all AAG-containing GIPL species. Trying to circumvent this issue, we performed data-independent MS3 analysis by selecting the most abundant and frequent lipid moiety-derived ion species observed at m/z 537.6 (Supplementary Figure 2C27–30). Again, we observed that fragment ions derived from the AAG moiety had very low abundance and we could not obtain enough information to determine the fine composition of the lipid tail (data not shown). Thus, we annotated the less abundant AAG-containing GIPL species (e.g., C34:0-AAG, C34:1-AAG, C34:2-AAG, and C42:1-AAG) with the total number of carbons instead (Table II; Supplementary Table I). However, as in T. cruzi GPIs all 1-O-alkyl chains thus far described are C16:0 (Serrano ; Carreira ; Almeida ; Previato ), it is most likely that most if not all AAG-containing GIPL species observed in this study would have the same C16:0-alkyl chain composition. To relatively quantify the GIPL species, we exploited the MS2 total-ion chromatogram (MS2 TIC) approach (Asara ) (Materials and methods, and below). The quantification revealed that GIPL species containing ceramide (91.6–97.9%) are much more abundant than those having AAG (2.1–8.4%) as lipid tail (Table II; Supplementary Table I; Supplementary Figure 2).

Analysis of proteinase K-released GPIs by LC-MSn

Next, the fraction rich in GPI-anchored proteins was digested with proteinase K and analyzed by LC-MS2 and LC-MS3 (Figure 3A). Proteinase K has a broad range of specificity and cleaves at the carboxyl terminus of the polypeptide, thus ensuing GPIs that are free of amino-acid residues (Almeida ). The fragmentation pattern of proteinase K-released GPIs was very similar to that of the GIPLs (Figure 3C–E; Supplementary Figure 3). In fact, some of the AAG-containing GPIs had the same glycan and lipid structure as some of the GIPLs (Table II). Thus, we cannot discard the possibility that these AAG-containing GIPLs could be actually direct precursors of GPI anchors to be attached to proteins. It is also worth pointing out that all GPI structures derived from GPI-anchored proteins had AAG as the lipid moiety. Conversely, the majority (91.4–97.8%) of GIPLs had ceramide as the lipid tail (Table II; Supplementary Table I). It still remains to be determined why epimastigotes, in contrast to metacyclic trypomastigotes (Serrano ), do not express ceramide-containing GPI anchors in their major surface glycoproteins (e.g., mucins). Furthermore, we could not detect any major GIPL species in the proteinase K-released GPI analysis, indicating that the sample was free of these glycolipids. The major fragments from MS3 analysis of AAG were the ones resulting from the loss of either alkyl or acyl group. Noteworthily, all AAG structures we could determine had alkyl-C16:0 at the sn-1 position and variable-size fatty acyl group at the sn-2 position (Figure 3C–E; Supplementary Figure 3). In total, 11 species of GPI anchors were identified. Of those, only three had been described earlier (Previato ; Serrano ; Almeida ) and nine were found to be novel but much less abundant species (Figure 3B; Table II; Supplementary Table I; Supplementary Figure 3). The MS2 TIC quantification showed that the two major species [EtNP]Hex4-[AEP]HexN-InsP-alkyl-C16:0-acyl-C16:0-Gro; [AEP]Hex4-[AEP]HexN-InsP-alkyl-C16:0-acyl-C16:0-Gro) comprise about 95% of total protein-derived GPIs, whereas the least abundant species accounts for only 0.04% ([EtNP]Hex4-[AEP]HexN-InsP-alkylacyl-C42:1-Gro) (Supplementary Table I). The MS2 and MS3 spectra, and schematic fragmentation for one of the novel species ([EtNP]Hex4-[AEP]HexN-InsP-alkyl-C16:0-acyl-C24:0-Gro), representing ∼2.5% of the total protein-derived GPI anchors, is shown in Figure 3C–E.

Analysis of trypsin-released GPI-peptides by LC-MSn

In T. cruzi, mucins are one of the major GPI-anchored antigens expressed on the parasite cell surface, and they are involved in the escape from host immune response as well as adhesion and invasion of host cells (Acosta-Serrano ; Buscaglia ; Acosta-Serrano ). T. cruzi mucins are encoded by at least 863 genes, grouped in two major families (i.e., TcMUC and TcSMUG) (Buscaglia ; Acosta-Serrano ). We had shown earlier that we could identify the mucins expressed by the infective trypomastigote stage through the analysis of a short (3–4-mer) peptide containing the ω site still attached to the GPI anchor (GPI-peptide) (Buscaglia ). Here, we exploited this approach to sequence the GPI-peptide derived from mucins of noninfective epimastigotes and, therefore, to determine which gene family(ies) is(are) expressed on this parasite stage. Although epimastigote mucins are somewhat resistant to different proteases (e.g., trypsin, proteinase K) as observed by SDS–PAGE (not shown), the C-terminal region seems to be less glycosylated, thus, susceptible to proteolytic digestion. Thus, the mucin-rich (9% n-butanol) extract from T. cruzi epimastigotes was digested with trypsin and analyzed by LC-MS2 and LC-MS3 (Figure 3A). On the basis of the structures identified from the proteinase K-treated samples, we could rapidly match several fragments, such as AAG (m/z 537.6), AEP-HexN-PI (m/z 1065.5), Hex(AEP)HexN-PI (m/z 1227.6), and Hex2(AEP)HexN-PI (m/z 1389.7), thus partially determining the GPI-peptide structure (Figure 4A). By combining the partially determined GPI-structure and the possible peptides that could be generated by tryptic digestion and being attached to a GPI anchor (using the GPI-anchoring prediction), one peptide candidate could be the APTPGD sequence from mucin TcSMUG S. Indeed, the fragmentation of GPI-peptide species showed abundant fragments corresponding to the mass of this peptide attached to dehydrated EtN, EtNP, or EtNP-Hex1–4 (Figure 4A), similar to the fragmentation pattern described elsewhere (Redman ). However, the fragmentation of the peptide moiety in the MS2 was very poor impairing the sequence confirmation. To ultimately sequence the peptide and determine the ω site, fragments corresponding to the peptide attached to dehydrated EtN or AEP were subjected to data-independent MS3 analysis (Figure 4B; Supplementary Figure 4). The quality of MS3 spectra were good enough to enable the de novo sequencing of the peptide. With this approach, we could sequence peptides attached to five different GPI structures, four of them also detected after proteinase K digestion (Figure 4A–C; Table II; Supplementary Table I; Supplementary Figure 4). Although five GPI species were found, all these species were attached to the same peptide sequence (APTPGD), which corresponds to the carboxyl terminus of the TcSMUG S subfamily of mucins from T. cruzi (El-Sayed ; Buscaglia ; Acosta-Serrano ). This result was corroborated by amino-acid compositional analysis (not shown). The TcSMUG S subfamily comprises eight unique sequences (GenBank accession numbers XP_804663.1, XP_807370.1, XP_807371.1, XP_821038.1, XP_821039.1, XP_821040.1, XP_821041.1, and XP_821042.1), with predicted mature protein sequences varying from 56 to 85 amino acids. TcSMUG S sequences contain approximately 40% threonine in their composition (Buscaglia ), suggesting that they could be heavily O-glycosylated and, therefore, quite resistant to protease digestion as we have already observed (data not shown).
Figure 4

Analysis by LC-MS2 and LC-MS3 of GPI-peptides released by trypsin treatment. (A) MS2 spectrum of the GPI-peptide species at m/z 1188.5, which corresponds to APTPGD-EtNP-Hex4-[AEP]HexN-InsP-1-O-alkyl-C16:0-2-O -acyl-C16:0-Gro. (B) MS3 spectrum of the fragment corresponding to the peptide attached to dehydrated EtN at m/z 582.4. The peptide fragments are indicated. (C) Proposed fragmentation and structure of the GPI-peptide APTPGD-EtNP-Hex4-[AEP]HexN-InsP-1-O-alkyl-C16:0-2-O -acyl-C16:0-Gro. APTPGD, Ala-Pro-Thr-Pro-Gly-Asp.

Quantitative analysis of GPIs

The absolute quantification was done by analyzing the myo-inositol content of the samples (Materials and methods). As most of the identified GPIs have closely related structures and conserved ionizable groups (AEP, EtNP, GlcN, and phosphate), we speculated that their ionization efficiency by electrospray would be comparable. Thus, we would be able to relatively quantify the different species by the signal intensity of each GPI or GIPL species. First, we injected the synthetic GPI sGPI-C16:0 mixed with an equal amount of sGPI-C18:1 or sGPI-C18:2 (see Materials and methods for details). Although these GPI species have different lipid tails, their ionization efficiencies were almost identical (Supplementary Figure 5A and B). Next, we analyzed sample mixtures containing constant concentration of sGPI-C18:1 (5 μM), but variable concentration of sGPI-C16:0 (0.005–5 μM). The response in the MS was evaluated by comparing the signal intensities of the parent-ion or by MS2 TIC approach (Asara ). The quantification by comparing parent-ion signal intensities showed small errors (1.3–33.6%), but the response was linear only from 1:1 to 1:20 ratio (Supplementary Table II). On the other hand, we observed higher errors (9.5–48.2%) by MS2 TIC approach, similar to the ones described by Asara , but the linear response was shown to be much broader (from 1:1 to 1:500 ratio) (Supplementary Table II). As most of the identified GPI and GIPL species are poorly abundant, we decided to relatively quantify them by MS2 TIC approach. Combining both quantifications, we were also able to estimate absolute amounts of individual protein-derived GPI and GIPL species. To determine the sensitivity of our method, we performed LC-MS2 analysis with different amounts of GPIs. First, we performed the analysis in a data-dependent acquisition (DDA) mode, fragmenting the five most abundant ions with dynamic exclusion. We detected GPIs from as low as 5 × 105 parasites, equivalent to 21.5 femtomoles of total GPIs and 3.1 femtomoles for the least abundant species, as quantified by myo-inositol analysis by GC-MS (data not shown). Furthermore, MS spectra were also collected by multiple-reaction monitoring (MRM) mode, targeting the two major species of proteinase K-released GPIs at m/z 911.42 and 919.23 (Supplementary Table I). With this approach, we detected and characterized GPIs from as low as 1 × 105 parasites, equivalent to 4.3 femtomoles of total GPIs and 611 attomoles for the least abundant species observed (Figure 5A and B). It is also worth to point out that epimastigote is much smaller compared with higher eukaryote cells. Thus, our method would require less number of mammalian cells for doing the same analysis. A recent report characterized the GPI-anchor structure from T. brucei transferrin-receptor using 19.7 picomoles (Mehlert and Ferguson, 2008). Thus, our LC-MS approach is from ∼6000- (for DDA) to ∼30 000-fold (for MRM) more sensitive than current nonradioactive methods.
Figure 5

Analysis by LC-MRM-MS of proteinase K-released GPIs from T. cruzi epimastigotes. The two major GPI species originally observed at m/z 918.93 and 910.93 (protein-derived GPI species 1 and 2, respectively; Supplementary Table I) were subjected to MRM analysis (A) Extracted-ion chromatogram of the diagnostic ions corresponding to the dehydrated alkylacylglycerol (AAG—H20) ion-species at m/z 537.5, and the loss of alkylacylglycerol (M—AAG) at m/z 1300.1 and 1284.1. The peak detected at the retention time 30.93 min includes the two GPI species. For this experiment, the equivalent of 105 cells, corresponding to 4.3 femtomoles of total GPIs, was used. (B) MS2 spectrum of the major GPI species ([EtNP]Hex4-[AEP]HexN-InsP-1-O-alkyl-C16:0-2-O-acyl-C16:0-Gro) at m/z 919.

Retention times and column separation

The retention time of GPI species was determined as described in Materials and methods. Notably, the retention times of protein-derived GPIs and GIPLs containing the same lipid tail were very similar (Supplementary Table I), and apparently had no correlation with the size and different substituent groups (e.g., EtNP, AEP, Hex) attached to the glycan core. This finding indicates that the separation is likely given by the exclusive interaction of the lipid tail with the hydrophobic polystyrene-divinylbenzene-based C4 (POROS R1) resin. As expected, we also noticed that protein-derived GPIs and GIPLs with shorter, hydroxylated, and/or unsaturated lipid tails were eluted first from the column, compared with those with longer, nonhydroxylated, and/or saturated lipid tails (Supplementary Table I).

Discussion

Here, we present the first global analysis of GPI-anchored molecules from a human pathogenic eukaryote, T. cruzi. Our analysis included the genome-wise prediction of GPI-anchored proteins and the large-scale structural determination of protein-free and protein-linked GPIs. The prediction analysis showed that about 12% of T. cruzi genes possibly encode GPI-anchored proteins. This number is much higher compared with the African sleep sickness parasite, T. brucei (strain TREU927), which has only 132 (1.5%) of the 8750 sequences possibly GPI anchored (data not shown). Poisson performed GPI-anchoring prediction for several organisms, including other early divergent eukaryotes Plasmodium falciparum (0.19%) and Leishmania major (1.02%); yeasts, Saccharomyces cerevisiae (0.95%) and Cryptococcus neoformans (0.90%); plants, Arabidopsis thaliana (0.83%) and Gibberella zeae (1.13%); and mammals, human (0.74%), and mouse (0.82%). Our study clearly shows that the number of potential GPI-anchored sequences is much higher in T. cruzi than in any other studied organisms. The GPI-anchoring sequences in T. cruzi are concentrated in few large multigene families, such as MASP, mucin, TS/gp85, mucin-like, and gp63. Of these families, mucins, TS/gp85 glycoproteins, and gp63 metalloproteinases were shown to have important functions in the virulence and evasion/modulation of host immune response (Acosta-Serrano ). Although plenty of information has been obtained with the completion of the genome project, little information is available about how many and which GPI-anchored molecules are expressed on the parasite surface. There is also a limitation of microarray (transcriptome) and real-time PCR data for T. cruzi gene expression, as in this organism and other trypanosomes, protein synthesis is mostly controlled by posttranscriptional regulation processes, like mRNA stabilization and degradation (Jager ). In addition, microarray and real-time PCR analyses do not provide any information regarding absolute or relative expression, and structure of GPI-anchored proteins and GIPLs in different parasite stages and strains. Large-scale GPIomic analysis in eukaryotes is limited due to the lack of a fast, simple, and sensitive method to characterize GPI-anchored molecules (Ferguson, 1992; Hooper, 2001). One of the limiting factors has been the absence of a more efficient method to purify GPIs. The current methods are based on HIC or RPC resins and have a restricted efficiency, which might be due to the amphiphilic character of these resins. Thus, we hypothesized that using a polystyrene-divinylbenzene-based C4 resin (e.g., POROS R1) (DePhillips ; Whitney ), which is essentially hydrophobic, would increase the chromatographic resolution of the RPC in the purification of GPI-anchored protein-linked and free GPIs (GIPLs). Indeed, our results proved that our hypothesis was correct. The resolution achieved with the POROS R1 RPC in our study was exceptional, and we could resolve and identify a remarkable number (90 species) of GIPLs and GPI-anchored protein-derived GPIs. Then, to ultimate gain sensitivity and speed, we packed capillary columns with POROS R1 and analyzed T. cruzi epimastigote GPIs by LC-MS. First, we analyzed the protein-free GPI (GIPL) fraction. In total, we were able to identify 78 complete species of GIPLs. Of those, 70 were novel (Supplementary Table I). Up to now, only a few complete species of GIPLs had been detected and structurally characterized in T. cruzi (de Lederkremer , 1991; Previato ; Almeida ). Consistently, these species were also the most abundant found in this study in the GIPL sample extracted with 9% n-butanol. In an earlier report, the glycan cores of GIPLs from Y strain epimastigote forms of T. cruzi were analyzed by NMR (Carreira ). With that approach seven different structures for the glycan core were described. In this study, we found all those seven glycan cores and additional five novel ones. Three distinct series differing only in the number of hexose residues were found ([AEP]Hex5–6-[AEP]HexN-InsP, [EtNP]Hex46-[AEP]HexN-InsP, and AEP-Hex4–8-HexN-InsP) (Table I). In addition, one minor species (comprising only 0.03% of the total GIPLs) without either EtNP or AEP (Hex7-HexN-InsP) was observed (Table I; Supplementary Figure 2C63). Another minor species (0.01% of the total GIPLs) containing sialic acid (N-acetylneuraminic acid, NANA) (NANA-Hex7-[AEP]HexN-InsP) was also detected and characterized by MS2 analysis (Table I; Supplementary Figure 2C18). The presence of sialic acid-containing glycolipids in T. cruzi was proposed earlier, through in vitro radiolabeling using 3H-NANA-labeled fetuin in epimastigote and trypomastigote cultures, but no structural data had been provided (Confalonieri ; Zingales ). Further structural and biosynthetic studies need to be carried out to establish the detailed structure and expression of the sialic acid-containing GIPLs in different parasite stages. Taken together, our results clearly show a greater diversity of GIPL species than was known earlier, therefore, indicating a higher complexity of the GPI biosynthesis in T. cruzi. The analysis of proteinase K-released GPIs lead to the characterization of 11 different species. Of those, only three have been characterized earlier (Previato ; Serrano ; Almeida ). Most of the glycan structures had either two AEP residues ([AEP]Hex45-[AEP]HexN-InsP) or one AEP and one EtNP ([EtNP]Hex46-[AEP]HexN-InsP). One species, however, had only one EtNP ([EtNP]Hex4-HexN-InsP). Interestingly, the glycan cores containing one AEP residue (AEP-Hex4–8-HexN-InsP) were shown to be highly abundant in GIPLs, but they were not detected in protein-derived GPI anchors, suggesting some differential steps during the biosynthesis of GIPLs and GPI anchors to be attached to proteins. By analyzing the trypsin-released GPI-peptides, we could determine that the mucins expressed by the epimastigote forms of T. cruzi belong to the TcSMUG S subfamily. This subfamily of mucins comprises few genes that encode short polypeptides rich in Thr residues and, therefore, most likely to be highly O-glycosylated. The only other GPI-anchored polypeptide from epimastigotes with known amino-acid sequence is the NETNES (Macrae ). This molecule has four glycosylation sites, but only 13 amino-acid residues. We were unable to detect NETNES in our analysis, probably due to the fact that this glycoconjugate does not have potential sites for trypsin digestion (Macrae ). Furthermore, we cannot exclude that, akin to T. brucei variant surface glycoproteins (Ferguson ), some T. cruzi GPI-anchored glycoproteins could not be extracted from cell membranes with 9% n-butanol. In this case, a stronger (more hydrophobic) extraction procedure using detergent (e.g., Triton X-100) should be used. This is currently under investigation in our laboratory. Taken together, our results and others from the literature (Acosta-Serrano , 2007; Buscaglia ) suggest that the epimastigote cell surface is heavily coated by a variety of GIPL structures, and to a less extent by short and highly glycosylated GPI-anchored polypeptides. GIPLs were recently proposed to participate in the interaction of the parasite with the insect midgut (Nogueira ). The absence of protease-cleavable polypeptide chains may also help protecting the parasite against insect digestive enzymes. On the other hand, bloodstream trypomastigotes were shown to express mucins from the TcMUC II subfamily that are composed by highly diverse sequences (Buscaglia ). In agreement with these earlier data, our preliminary large-scale proteomic analysis of trypomastigotes suggests that this mammal-dwelling forms express hundreds, perhaps thousands, of GPI-anchored glycoproteins (including members of TS/gp85, MASP, and TcMUC II families) (Nakayasu et al, unpublished data). This high diversity of GPI-anchored proteins in trypomastigote forms may result in the lack of major immunodominant epitope(s) on the parasite surface, therefore, contributing for the immunoevasion of the parasite and its perpetuation within the mammalian host (Buscaglia ). In sum, here we showed that the GPIome of epimastigote forms of T. cruzi is highly complex. There were only few major GIPL (Carreira ; Almeida ) and eMUC-GPI (Previato ; Serrano ; Almeida ) species characterized earlier, and there were no ω-sites described for T. cruzi epimastigote mucins. This high complexity of GIPLs and GPI-anchored proteins might be important for the interaction between the parasite and the insect host. This is the first comprehensive study of the characterization of free GPIs (GIPLs), protein-bound GPIs, and ω sites of a eukaryotic organism. This new methodology was shown to be simple and highly sensitive. Our method could be used for the high throughput screening (HTS) of potential GPI biosynthesis-targeting drugs against pathogenic parasites (Smith ), as well as HTS of GPI-metabolism mutant cell lines. We also propose its use for the global analysis of the GPIome of other pathogenic eukaryotes and mammalian cells, including healthy and modified (cancer) cells, whose GPI anchor structures and roles are still mostly elusive (Paulick and Bertozzi, 2008).

Materials and methods

Prediction analysis of GPI-anchored proteins in the T. cruzi genome

The prediction analysis was carried out using the FragAnchor algorithm (Poisson ) (http://navet.ics.hawaii.edu/~fraganchor/NNHMM/NNHMM.html) against the TcruziDB v5.0 (http://tcruzidb.org/tcruzidb/, downloaded on August 1, 2005), T. cruzi (downloaded on March 17, 2008) and T. brucei TREU927 (downloaded on August 13, 2008) sequences from the GenBank (http://www.ncbi.nlm.nih.gov/sites/entrez?db=Protein&itool=toolbar), and mucin TcMUC II sequences from GeneDB (http://www.genedb.org/, downloaded on August 20, 2005). Only sequences with high probability were accepted as potentially GPI-anchored proteins.

Solvents and salts

Otherwise indicated, all solvents and salts used in this study were of the highest quality available (HPLC or molecular biology grade), from Sigma-Aldrich, EMD Chemicals, or JT Baker.

T. cruzi culture

Noninfective epimastigote forms of T. cruzi (Y strain) were grown in liver infusion tryptose medium at 28°C for 3–4 days, as described earlier (Camargo, 1964). Cells were harvested and washed twice in phosphate-buffered saline (PBS), pH 7.4, and centrifuged at 2000 g at 4°C for 10 min. The parasite pellet containing 3.6 × 1010 was then store at −20°C until use.

Extraction of free GPIs and GPI-anchored proteins

The parasite pellet (3.6 × 1010 cells) was placed in an ice-bath, resuspended in 2 ml HPLC-grade water, and immediately transferred to a 15-ml PTFE-lined, screw capped glass tube (Supelco). Then, methanol, chloroform, and water were added to give 15 ml of the mixture chloroform:methanol:water (1:2:0.8, v:v:v). After homogenization in vortex (1 min), the suspension was centrifuged for 20 min at 2000 g. The organic phase was removed and transferred to a 100-ml glass flask, and the pellet was extracted three times with 10 ml chloroform:methanol (C:M, 2:1, v/v) solution, and twice with 10 ml chloroform:methanol:water (C:M:W, 1:2:0.8, v/v/v) solution. Between extractions, the organic phase was separated from the pellet by centrifugation at 2000 g for 20 min, at room temperature, and saved. All organic phases were combined, dried under N2 stream, and the resulting organic extract was subjected to Folch's partition (Folch ). The upper (aq.) phase of the partition, rich in free GPIs (GIPLs), was dried under N2 stream and analyzed by LC-MS2 as described below. The resulting delipidated parasite pellet was dried under N2 stream and extracted 3 × with 4 ml 9% n-butanol, essentially as described (Almeida ). The GPI-anchored proteins were separated from residual GIPLs by n-butanol:water (1:1, v/v) partition. The aq. phase, enriched with GPI-anchored proteins, was dried in a vacuum centrifuge (Vacufuge, Eppendorf) and stored at −20°C until use.

Biotinylation of epimastigote mucins and CLA

Epimastigote mucins were extracted and purified using octyl-Sepharose column (GE Healthcare), as described (Almeida ). For biotinylation, the mucin sample was dried in a vacuum centrifuge, redissolved in 1 ml PBS containing 1 mM NHS-biotin (Pierce), and incubated for 2 h in an ice bath. The reaction was terminated by the addition of TrisHCl pH 8.8 to a 50-mM final concentration, and the sample was dialyzed for 24 h against MilliQ ultrapure water (Millipore), at 4°C. The labeling efficiency was monitored by CLA. Briefly, biotinylated mucins were diluted 10-fold in 50 mM carbonate-bicarbonate buffer pH 9.6, immobilized on polystyrene 96-well chemiluminescent ELISA microplate (Nunc, Invitrogen) and incubated overnight at 4°C. After blocking for 2 h at 37°C with PBS-0.1% bovine serum albumin (PBS-A), 50 μl horseradish-conjugated streptavidin (1:1000 dilution) (Invitrogen) in PBS-A was added. The plate was washed 3 × with PBS containing 0.1% Tween-20 (PBS-T) and developed with 50 μl chemiluminescent reagent (SuperSignal, Pierce) diluted five-fold in 200 mM carbonate/bicarbonate buffer, pH 9.6. Chemiluminescent readings were measured in a luminometer (Luminoskan Ascent, Labsystems, Finland).

Purification of synthetic GPIs

T. cruzi epimastigote and trypomastigote GPIs were synthesized as described earlier (Yashunsky ) and stored as NMR samples in [2H]6-DMSO solution. Synthetic GPI samples were purified (from by-products appeared during storage) in reverse phase ziptips, built in 200-μl micropipette tips with a small piece of glass-fiber filter and 50 μl 40 mg/ml POROS R1 (C4) 50 resin (50-μm average particle size, Applied Biosystems) (Whitney ) in 2-propanol. The ziptips were washed with 100 μl methanol and equilibrated with 5% 2-propanol/10 mM ammonium acetate (AA). About 1 nmol of synthetic GPI sample, containing several by-products, was resuspended in 100 μl of 5% 2-propanol/10 mM AA and after loading, the column was washed with the same buffer, followed by step-wise elution of increasing concentration (10–90%) of 100 μl 2-propanol containing 10 mM AA. Alternatively, we performed the ziptips by equilibrating the sample with 5% n-propanol and eluting with 80% n-propanol. The fractions were dried in a vacuum centrifuge and stored at −20°C until use.

Purification of GPI-anchored proteins from T. cruzi epimastigote extracts

Extracted GPI-anchored proteins from epimastigote forms were purified in POROS R1 solid-phase cartridges. The cartridges were assembled in solid-phase extraction supports with 1 ml POROS resin (40 mg/ml in 2-propanol). A glass fiber filter was laid over the resin bed to avoid disturbance during the sample loading and elution steps. The cartridge was washed with 4 ml 2-propanol and 4 ml methanol before being equilibrated with 4 ml 5% n-propanol. The purified mucin sample was redissolved in 5% n-propanol and spiked with biotinylated epimastigote mucins. After loading onto the cartridges, samples were eluted in increasing concentration (5, 10, 12.5, 15, 17.5, 20, 22.5, 25, 27.5, 30, 50, and 90%) of n-propanol. The fractions were dried in a vacuum centrifuge and analyzed by SDS–PAGE (Laemmli, 1970) stained with silver (Mortz ), and by CLA as above. The bands from the gel were excised, digested with trypsin, and sequenced by LC-MS2 (see below).

Protease digestion of GPI-anchored proteins

GPI-anchored proteins extracted with 9% butanol from the equivalent to 1.8 × 109 parasites (approximately 200 μg of total protein) were digested overnight at 37°C with 50 μg proteinase K (from Tritirachium album, lyophilized powder, ⩾30 units/mg protein, Sigma) or 2 μg sequencing-grade trypsin (Promega) in 100 mM ammonium bicarbonate, pH 8.0. The digested GPI samples were repurified in POROS R1 ziptips, as described above, and analyzed directly by ESI-MS2 using a Qtof-1 ESI-MS, or LC-MS using a LTQXL ESI-linear ion trap-MS.

ESI-MS2 analysis of GPI samples

Synthetic and purified GPI samples were resuspended in 50% n-propanol/10 mM AA and injected by infusion (500 nl/min) into an ESI-QTOF-MS (Qtof-1, Micromass, Waters). Spectra were collected in the negative-ion mode at 50–2000 mass-to-charge (m/z) range. The nanospray source was set at 1.5–2.5 kV, cone voltage 35 V, and the source and desolvation temperatures at 110 and 150°C, respectively. Each GPI species was subjected to fragmentation with collision energy from 30 to 50 eV.

LC-MSn analysis of GPI samples

Capillary columns (75-μm internal diameter by 10-cm length) were packed with 10 mg/ml POROS R1 10 resin (10-μm particle size; Applied Biosystems) (suspension in 100% acetonitrile (ACN)) with a high-pressure apparatus at 500 psi (Gatlin ). The quality of packing was monitored in a stereomicroscope. GPI samples were redissolved in 20% 2-propanol/0.2% FA and loaded onto the capillary column connected to a nanoHPLC system (1D-Plus, Eksigent). The samples were eluted in a gradient of 20–80% solvent B over 60 min (solvent A: 5% 2-propanol/0.2% FA; solvent B: 90% 2-propanol/0.2% FA) and directly analyzed in the linear ion-trap mass spectrometer (LTQ XL, Thermo Fisher Scientific). To compensate eventual decrease in the flow rate at high concentrations of organic solvent, the flow rate was set as a gradient from 350 to 450 nl/min within the same period of time. The ionization source was set to 1.9 kV and 200°C. The full MS scan was collected in positive mode from 400 to 2000 m/z range with a maximum injection time of 100 ms. The five most intense ions were submitted to fragmentation with 40% normalized collision energy and a maximum injection time of 150 ms. When the dynamic exclusion function was enabled, it was set to collect only twice and then to exclude for 30 s. To test the sensitivity of the method, the samples were also analyzed by the MRM approach. For this analysis, parent ions at m/z 911 and 919, which correspond to the major species of T. cruzi epimastigote mucin GPIs (Almeida ), were constantly selected and fragmented. In addition, to determine the structure of lipid tail or the small peptide sequence attached to the GPI, these fragments were analyzed by MRM/MS3. All the spectra were analyzed manually to assign the GPI structures.

Determination of retention times and relative quantification of GPI species

To determine the retention times, individual parent ions were plotted as extracted-ion chromatograms. The peaks were smoothed 13 times with the Gaussian method. The retention times were annotated as the top of individual peaks. To standardize the quantitative analysis, synthetic GPIs [Man-[EtNP]Man-Man2-[AEP]GlcN-InsP-1-O-alkyl-C16:0-2-O-acyl-C16:0-Gro (sGPI-C16:0), Man-[EtNP]Man-Man2-[AEP]GlcN-InsP-1-O-alkyl-C16:0-2-O-acyl-C18:2-Gro (sGPI-C18:2), Man-[EtNP]Man-Man2-[AEP]GlcN-InsP-1-O-alkyl-C16:0-2-O-acyl-C18:1-Gro (sGPI-C18:1)] were mixed in different ratios in 50% 2-propanol, 0.2% FA and analyzed in a linear ion-trap mass spectrometer (LTQXL, Thermo Fisher Scientific). The samples were injected with an automated system (Triversa Nanomate, Advion), set at 1.5 kV and 0.25 p.s.i. N2 pressure, to avoid cross-contamination between samples. Full (enhanced scan rate, 100 ms maximum injection time) or MS2 (normal scan rate, 3 a.m.u. isolation width, 35% normalized collision energy) scans were collected for 20 s. Each sample was run in triplicates. The relative quantification of the GPI species was done by the MS2 TIC (Asara ).

Identification of proteins separated by SDS–PAGE

The protein bands from the SDS–PAGE were excised and digested as described earlier (Shevchenko ). The resulting peptides were purified in POROS R2 ziptips, as described elsewhere (Jurado ). Resulting peptides were loaded into a capillary reverse phase column (PepMap, C18 3 μm, 15 cm × 75 μm, LC Packings, Dionex) connected to an Ultimate nanoHPLC (LC Packings, Dionex) in tandem to an ESI-QTOF-MS. The elution was performed with a flow rate of 300 nl/min in a gradient of 5–35% ACN/0.1% FA in 25 min, 35–72% ACN in 1 min, and hold at 72% ACN for 5 min. MS spectra in positive-ion mode were collected in the 400–1800 m/z range, and each peptide was fragmented (MS2) for 3 s in the 50–2050 m/z range. MS data were converted into peak lists (PKL format) and searched against TcruziDB v5.0 (Aguero ) (http://tcruzidb.org) using Mascot algorithm (Perkins ) (Matrix Science). Database search parameters were the following: trypsin as the digesting enzyme (one missed cleavage site allowed); 500 ppm for peptide mass tolerance for monoisotopic ion; 0.8 Da for fragment mass tolerance; and carbamidomethylation of cysteine residues and oxidation of methionine residues as fixed and variable modifications, respectively. We validated only proteins with P<0.05, according to the software. An ion-score cutoff of 20 was set to ensure the quality of valid peptides and to remove redundant protein hits. Spectra with no peptide matching were subjected to de novo sequencing using PepSeq tool from the MassLynx v4.0 software (Waters).

Quantification of myo-inositol content by GC-MS

The myo-inositol content in the sample was determined as described (Ferguson, 1992). Briefly, 20 pmol of scyllo-inositol (Sigma) internal standard was added to each sample in a heat-cleaned (500°C, 2 h), one-end flame-sealed borosilicate 250-μl capillary microtube. An external standard containing 20 pmol of myo-inositol (Sigma) and 20 pmol of scyllo-inositol was also prepared. Samples were dried and subjected to strong acid hydrolysis with 50 μl of 6 M constant boiling HCl at 110°C for 24 h. The products were dried, redried on 20-μl methanol addition, and derivatized for 30 min with 15 μl freshly made trimethylsilyl (TMS) reagent (hexamethyldisilazane:trimethylchlorosilane:pyridine, 15:5:100, v:v:v). The inositol-TMS derivatives were analyzed on a TRACE GC Ultra gas chromatography system linked to a Polaris Q mass spectrometer (GC–MS, Thermo Fisher Scientific) using a Supelco SP2380™ column (30 m × 0.25 mm ID × 0.25 μm). The initial oven temperature was set to 100°C for 3 min and then a gradient of 15°C/min up to 250°C was applied. The final temperature of 250°C was kept for 5 min. The splitless injector was held at 200°C, the MS transfer line at 260°C, and the carrier gas flow at 0.7 ml/min. Extracted-ion chromatogram for myo-inositol was plotted using the diagnostic fragment ions at m/z 305 and 318. For the myo-inositol quantification, the chromatogram peaks were integrated and the following formula was applied: [area sample myo peak/area scyllo internal standard peak] × [amount of internal standard/(area standard myo peak/area standard scyllo peak)]. Supplementary Tables 1 - 2 and Supplementary Figures 1 - 5
  62 in total

Review 1.  The Tol proteins of Escherichia coli and their involvement in the translocation of group A colicins.

Authors:  Jean-Claude Lazzaroni; Jean-François Dubuisson; Anne Vianney
Journal:  Biochimie       Date:  2002 May-Jun       Impact factor: 4.079

2.  GROWTH AND DIFFERENTIATION IN TRYPANOSOMA CRUZI. I. ORIGIN OF METACYCLIC TRYPANOSOMES IN LIQUID MEDIA.

Authors:  E P CAMARGO
Journal:  Rev Inst Med Trop Sao Paulo       Date:  1964 May-Jun       Impact factor: 1.846

Review 3.  An update on Chagas disease (human American trypanosomiasis).

Authors:  A Moncayo; M I Ortiz Yanine
Journal:  Ann Trop Med Parasitol       Date:  2006-12

4.  Structural characterization of neutral glycosphingolipids from Trypanosoma cruzi.

Authors:  E Barreto-Bergter; A B Vermelho; R Hartmann; G Pohlentz; R A Klein; H Egge
Journal:  Mol Biochem Parasitol       Date:  1992-04       Impact factor: 1.759

5.  Cleavage of structural proteins during the assembly of the head of bacteriophage T4.

Authors:  U K Laemmli
Journal:  Nature       Date:  1970-08-15       Impact factor: 49.962

6.  Chemical validation of GPI biosynthesis as a drug target against African sleeping sickness.

Authors:  Terry K Smith; Arthur Crossman; John S Brimacombe; Michael A J Ferguson
Journal:  EMBO J       Date:  2004-11-04       Impact factor: 11.598

7.  Structural characterization of NETNES, a novel glycoconjugate in Trypanosoma cruzi epimastigotes.

Authors:  James I Macrae; Alvaro Acosta-Serrano; Nicholas A Morrice; Angela Mehlert; Michael A J Ferguson
Journal:  J Biol Chem       Date:  2005-01-13       Impact factor: 5.157

8.  Analysis of glycosylphosphatidylinositol membrane anchors by electrospray ionization-mass spectrometry and collision induced dissociation.

Authors:  C A Redman; B N Green; J E Thomas-Oates; V N Reinhold; M A Ferguson
Journal:  Glycoconj J       Date:  1994-06       Impact factor: 2.916

Review 9.  The structure, biosynthesis and functions of glycosylphosphatidylinositol anchors, and the contributions of trypanosome research.

Authors:  M A Ferguson
Journal:  J Cell Sci       Date:  1999-09       Impact factor: 5.285

10.  FragAnchor: a large-scale predictor of glycosylphosphatidylinositol anchors in eukaryote protein sequences by qualitative scoring.

Authors:  Guylaine Poisson; Cedric Chauve; Xin Chen; Anne Bergeron
Journal:  Genomics Proteomics Bioinformatics       Date:  2007-05       Impact factor: 7.691

View more
  36 in total

1.  Vesicle and vesicle-free extracellular proteome of Paracoccidioides brasiliensis: comparative analysis with other pathogenic fungi.

Authors:  Milene C Vallejo; Ernesto S Nakayasu; Alisson L Matsuo; Tiago J P Sobreira; Larissa V G Longo; Luciane Ganiko; Igor C Almeida; Rosana Puccia
Journal:  J Proteome Res       Date:  2012-02-14       Impact factor: 4.466

Review 2.  Chemical synthesis of glycosylphosphatidylinositol anchors.

Authors:  Benjamin M Swarts; Zhongwu Guo
Journal:  Adv Carbohydr Chem Biochem       Date:  2012       Impact factor: 12.200

3.  Chemoenzymatic synthesis of glycosylphosphatidylinositol-anchored glycopeptides.

Authors:  Zhimeng Wu; Xueqing Guo; Zhongwu Guo
Journal:  Chem Commun (Camb)       Date:  2010-06-24       Impact factor: 6.222

4.  Improved proteomic approach for the discovery of potential vaccine targets in Trypanosoma cruzi.

Authors:  Ernesto S Nakayasu; Tiago J P Sobreira; Rafael Torres; Luciane Ganiko; Paulo S L Oliveira; Alexandre F Marques; Igor C Almeida
Journal:  J Proteome Res       Date:  2011-12-08       Impact factor: 4.466

5.  Structural features affecting trafficking, processing, and secretion of Trypanosoma cruzi mucins.

Authors:  Gaspar E Cánepa; Andrea C Mesías; Hai Yu; Xi Chen; Carlos A Buscaglia
Journal:  J Biol Chem       Date:  2012-06-15       Impact factor: 5.157

Review 6.  The Trypanosoma cruzi Surface, a Nanoscale Patchwork Quilt.

Authors:  Juan Mucci; Andrés B Lantos; Carlos A Buscaglia; María Susana Leguizamón; Oscar Campetella
Journal:  Trends Parasitol       Date:  2016-11-11

Review 7.  Lipid synthesis in protozoan parasites: a comparison between kinetoplastids and apicomplexans.

Authors:  Srinivasan Ramakrishnan; Mauro Serricchio; Boris Striepen; Peter Bütikofer
Journal:  Prog Lipid Res       Date:  2013-07-01       Impact factor: 16.195

8.  Proteomics of Trypanosoma evansi infection in rodents.

Authors:  Nainita Roy; Rishi Kumar Nageshan; Rani Pallavi; Harshini Chakravarthy; Syama Chandran; Rajender Kumar; Ashok Kumar Gupta; Raj Kumar Singh; Suresh Chandra Yadav; Utpal Tatu
Journal:  PLoS One       Date:  2010-03-22       Impact factor: 3.240

9.  Characterization of yeast extracellular vesicles: evidence for the participation of different pathways of cellular traffic in vesicle biogenesis.

Authors:  Débora L Oliveira; Ernesto S Nakayasu; Luna S Joffe; Allan J Guimarães; Tiago J P Sobreira; Joshua D Nosanchuk; Radames J B Cordero; Susana Frases; Arturo Casadevall; Igor C Almeida; Leonardo Nimrichter; Marcio L Rodrigues
Journal:  PLoS One       Date:  2010-06-14       Impact factor: 3.240

10.  Proteomic analysis of detergent-solubilized membrane proteins from insect-developmental forms of Trypanosoma cruzi.

Authors:  Esteban M Cordero; Ernesto S Nakayasu; Luciana G Gentil; Nobuko Yoshida; Igor C Almeida; José Franco da Silveira
Journal:  J Proteome Res       Date:  2009-07       Impact factor: 4.466

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.