| Literature DB >> 24229052 |
Yang Yang1, Arjan Barendregt, Johannis P Kamerling, Albert J R Heck.
Abstract
Taking chicken Ovalbumin as a prototypical example of a eukaryotic protein we use high-resolution native electrospray ionization mass spectrometry on a modified Exactive Orbitrap mass analyzer to qualitatively and semiquantitatively dissect 59 proteoforms in the natural protein. This variety is largely induced by the presence of multiple phosphorylation sites and a glycosylation site that we find to be occupied by at least 45 different glycan structures. Mass analysis of the intact protein in its native state is straightforward and fast, requires very little sample preparation, and provides a direct view on the stoichiometry of all different coappearing modifications that are distinguishable in mass. As such, this proof-of-principal analysis shows that native electrospray ionization mass spectrometry in combination with an Orbitrap mass analyzer offers a means to characterize proteins in a manner highly complementary to standard bottom-up shot-gun proteome analysis.Entities:
Mesh:
Substances:
Year: 2013 PMID: 24229052 PMCID: PMC3880056 DOI: 10.1021/ac403057y
Source DB: PubMed Journal: Anal Chem ISSN: 0003-2700 Impact factor: 6.986
Figure 1Full native ESI-MS spectra acquired on a modified Orbitrap Exactive of (a) unprocessed, (b) deglycosylated, (c) dephosphorylated and (d) deglycosylated and dephosphorylated ovalbumin from m/z 2000 to 6000. There are two series of proteoforms present in the spectra displayed in (b) and (d) because the deglycosylation is incomplete due to enzyme specificity of Endo F1. The inset shows the crystal structure of chicken ovalbumin with the two reported phosphorylation sites, the N-acetylated terminus and the glycosylation site annotated.
Figure 2Zoom in on the [M + 13H]13+ charge state, native ESI-MS spectra of (a) unprocessed, (b) deglycosylated, (c) dephosphorylated, and (d) deglycosylated and dephosphorylated ovalbumin. From a comparison of the spectra in (b) and (d), the two phosphorylation sites could be confirmed. The average mass of the “naked” ovalbumin polypeptide backbone (with N-acetylation, and one GlcNAc) determined from the spectrum (d) is 42995.35 Da, within 1.23 ppm of the expected mass (42995.29 Da). The abundance ratio between the maximum and minimum detectable and assigned proteoforms is ∼800.
Quantitative Glycan Profiling of Chicken Ovalbumina
| no. | mass (Da) | relative abundance % | composition | glycan (Da) | ref (Da) | deviation (Da) | structure | ref |
|---|---|---|---|---|---|---|---|---|
| 1 | 43848.35 | 3.99 | Hex4HexNAc2 | 1074.32 | 1072.95 | 1.38 | + | ( |
| 2 | 44010.59 | 91.83 | Hex5HexNAc2 | 1236.56 | 1235.09 | 1.47 | + | ( |
| 3 | 44047.12 | 24.76 | Hex4HexNAc3 | 1273.09 | 1276.14 | 3.05 | + | ( |
| 4 | 44171.01 | 74.12 | Hex6HexNAc2 | 1396.98 | 1397.23 | 0.25 | + | ( |
| 5 | 44211.57 | 35.37 | Hex5HexNAc3 | 1437.54 | 1438.28 | 0.74 | + | ( |
| 6 | 44252.13 | 18.89 | Hex4HexNAc4 | 1478.10 | 1479.34 | 1.23 | + | ( |
| 7 | 44292.56 | 8.13 | Hex3HexNAc5 | 1518.53 | 1520.40 | 1.87 | + | ( |
| 8 | 44333.64 | 6.02 | Hex7HexNAc2 | 1559.61 | 1559.37 | 0.24 | + | ( |
| 9 | 44374.07 | 8.51 | Hex6HexNAc3 | 1600.04 | 1600.42 | 0.38 | + | ( |
| 10 | 44415.46 | 100.00 | Hex5HexNAc4 | 1641.43 | 1641.48 | 0.05 | + | ( |
| 11 | 44454.94 | 46.13 | Hex4HexNAc5 | 1680.91 | 1682.53 | 1.62 | + | ( |
| 12 | 44494.04 | 8.81 | Hex3HexNAc6 | 1720.01 | 1723.60 | 3.59 | + | ( |
| 13 | 44577.60 | 9.87 | Hex6HexNAc4 | 1803.58 | 1803.62 | 0.04 | – | |
| 14 | 44618.60 | 33.22 | Hex5HexNAc5 | 1844.57 | 1844.67 | 0.10 | + | ( |
| 15 | 44656.56 | 8.32 | Hex4HexNAc6 | 1882.53 | 1885.73 | 3.19 | + | ( |
| 16 | 44698.16 | 2.89 | Hex3HexNAc7 | 1924.13 | 1926.79 | 2.66 | + | ( |
| 17 | 44740.27 | 1.34 | Hex7HexNAc4 | 1966.24 | 1965.76 | 0.48 | – | |
| 18 | 44780.97 | 9.61 | Hex6HexNAc5 | 2006.94 | 2006.81 | 0.13 | + | ( |
| 19 | 44816.14 | 1.81 | HeX5HexNAc6 | 2042.11 | 2047.88 | 5.77 | + | ( |
| 20 | 44909.27 | 4.38 | Sia1Hex5HexNAc5 | 2135.24 | 2135.77 | 0.53 | – | |
| 21 | 44947.26 | 2.04 | Sia1Hex4HexNAc6 | 2173.23 | 2176.82 | 3.59 | – | |
| 22 | 44981.91 | 0.47 | Hex6HexNAc6 | 2207.88 | 2210.01 | 2.13 | + | ( |
| 23 | 45071.09 | 2.62 | Sia1Hex6HexNAc5 | 2297.06 | 2297.91 | 0.85 | + | ( |
| 24 | 45105.92 | 2.52 | Sia1Hex5HexNAc6 | 2331.89 | 2338.96 | 7.07 | – | |
| 25 | 45143.66 | 0.74 | Hex7HexNAc6 | 2369.63 | 2372.15 | 2.52 | +, G | ( |
| 26 | 45199.09 | 0.20 | Sia2Hex5HexNAc5 | 2425.06 | 2426.86 | 1.80 | – | |
| 27 | 45267.54 | 1.89 | Sia1Hex6HexNAc6 | 2493.51 | 2501.11 | 7.59 | +, G | ( |
| 28 | 45307.79 | 2.04 | Hex8HexNAc6 | 2533.76 | 2534.29 | 0.53 | + | ( |
| 29 | 45348.27 | 0.94 | Hex7HexNAc7 | 2574.24 | 2575.35 | 1.11 | – | |
| 30 | 45470.87 | 1.28 | Hex9HexNAc6 | 2696.85 | 2696.44 | 0.41 | +, G | ( |
| 31 | 45511.56 | 2.42 | Hex8HexNAc7 | 2737.53 | 2737.49 | 0.04 | – | |
| 32 | 45551.56 | 1.52 | Hex7HexNAc8 | 2777.53 | 2778.54 | 1.01 | – | |
| 33 | 45632.73 | 0.32 | Hex10HexNAc6 | 2858.70 | 2858.58 | 0.12 | +, G | – |
| 34 | 45673.76 | 0.64 | Hex9HexNAc7 | 2899.74 | 2899.63 | 0.10 | +, G | ( |
| 35 | 45714.78 | 2.27 | Hex8HexNAc8 | 2940.75 | 2940.68 | 0.06 | – | |
| 36 | 45753.97 | 1.00 | Hex7HexNAc9 | 2979.94 | 2981.74 | 1.80 | – | |
| 37 | 45835.80 | 0.28 | Hex10HexNAc7 | 3061.78 | 3061.77 | 0.00 | +, G | ( |
| 38 | 45877.05 | 0.80 | Hex9HexhNAc8 | 3103.03 | 3102.83 | 0.20 | – | |
| 39 | 45917.96 | 1.00 | Hex8HexNAc9 | 3143.93 | 3143.88 | 0.05 | – | |
| 40 | 46001.66 | 0.23 | Sia1Hex8HexNAc8 | 3227.64 | 3231.78 | 4.14 | – | |
| 41 | 46039.44 | 0.27 | Hex10HexNAc8 | 3265.41 | 3264.97 | 0.44 | – | |
| 42 | 46080.34 | 0.50 | Hex9HexNAc9 | 3306.31 | 3306.02 | 0.29 | – | |
| 43 | 46120.66 | 0.62 | Hex8HexNAc10 | 3346.64 | 3347.07 | 0.44 | – | |
| 44 | 46203.77 | 0.16 | Sia1Hex8HexNAc9 | 3429.74 | 3434.98 | 5.23 | +, G | ( |
| 45 | 46241.18 | 0.13 | Hex10HexNAc9 | 3467.16 | 3468.16 | 1.01 | – |
Mass and relative abundance of each detected and annotated proteoform are listed. The table also includes the proposed glycan compositions in terms of Hex (hexose; galactose/Gal + mannose/Man), HexNAc (N-acetylhexosamine; N-acetylglucosamine/GlcNAc), and Sia (sialic acid; N-acetylneuraminic acid/Neu5Ac), calculated glycan mass, its deviation from the expected mass, and its relative abundance, all extracted from a single native ESI-MS spectrum. Additionally, a list of references, corresponding to earlier established ovalbumin glycan structures (see Figure S-3 of the Supporting Information), is presented.
For the interpretation from composition to glycan structures, see Figures S-2, 3, and 4 of the Supporting Information.
In this column, “+” refers to all cases wherein at least one structure was reported. Meanwhile, the possible “unusual” moieties in chicken ovalbumin glycan structures are highlighted: “G” refers to the Gal(αl-4)Gal unit.
Asterisked ref is for pigeon ovalbumin, the rest are for chicken ovalbumin. Compositions with no previous annotation are marked with “–’’.
Figure 3Characterization of 59 proteoforms in chicken ovalbumin. (a) Variations caused by the widespread glycosylation could be most easily identified and assigned using the spectrum of dephosphorylated ovalbumin. By comparing this spectrum in (a) to unprocessed ovalbumin in (b), proteoforms with either one or two phosphorylation sites occupied could be detected (with the single phosphorylated peaks annotated in purple). The proteoform lacking N-terminal acetylation is marked as N* in pink. The signals in the gray box are multiplied by a factor of 10 and highly enriched in the less reported glycan structures. We hypothesize that several of the complex-type glycans in the composition range 24–45 are in fact extensions of the well-established complex-type glycans from the composition range 1–23, namely, with terminal Gal(α1–4)Gal and polylactosamine units, in separated or mixed form.