| Literature DB >> 25406038 |
Sjoerd van der Post1, Kristina A Thomsson, Gunnar C Hansson.
Abstract
The polymeric mucin MUC2 constitutes the main structural component of the mucus that covers the colon epithelium. The protein's central mucin domain is highly O-glycosylated and binds water to provide lubrication and prevent dehydration, binds bacteria, and separates the bacteria from the epithelial cells. Glycosylation outside the mucin domain is suggested to be important for proper protein folding and protection against intestinal proteases. However, glycosylation of these regions of the MUC2 has not been extensively studied. A purified 250 kDa recombinant protein containing the last 981 amino acids of human MUC2 was produced in CHO-K1 cells. The protein was analyzed before and after PNGase F treatment, followed by in-gel digestion with trypsin, chymotrypsin, subtilisin, or Asp-N. Peptides were analyzed by nLC/MS/MS using a combination of CID, ETD, and HCD fragmentation. The multiple enzyme approach increased peptide coverage from 36% when only using trypsin, to 86%. Seventeen of the 18 N-glycan consensus sites were identified as glycosylated. Fifty-six N-glycopeptides covering 10 N-glycan sites, and 14 O-glycopeptides were sequenced and characterized. The presented method of protein digestion can be used to gain better insights into the density and complexity of glycosylation of complex glycoproteins such as mucins.Entities:
Keywords: ETD; MUC2; N-glycosylation; O-glycosylation; chymotrypsin; mucus; subtilisin
Mesh:
Substances:
Year: 2014 PMID: 25406038 PMCID: PMC4261943 DOI: 10.1021/pr500874f
Source DB: PubMed Journal: J Proteome Res ISSN: 1535-3893 Impact factor: 4.466
Figure 1Overview of the MUC2-C sequence and the obtained sequence coverage. (A) The full MUC2 protein with the C-terminal highlighted. The recombinant protein used in this study is composed of the last 981 amino acids with the addition of a Myc-tag and GFP protein at the N-terminal. (B) Total peptide coverage (shaded in gray) from the combined MS analyses of tryptic, chymotryptic, subtilisin, or Asp-N digests. The GFP and Myc-tags are excluded from the figure. N-Glycosylation consensus sites (N-X-S/T) are underlined and in bold and named N1 to N18. Identified O-glycosylation sites (S/T) are highlighted in bold. Regions covered by peptides identified in the individual enzymatic digests are depicted below and shaded in color.
N-Glycopeptides Identified in the Different Enzymatic Digests of MUC2-C Analyzed by LC/MS/MS
| Δ | ||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 239 | 247 | Y | 240N6 | TRY | 2352.85 | 1078.56 | 1144.81 | 1.9 | 0 | 0 | 7 | 6 | ++ | |
| TRY | 2643.94 | 1078.56 | 1241.84 | 0.8 | 1 | 0 | 7 | 6 | ++ | |||||
| TRY | 2790 | 1078.56 | 1290.53 | 0.7 | 1 | 1 | 7 | 6 | + | |||||
| TRY | 2935.04 | 1078.56 | 1338.87 | 0.6 | 2 | 0 | 7 | 6 | ++ | |||||
| TRY | 3226.13 | 1078.56 | 1435.91 | 0.4 | 3 | 0 | 7 | 6 | ++ | |||||
| TRY | 1768.64 | 679.33 | 1224.99 | 0.3 | 0 | 1 | 5 | 4 | + | |||||
| 416 | 421 | FG | 418N8 | TRY | 2133.77 | 679.33 | 1408.06 | 938.71 | –1.7 | 0 | 1 | 6 | 5 | + |
| TRY | 2424.87 | 679.33 | 1035.74 | 0.2 | 1 | 1 | 6 | 5 | ++ | |||||
| TRY | 2498.9 | 679.33 | 1060.42 | –0.8 | 0 | 1 | 7 | 6 | + | |||||
| TRY | 2715.96 | 679.33 | 1132.77 | 0 | 2 | 1 | 6 | 5 | ++ | |||||
| TRY | 2790 | 679.33 | 1157.45 | –0.9 | 1 | 1 | 7 | 6 | ++ | |||||
| TRY | 3081.09 | 679.33 | 1254.48 | –0.2 | 2 | 1 | 7 | 6 | ++ | |||||
| 428 | 442 | T | 429N9 | SUB | 1622.58 | 1621.71 | 1082.44 | –2.7 | 0 | 0 | 5 | 4 | + | |
| SUB | 1913.68 | 1621.71 | 1179.47 | –1 | 1 | 0 | 5 | 4 | ++ | |||||
| SUB | 2204.77 | 1621.71 | 1276.51 | –2.6 | 2 | 0 | 5 | 4 | ++ | |||||
| 679 | 685 | DTCC | 683N12 | SUB | 1216.42 | 882.32 | 1050.38 | –0.9 | 0 | 0 | 5 | 2 | + | |
| SUB | 1257.45 | 882.32 | 1070.89 | –0.6 | 0 | 0 | 4 | 3 | + | |||||
| SUB | 1403.51 | 882.32 | 1143.92 | –1.5 | 0 | 1 | 4 | 3 | ++ | |||||
| SUB | 1606.59 | 882.32 | 1245.46 | 830.65 | 0 | 0 | 1 | 4 | 4 | +++ | ||||
| SUB | 1768.64 | 882.32 | 1326.49 | 884.66 | –2 | 0 | 1 | 5 | 4 | +++ | ||||
| 689 | 695 | C | 690N13 | TRY | 1257.45 | 881.37 | 1070.42 | –0.1 | 0 | 0 | 4 | 3 | + | |
| TRY | 1622.58 | 881.37 | 1252.98 | 835.66 | 1.5 | 0 | 0 | 5 | 4 | + | ||||
| TRY | 1768.64 | 881.37 | 1326.01 | 884.35 | 1.4 | 0 | 1 | 5 | 4 | ++ | ||||
| TRY | 1913.68 | 881.37 | 932.69 | –0.2 | 1 | 0 | 5 | 4 | + | |||||
| TRY | 2059.73 | 881.37 | 981.38 | –0.2 | 1 | 1 | 5 | 4 | + | |||||
| 688 | 693 | KC | 690N13 | CHY | 1216.42 | 721.34 | 969.89 | –1 | 0 | 0 | 5 | 2 | ++ | |
| CHY | 1622.58 | 721.34 | 1172.97 | –1.3 | 0 | 0 | 5 | 4 | ++ | |||||
| CHY | 1768.64 | 721.34 | 1246 | –0.5 | 0 | 1 | 5 | 4 | ++ | |||||
| 754 | 761 | KVD | 757N14 | SUB | 1768.64 | 915.50 | 1343.08 | 895.72 | –0.7 | 0 | 1 | 5 | 4 | ++ |
| SUB | 2059.73 | 915.50 | 1489 | 992.75 | 0.2 | 1 | 1 | 5 | 4 | ++ | ||||
| SUB | 2424.87 | 915.50 | 1114.47 | –1 | 1 | 1 | 6 | 5 | ++ | |||||
| SUB | 2790 | 915.50 | 1236.18 | –1.1 | 1 | 1 | 7 | 6 | + | |||||
| 743 | 761 | SSKCQDCVCTDKVD | 757N14 | CHY | 1768.64 | 2255.98 | 1342.55 | –0.2 | 0 | 1 | 5 | 4 | ++ | |
| CHY | 2059.73 | 2255.98 | 1439.58 | 0.3 | 1 | 1 | 5 | 4 | ++ | |||||
| CHY | 2350.83 | 2255.98 | 1536.61 | 0.2 | 2 | 1 | 5 | 4 | ++ | |||||
| 767 | 774 | THVPCNTS | 772N15 | SUB | 1403.51 | 914.39 | 1159.96 | –0.2 | 0 | 1 | 4 | 3 | + | |
| SUB | 1768.64 | 914.39 | 1342.52 | 895.35 | –1.1 | 0 | 1 | 5 | 4 | ++ | ||||
| SUB | 2059.73 | 914.39 | 1488.07 | 992.38 | –1.2 | 1 | 1 | 5 | 4 | +++ | ||||
| SUB | 2350.83 | 914.39 | 1089.42 | –0.3 | 2 | 1 | 5 | 4 | ++ | |||||
| SUB | 2424.87 | 914.39 | 1114.09 | –0.4 | 1 | 1 | 6 | 5 | + | |||||
| 820 | 829 | 821N16 | TRY | 1768.64 | 1275.54 | 1015.73 | –1 | 0 | 1 | 5 | 4 | ++ | ||
| TRY | 2059.73 | 1275.54 | 1112.77 | –1.1 | 1 | 1 | 5 | 4 | ++ | |||||
| TRY | 2350.83 | 1275.54 | 1209.8 | –0.3 | 2 | 1 | 5 | 4 | + | |||||
| 816 | 829 | SDPK | 821N16 | TRY | 1768.64 | 1702.74 | 1158.14 | –2.3 | 0 | 1 | 5 | 4 | ++ | |
| TRY | 2059.73 | 1702.74 | 1255.17 | 0.1 | 1 | 1 | 5 | 4 | ++ | |||||
| 813 | 825 | DFKSDPK | 821N16 | SUB | 2059.73 | 1618.71 | 1227.16 | –0.7 | 1 | 1 | 5 | 4 | +++ | |
| SUB | 2350.83 | 1618.71 | 1324.19 | –2.3 | 2 | 1 | 5 | 4 | + | |||||
| 814 | 824 | FKSDPK | 821N16 | CHY | 1768.64 | 1356.61 | 1042.76 | –1.9 | 0 | 1 | 5 | 4 | ++ | |
| or KSDPK | CHY | 2059.73 | 1357.62 | 1356.61 | 1139.79 | –1 | 1 | 1 | 5 | 4 | +++ | |||
| CHY | 2350.83 | 1356.61 | 1236.82 | –1.1 | 2 | 1 | 5 | 4 | + | |||||
| 838 | 845 | VS | 840N17 | SUB | 1216.42 | 903.41 | 1060.93 | –1.4 | 0 | 0 | 5 | 2 | ||
| 837 | 845 | SVS | 840N17 | SUB | 1216.42 | 990.44 | 1104.44 | –1.3 | 0 | 0 | 5 | 2 | ||
| 866 | 878 | TCTPR | 871N18 | SUB | 2059.73 | 1576.71 | 1213.16 | 0.1 | 1 | 1 | 5 | 4 | ++ | |
| or CTPR | SUB | 2350.83 | 1577.72 | 1576.71 | 1310.19 | –0.8 | 2 | 1 | 5 | 4 | ++ | |||
| SUB | 2424.87 | 1576.71 | 1334.87 | –1.6 | 1 | 1 | 6 | 5 | + | |||||
Theoretical peptide and glycan mass.
Glycopeptide mass measured in the orbitrap (details are described in Methods section).
NeuAc = N-acetylneuraminic acid; Hex = (Man and Gal); HexNAc = N-acetylhexosamine (GlcNAc); Fuc = Fucose.
Relative amounts are estimated by comparing peak intensities within the group of closely eluting glycoforms of the same peptide.
Coeluting precursor ions with composition GlcNAc2, GlcNAc2Man1–5 were also detected, but may be caused by in-source fragmentation, since they are not biologically relevant.
Figure 2HCD fragmentation spectrum of the N-glycopeptide from a chymotryptic digest of MUC2-C detected at m/z 1042.76 [M + 3H]3+, interpreted to have the sequence 815KSDPKNNCTFF (site 821N16, Figure 1B) and the N-glycan with the composition (Hex5HexNAc4Fuc). The mass chromatograms from LC/MS of the three N-glycopeptides with the same sequence are inserted.
Figure 3Pooled mass chromatograms of five N-glycopeptides labeled Gp 1–5 interpreted as glycoforms of the peptide 767THVPCNTS (site 772N15, Figure 1) from a subtilisin digest of MUC2-C (A) and pooled full scans collected between 14.5 and 15 min (B). Detected precursor ions and retention times of Gp1–5 are inserted in B. HCD fragmentation spectra of Gp 2 detected at m/z 1342.52 [M + 2H]2+ and Gp 3 detected at m/z 992.38 [M + 3H]3+ are shown in C and D, respectively.
Figure 4HCD fragmentation spectra of N-glycopeptides from a subtilisin digest of MUC2-C interpreted to have the sequence 679DTCCNIT (site 683N12, Figure 1B) and the monosaccharide compositions [Hex5HexNAc2] detected at m/z 1050.38 [M + 2H]2+ (A) and (Hex5 HexNAc4Fuc) at m/z 884.66 [M + 3H]3+ (B).
Figure 5HCD fragmentation spectrum of a O-glycopeptide at m/z 872.92 [M + 2H]2+ from a chymotryptic digest of MUC2-C interpreted to have the sequence 126GLRPYPSSVL (site Ser132, Figure 1B) and the monosaccharide composition (HexHexNAcNeuAc).