| Literature DB >> 28900186 |
Alexander Pralow1, Marcus Hoffmann1, Terry Nguyen-Khuong1, Erdmann Rapp2,3, Udo Reichl1,4.
Abstract
To determine all potential N-glycosylation sites of a glycoprotein, one central aspect of every bottom-up N-glycoproteomic strategy is to generate suitable N-glycopeptides that can be detected and analyzed by mass spectrometry. Specific proteases, such as trypsin, bear the potential of generating N-glycopeptides that either carry more than one N-glycosylation site or are too long to be readily analyzed by mass spectrometry- both due to the lack of tryptic cleavage sites near the N-glycosylation site. Here, we present a newly identified cleavage specificity of flavastacin, a protease from Flavobacterium menigosepticum, which - up to now - was only reported to cleave peptide bonds N-terminal to aspartic acid residues. In contrast to literature, we could not confirm this N-terminal specificity of flavastacin for aspartic acid. However, for the first time, we show a unique cleavage specificity of flavastacin towards the C-terminus of N-glycosylated asparagine residues. Implemented in an N-glycoproteomic workflow the use of flavastacin can thus not only render data analysis much easier, it can also significantly increase the confidence of MS-based N-glycoproteomic analyses. We demonstrate this newly discovered specificity of flavastacin by in-depth LC-MS(/MS) analysis of complex-type glycosylated human lactotransferrin and bovine serum albumin peptides and N-glycopeptides that were generated by trypsin and flavastacin digestion. Following to this work, further elucidation of the efficiency, specificity and mode of action of flavastacin is needed, but we believe that our discovery has great potential to facilitate and improve the characterization of N-glycoproteomes.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28900186 PMCID: PMC5595805 DOI: 10.1038/s41598-017-11668-1
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Comparison of flavastacin and peptidyl-Asp metalloendopeptidase. The protein sequences of flavastacin (Q47899) and peptidyl-Asp metalloendopeptidase (Q9R4J4) are from Uniprot. Both sequences where compared using the protein BLAST tool from the national center for biotechnology information (free online tool: https://blast.ncbi.nlm.nih.gov/Blast.cgi?-PROGRAM=blastp&PAGE_TYPE=BlastSearch&BLAST_SPEC=blast2seq&LINK_LOC=blasttab). Sequences with similarities are illustrated in the lower box. The first sequence has a similarity of 11 out of 43 amino acids (23%). The second sequence has a similarity of 4 out of 9 amino acids (44%). Possible conservative amino acid changes are indicated with “+”.
Manually annotated N-glycopeptide sequences of human lactotransferrin (hLTF) from MS(/MS) spectra of nanoRP-LC-ESI-OT-MS2 (HCD) measurements after sequential digestion of hLTF with trypsin and flavastacin.
| N-Glycosylation Site | Experimental/Observed mass [Da] | Theoretical mass [Da] | Δ Mass [Da] | Peptide | Position | Modifications | RT [min] |
|---|---|---|---|---|---|---|---|
| 156 | 490.2640 | 490.2660 | 0.002 | (R)PFLN(W) | 153–156 | 24.73 | |
| 860.4960 | 860.4990 | 0.003 | (G)TLRPFLN(W) | 150–156 | 25.07 | ||
| 1127.6620 | 1127.6570 | −0.005 | (V)PIGTLRPFLN(W) | 147–156 | 40.09 | ||
| 1226.7230 | 1226.7260 | 0.003 | (N)VPIGTLRPFLN(W) | 146–156 | 41.54 | ||
| 1755.9520 | 1755.9540 | 0.002 | (R)TAGWNVPIGTLRPFLN(W) | 141–156 | 42.17 | ||
| 1340.7680 | 1340.7680 | 0.000 | (W)NVPIGTLRPFLN(W) | 145–156 | 42.93 | ||
| 497 | 856.4950 | 856.4961 | −0.001 | (N)IPM(−48)GLLFN(Q) | 490–497 | M(CAM) (−105) | 30.11 |
| 506.2990 | 506.2970 | −0.002 | (G)LLFN(Q) | 494–497 | 32.10 | ||
| 563.3210 | 563.3190 | −0.002 | (M)GLLFN(Q) | 493–497 | 36.07 | ||
| 1385.7223 | 1385.7246 | −0.002 | (R)TAGWNIPM(−48)GLLFN(Q) | 485–497 | M(CAM) (−105) | 39.58 | |
| 920.4930 | 920.4910 | −0.002 | (N)IP | 490–497 | MSO | 39.95 | |
| 904.4950 | 904.4960 | 0.001 | (N)IPMGLLFN(Q) | 490–497 | 45.56 | ||
| 791.4110 | 791.4120 | 0.001 | (I)PMGLLFN(Q) | 491–497 | 47.45 | ||
| 694.3590 | 694.3590 | 0.000 | (P)MGLLFN(Q) | 492–497 | 53.33 |
CAM - carbamidomethylation; MSO - methionine S-oxidation.
Figure 2Base peak ion-chromatogram (BPC) and oxonium ion related extracted ion-chromatograms (EIC) of MS(/MS) spectra of nanoRP-LC-ESI-OT-MS2 (HCD) measured hLTF after sequential digestion with trypsin and flavastacin. Starting from the top: BPC of MS spectra (grey), EIC of MS/MS spectra of HexNAc within the m/z range 204.086–204.088 [M + H]+ (red), EIC of MS/MS spectra of NeuAc-H2O within the m/z range 274.092–274.094 (green), EIC of MS/MS spectra of HexHexNAc within the m/z range 366.139–366.143 (blue). The ion-chromatograms are illustrated in the time range 10.64–60.95 min. The accepted mass error of the EIC of the specific oxonium ions is 5 ppm. Signals listed in Table 1 are marked using asterisks.
Figure 3Fragment ion spectra of nanoRP-LC-ESI-OT-MS2 (HCD) measured hLTF N-glycopeptides after sequential digestion with trypsin and flavastacin. (A) For site N497 the fragment ion spectrum of the N-glycopeptide sequence TAGWNIPM*GLLFN 497 with the N-linked glycan Hex5HexNAc4dHex1NeuAc1 and the corresponding precursor ion m/z 1184.1659 [M + 3 H]3+ is shown. The “*” indicates the carbamidomethylation of methionine. (B) For site N156 the fragment ion spectrum of the N-glycopeptide sequence TAGWNVPIGTLRPFLN 156 with the N-linked glycan Hex5HexNAc4dHex2NeuAc1 and the corresponding precursor ion m/z 1321.2479 [M + 3 H]3+ is shown. N-glycan structures and oxonium ions are illustrated according to CFG nomenclature[22]. The b-ion series are highlighted in green, the y-ion series in red and the B-ions in blue. The isotopic patterns of the precursors are shown at the upper right corner.
Proteome Discoverer results of a nanoRP-LC-ESI-OT-MS2 (HCD) measurement after sequential digestion of BSA with trypsin and flavastacin.
| Sequence | # PSM | Modifications | MH + [Da] | IonScore | ΔM [Da] |
|---|---|---|---|---|---|
| (K)TVMENFVAFVDK(C) | 2 | N5(Deamidated) | 1400.67510 | 107 | −0.002 |
| (R)MPCTEDYLSLILNR(L) | 2 | C3(Carbamidomethyl) | 1724.82927 | 98 | −0.005 |
| (K)LGEYGFQNALIVR(Y) | 42 | N8(Deamidated) | 1480.77873 | 98 | 0.004 |
| (K)LGEYGFQNALIVR(Y) | 13 | 1479.79204 | 97 | 0.000 | |
| (K)TVMENFVAFVDK(C) | 9 | 1399.69121 | 93 | −0.001 | |
| (K)TVMENFVAFVDK(C) | 1 | M3(Oxidation) | 1415.68498 | 93 | −0.003 |
| (K)VPQVSTPTLVEVSR(S) | 4 | 1511.84062 | 86 | −0.002 | |
| (P)CTEDYLSLILNR(L) | 3 | C1(Carbamidomethyl) | 1496.73870 | 83 | −0.002 |
| (R)RHPEYAVSVLLR(L) | 6 | 1439.81028 | 82 | −0.002 | |
| (M)ENFVAFVDK(C) | 7 | 1068.53703 | 77 | 0.001 | |
| (R)KVPQVSTPTLVEVSR(S) | 2 | 1639.93723 | 76 | −0.001 | |
| (K)DAFLGSFLYEYSR(R) | 2 | 1567.73613 | 76 | −0.007 | |
| (E)YGFQNALIVR(Y) | 6 | N5(Deamidated) | 1181.63017 | 75 | −0.001 |
| (G)EYGFQNALIVR(Y) | 5 | N6(Deamidated) | 1310.67229 | 74 | 0.000 |
| (M)ENFVAFVDK(C) | 2 | N2(Deamidated) | 1069.51872 | 71 | −0.001 |
| (K)LGEYGFQNAL(I) | 2 | 1111.54180 | 68 | 0.000 | |
| (K)KQTALVELLK(H) | 13 | 1142.71416 | 68 | −0.000 | |
| (Y)FYAPELLYYANK(Y) | 1 | 1491.75029 | 68 | −0.002 | |
| (K)LVNELTEFAK(T) | 1 | N3(Deamidated) | 1164.61382 | 65 | −0.001 |
|
| 4 | 1106.62212 | 63 | 0.002 | |
| (K)DAIPENLPPLTADFAEDK(D) | 2 | 1955.95671 | 60 | −0.003 | |
| (S)TPTLVEVSR(S) | 1 | 1001.56133 | 57 | −0.001 | |
| (K)HLVDEPQNLIK(Q) | 2 | 1305.71526 | 57 | −0.001 | |
| (R)HPEYAVSVLLR(L) | 2 | 1283.70691 | 56 | −0.004 | |
| (P)EYAVSVLLR(L) | 3 | 1049.59648 | 56 | −0.003 | |
| (E)YAPELLYYANK(Y) | 1 | 1344.68120 | 55 | −0.002 | |
| (Q)VSTPTLVEVSR(S) | 1 | 1187.66106 | 55 | −0.002 | |
| (F)LGSFLYEYSR(R) | 3 | 1234.60918 | 55 | −0.001 | |
| (N)FVAFVDK(C) | 2 | 825.44951 | 55 | −0.001 | |
| (N)LPPLTADFA(E) | 7 | 944.50633 | 54 | −0.001 | |
|
| 5 | 1064.59661 | 54 | −0.002 | |
| (Y)APELLYYANK(Y) | 1 | 1181.61870 | 54 | −0.001 | |
| (T)ALVELLK(H) | 1 | 785.51219 | 54 | −0.001 | |
| (E)YAVSVLLR(L) | 2 | 920.55394 | 52 | −0.002 | |
| (K)LVNELTEFAK(T) | 1 | 1163.62810 | 52 | −0.003 | |
| (Q)TALVELLK(H) | 3 | 886.55974 | 48 | −0.001 | |
| (G)FQNALIVR(Y) | 1 | 960.56017 | 47 | −0.002 |
MASCOT search against unspecific in-silico digestion of mammalian taxonomy (UniProt-KB/SwissProt database). Peptide sequences with the N-terminal cleavage of aspartic acid or deamidated asparagine (as indicated from the supplier of flavastacin) are underlined and bold.
Figure 4Specificity of flavastacin for the C-terminus of N-glycosylated asparagine. The amino acid sequence of the tryptic N-glycopeptide of hLTF with the N-glycan Hex5HexNAc4dHex1NeuAc1 linked to N-glycosylation site N156 is shown. The scissors symbolize trypsin; the shield stands for cleavage inhibition of trypsin due to proline (P). Flavastacin is symbolized by a pick, and its specific C-terminal cleavage of N-glycosylated asparagine, as well as its unspecific N-terminal cleavages are shown. The N-glycan structure is illustrated according to CFG nomenclature[22].