| Literature DB >> 33555887 |
Qian Dong1, Xinjian Yan1, Yuxue Liang1, Sanford P Markey1, Sergey L Sheetlin1, Concepcion A Remoroza1, William E Wallace1, Stephen E Stein1.
Abstract
This work presents methods for identifying and then creating a mass spectral library for disulfide-linked peptides originating from the NISTmAb, a reference material of the humanized IgG1k monoclonal antibody (RM 8671). Analyses involved both partially reduced and non-reduced samples under neutral and weakly basic conditions followed by nanoflow liquid chromatography tandem mass spectrometry (LC-MS/MS). Spectra of peptides containing disulfide bonds are identified by both MS1 ion and MS2 fragment ion data in order to completely map all the disulfide linkages in the NISTmAb. This led to the detection of 383 distinct disulfide-linked peptide ions, arising from fully tryptic cleavage, missed cleavage, irregular cleavage, complex Met/Trp oxidation mixtures, and metal adducts. Fragmentation features of disulfide bonds under low-energy collision dissociation were examined. These include (1) peptide bond cleavage leaving disulfide bonds intact; (2) disulfide bond cleavage, often leading to extensive fragmentation; and (3) double cleavage products resulting from breakages of two peptide bonds or both peptide and disulfide bonds. Automated annotation of various complex MS/MS fragments enabled the identification of disulfide-linked peptides with high confidence. Peptides containing each of the nine native disulfide bonds were identified along with 86 additional disulfide linkages arising from disulfide bond shuffling. The presence of shuffled disulfides was nearly completely abrogated by refining digest conditions. A curated spectral library of 702 disulfide-linked peptide spectra was created from this analysis and is publicly available for free download. Since all IgG1 antibodies have the same constant regions, the resulting library can be used as a tool for facile identification of "hard-to-find" disulfide-bonded peptides. Moreover, we show that one may identify such peptides originating from IgG1 proteins in human serum, thereby serving as a means of monitoring the completeness of protein reduction in proteomics studies. Data are available via ProteomeXchange with identifier PXD023358.Entities:
Keywords: IgG1; MS spectral library; NISTmAb; cysteine; disulfide; disulfide bond fragmentation; disulfide-linked peptides; full MS scan
Mesh:
Substances:
Year: 2021 PMID: 33555887 PMCID: PMC9278810 DOI: 10.1021/acs.jproteome.0c00823
Source DB: PubMed Journal: J Proteome Res ISSN: 1535-3893 Impact factor: 5.370
Classification of the Three Major Groups of Product Ions Observed in the Spectra of Disulfide Peptide Discussed in This Section, SGTASVVCLLNNFYPR_SS_VYACEVTHQGLSSPVTKa
Symbols |, -H, and +S, -2H and +0 denote cleavage of amide, C–S, and S–S bonds, respectively.
Figure 1Building library of reference mass spectra for disulfide analysis.
Disulfide-Linked Peptides Detected for Each of the Nine Native Disulfide Linkages Obtained from the Triplicate LC–MS/MS Analyses of the 18 h NISTmAb Digest Partially Reduced with TCEP and Alkylated with IAMg,h
| bond name | position in protein | class | missed cleaved sites | peptide sequence | mass | charge | absolute abund. |
|---|---|---|---|---|---|---|---|
| 1 | HC22-HC97 | ||||||
| missed cleav. | 1 | 8213.882 | 5,6,7 | 9.3 × 1010 | |||
| MIFNFYFDVWGQGTTVTVSSASTK | |||||||
| 2 | ESGPALVKPTQTLTLTCTFSGFSLSTAGMSVGWIR | 9381.511 | 6,7 | 8.6 × 1009 | |||
| FNFYFDVWGQGTTVTVSSASTKGPSVFPLAPSSK | |||||||
| 2 | ESGPALVKPTQTLTLTCTFSGFSLSTAGMSVGWIRQPPGK | 8721.163 | 5,6,7,8 | 3.2 × 1009 | |||
| ARDMIFNFYFDVWGQGTTVTVSSASTK | |||||||
| 3 | ESGPALVKPTQTLTLTCTFSGFSLSTAGMSVGWIR | 9326.501 | 6,7,8 | 2.3 × 1009 | |||
| ATYYCARDMIFNFYFDVWGQGTTVTVSSASTK | |||||||
| 3 | ESGPALVKPTQTLTLTCTFSGFSLSTAGMSVGWIRQPPGK | 9888.792 | 7,8 | 8.3 × 1008 | |||
| ARDMIFNFYFDVWGQGTTVTVSSASTKGPSVFPLAPSSK | |||||||
| semi- tryptic | 0 | ESGPALVKPTQTLTLTCTFSGF | 4085.906 | 4,5 | 2.4 × 1009 | ||
| 0 | ESGPALVKPTQTLTLTCTFSGFSL | 4286.022 | 3 | 3.4 × 1008 | |||
| 2 | HC147-HC203 | ||||||
| missed cleav. | 1 | 7898.909 | 5,6 | 4.4 × 1009 | |||
| LGTQTYI | |||||||
| 1 | STSGGTAALGCLVK | 8259.11 | 6,7 | 4.8 × 1008 | |||
| PSSSLGTQTYICNVNHKPSNTKVDK | |||||||
| semi- tryptic | 0 | STSGGTAALGCLVK | 6869.428 | 5,6,7 | 4.5 × 1009 | ||
| YICNVNHKPSNTK | |||||||
| 0 | STSGGTAALGCLVK | 7069.544 | 6 | 1.8 × 1009 | |||
| TYICNVNHKPSNTK | |||||||
| 0 | STSGGTAALGCLVK | 4409.221 | 5,6 | 3.3 × 1008 | |||
| 0 | STSGGTAALGCLVK | 4122.073 | 4,5,6 | 2.2 × 1008 | |||
| 0 | STSGGTAALGCLVK | 4209.105 | 4,5 | 1.4 × 1008 | |||
| 0 | STSGGTAALGCLVK | 3836.904 | 4,5 | 9.3 × 1007 | |||
| 3 | HC264-HC324 | ||||||
| missed cleav. | 2 | TPEVTCVVVDVSHEDPEVK | 4537.287 | 4,5,6,7 | 6.6 × 1009 | ||
| 3 | TPEVTCVVVDVSHEDPEVK | 4965.526 | 5,6 | 1.5 × 1009 | |||
| 1 | TPEVTCVVVDVSHEDPEVK | 2748.299 | 4,5 | 9.4 × 1008 | |||
| 1 | TPEVTCVVVDVSHEDPEVK | 2756.336 | 4,5 | 2.8 × 1008 | |||
| semi- tryptic | 0 | TPEVTCVVVDVSHED | 1874.839 | 4 | 5.1 × 1008 | ||
| 0 | TPEVTCVVVDVSH | 1630.77 | 2 | 4.7 × 1008 | |||
| 0 | TPEVTCVVVD | 1307.61 | 2,3 | 3.4 × 1008 | |||
| 4 | HC370-HC428 | ||||||
| semi- tryptic | 0 | NQVSLTCLVK | 3231.563 | 4,5 | 7.6 × 1007 | ||
| 5 | LC23- LC87 | ||||||
| missed cleav. | 1 | DIQMTQSPSTLSASVGDRVTITCSASSR | 7320.344 | 5 | 1.0 × 1009 | ||
| GSGYPFTFGGGTK | |||||||
| 2 | VTITCSASSR | 6071.851 | 5,6 | 2.0 × 1008 | |||
| 6 | LC133-LC193 | ||||||
| missed cleav. | 1 | SGTASVVCLLNNFYPR | 3820.903 | 3,4,5,6,7 | 6.2 × 1010 | ||
| 2 | SGTASVVCLLNNFYPR | 4427.168 | 5,6 | 6.0 × 1008 | |||
| semi- tryptic | 0 | SGTASVVCLLNNFYPR | 2658.257 | 4 | 4.5 × 1007 | ||
| 7 | HC223-LC213 | missed cleav. | 1 | SCDKTHTC | 3467.639 | 3,4,5,6,7 | 2.6 × 1010 |
| 2 | SCDKTHTC | 3971.883 | 4,5,6,7 | 3.3 × 1009 | |||
| 3 | SCDKTHTC | 4788.299 | 4,5,6 | 2.7 × 1009 | |||
| 1 | SCDK | 1260.486 | 2,3,4 | 7.8 × 1008 | |||
| 8 and 9 | HC229-HC229 and HC232-HC232 | ||||||
| missed cleav. | 1 | THTCPPCPAPELLGGPSVFLFPPKPK | 5889.962 | 5,6,7,8,9 | 1.2 × 1010 | ||
| 2 | SC | 6323.125 | 4,5,6,7,8,9 | 2.2 × 1009 | |||
| semi- tryptic | 0 | THTCPPCPAPELLGGPSVFLFPPKPK | 4649.298 | 5 | 5.4 × 1008 |
VH, CH1, CH2, and CH3 refer to the variable and constant domains 1 to 3 in H chain, respectively.
VL and CL represent the variable and constant domains of the L chain.
H–L is the interchain bond linking the H and L chains.
Hinge region contains the two disulfide bonds connecting the two H chains.
Oxidized and/or metallated forms were also detected.
Intrapeptide form.
Notes: Bond: disulfide bond named by domains. Position: Cys position in each protein chain. Class: peptide class. Miss cleav.: total number of missed cleavage sites. Peptide sequence: sequence of two component peptides linked by a disulfide bond. Mass: mass of disulfide-linked peptide. z: charge state of disulfide-linked peptide. Absolute abund.: summed absolute abundance over all charge states of a peptide.
In Column 3, the fully tryptic peptide is shown in bold, and the relative abundance is calculated as the percentage of the tryptic peptide abundance over the total peptide abundance of the linkage site. Note that several peptides containing disulfide bonds 7, 8, and 9 were identified with carbamidomethylated (CAM) cysteine residues.
Selected Disulfide-Linked Peptide Ions with Varied Amino Acid Compositions and Charge States that Were Studied for Fragmentation Patterns under HCD and FT-CID
| # bond | bond name | Cys position in protein | disulfide-linked peptide sequence | peptide class | # ion | charge | precursor | replicate spectra |
|---|---|---|---|---|---|---|---|---|
| 2 | CH1 | 147–203 | STSGGTAALGCLVK | tryptic | 1 | 6 | 1320.494 | 42 |
| LSSVVTVPSSSLGTQTYICNVNHKPSNTK | 2 | 7 | 1131.996 | 157 | ||||
| 3 | CH2 | 264–324 | TPEVTCVVVDVSHEDPEVK | tryptic | 3 | 3 | 777.04 | 424 |
| 4 | 4 | 583.032 | 955 | |||||
| *EVTCVVVDVSHEDPEVK | semi-tryptic | 5 | 3 | 711.006 | 18 | |||
| *TPEVTCVVVDVSHED | 6 | 3 | 625.854 | 20 | ||||
| 7 | 4 | 469.717 | 14 | |||||
| 4 | CH3 | 370–428 | NQVSLTCLVK | tryptic | 8 | 4 | 962.213 | 149 |
| 9 | 5 | 769.972 | 582 | |||||
| 10 | 6 | 641.811 | 41 | |||||
| 11 | 7 | 550.268 | 29 | |||||
| 6 | CL | 133–193 | SGTASVVCLLNNFYPR | tryptic | 12 | 3 | 1186.257 | 17 |
| 13 | 4 | 889.945 | 277 | |||||
| 14 | 5 | 712.157 | 475 | |||||
| 7 | H-L | 223–213 | SCDK | missed cleavage | 15 | 3 | 421.169 | 30 |
Notes: The symbol * indicates semitryptic peptides. # bond and bond name: sequential number and name of bonds given by this work. Cys position in protein: Cys residue in protein sequence. Peptide class: tryptic digest products. # ion: sequential number of selected ions. Charge: charge states. Precursor m/z: the mass/charge for peptide ions in MS1. Replicate spectra: repeat spectra identifications.
Figure 2Summed abundances for each of the nine native disulfide bonds in the NISTmAb obtained by median results of triplicate analyses from three separate tryptic digests at 0.25, 2, and 18 h using partially reducing and non-reducing conditions. (A) Partially reduced peptide mapping using low concentration of TCEP at pH 8; (B) non-reduced peptide mapping at pH 8.
Figure 3Levels of scrambling occurring at pH 8 for each of Cys sites are observed from non-reduction and partial reduction experiments. Partial Red.: reduction by TCEP and alkylation by IAM. Non-Red.: no reduction and alkylation for antibody digestion. Cysteines are labeled by their residue number in L or H chain, e.g., HC264 denotes Cys at residue 264 of the H chain.
Figure 4Peptide identifications containing native disulfide bonds obtained from four triplicate analyses of the 18 h tryptic digest using different parameters for four protocol variations: non-reducing and partially reducing conditions at pH 7 and pH 8, respectively.
Figure 5Different types of product ions in the MS/MS spectrum of a triply charged disulfide-linked peptide, SGTASVVCLLNNFYPR_SS_YACEVTHQGLSSPVTK, at m/z 1186.257 with less than 5 ppm deviation from its theoretical value, from FT-CID fragmentation at NCE of 35%. (A) Sequencing disulfide linkage and (B) fully annotated MS/MS spectrum. The * symbol in green color represents fragments from amide bond cleavages and contains the intact disulfide bond. The annotation in red color denotes fragments from C–S bond cleavage.
Fragment Ion Annotation in an FT-CID MS/MS Spectrum of the Charge 3+ Disulfide-Linked Peptide, SGTASVVCLLNNFYPR_SS_VYACEVTHQGLSSPVTKa
| class | group | cleavage site | mass error in ppm | assignment of fragment | charge | sequence | percent abundance | |
|---|---|---|---|---|---|---|---|---|
| intact disulfide bond fragment | 1a | single C-N cleavage in either peptide | 985.8223 | 1.1 | y9 | 3 | CLLNNFYPR | 5.5 |
| 1075.204 | 1.9 | y12-NH3 | 3 | SVVCLLNNFYPR | 18.5 | |||
| 1080.879 | 1.4 | y12 | 3 | SVVCLLNNFYPR | 25.4 | |||
| 1131.5491 | 2.2 | b(2)16-H2O | 3 | SGTASVVCLLNNFYPR | 10.2 | |||
| 1308.6421 | 1 | b9-H2O | 2 | SGTASVVCL | 6.4 | |||
| 1312.1265 | 1 | b(2)8-H2O | 2 | SGTASVVCLLNNFYPR | 19.5 | |||
| 1321.1309 | 0.3 | b(2)8 | 2 | SGTASVVCLLNNFYPR | 32.8 | |||
| 1385.1595 | 0.2 | b(2)9 | 2 | SGTASVVCLLNNFYPR | 47 | |||
| 1413.6871 | 4.6 | b11-H5NO | 2 | SGTASVVCLLN | 19.9 | |||
| 1431.2117 | 0.4 | b11 | 2 | SGTASVVCLLN | 26.5 | |||
| 1478.229 | 0.5 | y9 | 2 | CLLNNFYPR | 100 | |||
| 1488.2318 | 1.3 | b12 | 2 | SGTASVVCLLNN | 17.7 | |||
| 1504.7283 | 3.3 | b(2)12-H2O | 2 | SGTASVVCLLNNFYPR | 30 | |||
| 1513.7325 | 2.6 | b(2)12 | 2 | SGTASVVCLLNNFYPR | 27.7 | |||
| 1527.7645 | 1.4 | y10 | 2 | VCLLNNFYPR | 98.5 | |||
| 1539.2399 | 3.8 | b(2)13-2H2O | 2 | SGTASVVCLLNNFYPR | 38.4 | |||
| 1553.2588 | 2.7 | b13-NH3 | 2 | SGTASVVCLLNNF | 7.2 | |||
| 1557.2498 | 3.3 | b(2)13 | 2 | SGTASVVCLLNNFYPR | 48.7 | |||
| 1561.7668 | 0.7 | b13 | 2 | SGTASVVCLLNNF | 26 | |||
| 1577.3018 | 3.3 | y11 | 2 | VVCLLNNFYPR | 22.8 | |||
| 1612.3059 | 4.1 | y12-NH3 | 2 | SVVCLLNNFYPR | 18.7 | |||
| 1625.7781 | 1.8 | b14-H5NO | 2 | SGTASVVCLLNNFY | 6.9 | |||
| 1629.3011 | 0.6 | a14 | 2 | SGTASVVCLLNNFY | 6.6 | |||
| 1634.7865 | 0.1 | b14-NH3 | 2 | SGTASVVCLLNNFY | 15.6 | |||
| 1641.3032 | 2.8 | a(2)15 | 2 | SGTASVVCLLNNFYPR | 18.2 | |||
| 1643.2998 | 0.1 | b14 | 2 | SGTASVVCLLNNFY | 55.1 | |||
| 1646.7877 | 2.6 | b(2)15-NH3 | 2 | SGTASVVCLLNNFYPR | 30.1 | |||
| 1655.3022 | 1.8 | b(2)15 | 2 | SGTASVVCLLNNFYPR | 86.6 | |||
| 1696.8263 | 1.5 | b(2)16-H2O | 2 | SGTASVVCLLNNFYPR | 22.1 | |||
| 1705.8307 | 1 | b(2)16 | 2 | SGTASVVCLLNNFYPR | 31.8 | |||
| 1b | double C-N cleavage in one peptide | 1533.7521 | 3.1 | int5-15 | 2 | SVVCLLNNFYP | 13.1 | |
| 1c | double C-N cleavage in both peptides | 1306.1199 | 4.1 | [b(2)13-y10] | 2 | VYACEVTHQGLSS | 8.1 | |
| 1329.6398 | 1.2 | [a14:b(2)11] | 2 | SGTASVVCLLNNFY | 7 | |||
| 1373.158 | 2.8 | [a14:b(2)12] | 2 | SGTASVVCLLNNFY | 44.1 | |||
| 1404.179 | 4.9 | [b(2)14:y11] | 2 | VYACEVTHQGLSSP | 11 | |||
| cleaved disulfide bond fragments | 2a | C-S cleavage | 892.9614 | 1.2 | y(2)17-H2S | 2 | VYACEVTHQGLSSPVTK | 3.5 |
| 925.9394 | 3.3 | y(2)17 + S | 2 | VYACEVTHQGLSSPVTK | 11.2 | |||
| 1456.7136 | 4.1 | y12 + S | 1 | SVVCLLNNFYPR | 8.2 | |||
| 1772.8417 | 2.3 | y16 + S | 1 | SGTASVVCLLNNFYPR | 17.1 | |||
| 1850.8861 | 4.6 | y(2)17 + S | 1 | VYACEVTHQGLSSPVTK | 10.1 | |||
| fragments without disulfide bond | 3a | single C-N cleavage in both peptides | 435.2348 | 0.6 | y3 | 1 | YPR | 7.5 |
| 444.2815 | 0.4 | y(2)4 | 1 | PVTK | 12.8 | |||
| 531.3135 | 0.4 | y(2)5 | 1 | SPVTK | 7.3 | |||
| 582.3021 | 2.3 | y4 | 1 | FYPR | 10 | |||
| 618.3459 | 0.3 | y(2)6 | 1 | SSPVTK | 8.8 | |||
| 696.3465 | 0.2 | y5 | 1 | NFYPR | 18.5 | |||
| 788.4503 | 1.2 | y(2)8 | 1 | GLSSPVTK | 21.2 | |||
| 810.3889 | 0.5 | y6 | 1 | NNFYPR | 51.8 | |||
| 899.4845 | 1.4 | y(2)9-NH3 | 1 | QGLSSPVTK | 13.5 | |||
| 916.511 | 1.3 | y(2)9 | 1 | QGLSSPVTK | 12 | |||
| 923.4728 | 0.6 | y7 | 1 | LNNFYPR | 40.9 | |||
| 1036.5577 | 0.3 | y8 | 1 | LLNNFYPR | 29.9 | |||
| 1053.5677 | 1 | y(2)10 | 1 | HQGLSSPVTK | 23 | |||
| 1154.6155 | 0.8 | y(2)11 | 1 | THQGLSSPVTK | 44.5 | |||
| 1382.7231 | 3.1 | y(2)13 | 1 | EVTHQGLSSPVTK | 11.5 |
Notes: Class and group: fragment class and group. Cleav. site: peptide bond or disulfide bond. m/z: the mass/charge ratio for fragment ion. Error ppm: mass deviation between observed and theoretical m/z values. z: charge state of fragment ion. Sequence: sequence of fragment ion. Abund. % relative abundance of specific product ions with respect to the base peak in a MS2 spectrum.
Figure 6Comparison of three ion categories from 15 disulfide-linked peptide ions (Table ) by percent ion intensity. DSB fragments: ions linked by a disulfide bond. Non-DSB fragments: ions not containing a disulfide bond. Unassigned: unassigned ions.
Figure 7Distribution of disulfide-bond cleavages among five disulfide linkages obtained from 3200 high-quality MS2 spectra. Pks refers to fragment ions arising from disulfide cleavage.
Figure 8MS2 analysis of the charge/energy-dependence changes in HCD fragmentation of S–S and C–S bonds of disulfide-linked peptide, TPEVTCVVVDVSHEDPEVK_SS_CK at (A) m/z 583.032 of the charge 4+ ion at an NCE of 20 and (B) m/z 466.627 of the charge 5+ ion at an NCE of 24. The intensity axis is scaled down to an intensity range from 0 to 12% relative to the base peak to show low levels of disulfide bond cleavage in the HCD spectrum. * fragments contain an intact disulfide bond. C–S and S–S indicate cleavage from disulfide bonds.
Figure 9HCD spectrum of the charge 7+ disulfide-linked peptide ion, NQVSLTCLVK_SS_WQQGNVFSCSVMHEALHNHYTQK, at NCE = 24 shows dominant double cleavage that generated a number of major internal fragments containing a disulfide bond (in square brackets []). * fragments contain an intact disulfide bond.
Figure 10Head-to-tail plot of experimental and reference HCD tandem mass spectra of an ion containing a native disulfide bond CH2. It shows an experimental ESI-MS/MS spectrum acquired from 2D LC–MS/MS analysis of reduced human plasma reference material NIST SRM 1950 matching to the charge 4+ disulfide-linked peptide spectrum in the library, “TPEVTCVVVDVSHEDPEVK_SS_CK”. The library spectral similarity score is 930 (1000 as the highest).