Literature DB >> 33555887

Comprehensive Analysis of Tryptic Peptides Arising from Disulfide Linkages in NISTmAb and Their Use for Developing a Mass Spectral Library.

Qian Dong1, Xinjian Yan1, Yuxue Liang1, Sanford P Markey1, Sergey L Sheetlin1, Concepcion A Remoroza1, William E Wallace1, Stephen E Stein1.   

Abstract

This work presents methods for identifying and then creating a mass spectral library for disulfide-linked peptides originating from the NISTmAb, a reference material of the humanized IgG1k monoclonal antibody (RM 8671). Analyses involved both partially reduced and non-reduced samples under neutral and weakly basic conditions followed by nanoflow liquid chromatography tandem mass spectrometry (LC-MS/MS). Spectra of peptides containing disulfide bonds are identified by both MS1 ion and MS2 fragment ion data in order to completely map all the disulfide linkages in the NISTmAb. This led to the detection of 383 distinct disulfide-linked peptide ions, arising from fully tryptic cleavage, missed cleavage, irregular cleavage, complex Met/Trp oxidation mixtures, and metal adducts. Fragmentation features of disulfide bonds under low-energy collision dissociation were examined. These include (1) peptide bond cleavage leaving disulfide bonds intact; (2) disulfide bond cleavage, often leading to extensive fragmentation; and (3) double cleavage products resulting from breakages of two peptide bonds or both peptide and disulfide bonds. Automated annotation of various complex MS/MS fragments enabled the identification of disulfide-linked peptides with high confidence. Peptides containing each of the nine native disulfide bonds were identified along with 86 additional disulfide linkages arising from disulfide bond shuffling. The presence of shuffled disulfides was nearly completely abrogated by refining digest conditions. A curated spectral library of 702 disulfide-linked peptide spectra was created from this analysis and is publicly available for free download. Since all IgG1 antibodies have the same constant regions, the resulting library can be used as a tool for facile identification of "hard-to-find" disulfide-bonded peptides. Moreover, we show that one may identify such peptides originating from IgG1 proteins in human serum, thereby serving as a means of monitoring the completeness of protein reduction in proteomics studies. Data are available via ProteomeXchange with identifier PXD023358.

Entities:  

Keywords:  IgG1; MS spectral library; NISTmAb; cysteine; disulfide; disulfide bond fragmentation; disulfide-linked peptides; full MS scan

Mesh:

Substances:

Year:  2021        PMID: 33555887      PMCID: PMC9278810          DOI: 10.1021/acs.jproteome.0c00823

Source DB:  PubMed          Journal:  J Proteome Res        ISSN: 1535-3893            Impact factor:   5.370


Introduction

Disulfide bridges between cysteines (Cys) stabilize the three-dimensional structure of proteins, aiding with protein folding and ultimately enabling protein functionality.[1,2] Dysregulated structural disulfide bonds are characteristic of certain human diseases, such as various neurodegenerative diseases and cancers.[3] Therefore, detailed information on disulfide connectivity is critical for protein structure–function studies and biotherapeutics structure integrity assessment. Typical proteomics experiments dissociate disulfide bonds by reduction as part of the denaturing process and then cap Cys residues by alkylation to prevent the disulfide bond from re-forming. These well-established protein digestion methods can fully sequence individual proteins and identify a large fraction of low-abundance modifications in protein digests. Recently, two detailed mass spectral library-based examinations of the NISTmAb, a reference material of the humanized IgG1κ monoclonal antibody (RM 8671),[4] and its glycosylation were reported.[5,6] In contrast, MS analysis of disulfide bonds requires a different analytical approach. Recent advances in mass spectrometry have enabled the development of customized LC–MS strategies for disulfide bond mapping in proteins.[7−9] A major requirement in sample preparation is to prevent non-native disulfide formation (i.e., disulfide bond artifacts). The disulfide shuffling occurs spontaneously in the presence of free cysteines at pH 7.5–8.5, which is typical for protein digestion in standard proteomics protocols.[10,11] These artifacts can be minimized by blocking free cysteines and using acidic to neutral pH conditions. Methods for determining disulfide patterns include liquid chromatographic ultraviolet profile comparison between reduced and non-reduced peptides, direct mass spectrometry detection of disulfide without reduction or partial reduction, chemical labeling identification, and tandem mass spectrometry sequencing of non-reduced peptides.[12−17] Significant challenges remain in disulfide analyses due to complex intertwined disulfides, disulfide shuffling, incomplete digestion of non-reduced proteins, large peptide masses, and low ionization efficiency of disulfide-linked peptides.[18−24] Tandem mass spectra of peptides containing disulfide bonds are generally very complex because of various collisional dissociation products arising from cleavage of the component peptide backbone as well as disulfide bonds. There are no diagnostic fragments characteristic of disulfide spectra unlike other post-translational modifications that are readily detected by predominant ions resulting from residue side chain cleavage (e.g., oxonium ions of glycans, phosphoric acid (H3PO4) neutral loss of O-phosphorylation). Thus, most available bioinformatics tools currently are incapable of recognizing various disulfide-linked ions and interpreting their product ion spectra. The present study is a continuation of our previous work,[5,6] in which two comprehensive spectral libraries were developed for characterizing peptides, glycopeptides, and other biologically modified peptides derived from the NISTmAb. These libraries contain a large number of low-abundance modifications occurring on a wide range of residues. Notably, many modifications are not reported by most commonly used peptide identification programs since modified peptides do not fragment primarily by peptide backbone cleavages. Instead, their side chain cleavage reactions can dominate their product ion spectra. This study extends the prior work to enable the identification of another class of hard-to-find peptide products of digestion, namely, disulfide-bonded peptides. To identify and confirm the identity of these complex peptides, datasets generated by two digestion protocols, non-reduction and partial reduction, were chosen from a large amount of data obtained by a series of common protocols used in creating the above spectral libraries.[5] The data analysis method developed in this work employs full scan MS spectra to identify potential precursor ions and validates them by a detailed fragmentation analysis of their tandem mass spectra (MS/MS spectra). Resulting spectra then serve to create a searchable mass spectral library. As a bonus, we find and demonstrate the utility of these spectra for identifying digested, but incompletely reduced IgG1-derived peptides in human serum, thereby providing a unique measure of reduction effectiveness.

Materials and Methods

Materials

The primary sample 8670 NISTmAb (PS 8670, an in-house primary standard material), lot 3f1b, is an IgG1k mAb derived from a separate production lot of NISTmAb 8671 expressed in murine suspension culture.[4] It was obtained from the Bioanalytical Science Group at NIST. Digestion reagents guanidine hydrochloride, dithiothreitol (DTT), tris(2-carboxyethyl)phosphine (TCEP), N-ethylmaleimide (NEM), and iodoacetamide (IAM) were purchased from Sigma-Aldrich (St. Louis, MO). A revised Tris buffer (pH = 7) was prepared via adjustment of tris base solution (200 mmol) by HCl solution (1 mol/L). Sequencing-grade trypsin was purchased from Promega (Madison, WI). RapiGest, brand-name for sodium 3-[(2methyl-2-undecyl-1,3-dioxolan-4-yl)methoxy]-1-propanesulfonate, was purchased from Waters (Milford, MA); Zeba spin columns (7 K molecular weight cutoff (MWCO)) were purchased from Thermo Fisher Scientific (Waltham, MA). Chromatographic separations were performed on an Acclaim pepmap100 nano column (150 mm × 75 μm, C18, 3 μm particle size, 100 Å pore size, Dionex, Sunnyvale, CA).

Tryptic Digestion

To generate a wide variety of tryptic peptides that were collected in the NISTmAb peptide spectral library, parameters for a series of digestion protocol variations were used during denaturing, reduction, and alkylation of the NISTmAb. The details are described in our previous paper.[5] Among them, the two protocols of the non-reduced and partially reduced samples prepared under these general experimental conditions (pH = 8) were used in this study, therefore, disulfide bond rearrangements could occur. To prevent these shuffling reactions, two alternative methods are used for sample preparation at pH 7. All methods used in analyzing disulfides are described below.

General Experimental Conditions (pH = 8)

NISTmAb (500 μg) was denatured at room temperature for 10 min using guanidine hydrochloride (6 M) in 50 μL of 100 mmol/L Tris HCI (Tris(hydroxymethyl)aminomethane hydrochloride) buffer (pH = 8). Reduction and alkylation were varied as follows: (a) the sample was neither reduced nor alkylated or (b) the sample was partially reduced by adding 5 μL of 200 mmol/L TCEP to the above denatured mixture and incubating at room temperature for 1 h, followed by alkylation with 10 μL of 200 mmol/L IAM at room temperature in the dark for 1 h. The alkylated protein solution was desalted with a Zeba spin column (7 K MWCO) prior to adding 10 μg of Promega trypsin. The resulting mixture was subjected to digestion with sequencing-grade trypsin at 37 °C for 0.25, 2, and 18 h.

Non-reduction and Alkylation with NEM (pH = 7)

500 μg of NISTmAb was denatured at room temperature for 10 min using 6 M guanidine in 50 μL of tris buffer (pH = 7). The alkylation reagent NEM was used because of its thiol specificity and stability at pH 7, contrasting to IAM with optimal reaction conditions at pH 8, which most likely results in disulfide shuffling.[8] 10 μL of 200 mmol/L NEM was added to the above denatured mixture and incubating at room temperature in the dark for 1 h. The alkylated protein solution was desalted as described above. Trypsin (10 μg) was added to digest the solution at 37 °C for 0.25, 2, and 18 h. The reaction was quenched with 5 μL of formic acid (50% v/v).

Reduction with TCEP and Alkylation with NEM (pH = 7)

500 μg of NISTmAb was denatured using 6 M of guanidine; reduction was performed by adding 5 μL of 200 mmol/L TCEP to the above denatured mixture and incubating at room temperature for 1 h, followed by alkylation with 20 μL of 200 mmol/L NEM at room temperature in the dark for 1 h. Then, the alkylated protein solution was desalted as described above. Trypsin (10 μg) was added to digest the protein solution at 37 °C for 0.25, 2, and 18 h. The reaction was quenched with 5 μL of formic acid (50% v/v).

LC–MS/MS Analysis

The above digests (0.2 μg) were analyzed on a Dionex Ultimate 3000 RSLC Nano LC with an Acclaim pepmap100 column with a nanospray source connected to one of two mass spectrometers: a Q Exactive Hybrid Quadrupole-Orbitrap mass spectrometer or an Orbitrap Fusion Lumos mass spectrometer (Thermo Fisher Scientific, Waltham, MA) in the positive ion mode. Mobile phase A consisted of 0.1% formic acid in water, and mobile phase B consisted of 0.1% formic acid in ACN. The peptides were eluted by increasing mobile phase B from 1% to 90% over 120 min. Data was collected using a data-dependent mode with a dynamic exclusion of 20 s. The top 10 most abundant precursor ions were selected from a 250 m/z to 1850 m/z full scan for HCD and ion trap fragmentation (FT-CID) with normalized collisional energy (NCE) parameters (NCE of 16, 20, 24, 32, and 36). The resolution of full MS scan and MS/MS scan was set at 120,000 and 30,000 on the Fusion Lumos and at 70,000 and 17,500 on the Q Exactive, respectively.

Data Analysis

MS1 analysis of All Detectable Ion Clusters in a LC–MS Run

All detectable MS1 ions in a LC–MS run were analyzed with an in-house program, NIST ProMS.[25] In a LC–MS peptide mapping analysis, each peptide ion can be detected via a set of MS1 (LC–MS) peaks with specific isotopes, abundance, and retention times (RT). This set of peaks is denoted as an ion cluster by ProMS. ProMS identifies isotopes of each ion cluster to determine its charge state, retention time, abundance, and monoisotopic m/z in all LC–MS/MS runs. The abundance of an ion cluster is calculated using the peak area of all observable isotope peaks. The RT of an ion cluster refers to a time when the accumulated abundance is the half of the total abundances. Additionally, the RT range of a peptide is defined as the first and last eluting values of the peptide.

Estimation of the Median RT Range

The RPLC column can generate consistent peptide RT under same experimental conditions, with small variations from run to run. In this study, the median value of RT range for specific experiments is used in a discriminant filter to remove unrelated identifications. The first step is to extract the RT range of the most abundant ion of each peptide; then the median RT range is calculated from replicate runs under identical experiment conditions.

Disulfide Peptide Analysis

The IgG1 disulfide bond structure (see Figure S1) is highly conserved with a simple disulfide connectivity pattern because most disulfide bonds are constrained within an individual heavy/light domain. In all but one case, component peptides can be found to be connected by a single native disulfide bond. The exception involves the identical peptides from two heavy chains linking to each other in the hinge region by two disulfide bonds (HC229-HC229 and HC232-HC232). More complex disulfide connectivity patterns involving multiple, intertwined disulfide bonds and the proximity of Cys residues in a peptide sequence require manual investigation or alternative MS fragmentation techniques. These are outside the scope of this paper.

Spectra of Disulfide-Linked Peptides

Each disulfide-bonded peptide in this study is composed of two component peptides: peptide 1 (P1) and peptide 2 (P2). In this work, when they are in separate polypeptide chains (heavy, H, or light, L), the one from the heavy chain is shown first. When in the same chain, they are shown in order of sequence number. The example given below is a peptide connected by two consecutive Cys residues in the constant region of the L chain. It is represented in two ways in this paper: (2) SGTASVVCLLNNFYPR_SS_VYACEVTHQGLSSPVTK. For assigning product ions of disulfide-bonded peptides, each spectrum can be considered as a mixture produced from P1 modified by P2 at the Cys residue and P2 modified by P1 at the Cys residue. For example MS/MS spectrum = b and y ion series from “SGTASVVC(P2-2H)LLNNFYPR” + b(2) and y(2) ion series from “VYAC(P1-2H)EVTHQGLSSPVTK” As shown in the expression, in order to distinguish fragments from different components in disulfide-containing spectra, ions from P2 are labeled by “(2)” following ion series of a, b, or y.

Classification of Product Ions

Most of the product ions of disulfide bond fragmentation observed in this work fall into three main groups, as shown in Table .
Table 1

Classification of the Three Major Groups of Product Ions Observed in the Spectra of Disulfide Peptide Discussed in This Section, SGTASVVCLLNNFYPR_SS_VYACEVTHQGLSSPVTKa

Symbols |, -H, and +S, -2H and +0 denote cleavage of amide, C–S, and S–S bonds, respectively.

Symbols |, -H, and +S, -2H and +0 denote cleavage of amide, C–S, and S–S bonds, respectively. Group 1: Fragments with an intact disulfide bond. They are further divided into three subgroups. Ions resulting from single peptide backbone cleavage in P1 or P2 are labeled using the standard notation of a/a(2), b/b(2), and y/y(2) ion series, shown in Subgroup 1a of Table . Ions arising from double cleavage in P1 or P2 are denoted as internal ions, such as “intVCLL” shown in Subgroup 1b. If a fragment sequence contains more than four residues, the position range is used, such as int7_12. When ions are composed of two fragments (P1 and P2 fragments), they are denoted by both product ions, such as “[y10:b(2)5]” in Subgroup 1c. Group 2: Specific fragment ions involving a cleavage of disulfide bonds, i.e., carbon–sulfur (C–S) and sulfur–sulfur (S–S). C–S bond cleavage yields fragments (Subgroup 2a) with a modification of persulfide (+S, 31.9721 Da) or dehydroalanine (-H2S, −33.9877 Da) at Cys, while S–S bond cleavage generates fragments (Subgroup 2b) with a modification of Cys (+0 Da) and Cys thioaldehyde (-2H, −2.0156 Da). Group 3: Ions are due to single or double peptide backbone cleavages but do not contain a disulfide bond, so a conventional annotation is used (Subgroups 3a and 3b).

Product Ion Assignment

A software tool was developed for matching experimental MS/MS fragment ions from HCD and FT-CID spectra to the in silico fragmentation of the component peptides as well as disulfide bonds. First, an analysis of fragment isotopic distributions was performed, which allows the accurate mass determination of product ion monoisotopic peaks in MS/MS spectra. The deviation threshold to accept an isotopic assignment is 20 ppm. For annotating fragments, the program makes assignments in the order of the occurrence frequency of fragment ions: (1) neutral loss of precursor ions; (2) amide bond cleavage ion series with and without a disulfide bond and their common neutral losses (H2O, NH3, NH3H2O, or 2H2O); (3) disulfide cleaved at C–S and S–S bonds; (4) double cleavage with and without a disulfide bond; and (5) neutral losses (C2H4O and C2H6O2) from peptide bond cleavage ion series. All fragments that have reasonable isotope patterns are analyzed, and those with deviation within 20 ppm between observed and theoretical m/z values are annotated.

Results

Identification of Disulfide Bonds

Data Analysis Strategy

Disulfide-linked peptides cannot be identified by most current database searching tools since they assume linear peptides. In recent disulfide studies,[7−9] the anticipated polypeptides containing disulfide linkages are often used for initial searching of MS2 spectra acquired from non-reduced or partially reduced digests, and the spectra matched by theoretical mass are then confirmed by product ion assignment. However, because of the complexity of disulfide bond fragmentation, many of such peptides may be undetected or misidentified. To address this issue, we used a previously developed alternative workflow for characterizing hard-to-identify peptides and modifications derived from the NISTmAb in an LC–MS/MS analysis.[5] Our method starts with searching for MS1 ion clusters to find all disulfide-containing candidates of interest based on the protein sequence and then assigns their product ion spectra to the in silico fragmentation of specific disulfide-linked peptides for validating identification results. Figure is a flowchart of the process of identifying spectra of disulfide-linked peptides, which are subsequently used for creating a spectral library. First, theoretical m/z values are computed for all potential disulfide-linked peptides derived from known native and potentially scrambled disulfide bond linkages based on protein sequence (step 1). Second, all detectable MS1 peaks are analyzed to determine their monoisotopic m/z, charge, abundance, and RT of each ion cluster (step 2). In the third step, the initial identification is conducted by matching observed m/z values derived from step 2 with the precursor m/z values in step 1. The median RT of these matched ions in the same digestion conditions is calculated and used to filter identifications of ions with inconsistent RT. After disulfide-linked peptides are tentatively identified, step 4 extracts MS2 spectra and conducts fragment ion analysis by interpreting their product ion spectra. In step 5, replicate spectra are selected based on product ion assignments. The selected spectrum should include at least 60% of peak abundances that can be assigned to fragments of disulfide-specific structure and peptide sequence. Lastly, all validated replicate spectra are merged to form consensus spectra for inclusion in a spectral library (step 6).
Figure 1

Building library of reference mass spectra for disulfide analysis.

Building library of reference mass spectra for disulfide analysis.

Identification of Native (Expected) NISTmAb Disulfide Linkages

Initially, we tested the above workflow with all MS and MS/MS data obtained by a wide range of common digestion protocols under alkaline pH conditions (see Table S1) that were used in developing NISTmAb spectral libraries.[5,6] We then compared different digestion profiles using both the number and ion intensities of confident identifications of all peptide classes including native disulfide-bonded peptides (see Figure S2A,B). Both non-reduction and partial reduction protocols were selected as the preferred digestion conditions under which the identification of relatively higher abundances and large numbers of disulfide-containing peptides were achieved. For developing spectral libraries, the experiments were designed to generate the widest possible variety of peptides of analytical concern. No attempts were made to optimize digestion; instead, we employed distinct but complementary protocols to achieve a broad coverage of peptides and modifications. Considering that disulfide bond shuffling can be easily induced under these common alkaline pH conditions, non-reducing and partially reducing experiments under neutral pH for protein digestions were performed in order to control shuffling induced by LC–MS analysis, as described in the Materials and Methods section. The NISTmAb, a typical IgG1 protein, is composed of four polypeptide chains including two identical heavy (H) and two identical light (L) chains (Figure S1). Each H chain has a variable domain (VH) and three constant domains (CH1, CH2, and CH3), while each L chain has a variable domain (VL) and a constant domain (CL). In total, the protein has 12 individual structural domains. Each domain contains a single disulfide bond or an intrachain disulfide bond. The two H chains are linked to each other in the hinge region (Hinge) by two interchain disulfide bonds, and each H chain is connected to an L chain by an interchain disulfide bond (H–L). Thus, the NISTmAb has a total of 12 intrachain disulfide bonds and four interchain disulfide bonds. In this peptide mapping study, there are nine distinct native disulfide bonds (Table ) that needed verification for the NISTmAb due to the symmetrical molecular structure formed by these antibodies.
Table 2

Disulfide-Linked Peptides Detected for Each of the Nine Native Disulfide Linkages Obtained from the Triplicate LC–MS/MS Analyses of the 18 h NISTmAb Digest Partially Reduced with TCEP and Alkylated with IAMg,h

bond nameposition in proteinclassmissed cleaved sitespeptide sequencemasschargeabsolute abund.
1a VHHC22-HC97tryptic 25%0eESGPALVKPTQTLTLTCTFSGFSLSTAGMSVGWIR_SS_VTNMDPADTATYYCAR5431.5874,5,63.6 × 1010
missed cleav.1eESGPALVKPTQTLTLTCTFSGFSLSTAGMSVGWIR_SS_VTNMDPADTATYYCARD8213.8825,6,79.3 × 1010
MIFNFYFDVWGQGTTVTVSSASTK
2ESGPALVKPTQTLTLTCTFSGFSLSTAGMSVGWIR_SS_VTNMDPADTATYYCARDMI9381.5116,78.6 × 1009
FNFYFDVWGQGTTVTVSSASTKGPSVFPLAPSSK
2ESGPALVKPTQTLTLTCTFSGFSLSTAGMSVGWIRQPPGK_SS_VTNMDPADTATYYC8721.1635,6,7,83.2 × 1009
ARDMIFNFYFDVWGQGTTVTVSSASTK
3ESGPALVKPTQTLTLTCTFSGFSLSTAGMSVGWIR_SS_DTSKNQVVLKVTNMDPADT9326.5016,7,82.3 × 1009
ATYYCARDMIFNFYFDVWGQGTTVTVSSASTK
3ESGPALVKPTQTLTLTCTFSGFSLSTAGMSVGWIRQPPGK_SS_VTNMDPADTATYYC9888.7927,88.3 × 1008
ARDMIFNFYFDVWGQGTTVTVSSASTKGPSVFPLAPSSK
semi- tryptic0ESGPALVKPTQTLTLTCTFSGF_SS_VTNMDPADTATYYCAR4085.9064,52.4 × 1009
0ESGPALVKPTQTLTLTCTFSGFSL_SS_VTNMDPADTATYYCAR4286.02233.4 × 1008
        
2a CH1HC147-HC203tryptic 94%0eSTSGGTAALGCLVK_SS_DYFPEPVTVSWNSGALTSGVHTFPAVLQSSGLYSLSSV7916.9195,6,7,81.8 × 1011
VTVPSSSLGTQTYICNVNHKPSNTK
missed cleav.1fSTSGGTAALGCLVKDYFPEPVTVSWNSGALTSGVHTFPAVLQSSGLYSLSSVVTVPSSS7898.9095,64.4 × 1009
LGTQTYICNVNHKPSNTK
1STSGGTAALGCLVK_SS_DYFPEPVTVSWNSGALTSGVHTFPAVLQSSGLYSLSSVVTV8259.116,74.8 × 1008
PSSSLGTQTYICNVNHKPSNTKVDK
semi- tryptic0STSGGTAALGCLVK_SS_SWNSGALTSGVHTFPAVLQSSGLYSLSSVVTVPSSSLGTQT6869.4285,6,74.5 × 1009
YICNVNHKPSNTK
0STSGGTAALGCLVK_SS_TVSWNSGALTSGVHTFPAVLQSSGLYSLSSVVTVPSSSLGTQ7069.54461.8 × 1009
TYICNVNHKPSNTK
0STSGGTAALGCLVK_SS_SLSSVVTVPSSSLGTQTYICNVNHKPSNTK4409.2215,63.3 × 1008
0STSGGTAALGCLVK_SS_SVVTVPSSSLGTQTYICNVNHKPSNTK4122.0734,5,62.2 × 1008
0STSGGTAALGCLVK_SS_SSVVTVPSSSLGTQTYICNVNHKPSNTK4209.1054,51.4 × 1008
0STSGGTAALGCLVK_SS_TVPSSSLGTQTYICNVNHKPSNTK3836.9044,59.3 × 1007
        
3a CH2HC264-HC324tryptic 96%0eTPEVTCVVVDVSHEDPEVK_SS_CK2328.0982,3,4,52.5 × 1011
missed cleav.2TPEVTCVVVDVSHEDPEVK_SS_VVSVLTVLHQDWLNGKEYKCK4537.2874,5,6,76.6 × 1009
3TPEVTCVVVDVSHEDPEVK_SS_VVSVLTVLHQDWLNGKEYKCKVSNK4965.5265,61.5 × 1009
1TPEVTCVVVDVSHEDPEVK_SS_EYKCK2748.2994,59.4 × 1008
1TPEVTCVVVDVSHEDPEVK_SS_CKVSNK2756.3364,52.8 × 1008
semi- tryptic0TPEVTCVVVDVSHED_SS_CK1874.83945.1 × 1008
0TPEVTCVVVDVSH_SS_CK1630.7724.7 × 1008
0TPEVTCVVVD_SS_CK1307.612,33.4 × 1008
        
4a CH3HC370-HC428tryptic 99%0eNQVSLTCLVK_SS_WQQGNVFSCSVMHEALHNHYTQK3844.8243,4,5,6,79.9 × 1010
semi- tryptic0NQVSLTCLVK_SS_VFSCSVMHEALHNHYTQK3231.5634,57.6 × 1007
        
5b VLLC23- LC87tryptic 92%0VTITCSASSR_SS_FSGSGSGTEFTLTISSLQPDDFATYYCFQGSGYPFTFGGGTK5446.464,51.4 × 1010
missed cleav.1DIQMTQSPSTLSASVGDRVTITCSASSR_SS_FSGSGSGTEFTLTISSLQPDDFATYYCFQ7320.34451.0 × 1009
GSGYPFTFGGGTK
2VTITCSASSR_SS_FSGSGSGTEFTLTISSLQPDDFATYYCFQGSGYPFTFGGGTKVEIKR6071.8515,62.0 × 1008
        
6b CLLC133-LC193tryptic 66%0SGTASVVCLLNNFYPR_SS_VYACEVTHQGLSSPVTK3555.7493,4,51.2 × 1011
missed cleav.1SGTASVVCLLNNFYPR_SS_HKVYACEVTHQGLSSPVTK3820.9033,4,5,6,76.2 × 1010
2SGTASVVCLLNNFYPR_SS_ADYEKHKVYACEVTHQGLSSPVTK4427.1685,66.0 × 1008
semi- tryptic0SGTASVVCLLNNFYPR_SS_VYACEVTH2658.25744.5 × 1007
        
7c H-LHC223-LC213missed cleav.1SCDKTHTC(CAM)PPC(CAM)PAPELLGGPSVFLFPPKPK_SS_GEC3467.6393,4,5,6,72.6 × 1010
2SCDKTHTC(CAM)PPC(CAM)PAPELLGGPSVFLFPPKPK_SS_SFNRGEC3971.8834,5,6,73.3 × 1009
3SCDKTHTC(CAM)PPC(CAM)PAPELLGGPSVFLFPPKPKDTLMISR_SS_SFNRGEC4788.2994,5,62.7 × 1009
1SCDK_SS_SFNRGEC1260.4862,3,47.8 × 1008
        
8 and 9d HingeHC229-HC229 and HC232-HC232tryptic 82%0THTCPPCPAPELLGGPSVFLFPPKPK_2(SS)_THTCPPCPAPELLGGPSVFLFPPKPK5456.7994,5,6,7,86.7 × 1010
missed cleav.1THTCPPCPAPELLGGPSVFLFPPKPK_2(SS)_SC(CAM)DKTHTCPPCPAPELLGGPSVFLFPPKPK5889.9625,6,7,8,91.2 × 1010
2SC(CAM)DKTHTCPPCPAPELLGGPSVFLFPPKPK_2(SS)_SC(CAM)DKTHTCPPCPAPELLGGPSVFLFPPKPK6323.1254,5,6,7,8,92.2 × 1009
semi- tryptic0THTCPPCPAPELLGGPSVFLFPPKPK_2(SS)_THTCPPCPAPELLGGPSVF4649.29855.4 × 1008

VH, CH1, CH2, and CH3 refer to the variable and constant domains 1 to 3 in H chain, respectively.

VL and CL represent the variable and constant domains of the L chain.

H–L is the interchain bond linking the H and L chains.

Hinge region contains the two disulfide bonds connecting the two H chains.

Oxidized and/or metallated forms were also detected.

Intrapeptide form.

Notes: Bond: disulfide bond named by domains. Position: Cys position in each protein chain. Class: peptide class. Miss cleav.: total number of missed cleavage sites. Peptide sequence: sequence of two component peptides linked by a disulfide bond. Mass: mass of disulfide-linked peptide. z: charge state of disulfide-linked peptide. Absolute abund.: summed absolute abundance over all charge states of a peptide.

In Column 3, the fully tryptic peptide is shown in bold, and the relative abundance is calculated as the percentage of the tryptic peptide abundance over the total peptide abundance of the linkage site. Note that several peptides containing disulfide bonds 7, 8, and 9 were identified with carbamidomethylated (CAM) cysteine residues.

VH, CH1, CH2, and CH3 refer to the variable and constant domains 1 to 3 in H chain, respectively. VL and CL represent the variable and constant domains of the L chain. H–L is the interchain bond linking the H and L chains. Hinge region contains the two disulfide bonds connecting the two H chains. Oxidized and/or metallated forms were also detected. Intrapeptide form. Notes: Bond: disulfide bond named by domains. Position: Cys position in each protein chain. Class: peptide class. Miss cleav.: total number of missed cleavage sites. Peptide sequence: sequence of two component peptides linked by a disulfide bond. Mass: mass of disulfide-linked peptide. z: charge state of disulfide-linked peptide. Absolute abund.: summed absolute abundance over all charge states of a peptide. In Column 3, the fully tryptic peptide is shown in bold, and the relative abundance is calculated as the percentage of the tryptic peptide abundance over the total peptide abundance of the linkage site. Note that several peptides containing disulfide bonds 7, 8, and 9 were identified with carbamidomethylated (CAM) cysteine residues. Disulfide analyses were done by identifying all potential disulfide-linked peptide classes from the tryptic digests of the NISTmAb, including fully tryptic cleavage, expected missed cleavages, irregular cleavage, ESI-induced adducts, and other modifications. Most disulfide-containing peptides are connected by a single disulfide bond. The exception is the peptides in the hinge region that contain two disulfide bonds located in the proximity of each other to connect two H chains, as described in the Materials and Methods section. Table shows the nine distinct native disulfide bonds mapped in the NISTmAb through the identification of the disulfide-linked peptides corresponding to each linkage. The table presents MS1 peptide data acquired from the triplicate LC–MS analyses of the 18 h solution tryptic digest of the NISTmAb partially reduced with TCEP and alkylated with IAM at pH 8. These identified disulfide-linked peptides range in mass from 1260 to 9888 Da with charge states from +2 to +9. A large majority of fully tryptic and miscleaved disulfide peptides are detected with multiple charges. Furthermore, more highly charged ions of tryptic peptides with missed cleavage are found, providing complementary information for the linkage analysis. Disulfide peptides containing multiple Met/Trp residues are detected in the composite spectrum generated from multiple mono-/dioxidized products. Disulfide peptides originated from irregular cleavage (semitryptic) were identified. Metallated ions formed by abundant tryptic disulfide peptides through electrospray ionization (ESI) were also found and verified by their precursor mass and coelution. The identification of these disulfide peptide variants facilitates reliable disulfide bond analysis and provides additional fragmentation information as shown in the Fragmentation section. Table summarizes the LC–MS analyses of disulfide-linked peptides for each linkage and provides the name of disulfide bonds, the linkage location in the protein, class, the number of missed-cleaved sites, sequence of peptides, mass, observed charge states, and absolute ion signal intensities.

Major Disulfide Tryptic Products

Disulfide-bonded peptide pairs are classified as disulfide tryptic peptides when all termini are consistent with simple tryptic cleavage. Other peptide varieties such as oxidized and metallated disulfide peptides can originate from modification to these peptides. Table shows that major disulfide tryptic peptides of seven (bonds 2 to 6 and 8 to 9) out of nine linkages produced 66 to 99% of the total disulfide peptide abundance (column 3 in bold of Table ). In contrast, the percent abundance of tryptic peptides of bond 1 was low (25%) and tryptic peptides of bond 7 was not detectable, indicating the need for alternative peptide classes (e.g., missed cleavage and/or irregular cleavage) for a complete analysis of disulfides in antibodies.

Missed Cleavage Products

Disulfide bonds generally hinder proteolytic digestion by common enzymes;[22] thus, peptides with up to four missed-cleavage sites were sought. This led to the identification of 152 disulfide-bonded peptides with missed-cleavage sites, which accounted for 40% of the total identified peptides. As seen from Table , the number of missed-cleavage sites are shown in Column 4 and they were observed in all disulfide bonds except the bond CH3. Varying levels of tryptic peptides with missed cleavage are identifiable with multiple charge states. The detection of such peptides is an important supplementary disulfide determination, especially when their tryptic counterparts are too small to be detected (H–L, bond 7) or low abundance species (VH, bond 1). Four alternative disulfide peptides containing one to three missed cleavages were observed for the bond H–L at the multiple charge states shown in Table , ensuring confidence in the assignment of this disulfide linkage. Furthermore, the presence of reduced and alkylated Cys residues with the disulfide-bonded peptides (Bonds H–L and Hinge) can increase their ion signals, and thus their abundances presented in Table are significant even with overnight digestion. Missed-cleavage products produced additional fragmentation patterns and their spectra are part of the spectral library.

Irregular Cleavage Products

Irregular cleavage products, involving a single semitryptic cleavage, are sought during the MS1 ion cluster search. In total, 27 such peptides, arising from major tryptic peptide cleavage, were identified, with most of them generated from protein digestion, whereas a few were produced from electrospray ionization (ESI) in-source fragmentation. Although these are minor products resulting from larger disulfide species, they can provide, as short peptides, useful information to aid in fragmentation analysis. For example, the semitryptic peptide ions 5 to 7 in Table are selected for characterizing disulfide bond fragmentation because their MS/MS spectral data contain useful structural information.
Table 4

Selected Disulfide-Linked Peptide Ions with Varied Amino Acid Compositions and Charge States that Were Studied for Fragmentation Patterns under HCD and FT-CID

# bondbond nameCys position in proteindisulfide-linked peptide sequencepeptide class# ionchargeprecursor m/zreplicate spectra
2CH1147–203STSGGTAALGCLVK_SS_DYFPEPVTVSWNSGALTSGVHTFPAVLQSSGLYStryptic161320.49442
LSSVVTVPSSSLGTQTYICNVNHKPSNTK271131.996157
3CH2264–324TPEVTCVVVDVSHEDPEVK_SS_CKtryptic33777.04424
44583.032955
*EVTCVVVDVSHEDPEVK_SS_CKsemi-tryptic53711.00618
*TPEVTCVVVDVSHED_SS_CK63625.85420
74469.71714
4CH3370–428NQVSLTCLVK_SS_WQQGNVFSCSVMHEALHNHYTQKtryptic84962.213149
95769.972582
106641.81141
117550.26829
6CL133–193SGTASVVCLLNNFYPR_SS_VYACEVTHQGLSSPVTKtryptic1231186.25717
134889.945277
145712.157475
7H-L223–213SCDK_SS_SFNRGECmissed cleavage153421.16930

Notes: The symbol * indicates semitryptic peptides. # bond and bond name: sequential number and name of bonds given by this work. Cys position in protein: Cys residue in protein sequence. Peptide class: tryptic digest products. # ion: sequential number of selected ions. Charge: charge states. Precursor m/z: the mass/charge for peptide ions in MS1. Replicate spectra: repeat spectra identifications.

Complex Met/Trp Oxidation Mixtures of Disulfide Species

Several Met/Trp (methionine/tryptophan) oxidation products are observed with two disulfide linkages of VH and CH3 (bonds 1 and 4, Table ) where multiple Met/Trp are present, in the partial reducing experiments. They are likely generated from two different sources: in-sample and in-source, which is consistent with previous observations.[5,26] Oxidized species with more than one Met/Trp residue often produce a mixture of disulfide-linked peptide spectra. Structural differences of the oxidation mixtures are due to position variations of Met or Trp or both in the sequence, making them difficult to recognize using conventional software tools. Selected ion chromatograms of oxidized disulfide-bonded peptides containing methionine/tryptophan are shown in Figure S3. Their MS2 mixture spectra are manually validated and collected in the library.

Comparison of Tryptic Digestion Profiles of Partially/Non-reduced Native Disulfide Bonds at pH 8

We conducted a comparative evaluation of the above two protocols to estimate their digestion efficiency for disulfide mapping using MS1 ion signal intensities of the unique identifications of disulfide-containing peptides. The digestion of partially reduced and alkylated NISTmAb produces three groups of Cys-containing peptides: major alkylated peptides, relatively abundant disulfide-linked peptides, and minor reduced peptides, whereas the non-reduced and non-alkylated digest almost exclusively yields peptides containing disulfide bonds. This comparison was conducted using disulfide-linked peptides. The summed abundances for each linkage site displayed in the bar graph from Figure A,B were obtained from two LC–MS analyses at pH 8 for partially reduced and non-reduced proteins digested for 0.25, 2, or 18 h. The site-specific quantitation combined the ion intensities of multiply charged disulfide tryptic peptides and peptides with miscleavages at each disulfide bond. It is well-known that LC–MS abundance measurement of low-intensity ions, such as disulfide-linked ions, is very challenging given the intrinsic bias against low abundance peptides, their lower ionization efficiency in LC – ESI – MS, and possible run-to-run variation in ion suppression and others. We used the site-specific quantitation as a measure of the relative detectability of the nine native disulfide bonds by the LC–MS analyses with the selected two protocols. At short digestion times (0.25 and 2 h), mostly low-abundance species were observed and no disulfide-containing peptides were detected from the L chain variable regions as well as from the hinge region. At 18 h, most of the major disulfide-linked peptides were observed with significantly higher signal intensities shown in Figure A but not in non-reduced datasets (Figure B, except the CH2 linkage), which may suggest that a longer time is needed to digest non-reduced proteins.
Figure 2

Summed abundances for each of the nine native disulfide bonds in the NISTmAb obtained by median results of triplicate analyses from three separate tryptic digests at 0.25, 2, and 18 h using partially reducing and non-reducing conditions. (A) Partially reduced peptide mapping using low concentration of TCEP at pH 8; (B) non-reduced peptide mapping at pH 8.

Summed abundances for each of the nine native disulfide bonds in the NISTmAb obtained by median results of triplicate analyses from three separate tryptic digests at 0.25, 2, and 18 h using partially reducing and non-reducing conditions. (A) Partially reduced peptide mapping using low concentration of TCEP at pH 8; (B) non-reduced peptide mapping at pH 8. In the digestion time course experiments for 0.25, 2, and 18 h at alkaline pH, a total of 129 native disulfide-bonded peptides were identified by the partial reduction condition with the low concentration of TCEP, whereas 85 of such peptides were detected using non-reducing and non-alkylating conditions. The spectra of these identifications are included in the spectral library. A major advantage of the pH 8 partial reduction approach is that partially reduced disulfides and alkylated cysteines greatly improve protein digestion efficiency for linear and alkylated peptides[5] as well as for disulfide-containing peptides. This also allows the identification of disulfide-bonded peptides along with any non-disulfide-containing peptides in the same experiment.

Detection of Scrambled (Unexpected) Disulfide Bonds at pH 8 in the LC–MS/MS Analysis

After the native NISTmAb disulfide linkages were verified using the non–/partially reducing LC–MS/MS analyses, we applied the same workflow to find disulfides with unexpected linkages. All 136 possible linkages that can be formed from the 16 Cys residues were sought. Only unmodified disulfide peptides are sought. The results reveal that these reactions readily occur at nearly all Cys residues under alkaline conditions (pH 8), with the exception of LC87. In addition to the nine expected native disulfide linkages, 86 non-native disulfide linkages were observed from 109 disulfide-scrambled tryptic peptides. Figure displays the level of scrambling occurring in the 18 h tryptic digest at pH 8 for every Cys residue of the NISTmAb as the percent abundance of the total intensities of all disulfide bonds from each Cys including native and scrambled. In the partial reduction of the protein digest, 8 out of 16 Cys residues exhibit over 15% disulfide scrambling. In particular, HC203, LC193, and LC213 comprise 33, 31, and 72% of all disulfide abundances, respectively. The non-reduction condition generated much less scrambling as compared to the partial reduction, although HC223 and LC213 show higher level of scrambling at 42 and 68%, respectively. It is evident that when using pH 8 for protein digestion, whether non-reduced or partially reduced samples, wide spread disulfide rearrangement could occur. LC213 was detected as a hotspot for both conditions.
Figure 3

Levels of scrambling occurring at pH 8 for each of Cys sites are observed from non-reduction and partial reduction experiments. Partial Red.: reduction by TCEP and alkylation by IAM. Non-Red.: no reduction and alkylation for antibody digestion. Cysteines are labeled by their residue number in L or H chain, e.g., HC264 denotes Cys at residue 264 of the H chain.

Levels of scrambling occurring at pH 8 for each of Cys sites are observed from non-reduction and partial reduction experiments. Partial Red.: reduction by TCEP and alkylation by IAM. Non-Red.: no reduction and alkylation for antibody digestion. Cysteines are labeled by their residue number in L or H chain, e.g., HC264 denotes Cys at residue 264 of the H chain. To minimize disulfide bond rearrangement at basic pH, two sets of control experiments (non-reduction and partial reduction) were performed by adjusting the buffer pH to 7 and using NEM as the alkylating reagent due to its thiol specificity and stability at or below pH 7, as described in the Experimental Section. Over 90% of the disulfide bond artifacts detected at pH 8 disappeared or were only found at the MS1 trace levels when neutral pH and NEM were used. Seven peptides with scrambled disulfides identified from MS/MS spectra are shown in Table S2A. Only four low-abundance non-native disulfide bonds were detected in the non-reduced digest, and just one was detected in the partial reduction digest. Seven Cys residues (HC22, HC147, HC264, HC324, HC370, LC23, and LC213) are involved in the formation of scrambled disulfide bonds detected at trace levels. This result illustrates that sample preparation using neutral or lower pH significantly decreases shuffling. To gain a better understanding of the pH 7 data, the identification of native disulfide-containing peptides at pH 7 is also compared to that at pH 8. Figure shows the number of disulfide-bonded peptides detected in each of four triplicate analyses of the 18 h tryptic digest under non-reduction and partial-reduction at pH 7 and pH 8, respectively. The notable variation in the number of identified ions is observed at all linkages by different digestion protocols, although more agreeable results are seen at four linkages (CH1, CH2, CH3, and CL). In all cases, the partial reduction at pH 8 yields more disulfide-containing peptides than others. At pH 8 conditions, a total of 114 different ions containing native disulfide bonds were identified, while 55 of such ions were identified under pH 7 conditions (see Table S2B). It is evident that at pH 7, each linkage yields fewer ions than those at pH 8; specifically, no detectable ions are found at two out of the nine disulfide linkages. These results suggest that pH is a critical parameter for effective disulfide analysis, and the selection of the preferable digestion conditions should consider the detectability of both native and scrambled disulfide bonded peptides.
Figure 4

Peptide identifications containing native disulfide bonds obtained from four triplicate analyses of the 18 h tryptic digest using different parameters for four protocol variations: non-reducing and partially reducing conditions at pH 7 and pH 8, respectively.

Peptide identifications containing native disulfide bonds obtained from four triplicate analyses of the 18 h tryptic digest using different parameters for four protocol variations: non-reducing and partially reducing conditions at pH 7 and pH 8, respectively.

Fragmentation Analysis of Disulfide-Linked Peptides

A central objective of this work was to completely map all the disulfide linkages in the NISTmAb by identifying as many disulfide-bonded peptides and their product ions as possible. This was done in the following manner: Manually annotate MS/MS spectra of disulfide-linked peptides. Develop an identification tool for automatic fragment assignment using information from the previous step. Apply this tool to all spectra containing disulfide bonds that were identified by MS1 ion analysis. Develop an evaluation tool for assessing the thoroughness and accuracy of annotations by examining the mass accuracy and abundance ratio of assigned/unassigned product ions. If step 4 finds incorrect or missed annotations, go back to step 1 to obtain new knowledge of unknown fragments, and then work through steps 1 to 5 until no more significant ions are unassigned.

Overview of Different Types of Fragments Observed

In this section, we introduce different types of product ions containing a disulfide bond with an example spectrum. Then, we describe the general fragmentation features of HCD and CID spectra obtained in FT MS/MS analysis of disulfide-bonded peptides under low collisional energy. Figure shows an annotated MS/MS spectrum of a peptide containing an intrachain disulfide bond. A total of 100 unique product ions were assigned, identified with relative abundance greater than 1% of the base peak, which constituted 92% of the total ion intensity. Table summarizes the 55 fragments having a relative abundance above 5% with a mass accuracy within 5 ppm. These fragments are organized into three groups (see column 1 of Table and Table in the Materials and Methods section for the detailed notation).
Figure 5

Different types of product ions in the MS/MS spectrum of a triply charged disulfide-linked peptide, SGTASVVCLLNNFYPR_SS_YACEVTHQGLSSPVTK, at m/z 1186.257 with less than 5 ppm deviation from its theoretical value, from FT-CID fragmentation at NCE of 35%. (A) Sequencing disulfide linkage and (B) fully annotated MS/MS spectrum. The * symbol in green color represents fragments from amide bond cleavages and contains the intact disulfide bond. The annotation in red color denotes fragments from C–S bond cleavage.

Table 3

Fragment Ion Annotation in an FT-CID MS/MS Spectrum of the Charge 3+ Disulfide-Linked Peptide, SGTASVVCLLNNFYPR_SS_VYACEVTHQGLSSPVTKa

classgroupcleavage sitem/zmass error in ppmassignment of fragmentchargesequencepercent abundance
intact disulfide bond fragment1asingle C-N cleavage in either peptide985.82231.1y93CLLNNFYPR_SS_VYACEVTHQGLSSPVTK5.5
1075.2041.9y12-NH33SVVCLLNNFYPR_SS_VYACEVTHQGLSSPVTK18.5
1080.8791.4y123SVVCLLNNFYPR_SS_VYACEVTHQGLSSPVTK25.4
1131.54912.2b(2)16-H2O3SGTASVVCLLNNFYPR_SS_VYACEVTHQGLSSPVT10.2
1308.64211b9-H2O2SGTASVVCL_SS_VYACEVTHQGLSSPVTK6.4
1312.12651b(2)8-H2O2SGTASVVCLLNNFYPR_SS_VYACEVTH19.5
1321.13090.3b(2)82SGTASVVCLLNNFYPR_SS_ YACEVTH32.8
1385.15950.2b(2)92SGTASVVCLLNNFYPR_SS_VYACEVTHQ47
1413.68714.6b11-H5NO2SGTASVVCLLN_SS_VYACEVTHQGLSSPVTK19.9
1431.21170.4b112SGTASVVCLLN_SS_VYACEVTHQGLSSPVTK26.5
1478.2290.5y92CLLNNFYPR_SS_VYACEVTHQGLSSPVTK100
1488.23181.3b122SGTASVVCLLNN_SS_VYACEVTHQGLSSPVTK17.7
1504.72833.3b(2)12-H2O2SGTASVVCLLNNFYPR_SS_VYACEVTHQGLS30
1513.73252.6b(2)122SGTASVVCLLNNFYPR_SS_VYACEVTHQGLS27.7
1527.76451.4y102VCLLNNFYPR_SS_VYACEVTHQGLSSPVTK98.5
1539.23993.8b(2)13-2H2O2SGTASVVCLLNNFYPR_SS_VYACEVTHQGLSS38.4
1553.25882.7b13-NH32SGTASVVCLLNNF_SS_VYACEVTHQGLSSPVTK7.2
1557.24983.3b(2)132SGTASVVCLLNNFYPR_SS_VYACEVTHQGLSS48.7
1561.76680.7b132SGTASVVCLLNNF_SS_VYACEVTHQGLSSPVTK26
1577.30183.3y112VVCLLNNFYPR_SS_VYACEVTHQGLSSPVTK22.8
1612.30594.1y12-NH32SVVCLLNNFYPR_SS_VYACEVTHQGLSSPVTK18.7
1625.77811.8b14-H5NO2SGTASVVCLLNNFY_SS_VYACEVTHQGLSSPVTK6.9
1629.30110.6a142SGTASVVCLLNNFY_SS_VYACEVTHQGLSSPVTK6.6
1634.78650.1b14-NH32SGTASVVCLLNNFY_SS_VYACEVTHQGLSSPVTK15.6
1641.30322.8a(2)152SGTASVVCLLNNFYPR_SS_VYACEVTHQGLSSPV18.2
1643.29980.1b142SGTASVVCLLNNFY_SS_VYACEVTHQGLSSPVTK55.1
1646.78772.6b(2)15-NH32SGTASVVCLLNNFYPR_SS_VYACEVTHQGLSSPV30.1
1655.30221.8b(2)152SGTASVVCLLNNFYPR_SS_VYACEVTHQGLSSPV86.6
1696.82631.5b(2)16-H2O2SGTASVVCLLNNFYPR_SS_VYACEVTHQGLSSPVT22.1
1705.83071b(2)162SGTASVVCLLNNFYPR_SS_VYACEVTHQGLSSPVT31.8
1bdouble C-N cleavage in one peptide1533.75213.1int5-152SVVCLLNNFYP_SS_VYACEVTHQGLSSPVTK13.1
1cdouble C-N cleavage in both peptides1306.11994.1[b(2)13-y10]2VYACEVTHQGLSS_SS_VCLLNNFYPR8.1
1329.63981.2[a14:b(2)11]2SGTASVVCLLNNFY_SS_VYACEVTHQGL7
1373.1582.8[a14:b(2)12]2SGTASVVCLLNNFY_SS_VYACEVTHQGLS44.1
1404.1794.9[b(2)14:y11]2VYACEVTHQGLSSP_SS_VVCLLNNFYPR11
cleaved disulfide bond fragments2aC-S cleavage892.96141.2y(2)17-H2S2VYACEVTHQGLSSPVTK3.5
925.93943.3y(2)17 + S2VYACEVTHQGLSSPVTK11.2
1456.71364.1y12 + S1SVVCLLNNFYPR8.2
1772.84172.3y16 + S1SGTASVVCLLNNFYPR17.1
1850.88614.6y(2)17 + S1VYACEVTHQGLSSPVTK10.1
fragments without disulfide bond3asingle C-N cleavage in both peptides435.23480.6y31YPR7.5
444.28150.4y(2)41PVTK12.8
531.31350.4y(2)51SPVTK7.3
582.30212.3y41FYPR10
618.34590.3y(2)61SSPVTK8.8
696.34650.2y51NFYPR18.5
788.45031.2y(2)81GLSSPVTK21.2
810.38890.5y61NNFYPR51.8
899.48451.4y(2)9-NH31QGLSSPVTK13.5
916.5111.3y(2)91QGLSSPVTK12
923.47280.6y71LNNFYPR40.9
1036.55770.3y81LLNNFYPR29.9
1053.56771y(2)101HQGLSSPVTK23
1154.61550.8y(2)111THQGLSSPVTK44.5
1382.72313.1y(2)131EVTHQGLSSPVTK11.5

Notes: Class and group: fragment class and group. Cleav. site: peptide bond or disulfide bond. m/z: the mass/charge ratio for fragment ion. Error ppm: mass deviation between observed and theoretical m/z values. z: charge state of fragment ion. Sequence: sequence of fragment ion. Abund. % relative abundance of specific product ions with respect to the base peak in a MS2 spectrum.

Different types of product ions in the MS/MS spectrum of a triply charged disulfide-linked peptide, SGTASVVCLLNNFYPR_SS_YACEVTHQGLSSPVTK, at m/z 1186.257 with less than 5 ppm deviation from its theoretical value, from FT-CID fragmentation at NCE of 35%. (A) Sequencing disulfide linkage and (B) fully annotated MS/MS spectrum. The * symbol in green color represents fragments from amide bond cleavages and contains the intact disulfide bond. The annotation in red color denotes fragments from C–S bond cleavage. Notes: Class and group: fragment class and group. Cleav. site: peptide bond or disulfide bond. m/z: the mass/charge ratio for fragment ion. Error ppm: mass deviation between observed and theoretical m/z values. z: charge state of fragment ion. Sequence: sequence of fragment ion. Abund. % relative abundance of specific product ions with respect to the base peak in a MS2 spectrum. In this example, group 1 contains 35 peptide backbone fragments all with an intact disulfide linkage. For example, the dominant peak at m/z 1478.229 of group 1a, as shown in Figure A, corresponds to the doubly charged ion y9 of P1 linked to P2 via a disulfide bond. Aside from these ions generated by single peptide bond cleavage, more complex products arise from double cleavage such as the ions in the groups 1b and 1c. For instance, the peak at m/z 1533.752 formed by double cleavage of amide bonds was identified as the doubly charged ion involving the P1 residues from 5 to 15 linked to P2 via a disulfide bond, labeled Int5_15 in Group 1b of Figure A. Another example is the doubly charged internal ion at m/z 1373.158 consisting of the fragment a14 of P1 and the fragment b(2)12 of P2 connected by a disulfide bond (Group 1c of Figure A). Overall, group 1 comprises a majority (71%) of total product ion intensity and provides detailed peptide sequence. This example spectrum also contains five product ions arising from disulfide cleavage as shown in red color in Figure B and Group 2a of Table . They were formed through the loss of H2S (e.g., charge 2+ y(2)17-H2S) or the addition of S (e.g., the singly and doubly charged ion y(2)17 + S) in the C–S bond cleavage products. Group 3 includes 15 standard peptide fragments without a disulfide bond. Some of them are highly abundant, such as the singly charged y6 of P1 and y(2)11 of P2. This group represents common backbone fragments, useful for sequencing both P1 and P2.

Key Features of Disulfide Linkage Fragmentation Patterns

Low-energy collision dissociation of disulfide-containing peptides generates product ions, which contain intact, cleaved, and no disulfide bonds. Cleavage of an S–S bond leads to the formation of fragments containing free Cys or Cys thioaldehyde (−2 Da) while cleavage of the C–S bond yields fragments with Cys persulfide (+32 Da) or dehydroalanine (−34 Da). The dissociation of amide bonds can produce typical b/y ions, with or without intact and cleaved disulfide bonds. We conducted a comprehensive analysis of disulfide bonds under fragmentation by low energy collision dissociation for 15 multiply charged disulfide-linked peptide ions (Table ). They include ions of tryptic, semitryptic, and miscleaved peptides. Different types of fragment ions arising from the disulfide bond were examined using spectra acquired at NCE values of 16%, 20%, and 24% for HCD as well as 35% for FT-CID. The last column shows the number of replicate spectra used in the calculation of median values. Three types of fragmentation patterns were observed, (1) amide bond cleavage products containing an intact disulfide bond are predominant fragments; (2) disulfide bond cleavages leading to minor yet extensive fragmentation; and (3) double cleavage products are common and significant fragments. These key features are more fully described below. Notes: The symbol * indicates semitryptic peptides. # bond and bond name: sequential number and name of bonds given by this work. Cys position in protein: Cys residue in protein sequence. Peptide class: tryptic digest products. # ion: sequential number of selected ions. Charge: charge states. Precursor m/z: the mass/charge for peptide ions in MS1. Replicate spectra: repeat spectra identifications.

Key Feature 1: Fragments Containing an Intact Disulfide Bond Predominate under HCD and FT-CID

For the purpose of finding major fragments in disulfide spectra, we examined a variety of product ions and divided them into three categories, as shown in Figure . One involves fragments containing an intact disulfide bond, labeled DSB fragments in this analysis, the second are fragments not containing Cys, labeled non-DSB fragments, and the last set includes all unassigned peaks.
Figure 6

Comparison of three ion categories from 15 disulfide-linked peptide ions (Table ) by percent ion intensity. DSB fragments: ions linked by a disulfide bond. Non-DSB fragments: ions not containing a disulfide bond. Unassigned: unassigned ions.

Comparison of three ion categories from 15 disulfide-linked peptide ions (Table ) by percent ion intensity. DSB fragments: ions linked by a disulfide bond. Non-DSB fragments: ions not containing a disulfide bond. Unassigned: unassigned ions. DSB fragments make the largest relative contribution in the spectra for all peptides studied, with each yielding from 46 to 76% of total product ion intensity. Non-DSB fragments from peptide bond cleavage accounted for 14 to 44% of total ion intensity in the spectra, providing complementary sequencing information to DSB fragments. The unassigned peaks appeared to primarily arise from low-abundance fragments that cannot be recognized due to the lack of isotopic data.

Key Feature 2: Extensive Disulfide-Bond Cleavage Observed at Low Levels

Disulfide-bond fragmentation involving Cys side chain breakage (i.e., C–S and S–S bonds) complements disulfide connectivity information. Such product ions have been reported as rare compared to amide bond breakage via the gas phase fragmentation of disulfide-linked peptides by low-energy collision dissociation.[27,28] In contrast, we observed extensive cleavage products, although at low levels, arising from both S–S and C–S bonds, including free Cys, Cys thioaldehyde (−2 Da), Cys persulfide (+32 Da), and dehydroalanine (−34 Da). Figure shows the distribution of observed disulfide cleavage products resulting from over 3200 good-quality MS2 spectra of the five disulfide bonds studied in Table . The spectra were divided into four groups in terms of the number of disulfide cleavage products observed (e.g., peaks = 0 for no peaks from disulfide cleavage; ≤3; between 4 to 7; and >7). The percentages of spectra in each group over total replicate spectra (Column 9 of Table ) were calculated for each bond linkage. These data indicate that most of the linkage sites produced up to three fragments resulting from disulfide fragmentation, while two disulfide linkages (CH2 and H–L) yield prevalent disulfide bond cleavage. Over 93% (H–L) and 60% (CH2) of their total spectra contain 4 to 15 cleavage products arising from C–S bonds and S–S bonds. This is because the two sites are made up of smaller peptides: SCDK_SS_SFNRGEC and TPEVTCVVVDVSHEDPEVK_SS_CK and its semitryptic peptides, which are more prone to undergoing S–S and C–S bond cleavages. In our data, fragments resulting from disulfide cleavage were typically observed in low abundance; for example, the summed abundance of the C–S and S–S bond cleavage is from 1 to 5% of the total product ion intensity.
Figure 7

Distribution of disulfide-bond cleavages among five disulfide linkages obtained from 3200 high-quality MS2 spectra. Pks refers to fragment ions arising from disulfide cleavage.

Distribution of disulfide-bond cleavages among five disulfide linkages obtained from 3200 high-quality MS2 spectra. Pks refers to fragment ions arising from disulfide cleavage. We observed more ions arising from S–S bond cleavage at lower charge states, whereas more ions from C–S bond cleavage were observed at higher charge states. Figure demonstrates the charge/energy-dependent changes in the HCD fragmentation of S–S and C–S bonds in the charges 4+ and 5+ TPEVTCVVVDVSHEDPEVK_SS_CK. Their fully annotated MS/MS peak lists can be found in Table S3A,B. More intense S–S bond cleavage is evident in charge 4+ at HCD NCE = 20 in Figure A, while more C–S bond cleavage products are seen in the charge 5+ at HCD NCE = 24 in Figure B. This observation suggests that (a) the number of mobile protons that are required to induce dissociation[29] appears to have a significant effect on C–S bond cleavage and (b) with fewer mobile protons and lower NCE values, the S–S bond may tend to yield more cleavage as compared to the C–S bond.
Figure 8

MS2 analysis of the charge/energy-dependence changes in HCD fragmentation of S–S and C–S bonds of disulfide-linked peptide, TPEVTCVVVDVSHEDPEVK_SS_CK at (A) m/z 583.032 of the charge 4+ ion at an NCE of 20 and (B) m/z 466.627 of the charge 5+ ion at an NCE of 24. The intensity axis is scaled down to an intensity range from 0 to 12% relative to the base peak to show low levels of disulfide bond cleavage in the HCD spectrum. * fragments contain an intact disulfide bond. C–S and S–S indicate cleavage from disulfide bonds.

MS2 analysis of the charge/energy-dependence changes in HCD fragmentation of S–S and C–S bonds of disulfide-linked peptide, TPEVTCVVVDVSHEDPEVK_SS_CK at (A) m/z 583.032 of the charge 4+ ion at an NCE of 20 and (B) m/z 466.627 of the charge 5+ ion at an NCE of 24. The intensity axis is scaled down to an intensity range from 0 to 12% relative to the base peak to show low levels of disulfide bond cleavage in the HCD spectrum. * fragments contain an intact disulfide bond. C–S and S–S indicate cleavage from disulfide bonds.

Key Feature 3: Double Cleavage of Disulfide-Linked Peptide Ions Is Common

They are frequently observed for all disulfide peptides investigated in this study. The percent abundances of double cleavage ions range from 10 to 35% of total product ion intensity. The formation of disulfide-bonded double cleavage ions can be quite complex. They can be grouped into two categories. One involves either double cleavage of one peptide (internal ions) or a cleavage from each of two component peptides, and the other results from the cleavage of both peptide backbone and disulfide bonds (see Group 2 of Table ). Following backbone fragmentation, further fragmentation of the resultant fragments may be derived by the breakage of the disulfide bond. The number and abundance level of double cleavage products are energy- and charge-dependent. As expected, higher collision energy and charge values appear to increase the portion of double cleavages. As illustrated in Figure , dominant double cleavage products were observed in the HCD spectrum of the charge 7+ CH3 disulfide peptide. The large proportion of multiply charged major products (shown in green color) observed in the high m/z region can be assigned as arising from the sequential cleavage of both component peptides. The product ions were analyzed for the peptide in 4+, 5+, 6+, and 7+ charge states, showing that the relative abundances of double cleavages at each charge are 1, 4, 10, and 22%, respectively.
Figure 9

HCD spectrum of the charge 7+ disulfide-linked peptide ion, NQVSLTCLVK_SS_WQQGNVFSCSVMHEALHNHYTQK, at NCE = 24 shows dominant double cleavage that generated a number of major internal fragments containing a disulfide bond (in square brackets []). * fragments contain an intact disulfide bond.

HCD spectrum of the charge 7+ disulfide-linked peptide ion, NQVSLTCLVK_SS_WQQGNVFSCSVMHEALHNHYTQK, at NCE = 24 shows dominant double cleavage that generated a number of major internal fragments containing a disulfide bond (in square brackets []). * fragments contain an intact disulfide bond.

Consensus Mass Spectral Library

We compiled a library of consensus mass spectra of all disulfide-linked peptide ions acquired in this study from the tryptic digest of the NISTmAb. Each of them arises from multiple spectra of the same peptide ion at a given collision energy. They were derived from the LC–MS/MS analysis of 82 runs under non-reducing and partial reducing conditions. Spectra were included only if at least 60% of total peak abundance could be confidently annotated. Over 10,000 replicate spectra underlie these consensus mass spectra. The spectral library contains 702 consensus spectra for 149 native and 234 scrambled disulfide-bonded peptide ions (e.g., sequence/modification/charge). Each entry in the library provides information on protein, Cys position in sequence, disulfide bonds, precursor m/z, normalized collision energy, identification score, percentages of assigned/unassigned abundances, and number of replicate spectra used for creating the consensus spectrum. Peaks in each spectrum are thoroughly annotated. This library is freely available at https://chemdata.nist.gov/dokuwiki/doku.php?id=peptidew:lib:disulfidepeptides.

Use of the Library of Disulfide-Linked Spectra in General Antibody Analysis

One potential utility of the mass spectral library is to evaluate the effectiveness of reduction in peptide mapping sample preparation. We demonstrate this using LC–MS/MS results from three different datasets, all of which were performed under conventional reducing conditions: Humira (a commercially available mAb drug),[30] human plasma (a NIST reference material, SRM 1950),[31,32] and NISTmAb prepared without second reduction.[5] Note that the former two samples were prepared with a second reduction step. This analysis was carried out using the NIST MSQC Pipeline.[33] It identified 5, 11, and 29 disulfide-linked peptide spectra in Humira, NIST SRM 1950, and NISTmAb, respectively, using a match score threshold of 450 and mass error of less than 10 ppm. All identifications were manually confirmed and are shown in Table S4. Figure shows an example of a charge 4+ peptide ion containing a native disulfide linkage, CH2 detected in the 2D LC–MS/MS analysis of reduced human plasma reference material. In the reduced Humira experiments, we found two native disulfide bonds. In the NIST 1950 plasma reference material, two native and two scrambled linkages were found. Furthermore, in the NISTmAb analysis, which did not include a second reduction step, all five native disulfide bonds of constant regions as well as four scrambled disulfide bonds were detected. These observations suggest that (1) low levels of disulfide-linked ions can be identified even after reduction in LC–MS analysis and (2) the second reduction step often implemented in standard peptide mapping experiments[34] is justified as fewer disulfide bonds were detected when this extra reduction step was done in the Humira and NIST SRM 1950 analyses.
Figure 10

Head-to-tail plot of experimental and reference HCD tandem mass spectra of an ion containing a native disulfide bond CH2. It shows an experimental ESI-MS/MS spectrum acquired from 2D LC–MS/MS analysis of reduced human plasma reference material NIST SRM 1950 matching to the charge 4+ disulfide-linked peptide spectrum in the library, “TPEVTCVVVDVSHEDPEVK_SS_CK”. The library spectral similarity score is 930 (1000 as the highest).

Head-to-tail plot of experimental and reference HCD tandem mass spectra of an ion containing a native disulfide bond CH2. It shows an experimental ESI-MS/MS spectrum acquired from 2D LC–MS/MS analysis of reduced human plasma reference material NIST SRM 1950 matching to the charge 4+ disulfide-linked peptide spectrum in the library, “TPEVTCVVVDVSHEDPEVK_SS_CK”. The library spectral similarity score is 930 (1000 as the highest).

Conclusions

In this study, we performed a detailed analysis of disulfide-linked peptides generated in the tryptic digestion of a reference monoclonal antibody, the NISTmAb. This was done both for understanding the fragmentation of these complex disulfide-linked peptides and for creating a reference mass spectral library for helping others to readily identify these peptides. For this purpose, a data analysis workflow of combining MS and MS/MS was developed, using MS data to assign all possible disulfide-containing peptide spectra and then MS/MS data to resolve any ambiguous identifications. This approach helps circumvent the tedious manual inspection of MS/MS spectra necessary for the protein disulfide bond profile with high sensitivity and accuracy. Because the method searches the complete combination of all possible disulfide bonds in proteins based on their sequence, all native as well as non-native disulfide-containing peptides are covered in searches. Hence, this approach could be useful for routine disulfide mapping in general proteomics applications. In our study, this led to the identification and annotation of 428 disulfide-linked peptide ions with a variety of origins: fully tryptic, missed cleavages, irregular cleavages, containing Met/Trp oxidation, and metallated adducts. Based on these results, we effectively characterized the relative level of all the nine native disulfide bonds and detected 86 additional disulfide linkages from shuffling in the LC–MS/MS analysis of the NISTmAb. As expected, the S–S bond shuffling at pH 8 was greatly minimized under neutral condition pH 7. This analysis highlights the unique difficulties in a comprehensive analysis of disulfide-linked peptides especially where reduction is incomplete, either by design or not, and where fragmentation lacks readily identifiable diagnostic fragment ions. Transforming these results into a spectral library will enable others to efficiently identify both native and scrambled disulfide peptides in routine proteomics analyses containing IgG1 antibodies. We showed that with a reference library of confidently assigned spectra, the unambiguous assignment of tandem mass spectra (MS/MS) to disulfide-linked peptide sequences can be easily done. We also demonstrated that this library can identify disulfide bonds of IgG1 proteins in reduced human serum digests, thereby providing a unique measure of the effectiveness of reduction methods in proteomics experiments.
  31 in total

Review 1.  Formation and transfer of disulphide bonds in living cells.

Authors:  Carolyn S Sevier; Chris A Kaiser
Journal:  Nat Rev Mol Cell Biol       Date:  2002-11       Impact factor: 94.444

2.  Tandem mass spectral libraries of peptides in digests of individual proteins: Human Serum Albumin (HSA).

Authors:  Qian Dong; Xinjian Yan; Lisa E Kilpatrick; Yuxue Liang; Yuri A Mirokhin; Jeri S Roth; Paul A Rudnick; Stephen E Stein
Journal:  Mol Cell Proteomics       Date:  2014-06-02       Impact factor: 5.911

3.  Mass Spectral Library of Acylcarnitines Derived from Human Urine.

Authors:  Xinjian Yan; Sanford P Markey; Ramesh Marupaka; Qian Dong; Brian T Cooper; Yuri A Mirokhin; William E Wallace; Stephen E Stein
Journal:  Anal Chem       Date:  2020-04-23       Impact factor: 6.986

4.  Comprehensive identification of protein disulfide bonds with pepsin/trypsin digestion, Orbitrap HCD and Spectrum Identification Machine.

Authors:  Chuanlong Cui; Tong Liu; Tong Chen; Johanna Lu; Ian Casaren; Diogo Borges Lima; Paulo Costa Carvalho; Annie Beuve; Hong Li
Journal:  J Proteomics       Date:  2018-12-14       Impact factor: 4.044

Review 5.  Recent mass spectrometry-based techniques and considerations for disulfide bond characterization in proteins.

Authors:  Jude C Lakbub; Joshua T Shipman; Heather Desaire
Journal:  Anal Bioanal Chem       Date:  2017-12-18       Impact factor: 4.142

6.  Ranking the susceptibility of disulfide bonds in human IgG1 antibodies by reduction, differential alkylation, and LC-MS analysis.

Authors:  Hongcheng Liu; Chris Chumsae; Georgeen Gaza-Bulseco; Karen Hurkmans; Czeslaw H Radziejewski
Journal:  Anal Chem       Date:  2010-06-15       Impact factor: 6.986

7.  Proteolytic 18O labeling for comparative proteomics: evaluation of endoprotease Glu-C as the catalytic agent.

Authors:  Kristy J Reynolds; Xudong Yao; Catherine Fenselau
Journal:  J Proteome Res       Date:  2002 Jan-Feb       Impact factor: 4.466

8.  Steric effects in peptide and protein exchange with activated disulfides.

Authors:  Jason Kerr; Jessica L Schlosser; Donald R Griffin; Darice Y Wong; Andrea M Kasko
Journal:  Biomacromolecules       Date:  2013-07-19       Impact factor: 6.988

9.  Free sulfhydryl in recombinant monoclonal antibodies.

Authors:  Wei Zhang; Marta J Czupryn
Journal:  Biotechnol Prog       Date:  2002 May-Jun

10.  Development of a Standard Reference Material for metabolomics research.

Authors:  Karen W Phinney; Guillaume Ballihaut; Mary Bedner; Brandi S Benford; Johanna E Camara; Steven J Christopher; W Clay Davis; Nathan G Dodder; Gauthier Eppe; Brian E Lang; Stephen E Long; Mark S Lowenthal; Elizabeth A McGaw; Karen E Murphy; Bryant C Nelson; Jocelyn L Prendergast; Jessica L Reiner; Catherine A Rimmer; Lane C Sander; Michele M Schantz; Katherine E Sharpless; Lorna T Sniegoski; Susan S-C Tai; Jeanice B Thomas; Thomas W Vetter; Michael J Welch; Stephen A Wise; Laura J Wood; William F Guthrie; Charles R Hagwood; Stefan D Leigh; James H Yen; Nien-Fan Zhang; Madhu Chaudhary-Webb; Huiping Chen; Zia Fazili; Donna J LaVoie; Leslie F McCoy; Shahzad S Momin; Neelima Paladugula; Elizabeth C Pendergrast; Christine M Pfeiffer; Carissa D Powers; Daniel Rabinowitz; Michael E Rybak; Rosemary L Schleicher; Bridgette M H Toombs; Mary Xu; Mindy Zhang; Arthur L Castle
Journal:  Anal Chem       Date:  2013-12-03       Impact factor: 6.986

View more
  1 in total

1.  Streamlining the Characterization of Disulfide Bond Shuffling and Protein Degradation in IgG1 Biopharmaceuticals Under Native and Stressed Conditions.

Authors:  Jill Coghlan; Alexander Benet; Preethi Kumaran; Michael Ford; Lawrie Veale; St John Skilton; Sergei Saveliev; Anna A Schwendeman
Journal:  Front Bioeng Biotechnol       Date:  2022-03-14
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.