Literature DB >> 24611550

Effects of calibration approaches on the accuracy for LC-MS targeted quantification of therapeutic protein.

Eslam Nouri-Nigjeh¹, Ming Zhang, Tao Ji, Haoying Yu, Bo An, Xiaotao Duan, Joseph Balthasar, Robert W Johnson, Jun Qu.

Abstract

LC-MS provides a promising alternative to ligand-binding assays for quantification of therapeutic proteins and biomarkers. As LC-MS methodology is based on the analysis of proteolytic peptides, calibration approaches utilizing various calibrators and internal standards (I.S.) have been developed. A comprehensive assessment of the accuracy and reliability of these approaches is essential but has yet been reported. Here we performed a well-controlled and systematic comparative study using quantification of monoclonal-antibody in plasma as the model system. Method development utilized a high-throughput orthogonal-array-optimization, and two sensitive and stable signature-peptides (SP) from different domains were selected based on extensive evaluations in plasma matrix. With the purities of all protein/peptide standards corrected by quantitative amino acid analysis (AAA), five calibration approaches using stable-isotope-labeled (SIL) I.S. were thoroughly compared, including those at peptide, extended-peptide, and protein levels and two "hybrid" approaches (i.e., protein calibrator with SIL-peptide or SIL-extended-peptide I.S.). These approaches were further evaluated in parallel for a 15 time point, preclinical pharmacokinetic study. All methods showed good precision (CV% < 20%). When examined with protein-spiked plasma QC, peptide-level calibration exhibited severe negative biases (-23 to -62%), highly discordant results between the two SP (deviations of 38-56%), and misleading pharmacokinetics assessments. Extended-peptide calibration showed significant improvements but still with unacceptable accuracy. Conversely, protein-level and the two hybrid calibrations achieved good quantitative accuracy (error < 10%), concordant results by two SP (deviations < 15%), and correct pharmacokinetic parameters. Hybrid approaches were found to provide a cost-effective means for accurate quantification without the costly SIL-protein. Other key findings include (i) using two SP provides a versatile gauge for method reliability; (ii) evaluation of peptide stability in the matrix before SP selection is critical; and (iii) using AAA to verify purities of protein/peptide calibrators ensures accurate quantitation. These results address fundamental calibration issues that have not been adequately investigated in published studies and will provide valuable guidelines for the "fit for purpose" development of accurate LC-MS assays for therapeutic proteins and biomarkers in biological matrices.

Entities: Chemical Disease Gene Species

Mesh：

Substances：
Proteins

Year: 2014 PMID： 24611550 PMCID： PMC3982980 DOI： 10.1021/ac5001477

Source DB: PubMed Journal: Anal Chem ISSN： 0003-2700 Impact factor: 6.986

Therapeutic proteins, and in particular monoclonal antibodies (mAb), have recently gained enormous success due to their high specificity, efficacy, and lower risks of immunogenicity.[1−5] These agents exhibit desired pharmacological characteristics such as long serum half-lives, high potency, and limited off-target toxicity.[6,7] However, protein drugs show more complex pharmacokinetic (PK) behaviors than small-molecule drugs.[6,7] Studying the pharmacokinetics of therapeutic proteins requires highly accurate quantification methods that enable the correct estimation of drug concentrations in plasma.[6,8] Conventionally, ELISA (enzyme-linked immunosorbent assay) is utilized for this purpose owing to its high sensitivity and analytical throughput. However, ELISA methods are often matrix- and species-dependent, and the method development is often time-consuming and costly, which is especially problematic in the early phases of drug discovery and development.[9,10] By comparison, liquid chromatography mass spectrometry (LC–MS) using selected reactions monitoring (SRM) is often matrix- and species-independent, and method development is generally faster than that for ELISA; moreover, LC–MS assays can be readily multiplexed, providing multiple potential advantages versus ELISA[11−14] Most LC–MS-based methods quantify protein by measuring a selected proteolytic signature peptide (SP) that serves as a surrogate for the intact protein. For this reason, several different calibration approaches exist at the peptide,[15,16] extended-peptide,[17,18] and protein levels.[8,19] The choice of calibrators and stable-isotope-labeled (SIL) internal standards (I.S.) is among the most critical factors governing the reliability and accuracy of the LC–MS-based quantification.[17,20−22] Peptide-level calibration is the most widely practiced approach, which employs one synthesized SP as the calibrator and a SIL-analog of the SP as the I.S. (spiked after digestion).[16,23] This approach enables a straightforward and facile development of quantitative methods, and both the calibrators and SIL-I.S. are readily available from commercial sources. Nevertheless, the use of an SIL-peptide as I.S. only corrects variations caused by LC–MS analysis but not the upstream steps such as sample preparation and digestion (Figure 1A).[21] Moreover, owing to the use of a peptide calibrator, this approach actually derives protein concentrations based on the measured SP concentrations in the digest, with the assumption that the efficiencies of sample preparation and digestion are close to 100%, which may not be true.[24] For example, tryptic digestion is rarely complete and can be partially nonspecific.[25] Consequently, this approach may result in significant bias that is not readily perceivable when the method is validated by spiking synthesized peptides into matrix digests, a prevalent practice for peptide-level calibration. Our lab and others have observed that for a wide variety of proteins, peptide-level calibration approaches resulted in significant quantitative bias when examined with protein-spiked QC samples.[15,19,24,26,27] Furthermore, when two unique SP were selected for the quantification of one protein, the peptide-level calibration approach may give two sets of highly discordant results for the same protein.[21,24,26]

Figure 1

Illustrations of various calibration approaches for targeted protein quantification and the study design of the comparative study. (A) The three options of internal standard (I.S.) methods used for targeted protein quantification. Full length stable-isotope-labeled (SIL) protein I.S. is added to the sample before any preparation; SIL-extended-peptide I.S. is added right before digestion, and SIL-peptide I.S. is spiked into the digest mixture after digestion. (B) Scheme of the comparison. The peptide- and extended-peptide-level calibrations and two hybrid calibrations were compared against the protein-level calibration (the Gold Standard), for quantitative performances in both protein-spiked QC samples and a full rat PK study. (C) The calibrators and I.S. employed for each calibration approach for quantification of the anti-HCV mAb in plasma. For each calibration approach, two sets of calibration curves (one for each SP) were independently established and then used for quantification. More recently, extended-peptide calibration approaches were introduced to address concerns related to digestion efficiency. The calibrator consists of a synthesized extended-peptide containing the SP sequence and (typically) 3–6 flanking residues extended from both the N- and C-termini. A SIL-extended-peptide is used as the I.S., which is spiked prior to digestion.[12,18,28,29] With cleavable sites at both ends of the SP sequence, this approach may help to compensate for the bias and variation introduced in the digestion step (e.g., missed cleavage and peptide degradation[12]). For example, Neubert et al. recently demonstrated that the extended-peptide calibration approach enabled accurate and sensitive quantification of small protein biomarkers, such as nerve growth factors, in plasma.[18] Artificial QconCAT proteins that contain multiple extended-peptides have been engineered to enable multiplexed quantification of a large number of targets.[30] The protein-level calibration method employs full-length protein as the calibrator and full-length SIL-protein as I.S., which is spiked directly into samples before sample preparation. This method provides high accuracy and precision by correcting the system bias and variation introduced by any of the sample preparation and enzymatic digestions steps[19,21−23] and, therefore, is considered the “Gold Standard” for LC–MS-based targeted protein quantification.[31−33] However, a drawback of protein-level calibration is that the production of full-length SIL-protein with high isotope purity can be costly and time-consuming[23] and is impractical for many classes of proteins. A systematic, comparative evaluation of the accuracy and precision of these calibration approaches in biological/pharmaceutical matrices will provide valuable information for therapeutic protein quantification. To our knowledge, such an investigation has not been conducted previously. In this study, we comprehensively compared the quantitative performance of five internal-standard calibration approaches, including the three above-mentioned calibration approaches (peptide-calibrator/SIL-peptide-I.S., extended-peptide-calibrator/SIL-extended-peptide-I.S., and protein-calibrator/SIL-protein-I.S.) and two “hybrid” approaches (protein-calibrator/SIL-peptide-I.S. and protein-calibrator/SIL-extended-protein-I.S.). Quantification of a mAb in plasma was used as the model system (illustrated in Figure 1). SP candidates were experimentally discovered, and the most sensitive and stable SP were selected based on evaluations in plasma digest. Two unique SP from different domains of the mAb were chosen to serve as a gauge for quantitative reliability. To avoid possible bias arising from differing purities of calibration standards, the purities of all protein, peptide, and extended-peptide standards were accurately measured by quantitative amino acid analysis (AAA). The accuracy and precision of each approach were assessed using QC samples prepared by spiking pure protein into plasma. Finally, we compared the five calibration approaches in a full pharmacokinetic assessment using a rat model.

Experimental Procedure

Materials and Reagents

A chimeric anti-Hepatitis C virus antibody (HCV-mAb, ∼170 kD) and its full-length stable-isotope-labeled (SIL) cognate were obtained from AbbVie (formerly Abbott Laboratories, North Chicago, IL). Sequencing-grade trypsin was from Promega (Madison WI). Protease, phosphatase, and kinase inhibitor cocktail tablets were from Roche (Basel, Switzerland). Bicinchoninic acid (BCA) protein assay reagents were from Pierce (Rockford, IL). HPLC-grade methanol, acetonitrile, acetone, and water were from B&J (Muskegon, MI). LC–MS-grade formic acid was from Fluka (Buchs, Switzerland). Tris(2-carboxyethyl) phosphine (TCEP), Tris, iodoacetamide (IAA), and phosphate-buffered saline were obtained from Sigma–Aldrich (St. Louis, MO). Signature peptides, extended signature peptides, their isotope-labeled cognates {i.e., GPSVFPLAPSSK[15N,13C] (the GPS peptide), TVAAPSVFIFPPSDEQLK[15N,13C] (the TVA peptide), ASTKGPSVFPLAPSSK[15N,13C]STSG (the extended GPS peptide), and EIKRTVAAPSVFIFPPSDEQLK[15N,13C]SGTA (the extended TVA peptide)}, and their nonlabeled versions were synthesized by New England Peptide (Gardner, MA) with nominal purity >95%. The peptide and extended-peptide stock solutions were prepared in 50% acetonitrile. The actual purities of all standards were calibrated using values measured accurately by quantitative amino acid analysis (AAA). Isotopic purities for synthesized peptide and extended-peptide SIL-standards were >99.9% and ∼97% (for lysine residues) for the SIL-protein produced by SILAC.

Identification of Signature Peptides (SP) via an Experimental Strategy and Development of the LC–MS Method

In this study, the optimal SP were identified using an experimental strategy, which included (i) the discovery of SP candidates by LC-Orbitrap MS analysis; (ii) a high-throughput, on-the-fly orthogonal array optimization to optimize SRM conditions for all candidates in plasma digest, and (iii) selection of two stable and sensitive SP based on the thorough evaluation in the plasma matrix. Details are in the Supporting Information.

Preparation of Plasma Samples

The rat plasma samples were subjected to protein denaturing, reduction/alkylation, and then a precipitation/on-pellet-digestion procedure that provides an efficient sample cleanup and optimal peptide recovery.[26,27,34] An aliquot of 2 μL of rat plasma sample was diluted 20-fold by PBS (100 mM, pH7.4) containing 0.5% SDS. Proteins in the diluted sample was reduced by TCEP (2 mM) for 15 min at 37 °C and then alkylated with IAA (100 mM) at 37 °C for 30 min. Both steps were performed under darkness with agitation. The protein content was precipitated in two steps, by adding 1 vol of chilled acetone at −20 °C and vortexing for 1 min to generate fine-sized protein aggregates, followed by adding 5 vol of chilled acetone. The samples were centrifuged at 20000g for 30 min, and the supernatant was removed carefully and the protein pellet was allowed to air-dry. Tris buffer (50 mM, pH 8.2) containing trypsin was added to a final E/S ratio of 1:20 (w/w). The solution was then incubated at 37 °C and vortexed at 500 rpm for 6 h to dissolve the pellet. A second aliquot of trypsin was added at an E/S ratio of 1:15 (w/w), and the mixture was incubated at 37 °C overnight to achieve complete digestion. Formic acid was added to a final concentration of 1% (v/v) to terminate the digestion. The sample was then centrifuged at (20000g) at 4 °C for 30 min, and 2/3 of the supernatant was transferred to the autosampler vial for LC–MS analysis.

LC–SRM–MS Analysis

Quantitative analysis was performed on a microflow-LC system coupled to a triple-quadruple MS.[35] An Agillent 1100 capillary HPLC was interfaced to a Thermo Scientific Quantum Ultra EMR triple-quadruple mass spectrometer via an ESI source (San Jose, CA). The separation was performed on a Zorbax-SB-C18 Stablebond column (150 mm × 0.5 mm i.d., 3.5 μm, and 100 Å pore size). A 20 min gradient was employed with a 10 μL injection volume, and the flow rate was 15 μL/min. A and B solvents were mixtures of water:acetonitrile:formic acid at 98:2:0.1 and 15:85:0.1, respectively. The gradient was 80% A to 72.5% A in 3 min and 55% A in 9 min, and then 100% B for 3 min followed by column equilibration at 80% A for 5 min. The column temperature was 50 °C. The spray voltage was 3.5 kV, and the capillary temperature was 320 °C. Quantification was performed using selected reaction monitoring of the transitions m/z 593.8→699.5 for GPS and m/z 973.8→913.4 for TVA. The optimized collision energy and tube-lens voltages were set at 21 eV/100 V and 29 eV/120 V based on the conditions obtained by orthogonal array optimization (OAO). Additional transitions (e.g., m/z 593.8→846.4 and m/z 973.8→1060.0 for GPS and TVA, respectively) were monitored to confirm the accurate assignment of the chromatography peaks. I.S. peptides were monitored using transitions of m/z 597.8→707.4 and m/z 977.8→ 921.2 for GPS and TVA, respectively. The dwell time was 100 ms for each transition. Q1 and Q3 resolution were both set at 0.7 fwhm (full width at half-maximum). The selected reaction monitoring data was processed by LCquan from Thermo Scientific. The signal-to-noise ratios (S/N) were determined as the intensity of the ion-current peak signals of target peptides over the baseline chemical noises. The lower limit of quantification, LLOQ, was determined as at least three folds of the lower limit-of-detection (LOD) of the least sensitive signature peptide of the same protein.

Establishment of the Five Calibration Approaches

The five different calibration approaches (Figure 1B) were prepared across the protein concentration range of 0.1–200 μg/mL (or equal molar concentrations of peptides or extended peptides). The actual purities of all standards were accurately quantified by quantitative amino acid analysis (AAA). Each calibration method was prepared following its own procedure (c.f. Figure 1, panels A and C). For each calibration approach, two calibration curves were prepared independently for the two SP. For each SP, the calibration curve was constructed by plotting the peak area ratios of the SP and I.S. versus the corresponding mAb concentrations. Linear regression with a 1/x2 weighting factor was employed. The procedures for preparing the calibration solutions are specified below.

Protein-Level Calibration

The protein-level calibration solutions were prepared by spiking 0.1, 0.2, 0.5, 1, 5, 20, 40, and 200 μg/mL pure mAb (purity was calibrated by AAA, same below) in pooled blank rat plasma before any preparation procedure; meanwhile, full-length SIL-HCV mAb was spiked at the concentration of 500 ng/mL. All calibration samples were experimentally processed and analyzed by LC-SRM-MS, following the above-mentioned protocols.

Peptide-Level Calibration

Peptide-level calibration solutions were prepared with adding GPS peptide at 0.032, 0.064, 0.16, 0.32, 1.6, 6.4, 12.8, and 64 ng/mL and TVA peptide at 0.052, 0.104, 0.26, 0.52, 2.6, 10.4, 20.8, and 104 ng/mL into digested rat plasma, which had been diluted 50-fold from the original plasma sample before the digestion procedure (c.f. the preparation procedure). The above concentration ranges correspond to the same molarities of 0.1–200 μg/mL mAb protein in plasma. The levels of SIL-GPS and TVA I.S. were 2.00 and 3.25 ng/mL, respectively.

Extended-Peptide Level Calibration

Standard solutions were prepared by spiking 0.052, 0.104, 0.26, 0.52, 2.6, 10.4, 20.8, and 104 ng/mL extended-GPS peptide and 0.076, 0.152, 0.376, 0.752, 3.75, 15, 30, and 150 ng/mL extended-TVA-peptide to the diluted and prepared plasma before digestion. These concentration ranges correspond to the same molarities of 0.1–200 μg/mL mAb protein in plasma. The I.S., SIL-extended-GPS, and SIL-extended-TVA were added at 3.25 and 4.68 ng/mL, respectively.

Hybrid Calibrations

Two varieties of hybrid calibrations were investigated. For both of the hybrid calibration approaches, the protein calibrator was spiked at 0.1, 0.2, 0.5, 1, 5, 20, 40, and 200 μg/mL HCV mAb antibody to blank rat plasma before any preparation step, which is the same as the protein-level calibration approach. For the hybrid approach using SIL-extended-peptide I.S., the SIL-extended-peptide I.S. was added at the same levels as that of the extended-peptide-level calibration approach, after sample preparation but before digestion; for the hybrid approach using SIL-peptide I.S., the protein-spiked plasma samples were prepared and digested, and then the SIL-peptide I.S. was added after digestion at the same levels as that of the peptide-level calibration approach.

Validation and Evaluation of Quantitative Performance Using Protein-Spiked QC Samples

The five calibration approaches were individually validated using the “default” method for each [i.e., spiking blank plasma samples (or sample digest) with the corresponding calibrators (peptides, extended-peptides or protein) of each calibration approach]. Details are shown in the Supporting Information. In order to evaluate the real accuracy of these approaches for the quantification of protein in plasma, another batch of QC samples were prepared by spiking the AAA-certified anti-HCV-mAb (full-length protein) into blank rat plasma at three levels (1.6, 10, and 80 μg/mL). Aliquots of these QC samples were prepared and analyzed by every calibration approach in triplicate for each of two different days (day 1 and day 14, N = 6). For each sample and each calibration approach, two sets of quantitative results were independently obtained by the two SP, and the agreement between the two values was evaluated.

Comparison of the Five Approaches in a Full Preclinical PK Study

Animal protocols were approved by the Institutional Animal Care and Use Committee of the SUNY-Buffalo. Male Sprague–Dawley rats were given a single intravenous dose of 2 mg/kg of the mAb (N = 4), and 50 μL of blood was collected at each of the 15 time points after administration (i.e., 1, 2, 4, 6, 12, 24, 48, 72, 120, 168, 216, 264, 360, 504, and 648 h). The plasma was procured, and after vortexing, five 2 μL aliquots of the same plasma sample were taken, which were quantified with each of the five different calibration methods. Two sets of quantitative values were independently obtained using the two SP. On the basis of the time–concentration curves obtained by these calibration approaches, the PK parameters were assessed with use of WinNonlin (Pharsight Corporation, Palo Alto, CA).

Results and Discussion

Selection of Two SP and Development of the LC–MS Method

In order to conduct an unbiased comparison of the calibration approaches, it is important to select the optimal SP that are stable, sensitive, and specific to the target protein (anti-HCV mAb, sequence shown in Figure 1 of the Supporting Information). To achieve this goal, instead of using an in silico method to predict the optimal SP, we employed an experimental strategy to discover and optimize many SP candidates, and then to evaluate these candidates in target matrices prior to SP selection.[24,26,27] Briefly, the pool of SP candidates was generated by a data-dependent nano-LC-LTQ/Orbitrap analysis and a stringent filtering step.[26,27] The list of qualified SP candidates is shown in Table 1 of the Supporting Information. To evaluate these candidates, the anti-HCV mAb was spiked to blank plasma and then the samples were prepared and digested; the optimal LC–MS conditions of all SP candidates shown in Table 1 of the Supporting Information were accurately obtained by a high-throughput and on-the-fly orthogonal-array-optimization (OAO)[24,26,27] within one single LC–MS run. With the use of the developed LC–MS conditions, all candidates were thoroughly assessed for stability and signal-to-noise ratios (S/N) in the plasma digest. Selection of unstable peptides as SP carries a significant risk of quantitative bias/variation, which cannot be corrected by SIL-peptide I.S.[24,26,36] For this reason, we evaluated the stability of the 15 candidates (Table 1 of the Supporting Information) by continuous incubation of the digest mixture under conditions respectively mimicking tryptic digestion (37 °C, pH 8.5, 24 h) and queuing in a cooled autosampler (4 °C, pH 2.8, 48 h). Details are presented in the Experimental section of the Supporting Information. Interestingly, 5 out of the 15 SP candidates showed considerable degradation (>20%) in at least one of the conditions, and some of these unstable peptides were of high abundance (Figure 2 and Table 1 of the Supporting Information). These observations underscore the potential risks of selecting SP merely based on the signal intensity. Among the stable peptides, two unique SP showing the highest S/N from heavy chain (GPSVFPLAPSSK, referred as the GPS peptide) and light chain (TVAAPSVFIFPPSDEQLK, the TVA peptide) were selected. The SP and their extended peptides (ASTKGPSVFPLAPSSKSTSG and EIKRTVAAPSVFIFPPSDEQLKSGTA) are synthesized in both isotope labeled and nonlabeled forms.

Figure 2

Evaluation of the stability of the 15 candidates for signature peptides (SP) from the anti-HCV mAb, by continuous incubation of the digest mixture under conditions mimicking (A) prolonged tryptic digestion (37 °C, pH 8.5, 24 h) and (B) queuing in a cooled autosampler (4 °C, pH 2.8, 48 h), respectively. The peptides are represented by the first three residues of their sequence (the full list is in Table 1 of the Supporting Information). GPS and TVA peptides from the heavy and light chains of the mAb, respectively, were finally selected as the SP (shown in red and blue). To enable an unbiased comparison of the five calibration approaches, the purities of all peptide, extended-peptide, and protein calibrators were quantified by the AAA and the mean quantitative values were corroborated for every amino acid residue to ensure accuracy. The results are shown in Table 2 of the Supporting Information. Clearly, the actual purities of all standards were markedly lower than the nominal values. For example, though the manufacturer-labeled purity for the TVA peptide was >95%, the true purity was only 65%. Manufacturers use HPLC-UV or electrophoresis methods to determine protein/peptide purities, which are unable to detect nonresponsive contaminations (e.g., dried buffer salts) and thus may result in a positive bias. As the standards were produced by highly reputable providers, such biases may be prevalent and thereby performing AAA to verify calibration standards is advisable to ensure quantitative accuracy. Plasma samples were subjected to precipitation and overnight digestion, a procedure demonstrated to afford high peptide recovery, high reproducibility, and excellent digestion efficiency,[24,27,34,37] followed by LC–MS analysis. Typical chromatograms of the two SP in plasma digest are shown in Figure 2 of the Supporting Information. Good selectivity was achieved for both SP. The detection limits were 15 and 30 ng of mAb protein per mL of plasma for GPS and TVA peptides, respectively. In this study, the lower limits of quantification (LLOQ) for both SP were validated at 100 ng protein per milliliters of plasma.

Comparison of the Five Calibration Approaches with Spiked QC Samples

Calibration Approaches at Peptide, Extended-Peptide, and Protein Levels

To ensure an objective comparison, the calibration curves at the peptide, extended-peptide, and protein levels were prepared in the same protein concentration range of 0.1–200 μg/mL in plasma (or equal molarities of peptides/extended-peptides, the same as below). For every approach, two calibration curves were independently prepared for the two SP. All calibration curves showed excellent linearity (R2 ≥ 0.99, Table 3 of the Supporting Information). Then we validated these calibration approaches using QC samples spiked with their corresponding calibrators (e.g. spiking pure synthesized peptides to digested blank plasma for peptide calibration), extended-peptides right before digestion for extended-peptide calibration, and pure protein before sample preparation for protein calibration. With the use of these “by default” validation approaches that have been widely practiced,[16] all methods exhibited excellent accuracy and precision at the three QC levels, as shown in Table 3 of the Supporting Information. However, the validation practices used for peptide and extended-peptide calibrations may not correctly reflect the accuracy of protein quantification because the target analyte is the protein rather than a peptide or extended-peptide. Subsequently, we sought to evaluate the quantitative performance of these three calibration approaches using QC prepared by spiking AAA-certified full-length protein into blank plasma at three levels (1.6, 10, and 80 μg/mL). The results are illustrated in Figure 3A, which plots the quantitative biases by the two SP (on the x- and y-axis) in replicate measurements (N = 6 in two different days). Detailed quantitative data is shown in Table 4 of the Supporting Information. All methods showed excellent precision. However, the peptide-level calibration clearly resulted in severe negative bias with both SP (Figure 3A); moreover, the quantitative values by the two SP differed substantially (Figure 3A and Table 4 of the Supporting Information). These problems may arise from the incomplete peptide recoveries in the preparation and digestion steps, and that the recoveries for the two SP may be substantially different. Incomplete recovery of SP appears to be inevitable, even if thoroughly optimized preparation and digestion strategies were employed.[15,19,20,24,26,31] The use of the extended-peptide calibration approach was found to markedly reduce the quantitative biases and discrepancy between the two SP (Figure 3A), probably because the incomplete peptide recovery during digestion was at least partially compensated for.[12,18,38] Nevertheless, that accuracy achieved by extended-peptide calibration is still insufficient (Figure 3A). Previously Neubert et al. demonstrated that extended-peptide calibration achieved excellent quantitative accuracy for a small cytokine protein.[18] We speculate that the different observations between the two studies may reflect that the extended-peptide could well-resemble the digestion behavior of the SP domain in small proteins, but not so in a much larger protein such as a mAb.[39,40]

Figure 3

Two-dimensional representations of the quantitative biases by (A) peptide-, extended-peptide-, and protein-level calibration approaches and (B) the two “hybrid” calibration approaches (c.f., Figure 1). QC samples were prepared by spiking blank plasma with pure protein at three levels: 1.6, 10, and 80 μg/mL. The purities of all standards were accurately measured by the quantitative amino acid analysis method to eliminate bias arising from possible inaccurate purity. Five aliquots of each QC sample were individually prepared and analyzed in replicates by the five calibration approaches. Each sample was analyzed three times in each of two different days (day 1 and day 14, N = 6, shown as individual data points). For every calibration method, the quantitative values were obtained independently using the two signature peptides (SP) (i.e., the GPS and TVA peptides). The two axes represent the quantitative biases by the two SP. The red box in the center of each panel denotes the zone of <20% bias, while the golden box signifies the zone of <10% bias. Protein-level calibration utilizing full-length protein as calibrator and full-length SIL-protein as I.S. has demonstrated excellent quantitative performance regardless of the efficiencies of the preparation and digestion procedures,[19,21,23] and therefore can be used as the “Gold Standard” for targeted protein quantification. Consistent with this notion, in this study, excellent accuracy and precision were observed at all QC levels by the protein-level calibration (Figure 3A). Though protein-level calibration provides high accuracy and precision, SIL-proteins must be produced in high-yields to be cost-effective, limiting their production to those for which optimized cell lines have been developed.[20] Additionally, since these proteins are generally produced by the SILAC method,[41] achieving very high isotopic purity is often both technically challenging and cost prohibitive. In this study, a 97% isotope purity (i.e., 3% of the mAb molecules is not labeled) was obtained for the SIL-anti-HCV mAb following multiple cycles of metabolic labeling, in comparison with >99.9% isotopic purities that are typically seen in synthesized SIL-peptides. To avoid sample contamination by the I.S., the SIL-protein has to be spiked at a relatively low level (<30-fold of detect limit in this study) and thus may limit the flexibility of method development. Typical chromatograms illustrating this problem are shown in Figure 3 of the Supporting Information.

Hybrid Calibration Approaches

To achieve accurate quantification, one plausible alternative to the protein-level calibration is the “hybrid” calibration methods (i.e., employing the full-length protein as calibrator and SIL-peptide or SIL-extended peptide as I.S.). When reproducible preparation and digestion steps are achieved, the use of full-length proteins as calibrator will offset the peptide losses in these steps even without using SIL-protein I.S. and, therefore, may result in accurate protein quantification. Our previous works as well as these by others showed accurate quantification of liver enzymes and mAb, using protein calibrator and SIL-peptide I.S.[12,24,26,27,42] Here we evaluated the quantitative performance of two hybrid calibration methods: protein-calibrator with SIL-peptide I.S. and protein-calibrator with SIL-extended-peptide-I.S., using QC prepared by spiking blank plasma with pure protein. Excellent quantitative accuracy and precision were observed for both methods at the three protein levels (N = 6, Figure 3B and Table 4 of the Supporting Information). The high level of accuracy and precision also suggested high reproducibility of the preparation and digestion procedure used in this study. These results demonstrated that hybrid approaches have a good potential for accurate targeted quantification in the absence of SIL-protein.

Comprehensive Comparison of Calibration Approaches in a Full Pharmacokinetics Study in Rats

The above comparisons were carried out in blank plasma spiked with known amounts of pure protein. In order to examine whether the above observations hold true for “real-world” applications, we further compared the five calibration approaches in parallel for a full pharmacokinetics analysis. Four rats (N = 4) received a single, clinically relevant dose of mAb (2.0 mg/kg), and then plasma samples were collected at 15 time points ranging from 0.5 to 648 h post injection (see Experimental Section). Five 2 μL aliquots of each sample were taken out, and then each aliquot was prepared and analyzed by one of the five calibration approaches. The time courses measured by these approaches are shown in Figure 4 (panels A–E). Protein-level calibration was utilized as the gold standard to estimate the range of true protein concentrations in the time-course samples (Figure 4A) and to define the acceptable error zone (±15%, as indicated by the gray bands in Figure 4 (panels A–F). As a cross validation, results by a parallel ELISA quantification also fell well within this zone (Figure 4F). Consistent with the findings in protein-spiked QC samples, the peptide-level calibration resulted in substantially underestimated concentrations in all time-course samples; moreover, for the same set of plasma samples, quantification independently by the two SP resulted in two substantially different sets of concentration–time curves (Figure 4B). Extended-peptide calibration approach showed a significant improvement over the peptide-level calibration (Figure 4C); however, considerable underestimation was still observed and most of the data points are not in the acceptable range; furthermore, the results by the two SP also differed markedly (Figure 4C). In contrast, both of the hybrid calibration methods afforded accurate measurement in all time-course samples (i.e., all data points are within the acceptable zone, Figure 4, panels D–E), and the results by the two SP agreed well with each other. Time course data at each time points are shown in Table 5 of the Supporting Information. The extents of inconsistency between the two SP in the time course samples are summarized in Figure 5. For protein-level and hybrid calibration approaches, excellent agreements between the SP were observed (deviation <15%), which is in sharp contrast to peptide- and extended-peptide-level calibration methods, where the deviations were >15% in a vast majority of the samples. These observations highlight the benefits of using multiple SP as a gauge for reliability of the quantitative method and results.

Figure 4

Figure 5

The extent of inconsistency between the two SP in the time course samples. The deviation between quantitative results obtained by the two signature peptides was used as a gauge for the reliability of the quantification method. For each of the calibration approaches, the average deviation of samples within the same time point group was calculated (N = 4 per time point). The threshold of acceptance was deviation < 15% (red line) between the two SP.

Comparison of the five calibration approaches and ELISA for a full PK study in rats. The time courses of anti-HCV mAb were measured by the five approaches in parallel: (A) protein-level calibration, (B) peptide-level calibration, (C) extended-peptide-level calibration, and two “hybrid approaches” employing protein as a calibrator with (D) SIL-peptides or (E) SIL-extended-peptides as I.S. (F) The results of a parallel ligand binding assay (samples from the four animals were pooled at each time point). PK parameters calculated based on the time courses determined by each calibration approach are shown in (G). Rats (N = 4) were given a single dose (I.V., 2 mg/kg) of anti-HCV mAb, and blood samples were collected at 15 time points post injection. Five aliquots of each sample were individually prepared and analyzed, respectively, by the five calibration approaches. Quantitative data were obtained independently by the two SP. The ±15% error zones were estimated based on the mean values obtained by the protein-level calibration approach (the Gold Standard). The extent of inconsistency between the two SP in the time course samples. The deviation between quantitative results obtained by the two signature peptides was used as a gauge for the reliability of the quantification method. For each of the calibration approaches, the average deviation of samples within the same time point group was calculated (N = 4 per time point). The threshold of acceptance was deviation < 15% (red line) between the two SP. Finally, to assess the effects of calibration approaches on pharmacokinetics assessment, we calculated some basic pharmacokinetic parameters based on the time course data (Figure 4G and Table 1). The values measured by the protein-level calibration defined the threshold of correct estimation (±15%, Figure 4G). Both the peptide- and extended-peptide level calibrations severely overestimated the steady state volume of distribution (Vss) and clearance (Cl) of the mAb and underestimated the AUC, while the half-life values (T1/2) were correct because this parameter is not dependent on the absolute level of drug; additionally, two discordant sets of PK parameters were obtained by the two SP (Table 1). Per contra, the hybrid methods achieved correct estimation of all parameters by both SP (Table 1). Interestingly, the PK parameters estimated by the two SP are more concordant using the hybrid approach with SIL-peptide I.S. (hybrid#1), than using SIL-extended-peptide I.S. (hybrid#2). This observation probably rooted from the possible different digestion rates between the extended-peptides and the corresponding domains in the protein.

Table 1

Pharmacokinetic Parameters Calculated Based on the Quantitative Results Obtained by Different Calibration Approaches and Signature Peptides

	AUC (μg h/mL)	Cl (mL/h)	Vss (mL)	T_1/2 (h)
protein calibration, GPS	7274	0.083	28.9	260.7
protein calibration, TVA	7113	0.085	28.8	260.8
aprotein calibration, mean of two SP	7194	0.084	28.9	260.9
protein calibration, deviation between the two SP	2%	2%	0%	0%
hybrid no. 2 (SIL-extended-peptide I.S.), GPS	7551	0.080	30.6	298.5
hybrid no. 2 (SIL-extended-peptide I.S.), TVA	6763	0.089	32.3	263.9
hybrid no. 2, mean of two SP	7155	0.084	31.4	281.2
hybrid no. 2, deviation between the two SP	8%	8%	4%	9%
hybrid no. 1 (SIL-peptide I.S.), GPS	7244	0.083	27.9	252.5
hybrid no. 1 (SIL-peptide I.S.), TVA	7125	0.085	28.9	261.1
hybrid no. 1, mean of two SP	7212	0.084	28.3	252.5
hybrid no. 1, deviation between the two SP	1%	1%	2%	2%
extended-peptide calibration, GPS	5774	0.105	38.2	271.1
extended-peptide calibration, TVA	3696	0.163	49.5	239.2
extended-peptide calibration, mean of two SP	4721	0.128	43.2	255.6
extended-peptide calibration, deviation between SP	31%	31%	18%	9%
peptide calibration, GPS	5640	0.107	35.9	252.5
peptide calibration, TVA	2995	0.202	74.7	280.7
peptide calibration, mean of two SP	4293	0.141	49.0	260.4
peptide calibration, deviation between the two SP	43%	43%	50%	7%

Parameters were calculated based on the mean of the concentrations obtained by the two SP.

Conclusions

In biomedical/pharmaceutical research and industry, accurate measurement of protein, drug, or biomarker in biological matrices is often critical. To facilitate the development of accurate LC–MS-based methods, we systematically evaluated the quantitative accuracy achieved by various calibration approaches. For each approach, the target mAb protein was independently quantified by two SP that were stringently selected based on a thorough evaluation of all candidates for stability and sensitivity. Under the sample preparation, digestion, and LC–MS methods developed here, all calibration approaches exhibited excellent quantitative precision. Though peptide-level calibration achieved excellent accuracy when validated with peptide-spiked QC (the “default” method for validation of peptide-level calibration), severe negative bias, and discordant results by the two SP were observed when this approach was examined with protein-spiked QC, suggesting that reliance on peptide calibration may carry a “hidden” but significant risk of quantitative error. Extended-peptide calibration showed noticeable improvement but negative biases remained, probably reflecting the fact that a short extended-peptide may not resemble the behavior of a large protein such as the mAb. Conversely, good quantitative accuracy was observed for the three approaches using full-length protein as the calibrator, including the protein-level calibration (the gold standard) and two hybrid approaches. While the protein-level calibration requires full-length SIL-protein which is expensive and may have isotope purity issues, the hybrid methods circumvent this need by employing SIL-peptide or SIL-extended-peptide as I.S. without perceivably compromising quantitative accuracy. Therefore, when reproducible sample preparation is achieved, the hybrid methods may provide a cost-effective alternative to protein-level calibration for accurate quantification. This study also demonstrated that the peptide- and extended-peptide level calibration approaches may result in misleading biological/pharmaceutical information, such as the incorrect pharmacokinetic assessments observed in the thorough pharmacokinetic study. Again the hybrid calibration approaches resulted in accurate pharmacokinetic assessments. This work also underscored the benefits of using two SP from different domains of a protein to gauge the reliability of quantification, which is capable of perceiving quantitative errors arising from such factors as methodological bias and truncation or modification of the target protein. Also highlighted in this study are the importance of using AAA-certified standards for accurate absolute quantification and the necessity of evaluating peptide stabilities prior to the selection of SP, two issues that are often overlooked. This study addressed important calibration issues in the developing field of LC–MS-based protein quantification, which has not been adequately addressed in published studies. The observations from this study should be of considerable value for practitioners as they develop “fit for purpose” quantitative assays. Preclinical and clinical PK studies require accurate quantification of therapeutic proteins and biomarkers. However, while the peptide-level calibration approach may not accurately reveal absolute protein concentrations in biological matrices, it is straightforward to develop at low cost and high throughput and will perform well for relative quantification; for example, it is an excellent and powerful tool for early estimation of T1/2 or multiplexed evaluation of protein biomarkers in preclinical species.[13]

42 in total

1. Evaluation of a cleavable stable isotope labeled synthetic peptide for absolute protein quantification using LC-MS/MS.

Authors: David R Barnidge; Gregory D Hall; Jeanne L Stocker; David C Muddiman
Journal: J Proteome Res Date: 2004 May-Jun Impact factor: 4.466

Review 2. Antibody pharmacokinetics and pharmacodynamics.

Authors: Evelyn D Lobo; Ryan J Hansen; Joseph P Balthasar
Journal: J Pharm Sci Date: 2004-11 Impact factor: 3.534

Review 3. Protein therapeutics: a summary and pharmacological classification.

Authors: Benjamin Leader; Quentin J Baca; David E Golan
Journal: Nat Rev Drug Discov Date: 2008-01 Impact factor: 84.694

4. The implications of proteolytic background for shotgun proteomics.

Authors: Paola Picotti; Ruedi Aebersold; Bruno Domon
Journal: Mol Cell Proteomics Date: 2007-05-28 Impact factor: 5.911

5. Quantitative, multiplexed assays for low abundance proteins in plasma by targeted mass spectrometry and stable isotope dilution.

Authors: Hasmik Keshishian; Terri Addona; Michael Burgess; Eric Kuhn; Steven A Carr
Journal: Mol Cell Proteomics Date: 2007-10-15 Impact factor: 5.911

6. Isotope-labeled protein standards: toward absolute quantitative proteomics.

Authors: Virginie Brun; Alain Dupuis; Annie Adrait; Marlène Marcellin; Damien Thomas; Magali Court; François Vandenesch; Jérôme Garin
Journal: Mol Cell Proteomics Date: 2007-09-11 Impact factor: 5.911

7. Utility of cleavable isotope-coded affinity-tagged reagents for quantification of low-copy proteins induced by methylprednisolone using liquid chromatography/tandem mass spectrometry.

Authors: Jun Qu; William J Jusko; Robert M Straubinger
Journal: Anal Chem Date: 2006-07-01 Impact factor: 6.986

8. Multiplexed absolute quantification in proteomics using artificial QCAT proteins of concatenated signature peptides.

Authors: Robert J Beynon; Mary K Doherty; Julie M Pratt; Simon J Gaskell
Journal: Nat Methods Date: 2005-08 Impact factor: 28.547

9. Stable isotope labeling by amino acids in cell culture, SILAC, as a simple and accurate approach to expression proteomics.

Authors: Shao-En Ong; Blagoy Blagoev; Irina Kratchmarova; Dan Bach Kristensen; Hanno Steen; Akhilesh Pandey; Matthias Mann
Journal: Mol Cell Proteomics Date: 2002-05 Impact factor: 5.911

10. Multiplexed absolute quantification for proteomics using concatenated signature peptides encoded by QconCAT genes.

Authors: Julie M Pratt; Deborah M Simpson; Mary K Doherty; Jenny Rivers; Simon J Gaskell; Robert J Beynon
Journal: Nat Protoc Date: 2006 Impact factor: 13.491

14 in total

1. Pharmacokinetic Analysis of a Novel Human EGFRvIII:CD3 Bispecific Antibody in Plasma and Whole Blood Using a High-Resolution Targeted Mass Spectrometry Approach.

Authors: Teilo H Schaller; Matthew W Foster; J Will Thompson; Ivan Spasojevic; Deimante Normantaite; M Arthur Moseley; Luis Sanchez-Perez; John H Sampson
Journal: J Proteome Res Date: 2019-07-19 Impact factor: 4.466

2. Development of an antibody-free ID-LC MS method for the quantification of procalcitonin in human serum at sub-microgram per liter level using a peptide-based calibration.

Authors: Huu-Hien Huynh; Amandine Bœuf; Maxence Derbez-Morin; Anne-Marie Dupuy; Béatrice Lalere; Vincent Delatour; Joëlle Vinh
Journal: Anal Bioanal Chem Date: 2021-05-14 Impact factor: 4.142

Review 3. Toward sensitive and accurate analysis of antibody biotherapeutics by liquid chromatography coupled with mass spectrometry.

Authors: Bo An; Ming Zhang; Jun Qu
Journal: Drug Metab Dispos Date: 2014-09-02 Impact factor: 3.922

4. Sensitive, High-Throughput, and Robust Trapping-Micro-LC-MS Strategy for the Quantification of Biomarkers and Antibody Biotherapeutics.

Authors: Ming Zhang; Bo An; Yang Qu; Shichen Shen; Wei Fu; Yuan-Ju Chen; Xue Wang; Rebeccah Young; John M Canty; Joseph P Balthasar; Keeley Murphy; Debadeep Bhattacharyya; Jonathan Josephs; Luca Ferrari; Shaolian Zhou; Surendra Bansal; Faye Vazvaei; Jun Qu
Journal: Anal Chem Date: 2018-01-08 Impact factor: 6.986

Review 5. Quantitative proteomics in cardiovascular research: global and targeted strategies.

Authors: Xiaomeng Shen; Rebeccah Young; John M Canty; Jun Qu
Journal: Proteomics Clin Appl Date: 2014-07-14 Impact factor: 3.494

6. "Catch-and-Release" Anti-Carcinoembryonic Antigen Monoclonal Antibody Leads to Greater Plasma and Tumor Exposure in a Mouse Model of Colorectal Cancer.

Authors: Frank A Engler; Joseph Ryan Polli; Tommy Li; Bo An; Michael Otteneder; Jun Qu; Joseph P Balthasar
Journal: J Pharmacol Exp Ther Date: 2018-05-07 Impact factor: 4.030

7. Biomarker Assay Validation by Mass Spectrometry.

Authors: Carmen Fernández-Metzler; Brad Ackermann; Fabio Garofolo; Mark E Arnold; Binodh DeSilva; Huidong Gu; Omar Laterza; Yan Mao; Mark Rose; Faye Vazvaei-Smith; Rick Steenwyk
Journal: AAPS J Date: 2022-05-09 Impact factor: 4.009

Review 8. Clinical potential of mass spectrometry-based proteogenomics.

Authors: Bing Zhang; Jeffrey R Whiteaker; Andrew N Hoofnagle; Geoffrey S Baird; Karin D Rodland; Amanda G Paulovich
Journal: Nat Rev Clin Oncol Date: 2019-04 Impact factor: 66.675

9. Impact of Sample Matrix on Accuracy of Peptide Quantification: Assessment of Calibrator and Internal Standard Selection and Method Validation.

Authors: Samuel L Arnold; Faith Stevison; Nina Isoherranen
Journal: Anal Chem Date: 2015-12-14 Impact factor: 6.986

10. Development of an LC-MS/MS Proposed Candidate Reference Method for the Standardization of Analytical Methods to Measure Lipoprotein(a).

Authors: Santica M Marcovina; Noémie Clouet-Foraison; Marlys L Koschinsky; Mark S Lowenthal; Allen Orquillas; Michael B Boffa; Andrew N Hoofnagle; Tomáš Vaisar
Journal: Clin Chem Date: 2021-03-01 Impact factor: 12.167