| Literature DB >> 21685906 |
Jeffrey R Whiteaker1, Chenwei Lin, Jacob Kennedy, Liming Hou, Mary Trute, Izabela Sokal, Ping Yan, Regine M Schoenherr, Lei Zhao, Uliana J Voytovich, Karen S Kelly-Spratt, Alexei Krasnoselsky, Philip R Gafken, Jason M Hogan, Lisa A Jones, Pei Wang, Lynn Amon, Lewis A Chodosh, Peter S Nelson, Martin W McIntosh, Christopher J Kemp, Amanda G Paulovich.
Abstract
High-throughput technologies can now identify hundreds of candidate protein biomarkers for any disease with relative ease. However, because there are no assays for the majority of proteins and de novo immunoassay development is prohibitively expensive, few candidate biomarkers are tested in clinical studies. We tested whether the analytical performance of a biomarker identification pipeline based on targeted mass spectrometry would be sufficient for data-dependent prioritization of candidate biomarkers, de novo development of assays and multiplexed biomarker verification. We used a data-dependent triage process to prioritize a subset of putative plasma biomarkers from >1,000 candidates previously identified using a mouse model of breast cancer. Eighty-eight novel quantitative assays based on selected reaction monitoring mass spectrometry were developed, multiplexed and evaluated in 80 plasma samples. Thirty-six proteins were verified as being elevated in the plasma of tumor-bearing animals. The analytical performance of this pipeline suggests that it should support the use of an analogous approach with human samples.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21685906 PMCID: PMC3232032 DOI: 10.1038/nbt.1900
Source DB: PubMed Journal: Nat Biotechnol ISSN: 1087-0156 Impact factor: 54.908
Figure 1Multistage, targeted proteomic pipeline for triage and verification of biomarker candidates. (a) Overview of the workflow used to triage and verify candidate biomarkers, showing the flux of candidates at each stage of the pipeline. (b) Required resources for implementing the proteomic pipeline. The overall timeline includes time for data collection and analysis. For Q-SRM and immuno-SRM measurements, the overall timeline includes synthetic peptide quality control, development of SRM methods, acquisition of response curves and data analysis (but not the time required to generate antibodies, which can be interspersed with other activities). Instrument demands are summarized independently to provide an estimate of the required laboratory resources to carry out the study. Additional reagent costs (e.g., peptide standards and antibodies) are required for Q-SRM and immuno-SRM assays. Finally, the required personnel used in each phase of the study are denoted as full-time equivalents.
Metrics of performance and verification results for quantitative SRM-based assays
| Gene.Peptide | LOD
| LOQ
| Median %CV | LOQ %CV | Preclinical
| Clin. Apparent
| ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| fmol | Peptide (ng/ml) | Protein (ng/ml) | fmol | Peptide (ng/ml) | Protein (ng/ml) | AUC | AUC | |||||
| 0.6 | 0.9 | 9.2 | 1.4 | 2.0 | 21.2 | 8.3 | 11.2 | 0.70 | 0.092 | 0.51 | 0.764 | |
| 0.5 | 0.5 | 17.3 | 1.4 | 1.3 | 43.4 | 5.8 | 9.7 | 0.59 | 0.508 | 0.63 | 0.326 | |
| 0.8 | 1.0 | 10.7 | 1.8 | 2.3 | 23.6 | 9.5 | 5.6 | 0.54 | 0.746 | 0.70 | 0.144 | |
| 9.0 | 12.2 | 586.7 | 24.3 | 33.2 | 1592.3 | 4.6 | 7.2 | 0.56 | 0.654 | 0.65 | 0.281 | |
| 0.1 | 0.1 | 1.2 | 0.2 | 0.2 | 2.7 | 8.1 | 6.6 | 0.53 | 0.813 | 0.80 | 0.035 | |
| 2.8 | 2.5 | 247.3 | 5.8 | 5.3 | 518.8 | 2.9 | 3.1 | 0.76 | 0.050 | 0.72 | 0.095 | |
| 0.4 | 0.4 | 9.8 | 0.8 | 0.8 | 20.0 | 3.3 | 3.3 | 0.71 | 0.090 | 0.69 | 0.139 | |
| 0.3 | 0.2 | 8.5 | 0.5 | 0.4 | 17.7 | 3.9 | 7.2 | 0.62 | 0.296 | 0.50 | 0.980 | |
| 1.8 | 2.2 | 39.0 | 3.9 | 4.8 | 86.7 | 5.1 | 8.6 | 0.55 | 0.694 | 0.74 | 0.087 | |
| 2.1 | 2.5 | 89.0 | 3.5 | 4.1 | 147.8 | 5.4 | 4.9 | 0.55 | 0.694 | 0.59 | 0.445 | |
| 2.7 | 2.2 | 124.3 | 7.1 | 5.7 | 324.6 | 5.1 | 10.1 | 0.52 | 0.873 | 0.55 | 0.721 | |
| 9.8 | 11.6 | 420.2 | 24.3 | 28.8 | 1045.9 | 4.2 | 3.6 | 0.64 | 0.346 | 0.50 | 0.975 | |
| 0.7 | 0.6 | 74.8 | 1.7 | 1.5 | 189.4 | 8.3 | 9.1 | 0.77 | 0.027 | 0.64 | 0.318 | |
| 0.1 | 0.1 | 1.1 | 0.3 | 0.2 | 2.5 | 8.0 | 8.8 | 0.52 | 0.833 | 0.73 | 0.105 | |
| 4.7 | 5.5 | 294.5 | 12.2 | 14.1 | 754.8 | 14.9 | 5.5 | 0.69 | 0.174 | 0.67 | 0.224 | |
| 0.8 | 0.5 | 27.7 | 1.9 | 1.3 | 64.3 | 5.6 | 4.1 | 0.81 | 0.015 | 0.68 | 0.172 | |
| 4.8 | 7.0 | 82.6 | 10.5 | 15.4 | 182.0 | 4.3 | 4.1 | 0.73 | 0.060 | 0.6 | 0.448 | |
| 1.7 | 1.4 | 33.7 | 3.6 | 2.9 | 70.3 | 3.7 | 5.5 | 0.72 | 0.102 | 0.74 | 0.057 | |
| 0.9 | 0.9 | 170.9 | 2.1 | 2.1 | 386.7 | 6.5 | 6.5 | 0.70 | 0.132 | 0.67 | 0.189 | |
| 15.3 | 21.8 | 283.5 | 40.2 | 57.1 | 744.6 | 2.7 | 2.8 | 0.59 | 0.530 | 0.65 | 0.256 | |
Three to four transitions were measured for each target in the equivalent of 1.5 μl of depleted plasma, and three repeats (three LC-SRM-MS analyses) were performed at ten concentration points (and a blank sample). The best transition for each target was selected as the basis for the quantitative assay. A linear regression algorithm was used for fitting the ten serial dilution data points for each curve. The LOD of each target was obtained from the average of the blank measurements plus three times the s.d. The LOQ was obtained from the average of the blank measurements plus ten times the s.d. with additional criteria that the CV of measurements of the concentration points near the calculated LOQ were <25%[40,41]. Protein LOD/LOQ estimates assume complete trypsin digestion.
Assays followed by an asterisk (*) were characterized by reverse curves, as described in Supplementary Results Section 4. ROC curves and corresponding AUC values were determined for each candidate, and P values were determined by permutation testing, as described in Supplementary Results Section 6. Candidates determined to be significantly elevated (that is, verified; AUC ≥ 0.8 and P ≤ 0.01 in the plasma of animals with clinically apparent tumors are in boldface. (Although Aldoc, Chi3l1 and Lbp meet significance based on P-values, their levels are near their assay LODs.) Peptide sequences are abbreviated to the first four amino acids (full sequences are shown in Supplementary Worksheet 4a).
Metrics of performance and verification results for immuno-SRM assays
| Gene.Peptide | LOD
| LOQ
| Median %CV | LOQ %CV | Preclinical
| Clinical Apparent
| ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| fmol | Peptide (ng/ml) | Protein (ng/ml) | fmol | Peptide (ng/ml) | Protein (ng/ml) | AUC | AUC | |||||
| 24.0 | 2.4 | 498.6 | 61.7 | 6.2 | 1280.3 | 18.2 | 23.3 | 0.56 | 0.682 | 0.54 | 0.672 | |
| 12.2 | 2.0 | 43.4 | 25.8 | 4.2 | 92.2 | 13.2 | 3.8 | 0.54 | 0.779 | 0.50 | 0.983 | |
| 3.4 | 0.5 | 22.1 | 7.5 | 1.0 | 48.1 | 15.7 | 13.1 | 0.50 | 0.498 | 0.82 | 0.025 | |
| 2.2 | 0.3 | 6.8 | 4.1 | 0.6 | 12.9 | 12.1 | 9.8 | 0.65 | 0.256 | 0.80 | 0.015 | |
| 8.0 | 1.8 | 45.9 | 16.6 | 3.8 | 95.3 | 11.2 | 4.9 | 0.64 | 0.279 | 0.72 | 0.107 | |
| 13.7 | 3.1 | 61.1 | 34.9 | 7.9 | 155.6 | 13.3 | 7.3 | 0.59 | 0.567 | 0.73 | 0.057 | |
| 48.6 | 7.5 | 258.0 | 98.3 | 15.1 | 522.1 | 14.3 | 6.2 | 0.51 | 0.920 | 0.53 | 0.818 | |
| 1.2 | 0.2 | 1.9 | 1.8 | 0.3 | 2.9 | 8.4 | 8.2 | 0.61 | 0.398 | 0.61 | 0.418 | |
| 2.4 | 0.5 | 8.1 | 4.1 | 0.8 | 13.8 | 9.3 | 4.5 | 0.66 | 0.254 | 0.76 | 0.072 | |
| 7.9 | 1.9 | 49.2 | 14.0 | 3.3 | 87.2 | 15.9 | 10.6 | 0.70 | 0.134 | 0.67 | 0.226 | |
| 0.8 | 0.1 | 3.9 | 1.0 | 0.1 | 5.1 | 11.5 | 11.5 | 0.64 | 0.341 | 0.61 | 0.470 | |
| 9.3 | 1.8 | 32.7 | 22.3 | 4.3 | 78.2 | 11.9 | 8.9 | 0.62 | 0.415 | 0.58 | 0.540 | |
| 3.3 | 0.8 | 11.7 | 7.2 | 1.6 | 25.4 | 16.3 | 2.8 | 0.65 | 0.254 | 0.61 | 0.353 | |
| 8.6 | 1.8 | 162.2 | 15.5 | 3.2 | 293.4 | 18.8 | 9.8 | 0.65 | 0.249 | 0.51 | 0.963 | |
| 5.3 | 0.7 | 16.8 | 11.4 | 1.4 | 36.6 | 7.7 | 4.9 | 0.51 | 0.925 | 0.73 | 0.082 | |
| 5.5 | 1.7 | 26.1 | 9.6 | 2.9 | 45.3 | 8.5 | 18.0 | 0.56 | 0.610 | 0.51 | 0.945 | |
| 10.1 | 1.7 | 47.6 | 26.4 | 4.5 | 124.2 | 16.3 | 16.9 | 0.69 | 0.142 | 0.60 | 0.483 | |
| 5.6 | 1.1 | 32.2 | 12.1 | 2.3 | 69.9 | 11.2 | 5.8 | 0.63 | 0.328 | 0.52 | 0.888 | |
| 10.2 | 1.5 | 22.3 | 18.4 | 2.7 | 40.2 | 6.0 | 5.4 | 0.50 | 0.498 | 0.60 | 0.391 | |
| 1.3 | 0.3 | 12.8 | 1.8 | 0.4 | 17.8 | 13.8 | 7.4 | 0.66 | 0.296 | 0.66 | 0.204 | |
| 3.4 | 1.0 | 49.5 | 8.2 | 2.5 | 118.5 | 16.7 | 2.2 | 0.56 | 0.622 | 0.63 | 0.378 | |
| 54.0 | 8.1 | 223.0 | 128.8 | 19.4 | 531.9 | 42.1 | 7.8 | 0.51 | 0.920 | 0.67 | 0.219 | |
| 4.6 | 0.7 | 13.2 | 7.4 | 1.1 | 21.2 | 8.6 | 2.2 | 0.75 | 0.057 | 0.69 | 0.147 | |
| 35.1 | 4.8 | 146.6 | 86.4 | 11.8 | 361.1 | 13.6 | 4.5 | 0.67 | 0.177 | 0.55 | 0.498 | |
| 0.2 | 0.0 | 1.8 | 0.5 | 0.1 | 4.1 | 12.1 | 8.3 | 0.53 | 0.826 | 0.50 | 0.498 | |
Three to four transitions were measured for each target in the equivalent of 10 μl of digested plasma, and three repeats (three SISCAPA process replicates) were performed at ten concentration points (and a blank sample). The best transition for each target was selected as the basis for the quantitative assay. A linear regression algorithm was used for fitting the ten serial dilution data points for each curve. The LOD of each target was obtained from the average of the blank measurements plus three times the s.d. The LOQ was obtained from the average of the blank measurements plus ten times the s.d. with additional criteria that the CV of measurements of the concentration points near the calculated LOQ were <25%[40,41]. Protein LOD/LOQ estimates assume complete trypsin digestion. Assays for candidates followed by asterisk (*) were characterized by reverse curves, as described in Supplementary Results Section 5. ROC curves and corresponding AUC values were determined for each candidate, and P-values were determined by permutation testing (Supplementary Results Section 6). Candidates determined to be significantly elevated (that is, verified; AUC ≥ 0.8 and P ≤ 0.01) in the plasma of animals with clinically apparent tumors are in boldface. Peptide sequences are abbreviated to the first four amino acids (full sequences are shown in Supplementary Worksheet 5a).
Figure 2Concentrations of biomarker candidates in plasma of tumor-bearing animals compared to controls. The median value is plotted as a line with each box displaying the distribution of the inner quartiles with whiskers showing 95% of the data. Results shown are derived from the SRM-MS–based measurements in Supplementary Worksheet 6a (worksheet labeled “clinically apparent”). Gray and blue data points represent Q-SRM data, whereas orange and red data points represent immuno-SRM data. Cases and controls were matched with respect to age and gender, housed in the same cage, euthanized on the same day, and derived from different animals than those used in the discovery and prioritization experiments. (Although Aldoc and Chi3l1 meet significance based on P-values, their levels are near their assay LODs.) CA, clinically apparent tumors; CTRL, matched controls.
Figure 4Immuno-SRM and ELISA data confirm elevation of Mfge8 in plasma of tumor-bearing animals. (a) Response curve for the immuno-SRM-MS assay targeting a proteotypic peptide from Mfge8. The %CV is 10% (at the LOQ), LOD is 20.9 ng protein/ml plasma and LOQ is 48.4 ng protein/ml plasma. (b) Distributions of the concentration of Mfge8 (inferred from the peptide concentration and assuming 100% recovery) in the mouse cohort with clinically apparent tumors (CA) and matched controls (CTRL). The median value is plotted as a line with each box displaying the distribution of the inner quartiles and whiskers showing 95% of the data. Results shown are derived from the immuno-SRM-MS–based measurements in Supplementary Worksheet 6a. (c) Box plots show the distributions of the concentration of the Mfge8 protein in the mouse cohort with clinically apparent tumors (CA) and matched controls (CTRL), determined using a commercially available ELISA assay. These are the identical plasma samples analyzed by immuno-SRM-MS in b. The median value is plotted as a line with each box displaying the distribution of the inner quartiles and whiskers showing 95% of the data. (d) ROC curve generated from the immuno-SRM-MS–based measurements of Mfge8 in the clinically apparent tumor cohort (blue line) and the preclinical mouse cohort (gold line). The plot was derived from the immuno-SRM-MS data in Supplementary Worksheet 6a,b.
Figure 3SRM-MS, ELISA and western blot analysis data confirm elevation of Lcn2 in plasma of tumor-bearing animals. (a) Response curve for the SRM-MS assay targeting a proteotypic peptide from Lcn2. The analytic %CV is 1.9% (at the LOQ), LOD is 10.5 ng protein/ml plasma and LOQ is 22.5 ng protein/ml plasma. (b) Distributions of concentrations of Lcn2 (inferred from the peptide concentration and assuming 100% recovery) in the mouse cohort with clinically apparent tumors (CA) and controls (CTRL). The median value is plotted as a line with each box displaying the distribution of the inner quartiles and whiskers showing 95% of the data. Results derived from the SRM-MS–based measurements in Supplementary Worksheet 6a. (c) Western blot analysis showing elevation of Lcn2 in plasma pools of tumor-bearing animals (lane 2) compared with healthy controls (lane 1). (d) Distributions of the concentrations of the Lcn2 in the mouse cohort with clinically apparent tumors (CA) and matched controls (CTRL), determined using a commercially available ELISA assay. The median value is plotted as a line with each box displaying the distribution of the inner quartiles and whiskers showing 95% of the data. Note that these are not the same plasma samples analyzed by SRM-MS in b. (e) ROC curve generated from the SRM-MS–based measurements of Lcn2 in the clinically apparent tumor cohort (blue line) and the preclinical mouse cohort (gold line).
Accession codes for ProteomeCommons.org
| Data set | Description | Tranche hash string |
|---|---|---|
| AIMS and LC-MS/MS of SCX fraction 1 | AIMS and LC-MS/MS RAW data | K3lkD96CyN8D6deSXTB5NlyqoNLgEOu C3tH0CRL+HDS1vkEodI/jPoXqv7k2q8P 3iXBM7T36wznmTqGRMueg54PEeuAA AAAAAAAeIQ== |
| AIMS and LC-MS/MS of SCX fraction 2 | AIMS and LC-MS/MS RAW data | ehRyE2rw4mrIkVvP+AEzgfwCOQOfgpmI bCousVQbbcpii8/rAR3qu8MstEKwg6x X+sOeIAS9r652yWc6Hw7ehWatyhcAAA AAAAAQmA== |
| AIMS and LC-MS/MS of SCX fraction 3 | AIMS and LC-MS/MS RAW data | 9/JrdiG2NiVFBww0qQ6FL+V4vJMCnUoU 6jdk3FEbLMXfpEjvGtBU/2i4DG0eY1XNBj CtMzxjd86U96n+D2VpCdf1aYcAAAAAAAA g7g== |
| AIMS and LC-MS/MS of SCX fraction 4 | AIMS and LC-MS/MS RAW data | Erqd2AkdAcUMmnPnmMCxh7Ak0vHe6/l 6S7smI8uz5ncpxmRLTT5fGoUu/re92EC Qvvz4Zn9z3ZSR+E7xNkssqN7j8moAAA AAAAAh5Q== |
| AIMS and LC-MS/MS of SCX fraction 5 | AIMS and LC-MS/MS RAW data | hg6it116AddlZ4/hG3Poiup3n9aj1NqKg dO4TpWlfqgUqjBSAPcIU4FOGQlolVlYC 3cA3La/YWiZOwfrBTrl41c6cygAAAAAAA Af8Q== |
| AIMS and LC-MS/MS of SCX fraction 6 | AIMS and LC-MS/MS RAW data | mK1cV+hKDZWRkOvty20dR4LOvJ2CrE OO55Rmu/RYauMzlE2Cay23Tl+0Eb94G fLjj9H65+ASL1PUu1tJ0cs5tZ6+MUgAA AAAAAAf8Q== |
| AIMS and LC-MS/MS of SCX fraction 7 | AIMS and LC-MS/MS RAW data | g3AUiwyHrtdHIhF5puFStf0CdHSSrHkb7H 0IuRhjyl0oTQYSiTtYRGzh9sCZCQe8E3IPX 4rgieHQkg2pMjhZ1cRlZjsAAAAAAAAf8Q== |
| AIMS and LC-MS/MS of SCX fraction 8 | AIMS and LC-MS/MS RAW data | cxVO8d9hxVqGN6GlA0fZR/+U0khGL8SV RAmdq46x6ovoMyKoo0NMVfI9UUv/TO+ k6L6Gijis7TEvxl/wx9uu7Orm+qIAAAAAA AAf8Q== |
| AIMS and LC-MS/MS of SCX fraction 9 | AIMS and LC-MS/MS RAW data | cijirLqD3wTBYAIIoQA+dz/TVWWYJrwgUzGzTi4Y9vnZY48m6C6uqR 3J6FdjRwyzP4QNJhfVI5hNub65jxfx+ jFhs+8AAAAAAAAf8Q== |
| AIMS and LC-MS/MS of SCX fraction 10 | AIMS and LC-MS/MS RAW data | AhG0zSjIacyC4j+pwv6dbBf8GDxQHvRz+ Nwcld11wWPHqwhiHl/tQDVnBFiRS7pdK 5ZLh8Zwz30bHhYvrbha60erA3cAAAAAA AAgDA== |
| AIMS and LC-MS/MS of SCX fraction 1 | AIMS and LC-MS/MS mzXML data | T61O2OmxA97AvDQ2099GblLUNi0wl4 oNgKZuFlflK0CMC9/iBv2qPutYlmb+i4G TqCuGkHeXEl0MdEXS34EPo9irZ7EAAA AAAAAL8w== |
| AIMS and LC-MS/MS of SCX fraction 2 | AIMS and LC-MS/MS mzXML data | LoJt/2J49bDTJOOmMx4UooWsoFL61A0 eexVz7KVHIoKvz91lWiC6vP6hnEtwjVPT QQ50ccPWV2s4y3oFXfEFHU0Ylc0AAAA AAAASvw== |
| AIMS and LC-MS/MS of SCX fraction 3 | AIMS and LC-MS/MS mzXML data | hQi8L5AbeWrHiy9Lnkx6n/cbfcrnyraDZB nMwcOWAqoDHNtUEByQOGu10wl698E YXu63CWI7PRZBD75mSVmlBiUMrf4AA AAAAAAiXQ== |
| AIMS and LC- MS/MS of SCX fraction 4 | AIMS and LC-MS/MS mzXML data | 5bDXI5/I8fjPk3j9gmI4hL6SBPEDdi8KG d7VsPI5+OToODJQ+qdxKXJw1VLph1pY4 YCzSAZLr2amRZjZ1CZsLxgGOV4AAAAA AAAiXQ== |
| AIMS and LC-MS/MS of SCX fraction 5 | AIMS and LC-MS/MS mzXML data | dgE2su595zuURoADids8A/PE6REgxsCs N3n3IrPDm+w9luMhqFfsRPOAp5MJei1y OMQcwDcRk+TatQ3206sLhIqD+/YAAAA AAAAgYQ== |
| AIMS and LC-MS/MS of SCX fraction 6 | AIMS and LC-MS/MS mzXML data | O5YapJumqtXlpeLbiRPvkMdMK5MHOlq 7UYzzDjF08vOu/5PZCmrJT2Pq2MvjFVrI Fz76uHSFiit9JXg6RswCdcPwH3MAAAA AAAAgYQ== |
| AIMS and LC-MS/MS of SCX fraction 7 | AIMS and LC-MS/MS mzXML data | v4Tnih9rrX+QugLO41s/h16puHrm0XyZOiiu Y9dPm2Kj5nS8NRD4htTriKbvb28aCIXJX6 2Nj1E+x26wZQ39bjOtYvIAAAAAAAAgYQ== |
| AIMS and LC-MS/MS of SCX fraction 8 | AIMS and LC-MS/MS mzXML data | eqCLA6T8Toopu5b1pFnV2A6h+WN1pspYkaj b8ia/v9RnKxfcVNoDexbVWrrMfwJn5wmwZM 2Zct6MexIQ1G0ztJXelvMAAAAAAAAgYQ== |
| AIMS and LC-MS/MS of SCX fraction 9 | AIMS and LC-MS/MS mzXML data | atBRwAIMFiao9s1W3FmWB8p4/Z52pbP EQLMijsKHIxVfsUnkNOB9cEIXV4B+qrU HPZ8w8TJrT8DnfWzMkF6+f5h0gxYAAA AAAAAgYQ== |
| AIMS and LC-MS/MS of SCX fraction 10 | AIMS and LC-MS/MS mzXML xx/88UT4UsBb+07o/tmMib4h75mlB7s data | 0RV8VHh/OYNqsnjSUKsmwOXB4lyzCzA GWjeGnEYWwIPEXkEk2m5wMN6W8jnQ AAAAAAAAgfA== |
| mouse_plasma_qmrm_siscapa_data | Integration results rIh4keM2uJWcRrPe9dS5IP91eDvVFyuV from response curves | JGBayQr7V8H8r9+cugheWgCtpr8HAHC xbCPK6rBFa/4JIhC9EBv0luInFZAAAAAA AABawQ== |