| Literature DB >> 27237338 |
Jeffrey J Jones1, Bruce E Wilcox1, Ryan W Benz1, Naveen Babbar1, Genna Boragine1, Ted Burrell1, Ellen B Christie1, Lisa J Croner1, Phong Cun1, Roslyn Dillon1, Stefanie N Kairs1, Athit Kao1, Ryan Preston1, Scott R Schreckengaust1, Heather Skor1, William F Smith1, Jia You1, W Daniel Hillis2, David B Agus3, John E Blume4.
Abstract
INTRODUCTION: Colorectal cancer (CRC) testing programs reduce mortality; however, approximately 40% of the recommended population who should undergo CRC testing does not. Early colon cancer detection in patient populations ineligible for testing, such as the elderly or those with significant comorbidities, could have clinical benefit. Despite many attempts to identify individual protein markers of this disease, little progress has been made. Targeted mass spectrometry, using multiple reaction monitoring (MRM) technology, enables the simultaneous assessment of groups of candidates for improved detection performance.Entities:
Keywords: Classification; Colorectal cancer; Machine learning; Mass spectrometry, Multiple reaction monitoring
Mesh:
Substances:
Year: 2016 PMID: 27237338 PMCID: PMC8961700 DOI: 10.1016/j.clcc.2016.02.004
Source DB: PubMed Journal: Clin Colorectal Cancer ISSN: 1533-0028 Impact factor: 4.481
Figure 1Flow Diagram Depicting the Steps Involved to Reduce an Initial List of Candidate Protein Biomarkers to a Viable Multiple Reaction Monitoring Assay. In Brief, Target Proteins Underwent In Silico Tryptic Digestion From Which Peptides Were Down selected by Both In Silico Modeling and Empirical Measurements to an Interim List of Candidate Peptides. These Candidate Peptides Each Have 5 Transitions Optimized for Instrument Response and Evaluated for Matrix Interference. Additional Down selection for the Final Assay, Based on Performance Metrics, Resulted in 337 Peptides, Having 2 Representative Transitions per Each Peptide
Abbreviations: CRC = colorectal cancer; LCMS = liquid chromatography-mass spectrometry.
Summary of Patient Demographics and Clinical Annotations for 138 Discovery Set and 136 Validation Set Samples
| Variable | Discovery (n = 138) | Validation (n = 136) | ||
|---|---|---|---|---|
|
| ||||
| Control | CRC | Control | CRC | |
| Total | 69 | 69 | 68 | 68 |
| ProteoGenex | 24 | 24 | 24 | 24 |
| Asterand | 24 | 24 | 24 | 24 |
| CapitalBio | 21 | 21 | 20 | 20 |
| Gender | ||||
| Male | 29 | 29 | 28 | 28 |
| Female | 40 | 40 | 40 | 40 |
| Mean age (years) | 56.8 | 60.5 | 58.0 | 62.0 |
| CRC stage | NA | NA | ||
| I | 13 | 16 | ||
| II | 35 | 35 | ||
| III | 15 | 14 | ||
| IV | 6 | 3 | ||
| CRC lesion location | NA | NA | ||
| Colon | 33 | 39 | ||
| Rectum | 34 | 26 | ||
| Rectosigmoid junction | 2 | 3 | ||
Abbreviations: CRC = colorectal cancer; NA = not applicable.
Figure 2Plot of Concurrent Assay Transitions Across Mass Spectrometry (MS) Elution Time. Median Chromatography Full Width at Half Maximum for Heavy Peptides Was 3.4 Seconds, 8.6 Seconds at Baseline. Within the 30-Minute Chromatography Profile, Each Analyte Was Allocated a 42-Second Window for Data Acquisition With the MS Instrument in Dynamic Multiple Reaction Monitoring Acquisition Mode Resulting in an Number of Points Across Each Peak of 16.2 ± 5.4
Figure 3Box Plots (Whiskers at 95% Confidence Interval [CI]) of Differences of Measured Heavy Peptide Retention Times From Expected Times for Each Sample Injection. The Close Monitoring of Retention Time Drift Was Used to Justify the Exchange of the Main Chromatography Column Owing to Significant Risk to Losing Peak Measurements (A). Events for Column Exchange Were Triggered by the Lower 95% CI at 21 Seconds or a Loss of Approximately 17 Heavy Peptide Transitions. Additionally, a Chromatography Column Was Exchanged Owing to Risk of Liquid Chromatography Over Pressure (B)
Figure 4Calibration Curves for a Randomly Selected Set of Heavy Peptide Transitions, Showing the 5-Point Daily Calibration Curve Covering Individual Peptide Concentrations of 250 fmol/μL to 0.025 fmol/μL. All 12 Days, on Each Instrument, Are Represented in the Point Cluster. A Loess Smooth Line Was Plotted to Guide the Eye
Abbreviations: AUC = area under the curve; QQQ = triple quadropole.
Proteins, Peptides, and Transitions Constituting 13-Protein/15-Transition Validated Classifier for CRC Detection Using Targeted MS-MRM
| Protein Description | Protein ID | Peptide | Transition |
|---|---|---|---|
| α1-Acid glycoprotein 1 | A1AG1_HUMAN | NWGLSVYADK | y7 |
| α1-Acid glycoprotein 1 | A1AG1_HUMAN | SDVVYTDWK | y5 |
| α1-Antitrypsin | A1AT_HUMAN | SVLGQLGITK | y7 |
| α-Amylase 2B | AMY2B_HUMAN | LVGLLDLALEKDYVR | b3 |
| Clusterin | CLUS_HUMAN | EPQDTYHYLPFSLPHR | y3 |
| Complement component C9 | CO9_HUMAN | TEHYEEQIEAFK | y2 |
| Delta(3,5)-Delta(2,4)-dienoyl-CoA isomerase, mitochondrial | ECH1_HUMAN | LRDLLTR | b3 |
| Ferritin light chain | FRIL_HUMAN | GGRALFQDIK | b3 |
| Gelsolin | GELS_HUMAN | AGALNSNDAFVLK | b4 |
| Gelsolin | GELS_HUMAN | AGALNSNDAFVLK | y7 |
| Metalloproteinase inhibitor 1 | TIMP1_HUMAN | GFQALGDAADIR | b4 |
| Osteopontin | OSTP_HUMAN | AIPVAQDLNAPSDWDSR | y9 |
| Selenium-binding protein 1 | SBP1_HUMAN | EPLGPALAHELR | y6 |
| Seprase | SEPR_HUMAN | LGVYEVEDQITAVR | y8 |
| Spondin-2 | SPON2_HUMAN | HSLVSFVVR | y8 |
Abbreviations: CRC = colorectal cancer; MRM = multiple reaction monitoring; MS = mass spectrometry.
Figure 5Average Receiver Operating Characteristic (ROC) Curve From the 15-Transition and 13-Protein Classifier Model Applied to the Discovery Set Data in Cross-Validation Assessment. The Plot Represents the Average of the 10 ROC Curves Obtained by Combining Model Predictions for All Test Set Samples Across the 10 Folds of Each Inner Replicate of the Cross-Validation Procedure. The Mean Area Under the Curve of These 10 ROCs Was 0.82
Figure 6Validation Set Receiver Operating Characteristic (ROC) Curve for the Locked Discovery Set Model Applied to the Validation Set (Black Line). The Associated Area Under the Curve Was 0.91. The ROC Curves From the Individual Transition Components of the Classifier Model Are Shown in Light Gray for Comparative Assessment Against the Combined Marker Panel Performance
Results for Tests for Significance of Incorrectly Called Samples as Function of Individual Cohort as Assessed by Panel Component Permutation Analysis
| Protein | Samples With Incorrect Results | Samples With Correct Result | |||||
|---|---|---|---|---|---|---|---|
|
| |||||||
| Asterand | CapitalBio | ProteoGenex | Asterand | CapitalBio | ProteoGenex | ||
| A1AG1 | 4 | 9 | 6 | 44 | 31 | 42 | 1.00 |
| A1AT | 4 | 11 | 7 | 44 | 29 | 41 | 0.88 |
| AMY2B | 4 | 10 | 9 | 44 | 30 | 39 | 1.00 |
| CLUS | 4 | 12 | 8 | 44 | 28 | 40 | 0.48 |
| CO9 | 10 | 10 | 10 | 38 | 30 | 38 | 1.00 |
| ECH1 | 4 | 13 | 6 | 44 | 27 | 42 | 0.14 |
| FRIL | 4 | 11 | 7 | 44 | 29 | 41 | 0.88 |
| GELS | 7 | 9 | 10 | 41 | 31 | 38 | 1.00 |
| OSTP | 4 | 12 | 6 | 44 | 28 | 42 | 0.29 |
| SBP1 | 5 | 11 | 7 | 43 | 29 | 41 | 1.00 |
| SEPR | 2 | 10 | 8 | 46 | 30 | 40 | 0.20 |
| SPON2 | 3 | 13 | 7 | 45 | 27 | 41 | 0.06 |
| TIMP1 | 5 | 11 | 8 | 43 | 29 | 40 | 1.00 |