| Literature DB >> 30763335 |
Yan Chen1,2, Jonathan Vu1,2, Mitchell G Thompson1,2,3, William A Sharpless1,2, Leanne Jade G Chan1,2, Jennifer W Gin1,2, Jay D Keasling1,2,4,5,6, Paul D Adams1,4,7, Christopher J Petzold1,2.
Abstract
Recent improvements in the speed and sensitivity of liquid chromatography-mass spectrometry systems have driven significant progress toward system-wide characterization of the proteome of many species. These efforts create large proteomic datasets that provide insight into biological processes and identify diagnostic proteins whose abundance changes significantly under different experimental conditions. Yet, these system-wide experiments are typically the starting point for hypothesis-driven, follow-up experiments to elucidate the extent of the phenomenon or the utility of the diagnostic marker, wherein many samples must be analyzed. Transitioning from a few discovery experiments to quantitative analyses on hundreds of samples requires significant resources both to develop sensitive and specific methods as well as analyze them in a high-throughput manner. To aid these efforts, we developed a workflow using data acquired from discovery proteomic experiments, retention time prediction, and standard-flow chromatography to rapidly develop targeted proteomic assays. We demonstrated this workflow by developing MRM assays to quantify proteins of multiple metabolic pathways from multiple microbes under different experimental conditions. With this workflow, one can also target peptides in scheduled/dynamic acquisition methods from a shotgun proteomic dataset downloaded from online repositories, validate with appropriate control samples or standard peptides, and begin analyzing hundreds of samples in only a few minutes.Entities:
Mesh:
Year: 2019 PMID: 30763335 PMCID: PMC6375547 DOI: 10.1371/journal.pone.0211582
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Workflow to rapidly develop SRM targeted proteomic methods from shotgun proteomic data.
(a) Identification of proteins from either in-house shotgun DDA acquisition using an LC-QToF or data downloaded from public MS data repositories; (b) construction of a proteome spectral library from raw data containing retention times for a set of host-specific reference peptides and peptides of interest; (c) prediction of targeted peptide retention time based on the spectral library and measured retention times of reference peptides in a new chromatography gradient; (d) predicted RT of targeted peptide used without further methods optimization in a rapid SRM method.
Fig 2Comparison of the accuracy of peptide retention time prediction for different RT windows from proteomic libraries constructed from in-house and public MS data.
The percentage of peptides that were detected of a selection of five hundred (500) peptides from (a) an in-house E. coli library; (b) an in-house S. cerevisiae library; and (c) a S. cerevisiae library downloaded from the Chorus Project for chromatographic gradients of 120 minutes (Circle), 20 minutes (Triangle), and 2 minutes (Diamond).
Summary of discovery proteomic-based spectral libraries of microbes commonly used in biotechnology research and development.
| Organisms | Capabilities/Utility | Unique peptides | Total proteins | Source |
|---|---|---|---|---|
| Model organism; Wide range of engineering tools | 6994 | 1017 | This study | |
| Aromatic compound degradation; Redox enzymes; Stress tolerance | 1498 | 549 | This study | |
| Amino acid production; Consumes a broad range of carbon sources | 1123 | 358 | This study | |
| Plant mutagenesis | 1365 | 483 | This study | |
| Lipid production; Lignin monomer utilization | 1903 | 682 | This study | |
| Versatile metabolism; Aromatic compound degradation | 5281 | 1483 | [ | |
| Model organism; Robustness and tolerance towards harsh fermentation conditions | 32476 | 4184 | [ |
Fig 3Examples of rapid MRM target method development for quantifying pathway proteins in Peptides of pathway proteins selected from spectral libraries generated via in-house shotgun proteomics or online databases.
Fig 4(A) Central carbon pathways (glycolysis, lysine degradation, aromatic monomer degradation pathways, and tricarboxylic acid (TCA) cycle) in The error bar shows the standard deviation of measured peak area of three biological replicates. Statistical significance of p-coumarate and 5-aminovalerate against glucose were calculated by moderated t-test with the limma package in R, and resulting p-values were adjusted using the Benjamini-Hochberg (BH) method. *, **, and *** indicate adjusted P < 0.05, 0.01 and 0.001, respectively.