| Literature DB >> 25455107 |
Hong Wang1, Yanling Yang, Yuxin Li, Bing Bai, Xusheng Wang, Haiyan Tan, Tao Liu, Thomas G Beach, Junmin Peng, Zhiping Wu.
Abstract
The development of high-resolution liquid chromatography (LC) is essential for improving the sensitivity and throughput of mass spectrometry (MS)-based proteomics. Here we present systematic optimization of a long gradient LC-MS/MS platform to enhance protein identification from a complex mixture. The platform employed an in-house fabricated, reverse-phase long column (100 μm × 150 cm, 5 μm C18 beads) coupled to Q Exactive MS. The column was capable of achieving a peak capacity of ∼700 in a 720 min gradient of 10-45% acetonitrile. The optimal loading level was ∼6 μg of peptides, although the column allowed loading as many as 20 μg. Gas-phase fractionation of peptide ions further increased the number of peptide identification by ∼10%. Moreover, the combination of basic pH LC prefractionation with the long gradient LC-MS/MS platform enabled the identification of 96,127 peptides and 10,544 proteins at 1% protein false discovery rate in a post-mortem brain sample of Alzheimer's disease. Because deep RNA sequencing of the same specimen suggested that ∼16,000 genes were expressed, the current analysis covered more than 60% of the expressed proteome. Further improvement strategies of the LC/LC-MS/MS platform were also discussed.Entities:
Keywords: AD proteome; long LC column; mass spectrometry
Mesh:
Substances:
Year: 2014 PMID: 25455107 PMCID: PMC4324436 DOI: 10.1021/pr500882h
Source DB: PubMed Journal: J Proteome Res ISSN: 1535-3893 Impact factor: 4.466
Figure 1Evaluation of the reproducibility of long LC column coupled to Q Exactive MS. (A) Illustration of the setup of long LC column (100 μm × 150 cm, 5 μm C18 particles) coupled to Q Exactive MS. (B) Base peak chromatographs of three technically repeated runs. About 1 μg of rat brain tryptic peptide mixture was loaded on the column and then eluted in a 10–45% acetonitrile gradient over 4 h. (C) Comparison of accepted peptide spectrum matches (PSMs), peptide, and protein identifications.
Figure 2Optimization of the loading amount of rat brain peptides for LC–MS/MS identification. Various amounts of rat brain peptides were loaded on the long column and analyzed by a 4 h gradient. (A) Number of detected peptides with different loading levels. (B) Protein identification with different loading levels. (C) Effect of different peptide loading amount on the global distribution of peak width for major peptide ions. (D) The effect of loading amounts on the peak width of protein TBB3 peptide NSSYFVEWIPNNVK.
Figure 3Optimization of the LC gradient buffer for peptide elution. ∼2 μg of peptides was loaded on the long column and eluted in a 10–45% gradient of acetonitrile over 4 h. The LC elution profile was represented by total ion current (solid black line) along with the gradient (dotted black line). The number of identified peptides every 2 min was plotted (solid red line). About 157 ± 42 peptides were identified in every two min.
Figure 4Optimization of the LC gradient time for peptide elution. (A) Peak capacities plotted against gradient time. Peak capacities were calculated by dividing the average peak width of major peptide ions in a LC run over entire gradient time. (B) Correlation between the number of identified peptides/proteins and gradient time. (C) Number of detected peptides was in a linear relationship with the peak capacity.
Figure 5Deep proteomics analysis of AD brain tissue. (A) Flowchart of the procedure. (B) Chromatograph of basic pH RPLC prefractionation of peptides (upper panel) monitored at 214 nm and an example base peak chromatograph of acidic pH long gradient RPLC–MS/MS (lower panel).(C) Basic pH RPLC fractionation yielded even partitioning of peptides, which led to similar number of identified proteins in concaternated, pooled fractions. (D) Majority of the peptides was solely identified in one fraction.
Figure 6Comparison of deep proteomics and RNA-seq data from the same AD brain tissue. (A) Histogram of FPKM distribution of RNA-seq and proteomics data. The open bar represents the distribution of protein coding gene numbers detected by RNaseq, and the gray bar indicates the distribution of protein coding gene numbers validated by MS with different FPKM values. (B) Scatter plot of spectra counts per thousand amino acid of proteomic data versus FPKM of RNA-seq data.