| Literature DB >> 26439836 |
Kaijie Xiao1, Fan Yu1, Houqin Fang1, Bingbing Xue1, Yan Liu1, Zhixin Tian1.
Abstract
It has long been an analytical challenge to accurately and efficiently resolve extremely dense overlapping isotopic envelopes (OIEs) in protein tandem mass spectra to confidently identify proteins. Here, we report a computationally efficient method, called OIE_CARE, to resolve OIEs by calculating the relative deviation between the ideal and observed experimental abundance. In the OIE_CARE method, the ideal experimental abundance of a particular overlapping isotopic peak (OIP) is first calculated for all the OIEs sharing this OIP. The relative deviation (RD) of the overall observed experimental abundance of this OIP relative to the summed ideal value is then calculated. The final individual abundance of the OIP for each OIE is the individual ideal experimental abundance multiplied by 1 + RD. Initial studies were performed using higher-energy collisional dissociation tandem mass spectra on myoglobin (with direct infusion) and the intact E. coli proteome (with liquid chromatographic separation). Comprehensive data at the protein and proteome levels, high confidence and good reproducibility were achieved. The resolving method reported here can, in principle, be extended to resolve any envelope-type overlapping data for which the corresponding theoretical reference values are available.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26439836 PMCID: PMC4593959 DOI: 10.1038/srep14755
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Database Search Results of Three Technical Replicate HCD Spectra (S1, S2 and S3) of Myoglobin.
| Matching b/yions | Sequencecoverage (%) | Peptide bondcoverage (%) | Interpretedisotopic peaks (%) | Interpretedabundance (%) | |
|---|---|---|---|---|---|
| S1 | 99 | 100.0 | 47.4 | 95.6 | 99.2 |
| S2 | 88 | 100.0 | 40.8 | 95.6 | 99.2 |
| S3 | 88 | 100.0 | 39.5 | 95.6 | 99.2 |
| 92 ± 6 | 100.0 ± 0.0 | 42.5 ± 4.2 | 95.6 ± 0.1 | 99.2 ± 0.0 |
Figure 1Matching vs. non-matching b/y ions from the forward, random and reverse database search of the HCD spectra of myoglobin.
The error bars are the result of three technical replicates.
Database Search Results of Three Technical Replicate RPLC-MSMS Datasets (D1, D2 and D3) of the E. coli Intact Proteome with a Spectrum-Level FDR of 1%. PrSMs = Protein Spectrum Matches.
| PrSMs | Proteo-forms | Sequencecoverage(%) | Peptidebondcoverage (%) | Interpretedisotopic peaks(%) | Interpretedabundance(%) | |
|---|---|---|---|---|---|---|
| D1 | 4478 | 103 | 76.7 | 26.0 | 61.1 | 71.5 |
| D2 | 5445 | 107 | 73.4 | 26.5 | 62.9 | 73.4 |
| D3 | 5392 | 105 | 73.7 | 27.4 | 61.7 | 71.5 |
| 5105 ± 544 | 105 ± 2 | 74.6 ± 1.8 | 26.7 ± 0.7 | 61.9 ± 0.9 | 72.1 ± 1.1 |
Figure 2Graphical fragmentation map of the identified proteoform RL7_ECOLI (P0A7K2) with S1 acetylation and K81 methylation.
Resolving the OIPs among y10-1+, y20-2+ and y72-7+ in the HCD Spectrum of Myoglobin Using the OIE_CARE Method.*
| A | B | C | D | E | F | G | H | |
|---|---|---|---|---|---|---|---|---|
| 1 | ||||||||
| 2 | Exp. | Exp. abun. (observed) | Exp. abun. (ideal) | Exp. abun. (ideal) | Exp. abun. (ideal) | Deviation | ||
| 3 | 1142.617676 | 480992.312500 | 46615.469396 | 416521.895787 | 66542.651159 | −0.09 | ||
| 4 | ||||||||
| 5 | Theo. | Theo. rel. abun. | Exp. | Exp. abun. | Exp. rel. abun. | IPMD | IPAD | IPAD |
| 6 | 1141.607714 | 63.64 | 1141.613770 | 128926.921875 | 63.64 | 5 | 0 | |
| 7 | 1142.610419 | 23.01 | 1142.617676 | 42330.617979 | 20.89 | 6 | −2 | 214 |
| 8 | ||||||||
| 9 | Theo. | Theo. rel. abun. | Exp. | Exp. abun. | Exp. rel. abun. | IPMD | IPAD | IPAD |
| 10 | 1142.611961 | 100.00 | 1142.617676 | 378235.583145 | 90.81 | 5 | −9 | 15 |
| 11 | 1143.113357 | 67.65 | 1143.120117 | 281777.062500 | 67.65 | 6 | 0 | |
| 12 | ||||||||
| 13 | Theo. | Theo. rel. abun. | Exp. | Exp. abun. | Exp. rel. abun. | IPMD | IPAD | IPAD |
| 14 | 1142.607764 | 55.06 | 1142.617676 | 60426.111376 | 50.00 | 9 | −5 | 343 |
| 15 | 1142.894250 | 100.00 | 1142.910522 | 120854.796875 | 100.00 | 14 | 0 | |
For OIP of m/z 1142.617676:
Step 1: The ideal exp. abundance for y10-1+ (E3), y20-2+ (F3) and y72-7+ (G3) is calculated using B7*D6/B6 and B10*D11/B11, and B14*D15/B15, respectively.
Step 2: The deviation (H3) between the observed and ideal values is calculated using (D3 − (E3 + F3 + G3))/(E3 + F3 + G3).
Step 3: The final individual exp. abun. for y10-1+ (D7), y20-2+ (D10) and y72-7+ (D14) is calculated using E3*(1 + H3), F3*(1 + H3), G3*(1 + H3), respectively.
*Only OIP of m/z 1142.617676 and the isotopic peaks used for normalization are listed here. The resolving of the other four OIPs of these three product ions and y144-14+ was performed using the same method. The full iEF information is provided in Table S6.
#The equivalent IPAD values without OIE_CARE. Exp. = experimental, theo. = theoretical, and abun. = abundance.
Figure 3The iEF maps of y72-7+ and orthogonal plots of experimental vs. theoretical relative abundance of all interpreted isotopic peaks (with IPAD ≥ 0) without (A,C) and with (B,D) OIE_CARE resolving of OIEs for one of the HCD spectra of myoglobin.
The bars and circles in (A,B) are the experimental and theoretical data, respectively. Rel. = relative, Exp. = experimental, and theo. = theoretical.
Figure 4The orthogonal plot of matching b and y ions with vs. without using the OIE_CARE method for the unique E. coli proteoforms.
The orthogonal dotted line is added as a visual guide.
Figure 5Interpreted isotopic peaks and abundance of the myoglobin HCD spectra when two incremental combinatorial ions series, b/y and b/y + b/y-NL (including a and a-NL), are searched separately.
The error bars arise from three technical replicates. IPs = Isotopic Peaks.
Figure 6Venn diagrams of the matching b/y ions of the three technical replicate HCD spectra (S1, S2, and S3) of myoglobin (A) and identified unique proteoforms of the three technical replicate RPLC-MSMS datasets (D1, D2, and D3) of the E. coli intact proteome (B).