| Literature DB >> 30367573 |
Kaiyuan Zhu1, Xiaowen Liu2,3.
Abstract
BACKGROUND: Top-down homogeneous multiplexed tandem mass (HomMTM) spectra are generated from modified proteoforms of the same protein with different post-translational modification patterns. They are frequently observed in the analysis of ultramodified proteins, some proteoforms of which have similar molecular weights and cannot be well separated by liquid chromatography in mass spectrometry analysis.Entities:
Keywords: Graph algorithms; Mass spectrometry; Multiplexed mass spectra; Top-down
Mesh:
Substances:
Year: 2018 PMID: 30367573 PMCID: PMC6101081 DOI: 10.1186/s12859-018-2273-4
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1Illustration of the conversion from the HomMTM spectral identification problem to the MSkSF problem. A deconvoluted HomMTM spectrum generated from two modified proteoforms of the protein GKGKLKAKE with one expected PTM: acetylation on K, is used as an example. a Each peak corresponds to a potential prefix residue mass of a proteoform of GKGKLKAKE satisfying that the prefix residue mass or its complementary suffix residue mass matches an experimental fragment mass. Potential masses for the prefix GKGKL matched to experimental masses are shown in the red dotted box. b A graph with 10 layers is constructed based on the masses in and the peaks in (a). Each vertex in layer i, 0≤i≤10, corresponds to a mass in and those with dotted circles are removed because they are not on any path from the source to the sink. The capacity of a vertex is the ratio (shown in percentage) between the intensity of the mass and the sum of the intensities of all masses corresponding to vertices with solid circles in the same layer. The solution to the MSkSF problem is the two blue paths with flows 70 and 30 (in percentage), which correspond to two proteoforms GK[Acetylation]GK[Acetylation]LKAKE with relative abundance 70% and GKGK[Acetylation]LK[Acetylation]AKE with relative abundance 30%
Five expected PTMs are allowed in the identification and quantification of histone H4 proteoforms
| PTM | Monoisotopic mass (Da) | Amino acids |
|---|---|---|
| Acetylation | 42.01056 | R, K |
| Methylation | 14.01565 | R, K |
| Dimethylation | 28.03130 | R, K |
| Trimethylation | 42.04695 | R |
| Phosphorylation | 79.96633 | S, T, Y |
Fig. 2Comparison of the numbers of matched fragment ions. The numbers of matched fragment ions are compared for the 184 spectra identified by both the proposed method and MS-Align-E. For each spectrum, the difference between the number of fragment ions matched to the proteoform pair reported by the proposed method and that matched to the single proteoform reported by MS-Align-E is computed
Fig. 3The sizes of graphs used in HomMTM spectral interpretation. The numbers of vertices and edges in the graph generated from the histone H4 protein and five PTMs (acetylation, methylation, dimethylation, trimethylation, phosphorylation) increase significantly when the bound for the sum of mass shifts introduced by PTMs increases from 50 Da to 600 Da