| Literature DB >> 21342541 |
William Murad1, Rahul Singh, Ten-Yang Yen.
Abstract
BACKGROUND: Determining the disulfide (S-S) bond pattern in a protein is often crucial for understanding its structure and function. In recent research, mass spectrometry (MS) based analysis has been applied to this problem following protein digestion under both partial reduction and non-reduction conditions. However, this paradigm still awaits solutions to certain algorithmic problems fundamental amongst which is the efficient matching of an exponentially growing set of putative S-S bonded structural alternatives to the large amounts of experimental spectrometric data. Current methods circumvent this challenge primarily through simplifications, such as by assuming only the occurrence of certain ion-types (b-ions and y-ions) that predominate in the more popular dissociation methods, such as collision-induced dissociation (CID). Unfortunately, this can adversely impact the quality of results.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21342541 PMCID: PMC3044266 DOI: 10.1186/1471-2105-12-S1-S12
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1MS-based approach diagrammatic representation. (A) Once a protein is digested, the theoretically possible disulfide bonded peptides are compared with experimentally obtained precursor ions. In order to confirm each correspondence, the possible disulfide bonded fragment ions are next compared with experimentally generated MS/MS spectra. (B) Most of the different fragment ions (and their nomenclature) that can be observed. Ions types not represented here include b and y ions which have either lost a water molecule (bo, yo) or have lost an ammonia molecule (b*, y*).
Figure 2Multiple-ion spectra analysis. This figure illustrates the presence of multiple ions types (in green) after CID. In the first spectrum, note the presence of b and y ions with high intensity in the fragmentation of the precursor ion with sequence: FFLQGIQLNTILPDAR, for the protein Lysozyme [Swiss-Prot: P11279]. In the second spectrum, a, b, b, and y ions (all with high intensity) can be observed after the fragmentation of a precursor ion existing in the protein Pratelet glycoprotein 4 [Swiss-Prot P16671].
Figure 3Two-stage matching spectra for protein ST8SiaIV. (A) In the first-stage (DMS vs. PMS), the theoretical disulfide-bonded structure is matched with the doubly charged precursor ion with highest intensity, whose m/z = 1082.9. (B) For this initial match, the disulfide-bonded peptide pair is fragmented and the fragments are matched with the MS/MS spectrum for the precursor ion (FMS vs. TMS), generating a list of validation matches.
Abbreviations and their definitions
| Abbreviation | Definition |
|---|---|
| Set of mass values corresponding to all possible disulfide-bonded peptide structures that can be obtained from a digested protein. | |
| Set of mass values of ions that undergo dissociation to produce product ions (set of precursor ions). | |
| Correspondence obtained when the difference between the detected mass of a targeted ion from the | |
| Initial Match threshold. Threshold used to define a mass window centered on a | |
| Set of trimmed mass values from the | |
| Peptide Mass: cysteine-containing peptide mass value. | |
| Temporary mass set containing possible disulfide bonded peptide structures. | |
| Set of mass values of every disulfide-bonded fragment structure that can be obtained from fragment ions, which can be of types | |
| Set of mass values of the product ions obtained after the MS/MS step (MS/MS spectra). | |
| Correspondence obtained when the difference between a precursor ion fragment mass from | |
| Validation Match threshold. Threshold used to define a mass window centered at a | |
| Set containing the mass values of fragment ions generated by the method GENFRAGS(.) in the APROX-FMS routine. |
Figure 4Pseudo code for APROX-DMS and TRIM routines
Running APROX-DMS on the ST8SiaIV bond
| Property | Value |
|---|---|
| {716, 728, 749, 863, 864, 891, 976, 1096, 1105, 1161, 1204, 1274, 1359, 1367, 1418, 1480, 1593, 1733, 1754, 1846, 1863, 1864, 1976, 2179, 2292, 2351, 2617, 2737, 2822} | |
| {2050.5} | |
| 0.02530 | |
| 1.0 | |
| {728, 749, 863, 891, 976, 1105, 1161, 1204, 1274, 1367, 1418, 1480, 1593, 1639, 1702, 1754, 1846, 1908, 1994, | |
| {716, 864, 1096, 1359, 1476, 1733, 1863, 1864, 1865, 1867, 1976, 2022} | |
| {2050.0} |
CCP: the mass values of all cysteine-containing peptides. PMS: a disulfide-bonded precursor ion mass. TrimSet: all the disulfide-bonded structures trimmed from the set of feasible combinations of cysteine-containing peptides. For this example, 37.5% of the structures were trimmed and the correct IM was found.
Figure 5Pseudo code for APROX-FMS routine
DMS and FMS mass space sizes comparison
| Protein | Disulfide Bond | Full Search (exponential) | Proposed Search (polynomial) | DMS decrease | FMS decrease | ||
|---|---|---|---|---|---|---|---|
| DMS size | FMS size | DMS size | FMS size | ||||
| Beta-LG | C82C176 | 2152 | 2169 | 1870 | 78 | 13.1% | 96.4% |
| ST8Sia IV | C142C292 | 1246 | 1792 | 1038 | 106 | 16.7% | 94.1% |
| C156C356 | 1246 | 2640 | 1038 | 255 | 16.7% | 90.3% | |
| FucT VII | C318C321 | 581 | 115 | 528 | 34 | 9.1% | 70.4% |
| C68C76 | 879 | 103 | 681 | 41 | 22.5% | 60.2% | |
| C211C214 | 879 | 1819 | 681 | 107 | 22.5% | 94.1% | |
| B1,4-GalT | C134C176 | 2149 | 1189 | 1127 | 77 | 47.6% | 93.5% |
| C247C266 | 2149 | 5480 | 1127 | 426 | 47.6% | 92.2% | |
| Average DMS and FMS decrease | 21.8% | 86.4% | |||||
Figure 6Comparison of the computational time (in seconds) for the exhaustive and partial generation of Table 3. On average there was a 49.5% decrease in time to compute the DMS and 88.7% decrease in time to compute the FMS. The computations were carried out on an Intel T2390 1.86 GHz single-core processor with 1GB RAM.
Figure 7Spectra samples from tryptic digested protein FucT VII. Spectra (m/z vs. normalized intensity) illustrating the confirmatory matches (whose intensity values were at least 10% of the maximum intensity) found for the disulfide bond between cysteines C318-C321 in protein FucT VII. The spectrum in the left shows the matches found when multiple ions were considered. The spectrum in the right shows the matches when only b/y-ions were considered.
Comparison with predictive methods
| Protein | Known Pattern | Proposed Algorithm | DiANNA 1.1 | DISULFIND | PreCys |
|---|---|---|---|---|---|
| ST8Sia IV | C142C292, C156C356, | C11C156, C142C292, C169C356 | None | C142C356, C156C292 | |
| Beta-LG | C82C176, C122C135 | C12C137, C82C176, C126C135 | None | None | |
| FucT VII | C68C76, C211C214, C318C321 | C68C321, C76C211, C214C318 | C76C318 | C68C76, C211C214, C318C321 | |
| B1,4-GalT | C134C176, C247C266 | C23C176, C30C144, C266C341 | None | C134C247, C176C266 | |
| C2GnT-I | C59C413, C100C172, C151C199, C372C381 | C13C172, C59C217, C151C234, C199C372, C381C413 | Not supported | C59C381, C100C372, C151C172, C199C413 | |
| Lysozyme | C24C145, C48C133 | C24C145, C48C133, C82C98, C94C112 | C24C145, C48C133, C82C98, C94C112 | C82C145 | |
| FT III | C81C338, C91C341 | C16C91, C81C143, C129C338 | None | C81C91 | |
| Aldolase | None | C73C339, C135C290, C115C240, C178C202 | None | None | |
| Aspa | None | C4C275, C60C217, C66C151, C123C145 | None | None |
Comparison with MassMatrix
| Protein | Known Pattern | Proposed Method | MassMatrix |
|---|---|---|---|
| ST8Sia IV | C142C292, C156C356 | C142C292 [ | |
| Beta-LG | C82C176, C122C135 | C82C176 [ | |
| FucT VII | C68C76, C211C214, C318C321 | C68C76 [ | |
| B1,4-GalT | C134C176, C247C266 | C134C176 [ | |
| C2GnT-I | C59C413, C100C172, C151C199, C372C381 | None | |
| Lysozyme | C24C145, C48C133 | C48C133 [ | |
| FT III | C81C338, C91C341 | None | |
| Aldolase | None | None | |
| Aspa | None | None |
The score (Vs) of each disulfide bond and the confidence scores (pp and pp2 values) are shown in brackets, respectively.
Sensitivity, specificity, accuracy and Mathew’s correlation coefficient results for all nine proteins analyzed
| Protein | ||||
|---|---|---|---|---|
| ST8Sia IV | 1.00 | 1.00 | 1.00 | 1.00 |
| Beta-LG | 0.50 | 1.00 | 0.95 | 0.69 |
| FucT VII | 1.00 | 1.00 | 1.00 | 1.00 |
| C2GnT-I | 0.75 | 1.00 | 0.98 | 0.86 |
| Lysozyme | 1.00 | 1.00 | 1.00 | 1.00 |
| B1,4-GalT | 1.00 | 1.00 | 1.00 | 1.00 |
| FT III | 0.50 | 1.00 | 0.94 | 0.69 |
| Aldolase | X | 1.00 | 1.00 | X |
| Aspa | X | 1.00 | 1.00 | X |