| Literature DB >> 23368815 |
Abstract
BACKGROUND: Disulfide bonds constitute one of the most important cross-linkages in proteins and significantly influence protein structure and function. At the state-of-the-art, various methodological frameworks have been proposed for identification of disulfide bonds. These include among others, mass spectrometry-based methods, sequence-based predictive approaches, as well as techniques like crystallography and NMR. Each of these frameworks has its advantages and disadvantages in terms of pre-requisites for applicability, throughput, and accuracy. Furthermore, the results from different methods may concur or conflict in parts.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23368815 PMCID: PMC3549834 DOI: 10.1186/1471-2105-14-S2-S20
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Results of S-S bond determination on a set of Glycosyltransferases using MS-based and sequence-based methods.
| Swiss-Prot ID | Known S-S bonds | Methods that detected the bond | Methodology |
|---|---|---|---|
| MassMatrix | MS-based | ||
| MassMatrix | MS-based | ||
| MassMatrix, DisLocate, DiANNA 1.1 | MS-based, Sequence-based | ||
| - | - | ||
| MassMatrix, PreCys | MS-based, Sequence-based (cysteine separation profile) | ||
| MassMatrix, PreCys | As above | ||
| MassMatrix, PreCys | As above | ||
| MassMatrix | MS-based | ||
| MassMatrix | MS-based | ||
| - | - | ||
| - | - | ||
| - | - | ||
| - | - | ||
| DISULFIND, DiANNA 1.1 | Sequence-based | ||
| MassMatrix, DiANNA 1.1, DISULFIND | MS-based, Sequence-based | ||
| - | - | ||
| - | - |
The molecules and their Swiss-Prot ID are: ST8Sia IV [Swiss-Prot:Q92187], Beta-lactoglobulin [Swiss-Prot:P02754], FucT VII [Swiss-Prot:Q11130], C2GnT-I [Swiss-Prot:Q09324], Lysozyme [Swiss-Prot:P00698], FT III [Swiss-Prot:P21217], β1-4GalT [Swiss-Prot:P08037], Aldolase [Swiss-Prot:P00883], and Aspa [Swiss-Prot:Q9R1T5]. The methods investigated include MassMatrix [21], DisLocate [24], DISULFIND [14], PreCys [4], and DiANNA 1.1 [17].
Figure 1Tandem mass spectra for protein C2GnT-I. (Left) Confirmatory spectrum for disulfide bond C151-C199 for protein C2GnT-I (chymotryptic digested). (Right) Confirmatory spectrum for disulfide bond C372-C381 determined for protein C2GnT-I (tryptic digested).
Baseline: the S-S bonds determined by MS2DB+, SVM, and CSP.
| Proteins | Bonds determined by different methods | |||
|---|---|---|---|---|
| Known Linkages | MS2DB+ | SVM | CSP | |
| ST8Sia IV | 142-292 | 142-292 | 142-292 | |
| 156-356 | 156-356 | 156-356 | ||
| Beta-LG | 82-176 | - | - | |
| - | 122-135 | 122-135 | ||
| - | ||||
| FucT VII | 68-76 | 68-76 | 68-76 | |
| 211-214 | 211-214 | 211-214 | ||
| 318-321 | 318-321 | 318-321 | ||
| Β1,4-GalT | 134-176 | - | 134-176 | |
| 247-266 | - | 247-266 | ||
| - | - | |||
| C2GnT-I | 59-413 | - | - | |
| 372-381 | 372-381 | - | ||
| - | 100-172 | - | ||
| 151-199 | - | - | ||
| - | - | |||
| - | - | |||
| - | - | |||
| - | - | |||
| Lysozyme | 24-145 | 24-145 | - | |
| 48-133 | - | - | ||
| - | - | |||
| - | - | |||
| FT III | 81-338 | - | - | |
| - | - | - | ||
| - | ||||
| - | ||||
False positives are indicated in italics.
The overall performance of MS2DB+, SVM, and CSP.
| Methods | c | |||
|---|---|---|---|---|
| 0.821 | 1.000 | 0.982 | 0.891 | |
| 0.571 | 0.977 | 0.964 | 0.581 | |
| 0.500 | 0.969 | 0.959 | 0.459 |
Illustration of the key steps in the proposed method
| Correct S-S topology: C134-176, C247-C266 | ||
|---|---|---|
| Initial Connectivity {pairs(bond, belief)} | ||
| MS/MS | SVM | CSP |
| Primitive hypothesis | ||
| Power Set | ||
| Power set scoring (per method) | ||
| MS/MS | SVM | CSP |
| Global S-S connectivity (per combination rule) | ||
| Dempster rule | Campos rule | Shafer rule |
| *Yager rule: no S-S bonds were found (belief assignments were lower than 0.01) | ||
Known and determined S-S bonds for all molecules by combining MS2DB+, SVM, and CSP using the Shafer rule.
| Protein | Known Bonds | Bonds Found | Belief |
|---|---|---|---|
| ST8Sia IV | 0.66, 0.68 | ||
| Beta-LG | 0.49, 0.36 | ||
| FucT VII | 0.43, 0.26, 0.54 | ||
| Β1,4-GalT | 0.35, 0.36 | ||
| C2GnT-I | 0.07, 0.08, 0.26,0.06 | ||
| Lysozyme | 0.31, 0.13, 0.10, 0.10 | ||
| FT III | 0.67 |
The true bonds are shown in bold.
Quantitative characterization of the combination of MS-based and sequence-based methods (MS2DB+, SVM, and CSP) using the Shafer rule.
| Protein | c | |||
|---|---|---|---|---|
| ST8Sia IV | 1.00 | 1.00 | 1.00 | 1.00 |
| Beta-LG | 1.00 | 1.00 | 1.00 | 1.00 |
| FucT VII | 1.00 | 1.00 | 1.00 | 1.00 |
| Β1,4-GalT | 1.00 | 1.00 | 1.00 | 1.00 |
| C2GnT-I | 1.00 | 1.00 | 1.00 | 1.00 |
| Lysozyme | 1.00 | 0.94 | 0.94 | 0.69 |
| FT III | 0.50 | 1.00 | 0.95 | 0.69 |
Numeric characterization of the four rules
| Methods | c | |||
|---|---|---|---|---|
| 0.821 | 1.000 | 0.982 | 0.891 | |
| 0.857 | 0.992 | 0.978 | 0.772 | |
| 0.714 | 0.992 | 0.965 | 0.589 | |
| 0.857 | 0.977 | 0.965 | 0.797 | |
| 0.929 | 0.992 | 0.985 | 0.911 |
Results from combining MassMatrix with the sequence-based methods.
| Swiss-Prot ID | Known Bonds | MassMatrix | Dempster, Campos, and Shafer rules | Yager rule |
|---|---|---|---|---|
| 142-292 | 142-292 | 142-292 | ||
| 156-356 | 156-356 | 156-356 | ||
| 82-176 | 82-176 | 82-176 | ||
| - | 122-135 | 122-135 | ||
| 68-76 | 68-76 | 68-76 | ||
| 211-214 | 211-214 | 211-214 | ||
| 318-321 | 318-321 | 318-321 | ||
| 134-176 | 134-176 | - | ||
| 247-266 | 247-266 | - | ||
| - | - | - | ||
| - | - | - | ||
| - | - | - | ||
| - | 372-381 | 372-381 | ||
| - | 24-145 | 24-145 | ||
| 48-133 | 48-133 | 48-133 |