| Literature DB >> 28155636 |
Abstract
BACKGROUND: De novo peptide sequencing via tandem mass spectrometry (MS/MS) has been developed rapidly in recent years. With the use of spectra pairs from the same peptide under different fragmentation modes, performance of de novo sequencing is greatly improved. Currently, with large amount of spectra sequenced everyday, spectra libraries containing tens of thousands of annotated experimental MS/MS spectra become available. These libraries provide information of the spectra properties, thus have the potential to be used with de novo sequencing to improve its performance.Entities:
Keywords: De novo peptide sequencing; Electron transfer dissociation; Higher-energy collisional dissociation; Spectra library
Mesh:
Substances:
Year: 2016 PMID: 28155636 PMCID: PMC5259825 DOI: 10.1186/s12859-016-1330-0
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Relationships for selection and score calculation
| Relationship | Ions selected | Score calculation on |
|---|---|---|
| spectrum | ||
| ∣( |
|
|
| and ∣( | (middle ion) | |
| ∣ |
|
|
| ∣ |
|
Scores calculated using spectra library SL
| Feature | Score calculation |
|---|---|
| Amino acid difference |
|
| Ion complementarity |
|
Number of spectra and charges in each dataset used in the experiments
| Dataset | Number of | Charge of | Number of |
|---|---|---|---|
| total spectra | spectra | selected pair | |
| SCX _ | 1952 | +2 to +6 | 161 |
| SCX _ | 612 | ||
| SCX _ | 2557 | +2 to +5 | 249 |
| SCX _ | 1298 |
Parameters used in the experiments
| Parameter | Role in the method | Value |
|---|---|---|
| Threshold | Peak selection in spectra merging | 0.01Da |
| Number of tags | De novo sequencing | 10 per experimental spectrum |
| Stop threshold | Path extending in sequencing | 10 in each partial segment |
| Output number | Candidate output | 3 per spectra pair |
Significant score calculated using spectra libraries
| Score | Calculation on NIST-HCD | Calculation on SynthETD |
|---|---|---|
|
| 0.79 | 0.42 |
|
| 0.92 | |
|
| 0.61 | 0.66 |
Full length peptide sequencing accuracy comparison on two HCD and ETD dataset pairs
| Dataset | Number of | Accuracy of | Accuracy of the |
|---|---|---|---|
| spectra | previous | proposed | |
| pairs | method | method | |
| SCX _ | 161 | 83.53% | 86.96% |
| and SCX _ | |||
| SCX _ | 249 | 94.78% | 95.16% |
| and SCX _ |
Accuracy comparison on SCX _HCD_decon and SCX _ETD_decon dataset pair with different output
| Method | Output first | Output first and second | Output top three |
|---|---|---|---|
| Previous | 65.22% | 72.05% | 83.85% |
| Proposed | 76.40% | 80.12% | 86.96% |
Accuracy comparison on SCX _HCD_no_decon and SCX _ETD_no_decon dataset pair with different output
| Method | Output first | Output first and second | Output top three |
|---|---|---|---|
| Previous | 80.43% | 84.04% | 94.78% |
| Proposed | 82.71% | 87.94% | 95.16% |
Computational time comparison on two HCD and ETD dataset pairs
| Dataset | Number | Time (sec.) per | Time (sec.) per |
|---|---|---|---|
| of spectra | pair using | pair using | |
| pairs | previous method | proposed method | |
| SCX _ | 161 | 3.97 | 2.16 |
| and SCX _ | |||
| SCX _ | 294 | 2.35 | 1.79 |
| and SCX _ |