| Literature DB >> 31443634 |
Hyunwoo Kim1, Sangjeong Lee2, Heejin Park3.
Abstract
BACKGROUND: One of the most important steps in peptide identification is to estimate the false discovery rate (FDR). The most commonly used method for estimating FDR is the target-decoy search strategy (TDS). While this method is simple and effective, it is time/space-inefficient because it searches a database that is twice as large as the original protein database. This inefficiency problem becomes more evident as protein databases get bigger and bigger. We propose a target-small decoy search strategy and present a rigorous verification that it reduces the database size and search time while retaining the accuracy of target-decoy search strategy (TDS).Entities:
Keywords: False discovery rate; Target-decoy search; Target-small decoy search
Mesh:
Substances:
Year: 2019 PMID: 31443634 PMCID: PMC6708216 DOI: 10.1186/s12859-019-3034-8
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1Comparison of PSMs between TDS and our method using Comet. (a) 1/2 decoy database; (b) 1/4 decoy database; (c) 1/6 decoy database; (d) 1/8 decoy database. Using the UniProt database
Fig. 2Comparison of search times for various decoy database sizes when the search time of the original decoy database is 100%. It shows that is little change in search time when the decoy database is reduced to 1/4 or less. #TARGET: (#DECOY/N) indicates the size at which the decoy database is reduced compared to the target. (a), (b) Using the UniProt database. (c), (d) Using the SwissProt database
Fig. 3Comparison of ratios among FPRatio (blue bars), UPRatio (red bars) and DBRatio (yellow bars). (a) Using the UniProt database. (b) Using the SwissProt database. #DECOY/#TARGET indicates the ratio of decoy to target for each method
Fig. 4The percentages of target/decoy PSMs among the ranks 1–5 PSMs in the Comet results using shifted HEK293 data. Blue bars represent the percentage of target PSMs and red bars represent that of decoy PSMs. (a) Original decoy database; (b) 1/2 decoy database; (c) 1/4 decoy database; (d) 1/6 decoy database; (e) 1/8 decoy database. Using the UniProt database