| Literature DB >> 30537264 |
Mark V Ivanov1,2, Lev I Levitsky2, Julia A Bubis2, Mikhail V Gorshkov2.
Abstract
Shotgun proteomics workflows for database protein identification typically include a combination of search engines and postsearch validation software based mostly on machine learning algorithms. Here, a new postsearch validation tool called Scavager employing CatBoost, an open-source gradient boosting library, which shows improved efficiency compared with the other popular algorithms, such as Percolator, PeptideProphet, and Q-ranker, is presented. The comparison is done using multiple data sets and search engines, including MSGF+, MSFragger, X!Tandem, Comet, and recently introduced IdentiPy. Implemented in Python programming language, Scavager is open-source and freely available at https://bitbucket.org/markmipt/scavager.Entities:
Keywords: machine learning; postsearch validation; proteomics
Mesh:
Year: 2018 PMID: 30537264 DOI: 10.1002/pmic.201800280
Source DB: PubMed Journal: Proteomics ISSN: 1615-9853 Impact factor: 3.984