Literature DB >> 28452485

Systematic Comparison of False-Discovery-Rate-Controlling Strategies for Proteogenomic Search Using Spike-in Experiments.

Honglan Li1, Jonghun Park2, Hyunwoo Kim3, Kyu-Baek Hwang1, Eunok Paek2.   

Abstract

Proteogenomic searches are useful for novel peptide identification from tandem mass spectra. Usually, separate and multistage approaches are adopted to accurately control the false discovery rate (FDR) for proteogenomic search. Their performance on novel peptide identification has not been thoroughly evaluated, however, mainly due to the difficulty in confirming the existence of identified novel peptides. We simulated a proteogenomic search using a controlled, spike-in proteomic data set. After confirming that the results of the simulated proteogenomic search were similar to those of a real proteogenomic search using a human cell line data set, we evaluated the performance of six FDR control methods-global, separate, and multistage FDR estimation, respectively, coupled to a target-decoy search and a mixture model-based method-on novel peptide identification. The multistage approach showed the highest accuracy for FDR estimation. However, global and separate FDR estimation with the mixture model-based method showed higher sensitivities than others at the same true FDR. Furthermore, the mixture model-based method performed equally well when applied without or with a reduced set of decoy sequences. Considering different prior probabilities for novel and known protein identification, we recommend using mixture model-based methods with separate FDR estimation for sensitive and reliable identification of novel peptides from proteogenomic searches.

Entities:  

Keywords:  false discovery rate control; novel peptide identification; proteogenomic search; simulation; spike-in data

Mesh:

Substances:

Year:  2017        PMID: 28452485     DOI: 10.1021/acs.jproteome.7b00033

Source DB:  PubMed          Journal:  J Proteome Res        ISSN: 1535-3893            Impact factor:   4.466


  6 in total

1.  Peptide identifications and false discovery rates using different mass spectrometry platforms.

Authors:  Krishna D B Anapindi; Elena V Romanova; Bruce R Southey; Jonathan V Sweedler
Journal:  Talanta       Date:  2018-01-31       Impact factor: 6.057

2.  Proteomics in non-human primates: utilizing RNA-Seq data to improve protein identification by mass spectrometry in vervet monkeys.

Authors:  J Michael Proffitt; Jeremy Glenn; Anthony J Cesnik; Avinash Jadhav; Michael R Shortreed; Lloyd M Smith; Kylie Kavanagh; Laura A Cox; Michael Olivier
Journal:  BMC Genomics       Date:  2017-11-13       Impact factor: 3.969

3.  Target-small decoy search strategy for false discovery rate estimation.

Authors:  Hyunwoo Kim; Sangjeong Lee; Heejin Park
Journal:  BMC Bioinformatics       Date:  2019-08-23       Impact factor: 3.169

4.  Transfer posterior error probability estimation for peptide identification.

Authors:  Xinpei Yi; Fuzhou Gong; Yan Fu
Journal:  BMC Bioinformatics       Date:  2020-05-04       Impact factor: 3.169

5.  Cancer neoantigen prioritization through sensitive and reliable proteogenomics analysis.

Authors:  Bo Wen; Kai Li; Yun Zhang; Bing Zhang
Journal:  Nat Commun       Date:  2020-04-09       Impact factor: 14.919

6.  Protein-gene Expression Nexus: Comprehensive characterization of human cancer cell lines with proteogenomic analysis.

Authors:  Daejin Hyung; Min-Jeong Baek; Jongkeun Lee; Juyeon Cho; Hyoun Sook Kim; Charny Park; Soo Young Cho
Journal:  Comput Struct Biotechnol J       Date:  2021-08-17       Impact factor: 7.271

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.