Literature DB >> 35790756

Resolving missing protein problems using functional class scoring.

Bertrand Jern Han Wong1, Weijia Kong1,2,3, Wilson Wen Bin Goh4,5,6, Limsoon Wong2.   

Abstract

Despite technological advances in proteomics, incomplete coverage and inconsistency issues persist, resulting in "data holes". These data holes cause the missing protein problem (MPP), where relevant proteins are persistently unobserved, or sporadically observed across samples, hindering biomarker discovery and proper functional characterization. Network-based approaches can provide powerful solutions for resolving these issues. Functional Class Scoring (FCS) is one such method that uses protein complex information to recover missing proteins with weak support. However, FCS has not been evaluated on more recent proteomic technologies with higher coverage, and there is no clear way to evaluate its performance. To address these issues, we devised a more rigorous evaluation schema based on cross-verification between technical replicates and evaluated its performance on data acquired under recent Data-Independent Acquisition (DIA) technologies (viz. SWATH). Although cross-replicate examination reveals some inconsistencies amongst same-class samples, tissue-differentiating signal is nonetheless strongly conserved, confirming that FCS selects for biologically meaningful networks. We also report that predicted missing proteins are statistically significant based on FCS p values. Despite limited cross-replicate verification rates, the predicted missing proteins as a whole have higher peptide support than non-predicted proteins. FCS also predicts missing proteins that are often lost due to weak specific peptide support.
© 2022. The Author(s).

Entities:  

Mesh:

Substances:

Year:  2022        PMID: 35790756      PMCID: PMC9256666          DOI: 10.1038/s41598-022-15314-3

Source DB:  PubMed          Journal:  Sci Rep        ISSN: 2045-2322            Impact factor:   4.996


  28 in total

1.  Exploring gene expression data with class scores.

Authors:  Paul Pavlidis; Darrin P Lewis; William Stafford Noble
Journal:  Pac Symp Biocomput       Date:  2002

Review 2.  Why Batch Effects Matter in Omics Data, and How to Avoid Them.

Authors:  Wilson Wen Bin Goh; Wei Wang; Limsoon Wong
Journal:  Trends Biotechnol       Date:  2017-03-25       Impact factor: 19.536

3.  OpenSWATH enables automated, targeted analysis of data-independent acquisition MS data.

Authors:  Hannes L Röst; George Rosenberger; Pedro Navarro; Ludovic Gillet; Saša M Miladinović; Olga T Schubert; Witold Wolski; Ben C Collins; Johan Malmström; Lars Malmström; Ruedi Aebersold
Journal:  Nat Biotechnol       Date:  2014-03       Impact factor: 54.908

Review 4.  Design principles for clinical network-based proteomics.

Authors:  Wilson Wen Bin Goh; Limsoon Wong
Journal:  Drug Discov Today       Date:  2016-05-27       Impact factor: 7.851

5.  Evaluating feature-selection stability in next-generation proteomics.

Authors:  Wilson Wen Bin Goh; Limsoon Wong
Journal:  J Bioinform Comput Biol       Date:  2016-08-03       Impact factor: 1.122

Review 6.  Review, evaluation, and discussion of the challenges of missing value imputation for mass spectrometry-based label-free global proteomics.

Authors:  Bobbie-Jo M Webb-Robertson; Holli K Wiberg; Melissa M Matzke; Joseph N Brown; Jing Wang; Jason E McDermott; Richard D Smith; Karin D Rodland; Thomas O Metz; Joel G Pounds; Katrina M Waters
Journal:  J Proteome Res       Date:  2015-04-22       Impact factor: 4.466

7.  Kinetic analysis of npBAF to nBAF switching reveals exchange of SS18 with CREST and integration with neural developmental pathways.

Authors:  Brett T Staahl; Jiong Tang; Wei Wu; Alfred Sun; Aaron D Gitler; Andrew S Yoo; Gerald R Crabtree
Journal:  J Neurosci       Date:  2013-06-19       Impact factor: 6.167

8.  The GENCODE v7 catalog of human long noncoding RNAs: analysis of their gene structure, evolution, and expression.

Authors:  Thomas Derrien; Rory Johnson; Giovanni Bussotti; Andrea Tanzer; Sarah Djebali; Hagen Tilgner; Gregory Guernec; David Martin; Angelika Merkel; David G Knowles; Julien Lagarde; Lavanya Veeravalli; Xiaoan Ruan; Yijun Ruan; Timo Lassmann; Piero Carninci; James B Brown; Leonard Lipovich; Jose M Gonzalez; Mark Thomas; Carrie A Davis; Ramin Shiekhattar; Thomas R Gingeras; Tim J Hubbard; Cedric Notredame; Jennifer Harrow; Roderic Guigó
Journal:  Genome Res       Date:  2012-09       Impact factor: 9.043

9.  Finding consistent disease subnetworks across microarray datasets.

Authors:  Donny Soh; Difeng Dong; Yike Guo; Limsoon Wong
Journal:  BMC Bioinformatics       Date:  2011-11-30       Impact factor: 3.169

10.  Benchmarking human protein complexes to investigate drug-related systems and evaluate predicted protein complexes.

Authors:  Min Wu; Qi Yu; Xiaoli Li; Jie Zheng; Jing-Fei Huang; Chee-Keong Kwoh
Journal:  PLoS One       Date:  2013-02-06       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.