Literature DB >> 25359890

Measuring the wisdom of the crowds in network-based gene function inference.

W Verleyen1, S Ballouz1, J Gillis1.   

Abstract

MOTIVATION: Network-based gene function inference methods have proliferated in recent years, but measurable progress remains elusive. We wished to better explore performance trends by controlling data and algorithm implementation, with a particular focus on the performance of aggregate predictions.
RESULTS: Hypothesizing that popular methods would perform well without hand-tuning, we used well-characterized algorithms to produce verifiably 'untweaked' results. We find that most state-of-the-art machine learning methods obtain 'gold standard' performance as measured in critical assessments in defined tasks. Across a broad range of tests, we see close alignment in algorithm performances after controlling for the underlying data being used. We find that algorithm aggregation provides only modest benefits, with a 17% increase in area under the ROC (AUROC) above the mean AUROC. In contrast, data aggregation gains are enormous with an 88% improvement in mean AUROC. Altogether, we find substantial evidence to support the view that additional algorithm development has little to offer for gene function prediction.
AVAILABILITY AND IMPLEMENTATION: The supplementary information contains a description of the algorithms, the network data parsed from different biological data resources and a guide to the source code (available at: http://gillislab.cshl.edu/supplements/).
© The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Entities:  

Mesh:

Substances:

Year:  2014        PMID: 25359890     DOI: 10.1093/bioinformatics/btu715

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  6 in total

1.  Exploiting single-cell expression to characterize co-expression replicability.

Authors:  Megan Crow; Anirban Paul; Sara Ballouz; Z Josh Huang; Jesse Gillis
Journal:  Genome Biol       Date:  2016-05-06       Impact factor: 13.583

2.  Enhancing gene regulatory network inference through data integration with markov random fields.

Authors:  Michael Banf; Seung Y Rhee
Journal:  Sci Rep       Date:  2017-02-01       Impact factor: 4.379

3.  Optimal Function Prediction of Key Aberrant Genes in Early-onset Preeclampsia Using a Modified Network-based Guilt by Association Method.

Authors:  Jing Wang; Yanping Bi; Junxia Li; Yanfang Tian; Xue Yang; Zhongfang Sun
Journal:  Iran J Public Health       Date:  2018-11       Impact factor: 1.429

4.  Evaluation of critical data processing steps for reliable prediction of gene co-expression from large collections of RNA-seq data.

Authors:  Alexis Vandenbon
Journal:  PLoS One       Date:  2022-01-28       Impact factor: 3.240

5.  DTW-MIC Coexpression Networks from Time-Course Data.

Authors:  Samantha Riccadonna; Giuseppe Jurman; Roberto Visintainer; Michele Filosi; Cesare Furlanello
Journal:  PLoS One       Date:  2016-03-31       Impact factor: 3.240

6.  Ligand Similarity Complements Sequence, Physical Interaction, and Co-Expression for Gene Function Prediction.

Authors:  Matthew J O'Meara; Sara Ballouz; Brian K Shoichet; Jesse Gillis
Journal:  PLoS One       Date:  2016-07-28       Impact factor: 3.240

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.