Literature DB >> 22268703

Accurate evaluation and analysis of functional genomics data and methods.

Casey S Greene1, Olga G Troyanskaya.   

Abstract

The development of technology capable of inexpensively performing large-scale measurements of biological systems has generated a wealth of data. Integrative analysis of these data holds the promise of uncovering gene function, regulation, and, in the longer run, understanding complex disease. However, their analysis has proved very challenging, as it is difficult to quickly and effectively assess the relevance and accuracy of these data for individual biological questions. Here, we identify biases that present challenges for the assessment of functional genomics data and methods. We then discuss evaluation methods that, taken together, begin to address these issues. We also argue that the funding of systematic data-driven experiments and of high-quality curation efforts will further improve evaluation metrics so that they more-accurately assess functional genomics data and methods. Such metrics will allow researchers in the field of functional genomics to continue to answer important biological questions in a data-driven manner.
© 2012 New York Academy of Sciences.

Entities:  

Mesh:

Year:  2012        PMID: 22268703      PMCID: PMC4753770          DOI: 10.1111/j.1749-6632.2011.06383.x

Source DB:  PubMed          Journal:  Ann N Y Acad Sci        ISSN: 0077-8923            Impact factor:   5.691


  28 in total

Review 1.  A probabilistic view of gene function.

Authors:  Andrew G Fraser; Edward M Marcotte
Journal:  Nat Genet       Date:  2004-06       Impact factor: 38.330

2.  Whole-genome annotation by using evidence integration in functional-linkage networks.

Authors:  Ulas Karaoz; T M Murali; Stan Letovsky; Yu Zheng; Chunming Ding; Charles R Cantor; Simon Kasif
Journal:  Proc Natl Acad Sci U S A       Date:  2004-02-23       Impact factor: 11.205

Review 3.  Data integration and genomic medicine.

Authors:  Brenton Louie; Peter Mork; Fernando Martin-Sanchez; Alon Halevy; Peter Tarczy-Hornoch
Journal:  J Biomed Inform       Date:  2006-03-09       Impact factor: 6.317

4.  The impact of incomplete knowledge on evaluation: an experimental benchmark for protein function prediction.

Authors:  Curtis Huttenhower; Matthew A Hibbs; Chad L Myers; Amy A Caudy; David C Hess; Olga G Troyanskaya
Journal:  Bioinformatics       Date:  2009-06-26       Impact factor: 6.937

5.  Bayesian Markov Random Field analysis for protein function prediction based on network data.

Authors:  Yiannis A I Kourmpetis; Aalt D J van Dijk; Marco C A M Bink; Roeland C H J van Ham; Cajo J F ter Braak
Journal:  PLoS One       Date:  2010-02-24       Impact factor: 3.240

6.  PILGRM: an interactive data-driven discovery platform for expert biologists.

Authors:  Casey S Greene; Olga G Troyanskaya
Journal:  Nucleic Acids Res       Date:  2011-06-07       Impact factor: 16.971

7.  Finding function: evaluation methods for functional genomic data.

Authors:  Chad L Myers; Daniel R Barrett; Matthew A Hibbs; Curtis Huttenhower; Olga G Troyanskaya
Journal:  BMC Genomics       Date:  2006-07-25       Impact factor: 3.969

8.  The Gene Ontology's Reference Genome Project: a unified framework for functional annotation across species.

Authors: 
Journal:  PLoS Comput Biol       Date:  2009-07-03       Impact factor: 4.475

9.  Computationally driven, quantitative experiments discover genes required for mitochondrial biogenesis.

Authors:  David C Hess; Chad L Myers; Curtis Huttenhower; Matthew A Hibbs; Alicia P Hayes; Jadine Paw; John J Clore; Rosa M Mendoza; Bryan San Luis; Corey Nislow; Guri Giaever; Michael Costanzo; Olga G Troyanskaya; Amy A Caudy
Journal:  PLoS Genet       Date:  2009-03-20       Impact factor: 5.917

10.  A critical assessment of Mus musculus gene function prediction using integrated genomic evidence.

Authors:  Lourdes Peña-Castillo; Murat Tasan; Chad L Myers; Hyunju Lee; Trupti Joshi; Chao Zhang; Yuanfang Guan; Michele Leone; Andrea Pagnani; Wan Kyu Kim; Chase Krumpelman; Weidong Tian; Guillaume Obozinski; Yanjun Qi; Sara Mostafavi; Guan Ning Lin; Gabriel F Berriz; Francis D Gibbons; Gert Lanckriet; Jian Qiu; Charles Grant; Zafer Barutcuoglu; David P Hill; David Warde-Farley; Chris Grouios; Debajyoti Ray; Judith A Blake; Minghua Deng; Michael I Jordan; William S Noble; Quaid Morris; Judith Klein-Seetharaman; Ziv Bar-Joseph; Ting Chen; Fengzhu Sun; Olga G Troyanskaya; Edward M Marcotte; Dong Xu; Timothy R Hughes; Frederick P Roth
Journal:  Genome Biol       Date:  2008-06-27       Impact factor: 13.583

View more
  12 in total

1.  Unsupervised Extraction of Stable Expression Signatures from Public Compendia with an Ensemble of Neural Networks.

Authors:  Jie Tan; Georgia Doing; Kimberley A Lewis; Courtney E Price; Kathleen M Chen; Kyle C Cady; Barret Perchuk; Michael T Laub; Deborah A Hogan; Casey S Greene
Journal:  Cell Syst       Date:  2017-07-12       Impact factor: 10.304

2.  Implicating candidate genes at GWAS signals by leveraging topologically associating domains.

Authors:  Gregory P Way; Daniel W Youngstrom; Kurt D Hankenson; Casey S Greene; Struan Fa Grant
Journal:  Eur J Hum Genet       Date:  2017-08-09       Impact factor: 4.246

3.  Assessing identity, redundancy and confounds in Gene Ontology annotations over time.

Authors:  Jesse Gillis; Paul Pavlidis
Journal:  Bioinformatics       Date:  2013-01-06       Impact factor: 6.937

4.  Parametric Bayesian priors and better choice of negative examples improve protein function prediction.

Authors:  Noah Youngs; Duncan Penfold-Brown; Kevin Drew; Dennis Shasha; Richard Bonneau
Journal:  Bioinformatics       Date:  2013-03-19       Impact factor: 6.937

5.  Negative example selection for protein function prediction: the NoGO database.

Authors:  Noah Youngs; Duncan Penfold-Brown; Richard Bonneau; Dennis Shasha
Journal:  PLoS Comput Biol       Date:  2014-06-12       Impact factor: 4.475

6.  Organellar genome assembly methods and comparative analysis of horticultural plants.

Authors:  Xuelin Wang; Feng Cheng; Dekai Rohlsen; Changwei Bi; Chunyan Wang; Yiqing Xu; Suyun Wei; Qiaolin Ye; Tongming Yin; Ning Ye
Journal:  Hortic Res       Date:  2018-01-10       Impact factor: 6.793

7.  Chapter 2: Data-driven view of disease biology.

Authors:  Casey S Greene; Olga G Troyanskaya
Journal:  PLoS Comput Biol       Date:  2012-12-27       Impact factor: 4.475

8.  Predicting targeted drug combinations based on Pareto optimal patterns of coexpression network connectivity.

Authors:  Nadia M Penrod; Casey S Greene; Jason H Moore
Journal:  Genome Med       Date:  2014-04-30       Impact factor: 11.117

9.  Machine Learning Analysis Identifies Drosophila Grunge/Atrophin as an Important Learning and Memory Gene Required for Memory Retention and Social Learning.

Authors:  Balint Z Kacsoh; Casey S Greene; Giovanni Bosco
Journal:  G3 (Bethesda)       Date:  2017-11-06       Impact factor: 3.154

10.  CommWalker: correctly evaluating modules in molecular networks in light of annotation bias.

Authors:  M D Luecken; M J T Page; A J Crosby; S Mason; G Reinert; C M Deane
Journal:  Bioinformatics       Date:  2018-03-15       Impact factor: 6.937

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.