Literature DB >> 17848398

Improved detection of overrepresentation of Gene-Ontology annotations with parent child analysis.

Steffen Grossmann1, Sebastian Bauer, Peter N Robinson, Martin Vingron.   

Abstract

MOTIVATION: High-throughput experiments such as microarray hybridizations often yield long lists of genes found to share a certain characteristic such as differential expression. Exploring Gene Ontology (GO) annotations for such lists of genes has become a widespread practice to get first insights into the potential biological meaning of the experiment. The standard statistical approach to measuring overrepresentation of GO terms cannot cope with the dependencies resulting from the structure of GO because they analyze each term in isolation. Especially the fact that annotations are inherited from more specific descendant terms can result in certain types of false-positive results with potentially misleading biological interpretation, a phenomenon which we term the inheritance problem.
RESULTS: We present here a novel approach to analysis of GO term overrepresentation that determines overrepresentation of terms in the context of annotations to the term's parents. This approach reduces the dependencies between the individual term's measurements, and thereby avoids producing false-positive results owing to the inheritance problem. ROC analysis using study sets with overrepresented GO terms showed a clear advantage for our approach over the standard algorithm with respect to the inheritance problem. Although there can be no gold standard for exploratory methods such as analysis of GO term overrepresentation, analysis of biological datasets suggests that our algorithm tends to identify the core GO terms that are most characteristic of the dataset being analyzed.

Entities:  

Mesh:

Year:  2007        PMID: 17848398     DOI: 10.1093/bioinformatics/btm440

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  162 in total

1.  Gene expression divergence recapitulates the developmental hourglass model.

Authors:  Alex T Kalinka; Karolina M Varga; Dave T Gerrard; Stephan Preibisch; David L Corcoran; Julia Jarrells; Uwe Ohler; Casey M Bergman; Pavel Tomancak
Journal:  Nature       Date:  2010-12-09       Impact factor: 49.962

2.  Genotype and phenotypes of an intestine-adapted Escherichia coli K-12 mutant selected by animal passage for superior colonization.

Authors:  Andrew J Fabich; Mary P Leatham; Joe E Grissom; Graham Wiley; Hongshing Lai; Fares Najar; Bruce A Roe; Paul S Cohen; Tyrrell Conway
Journal:  Infect Immun       Date:  2011-03-21       Impact factor: 3.441

3.  Optimization criteria and biological process enrichment in homologous multiprotein modules.

Authors:  Luqman Hodgkinson; Richard M Karp
Journal:  Proc Natl Acad Sci U S A       Date:  2013-06-11       Impact factor: 11.205

4.  Deeply conserved chordate noncoding sequences preserve genome synteny but do not drive gene duplicate retention.

Authors:  Andrew L Hufton; Susanne Mathia; Helene Braun; Udo Georgi; Hans Lehrach; Martin Vingron; Albert J Poustka; Georgia Panopoulou
Journal:  Genome Res       Date:  2009-08-24       Impact factor: 9.043

5.  GO-Module: functional synthesis and improved interpretation of Gene Ontology patterns.

Authors:  Xinan Yang; Jianrong Li; Younghee Lee; Yves A Lussier
Journal:  Bioinformatics       Date:  2011-03-17       Impact factor: 6.937

6.  GO-Bayes: Gene Ontology-based overrepresentation analysis using a Bayesian approach.

Authors:  Song Zhang; Jing Cao; Y Megan Kong; Richard H Scheuermann
Journal:  Bioinformatics       Date:  2010-02-21       Impact factor: 6.937

7.  A Bayesian extension of the hypergeometric test for functional enrichment analysis.

Authors:  Jing Cao; Song Zhang
Journal:  Biometrics       Date:  2013-12-09       Impact factor: 2.571

8.  Multiset Statistics for Gene Set Analysis.

Authors:  Michael A Newton; Zhishi Wang
Journal:  Annu Rev Stat Appl       Date:  2015-04       Impact factor: 5.810

9.  The Human Phenotype Ontology: a tool for annotating and analyzing human hereditary disease.

Authors:  Peter N Robinson; Sebastian Köhler; Sebastian Bauer; Dominik Seelow; Denise Horn; Stefan Mundlos
Journal:  Am J Hum Genet       Date:  2008-10-23       Impact factor: 11.025

10.  GOGrapher: A Python library for GO graph representation and analysis.

Authors:  Brian Muller; Adam J Richards; Bo Jin; Xinghua Lu
Journal:  BMC Res Notes       Date:  2009-07-07
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.