Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Improved detection of overrepresentation of Gene-Ontology annotations with parent child analysis.

Literature DB >> 17848398

Improved detection of overrepresentation of Gene-Ontology annotations with parent child analysis.

Steffen Grossmann¹, Sebastian Bauer, Peter N Robinson, Martin Vingron.

Abstract

MOTIVATION: High-throughput experiments such as microarray hybridizations often yield long lists of genes found to share a certain characteristic such as differential expression. Exploring Gene Ontology (GO) annotations for such lists of genes has become a widespread practice to get first insights into the potential biological meaning of the experiment. The standard statistical approach to measuring overrepresentation of GO terms cannot cope with the dependencies resulting from the structure of GO because they analyze each term in isolation. Especially the fact that annotations are inherited from more specific descendant terms can result in certain types of false-positive results with potentially misleading biological interpretation, a phenomenon which we term the inheritance problem.
RESULTS: We present here a novel approach to analysis of GO term overrepresentation that determines overrepresentation of terms in the context of annotations to the term's parents. This approach reduces the dependencies between the individual term's measurements, and thereby avoids producing false-positive results owing to the inheritance problem. ROC analysis using study sets with overrepresented GO terms showed a clear advantage for our approach over the standard algorithm with respect to the inheritance problem. Although there can be no gold standard for exploratory methods such as analysis of GO term overrepresentation, analysis of biological datasets suggests that our algorithm tends to identify the core GO terms that are most characteristic of the dataset being analyzed.

Entities: Species

Mesh：

Year: 2007 PMID： 17848398 DOI： 10.1093/bioinformatics/btm440

Source DB: PubMed Journal: Bioinformatics ISSN： 1367-4803 Impact factor: 6.937

Keyword Cloud
Cited

162 in total

1. Gene expression divergence recapitulates the developmental hourglass model.

Authors: Alex T Kalinka; Karolina M Varga; Dave T Gerrard; Stephan Preibisch; David L Corcoran; Julia Jarrells; Uwe Ohler; Casey M Bergman; Pavel Tomancak
Journal: Nature Date: 2010-12-09 Impact factor: 49.962

2. Genotype and phenotypes of an intestine-adapted Escherichia coli K-12 mutant selected by animal passage for superior colonization.

Authors: Andrew J Fabich; Mary P Leatham; Joe E Grissom; Graham Wiley; Hongshing Lai; Fares Najar; Bruce A Roe; Paul S Cohen; Tyrrell Conway
Journal: Infect Immun Date: 2011-03-21 Impact factor: 3.441

Improved detection of overrepresentation of Gene-Ontology annotations with parent child analysis.

1. Gene expression divergence recapitulates the developmental hourglass model.

2. Genotype and phenotypes of an intestine-adapted Escherichia coli K-12 mutant selected by animal passage for superior colonization.

3. Optimization criteria and biological process enrichment in homologous multiprotein modules.

4. Deeply conserved chordate noncoding sequences preserve genome synteny but do not drive gene duplicate retention.

5. GO-Module: functional synthesis and improved interpretation of Gene Ontology patterns.

6. GO-Bayes: Gene Ontology-based overrepresentation analysis using a Bayesian approach.

7. A Bayesian extension of the hypergeometric test for functional enrichment analysis.

8. Multiset Statistics for Gene Set Analysis.

9. The Human Phenotype Ontology: a tool for annotating and analyzing human hereditary disease.

10. GOGrapher: A Python library for GO graph representation and analysis.