Soumya Raychaudhuri1, Russ B Altman. 1. Department of Genetics, Stanford Medical Informatics, 251 Campus Drive, MSOB X-215, Stanford University, Stanford, CA 94305-5479, USA.
Abstract
MOTIVATION: Many experimental and algorithmic approaches in biology generate groups of genes that need to be examined for related functional properties. For example, gene expression profiles are frequently organized into clusters of genes that may share functional properties. We evaluate a method, neighbor divergence per gene (NDPG), that uses scientific literature to assess whether a group of genes are functionally related. The method requires only a corpus of documents and an index connecting the documents to genes. RESULTS: We evaluate NDPG on 2796 functional groups generated by the Gene Ontology consortium in four organisms: mouse, fly, worm and yeast. NDPG finds functional coherence in 96, 92, 82 and 45% of the groups (at 99.9% specificity) in yeast, mouse, fly and worm respectively.
MOTIVATION: Many experimental and algorithmic approaches in biology generate groups of genes that need to be examined for related functional properties. For example, gene expression profiles are frequently organized into clusters of genes that may share functional properties. We evaluate a method, neighbor divergence per gene (NDPG), that uses scientific literature to assess whether a group of genes are functionally related. The method requires only a corpus of documents and an index connecting the documents to genes. RESULTS: We evaluate NDPG on 2796 functional groups generated by the Gene Ontology consortium in four organisms: mouse, fly, worm and yeast. NDPG finds functional coherence in 96, 92, 82 and 45% of the groups (at 99.9% specificity) in yeast, mouse, fly and worm respectively.
Authors: Selina S Dwight; Midori A Harris; Kara Dolinski; Catherine A Ball; Gail Binkley; Karen R Christie; Dianna G Fisk; Laurie Issel-Tarver; Mark Schroeder; Gavin Sherlock; Anand Sethuraman; Shuai Weng; David Botstein; J Michael Cherry Journal: Nucleic Acids Res Date: 2002-01-01 Impact factor: 16.971
Authors: W M Gelbart; M Crosby; B Matthews; W P Rindone; J Chillemi; S Russo Twombly; D Emmert; M Ashburner; R A Drysdale; E Whitfield; G H Millburn; A de Grey; T Kaufman; K Matthews; D Gilbert; V Strelets; C Tolstoshev Journal: Nucleic Acids Res Date: 1997-01-01 Impact factor: 16.971
Authors: S F Altschul; T L Madden; A A Schäffer; J Zhang; Z Zhang; W Miller; D J Lipman Journal: Nucleic Acids Res Date: 1997-09-01 Impact factor: 16.971
Authors: J M Cherry; C Adler; C Ball; S A Chervitz; S S Dwight; E T Hester; Y Jia; G Juvik; T Roe; M Schroeder; S Weng; D Botstein Journal: Nucleic Acids Res Date: 1998-01-01 Impact factor: 16.971
Authors: Adam J Richards; Brian Muller; Matthew Shotwell; L Ashley Cowart; Bäerbel Rohrer; Xinghua Lu Journal: Bioinformatics Date: 2010-06-15 Impact factor: 6.937
Authors: Lijing Xu; Nicholas Furlotte; Yunyue Lin; Kevin Heinrich; Michael W Berry; Ebenezer O George; Ramin Homayouni Journal: PLoS One Date: 2011-04-14 Impact factor: 3.240