Literature DB >> 20702400

Identifying informative subsets of the Gene Ontology with information bottleneck methods.

Bo Jin1, Xinghua Lu.   

Abstract

MOTIVATION: The Gene Ontology (GO) is a controlled vocabulary designed to represent the biological concepts pertaining to gene products. This study investigates the methods for identifying informative subsets of GO terms in an automatic and objective fashion. This task in turn requires addressing the following issues: how to represent the semantic context of GO terms, what metrics are suitable for measuring the semantic differences between terms, how to identify an informative subset that retains as much as possible of the original semantic information of GO.
RESULTS: We represented the semantic context of a GO term using the word-usage-profile associated with the term, which enables one to measure the semantic differences between terms based on the differences in their semantic contexts. We further employed the information bottleneck methods to automatically identify subsets of GO terms that retain as much as possible of the semantic information in an annotation database. The automatically retrieved informative subsets align well with an expert-picked GO slim subset, cover important concepts and proteins, and enhance literature-based GO annotation. AVAILABILITY: http://carcweb.musc.edu/TextminingProjects/.

Mesh:

Substances:

Year:  2010        PMID: 20702400      PMCID: PMC2944202          DOI: 10.1093/bioinformatics/btq449

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  19 in total

1.  Investigating semantic similarity measures across the Gene Ontology: the relationship between sequence and annotation.

Authors:  P W Lord; R D Stevens; A Brass; C A Goble
Journal:  Bioinformatics       Date:  2003-07-01       Impact factor: 6.937

2.  The Gene Ontology Annotation (GOA) Database--an integrated resource of GO annotations to the UniProt Knowledgebase.

Authors:  Evelyn Camon; Daniel Barrell; Vivian Lee; Emily Dimmer; Rolf Apweiler
Journal:  In Silico Biol       Date:  2003-12-01

3.  A new method to measure the semantic similarity of GO terms.

Authors:  James Z Wang; Zhidian Du; Rapeeporn Payattakool; Philip S Yu; Chin-Fu Chen
Journal:  Bioinformatics       Date:  2007-03-07       Impact factor: 6.937

4.  The Unified Medical Language System.

Authors:  D A Lindberg; B L Humphreys; A T McCray
Journal:  Methods Inf Med       Date:  1993-08       Impact factor: 2.176

5.  Assessing the functional coherence of gene sets with metrics based on the Gene Ontology graph.

Authors:  Adam J Richards; Brian Muller; Matthew Shotwell; L Ashley Cowart; Bäerbel Rohrer; Xinghua Lu
Journal:  Bioinformatics       Date:  2010-06-15       Impact factor: 6.937

6.  Information theory applied to the sparse gene ontology annotation network to predict novel gene function.

Authors:  Ying Tao; Lee Sam; Jianrong Li; Carol Friedman; Yves A Lussier
Journal:  Bioinformatics       Date:  2007-07-01       Impact factor: 6.937

7.  The TREC 2004 genomics track categorization task: classifying full text biomedical documents.

Authors:  Aaron M Cohen; William R Hersh
Journal:  J Biomed Discov Collab       Date:  2006-03-14

8.  Multi-label literature classification based on the Gene Ontology graph.

Authors:  Bo Jin; Brian Muller; Chengxiang Zhai; Xinghua Lu
Journal:  BMC Bioinformatics       Date:  2008-12-08       Impact factor: 3.169

9.  A relation based measure of semantic similarity for Gene Ontology annotations.

Authors:  Brendan Sheehan; Aaron Quigley; Benoit Gaudin; Simon Dobson
Journal:  BMC Bioinformatics       Date:  2008-11-04       Impact factor: 3.169

Review 10.  Getting started in text mining.

Authors:  K Bretonnel Cohen; Lawrence Hunter
Journal:  PLoS Comput Biol       Date:  2008-01       Impact factor: 4.475

View more
  12 in total

1.  Using predictive specificity to determine when gene set analysis is biologically meaningful.

Authors:  Sara Ballouz; Paul Pavlidis; Jesse Gillis
Journal:  Nucleic Acids Res       Date:  2017-02-28       Impact factor: 16.971

2.  Signal-Oriented Pathway Analyses Reveal a Signaling Complex as a Synthetic Lethal Target for p53 Mutations.

Authors:  Songjian Lu; Chunhui Cai; Gonghong Yan; Zhuan Zhou; Yong Wan; Vicky Chen; Lujia Chen; Gregory F Cooper; Lina M Obeid; Yusuf A Hannun; Adrian V Lee; Xinghua Lu
Journal:  Cancer Res       Date:  2016-10-10       Impact factor: 12.701

3.  Integrating genome and functional genomics data to reveal perturbed signaling pathways in ovarian cancers.

Authors:  Songjian Lu; Xinghua Lu
Journal:  AMIA Jt Summits Transl Sci Proc       Date:  2012-03-19

4.  Quality of computationally inferred gene ontology annotations.

Authors:  Nives Skunca; Adrian Altenhoff; Christophe Dessimoz
Journal:  PLoS Comput Biol       Date:  2012-05-31       Impact factor: 4.475

5.  Identifying Driver Genomic Alterations in Cancers by Searching Minimum-Weight, Mutually Exclusive Sets.

Authors:  Songjian Lu; Kevin N Lu; Shi-Yuan Cheng; Bo Hu; Xiaojun Ma; Nicholas Nystrom; Xinghua Lu
Journal:  PLoS Comput Biol       Date:  2015-08-28       Impact factor: 4.475

6.  Optimal Threshold Determination for Interpreting Semantic Similarity and Particularity: Application to the Comparison of Gene Sets and Metabolic Pathways Using GO and ChEBI.

Authors:  Charles Bettembourg; Christian Diot; Olivier Dameron
Journal:  PLoS One       Date:  2015-07-31       Impact factor: 3.240

7.  Semantic particularity measure for functional characterization of gene sets using gene ontology.

Authors:  Charles Bettembourg; Christian Diot; Olivier Dameron
Journal:  PLoS One       Date:  2014-01-28       Impact factor: 3.240

8.  Conceptualization of molecular findings by mining gene annotations.

Authors:  Vicky Chen; Xinghua Lu
Journal:  BMC Proc       Date:  2013-12-20

9.  From data towards knowledge: revealing the architecture of signaling systems by unifying knowledge mining and data mining of systematic perturbation data.

Authors:  Songjian Lu; Bo Jin; L Ashley Cowart; Xinghua Lu
Journal:  PLoS One       Date:  2013-04-23       Impact factor: 3.240

10.  Measuring the evolution of ontology complexity: the gene ontology case study.

Authors:  Olivier Dameron; Charles Bettembourg; Nolwenn Le Meur
Journal:  PLoS One       Date:  2013-10-11       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.