Literature DB >> 25495798

Methodology for the inference of gene function from phenotype data.

Joao A Ascensao1,2, Mary E Dolan3, David P Hill4, Judith A Blake5.   

Abstract

BACKGROUND: Biomedical ontologies are increasingly instrumental in the advancement of biological research primarily through their use to efficiently consolidate large amounts of data into structured, accessible sets. However, ontology development and usage can be hampered by the segregation of knowledge by domain that occurs due to independent development and use of the ontologies. The ability to infer data associated with one ontology to data associated with another ontology would prove useful in expanding information content and scope. We here focus on relating two ontologies: the Gene Ontology (GO), which encodes canonical gene function, and the Mammalian Phenotype Ontology (MP), which describes non-canonical phenotypes, using statistical methods to suggest GO functional annotations from existing MP phenotype annotations. This work is in contrast to previous studies that have focused on inferring gene function from phenotype primarily through lexical or semantic similarity measures.
RESULTS: We have designed and tested a set of algorithms that represents a novel methodology to define rules for predicting gene function by examining the emergent structure and relationships between the gene functions and phenotypes rather than inspecting the terms semantically. The algorithms inspect relationships among multiple phenotype terms to deduce if there are cases where they all arise from a single gene function. We apply this methodology to data about genes in the laboratory mouse that are formally represented in the Mouse Genome Informatics (MGI) resource. From the data, 7444 rule instances were generated from five generalized rules, resulting in 4818 unique GO functional predictions for 1796 genes.
CONCLUSIONS: We show that our method is capable of inferring high-quality functional annotations from curated phenotype data. As well as creating inferred annotations, our method has the potential to allow for the elucidation of unforeseen, biologically significant associations between gene function and phenotypes that would be overlooked by a semantics-based approach. Future work will include the implementation of the described algorithms for a variety of other model organism databases, taking full advantage of the abundance of available high quality curated data.

Entities:  

Mesh:

Year:  2014        PMID: 25495798      PMCID: PMC4302099          DOI: 10.1186/s12859-014-0405-z

Source DB:  PubMed          Journal:  BMC Bioinformatics        ISSN: 1471-2105            Impact factor:   3.169


  23 in total

1.  Additional gene ontology structure for improved biological reasoning.

Authors:  Simen Myhre; Henrik Tveit; Torulf Mollestad; Astrid Laegreid
Journal:  Bioinformatics       Date:  2006-06-20       Impact factor: 6.937

2.  Predicting phenotype from patterns of annotation.

Authors:  Oliver D King; Jeffrey C Lee; Aimée M Dudley; Daniel M Janse; George M Church; Frederick P Roth
Journal:  Bioinformatics       Date:  2003       Impact factor: 6.937

3.  Mapping Gene Ontology to proteins based on protein-protein interaction data.

Authors:  Minghua Deng; Zhidong Tu; Fengzhu Sun; Ting Chen
Journal:  Bioinformatics       Date:  2004-01-29       Impact factor: 6.937

4.  Information theory applied to the sparse gene ontology annotation network to predict novel gene function.

Authors:  Ying Tao; Lee Sam; Jianrong Li; Carol Friedman; Yves A Lussier
Journal:  Bioinformatics       Date:  2007-07-01       Impact factor: 6.937

5.  Distinct enhancers at the Pax3 locus can function redundantly to regulate neural tube and neural crest expressions.

Authors:  Karl R Degenhardt; Rita C Milewski; Arun Padmanabhan; Mayumi Miller; Manvendra K Singh; Deborah Lang; Kurt A Engleka; Meilin Wu; Jun Li; Diane Zhou; Nicole Antonucci; Li Li; Jonathan A Epstein
Journal:  Dev Biol       Date:  2010-01-04       Impact factor: 3.582

6.  Systematic analysis of experimental phenotype data reveals gene functions.

Authors:  Robert Hoehndorf; Nigel W Hardy; David Osumi-Sutherland; Susan Tweedie; Paul N Schofield; Georgios V Gkoutos
Journal:  PLoS One       Date:  2013-04-16       Impact factor: 3.240

7.  Gene ontology function prediction in mollicutes using protein-protein association networks.

Authors:  Antonio Gómez; Juan Cedano; Isaac Amela; Antoni Planas; Jaume Piñol; Enrique Querol
Journal:  BMC Syst Biol       Date:  2011-04-12

8.  Cytoscape 2.8: new features for data integration and network visualization.

Authors:  Michael E Smoot; Keiichiro Ono; Johannes Ruscheinski; Peng-Liang Wang; Trey Ideker
Journal:  Bioinformatics       Date:  2010-12-12       Impact factor: 6.937

9.  Gene Ontology annotations: what they mean and where they come from.

Authors:  David P Hill; Barry Smith; Monica S McAndrews-Hill; Judith A Blake
Journal:  BMC Bioinformatics       Date:  2008-04-29       Impact factor: 3.169

10.  The Mouse Genome Database: integration of and access to knowledge about the laboratory mouse.

Authors:  Judith A Blake; Carol J Bult; Janan T Eppig; James A Kadin; Joel E Richardson
Journal:  Nucleic Acids Res       Date:  2013-11-26       Impact factor: 16.971

View more
  1 in total

1.  Allele, phenotype and disease data at Mouse Genome Informatics: improving access and analysis.

Authors:  Susan M Bello; Cynthia L Smith; Janan T Eppig
Journal:  Mamm Genome       Date:  2015-07-11       Impact factor: 2.957

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.