Literature DB >> 15059832

Gene annotation from scientific literature using mappings between keyword systems.

Antonio J Pérez1, Carolina Perez-Iratxeta, Peer Bork, Guillermo Thode, Miguel A Andrade.   

Abstract

MOTIVATION: The description of genes in databases by keywords helps the non-specialist to quickly grasp the properties of a gene and increases the efficiency of computational tools that are applied to gene data (e.g. searching a gene database for sequences related to a particular biological process). However, the association of keywords to genes or protein sequences is a difficult process that ultimately implies examination of the literature related to a gene.
RESULTS: To support this task, we present a procedure to derive keywords from the set of scientific abstracts related to a gene. Our system is based on the automated extraction of mappings between related terms from different databases using a model of fuzzy associations that can be applied with all generality to any pair of linked databases. We tested the system by annotating genes of the SWISS-PROT database with keywords derived from the abstracts linked to their entries (stored in the MEDLINE database of scientific references). The performance of the annotation procedure was much better for SWISS-PROT keywords (recall of 47%, precision of 68%) than for Gene Ontology terms (recall of 8%, precision of 67%). AVAILABILITY: The algorithm can be publicly accessed and used for the annotation of sequences through a web server at http://www.bork.embl.de/kat

Mesh:

Substances:

Year:  2004        PMID: 15059832     DOI: 10.1093/bioinformatics/bth207

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  15 in total

1.  Genestrace: phenomic knowledge discovery via structured terminology.

Authors:  Michael N Cantor; Indra Neil Sarkar; Olivier Bodenreider; Yves A Lussier
Journal:  Pac Symp Biocomput       Date:  2005

2.  PhenoGO: assigning phenotypic context to gene ontology annotations with natural language processing.

Authors:  Yves Lussier; Tara Borlawsky; Daniel Rappaport; Yang Liu; Carol Friedman
Journal:  Pac Symp Biocomput       Date:  2006

3.  Mining experimental evidence of molecular function claims from the literature.

Authors:  Colleen E Crangle; J Michael Cherry; Eurie L Hong; Alex Zbyslaw
Journal:  Bioinformatics       Date:  2007-10-17       Impact factor: 6.937

4.  Information theory applied to the sparse gene ontology annotation network to predict novel gene function.

Authors:  Ying Tao; Lee Sam; Jianrong Li; Carol Friedman; Yves A Lussier
Journal:  Bioinformatics       Date:  2007-07-01       Impact factor: 6.937

5.  PESCADOR, a web-based tool to assist text-mining of biointeractions extracted from PubMed queries.

Authors:  Adriano Barbosa-Silva; Jean-Fred Fontaine; Elisa R Donnard; Fernanda Stussi; J Miguel Ortega; Miguel A Andrade-Navarro
Journal:  BMC Bioinformatics       Date:  2011-11-09       Impact factor: 3.307

6.  Literature mining for the discovery of hidden connections between drugs, genes and diseases.

Authors:  Raoul Frijters; Marianne van Vugt; Ruben Smeets; René van Schaik; Jacob de Vlieg; Wynand Alkema
Journal:  PLoS Comput Biol       Date:  2010-09-23       Impact factor: 4.475

7.  Combining evidence, biomedical literature and statistical dependence: new insights for functional annotation of gene sets.

Authors:  Marc Aubry; Annabelle Monnier; Celine Chicault; Marie de Tayrac; Marie-Dominique Galibert; Anita Burgun; Jean Mosser
Journal:  BMC Bioinformatics       Date:  2006-05-04       Impact factor: 3.169

8.  GOAnnotator: linking protein GO annotations to evidence text.

Authors:  Francisco M Couto; Mário J Silva; Vivian Lee; Emily Dimmer; Evelyn Camon; Rolf Apweiler; Harald Kirsch; Dietrich Rebholz-Schuhmann
Journal:  J Biomed Discov Collab       Date:  2006-12-20

9.  Protein function prediction using text-based features extracted from the biomedical literature: the CAFA challenge.

Authors:  Andrew Wong; Hagit Shatkay
Journal:  BMC Bioinformatics       Date:  2013-02-28       Impact factor: 3.169

10.  Amplification of the Gene Ontology annotation of Affymetrix probe sets.

Authors:  Enrique M Muro; Carolina Perez-Iratxeta; Miguel A Andrade-Navarro
Journal:  BMC Bioinformatics       Date:  2006-03-20       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.