Literature DB >> 17942445

Mining experimental evidence of molecular function claims from the literature.

Colleen E Crangle1, J Michael Cherry, Eurie L Hong, Alex Zbyslaw.   

Abstract

MOTIVATION: The rate at which gene-related findings appear in the scientific literature makes it difficult if not impossible for biomedical scientists to keep fully informed and up to date. The importance of these findings argues for the development of automated methods that can find, extract and summarize this information. This article reports on methods for determining the molecular function claims that are being made in a scientific article, specifically those that are backed by experimental evidence.
RESULTS: The most significant result is that for molecular function claims based on direct assays, our methods achieved recall of 70.7% and precision of 65.7%. Furthermore, our methods correctly identified in the text 44.6% of the specific molecular function claims backed up by direct assays, but with a precision of only 0.92%, a disappointing outcome that led to an examination of the different kinds of errors. These results were based on an analysis of 1823 articles from the literature of Saccharomyces cerevisiae (budding yeast). AVAILABILITY: The annotation files for S.cerevisiae are available from ftp://genome-ftp.stanford.edu/pub/yeast/data_download/literature_curation/gene_association.sgd.gz. The draft protocol vocabulary is available by request from the first author.

Entities:  

Mesh:

Substances:

Year:  2007        PMID: 17942445      PMCID: PMC3041023          DOI: 10.1093/bioinformatics/btm495

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  25 in total

1.  The ENZYME database in 2000.

Authors:  A Bairoch
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  Evaluation of text data mining for database curation: lessons learned from the KDD Challenge Cup.

Authors:  Alexander S Yeh; Lynette Hirschman; Alexander A Morgan
Journal:  Bioinformatics       Date:  2003       Impact factor: 6.937

3.  Gene annotation from scientific literature using mappings between keyword systems.

Authors:  Antonio J Pérez; Carolina Perez-Iratxeta; Peer Bork; Guillermo Thode; Miguel A Andrade
Journal:  Bioinformatics       Date:  2004-04-01       Impact factor: 6.937

Review 4.  Development of FuGO: an ontology for functional genomics investigations.

Authors:  Patricia L Whetzel; Ryan R Brinkman; Helen C Causton; Liju Fan; Dawn Field; Jennifer Fostel; Gilberto Fragoso; Tanya Gray; Mervi Heiskanen; Tina Hernandez-Boussard; Norman Morrison; Helen Parkinson; Philippe Rocca-Serra; Susanna-Assunta Sansone; Daniel Schober; Barry Smith; Robert Stevens; Christian J Stoeckert; Chris Taylor; Joe White; Andrew Wood
Journal:  OMICS       Date:  2006

5.  Identifying gene ontology concepts in natural-language text.

Authors:  C E Crangle; A Zbyslaw
Journal:  Conf Proc IEEE Eng Med Biol Soc       Date:  2004

6.  Glycogen synthase phosphatase interacts with heat shock factor to activate CUP1 gene transcription in Saccharomyces cerevisiae.

Authors:  J T Lin; J T Lis
Journal:  Mol Cell Biol       Date:  1999-05       Impact factor: 4.272

7.  RMI1/NCE4, a suppressor of genome instability, encodes a member of the RecQ helicase/Topo III complex.

Authors:  Michael Chang; Mohammed Bellaoui; Chaoying Zhang; Ridhdhi Desai; Pavel Morozov; Lissette Delgado-Cruzata; Rodney Rothstein; Greg A Freyer; Charles Boone; Grant W Brown
Journal:  EMBO J       Date:  2005-05-12       Impact factor: 11.598

8.  The FlyBase database of the Drosophila genome projects and community literature.

Authors: 
Journal:  Nucleic Acids Res       Date:  2003-01-01       Impact factor: 16.971

9.  Overview of BioCreAtIvE: critical assessment of information extraction for biology.

Authors:  Lynette Hirschman; Alexander Yeh; Christian Blaschke; Alfonso Valencia
Journal:  BMC Bioinformatics       Date:  2005-05-24       Impact factor: 3.169

10.  Learning statistical models for annotating proteins with function information using biomedical text.

Authors:  Soumya Ray; Mark Craven
Journal:  BMC Bioinformatics       Date:  2005-05-24       Impact factor: 3.169

View more
  8 in total

1.  Getting started in text mining: part two.

Authors:  Andrey Rzhetsky; Michael Seringhaus; Mark B Gerstein
Journal:  PLoS Comput Biol       Date:  2009-07-31       Impact factor: 4.475

Review 2.  Functional annotations for the Saccharomyces cerevisiae genome: the knowns and the known unknowns.

Authors:  Karen R Christie; Eurie L Hong; J Michael Cherry
Journal:  Trends Microbiol       Date:  2009-07-02       Impact factor: 17.079

3.  Using computational predictions to improve literature-based Gene Ontology annotations: a feasibility study.

Authors:  Maria C Costanzo; Julie Park; Rama Balakrishnan; J Michael Cherry; Eurie L Hong
Journal:  Database (Oxford)       Date:  2011-03-15       Impact factor: 3.451

4.  CvManGO, a method for leveraging computational predictions to improve literature-based Gene Ontology annotations.

Authors:  Julie Park; Maria C Costanzo; Rama Balakrishnan; J Michael Cherry; Eurie L Hong
Journal:  Database (Oxford)       Date:  2012-03-20       Impact factor: 3.451

5.  A questions-based investigation of consumer mental-health information.

Authors:  Colleen E Crangle; Joyce Brothers Kart
Journal:  PeerJ       Date:  2015-03-31       Impact factor: 2.984

6.  Gene Ontology density estimation and discourse analysis for automatic GeneRiF extraction.

Authors:  Julien Gobeill; Imad Tbahriti; Frédéric Ehrler; Anaïs Mottaz; Anne-Lise Veuthey; Patrick Ruch
Journal:  BMC Bioinformatics       Date:  2008-04-11       Impact factor: 3.169

7.  Semi-automated curation of protein subcellular localization: a text mining-based approach to Gene Ontology (GO) Cellular Component curation.

Authors:  Kimberly Van Auken; Joshua Jaffery; Juancarlos Chan; Hans-Michael Müller; Paul W Sternberg
Journal:  BMC Bioinformatics       Date:  2009-07-21       Impact factor: 3.169

8.  ECO-CollecTF: A Corpus of Annotated Evidence-Based Assertions in Biomedical Manuscripts.

Authors:  Elizabeth T Hobbs; Stephen M Goralski; Ashley Mitchell; Andrew Simpson; Dorjan Leka; Emmanuel Kotey; Matt Sekira; James B Munro; Suvarna Nadendla; Rebecca Jackson; Aitor Gonzalez-Aguirre; Martin Krallinger; Michelle Giglio; Ivan Erill
Journal:  Front Res Metr Anal       Date:  2021-07-13
  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.