Literature DB >> 23424147

Detection of protein catalytic sites in the biomedical literature.

Karin Verspoor1, Andrew Mackinlay, Judith D Cohn, Michael E Wall.   

Abstract

This paper explores the application of text mining to the problem of detecting protein functional sites in the biomedical literature, and specifically considers the task of identifying catalytic sites in that literature. We provide strong evidence for the need for text mining techniques that address residue-level protein function annotation through an analysis of two corpora in terms of their coverage of curated data sources. We also explore the viability of building a text-based classifier for identifying protein functional sites, identifying the low coverage of curated data sources and the potential ambiguity of information about protein functional sites as challenges that must be addressed. Nevertheless we produce a simple classifier that achieves a reasonable ∼69% F-score on our full text silver corpus on the first attempt to address this classification task. The work has application in computational prediction of the functional significance of protein sites as well as in curation workflows for databases that capture this information.

Entities:  

Mesh:

Substances:

Year:  2013        PMID: 23424147      PMCID: PMC3664919     

Source DB:  PubMed          Journal:  Pac Symp Biocomput        ISSN: 2335-6928


  13 in total

1.  The Protein Data Bank.

Authors:  H M Berman; J Westbrook; Z Feng; G Gilliland; T N Bhat; H Weissig; I N Shindyalov; P E Bourne
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

Review 2.  Beyond annotation transfer by homology: novel protein-function prediction methods to assist drug discovery.

Authors:  Yanay Ofran; Marco Punta; Reinhard Schneider; Burkhard Rost
Journal:  Drug Discov Today       Date:  2005-11-01       Impact factor: 7.851

3.  Manual curation is not sufficient for annotation of genomic databases.

Authors:  William A Baumgartner; K Bretonnel Cohen; Lynne M Fox; George Acquaah-Mensah; Lawrence Hunter
Journal:  Bioinformatics       Date:  2007-07-01       Impact factor: 6.937

4.  The structural and content aspects of abstracts versus bodies of full text journal articles are different.

Authors:  K Bretonnel Cohen; Helen L Johnson; Karin Verspoor; Christophe Roeder; Lawrence E Hunter
Journal:  BMC Bioinformatics       Date:  2010-09-29       Impact factor: 3.169

5.  Text mining improves prediction of protein functional sites.

Authors:  Karin M Verspoor; Judith D Cohn; Komandur E Ravikumar; Michael E Wall
Journal:  PLoS One       Date:  2012-02-29       Impact factor: 3.240

6.  Automated extraction and semantic analysis of mutation impacts from the biomedical literature.

Authors:  Nona Naderi; René Witte
Journal:  BMC Genomics       Date:  2012-06-18       Impact factor: 3.969

7.  BioLemmatizer: a lemmatization tool for morphological processing of biomedical text.

Authors:  Haibin Liu; Tom Christiansen; William A Baumgartner; Karin Verspoor
Journal:  J Biomed Semantics       Date:  2012-04-01

8.  Binding MOAD, a high-quality protein-ligand database.

Authors:  Mark L Benson; Richard D Smith; Nickolay A Khazanov; Brandon Dimcheff; John Beaver; Peter Dresslar; Jason Nerothin; Heather A Carlson
Journal:  Nucleic Acids Res       Date:  2007-11-30       Impact factor: 16.971

9.  Annotation of protein residues based on a literature analysis: cross-validation against UniProtKb.

Authors:  Kevin Nagel; Antonio Jimeno-Yepes; Dietrich Rebholz-Schuhmann
Journal:  BMC Bioinformatics       Date:  2009-08-27       Impact factor: 3.169

10.  Literature mining of protein-residue associations with graph rules learned through distant supervision.

Authors:  Ke Ravikumar; Haibin Liu; Judith D Cohn; Michael E Wall; Karin Verspoor
Journal:  J Biomed Semantics       Date:  2012-10-05
View more
  5 in total

1.  PDF text classification to leverage information extraction from publication reports.

Authors:  Duy Duc An Bui; Guilherme Del Fiol; Siddhartha Jonnalagadda
Journal:  J Biomed Inform       Date:  2016-04-01       Impact factor: 6.317

2.  Associating disease-related genetic variants in intergenic regions to the genes they impact.

Authors:  Geoff Macintyre; Antonio Jimeno Yepes; Cheng Soon Ong; Karin Verspoor
Journal:  PeerJ       Date:  2014-10-23       Impact factor: 2.984

3.  Mutation extraction tools can be combined for robust recognition of genetic variants in the literature.

Authors:  Antonio Jimeno Yepes; Karin Verspoor
Journal:  F1000Res       Date:  2014-01-21

4.  Annotating the biomedical literature for the human variome.

Authors:  Karin Verspoor; Antonio Jimeno Yepes; Lawrence Cavedon; Tara McIntosh; Asha Herten-Crabb; Zoë Thomas; John-Paul Plazzer
Journal:  Database (Oxford)       Date:  2013-04-12       Impact factor: 3.451

5.  Literature mining of genetic variants for curation: quantifying the importance of supplementary material.

Authors:  Antonio Jimeno Yepes; Karin Verspoor
Journal:  Database (Oxford)       Date:  2014-02-10       Impact factor: 3.451

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.