Literature DB >> 10902198

Two applications of information extraction to biological science journal articles: enzyme interactions and protein structures.

K Humphreys1, G Demetriou, R Gaizauskas.   

Abstract

Information extraction technology, as defined and developed through the U.S. DARPA Message Understanding Conferences (MUCs), has proved successful at extracting information primarily from newswire texts and primarily in domains concerned with human activity. In this paper we consider the application of this technology to the extraction of information from scientific journal papers in the area of molecular biology. In particular, we describe how an information extraction system designed to participate in the MUC exercises has been modified for two bioinformatics applications: EMPathIE, concerned with enzyme and metabolic pathways; and PASTA, concerned with protein structure. Progress to date provides convincing grounds for believing that IE techniques will deliver novel and effective ways for scientists to make use of the core literature which defines their disciplines.

Entities:  

Mesh:

Substances:

Year:  2000        PMID: 10902198     DOI: 10.1142/9789814447331_0048

Source DB:  PubMed          Journal:  Pac Symp Biocomput        ISSN: 2335-6928


  11 in total

1.  High-recall protein entity recognition using a dictionary.

Authors:  Zhenzhen Kou; William W Cohen; Robert F Murphy
Journal:  Bioinformatics       Date:  2005-06       Impact factor: 6.937

2.  Quantitative assessment of dictionary-based protein named entity tagging.

Authors:  Hongfang Liu; Zhang-Zhi Hu; Manabu Torii; Cathy Wu; Carol Friedman
Journal:  J Am Med Inform Assoc       Date:  2006-06-23       Impact factor: 4.497

3.  BioTagger-GM: a gene/protein name recognition system.

Authors:  Manabu Torii; Zhangzhi Hu; Cathy H Wu; Hongfang Liu
Journal:  J Am Med Inform Assoc       Date:  2008-12-11       Impact factor: 4.497

4.  PreBIND and Textomy--mining the biomedical literature for protein-protein interactions using a support vector machine.

Authors:  Ian Donaldson; Joel Martin; Berry de Bruijn; Cheryl Wolting; Vicki Lay; Brigitte Tuekam; Shudong Zhang; Berivan Baskin; Gary D Bader; Katerina Michalickova; Tony Pawson; Christopher W V Hogue
Journal:  BMC Bioinformatics       Date:  2003-03-27       Impact factor: 3.169

5.  A text-mining system for extracting metabolic reactions from full-text articles.

Authors:  Jan Czarnecki; Irene Nobeli; Adrian M Smith; Adrian J Shepherd
Journal:  BMC Bioinformatics       Date:  2012-07-23       Impact factor: 3.169

6.  Applications of natural language processing in biodiversity science.

Authors:  Anne E Thessen; Hong Cui; Dmitry Mozzherin
Journal:  Adv Bioinformatics       Date:  2012-05-22

7.  A graph-search framework for associating gene identifiers with documents.

Authors:  William W Cohen; Einat Minkov
Journal:  BMC Bioinformatics       Date:  2006-10-10       Impact factor: 3.169

8.  Mutationmapper: a tool to aid the mapping of protein mutation data.

Authors:  Shabana Vohra; Philip C Biggin
Journal:  PLoS One       Date:  2013-08-09       Impact factor: 3.240

9.  An integrated text mining framework for metabolic interaction network reconstruction.

Authors:  Preecha Patumcharoenpol; Wanwipa Vongsangnak; Narumol Doungpan; Asawin Meechai; Bairong Shen; Jonathan H Chan
Journal:  PeerJ       Date:  2016-03-21       Impact factor: 2.984

10.  Identification of transcription factor contexts in literature using machine learning approaches.

Authors:  Hui Yang; Goran Nenadic; John A Keane
Journal:  BMC Bioinformatics       Date:  2008-04-11       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.