Literature DB >> 16046493

Extraction of regulatory gene/protein networks from Medline.

Jasmin Saric1, Lars Juhl Jensen, Rossitza Ouzounova, Isabel Rojas, Peer Bork.   

Abstract

MOTIVATION: We have previously developed a rule-based approach for extracting information on the regulation of gene expression in yeast. The biomedical literature, however, contains information on several other equally important regulatory mechanisms, in particular phosphorylation, which we now expanded for our rule-based system also to extract.
RESULTS: This paper presents new results for extraction of relational information from biomedical text. We have improved our system, STRING-IE, to capture both new types of linguistic constructs as well as new types of biological information [i.e. (de-)phosphorylation]. The precision remains stable with a slight increase in recall. From almost one million PubMed abstracts related to four model organisms, we manage to extract regulatory networks and binary phosphorylations comprising 3,319 relation chunks. The accuracy is 83-90% and 86-95% for gene expression and (de-)phosphorylation relations, respectively. To achieve this, we made use of an organism-specific resource of gene/protein names considerably larger than those used in most other biology related information extraction approaches. These names were included in the lexicon when retraining the part-of-speech (POS) tagger on the GENIA corpus. For the domain in question, an accuracy of 96.4% was attained on POS tags. It should be noted that the rules were developed for yeast and successfully applied to both abstracts and full-text articles related to other organisms with comparable accuracy. AVAILABILITY: The revised GENIA corpus, the POS tagger, the extraction rules and the full sets of extracted relations are available from http://www.bork.embl.de/Docu/STRING-IE

Entities:  

Mesh:

Substances:

Year:  2005        PMID: 16046493     DOI: 10.1093/bioinformatics/bti597

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  51 in total

Review 1.  Natural Language Processing methods and systems for biomedical ontology learning.

Authors:  Kaihong Liu; William R Hogan; Rebecca S Crowley
Journal:  J Biomed Inform       Date:  2010-07-18       Impact factor: 6.317

2.  Using text to build semantic networks for pharmacogenomics.

Authors:  Adrien Coulet; Nigam H Shah; Yael Garten; Mark Musen; Russ B Altman
Journal:  J Biomed Inform       Date:  2010-08-17       Impact factor: 6.317

3.  Systematic discovery of in vivo phosphorylation networks.

Authors:  Rune Linding; Lars Juhl Jensen; Gerard J Ostheimer; Marcel A T M van Vugt; Claus Jørgensen; Ioana M Miron; Francesca Diella; Karen Colwill; Lorne Taylor; Kelly Elder; Pavel Metalnikov; Vivian Nguyen; Adrian Pasculescu; Jing Jin; Jin Gyoon Park; Leona D Samson; James R Woodgett; Robert B Russell; Peer Bork; Michael B Yaffe; Tony Pawson
Journal:  Cell       Date:  2007-06-14       Impact factor: 41.582

4.  Bayesian inference of protein-protein interactions from biological literature.

Authors:  Rajesh Chowdhary; Jinfeng Zhang; Jun S Liu
Journal:  Bioinformatics       Date:  2009-04-15       Impact factor: 6.937

5.  Extracting causal relations on HIV drug resistance from literature.

Authors:  Quoc-Chinh Bui; Breanndán O Nualláin; Charles A Boucher; Peter M A Sloot
Journal:  BMC Bioinformatics       Date:  2010-02-23       Impact factor: 3.169

6.  A comprehensive benchmark of kernel methods to extract protein-protein interactions from literature.

Authors:  Domonkos Tikk; Philippe Thomas; Peter Palaga; Jörg Hakenberg; Ulf Leser
Journal:  PLoS Comput Biol       Date:  2010-07-01       Impact factor: 4.475

7.  BSQA: integrated text mining using entity relation semantics extracted from biological literature of insects.

Authors:  Xin He; Yanen Li; Radhika Khetani; Barry Sanders; Yue Lu; Xu Ling; Chengxiang Zhai; Bruce Schatz
Journal:  Nucleic Acids Res       Date:  2010-07       Impact factor: 16.971

8.  PTM-Switchboard--a database of posttranslational modifications of transcription factors, the mediating enzymes and target genes.

Authors:  Logan Everett; Antony Vo; Sridhar Hannenhalli
Journal:  Nucleic Acids Res       Date:  2008-10-15       Impact factor: 16.971

Review 9.  Regulation by transcription factors in bacteria: beyond description.

Authors:  Enrique Balleza; Lucia N López-Bojorquez; Agustino Martínez-Antonio; Osbaldo Resendis-Antonio; Irma Lozada-Chávez; Yalbi I Balderas-Martínez; Sergio Encarnación; Julio Collado-Vides
Journal:  FEMS Microbiol Rev       Date:  2009-01       Impact factor: 16.408

10.  Automated recognition of brain region mentions in neuroscience literature.

Authors:  Leon French; Suzanne Lane; Lydia Xu; Paul Pavlidis
Journal:  Front Neuroinform       Date:  2009-09-01       Impact factor: 4.081

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.