Literature DB >> 15972005

Large-scale extraction of gene regulation for model organisms in an ontological context.

Jasmin Saric1, Lars J Jensen, Isabel Rojas.   

Abstract

This paper presents an approach using syntactosemantic rules for the extraction of relational information from biomedical abstracts. The results show that by overcoming the hurdle of technical terminology, high precision results can be achieved. From abstracts related to baker's yeast, we manage to extract a regulatory network comprised of 441 pairwise relations from 58,664 abstracts with an accuracy of 83 - 90%. To achieve this, we made use of a resource of gene/protein names considerably larger than those used in most other biology related information extraction approaches. This list of names was included in the lexicon of our retrained partof- speech tagger for use on molecular biology abstracts. For the domain in question an accuracy of 93.6 - 97.7% was attained on Part-of-speech-tags. The method can be easily adapted to other organisms than yeast, allowing us to extract many more biologically relevant relations. The main reason for the comparable precision rates is the ontological model that was built beforehand and served as a guiding force for the manual coding of the syntactosemantic rules.

Entities:  

Mesh:

Substances:

Year:  2005        PMID: 15972005

Source DB:  PubMed          Journal:  In Silico Biol        ISSN: 1386-6338


  7 in total

Review 1.  Text-mining solutions for biomedical research: enabling integrative biology.

Authors:  Dietrich Rebholz-Schuhmann; Anika Oellrich; Robert Hoehndorf
Journal:  Nat Rev Genet       Date:  2012-11-14       Impact factor: 53.242

2.  Improving the extraction of complex regulatory events from scientific text by using ontology-based inference.

Authors:  Jung-Jae Kim; Dietrich Rebholz-Schuhmann
Journal:  J Biomed Semantics       Date:  2011-10-06

3.  Term identification methods for consumer health vocabulary development.

Authors:  Qing T Zeng; Tony Tse; Guy Divita; Alla Keselman; Jon Crowell; Allen C Browne; Sergey Goryachev; Long Ngo
Journal:  J Med Internet Res       Date:  2007-02-28       Impact factor: 5.428

4.  Graph theory enables drug repurposing--how a mathematical model can drive the discovery of hidden mechanisms of action.

Authors:  Ruggero Gramatica; T Di Matteo; Stefano Giorgetti; Massimo Barbiani; Dorian Bevec; Tomaso Aste
Journal:  PLoS One       Date:  2014-01-09       Impact factor: 3.240

5.  BioCAD: an information fusion platform for bio-network inference and analysis.

Authors:  Doheon Lee; Sangwoo Kim; Younghoon Kim
Journal:  BMC Bioinformatics       Date:  2007-11-27       Impact factor: 3.169

6.  Automatic reconstruction of a bacterial regulatory network using Natural Language Processing.

Authors:  Carlos Rodríguez-Penagos; Heladia Salgado; Irma Martínez-Flores; Julio Collado-Vides
Journal:  BMC Bioinformatics       Date:  2007-08-07       Impact factor: 3.169

7.  Text-mining assisted regulatory annotation.

Authors:  Stein Aerts; Maximilian Haeussler; Steven van Vooren; Obi L Griffith; Paco Hulpiau; Steven J M Jones; Stephen B Montgomery; Casey M Bergman
Journal:  Genome Biol       Date:  2008-02-13       Impact factor: 13.583

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.