Literature DB >> 19398370

Learning ontological rules to extract multiple relations of genic interactions from text.

Alain-Pierre Manine1, Erick Alphonse, Philippe Bessières.   

Abstract

INTRODUCTION: Information extraction (IE) systems have been proposed in recent years to extract genic interactions from bibliographical resources. They are limited to single interaction relations, and have to face a trade-off between recall and precision, by focusing either on specific interactions (for precision), or general and unspecified interactions of biological entities (for recall). Yet, biologists need to process more complex data from literature, in order to study biological pathways. An ontology is an adequate formal representation to model this sophisticated knowledge. However, the tight integration of IE systems and ontologies is still a current research issue, a fortiori with complex ones that go beyond hierarchies.
METHOD: We propose a rich modeling of genic interactions with an ontology, and show how it can be used within an IE system. The ontology is seen as a language specifying a normalized representation of text. First, IE is performed by extracting instances from natural language processing (NLP) modules. Then, deductive inferences on the ontology language are completed, and new instances are derived from previously extracted ones. Inference rules are learnt with an inductive logic programming (ILP) algorithm, using the ontology as the hypothesis language, and its instantiation on an annotated corpus as the example language. Learning is set in a multi-class setting to deal with the multiple ontological relations.
RESULTS: We validated our approach on an annotated corpus of gene transcription regulations in the Bacillus subtilis bacterium. We reach a global recall of 89.3% and a precision of 89.6%, with high scores for the ten semantic relations defined in the ontology.

Entities:  

Mesh:

Year:  2009        PMID: 19398370     DOI: 10.1016/j.ijmedinf.2009.03.005

Source DB:  PubMed          Journal:  Int J Med Inform        ISSN: 1386-5056            Impact factor:   4.046


  4 in total

1.  Event extraction of bacteria biotopes: a knowledge-intensive NLP-based approach.

Authors:  Zorana Ratkovic; Wiktoria Golik; Pierre Warnier
Journal:  BMC Bioinformatics       Date:  2012-06-26       Impact factor: 3.169

2.  BioNLP Shared Task--The Bacteria Track.

Authors:  Robert Bossy; Julien Jourde; Alain-Pierre Manine; Philippe Veber; Erick Alphonse; Maarten van de Guchte; Philippe Bessières; Claire Nédellec
Journal:  BMC Bioinformatics       Date:  2012-06-26       Impact factor: 3.169

3.  Overview of the gene regulation network and the bacteria biotope tasks in BioNLP'13 shared task.

Authors:  Robert Bossy; Wiktoria Golik; Zorana Ratkovic; Dialekti Valsamou; Philippe Bessières; Claire Nédellec
Journal:  BMC Bioinformatics       Date:  2015-07-13       Impact factor: 3.169

4.  A semantic-based method for extracting concept definitions from scientific publications: evaluation in the autism phenotype domain.

Authors:  Saeed Hassanpour; Martin J O'Connor; Amar K Das
Journal:  J Biomed Semantics       Date:  2013-08-12
  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.