Literature DB >> 12460629

Learning anchor verbs for biological interaction patterns from published text articles.

Vasileios Hatzivassiloglou1, Wubin Weng.   

Abstract

Much of knowledge modeling in the molecular biology domain involves interactions between proteins, genes, various forms of RNA, small molecules, etc. Interactions between these substances are typically extracted and codified manually, increasing the cost and time for modeling and substantially limiting the coverage of the resulting knowledge base. In this paper, we describe an automatic system that learns from text interaction verbs; these verbs can then form the core of automatically retrieved patterns which model classes of biological interactions. We investigate text features relating verbs with genes and proteins, and apply statistical tests and a logistic regression statistical model to determine whether a given verb belongs to the class of interaction verbs. Our system, AVAD, achieves over 87% precision and 82% recall when tested on an 11 million word corpus of journal articles. In addition, we compare the automatically obtained results with a manually constructed database of interaction verbs and show that the automatic approach can significantly enrich the manual list by detecting rarer interaction verbs that were omitted from the database.

Mesh:

Year:  2002        PMID: 12460629     DOI: 10.1016/s1386-5056(02)00054-0

Source DB:  PubMed          Journal:  Int J Med Inform        ISSN: 1386-5056            Impact factor:   4.046


  5 in total

1.  Bayesian inference of protein-protein interactions from biological literature.

Authors:  Rajesh Chowdhary; Jinfeng Zhang; Jun S Liu
Journal:  Bioinformatics       Date:  2009-04-15       Impact factor: 6.937

Review 2.  What the papers say: text mining for genomics and systems biology.

Authors:  Nathan Harmston; Wendy Filsell; Michael P H Stumpf
Journal:  Hum Genomics       Date:  2010-10       Impact factor: 4.639

3.  Connecting the dots between PubMed abstracts.

Authors:  M Shahriar Hossain; Joseph Gresock; Yvette Edmonds; Richard Helm; Malcolm Potts; Naren Ramakrishnan
Journal:  PLoS One       Date:  2012-01-03       Impact factor: 3.240

4.  Automatic extraction of protein-protein interactions using grammatical relationship graph.

Authors:  Kaixian Yu; Pei-Yau Lung; Tingting Zhao; Peixiang Zhao; Yan-Yuan Tseng; Jinfeng Zhang
Journal:  BMC Med Inform Decis Mak       Date:  2018-07-23       Impact factor: 2.796

5.  Identification of transcription factor contexts in literature using machine learning approaches.

Authors:  Hui Yang; Goran Nenadic; John A Keane
Journal:  BMC Bioinformatics       Date:  2008-04-11       Impact factor: 3.169

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.