Literature DB >> 25863986

A semi-supervised learning framework for biomedical event extraction based on hidden topics.

Deyu Zhou1, Dayou Zhong2.   

Abstract

OBJECTIVES: Scientists have devoted decades of efforts to understanding the interaction between proteins or RNA production. The information might empower the current knowledge on drug reactions or the development of certain diseases. Nevertheless, due to the lack of explicit structure, literature in life science, one of the most important sources of this information, prevents computer-based systems from accessing. Therefore, biomedical event extraction, automatically acquiring knowledge of molecular events in research articles, has attracted community-wide efforts recently. Most approaches are based on statistical models, requiring large-scale annotated corpora to precisely estimate models' parameters. However, it is usually difficult to obtain in practice. Therefore, employing un-annotated data based on semi-supervised learning for biomedical event extraction is a feasible solution and attracts more interests. METHODS AND MATERIAL: In this paper, a semi-supervised learning framework based on hidden topics for biomedical event extraction is presented. In this framework, sentences in the un-annotated corpus are elaborately and automatically assigned with event annotations based on their distances to these sentences in the annotated corpus. More specifically, not only the structures of the sentences, but also the hidden topics embedded in the sentences are used for describing the distance. The sentences and newly assigned event annotations, together with the annotated corpus, are employed for training.
RESULTS: Experiments were conducted on the multi-level event extraction corpus, a golden standard corpus. Experimental results show that more than 2.2% improvement on F-score on biomedical event extraction is achieved by the proposed framework when compared to the state-of-the-art approach.
CONCLUSION: The results suggest that by incorporating un-annotated data, the proposed framework indeed improves the performance of the state-of-the-art event extraction system and the similarity between sentences might be precisely described by hidden topics and structures of the sentences.
Copyright © 2015 Elsevier B.V. All rights reserved.

Keywords:  Biomedical event extraction; K nearest neighbor; Latent Dirichlet allocation; Semi-supervised learning

Mesh:

Year:  2015        PMID: 25863986     DOI: 10.1016/j.artmed.2015.03.004

Source DB:  PubMed          Journal:  Artif Intell Med        ISSN: 0933-3657            Impact factor:   5.326


  4 in total

1.  Conditional Probability Joint Extraction of Nested Biomedical Events: Design of a Unified Extraction Framework Based on Neural Networks.

Authors:  Yan Wang; Jian Wang; Huiyi Lu; Bing Xu; Yijia Zhang; Santosh Kumar Banbhrani; Hongfei Lin
Journal:  JMIR Med Inform       Date:  2022-06-07

2.  A multiple distributed representation method based on neural network for biomedical event extraction.

Authors:  Anran Wang; Jian Wang; Hongfei Lin; Jianhai Zhang; Zhihao Yang; Kan Xu
Journal:  BMC Med Inform Decis Mak       Date:  2017-12-20       Impact factor: 2.796

3.  A biomedical event extraction method based on fine-grained and attention mechanism.

Authors:  Xinyu He; Ping Tai; Hongbin Lu; Xin Huang; Yonggong Ren
Journal:  BMC Bioinformatics       Date:  2022-07-29       Impact factor: 3.307

4.  Support Vector Machine with Ensemble Tree Kernel for Relation Extraction.

Authors:  Xiaoyong Liu; Hui Fu; Zhiguo Du
Journal:  Comput Intell Neurosci       Date:  2016-03-22
  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.