Literature DB >> 19616641

Feature generation and representations for protein-protein interaction classification.

Man Lan1, Chew Lim Tan, Jian Su.   

Abstract

Automatic detecting protein-protein interaction (PPI) relevant articles is a crucial step for large-scale biological database curation. The previous work adopted POS tagging, shallow parsing and sentence splitting techniques, but they achieved worse performance than the simple bag-of-words representation. In this paper, we generated and investigated multiple types of feature representations in order to further improve the performance of PPI text classification task. Besides the traditional domain-independent bag-of-words approach and the term weighting methods, we also explored other domain-dependent features, i.e. protein-protein interaction trigger keywords, protein named entities and the advanced ways of incorporating Natural Language Processing (NLP) output. The integration of these multiple features has been evaluated on the BioCreAtIvE II corpus. The experimental results showed that both the advanced way of using NLP output and the integration of bag-of-words and NLP output improved the performance of text classification. Specifically, in comparison with the best performance achieved in the BioCreAtIvE II IAS, the feature-level and classifier-level integration of multiple features improved the performance of classification 2.71% and 3.95%, respectively.

Mesh:

Year:  2009        PMID: 19616641     DOI: 10.1016/j.jbi.2009.07.004

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  6 in total

1.  Document triage for identifying protein-protein interactions affected by mutations: a neural network ensemble approach.

Authors:  Ling Luo; Zhihao Yang; Hongfei Lin; Jian Wang
Journal:  Database (Oxford)       Date:  2018-01-01       Impact factor: 3.451

2.  Detection of interaction articles and experimental methods in biomedical literature.

Authors:  Gerold Schneider; Simon Clematide; Fabio Rinaldi
Journal:  BMC Bioinformatics       Date:  2011-10-03       Impact factor: 3.169

3.  Extraction of Protein-Protein Interaction from Scientific Articles by Predicting Dominant Keywords.

Authors:  Shun Koyabu; Thi Thanh Thuy Phan; Takenao Ohkawa
Journal:  Biomed Res Int       Date:  2015-12-10       Impact factor: 3.411

4.  Exploiting graph kernels for high performance biomedical relation extraction.

Authors:  Nagesh C Panyam; Karin Verspoor; Trevor Cohn; Kotagiri Ramamohanarao
Journal:  J Biomed Semantics       Date:  2018-01-30

5.  Automatic assignment of prokaryotic genes to functional categories using literature profiling.

Authors:  Raul Torrieri; Francislon S Oliveira; Guilherme Oliveira; Roney S Coimbra
Journal:  PLoS One       Date:  2012-10-15       Impact factor: 3.240

6.  Protein-protein interaction extraction with feature selection by evaluating contribution levels of groups consisting of related features.

Authors:  Thi Thanh Thuy Phan; Takenao Ohkawa
Journal:  BMC Bioinformatics       Date:  2016-07-25       Impact factor: 3.169

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.