Literature DB >> 20180257

Predicting protein-protein relationships from literature using latent topics.

Tatsuya Aso1, Koji Eguchi.   

Abstract

This paper investigates applying statistical topic models to extract and predict relationships between biological entities, especially protein mentions. A statistical topic model, Latent Dirichlet Allocation (LDA) is promising; however, it has not been investigated for such a task. In this paper, we apply the state-of-the-art Collapsed Variational Bayesian Inference and Gibbs Sampling inference to estimating the LDA model. We also apply probabilistic Latent Semantic Analysis (pLSA) as a baseline for comparison, and compare them from the viewpoints of log-likelihood, classification accuracy and retrieval effectiveness. We demonstrate through experiments that the Collapsed Variational LDA gives better results than the others, especially in terms of classification accuracy and retrieval effectiveness in the task of the protein-protein relationship prediction.

Mesh:

Substances:

Year:  2009        PMID: 20180257

Source DB:  PubMed          Journal:  Genome Inform        ISSN: 0919-9454


  3 in total

1.  Inferring functional modules of protein families with probabilistic topic models.

Authors:  Sebastian Ga Konietzny; Laura Dietz; Alice C McHardy
Journal:  BMC Bioinformatics       Date:  2011-05-09       Impact factor: 3.169

2.  Exploiting topic modeling to boost metagenomic reads binning.

Authors:  Ruichang Zhang; Zhanzhan Cheng; Jihong Guan; Shuigeng Zhou
Journal:  BMC Bioinformatics       Date:  2015-03-18       Impact factor: 3.169

3.  Evaluation of clustering and topic modeling methods over health-related tweets and emails.

Authors:  Juan Antonio Lossio-Ventura; Sergio Gonzales; Juandiego Morzan; Hugo Alatrista-Salas; Tina Hernandez-Boussard; Jiang Bian
Journal:  Artif Intell Med       Date:  2021-05-07       Impact factor: 7.011

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.