Literature DB >> 17473318

Data mining and predictive modeling of biomolecular network from biomedical literature databases.

Xiaohua Hu1, Daniel D Wu.   

Abstract

In this paper, we present a novel approach Bio-IEDM (Biomedical Information Extraction and Data Mining) to integrate text mining and predictive modeling to analyze biomolecular network from biomedical literature databases. Our method consists of two phases. In phase 1, we discuss a semisupervised efficient learning approach to automatically extract biological relationships such as protein-protein interaction, protein-gene interaction from the biomedical literature databases to construct the biomolecular network. Our method automatically learns the patterns based on a few user seed tuples and then extracts new tuples from the biomedical literature based on the discovered patterns. The derived biomolecular network forms a large scale-free network graph. In phase 2, we present a novel clustering algorithm to analyze the biomolecular network graph to identify biologically meaningful subnetworks (communities). The clustering algorithm considers the characteristics of the scale-free network graphs and is based on the local density of the vertex and its neighborhood functions that can be used to find more meaningful clusters with different density level. The experimental results indicate our approach is very effective in extracting biological knowledge from a huge collection of biomedical literature. The integration of data mining and information extraction provides a promising direction for analyzing the biomolecular network.

Mesh:

Substances:

Year:  2007        PMID: 17473318     DOI: 10.1109/TCBB.2007.070211

Source DB:  PubMed          Journal:  IEEE/ACM Trans Comput Biol Bioinform        ISSN: 1545-5963            Impact factor:   3.710


  7 in total

1.  Inquiry diagnosis of coronary heart disease in Chinese medicine based on symptom-syndrome interactions.

Authors:  Guo-Zheng Li; Sheng Sun; Mingyu You; Ya-Lei Wang; Guo-Ping Liu
Journal:  Chin Med       Date:  2012-04-05       Impact factor: 5.455

2.  The Text-mining based PubChem Bioassay neighboring analysis.

Authors:  Lianyi Han; Tugba O Suzek; Yanli Wang; Steve H Bryant
Journal:  BMC Bioinformatics       Date:  2010-11-08       Impact factor: 3.169

3.  Learning an enriched representation from unlabeled data for protein-protein interaction extraction.

Authors:  Yanpeng Li; Xiaohua Hu; Hongfei Lin; Zhihao Yang
Journal:  BMC Bioinformatics       Date:  2010-04-16       Impact factor: 3.169

4.  Open Biomedical Ontology-based Medline exploration.

Authors:  Weijian Xuan; Manhong Dai; Barbara Mirel; Jean Song; Brian Athey; Stanley J Watson; Fan Meng
Journal:  BMC Bioinformatics       Date:  2009-05-06       Impact factor: 3.169

5.  Automatic extraction of protein-protein interactions using grammatical relationship graph.

Authors:  Kaixian Yu; Pei-Yau Lung; Tingting Zhao; Peixiang Zhao; Yan-Yuan Tseng; Jinfeng Zhang
Journal:  BMC Med Inform Decis Mak       Date:  2018-07-23       Impact factor: 2.796

6.  A semantic relationship mining method among disorders, genes, and drugs from different biomedical datasets.

Authors:  Li Zhang; Jiamei Hu; Qianzhi Xu; Fang Li; Guozheng Rao; Cui Tao
Journal:  BMC Med Inform Decis Mak       Date:  2020-12-14       Impact factor: 2.796

7.  Mining and state-space modeling and verification of sub-networks from large-scale biomolecular networks.

Authors:  Xiaohua Hu; Fang-Xiang Wu
Journal:  BMC Bioinformatics       Date:  2007-08-31       Impact factor: 3.169

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.