Literature DB >> 23828786

Towards building a disease-phenotype knowledge base: extracting disease-manifestation relationship from literature.

Rong Xu1, Li Li, Quanqiu Wang.   

Abstract

MOTIVATION: Systems approaches to studying phenotypic relationships among diseases are emerging as an active area of research for both novel disease gene discovery and drug repurposing. Currently, systematic study of disease phenotypic relationships on a phenome-wide scale is limited because large-scale machine-understandable disease-phenotype relationship knowledge bases are often unavailable. Here, we present an automatic approach to extract disease-manifestation (D-M) pairs (one specific type of disease-phenotype relationship) from the wide body of published biomedical literature. DATA AND METHODS: Our method leverages external knowledge and limits the amount of human effort required. For the text corpus, we used 119 085 682 MEDLINE sentences (21 354 075 citations). First, we used D-M pairs from existing biomedical ontologies as prior knowledge to automatically discover D-M-specific syntactic patterns. We then extracted additional pairs from MEDLINE using the learned patterns. Finally, we analysed correlations between disease manifestations and disease-associated genes and drugs to demonstrate the potential of this newly created knowledge base in disease gene discovery and drug repurposing.
RESULTS: In total, we extracted 121 359 unique D-M pairs with a high precision of 0.924. Among the extracted pairs, 120 419 (99.2%) have not been captured in existing structured knowledge sources. We have shown that disease manifestations correlate positively with both disease-associated genes and drug treatments.
CONCLUSIONS: The main contribution of our study is the creation of a large-scale and accurate D-M phenotype relationship knowledge base. This unique knowledge base, when combined with existing phenotypic, genetic and proteomic datasets, can have profound implications in our deeper understanding of disease etiology and in rapid drug repurposing. AVAILABILITY: http://nlp.case.edu/public/data/DMPatternUMLS/

Entities:  

Mesh:

Year:  2013        PMID: 23828786      PMCID: PMC4068009          DOI: 10.1093/bioinformatics/btt359

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  44 in total

1.  GENIES: a natural-language processing system for the extraction of molecular pathways from journal articles.

Authors:  C Friedman; P Kra; H Yu; M Krauthammer; A Rzhetsky
Journal:  Bioinformatics       Date:  2001       Impact factor: 6.937

Review 2.  Recent approaches to the prioritization of candidate disease genes.

Authors:  Nadezhda T Doncheva; Tim Kacprowski; Mario Albrecht
Journal:  Wiley Interdiscip Rev Syst Biol Med       Date:  2012-06-11

3.  A knowledge-driven conditional approach to extract pharmacogenomics specific drug-gene relationships from free text.

Authors:  Rong Xu; Quanqiu Wang
Journal:  J Biomed Inform       Date:  2012-04-27       Impact factor: 6.317

4.  PubChem as a source of polypharmacology.

Authors:  Bin Chen; David Wild; Rajarshi Guha
Journal:  J Chem Inf Model       Date:  2009-09       Impact factor: 4.956

5.  Inferring disease and gene set associations with rank coherence in networks.

Authors:  TaeHyun Hwang; Wei Zhang; Maoqiang Xie; Jinfeng Liu; Rui Kuang
Journal:  Bioinformatics       Date:  2011-08-08       Impact factor: 6.937

Review 6.  Network medicine: a network-based approach to human disease.

Authors:  Albert-László Barabási; Natali Gulbahce; Joseph Loscalzo
Journal:  Nat Rev Genet       Date:  2011-01       Impact factor: 53.242

7.  PhenomeNET: a whole-phenome approach to disease gene discovery.

Authors:  Robert Hoehndorf; Paul N Schofield; Georgios V Gkoutos
Journal:  Nucleic Acids Res       Date:  2011-07-06       Impact factor: 16.971

8.  A computational method based on the integration of heterogeneous networks for predicting disease-gene associations.

Authors:  Xingli Guo; Lin Gao; Chunshui Wei; Xiaofei Yang; Yi Zhao; Anguo Dong
Journal:  PLoS One       Date:  2011-09-02       Impact factor: 3.240

9.  The impact of cellular networks on disease comorbidity.

Authors:  Juyong Park; Deok-Sun Lee; Nicholas A Christakis; Albert-László Barabási
Journal:  Mol Syst Biol       Date:  2009-04-07       Impact factor: 11.429

10.  Drug discovery using chemical systems biology: repositioning the safe medicine Comtan to treat multi-drug and extensively drug resistant tuberculosis.

Authors:  Sarah L Kinnings; Nina Liu; Nancy Buchmeier; Peter J Tonge; Lei Xie; Philip E Bourne
Journal:  PLoS Comput Biol       Date:  2009-07-03       Impact factor: 4.475

View more
  32 in total

1.  PhenoPredict: A disease phenome-wide drug repositioning approach towards schizophrenia drug discovery.

Authors:  Rong Xu; QuanQiu Wang
Journal:  J Biomed Inform       Date:  2015-07-04       Impact factor: 6.317

2.  Drug repositioning for prostate cancer: using a data-driven approach to gain new insights.

Authors:  QuanQiu Wang; Rong Xu
Journal:  AMIA Annu Symp Proc       Date:  2018-04-16

3.  Large-scale automatic extraction of side effects associated with targeted anticancer drugs from full-text oncological articles.

Authors:  Rong Xu; QuanQiu Wang
Journal:  J Biomed Inform       Date:  2015-03-27       Impact factor: 6.317

4.  Retrospective analysis of health claims to evaluate pharmacotherapies with potential for repurposing: Association of bupropion and stimulant use disorder remission.

Authors:  Emily R Hankosky; Heather M Bush; Linda P Dwoskin; Daniel R Harris; Darren W Henderson; Guo-Qiang Zhang; Patricia R Freeman; Jeffery C Talbert
Journal:  AMIA Annu Symp Proc       Date:  2018-12-05

5.  Context-sensitive network-based disease genetics prediction and its implications in drug discovery.

Authors:  Yang Chen; Rong Xu
Journal:  Bioinformatics       Date:  2017-04-01       Impact factor: 6.937

6.  Automatic construction of a large-scale and accurate drug-side-effect association knowledge base from biomedical literature.

Authors:  Rong Xu; QuanQiu Wang
Journal:  J Biomed Inform       Date:  2014-06-10       Impact factor: 6.317

7.  DenguePredict: An Integrated Drug Repositioning Approach towards Drug Discovery for Dengue.

Authors:  QuanQiu Wang; Rong Xu
Journal:  AMIA Annu Symp Proc       Date:  2015-11-05

8.  PedAM: a database for Pediatric Disease Annotation and Medicine.

Authors:  Jinmeng Jia; Zhongxin An; Yue Ming; Yongli Guo; Wei Li; Xin Li; Yunxiang Liang; Dongming Guo; Jun Tai; Geng Chen; Yaqiong Jin; Zhimei Liu; Xin Ni; Tieliu Shi
Journal:  Nucleic Acids Res       Date:  2018-01-04       Impact factor: 16.971

9.  Drug repurposing for glioblastoma based on molecular subtypes.

Authors:  Yang Chen; Rong Xu
Journal:  J Biomed Inform       Date:  2016-09-30       Impact factor: 6.317

10.  Combining Human Disease Genetics and Mouse Model Phenotypes towards Drug Repositioning for Parkinson's disease.

Authors:  Yang Chen; Xiaoshu Cai; Rong Xu
Journal:  AMIA Annu Symp Proc       Date:  2015-11-05
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.