Literature DB >> 28675924

A statistical framework for biomedical literature mining.

Dongjun Chung1, Andrew Lawson1, W Jim Zheng2.   

Abstract

In systems biology, it is of great interest to identify new genes that were not previously reported to be associated with biological pathways related to various functions and diseases. Identification of these new pathway-modulating genes does not only promote understanding of pathway regulation mechanisms but also allow identification of novel targets for therapeutics. Recently, biomedical literature has been considered as a valuable resource to investigate pathway-modulating genes. While the majority of currently available approaches are based on the co-occurrence of genes within an abstract, it has been reported that these approaches show only sub-optimal performances because 70% of abstracts contain information only for a single gene. To overcome such limitation, we propose a novel statistical framework based on the concept of ontology fingerprint that uses gene ontology to extract information from large biomedical literature data. The proposed framework simultaneously identifies pathway-modulating genes and facilitates interpreting functions of these new genes. We also propose a computationally efficient posterior inference procedure based on Metropolis-Hastings within Gibbs sampler for parameter updates and the poor man's reversible jump Markov chain Monte Carlo approach for model selection. We evaluate the proposed statistical framework with simulation studies, experimental validation, and an application to studies of pathway-modulating genes in yeast. The R implementation of the proposed model is currently available at https://dongjunchung.github.io/bayesGO/.
Copyright © 2017 John Wiley & Sons, Ltd. Copyright © 2017 John Wiley & Sons, Ltd.

Entities:  

Keywords:  Bayesian hierarchical model; biological pathway; gene ontology; literature search; ontology fingerprint

Mesh:

Substances:

Year:  2017        PMID: 28675924      PMCID: PMC5657248          DOI: 10.1002/sim.7384

Source DB:  PubMed          Journal:  Stat Med        ISSN: 0277-6715            Impact factor:   2.373


  7 in total

1.  KEGG: kyoto encyclopedia of genes and genomes.

Authors:  M Kanehisa; S Goto
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  A literature network of human genes for high-throughput analysis of gene expression.

Authors:  T K Jenssen; A Laegreid; J Komorowski; E Hovig
Journal:  Nat Genet       Date:  2001-05       Impact factor: 38.330

3.  Gene ontology: tool for the unification of biology. The Gene Ontology Consortium.

Authors:  M Ashburner; C A Ball; J A Blake; D Botstein; H Butler; J M Cherry; A P Davis; K Dolinski; S S Dwight; J T Eppig; M A Harris; D P Hill; L Issel-Tarver; A Kasarskis; S Lewis; J C Matese; J E Richardson; M Ringwald; G M Rubin; G Sherlock
Journal:  Nat Genet       Date:  2000-05       Impact factor: 38.330

4.  Finding pathway-modulating genes from a novel Ontology Fingerprint-derived gene network.

Authors:  Tingting Qin; Nabil Matmati; Lam C Tsoi; Bidyut K Mohanty; Nan Gao; Jijun Tang; Andrew B Lawson; Yusuf A Hannun; W Jim Zheng
Journal:  Nucleic Acids Res       Date:  2014-07-24       Impact factor: 16.971

5.  Génie: literature-based gene prioritization at multi genomic scale.

Authors:  Jean-Fred Fontaine; Florian Priller; Adriano Barbosa-Silva; Miguel A Andrade-Navarro
Journal:  Nucleic Acids Res       Date:  2011-05-23       Impact factor: 16.971

6.  ToppGene Suite for gene list enrichment analysis and candidate gene prioritization.

Authors:  Jing Chen; Eric E Bardes; Bruce J Aronow; Anil G Jegga
Journal:  Nucleic Acids Res       Date:  2009-05-22       Impact factor: 16.971

7.  Broad network-based predictability of Saccharomyces cerevisiae gene loss-of-function phenotypes.

Authors:  Kriston L McGary; Insuk Lee; Edward M Marcotte
Journal:  Genome Biol       Date:  2007       Impact factor: 13.583

  7 in total
  3 in total

1.  GAIL: An interactive webserver for inference and dynamic visualization of gene-gene associations based on gene ontology guided mining of biomedical literature.

Authors:  Daniel Couch; Zhenning Yu; Jin Hyun Nam; Carter Allen; Paula S Ramos; Willian A da Silveira; Kelly J Hunt; Edward S Hazard; Gary Hardiman; Andrew Lawson; Dongjun Chung
Journal:  PLoS One       Date:  2019-07-01       Impact factor: 3.240

2.  DES-ROD: Exploring Literature to Develop New Links between RNA Oxidation and Human Diseases.

Authors:  Magbubah Essack; Adil Salhi; Christophe Van Neste; Arwa Bin Raies; Faroug Tifratene; Mahmut Uludag; Arnaud Hungler; Bozidarka Zaric; Sonja Zafirovic; Takashi Gojobori; Esma Isenovic; Vladan P Bajic
Journal:  Oxid Med Cell Longev       Date:  2020-03-27       Impact factor: 6.543

3.  PALMER: improving pathway annotation based on the biomedical literature mining with a constrained latent block model.

Authors:  Jin Hyun Nam; Daniel Couch; Willian A da Silveira; Zhenning Yu; Dongjun Chung
Journal:  BMC Bioinformatics       Date:  2020-10-02       Impact factor: 3.307

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.