Literature DB >> 26384564

Beegle: from literature mining to disease-gene discovery.

Sarah ElShal1, Léon-Charles Tranchevent2, Alejandro Sifrim3, Amin Ardeshirdavani4, Jesse Davis5, Yves Moreau4.   

Abstract

Disease-gene identification is a challenging process that has multiple applications within functional genomics and personalized medicine. Typically, this process involves both finding genes known to be associated with the disease (through literature search) and carrying out preliminary experiments or screens (e.g. linkage or association studies, copy number analyses, expression profiling) to determine a set of promising candidates for experimental validation. This requires extensive time and monetary resources. We describe Beegle, an online search and discovery engine that attempts to simplify this process by automating the typical approaches. It starts by mining the literature to quickly extract a set of genes known to be linked with a given query, then it integrates the learning methodology of Endeavour (a gene prioritization tool) to train a genomic model and rank a set of candidate genes to generate novel hypotheses. In a realistic evaluation setup, Beegle has an average recall of 84% in the top 100 returned genes as a search engine, which improves the discovery engine by 12.6% in the top 5% prioritized genes. Beegle is publicly available at http://beegle.esat.kuleuven.be/.
© The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

Entities:  

Mesh:

Year:  2015        PMID: 26384564      PMCID: PMC4737179          DOI: 10.1093/nar/gkv905

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  26 in total

1.  Association of genes to genetically inherited diseases using data mining.

Authors:  Carolina Perez-Iratxeta; Peer Bork; Miguel A Andrade
Journal:  Nat Genet       Date:  2002-05-13       Impact factor: 38.330

2.  The Unified Medical Language System (UMLS): integrating biomedical terminology.

Authors:  Olivier Bodenreider
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

Review 3.  Recent approaches to the prioritization of candidate disease genes.

Authors:  Nadezhda T Doncheva; Tim Kacprowski; Mario Albrecht
Journal:  Wiley Interdiscip Rev Syst Biol Med       Date:  2012-06-11

4.  Inferring novel gene-disease associations using Medical Subject Heading Over-representation Profiles.

Authors:  Warren A Cheung; Bf Francis Ouellette; Wyeth W Wasserman
Journal:  Genome Med       Date:  2012-09-28       Impact factor: 11.117

5.  Text-derived concept profiles support assessment of DNA microarray data for acute myeloid leukemia and for androgen receptor stimulation.

Authors:  Rob Jelier; Guido Jenster; Lambert C J Dorssers; Bas J Wouters; Peter J M Hendriksen; Barend Mons; Ruud Delwel; Jan A Kors
Journal:  BMC Bioinformatics       Date:  2007-01-18       Impact factor: 3.169

6.  Anni 2.0: a multipurpose text-mining tool for the life sciences.

Authors:  Rob Jelier; Martijn J Schuemie; Antoine Veldhoven; Lambert C J Dorssers; Guido Jenster; Jan A Kors
Journal:  Genome Biol       Date:  2008-06-12       Impact factor: 13.583

7.  Identification of novel risk genes associated with type 1 diabetes mellitus using a genome-wide gene-based association analysis.

Authors:  Ying-Hua Qiu; Fei-Yan Deng; Min-Jing Li; Shu-Feng Lei
Journal:  J Diabetes Investig       Date:  2014-04-02       Impact factor: 4.232

8.  GeneDistiller--distilling candidate genes from linkage intervals.

Authors:  Dominik Seelow; Jana Marie Schwarz; Markus Schuelke
Journal:  PLoS One       Date:  2008-12-05       Impact factor: 3.240

9.  TXTGate: profiling gene groups with text-based information.

Authors:  Patrick Glenisson; Bert Coessens; Steven Van Vooren; Janick Mathys; Yves Moreau; Bart De Moor
Journal:  Genome Biol       Date:  2004-05-28       Impact factor: 13.583

10.  ENDEAVOUR update: a web resource for gene prioritization in multiple species.

Authors:  Léon-Charles Tranchevent; Roland Barriot; Shi Yu; Steven Van Vooren; Peter Van Loo; Bert Coessens; Bart De Moor; Stein Aerts; Yves Moreau
Journal:  Nucleic Acids Res       Date:  2008-05-28       Impact factor: 16.971

View more
  10 in total

1.  Combining Literature Mining and Machine Learning for Predicting Biomedical Discoveries.

Authors:  Balu Bhasuran
Journal:  Methods Mol Biol       Date:  2022

2.  Cytoplasmic LMO2-LDB1 Complex Activates STAT3 Signaling through Interaction with gp130-JAK in Glioma Stem Cells.

Authors:  Cheol Gyu Park; Sang-Hun Choi; Seon Yong Lee; Kiyoung Eun; Min Gi Park; Junseok Jang; Hyeon Ju Jeong; Seong Jin Kim; Sohee Jeong; Kanghun Lee; Hyunggee Kim
Journal:  Cells       Date:  2022-06-26       Impact factor: 7.666

3.  Gene prioritization using Bayesian matrix factorization with genomic and phenotypic side information.

Authors:  Pooya Zakeri; Jaak Simm; Adam Arany; Sarah ElShal; Yves Moreau
Journal:  Bioinformatics       Date:  2018-07-01       Impact factor: 6.937

4.  Candidate Genes and MiRNAs Linked to the Inverse Relationship Between Cancer and Alzheimer's Disease: Insights From Data Mining and Enrichment Analysis.

Authors:  Cristina Battaglia; Marco Venturin; Aleksandra Sojic; Nithiya Jesuthasan; Alessandro Orro; Roberta Spinelli; Massimo Musicco; Gianluca De Bellis; Fulvio Adorni
Journal:  Front Genet       Date:  2019-09-24       Impact factor: 4.599

5.  PRIORI-T: A tool for rare disease gene prioritization using MEDLINE.

Authors:  Aditya Rao; Thomas Joseph; Vangala G Saipradeep; Sujatha Kotte; Naveen Sivadasan; Rajgopal Srinivasan
Journal:  PLoS One       Date:  2020-04-21       Impact factor: 3.240

6.  Collaborative Cross Mice Yield Genetic Modifiers for Pseudomonas aeruginosa Infection in Human Lung Disease.

Authors:  Nicola Ivan Lorè; Barbara Sipione; Gengming He; Lisa J Strug; Hanifa J Atamni; Alexandra Dorman; Richard Mott; Fuad A Iraqi; Alessandra Bragonzi
Journal:  mBio       Date:  2020-03-03       Impact factor: 7.867

7.  RNA Sequencing of CD4+ T Cells in Relapsing-Remitting Multiple Sclerosis Patients at Relapse: Deciphering the Involvement of Novel genes and Pathways.

Authors:  Saeed Talebi; Samaneh Maleknia; Fahimeh Palizban; Zahra Salehi; Abdorreza Naser Moghadasi; Kaveh Kavousi; Mohammad Ali Sahraian; Maryam Izad
Journal:  J Mol Neurosci       Date:  2021-07-21       Impact factor: 3.444

8.  Drug repositioning or target repositioning: A structural perspective of drug-target-indication relationship for available repurposed drugs.

Authors:  Daniele Parisi; Melissa F Adasme; Anastasia Sveshnikova; Sarah Naomi Bolz; Yves Moreau; Michael Schroeder
Journal:  Comput Struct Biotechnol J       Date:  2020-04-13       Impact factor: 7.271

9.  Interpreting pathways to discover cancer driver genes with Moonlight.

Authors:  Antonio Colaprico; Catharina Olsen; Matthew H Bailey; Gabriel J Odom; Thilde Terkelsen; Tiago C Silva; André V Olsen; Laura Cantini; Andrei Zinovyev; Emmanuel Barillot; Houtan Noushmehr; Gloria Bertoli; Isabella Castiglioni; Claudia Cava; Gianluca Bontempi; Xi Steven Chen; Elena Papaleo
Journal:  Nat Commun       Date:  2020-01-03       Impact factor: 14.919

10.  Preclinical validation of therapeutic targets predicted by tensor factorization on heterogeneous graphs.

Authors:  Saee Paliwal; Alex de Giorgio; Daniel Neil; Jean-Baptiste Michel; Alix Mb Lacoste
Journal:  Sci Rep       Date:  2020-10-26       Impact factor: 4.379

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.