Literature DB >> 21977986

ProDiGe: Prioritization Of Disease Genes with multitask machine learning from positive and unlabeled examples.

Fantine Mordelet1, Jean-Philippe Vert.   

Abstract

BACKGROUND: Elucidating the genetic basis of human diseases is a central goal of genetics and molecular biology. While traditional linkage analysis and modern high-throughput techniques often provide long lists of tens or hundreds of disease gene candidates, the identification of disease genes among the candidates remains time-consuming and expensive. Efficient computational methods are therefore needed to prioritize genes within the list of candidates, by exploiting the wealth of information available about the genes in various databases.
RESULTS: We propose ProDiGe, a novel algorithm for Prioritization of Disease Genes. ProDiGe implements a novel machine learning strategy based on learning from positive and unlabeled examples, which allows to integrate various sources of information about the genes, to share information about known disease genes across diseases, and to perform genome-wide searches for new disease genes. Experiments on real data show that ProDiGe outperforms state-of-the-art methods for the prioritization of genes in human diseases.
CONCLUSIONS: ProDiGe implements a new machine learning paradigm for gene prioritization, which could help the identification of new disease genes. It is freely available at http://cbio.ensmp.fr/prodige.

Entities:  

Mesh:

Year:  2011        PMID: 21977986      PMCID: PMC3215680          DOI: 10.1186/1471-2105-12-389

Source DB:  PubMed          Journal:  BMC Bioinformatics        ISSN: 1471-2105            Impact factor:   3.169


  30 in total

1.  Estimating the support of a high-dimensional distribution.

Authors:  B Schölkopf; J C Platt; J Shawe-Taylor; A J Smola; R C Williamson
Journal:  Neural Comput       Date:  2001-07       Impact factor: 2.026

2.  Learning gene functional classifications from multiple data types.

Authors:  Paul Pavlidis; Jason Weston; Jinsong Cai; William Stafford Noble
Journal:  J Comput Biol       Date:  2002       Impact factor: 1.479

3.  Association of genes to genetically inherited diseases using data mining.

Authors:  Carolina Perez-Iratxeta; Peer Bork; Miguel A Andrade
Journal:  Nat Genet       Date:  2002-05-13       Impact factor: 38.330

4.  Efficient peptide-MHC-I binding prediction for alleles with few known binders.

Authors:  Laurent Jacob; Jean-Philippe Vert
Journal:  Bioinformatics       Date:  2007-12-14       Impact factor: 6.937

5.  A human phenome-interactome network of protein complexes implicated in genetic disorders.

Authors:  Kasper Lage; E Olof Karlberg; Zenia M Størling; Páll I Olason; Anders G Pedersen; Olga Rigina; Anders M Hinsby; Zeynep Tümer; Flemming Pociot; Niels Tommerup; Yves Moreau; Søren Brunak
Journal:  Nat Biotechnol       Date:  2007-03       Impact factor: 54.908

6.  Walking the interactome for prioritization of candidate disease genes.

Authors:  Sebastian Köhler; Sebastian Bauer; Denise Horn; Peter N Robinson
Journal:  Am J Hum Genet       Date:  2008-03-27       Impact factor: 11.025

7.  Large-scale analysis of the human and mouse transcriptomes.

Authors:  Andrew I Su; Michael P Cooke; Keith A Ching; Yaron Hakak; John R Walker; Tim Wiltshire; Anthony P Orth; Raquel G Vega; Lisa M Sapinoso; Aziz Moqrich; Ardem Patapoutian; Garret M Hampton; Peter G Schultz; John B Hogenesch
Journal:  Proc Natl Acad Sci U S A       Date:  2002-03-19       Impact factor: 11.205

8.  Speeding disease gene discovery by sequence based candidate prioritization.

Authors:  Euan A Adie; Richard R Adams; Kathryn L Evans; David J Porteous; Ben S Pickard
Journal:  BMC Bioinformatics       Date:  2005-03-14       Impact factor: 3.169

9.  Mendelian Inheritance in Man and its online version, OMIM.

Authors:  Victor A McKusick
Journal:  Am J Hum Genet       Date:  2007-03-08       Impact factor: 11.025

10.  POCUS: mining genomic sequence annotation to predict disease genes.

Authors:  Frances S Turner; Daniel R Clutterbuck; Colin A M Semple
Journal:  Genome Biol       Date:  2003-10-10       Impact factor: 13.583

View more
  33 in total

1.  Multi-task learning for pKa prediction.

Authors:  Grigorios Skolidis; Katja Hansen; Guido Sanguinetti; Matthias Rupp
Journal:  J Comput Aided Mol Des       Date:  2012-06-20       Impact factor: 3.686

Review 2.  Systems genetics in "-omics" era: current and future development.

Authors:  Hong Li
Journal:  Theory Biosci       Date:  2012-11-09       Impact factor: 1.919

3.  Inferring Protein Sequence-Function Relationships with Large-Scale Positive-Unlabeled Learning.

Authors:  Hyebin Song; Bennett J Bremer; Emily C Hinds; Garvesh Raskutti; Philip A Romero
Journal:  Cell Syst       Date:  2020-11-18       Impact factor: 10.304

4.  Prediction and validation of gene-disease associations using methods inspired by social network analyses.

Authors:  U Martin Singh-Blom; Nagarajan Natarajan; Ambuj Tewari; John O Woods; Inderjit S Dhillon; Edward M Marcotte
Journal:  PLoS One       Date:  2013-05-01       Impact factor: 3.240

5.  Positive-unlabeled learning for the prediction of conformational B-cell epitopes.

Authors:  Jing Ren; Qian Liu; John Ellis; Jinyan Li
Journal:  BMC Bioinformatics       Date:  2015-12-09       Impact factor: 3.169

6.  Functional networks inference from rule-based machine learning models.

Authors:  Nicola Lazzarini; Paweł Widera; Stuart Williamson; Rakesh Heer; Natalio Krasnogor; Jaume Bacardit
Journal:  BioData Min       Date:  2016-09-05       Impact factor: 2.522

7.  SGFSC: speeding the gene functional similarity calculation based on hash tables.

Authors:  Zhen Tian; Chunyu Wang; Maozu Guo; Xiaoyan Liu; Zhixia Teng
Journal:  BMC Bioinformatics       Date:  2016-11-04       Impact factor: 3.169

8.  Positive-unlabeled learning for disease gene identification.

Authors:  Peng Yang; Xiao-Li Li; Jian-Ping Mei; Chee-Keong Kwoh; See-Kiong Ng
Journal:  Bioinformatics       Date:  2012-08-24       Impact factor: 6.937

9.  Ensemble positive unlabeled learning for disease gene identification.

Authors:  Peng Yang; Xiaoli Li; Hon-Nian Chua; Chee-Keong Kwoh; See-Kiong Ng
Journal:  PLoS One       Date:  2014-05-09       Impact factor: 3.240

10.  Inductive matrix completion for predicting gene-disease associations.

Authors:  Nagarajan Natarajan; Inderjit S Dhillon
Journal:  Bioinformatics       Date:  2014-06-15       Impact factor: 6.937

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.