Literature DB >> 17646288

Kernel-based data fusion for gene prioritization.

Tijl De Bie1, Léon-Charles Tranchevent, Liesbeth M M van Oeffelen, Yves Moreau.   

Abstract

MOTIVATION: Hunting disease genes is a problem of primary importance in biomedical research. Biologists usually approach this problem in two steps: first a set of candidate genes is identified using traditional positional cloning or high-throughput genomics techniques; second, these genes are further investigated and validated in the wet lab, one by one. To speed up discovery and limit the number of costly wet lab experiments, biologists must test the candidate genes starting with the most probable candidates. So far, biologists have relied on literature studies, extensive queries to multiple databases and hunches about expected properties of the disease gene to determine such an ordering. Recently, we have introduced the data mining tool ENDEAVOUR (Aerts et al., 2006), which performs this task automatically by relying on different genome-wide data sources, such as Gene Ontology, literature, microarray, sequence and more.
RESULTS: In this article, we present a novel kernel method that operates in the same setting: based on a number of different views on a set of training genes, a prioritization of test genes is obtained. We furthermore provide a thorough learning theoretical analysis of the method's guaranteed performance. Finally, we apply the method to the disease data sets on which ENDEAVOUR (Aerts et al., 2006) has been benchmarked, and report a considerable improvement in empirical performance. AVAILABILITY: The MATLAB code used in the empirical results will be made publicly available.

Entities:  

Mesh:

Substances:

Year:  2007        PMID: 17646288     DOI: 10.1093/bioinformatics/btm187

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  38 in total

Review 1.  Computational tools for prioritizing candidate genes: boosting disease gene discovery.

Authors:  Yves Moreau; Léon-Charles Tranchevent
Journal:  Nat Rev Genet       Date:  2012-07-03       Impact factor: 53.242

2.  Improving biomarker list stability by integration of biological knowledge in the learning process.

Authors:  Tiziana Sanavia; Fabio Aiolli; Giovanni Da San Martino; Andrea Bisognin; Barbara Di Camillo
Journal:  BMC Bioinformatics       Date:  2012-03-28       Impact factor: 3.169

3.  Automated annotation of Drosophila gene expression patterns using a controlled vocabulary.

Authors:  Shuiwang Ji; Liang Sun; Rong Jin; Sudhir Kumar; Jieping Ye
Journal:  Bioinformatics       Date:  2008-07-16       Impact factor: 6.937

Review 4.  Candidate gene prioritization.

Authors:  Ali Masoudi-Nejad; Alireza Meshkin; Behzad Haji-Eghrari; Gholamreza Bidkhori
Journal:  Mol Genet Genomics       Date:  2012-08-15       Impact factor: 3.291

5.  Machine Learning for Integrating Data in Biology and Medicine: Principles, Practice, and Opportunities.

Authors:  Marinka Zitnik; Francis Nguyen; Bo Wang; Jure Leskovec; Anna Goldenberg; Michael M Hoffman
Journal:  Inf Fusion       Date:  2018-09-21       Impact factor: 12.975

Review 6.  Kernel methods for large-scale genomic data analysis.

Authors:  Xuefeng Wang; Eric P Xing; Daniel J Schaid
Journal:  Brief Bioinform       Date:  2014-07-22       Impact factor: 11.622

7.  Multimodal classification of Alzheimer's disease and mild cognitive impairment.

Authors:  Daoqiang Zhang; Yaping Wang; Luping Zhou; Hong Yuan; Dinggang Shen
Journal:  Neuroimage       Date:  2011-01-12       Impact factor: 6.556

8.  ProDiGe: Prioritization Of Disease Genes with multitask machine learning from positive and unlabeled examples.

Authors:  Fantine Mordelet; Jean-Philippe Vert
Journal:  BMC Bioinformatics       Date:  2011-10-06       Impact factor: 3.169

9.  L2-norm multiple kernel learning and its application to biomedical data fusion.

Authors:  Shi Yu; Tillmann Falck; Anneleen Daemen; Leon-Charles Tranchevent; Johan Ak Suykens; Bart De Moor; Yves Moreau
Journal:  BMC Bioinformatics       Date:  2010-06-08       Impact factor: 3.169

10.  Genetic-linkage mapping of complex hereditary disorders to a whole-genome molecular-interaction network.

Authors:  Ivan Iossifov; Tian Zheng; Miron Baron; T Conrad Gilliam; Andrey Rzhetsky
Journal:  Genome Res       Date:  2008-04-16       Impact factor: 9.043

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.