Literature DB >> 15130933

A statistical framework for genomic data fusion.

Gert R G Lanckriet1, Tijl De Bie, Nello Cristianini, Michael I Jordan, William Stafford Noble.   

Abstract

MOTIVATION: During the past decade, the new focus on genomics has highlighted a particular challenge: to integrate the different views of the genome that are provided by various types of experimental data.
RESULTS: This paper describes a computational framework for integrating and drawing inferences from a collection of genome-wide measurements. Each dataset is represented via a kernel function, which defines generalized similarity relationships between pairs of entities, such as genes or proteins. The kernel representation is both flexible and efficient, and can be applied to many different types of data. Furthermore, kernel functions derived from different types of data can be combined in a straightforward fashion. Recent advances in the theory of kernel methods have provided efficient algorithms to perform such combinations in a way that minimizes a statistical loss function. These methods exploit semidefinite programming techniques to reduce the problem of finding optimizing kernel combinations to a convex optimization problem. Computational experiments performed using yeast genome-wide datasets, including amino acid sequences, hydropathy profiles, gene expression data and known protein-protein interactions, demonstrate the utility of this approach. A statistical learning algorithm trained from all of these data to recognize particular classes of proteins--membrane proteins and ribosomal proteins--performs significantly better than the same algorithm trained on any single type of data. AVAILABILITY: Supplementary data at http://noble.gs.washington.edu/proj/sdp-svm

Entities:  

Mesh:

Substances:

Year:  2004        PMID: 15130933     DOI: 10.1093/bioinformatics/bth294

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  117 in total

Review 1.  Computational tools for prioritizing candidate genes: boosting disease gene discovery.

Authors:  Yves Moreau; Léon-Charles Tranchevent
Journal:  Nat Rev Genet       Date:  2012-07-03       Impact factor: 53.242

Review 2.  Genomic similarity and kernel methods II: methods for genomic information.

Authors:  Daniel J Schaid
Journal:  Hum Hered       Date:  2010-07-03       Impact factor: 0.444

Review 3.  Methods for biological data integration: perspectives and challenges.

Authors:  Vladimir Gligorijević; Nataša Pržulj
Journal:  J R Soc Interface       Date:  2015-11-06       Impact factor: 4.118

4.  Optimized approach to decision fusion of heterogeneous data for breast cancer diagnosis.

Authors:  Jonathan L Jesneck; Loren W Nolte; Jay A Baker; Carey E Floyd; Joseph Y Lo
Journal:  Med Phys       Date:  2006-08       Impact factor: 4.071

5.  Automated annotation of Drosophila gene expression patterns using a controlled vocabulary.

Authors:  Shuiwang Ji; Liang Sun; Rong Jin; Sudhir Kumar; Jieping Ye
Journal:  Bioinformatics       Date:  2008-07-16       Impact factor: 6.937

6.  Integrative approaches for predicting protein function and prioritizing genes for complex phenotypes using protein interaction networks.

Authors:  Xiaotu Ma; Ting Chen; Fengzhu Sun
Journal:  Brief Bioinform       Date:  2013-06-19       Impact factor: 11.622

7.  The impact of incomplete knowledge on evaluation: an experimental benchmark for protein function prediction.

Authors:  Curtis Huttenhower; Matthew A Hibbs; Chad L Myers; Amy A Caudy; David C Hess; Olga G Troyanskaya
Journal:  Bioinformatics       Date:  2009-06-26       Impact factor: 6.937

8.  Subtyping of Gliomaby Combining Gene Expression and CNVs Data Based on a Compressive Sensing Approach.

Authors:  Wenlong Tang; Hongbao Cao; Ji-Gang Zhang; Junbo Duan; Dongdong Lin; Yu-Ping Wang
Journal:  Adv Genet Eng       Date:  2012-01-16

9.  Graph- and rule-based learning algorithms: a comprehensive review of their applications for cancer type classification and prognosis using genomic data.

Authors:  Saurav Mallik; Zhongming Zhao
Journal:  Brief Bioinform       Date:  2020-03-23       Impact factor: 11.622

10.  Protein-ligand interaction prediction: an improved chemogenomics approach.

Authors:  Laurent Jacob; Jean-Philippe Vert
Journal:  Bioinformatics       Date:  2008-08-01       Impact factor: 6.937

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.