Literature DB >> 12015889

Learning gene functional classifications from multiple data types.

Paul Pavlidis1, Jason Weston, Jinsong Cai, William Stafford Noble.   

Abstract

In our attempts to understand cellular function at the molecular level, we must be able to synthesize information from disparate types of genomic data. We consider the problem of inferring gene functional classifications from a heterogeneous data set consisting of DNA microarray expression measurements and phylogenetic profiles from whole-genome sequence comparisons. We demonstrate the application of the support vector machine (SVM) learning algorithm to this functional inference task. Our results suggest the importance of exploiting prior information about the heterogeneity of the data. In particular, we propose an SVM kernel function that is explicitly heterogeneous. In addition, we describe feature scaling methods for further exploiting prior knowledge of heterogeneity by giving each data type different weights.

Mesh:

Substances:

Year:  2002        PMID: 12015889     DOI: 10.1089/10665270252935539

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  50 in total

1.  Functional modules by relating protein interaction networks and gene expression.

Authors:  Sabine Tornow; H W Mewes
Journal:  Nucleic Acids Res       Date:  2003-11-01       Impact factor: 16.971

Review 2.  Methods for biological data integration: perspectives and challenges.

Authors:  Vladimir Gligorijević; Nataša Pržulj
Journal:  J R Soc Interface       Date:  2015-11-06       Impact factor: 4.118

3.  The FunCat, a functional annotation scheme for systematic classification of proteins from whole genomes.

Authors:  Andreas Ruepp; Alfred Zollner; Dieter Maier; Kaj Albermann; Jean Hani; Martin Mokrejs; Igor Tetko; Ulrich Güldener; Gertrud Mannhaupt; Martin Münsterkötter; H Werner Mewes
Journal:  Nucleic Acids Res       Date:  2004-10-14       Impact factor: 16.971

4.  Optimized approach to decision fusion of heterogeneous data for breast cancer diagnosis.

Authors:  Jonathan L Jesneck; Loren W Nolte; Jay A Baker; Carey E Floyd; Joseph Y Lo
Journal:  Med Phys       Date:  2006-08       Impact factor: 4.071

5.  Large datasets in biomedicine: a discussion of salient analytic issues.

Authors:  Anshu Sinha; George Hripcsak; Marianthi Markatou
Journal:  J Am Med Inform Assoc       Date:  2009-08-28       Impact factor: 4.497

6.  CCR2 modulates inflammatory and metabolic effects of high-fat feeding.

Authors:  Stuart P Weisberg; Deborah Hunter; Reid Huber; Jacob Lemieux; Sarah Slaymaker; Kris Vaddi; Israel Charo; Rudolph L Leibel; Anthony W Ferrante
Journal:  J Clin Invest       Date:  2005-12-08       Impact factor: 14.808

7.  Multiple kernel learning with random effects for predicting longitudinal outcomes and data integration.

Authors:  Tianle Chen; Donglin Zeng; Yuanjia Wang
Journal:  Biometrics       Date:  2015-07-14       Impact factor: 2.571

8.  XML-based approaches for the integration of heterogeneous bio-molecular data.

Authors:  Marco Mesiti; Ernesto Jiménez-Ruiz; Ismael Sanz; Rafael Berlanga-Llavori; Paolo Perlasca; Giorgio Valentini; David Manset
Journal:  BMC Bioinformatics       Date:  2009-10-15       Impact factor: 3.169

9.  Fast integration of heterogeneous data sources for predicting gene function with limited annotation.

Authors:  Sara Mostafavi; Quaid Morris
Journal:  Bioinformatics       Date:  2010-05-27       Impact factor: 6.937

10.  Proteome-wide prediction of novel DNA/RNA-binding proteins using amino acid composition and periodicity in the hyperthermophilic archaeon Pyrococcus furiosus.

Authors:  Kosuke Fujishima; Mizuki Komasa; Sayaka Kitamura; Haruo Suzuki; Masaru Tomita; Akio Kanai
Journal:  DNA Res       Date:  2007-06-15       Impact factor: 4.458

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.