Stephen W Hartley1, Paola Sebastiani. 1. National Institutes of Health/National Human Genome Research Institute, 5625 Fishers Lane, Rockville, MD 20850, USA. stephen.hartley@nih.gov
Abstract
MOTIVATION: Although several studies have used Bayesian classifiers for risk prediction using genome-wide single nucleotide polymorphism (SNP) datasets, no software can efficiently perform these analyses on massive genetic datasets and can accommodate multiple traits. RESULTS: We describe the program PleioGRiP that performs a genome-wide Bayesian model search to identify SNPs associated with a discrete phenotype and uses SNPs ranked by Bayes factor to produce nested Bayesian classifiers. These classifiers can be used for genetic risk prediction, either selecting the classifier with optimal number of features or using an ensemble of classifiers. In addition, PleioGRiP implements an extension to the Bayesian search and classification and can search for pleiotropic relationships in which SNPs are simultaneously associated with two or more distinct phenotypes. These relationships can be used to generate connected Bayesian classifiers to predict the phenotype of interest either using genetic data alone or in combination with the secondary phenotype(s). AVAILABILITY: PleioGRiP is implemented in Java, and it is available from http://hdl.handle.net/2144/4367. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
MOTIVATION: Although several studies have used Bayesian classifiers for risk prediction using genome-wide single nucleotide polymorphism (SNP) datasets, no software can efficiently perform these analyses on massive genetic datasets and can accommodate multiple traits. RESULTS: We describe the program PleioGRiP that performs a genome-wide Bayesian model search to identify SNPs associated with a discrete phenotype and uses SNPs ranked by Bayes factor to produce nested Bayesian classifiers. These classifiers can be used for genetic risk prediction, either selecting the classifier with optimal number of features or using an ensemble of classifiers. In addition, PleioGRiP implements an extension to the Bayesian search and classification and can search for pleiotropic relationships in which SNPs are simultaneously associated with two or more distinct phenotypes. These relationships can be used to generate connected Bayesian classifiers to predict the phenotype of interest either using genetic data alone or in combination with the secondary phenotype(s). AVAILABILITY: PleioGRiP is implemented in Java, and it is available from http://hdl.handle.net/2144/4367. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Authors: Sebastian Okser; Terho Lehtimäki; Laura L Elo; Nina Mononen; Nina Peltonen; Mika Kähönen; Markus Juonala; Yue-Mei Fan; Jussi A Hernesniemi; Tomi Laitinen; Leo-Pekka Lyytikäinen; Riikka Rontu; Carita Eklund; Nina Hutri-Kähönen; Leena Taittonen; Mikko Hurme; Jorma S A Viikari; Olli T Raitakari; Tero Aittokallio Journal: PLoS Genet Date: 2010-09-30 Impact factor: 5.917
Authors: Stephen W Hartley; Stefano Monti; Ching-Ti Liu; Martin H Steinberg; Paola Sebastiani Journal: Front Genet Date: 2012-09-11 Impact factor: 4.599
Authors: Paola Sebastiani; Nadia Solovieff; Andrew T Dewan; Kyle M Walsh; Annibale Puca; Stephen W Hartley; Efthymia Melista; Stacy Andersen; Daniel A Dworkis; Jemma B Wilk; Richard H Myers; Martin H Steinberg; Monty Montano; Clinton T Baldwin; Josephine Hoh; Thomas T Perls Journal: PLoS One Date: 2012-01-18 Impact factor: 3.240