Literature DB >> 21672907

The application of naive Bayes model averaging to predict Alzheimer's disease from genome-wide data.

Wei Wei1, Shyam Visweswaran, Gregory F Cooper.   

Abstract

OBJECTIVE: Predicting patient outcomes from genome-wide measurements holds significant promise for improving clinical care. The large number of measurements (eg, single nucleotide polymorphisms (SNPs)), however, makes this task computationally challenging. This paper evaluates the performance of an algorithm that predicts patient outcomes from genome-wide data by efficiently model averaging over an exponential number of naive Bayes (NB) models.
DESIGN: This model-averaged naive Bayes (MANB) method was applied to predict late onset Alzheimer's disease in 1411 individuals who each had 312,318 SNP measurements available as genome-wide predictive features. Its performance was compared to that of a naive Bayes algorithm without feature selection (NB) and with feature selection (FSNB). MEASUREMENT: Performance of each algorithm was measured in terms of area under the ROC curve (AUC), calibration, and run time.
RESULTS: The training time of MANB (16.1 s) was fast like NB (15.6 s), while FSNB (1684.2 s) was considerably slower. Each of the three algorithms required less than 0.1 s to predict the outcome of a test case. MANB had an AUC of 0.72, which is significantly better than the AUC of 0.59 by NB (p<0.00001), but not significantly different from the AUC of 0.71 by FSNB. MANB was better calibrated than NB, and FSNB was even better in calibration. A limitation was that only one dataset and two comparison algorithms were included in this study.
CONCLUSION: MANB performed comparatively well in predicting a clinical outcome from a high-dimensional genome-wide dataset. These results provide support for including MANB in the methods used to predict outcomes from large, genome-wide datasets.

Entities:  

Mesh:

Substances:

Year:  2011        PMID: 21672907      PMCID: PMC3128400          DOI: 10.1136/amiajnl-2011-000101

Source DB:  PubMed          Journal:  J Am Med Inform Assoc        ISSN: 1067-5027            Impact factor:   4.497


  18 in total

1.  A mathematical approach to medical diagnosis. Application to congenital heart disease.

Authors:  H R WARNER; A F TORONTO; L G VEASEY; R STEPHENSON
Journal:  JAMA       Date:  1961-07-22       Impact factor: 56.272

Review 2.  The pursuit of genome-wide association studies: where are we now?

Authors:  Chee Seng Ku; En Yun Loy; Yudi Pawitan; Kee Seng Chia
Journal:  J Hum Genet       Date:  2010-03-19       Impact factor: 3.172

3.  Approaches for evaluating rare polymorphisms in genetic association studies.

Authors:  Qizhai Li; Hong Zhang; Kai Yu
Journal:  Hum Hered       Date:  2010-03-24       Impact factor: 0.444

Review 4.  A review of feature selection techniques in bioinformatics.

Authors:  Yvan Saeys; Iñaki Inza; Pedro Larrañaga
Journal:  Bioinformatics       Date:  2007-08-24       Impact factor: 6.937

5.  Shifting paradigm of association studies: value of rare single-nucleotide polymorphisms.

Authors:  Ivan P Gorlov; Olga Y Gorlova; Shamil R Sunyaev; Margaret R Spitz; Christopher I Amos
Journal:  Am J Hum Genet       Date:  2008-01       Impact factor: 11.025

6.  On Jim Watson's APOE status: genetic information is hard to hide.

Authors:  Dale R Nyholt; Chang-En Yu; Peter M Visscher
Journal:  Eur J Hum Genet       Date:  2008-10-22       Impact factor: 4.246

7.  Fine mapping of the chromosome 10q11-q21 linkage region in Alzheimer's disease cases and controls.

Authors:  Margaret Daniele Fallin; Megan Szymanski; Ruihua Wang; Adrian Gherman; Susan S Bassett; Dimitrios Avramopoulos
Journal:  Neurogenetics       Date:  2010-02-25       Impact factor: 2.660

Review 8.  Common vs. rare allele hypotheses for complex diseases.

Authors:  Nicholas J Schork; Sarah S Murray; Kelly A Frazer; Eric J Topol
Journal:  Curr Opin Genet Dev       Date:  2009-05-28       Impact factor: 5.578

Review 9.  A century of Alzheimer's disease.

Authors:  Michel Goedert; Maria Grazia Spillantini
Journal:  Science       Date:  2006-11-03       Impact factor: 47.728

10.  A high-density whole-genome association study reveals that APOE is the major susceptibility gene for sporadic late-onset Alzheimer's disease.

Authors:  Keith D Coon; Amanda J Myers; David W Craig; Jennifer A Webster; John V Pearson; Diane Hu Lince; Victoria L Zismann; Thomas G Beach; Doris Leung; Leslie Bryden; Rebecca F Halperin; Lauren Marlowe; Mona Kaleem; Douglas G Walker; Rivka Ravid; Christopher B Heward; Joseph Rogers; Andreas Papassotiropoulos; Eric M Reiman; John Hardy; Dietrich A Stephan
Journal:  J Clin Psychiatry       Date:  2007-04       Impact factor: 4.384

View more
  29 in total

Review 1.  Bayesian networks in neuroscience: a survey.

Authors:  Concha Bielza; Pedro Larrañaga
Journal:  Front Comput Neurosci       Date:  2014-10-16       Impact factor: 2.380

2.  Visualizing the operating range of a classification system.

Authors:  George Hripcsak
Journal:  J Am Med Inform Assoc       Date:  2012-01-16       Impact factor: 4.497

3.  Computationally translating molecular discoveries into tools for medicine: translational bioinformatics articles now featured in JAMIA.

Authors:  Atul J Butte; Nigam H Shah
Journal:  J Am Med Inform Assoc       Date:  2011 Jul-Aug       Impact factor: 4.497

4.  Evaluation of a two-stage framework for prediction using big genomic data.

Authors:  Xia Jiang; Richard E Neapolitan
Journal:  Brief Bioinform       Date:  2015-03-18       Impact factor: 11.622

5.  A comparative analysis of methods for predicting clinical outcomes using high-dimensional genomic datasets.

Authors:  Xia Jiang; Binghuang Cai; Diyang Xue; Xinghua Lu; Gregory F Cooper; Richard E Neapolitan
Journal:  J Am Med Inform Assoc       Date:  2014-04-15       Impact factor: 4.497

6.  Sex-specific patterns and differences in dementia and Alzheimer's disease using informatics approaches.

Authors:  Jay Geronimo Ronquillo; Merritt Rachel Baer; William T Lester
Journal:  J Women Aging       Date:  2016-04-22

7.  Improving Personalized Clinical Risk Prediction Based on Causality-Based Association Rules.

Authors:  Chih-Wen Cheng; May D Wang
Journal:  ACM BCB       Date:  2015-09

8.  ICU Outcome Predictions using Physiologic Trends in the First Two Days.

Authors:  Mehmet Kayaalp
Journal:  Comput Cardiol (2010)       Date:  2012

9.  Doubly Optimized Calibrated Support Vector Machine (DOC-SVM): an algorithm for joint optimization of discrimination and calibration.

Authors:  Xiaoqian Jiang; Aditya Menon; Shuang Wang; Jihoon Kim; Lucila Ohno-Machado
Journal:  PLoS One       Date:  2012-11-06       Impact factor: 3.240

10.  Hierarchical Naive Bayes for genetic association studies.

Authors:  Alberto Malovini; Nicola Barbarini; Riccardo Bellazzi; Francesca de Michelis
Journal:  BMC Bioinformatics       Date:  2012-09-07       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.