Literature DB >> 22777693

Use of support vector machines for disease risk prediction in genome-wide association studies: concerns and opportunities.

Florian Mittag1, Finja Büchel, Mohamad Saad, Andreas Jahn, Claudia Schulte, Zoltan Bochdanovits, Javier Simón-Sánchez, Mike A Nalls, Margaux Keller, Dena G Hernandez, J Raphael Gibbs, Suzanne Lesage, Alexis Brice, Peter Heutink, Maria Martinez, Nicholas W Wood, John Hardy, Andrew B Singleton, Andreas Zell, Thomas Gasser, Manu Sharma.   

Abstract

The success of genome-wide association studies (GWAS) in deciphering the genetic architecture of complex diseases has fueled the expectations whether the individual risk can also be quantified based on the genetic architecture. So far, disease risk prediction based on top-validated single-nucleotide polymorphisms (SNPs) showed little predictive value. Here, we applied a support vector machine (SVM) to Parkinson disease (PD) and type 1 diabetes (T1D), to show that apart from magnitude of effect size of risk variants, heritability of the disease also plays an important role in disease risk prediction. Furthermore, we performed a simulation study to show the role of uncommon (frequency 1-5%) as well as rare variants (frequency <1%) in disease etiology of complex diseases. Using a cross-validation model, we were able to achieve predictions with an area under the receiver operating characteristic curve (AUC) of ~0.88 for T1D, highlighting the strong heritable component (∼90%). This is in contrast to PD, where we were unable to achieve a satisfactory prediction (AUC ~0.56; heritability ~38%). Our simulations showed that simultaneous inclusion of uncommon and rare variants in GWAS would eventually lead to feasible disease risk prediction for complex diseases such as PD. The used software is available at http://www.ra.cs.uni-tuebingen.de/software/MACLEAPS/.
© 2012 Wiley Periodicals, Inc.

Entities:  

Mesh:

Year:  2012        PMID: 22777693      PMCID: PMC5968822          DOI: 10.1002/humu.22161

Source DB:  PubMed          Journal:  Hum Mutat        ISSN: 1059-7794            Impact factor:   4.878


  28 in total

Review 1.  Genomewide association studies and assessment of the risk of disease.

Authors:  Teri A Manolio
Journal:  N Engl J Med       Date:  2010-07-08       Impact factor: 91.245

Review 2.  Genome-wide association studies for common diseases and complex traits.

Authors:  Joel N Hirschhorn; Mark J Daly
Journal:  Nat Rev Genet       Date:  2005-02       Impact factor: 53.242

3.  Towards predictive genetic testing of complex diseases.

Authors:  A Cecile J W Janssens; Cornelia M van Duijn
Journal:  Eur J Epidemiol       Date:  2006-12-08       Impact factor: 8.082

4.  PLINK: a tool set for whole-genome association and population-based linkage analyses.

Authors:  Shaun Purcell; Benjamin Neale; Kathe Todd-Brown; Lori Thomas; Manuel A R Ferreira; David Bender; Julian Maller; Pamela Sklar; Paul I W de Bakker; Mark J Daly; Pak C Sham
Journal:  Am J Hum Genet       Date:  2007-07-25       Impact factor: 11.025

Review 5.  Validating, augmenting and refining genome-wide association signals.

Authors:  John P A Ioannidis; Gilles Thomas; Mark J Daly
Journal:  Nat Rev Genet       Date:  2009-05       Impact factor: 53.242

6.  Harnessing the information contained within genome-wide association studies to improve individual prediction of complex disease risk.

Authors:  David M Evans; Peter M Visscher; Naomi R Wray
Journal:  Hum Mol Genet       Date:  2009-06-24       Impact factor: 6.150

7.  Genome-wide association study confirms BST1 and suggests a locus on 12q24 as the risk loci for Parkinson's disease in the European population.

Authors:  Mohamad Saad; Suzanne Lesage; Aude Saint-Pierre; Jean-Christophe Corvol; Diana Zelenika; Jean-Charles Lambert; Marie Vidailhet; George D Mellick; Ebba Lohmann; Franck Durif; Pierre Pollak; Philippe Damier; François Tison; Peter A Silburn; Christophe Tzourio; Sylvie Forlani; Marie-Anne Loriot; Maurice Giroud; Catherine Helmer; Florence Portet; Philippe Amouyel; Mark Lathrop; Alexis Elbaz; Alexandra Durr; Maria Martinez; Alexis Brice
Journal:  Hum Mol Genet       Date:  2010-11-17       Impact factor: 6.150

8.  Designing candidate gene and genome-wide case-control association studies.

Authors:  Krina T Zondervan; Lon R Cardon
Journal:  Nat Protoc       Date:  2007       Impact factor: 13.491

9.  Imputation of sequence variants for identification of genetic risks for Parkinson's disease: a meta-analysis of genome-wide association studies.

Authors:  Michael A Nalls; Vincent Plagnol; Dena G Hernandez; Manu Sharma; Una-Marie Sheerin; Mohamad Saad; J Simón-Sánchez; Claudia Schulte; Suzanne Lesage; Sigurlaug Sveinbjörnsdóttir; Kári Stefánsson; Maria Martinez; John Hardy; Peter Heutink; Alexis Brice; Thomas Gasser; Andrew B Singleton; Nicholas W Wood
Journal:  Lancet       Date:  2011-02-01       Impact factor: 79.321

10.  Predictive testing for complex diseases using multiple genes: fact or fiction?

Authors:  A Cecile J W Janssens; Yurii S Aulchenko; Stefano Elefante; Gerard J J M Borsboom; Ewout W Steyerberg; Cornelia M van Duijn
Journal:  Genet Med       Date:  2006-07       Impact factor: 8.822

View more
  10 in total

Review 1.  The basics of data, big data, and machine learning in clinical practice.

Authors:  David Soriano-Valdez; Ingris Pelaez-Ballestas; Amaranta Manrique de Lara; Alfonso Gastelum-Strozzi
Journal:  Clin Rheumatol       Date:  2020-06-05       Impact factor: 2.980

2.  SNPs selection using support vector regression and genetic algorithms in GWAS.

Authors:  Fabrízzio Condé de Oliveira; Carlos Cristiano Hasenclever Borges; Fernanda Nascimento Almeida; Fabyano Fonseca e Silva; Rui da Silva Verneque; Marcos Vinicius G B da Silva; Wagner Arbex
Journal:  BMC Genomics       Date:  2014-10-27       Impact factor: 3.969

3.  Application of Machine Learning Techniques to High-Dimensional Clinical Data to Forecast Postoperative Complications.

Authors:  Paul Thottakkara; Tezcan Ozrazgat-Baslanti; Bradley B Hupf; Parisa Rashidi; Panos Pardalos; Petar Momcilovic; Azra Bihorac
Journal:  PLoS One       Date:  2016-05-27       Impact factor: 3.240

4.  Combining Multiple Hypothesis Testing with Machine Learning Increases the Statistical Power of Genome-wide Association Studies.

Authors:  Bettina Mieth; Marius Kloft; Juan Antonio Rodríguez; Sören Sonnenburg; Robin Vobruba; Carlos Morcillo-Suárez; Xavier Farré; Urko M Marigorta; Ernst Fehr; Thorsten Dickhaus; Gilles Blanchard; Daniel Schunk; Arcadi Navarro; Klaus-Robert Müller
Journal:  Sci Rep       Date:  2016-11-28       Impact factor: 4.379

5.  Prediction of Smoking Behavior From Single Nucleotide Polymorphisms With Machine Learning Approaches.

Authors:  Yi Xu; Liyu Cao; Xinyi Zhao; Yinghao Yao; Qiang Liu; Bin Zhang; Yan Wang; Ying Mao; Yunlong Ma; Jennie Z Ma; Thomas J Payne; Ming D Li; Lanjuan Li
Journal:  Front Psychiatry       Date:  2020-05-14       Impact factor: 4.157

6.  Comparison of regression imputation methods of baseline covariates that predict survival outcomes.

Authors:  Nicole Solomon; Yuliya Lokhnygina; Susan Halabi
Journal:  J Clin Transl Sci       Date:  2020-09-04

7.  Influence of Feature Encoding and Choice of Classifier on Disease Risk Prediction in Genome-Wide Association Studies.

Authors:  Florian Mittag; Michael Römer; Andreas Zell
Journal:  PLoS One       Date:  2015-08-18       Impact factor: 3.240

8.  Parkinson's disease: dopaminergic nerve cell model is consistent with experimental finding of increased extracellular transport of α-synuclein.

Authors:  Finja Büchel; Sandra Saliger; Andreas Dräger; Stephanie Hoffmann; Clemens Wrzodek; Andreas Zell; Philipp J Kahle
Journal:  BMC Neurosci       Date:  2013-11-06       Impact factor: 3.288

9.  Orchestrated increase of dopamine and PARK mRNAs but not miR-133b in dopamine neurons in Parkinson's disease.

Authors:  Falk Schlaudraff; Jan Gründemann; Michael Fauler; Elena Dragicevic; John Hardy; Birgit Liss
Journal:  Neurobiol Aging       Date:  2014-03-22       Impact factor: 4.673

10.  Genetic sharing and heritability of paediatric age of onset autoimmune diseases.

Authors:  Yun R Li; Sihai D Zhao; Jin Li; Jonathan P Bradfield; Maede Mohebnasab; Laura Steel; Julie Kobie; Debra J Abrams; Frank D Mentch; Joseph T Glessner; Yiran Guo; Zhi Wei; John J Connolly; Christopher J Cardinale; Marina Bakay; Dong Li; S Melkorka Maggadottir; Kelly A Thomas; Haijun Qui; Rosetta M Chiavacci; Cecilia E Kim; Fengxiang Wang; James Snyder; Berit Flatø; Øystein Førre; Lee A Denson; Susan D Thompson; Mara L Becker; Stephen L Guthery; Anna Latiano; Elena Perez; Elena Resnick; Caterina Strisciuglio; Annamaria Staiano; Erasmo Miele; Mark S Silverberg; Benedicte A Lie; Marilynn Punaro; Richard K Russell; David C Wilson; Marla C Dubinsky; Dimitri S Monos; Vito Annese; Jane E Munro; Carol Wise; Helen Chapel; Charlotte Cunningham-Rundles; Jordan S Orange; Edward M Behrens; Kathleen E Sullivan; Subra Kugathasan; Anne M Griffiths; Jack Satsangi; Struan F A Grant; Patrick M A Sleiman; Terri H Finkel; Constantin Polychronakos; Robert N Baldassano; Eline T Luning Prak; Justine A Ellis; Hongzhe Li; Brendan J Keating; Hakon Hakonarson
Journal:  Nat Commun       Date:  2015-10-09       Impact factor: 17.694

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.