Literature DB >> 31049651

Statistical learning approaches in the genetic epidemiology of complex diseases.

Anne-Laure Boulesteix1, Marvin N Wright2,3, Sabine Hoffmann4, Inke R König5.   

Abstract

In this paper, we give an overview of methodological issues related to the use of statistical learning approaches when analyzing high-dimensional genetic data. The focus is set on regression models and machine learning algorithms taking genetic variables as input and returning a classification or a prediction for the target variable of interest; for example, the present or future disease status, or the future course of a disease. After briefly explaining the basic motivation and principle of these methods, we review different procedures that can be used to evaluate the accuracy of the obtained models and discuss common flaws that may lead to over-optimistic conclusions with respect to their prediction performance and usefulness.

Entities:  

Keywords:  Cross-validation; High-dimensional data; Omics data; Prognostic model; Regression; Validation

Mesh:

Year:  2019        PMID: 31049651     DOI: 10.1007/s00439-019-01996-9

Source DB:  PubMed          Journal:  Hum Genet        ISSN: 0340-6717            Impact factor:   4.132


  46 in total

Review 1.  Critical review of published microarray studies for cancer outcome and guidelines on statistical analysis and reporting.

Authors:  Alain Dupuy; Richard M Simon
Journal:  J Natl Cancer Inst       Date:  2007-01-17       Impact factor: 13.506

2.  Strategies for developing prediction models from genome-wide association studies.

Authors:  Jincao Wu; Ruth M Pfeiffer; Mitchell H Gail
Journal:  Genet Epidemiol       Date:  2013-10-25       Impact factor: 2.135

3.  Prediction of coronary heart disease using risk factor categories.

Authors:  P W Wilson; R B D'Agostino; D Levy; A M Belanger; H Silbershatz; W B Kannel
Journal:  Circulation       Date:  1998-05-12       Impact factor: 29.690

4.  Extensions of net reclassification improvement calculations to measure usefulness of new biomarkers.

Authors:  Michael J Pencina; Ralph B D'Agostino; Ewout W Steyerberg
Journal:  Stat Med       Date:  2010-11-05       Impact factor: 2.373

Review 5.  Risk estimation and risk prediction using machine-learning methods.

Authors:  Jochen Kruppa; Andreas Ziegler; Inke R König
Journal:  Hum Genet       Date:  2012-07-03       Impact factor: 4.132

6.  Bias in error estimation when using cross-validation for model selection.

Authors:  Sudhir Varma; Richard Simon
Journal:  BMC Bioinformatics       Date:  2006-02-23       Impact factor: 3.169

7.  Leveraging functional annotations in genetic risk prediction for human complex diseases.

Authors:  Yiming Hu; Qiongshi Lu; Ryan Powles; Xinwei Yao; Can Yang; Fang Fang; Xinran Xu; Hongyu Zhao
Journal:  PLoS Comput Biol       Date:  2017-06-08       Impact factor: 4.475

8.  A measure of the impact of CV incompleteness on prediction error estimation with application to PCA and normalization.

Authors:  Roman Hornung; Christoph Bernau; Caroline Truntzer; Rory Wilson; Thomas Stadler; Anne-Laure Boulesteix
Journal:  BMC Med Res Methodol       Date:  2015-11-04       Impact factor: 4.615

Review 9.  Improved prediction of complex diseases by common genetic markers: state of the art and further perspectives.

Authors:  Bent Müller; Arndt Wilcke; Anne-Laure Boulesteix; Jens Brauer; Eberhard Passarge; Johannes Boltze; Holger Kirsten
Journal:  Hum Genet       Date:  2016-02-02       Impact factor: 4.132

10.  The revival of the Gini importance?

Authors:  Stefano Nembrini; Inke R König; Marvin N Wright
Journal:  Bioinformatics       Date:  2018-11-01       Impact factor: 6.937

View more
  2 in total

1.  Special issue on 'Genetic epidemiology of complex diseases: impact of population history and modelling assumptions'.

Authors:  Amke Caliebe; Michael Nothnagel
Journal:  Hum Genet       Date:  2020-01       Impact factor: 4.132

2.  Introduction to statistical simulations in health research.

Authors:  Anne-Laure Boulesteix; Rolf Hh Groenwold; Michal Abrahamowicz; Harald Binder; Matthias Briel; Roman Hornung; Tim P Morris; Jörg Rahnenführer; Willi Sauerbrei
Journal:  BMJ Open       Date:  2020-12-13       Impact factor: 2.692

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.