Literature DB >> 20339902

Don't split your data.

Henrik Källberg1, Lars Alfredsson, Maria Feychting, Anders Ahlbom.   

Abstract

False positive findings are a common problem in whole genome association studies. In this commentary we show that nothing is gained by randomly splitting a data sample to two equal sized subsets, where the first data subset is used for explorative purposes and the other sub set is used to confirm the findings in the first subset. We compare the random splitting procedure to using the full data sample for analysis, by using a Bayesian perspective with consideration taken to prior probability of a false positive finding.

Mesh:

Year:  2010        PMID: 20339902     DOI: 10.1007/s10654-010-9447-3

Source DB:  PubMed          Journal:  Eur J Epidemiol        ISSN: 0393-2990            Impact factor:   8.082


  4 in total

1.  Assessing the probability that a positive report is false: an approach for molecular epidemiology studies.

Authors:  Sholom Wacholder; Stephen Chanock; Montserrat Garcia-Closas; Laure El Ghormli; Nathaniel Rothman
Journal:  J Natl Cancer Inst       Date:  2004-03-17       Impact factor: 13.506

2.  The problem of multiple inference in studies designed to generate hypotheses.

Authors:  D C Thomas; J Siemiatycki; R Dewar; J Robins; M Goldberg; B G Armstrong
Journal:  Am J Epidemiol       Date:  1985-12       Impact factor: 4.897

3.  Methodological Issues in Multistage Genome-wide Association Studies.

Authors:  Duncan C Thomas; Graham Casey; David V Conti; Robert W Haile; Juan Pablo Lewinger; Daniel O Stram
Journal:  Stat Sci       Date:  2009-11-01       Impact factor: 2.901

4.  Two-stage designs for gene-disease association studies.

Authors:  Jaya M Satagopan; David A Verbel; E S Venkatraman; Kenneth E Offit; Colin B Begg
Journal:  Biometrics       Date:  2002-03       Impact factor: 2.571

  4 in total
  2 in total

Review 1.  Supervised Machine Learning: A Brief Primer.

Authors:  Tammy Jiang; Jaimie L Gradus; Anthony J Rosellini
Journal:  Behav Ther       Date:  2020-05-16

2.  Genetic predictors of response to serotonergic and noradrenergic antidepressants in major depressive disorder: a genome-wide analysis of individual-level data and a meta-analysis.

Authors:  Katherine E Tansey; Michel Guipponi; Nader Perroud; Guido Bondolfi; Enrico Domenici; David Evans; Stephanie K Hall; Joanna Hauser; Neven Henigsberg; Xiaolan Hu; Borut Jerman; Wolfgang Maier; Ole Mors; Michael O'Donovan; Tim J Peters; Anna Placentino; Marcella Rietschel; Daniel Souery; Katherine J Aitchison; Ian Craig; Anne Farmer; Jens R Wendland; Alain Malafosse; Peter Holmans; Glyn Lewis; Cathryn M Lewis; Tine Bryan Stensbøl; Shitij Kapur; Peter McGuffin; Rudolf Uher
Journal:  PLoS Med       Date:  2012-10-16       Impact factor: 11.069

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.