Literature DB >> 20052610

Estimating effect sizes in genome-wide association studies.

József Bukszár1, Edwin J C G van den Oord.   

Abstract

Knowledge about the proportion of markers without effects (p( 0 )) and the effect sizes in large scale genetic studies is important to understand the basic properties of the data and for applications such as the control of false discoveries and designing adequately powered replication studies. Many p(0) estimators have been proposed. However, high dimensional data sets typically comprise a large range of effect sizes and it is unclear whether the estimated p(0) is related to the whole range, including markers with very small effects, or just the markers with large effects. In this article we develop an estimation procedure that can be used in all scenarios where the test statistic distribution under the alternative can be characterized by a single parameter (e.g. non-centrality parameter of the non-central chi-square or F distribution). The estimation procedure starts with estimating the largest effect in the data set, then the second largest effect, then the third largest effect, etc. We stop when the effect sizes become so small that they cannot be estimated precisely anymore for the given sample size. Once the individual effect sizes are estimated, they can be used to calculate an interpretable estimate of p(0). Thus, our method results in both an interpretable estimate of p(0) as well as estimates of the effect sizes present in the whole marker set by repeatedly estimating a single parameter. Simulations suggest that the effects are estimated precisely with only a small upward bias. The R codes that compute the effect estimates are freely downloadable from the website: http://www.people.vcu.edu/~jbukszar/.

Entities:  

Mesh:

Year:  2010        PMID: 20052610      PMCID: PMC3923086          DOI: 10.1007/s10519-009-9321-9

Source DB:  PubMed          Journal:  Behav Genet        ISSN: 0001-8244            Impact factor:   2.805


  18 in total

1.  Estimation of the number of "true" null hypotheses in multivariate analysis of neuroimaging data.

Authors:  F E Turkheimer; C B Smith; K Schmidt
Journal:  Neuroimage       Date:  2001-05       Impact factor: 6.556

2.  Selecting a maximally informative set of single-nucleotide polymorphisms for association analyses using linkage disequilibrium.

Authors:  Christopher S Carlson; Michael A Eberle; Mark J Rieder; Qian Yi; Leonid Kruglyak; Deborah A Nickerson
Journal:  Am J Hum Genet       Date:  2003-12-15       Impact factor: 11.025

3.  Estimating the occurrence of false positives and false negatives in microarray studies by approximating and partitioning the empirical distribution of p-values.

Authors:  Stan Pounds; Stephan W Morris
Journal:  Bioinformatics       Date:  2003-07-01       Impact factor: 6.937

4.  Comparison of methods for estimating the number of true null hypotheses in multiplicity testing.

Authors:  Huey-miin Hsueh; James J Chen; Ralph L Kodell
Journal:  J Biopharm Stat       Date:  2003-11       Impact factor: 1.051

5.  Multiple-testing strategy for analyzing cDNA array data on gene expression.

Authors:  Robert R Delongchamp; John F Bowyer; James J Chen; Ralph L Kodell
Journal:  Biometrics       Date:  2004-09       Impact factor: 2.571

6.  The 'miss rate' for the analysis of gene expression data.

Authors:  Jonathan Taylor; Robert Tibshirani; Bradley Efron
Journal:  Biostatistics       Date:  2005-01       Impact factor: 5.899

7.  Accurate and efficient power calculations for 2 x m tables in unmatched case-control designs.

Authors:  József Bukszár; Edwin J C G van den Oord
Journal:  Stat Med       Date:  2006-08-15       Impact factor: 2.373

8.  A whole genome scan for quantitative trait loci affecting milk protein percentage in Israeli-Holstein cattle, by means of selective milk DNA pooling in a daughter design, using an adjusted false discovery rate criterion.

Authors:  M O Mosig; E Lipkin; G Khutoreskaya; E Tchourzyna; M Soller; A Friedmann
Journal:  Genetics       Date:  2001-04       Impact factor: 4.562

9.  The distribution of the effects of genes affecting quantitative traits in livestock.

Authors:  B Hayes; M E Goddard
Journal:  Genet Sel Evol       Date:  2001 May-Jun       Impact factor: 4.297

10.  Genomewide association analysis followed by a replication study implicates a novel candidate gene for neuroticism.

Authors:  Edwin J C G van den Oord; Po-Hsiu Kuo; Annette M Hartmann; B Todd Webb; Hans-Jürgen Möller; John M Hettema; Ina Giegling; József Bukszár; Dan Rujescu
Journal:  Arch Gen Psychiatry       Date:  2008-09
View more
  2 in total

1.  Exploring transposable element-based markers to identify allelic variations underlying agronomic traits in rice.

Authors:  Haidong Yan; David C Haak; Song Li; Linkai Huang; Aureliano Bombarely
Journal:  Plant Commun       Date:  2021-12-20

2.  Towards accurate estimation of the proportion of true null hypotheses in multiple testing.

Authors:  Shu-Dong Zhang
Journal:  PLoS One       Date:  2011-04-22       Impact factor: 3.240

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.