| Literature DB >> 24419042 |
L C Lazzeroni1, Y Lu2, I Belitskaya-Lévy3.
Abstract
Scientists often interpret P-values as measures of the relative strength of statistical findings. This is common practice in large-scale genomic studies where P-values are used to choose which of numerous hypothesis test results should be pursued in subsequent research. In this study, we examine P-value variability to assess the degree of certainty P-values provide. We develop prediction intervals for the P-value in a replication study given the P-value observed in an initial study. The intervals depend on the initial value of P and the ratio of sample sizes between the initial and replication studies, but not on the underlying effect size or initial sample size. The intervals are valid for most large-sample statistical tests in any context, and can be used in the presence of single or multiple tests. While P-values are highly variable, future P-value variability can be explicitly predicted based on a P-value from an initial study. The relative size of the replication and initial study is an important predictor of the P-value in a subsequent replication study. We provide a handy calculator implementing these results and apply them to a study of Alzheimer's disease and recent findings of the Cross-Disorder Group of the Psychiatric Genomics Consortium. This study suggests that overinterpretation of very significant, but highly variable, P-values is an important factor contributing to the unexpectedly high incidence of non-replication. Formal prediction intervals can also provide realistic interpretations and comparisons of P-values associated with different estimated effect sizes and sample sizes.Entities:
Mesh:
Year: 2014 PMID: 24419042 PMCID: PMC4255087 DOI: 10.1038/mp.2013.184
Source DB: PubMed Journal: Mol Psychiatry ISSN: 1359-4184 Impact factor: 15.992
Figure 1Sampling distribution of −log p. Simulated independent values of −log p from 1000 identical studies of the same SNP in the same population using a one-sided test. Within each study, sample size = 2000, OR = 1.74, and risk to major homozygote = 9.5%. Genotype frequencies based on minor allele frequency of 30% under Hardy–Weinberg equilibrium. Dashed vertical line corresponds to P = 0.05. Overlaid curve is the theoretical exponential density under the null hypothesis that the OR = 1. Upper range of the density, which reaches 2.3 when −log P = 0, is not shown. SNP, single nucleotide polymorphism.
Figure 2Ninety-five percent (95%) prediction intervals for replication P-values. A vertical line extending from the bottom to the top of the appropriate funnel gives the prediction interval given the original −log p on the x axis. P-values based on one-sided tests. Dashed lines at P = 0.05. Black arrows correspond to the two most significant SNP P-values observed by Furney et al. in an analysis of 939 individuals. Blue arrows correspond to the same P-values after applying a selection bias correction for the 488 911 SNPs tested in that study. (a) Prediction intervals with equal study sizes. Left (red) funnel: prediction intervals for a single test. Right (partially hidden, yellow funnel): prediction intervals with Bonferroni selection bias correction for picking the best of 1M tests in the initial study, Furney et al.[8] (b) Prediction intervals with unequal study sizes and no selection bias correction. Bottom funnel: prediction intervals when replication study size is one-fourth the size of the original study. Top funnel: prediction intervals when replication study is four times the size of the original study. SNP, single nucleotide polymorphism.
Figure 3Ninety-five percent (95%) prediction intervals for the most significant P-value reported by Furney et al.[8] as a function of replication sample size. Above funnel in red outline shows intervals without selection bias correction. Below funnel in yellow outline, which is partly hidden by the upper funnel, shows intervals with selection bias correction. Dashed line at P = 0.05.
Predictions based on recent results from the Psychiatric Genomics Consortium
| | | | | | ||
| rs2535629 | ADHD | 2787 | 2635 | 0.201 | 0.70 (0.00, 4.29) | 1.63 (0.00, 9.37) |
| ASD | 4949 | 5314 | 0.196 | 0.71 (0.00, 4.32) | 1.02 (0.00, 5.92) | |
| BPD | 6990 | 4820 | 6.61E−06 | 5.18 (1.08, 12.47) | 7.40 (1.79, 17.03) | |
| MDD | 9227 | 7383 | 0.000216 | 3.67 (0.45, 10.01) | 3.77 (0.48, 10.26) | |
| Schizophrenia | 9379 | 7736 | 6.71E−05 | 4.17 (0.65, 10.85) | 4.17 (0.65, 10.85) | |
| All | 33 332 | 27 888 | 2.54E−12 | 11.60 (4.63, 21.83) | 3.67 (0.86, 8.48) | |
| rs11191454 | ADHD | 2787 | 2635 | 0.355 | 0.45 (0.00, 3.66) | 0.99 (0.00, 7.72) |
| ASD | 4949 | 5314 | 0.143 | 0.84 (0.00, 4.64) | 1.22 (0.00, 6.42) | |
| BPD | 6990 | 4820 | 0.0107 | 1.97 (0.00, 6.99) | 2.73 (0.01, 9.25) | |
| MDD | 9227 | 7383 | 0.0156 | 1.81 (0.00, 6.68) | 1.86 (0.00, 6.83) | |
| Schizophrenia | 9379 | 7736 | 3.48E−06 | 5.46 (1.21, 12.91) | 5.46 (1.21, 12.91) | |
| All | 33 332 | 27 888 | 1.39E−08 | 7.86 (2.43, 16.52) | 2.57 (0.36, 6.74) | |
| rs1024582 | ADHD | 2787 | 2635 | 0.127 | 0.90 (0.00, 4.76) | 2.16 (0.00, 10.63) |
| ASD | 4949 | 5314 | 0.892 | 0.05 (0.00, 2.44) | 0.06 (0.00, 3.12) | |
| BPD | 6990 | 4820 | 1.12E−06 | 5.95 (1.44, 13.67) | 8.53 (2.35, 18.75) | |
| MDD | 9227 | 7383 | 0.12 | 0.92 (0.00, 4.82) | 0.94 (0.00, 4.92) | |
| Schizophrenia | 9379 | 7736 | 2.84E−05 | 4.55 (0.80, 11.46) | 4.55 (0.80, 11.46) | |
| All | 33 332 | 27 888 | 1.87E−08 | 7.73 (2.36, 16.33) | 2.53 (0.35, 6.67) | |
| rs2799573 | ADHD | 2787 | 2635 | 0.00691 | 2.16 (0.00, 7.35) | 5.75 (0.37, 17.72) |
| ASD | 4949 | 5314 | 0.238 | 0.62 (0.00, 4.11) | 0.89 (0.00, 5.61) | |
| BPD | 6990 | 4820 | 0.0617 | 1.21 (0.00, 5.46) | 1.64 (0.00, 7.10) | |
| MDD | 9227 | 7383 | 0.00108 | 2.97 (0.21, 8.81) | 3.05 (0.22, 9.02) | |
| Schizophrenia | 9379 | 7736 | 0.00161 | 2.79 (0.15, 8.51) | 2.79 (0.15, 8.51) | |
| All | 33 332 | 27 888 | 4.29E−08 | 7.37 (2.17, 15.80) | 2.42 (0.30, 6.50) | |
Abbreviations: ADHD, attention deficit-hyperactivity disorder; ASD, autism spectrum disorder; BPD, bipolar disorder; MDD, major depressive disorder.
Controls include family-based pseudo-controls.
The observed value of −log p is identical to the median of the first prediction interval, based on the original sample size.
ESS = effective sample size is calculated as 4C1C2/(C1+C2).
ESS = 16 957 corresponds to ESS in the original schizophrenia study. Accordingly, the two schizophrenia intervals are identical.