| Literature DB >> 30478944 |
Ting Zhang1, Lei Sun1,2.
Abstract
When evaluating a newly developed statistical test, an important step is to check its type 1 error (T1E) control using simulations. This is often achieved by the standard simulation design S0 under the so-called "theoretical" null of no association. In practice, the whole-genome association analyses scan through a large number of genetic markers ( <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:mi>G</mml:mi></mml:math> s) for the ones associated with an outcome of interest ( <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:mi>Y</mml:mi></mml:math> ), where <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:mi>Y</mml:mi></mml:math> comes from an alternative while the majority of <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:mi>G</mml:mi></mml:math> s are not associated with <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:mi>Y</mml:mi></mml:math> ; the <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:mi>Y</mml:mi> <mml:mo>-</mml:mo> <mml:mi>G</mml:mi></mml:math> relationships are under the "empirical" null. This reality can be better represented by two other simulation designs, where design S1.1 simulates <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:mi>Y</mml:mi></mml:math> from analternative model based on <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:mi>G</mml:mi></mml:math> , then evaluates its association with independently generated <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:mrow><mml:mrow/> <mml:msub><mml:mi>G</mml:mi> <mml:mrow><mml:mi>n</mml:mi> <mml:mi>e</mml:mi> <mml:mi>w</mml:mi></mml:mrow> </mml:msub> </mml:mrow> </mml:math> ; while design S1.2 evaluates the association between permutated <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:mi>Y</mml:mi></mml:math> and <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:mi>G</mml:mi></mml:math> . More than a decade ago, Efron (2004) has noted the important distinction between the "theoretical" and "empirical" null in false discovery rate control. Using scale tests for variance heterogeneity, direct univariate, and multivariate interaction tests as examples, here we show that not all null simulation designs are equal. In examining the accuracy of a likelihood ratio test, while simulation design S0 suggested the method being accurate, designs S1.1 and S1.2 revealed its increased empirical T1E rate if applied in real data setting. The inflation becomes more severe at the tail and does not diminish as sample size increases. This is an important observation that calls for new practices for methods evaluation and T1E control interpretation.Entities:
Keywords: interaction; simulation; type 1 error; variance heterogeneity; whole-genome association scans
Mesh:
Substances:
Year: 2018 PMID: 30478944 PMCID: PMC6518945 DOI: 10.1002/gepi.22172
Source DB: PubMed Journal: Genet Epidemiol ISSN: 0741-0395 Impact factor: 2.135
Summary of the two data‐generating models for indirect and direct × interaction testing, and evaluating the “theoretical” null simulation designs S0 versus the two “empirical” null simulation designs S1.1 and S1.2, as described in Sections 2.3.1 and 2.3.2
| Introduce variance heterogeneity by | Introduce variance heterogeneity by | |
|---|---|---|
| Null model for S0 |
|
|
| Alternative models for S1.1 and S1.2 |
|
|
| Parameters |
|
|
| Sample size |
|
|
| Nominal T1E level |
|
|
| Replications |
|
|
| nrep.out = 100 | nrep.out = 100 |
If was available for direct × testing, the Aschard et al. (2013) model coincides with Model I of Rao and Province (2016), except was . T1E rate is first estimated from nrep.in simulation replicates in an inner loop in which is fixed (similar to one whole‐genome × interaction scan), then averaged over nrep.out simulation replicates in an outer loop in which varies.
MAF: minor allele frequency.
Summary of the data‐generating models for direct multivariate × interaction testing, and evaluating the “theoretical” null simulation designs S0 versus the two “empirical” null simulation designs S1.1 and S1.2, as described in Section 2.3.3
| Null model for S0 |
|
| Alternative models for S1.1 and S1.2 |
|
| MAF for | Large: |
| Small: | |
| Parameters |
|
| Sample size |
|
| Nominal T1E level |
|
| Replications |
|
T1E rate is first estimated from nrep.in simulation replicates in an inner loop in which is fixed (similar to one whole‐genome gene‐based × interaction scan), then averaged over nrep.out simulation replicates in an outer loop in which varies.
MAF: minor allele frequency.
Simulation results of interaction scenario 1: Single , and missing
|
| |||||||
|---|---|---|---|---|---|---|---|
|
| 0.0 | 0.2 | 0.4 | 0.6 | 0.8 | 1 | |
| Location | |||||||
|
| S0 | 5.029 | 5.027 | 5.026 | 5.027 | 5.027 | 5.026 |
| S1.1 | 5.039 | 5.021 | 5.021 | 5.019 | 5.014 | 5.010 | |
| S1.2 | 4.997 | 5.023 | 5.022 | 5.020 | 5.017 | 5.011 | |
|
| S0 | 5.002 | 4.998 | 4.997 | 4.998 | 4.998 | 4.997 |
| S1.1 | 5.014 | 4.992 | 4.993 | 4.991 | 4.985 | 4.981 | |
| S1.2 | 4.974 | 4.994 | 4.993 | 4.990 | 4.988 | 4.983 | |
| Scale | |||||||
|
| S0 | 5.035 | 5.083 | 5.081 | 5.081 | 5.081 | 5.079 |
| S1.1 | 5.029 | 5.188 |
|
|
|
| |
| S1.2 | 5.031 | 5.198 |
|
|
|
| |
|
| S0 | 4.956 | 4.898 | 4.898 | 4.898 | 4.898 | 4.896 |
| S1.1 | 4.989 | 4.901 | 4.904 | 4.905 | 4.909 | 4.907 | |
| S1.2 | 4.906 | 4.922 | 4.912 | 4.911 | 4.915 | 4.908 | |
Empirical T1E rates of and location tests for mean difference in across the three groups, and of and scale tests for variance difference in , based on the “theoretical” null design of S0 and the alternative “empirical” null designs of S1.1 and S1.2. The alternative data were generated using the Aschard's genetic model as described in Table 1. Empirical T1E rates outside are bolded.
Simulation results of interaction scenario 1: Single , and missing; effect of the nominal level
|
|
|
|
|
|
| |
|---|---|---|---|---|---|---|
| Location |
| S0 | 5.016 | 0.999 | 0.985 | 0.988 |
| S1.1 | 5.009 | 1.007 | 0.988 | 0.990 | ||
| S1.2 | 5.011 | 1.009 | 1.033 | 1.013 | ||
|
| S0 | 5.008 | 0.998 | 0.989 | 0.990 | |
| S1.1 | 4.982 | 0.998 | 0.982 | 0.991 | ||
| S1.2 | 4.982 | 0.999 | 0.983 | 0.997 | ||
| Scale |
| S0 | 5.009 | 1.002 | 1.024 | 1.033 |
| S1.1 |
|
|
|
| ||
| S1.2 |
|
|
|
| ||
|
| S0 | 4.955 | 0.961 | 0.938 | 0.971 | |
| S1.1 | 4.964 | 0.978 | 0.932 | 0.952 | ||
| S1.2 | 4.962 | 0.970 | 0.953 | 0.958 |
Empirical T1E rates of and location tests for mean difference in across the three groups, and of and scale tests for variance difference in , based on the “theoretical” null design of S0 and the alternative “empirical” null designs of S1.1 and S1.2. The alternative data were generated using the Aschard's genetic model as described in Table 1, focusing on the extreme case of large interaction effect, . Empirical T1E rates outside are boded.
Simulation results of interaction scenario 1: Single , and missing; effect of sample size
|
| |||
|---|---|---|---|
|
|
| ||
| Location | |||
|
| S0 | 5.011 | 5.012 |
| S1.1 | 4.992 | 5.003 | |
| S1.2 | 4.934 | 4.989 | |
|
| S0 | 5.011 | 5.012 |
| S1.1 | 4.993 | 5.003 | |
| S1.2 | 4.934 | 4.989 | |
| Scale | |||
|
| S0 | 5.103 | 5.165 |
| S1.1 |
|
| |
| S1.2 |
|
| |
|
| S0 | 4.924 | 4.945 |
| S1.1 | 4.905 | 4.965 | |
| S1.2 | 4.874 | 4.825 | |
Empirical T1E rates of and location tests for mean difference in across the three groups, and of and scale tests for variance difference in , based on the “theoretical” null design of S0 and the alternative “empirical” null designs of S1.1 and S1.2. The alternative data were generated using the Cao's genetic model as described in Table 1, and using two difference sample sizes of and . Empirical T1E rates outside are bolded.
Figure 1Comparison of the asymptotic distribution (black solid) and finite‐sample distribution (red dashed) of under the “empirical” null, with the asymptotic distribution (, blue dot‐dashed) of under the “theoretical” null. Vertical lines correspond the quantile cutoffs for
Simulation results of interaction scenario 2: Single , and known
|
|
| ||||||
|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
| |
|
| S0 | 5.034 | 1.017 | 1.029 | 5.033 | 1.007 | 0.979 |
| S1.1 |
|
|
|
|
|
| |
| S1.2 |
|
|
|
|
|
| |
|
| S0 | 4.982 | 1.003 | 1.002 | 5.021 | 1.004 | 0.972 |
| S1.1 |
|
|
|
|
|
| |
| S1.2 |
|
|
|
|
|
| |
Empirical T1E rates of the LRT and Score tests, based on the ‘theoretical’ null design of S0 and the alternative ‘empirical’ null designs of S1.1 and S1.2. The alternative Y1 data were generated using the Aschard's genetic model as described in Tables 2 when β = 1, but E was assumed to be known in this case and direction interaction testing was possible. Empirical T1E rates outside are bolded.
Simulation results of interaction scenario 3: Multiple s, and known
| Small MAF (rare) | large MAF (common) | ||||||
|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
| |
|
| S0 | 4.991 | 0.982 | 0.985 | 4.983 | 1.001 | 0.992 |
| S1.1 |
|
|
|
|
|
| |
| S1.2 |
|
|
|
|
|
| |
|
| S0 | 4.904 | 0.949 | 0.895 | 4.934 | 0.976 | 0.917 |
| S1.1 | 5.038 | 1.074 | 1.059 |
|
|
| |
| S1.2 | 5.034 | 1.065 | 1.110 |
|
|
| |
|
| S0 |
|
|
|
|
|
|
| S1.1 |
|
|
|
|
|
| |
| S1.2 |
|
|
|
|
|
| |
Empirical T1E rates of the , , and tests of jointly testing for multiple interaction effects, based on the “theoretical” null design of S0 and the alternative “empirical” null designs of S1.1 and S1.2. Models and parameters values are given in Table 2. Empirical T1E rates outside are bolded.
MAF: minor allele frequency.