| Literature DB >> 31664079 |
Yi Li1, Orna Levran2, JongJoo Kim3, Tiejun Zhang4,5, Xingdong Chen6, Chen Suo7,8.
Abstract
It is extremely expensive to conduct large sample size array- or sequencing based genome scale association studies. For a quantitative trait, an extreme case-control study design may improve the power and reduce the cost of variant calling. We investigated the performance of extreme study design when various proportions of samples are selected from the tails of phenotype distribution. Using simulations, we show that when risk genotypes become rare in the population and effect size is relatively small, it is beneficial to carry out an extreme sampling study. In particular, the number of selected cases and controls can even be unbalanced such that power is further increased, compared with a balanced selection. Our application to two data sets: methadone dose data and yearling weight data, demonstrated that similar results for full data analysis can be obtained using extreme sampling with only a fraction of the data. Using power analysis with simulated data and an experimental data application, we conclude that when full data is unavailable due to restricted budget, it is rewarding to employ an extreme sampling design in the sense that there can be immense cost reductions and qualitatively similar power as in the full data analysis.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31664079 PMCID: PMC6820758 DOI: 10.1038/s41598-019-51790-w
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Illustration defining numbers of individuals with different genotypes from the two ends of the trait distribution.
| AA | AB | BB | Total | |
|---|---|---|---|---|
| Upper tail | UAA | UAB | UBB | U |
| Lower tail | LAA | LAB | LBB | L |
| Total | YAA | YAB | YBB | U + L |
Figure 1Power of extreme study design under the dominant and recessive model. Relationship between power and various fractions in selecting cases and controls, under the dominant model (left panel) and the recessive model (right panel). Crosses represent achieved power when we analyze the phenotype value quantitatively.
Figure 2Power of extreme study design under the multiplicative model. Relationship between power and various fractions in selecting cases and controls, under the multiplicative model. Crosses represent achieved power when we analyze the phenotype value quantitatively.
Figure 3Power versus fractions in selecting equal number of cases and controls. Relationship between power and various fractions in selecting cases and controls, under the dominant model (left panel) and the recessive model (right panel). Crosses represent achieved power when we analyze the phenotype value quantitatively.
Figure 4Power versus fractions in selecting cases and controls. Relationship between power and various unequal fractions in selecting cases and controls, under the dominant model (left panel) and the recessive model (right panel). Crosses represent achieved power when we analyze the phenotype value quantitatively.
Comparisons between extreme sampling, random sampling and XP-GWAS.
| δ | Fractions | Full data | Extreme sampling | Random sampling | XP-GWAS | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Dominant | Recessive | Multiplicative | Dominant | Recessive | Multiplicative | Dominant | Recessive | Multiplicative | Dominant | Recessive | Multiplicative | ||
| 1 | 0.1 | 0.05 | 0.05 | 0.05 | 0.0501 | 0.0515 | 0.0501 | 0.0500 | 0.0504 | 0.0500 | |||
| 0.2 | 0.0500 | 0.0505 | 0.0500 | 0.0500 | 0.0500 | 0.0500 | |||||||
| 0.3 | 0.0500 | 0.0500 | 0.0500 | 0.0501 | 0.0500 | 0.0501 | |||||||
| 0.4 | 0.0500 | 0.0512 | 0.0500 | 0.0500 | 0.0510 | 0.0507 | |||||||
| 0.5 | 0.0500 | 0.0508 | 0.0500 | 0.0505 | 0.0511 | 0.0500 | |||||||
| 1.1 | 0.1 | 0.1105 | 0.07 | 0.1629 | 0.0923 | 0.0580 | 0.1178 | 0.0591 | 0.0518 | 0.0688 | 0.0520 | 0.0410 | 0.0657 |
| 0.2 | 0.1013 | 0.0655 | 0.1421 | 0.0749 | 0.0585 | 0.0935 | 0.0606 | 0.0500 | 0.0806 | ||||
| 0.3 | 0.0966 | 0.0611 | 0.1355 | 0.0839 | 0.0617 | 0.1142 | 0.0639 | 0.0535 | 0.0844 | ||||
| 0.4 | 0.0947 | 0.0690 | 0.1305 | 0.0972 | 0.0688 | 0.1402 | |||||||
| 0.5 | 0.0906 | 0.0635 | 0.1205 | 0.1058 | 0.0668 | 0.1553 | |||||||
| 1.2 | 0.1 | 0.3188 | 0.1361 | 0.5604 | 0.2248 | 0.0883 | 0.3597 | 0.0964 | 0.0643 | 0.1371 | 0.0826 | 0.0522 | 0.1502 |
| 0.2 | 0.2654 | 0.1138 | 0.4473 | 0.1495 | 0.0809 | 0.2468 | 0.0961 | 0.0613 | 0.1977 | ||||
| 0.3 | 0.2609 | 0.1033 | 0.4457 | 0.2023 | 0.0973 | 0.3625 | 0.1069 | 0.0665 | 0.2061 | ||||
| 0.4 | 0.2556 | 0.1098 | 0.4378 | 0.2686 | 0.1156 | 0.4741 | |||||||
| 0.5 | 0.2197 | 0.1008 | 0.3816 | 0.3091 | 0.1272 | 0.5507 | |||||||
| 1.3 | 0.1 | 0.6297 | 0.2467 | 0.915 | 0.4398 | 0.1468 | 0.7124 | 0.1546 | 0.0827 | 0.2712 | 0.1349 | 0.0673 | 0.3131 |
| 0.2 | 0.5299 | 0.1946 | 0.8309 | 0.2837 | 0.1182 | 0.5206 | 0.167 | 0.0831 | 0.4136 | ||||
| 0.3 | 0.5240 | 0.1905 | 0.8283 | 0.4198 | 0.1593 | 0.7177 | 0.1823 | 0.088 | 0.4253 | ||||
| 0.4 | 0.5175 | 0.1883 | 0.8190 | 0.5428 | 0.2070 | 0.8442 | |||||||
| 0.5 | 0.4465 | 0.1665 | 0.7473 | 0.6287 | 0.2403 | 0.9136 | |||||||
| 1.4 | 0.1 | 0.8755 | 0.4107 | 0.9961 | 0.6940 | 0.2283 | 0.9414 | 0.2450 | 0.1084 | 0.4777 | 0.2095 | 0.0899 | |
| 0.2 | 0.7951 | 0.3155 | 0.9816 | 0.4741 | 0.1794 | 0.8046 | 0.2639 | 0.1148 | 0.6773 | ||||
| 0.3 | 0.7894 | 0.3079 | 0.9807 | 0.6667 | 0.2574 | 0.9420 | 0.2843 | 0.1200 | 0.6892 | ||||
| 0.4 | 0.7775 | 0.3042 | 0.978 | 0.7987 | 0.3432 | 0.9858 | |||||||
| 0.5 | 0.6981 | 0.2651 | 0.9514 | 0.8777 | 0.4012 | 0.9956 | |||||||
Individuals with MD > T2 and MD < T1 are selected as cases and controls, respectively.
| Controls (MD < T1) | Cases (MD > T2) | ||||
|---|---|---|---|---|---|
| 90th | 80th | 70th | 60th | 50th | |
| 10th | 2 | 3 | 4 | 3 | 3 |
| 20th | 4 | 10 | 13 | 5 | 7 |
| 30th | 7 | 9 | 16 | 4 | 7 |
| 40th | 7 | 8 | 11 | 4 | 10 |
| 50th | 6 | 8 | 15 | 5 | 10 |
| 10th | 0.014 | 0.002 | 0.001 | 0.002 | 0.001 |
| 20th | 0.006 | 0.008 | 0.016 | 0.004 | 0.010 |
| 30th | 0.005 | 0.017 | 0.010 | 0.009 | 0.024 |
| 40th | 0.006 | 0.018 | 0.005 | 0.012 | 0.020 |
| 50th | 0.003 | 0.014 | 0.008 | 0.006 | 0.009 |