| Literature DB >> 34863089 |
Shuai Wang1, James B Meigs2,3,4, Josée Dupuis5.
Abstract
BACKGROUND: Advancements in statistical methods and sequencing technology have led to numerous novel discoveries in human genetics in the past two decades. Among phenotypes of interest, most attention has been given to studying genetic associations with continuous or binary traits. Efficient statistical methods have been proposed and are available for both types of traits under different study designs. However, for multinomial categorical traits in related samples, there is a lack of efficient statistical methods and software.Entities:
Keywords: Categorical; EGEE; Family samples; Framingham heart study; GWAS; Multinomial; Ordinal; Score test; Sequencing; Wald test
Mesh:
Year: 2021 PMID: 34863089 PMCID: PMC8642939 DOI: 10.1186/s12864-021-08107-x
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 4.547
Simulation results of type-I error for family-based samples
| MAF | Robust Score test | Wald test | Logistic regression | ||||||
|---|---|---|---|---|---|---|---|---|---|
| α = 0.01 | α = 0.001 | α = 0.0001 | α = 0.01 | α = 0.001 | α = 0.0001 | α = 0.01 | α = 0.001 | α = 0.0001 | |
| 0.01 | 0.014 | 0.0020 | 0.0003 | 0.012 | 0.0024 | 0.00058 | 0.023 | 0.0023 | 0.0006 |
| 0.02 | 0.013 | 0.0020 | 0.0003 | 0.010 | 0.0011 | 0.0003 | 0.021 | 0.0027 | 0.0004 |
| 0.03 | 0.012 | 0.0017 | 0.0002 | 0.009 | 0.0012 | 0.0002 | 0.022 | 0.0025 | 0.0004 |
| 0.04 | 0.011 | 0.0014 | 0.0002 | 0.007 | 0.0010 | 0.0002 | 0.022 | 0.0025 | 0.0006 |
| 0.05 | 0.011 | 0.0013 | 0.0002 | 0.011 | 0.0008 | 0.0002 | 0.021 | 0.0026 | 0.0002 |
| 0.1 | 0.011 | 0.0010 | 0.0001 | 0.009 | 0.0008 | 0.0001 | 0.021 | 0.0024 | 0.0003 |
| 0.2 | 0.010 | 0.0010 | 0.0001 | 0.010 | 0.0010 | 0.0001 | 0.019 | 0.0033 | 0.0004 |
| 0.3 | 0.010 | 0.0010 | 0.0001 | 0.011 | 0.0013 | 0.0001 | 0.021 | 0.0033 | 0.0011 |
Simulation results of type-I error for unrelated samples
| MAF | Score test | Wald test | Logistic regression | |||
|---|---|---|---|---|---|---|
| α = 0.01 | α = 0.001 | α = 0.01 | α = 0.001 | α = 0.01 | α = 0.001 | |
| 0.01 | 0.011 | 0.0010 | 0.008 | 0.0006 | 0.011 | 0.0011 |
| 0.02 | 0.010 | 0.0016 | 0.010 | 0.0016 | 0.012 | 0.0014 |
| 0.03 | 0.012 | 0.0012 | 0.011 | 0.0010 | 0.011 | 0.0014 |
| 0.04 | 0.011 | 0.0010 | 0.010 | 0.0010 | 0.011 | 0.0010 |
| 0.05 | 0.010 | 0.0010 | 0.006 | 0.0004 | 0.010 | 0.0005 |
| 0.1 | 0.010 | 0.0010 | 0.010 | 0.0008 | 0.010 | 0.0010 |
| 0.2 | 0.010 | 0.0010 | 0.009 | 0.0004 | 0.010 | 0.0010 |
| 0.3 | 0.009 | 0.0011 | 0.009 | 0.0006 | 0.010 | 0.0010 |
Simulation results of type-I error for family-based samples for ordinal traits
| MAF | Robust Score test | Wald test | ||||
|---|---|---|---|---|---|---|
| α = 0.01 | α = 0.001 | α = 0.0001 | α = 0.01 | α = 0.001 | α = 0.0001 | |
| 0.01 | 0.010 | 0.0008 | 0.00009 | 0.012 | 0.0013 | 0.00019 |
| 0.02 | 0.009 | 0.0008 | 0.00008 | 0.011 | 0.0011 | 0.00013 |
| 0.03 | 0.010 | 0.0010 | 0.00009 | 0.011 | 0.0012 | 0.00012 |
| 0.04 | 0.009 | 0.0009 | 0.00008 | 0.010 | 0.0011 | 0.00011 |
| 0.05 | 0.010 | 0.0009 | 0.00010 | 0.011 | 0.0011 | 0.00014 |
| 0.1 | 0.010 | 0.0010 | 0.00009 | 0.010 | 0.0011 | 0.00009 |
| 0.2 | 0.010 | 0.0009 | 0.00009 | 0.010 | 0.0010 | 0.00012 |
| 0.3 | 0.009 | 0.0009 | 0.00010 | 0.010 | 0.0010 | 0.00012 |
Power results for family-based samples
| MAF | α=0.01 | α=0.001 | α = 5 × 10−8 | |||
|---|---|---|---|---|---|---|
| 0.01 | score | 97.2 | score | 92.4 | score | 42.5 |
| Wald | 96.7 | Wald | 90.2 | Wald | 29.8 | |
| 0.02 | score | 96.5 | score | 89.1 | score | 33.0 |
| Wald | 96.6 | Wald | 86.4 | Wald | 24.4 | |
| 0.03 | score | 95.5 | score | 87.5 | score | 25.6 |
| Wald | 95.1 | Wald | 84.6 | Wald | 20.5 | |
| 0.04 | score | 94.9 | score | 85.4 | score | 23.4 |
| Wald | 94.6 | Wald | 82.4 | Wald | 17.7 | |
| 0.05 | score | 94.3 | score | 83.6 | Score | 20.9 |
| Wald | 93.5 | Wald | 81.2 | Wald | 15.8 | |
| 0.1 | score | 93.0 | score | 78.6 | score | 13.8 |
| Wald | 94.3 | Wald | 79.6 | Wald | 11.6 | |
| 0.2 | score | 89.4 | score | 71.7 | score | 7.6 |
| Wald | 91.1 | Wald | 74.3 | Wald | 8.0 | |
| 0.3 | score | 87.4 | score | 68.2 | score | 6.4 |
| Wald | 89.0 | Wald | 71.0 | Wald | 6.5 | |
Power results for unrelated samples
| MAF | α=0.01 | α=0.001 | α = 5 × 10−8 | |||
|---|---|---|---|---|---|---|
| 0.01 | score | 95.0 | score | 85.8 | score | 26.5 |
| Wald | 94.1 | Wald | 82.8 | Wald | 15.6 | |
| Logistic (LRT) | 92.9 | Logistic (LRT) | 79.6 | Logistic (LRT) | 11.4 | |
| 0.02 | score | 93.3 | score | 82.3 | score | 20.0 |
| Wald | 92.8 | Wald | 80.6 | Wald | 14.6 | |
Logistic (LRT) | 91.6 | Logistic (LRT) | 77.4 | Logistic (LRT) | 10.6 | |
| 0.03 | score | 92.7 | score | 81.1 | score | 15.9 |
| Wald | 92.4 | Wald | 79.5 | Wald | 12.7 | |
Logistic (LRT) | 91.2 | Logistic (LRT) | 76.8 | Logistic (LRT) | 9.5 | |
| 0.04 | score | 92.4 | score | 79.2 | score | 14.1 |
| Wald | 92.0 | Wald | 78.2 | Wald | 11.3 | |
Logistic (LRT) | 90.8 | Logistic (LRT) | 75.4 | Logistic (LRT) | 8.6 | |
| 0.05 | score | 92.0 | score | 77.9 | score | 13.1 |
| Wald | 91.9 | Wald | 77.3 | Wald | 10.9 | |
Logistic (LRT) | 90.9 | Logistic (LRT) | 74.7 | Logistic (LRT) | 8.2 | |
| 0.1 | score | 91.3 | score | 75.7 | score | 10.4 |
| Wald | 91.0 | Wald | 74.9 | Wald | 9.5 | |
Logistic (LRT) | 90.3 | Logistic (LRT) | 73.2 | Logistic (LRT) | 7.8 | |
| 0.2 | score | 89.9 | score | 73.2 | score | 8.1 |
| Wald | 89.7 | Wald | 73.0 | Wald | 7.5 | |
Logistic (LRT) | 89.3 | Logistic (LRT) | 71.9 | Logistic (LRT) | 6.8 | |
| 0.3 | score | 89.2 | score | 72.2 | score | 7.0 |
| Wald | 89.3 | Wald | 71.8 | Wald | 6.5 | |
Logistic (LRT) | 89.0 | Logistic (LRT) | 71.5 | Logistic (LRT) | 6.3 | |
Fig. 1Manhattan plot of diabesity using the FHS data and Hapmap imputed genotypes
Fig. 2QQ-plot of diabesity using the FHS data and Hapmap imputed genotypes
Top SNPs and the closest genes
| Chr | Lead SNP | bp | Loci | |
|---|---|---|---|---|
| 5 | rs16875172 | 1.17*10^(− 7) to 5.34*10^(− 7) | 5:77422013–5:77559950 | |
| 7 | rs528144 | 2.99*10^(− 7) to 5.56*10^(− 6) | 7:99257162–7:99918674 | |
| 13 | rs1925751 | 7.34*10^(−7) to 5.97*10^(− 6) | 13:66763957–13: 66795551 |
Fig. 3Manhattan plot of “ordinal” diabesity using the FHS data and Hapmap imputed genotypes (proposed ordinal approach)
Fig. 4Manhattan plot of “ordinal” diabesity using the FHS data and Hapmap imputed genotypes (POLMM approach)
Computing time of robust score and Wald tests on a i7-8565u processor with 16GB RAM
| Sample size | Robust score test | Wald test |
|---|---|---|
| 5000 (182 families) | 3.09 s (initial) + 1.17 s per SNP | 3.44 s per SNP |
| 10,000 (364 families) | 4.47 (initial) + 2.45 s per SNP | 6.92 s per SNP |
| 20,000 (728 families) | 5.47 (initial) + 5.25 s per SNP | 10.95 s per SNP |
Fig. 5All possible family structures