| Literature DB >> 32366212 |
Jianjun Zhang1, Xuan Guo2, Samantha Gonzales2, Jingjing Yang3, Xuexia Wang4.
Abstract
BACKGROUND: In the last decade, a large number of common variants underlying complex diseases have been identified through genome-wide association studies (GWASs). Summary data of the GWASs are freely and publicly available. The summary data is usually obtained through single marker analysis. Gene-based analysis offers a useful alternative and complement to single marker analysis. Results from gene level association tests can be more readily integrated with downstream functional and pathogenic investigations. Most existing gene-based methods fall into two categories: burden tests and quadratic tests. Burden tests are usually powerful when the directions of effects of causal variants are the same. However, they may suffer loss of statistical power when different directions of effects exist at the causal variants. The power of quadratic tests is not affected by the directions of effects but could be less powerful due to issues such as the large number of degree of freedoms. These drawbacks of existing gene based methods motivated us to develop a new powerful method to identify disease associated genes using existing GWAS summary data. METHODS ANDEntities:
Keywords: Burden tests; Genome-wide association studies (GWAS); Quadratic test methods; Truncated statistic method
Year: 2020 PMID: 32366212 PMCID: PMC7199321 DOI: 10.1186/s12859-020-3511-0
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Estimated type I error rates for different test methods
| ST | S2T | AT | GATES | aSPU | TS | |
|---|---|---|---|---|---|---|
| 1×10−3 | 1.02×10−3 | 0.97×10−3 | 1.04×10−3 | 1.05×10−3 | 1.00×10−3 | 1.00×10−3 |
| 1×10−4 | 1.09×10−4 | 0.91×10−4 | 1.06×10−4 | 0.99×10−4 | 1.02×10−4 | 1.00×10−4 |
| 1×10−5 | 1.10×10−5 | 0.95×10−5 | 1.00×10−5 | 0.90×10−5 | 1.10×10−5 | 1.00×10−5 |
| 2.8×10−6 | 4.00×10−6 | 3.00×10−6 | 2.50×10−6 | 2.00×10−6 | 3.00×10−6 | 3.00×10−6 |
Estimated power (%) under 2.8×10−6 significance level for different tests. Data are simulated from N(×△,). has 3 nonzero elements with different signs
| nonzero △ | nonzero | AT | S2T | ST | GATES | aSPU | TS |
|---|---|---|---|---|---|---|---|
| (4,2,1) | (1,1,1) | 68.0 | 8.5 | 68.5 | 16.5 | 83.5 | 95.5 |
| (4,4,2) | (1,1,-1) | 69.0 | 45.0 | 45.0 | 28.5 | 65.6 | 93.8 |
| (2,5,4) | (1,-1,-1) | 92.5 | 74.5 | 76.0 | 65.5 | 62.0 | 98.2 |
| U(1,5) | (1,1,1) | 86.0 | 37.0 | 87.0 | 24.0 | 86.0 | 97.8 |
| U(2,6) | (1,1,-1) | 71.5 | 71.5 | 19.0 | 58.0 | 68.0 | 92.0 |
| U(2,6) | (1,-1,-1) | 70.0 | 70.5 | 19.5 | 65.5 | 64.5 | 92.5 |
| N(3,4) | (1,1,1) | 83.0 | 60.0 | 83.5 | 74.5 | 88.5 | 95.5 |
| N(3,4) | (1,1,-1) | 68.0 | 70.0 | 33.0 | 72.5 | 69.5 | 85.7 |
| N(3,4) | (1,-1,-1) | 66.0 | 61.5 | 33.0 | 78.0 | 73.5 | 87.1 |
Fig. 1Venn diagram of the number of significant genes identified by TS, aSPU, GATES, and GW for SCZ
Comparison of the four methods using the PGC SCZ and UKB T2D GWAS summary data
| Methods | aSPU | GATES | GW | TS |
|---|---|---|---|---|
| PGC SCZ | ||||
| Total significant genes (m,%) | 76 (31, 40.8%) | 73 (23, 31.5%) | 93 (32, 34.4%) | 215 (43, 20%) |
| Unique significant genes (u,%) | 4 (1, 25%) | 14 (0, 0%) | 15 (1, 6.7%) | 155 (13, 8.4%) |
| UKB T2D | ||||
| Total significant genes (m,%) | 54 (25, 46.3%) | 40 (12, 30%) | 57 (27, 47.4%) | 217 (47, 21.7%) |
| Unique significant genes (u,%) | 0 (0, 0%) | 7 (1, 14.3%) | 1 (0, 0%) | 155 (17, 11%) |
Note: m denotes the number of significant SNPs in GWAS and u denotes the number of significant SNPs in GWAS. GW denotes a combination of ST, S2T, and AT.
Fig. 2Venn diagram of the number of significant genes identified by TS, aSPU, GATES, and GW for UKB
Verification study for UKB T2D using GWAS summary data obtained from DIAGRAM
| Methods | Number of significant genes from UKB | Verified genes from Diagram | Verfied percentage |
|---|---|---|---|
| GW | 57 | 5 | 8.7% |
| aSPU | 54 | 5 | 9.2% |
| GATES | 40 | 8 | 20.0% |
| TS | 217 | 32 | 14.7% |
Note: GW denotes a combination of ST, S2T, and AT.