| Literature DB >> 35885903 |
Shijia Yan1, Qiuying Sha1, Shuanglin Zhang1.
Abstract
Recently, gene-based association studies have shown that integrating genome-wide association studies (GWAS) with expression quantitative trait locus (eQTL) data can boost statistical power and that the genetic liability of traits can be captured by polygenic risk scores (PRSs). In this paper, we propose a new gene-based statistical method that leverages gene-expression measurements and new PRSs to identify genes that are associated with phenotypes of interest. We used a generalized linear model to associate phenotypes with gene expression and PRSs and used a score-test statistic to test the association between phenotypes and genes. Our simulation studies show that the newly developed method has correct type I error rates and can boost statistical power compared with other methods that use either gene expression or PRS in association tests. A real data analysis figure based on UK Biobank data for asthma shows that the proposed method is applicable to GWAS.Entities:
Keywords: PRS; TWAS; gene-base association studies
Mesh:
Year: 2022 PMID: 35885903 PMCID: PMC9318573 DOI: 10.3390/genes13071120
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.141
Estimated type I error rates of the seven methods for different sample sizes of GWAS data sets (5000, 10,000, and 20,000) and different genes (gene1, geme2, and gene3). Type I error rates are evaluated using 1000-replicates sample at significance level of .
|
| Gene | TWAS | PRSB | PRST | PRSQ | TWAS-PRSB | TWAS-PRST | TWAS-PRSQ |
|---|---|---|---|---|---|---|---|---|
| 5000 | 1 | 0.044 | 0.056 | 0.062 | 0.057 | 0.056 | 0.057 | 0.058 |
| 2 | 0.048 | 0.051 | 0.048 | 0.050 | 0.063 | 0.061 | 0.063 | |
| 3 | 0.046 | 0.042 | 0.045 | 0.045 | 0.049 | 0.051 | 0.050 | |
| 10,000 | 1 | 0.044 | 0.055 | 0.057 | 0.051 | 0.060 | 0.063 | 0.058 |
| 2 | 0.054 | 0.046 | 0.047 | 0.049 | 0.052 | 0.047 | 0.047 | |
| 3 | 0.050 | 0.052 | 0.054 | 0.056 | 0.060 | 0.057 | 0.046 | |
| 20,000 | 1 | 0.043 | 0.049 | 0.047 | 0.047 | 0.054 | 0.051 | 0.055 |
| 2 | 0.039 | 0.040 | 0.040 | 0.041 | 0.043 | 0.044 | 0.047 | |
| 3 | 0.040 | 0.042 | 0.039 | 0.047 | 0.040 | 0.042 | 0.042 |
Figure 1Powers of the seven tests versus the total effect size for quantitative traits with . The proportion of causal variants is 0.2. Models 1–3 correspond to genes 1–3, for which we only used the eQTL with the largest weight to generate gene expression; Models 4–6 correspond to genes 1–3, for which we used the two eQTLs with the first two largest weights to generate gene expression.
Figure 2Powers of the seven tests versus the total effect size for quantitative traits with . The proportion of causal variants is 0.2. Models 1–3 correspond to genes 1–3, for which we only used the eQTL with the largest weight to generate the gene expression; Models 4–6 correspond to genes 1–3, for which we used two eQTLs with the first two largest weights to generate gene expression.
Figure 3Powers of the seven tests versus the total effect size for quantitative traits with . The proportion of causal variants is 0.2. Models 1–3 correspond to genes 1–3, for which we only use the eQTL with the largest weight to generate gene expression; Models 4–6 correspond to genes 1–3, for which we use two eQTLs with the first two largest weights to generate gene expression.
The number of genes identified by seven methods under different settings. The numbers in the parentheses indicate the number of identified genes that are reported in TWAS hub (http://twas-hub.org/; accessed on 2 January 2022).
| Setting | TWAS | PRSB | PRST | PRSQ | TWAS-PRSB | TWAS-PRST | TWAS-PRSQ |
|---|---|---|---|---|---|---|---|
|
| 47 | 190 | 198 | 218 (124) | 198 | 195 | 212 |
|
| 65 | 257 (149) | 249 (148) | 258 (152) | 249 | 247 | 268 |
|
| 82 | 319 (185) | 312 (186) | 337 (203) | 304 | 297 | 324 |