| Literature DB >> 33798195 |
Shizhen Tang1,2, Aron S Buchman3, Philip L De Jager4, David A Bennett3, Michael P Epstein1, Jingjing Yang1.
Abstract
Transcriptome-wide association studies (TWAS) have been widely used to integrate transcriptomic and genetic data to study complex human diseases. Within a test dataset lacking transcriptomic data, traditional two-stage TWAS methods first impute gene expression by creating a weighted sum that aggregates SNPs with their corresponding cis-eQTL effects on reference transcriptome. Traditional TWAS methods then employ a linear regression model to assess the association between imputed gene expression and test phenotype, thereby assuming the effect of a cis-eQTL SNP on test phenotype is a linear function of the eQTL's estimated effect on reference transcriptome. To increase TWAS robustness to this assumption, we propose a novel Variance-Component TWAS procedure (VC-TWAS) that assumes the effects of cis-eQTL SNPs on phenotype are random (with variance proportional to corresponding reference cis-eQTL effects) rather than fixed. VC-TWAS is applicable to both continuous and dichotomous phenotypes, as well as individual-level and summary-level GWAS data. Using simulated data, we show VC-TWAS is more powerful than traditional TWAS methods based on a two-stage Burden test, especially when eQTL genetic effects on test phenotype are no longer a linear function of their eQTL genetic effects on reference transcriptome. We further applied VC-TWAS to both individual-level (N = ~3.4K) and summary-level (N = ~54K) GWAS data to study Alzheimer's dementia (AD). With the individual-level data, we detected 13 significant risk genes including 6 known GWAS risk genes such as TOMM40 that were missed by traditional TWAS methods. With the summary-level data, we detected 57 significant risk genes considering only cis-SNPs and 71 significant genes considering both cis- and trans- SNPs, which also validated our findings with the individual-level GWAS data. Our VC-TWAS method is implemented in the TIGAR tool for public use.Entities:
Mesh:
Substances:
Year: 2021 PMID: 33798195 PMCID: PMC8046351 DOI: 10.1371/journal.pgen.1009482
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Fig 1Power comparison of Burden-TWAS and VC-TWAS methods under simulation scenarios with 20% true causal eQTL for gene expression (i.e., p = 0.2) in the test gene.
Phenotypes simulated from Model I (A, C, E) and Model II (B, D, F) were considered. Various gene expression heritability and various types of SNP weights were considered, including those derived from PrediXcan method, DPR method, and filtered DPR weights in panel (A, B). Various test sample sizes (400, 800, 1232) were considered in panel (B, D). The VC-TWAS approach of using only summary-level GWAS data were validated in panel (E, F).
Type I errors under null simulation studies for Burden-TWAS and VC-TWAS with p = 0.2, = 0.1, using variant weights given by DPR, filtered DPR, and PrediXcan.
| Significance Level | Burden-TWAS | VC-TWAS | ||||
|---|---|---|---|---|---|---|
| DPR | Filtered DPR | PrediXcan | DPR | Filtered DPR | PrediXcan | |
| 1.00×10−2 | 9.82×10−3 | 9.86×10−3 | 9.27×10−3 | 9.43×10−3 | 9.46×10−3 | 9.23×10−3 |
| 1.00×10−4 | 8.64×10−5 | 8.44×10−5 | 9.95×10−5 | 8.64×10−5 | 9.05×10−5 | 8.24×10−5 |
| 2.50×10−6 | 2.00×10−6 | 2.00×10−6 | 2.00×10−6 | 2.00×10−6 | 1.00×10−6 | 6.00×10−6 |
Fig 2Manhattan plots of meta VC-TWAS for AD clinical diagnosis (A) and VC-TWAS of global AD pathology (B) with filtered DPR weights. Genes with FDR < 0.05 are colored in (A), with red for novel risk genes and blue for known AD risk genes. Genes with FDR < 0.05 in meta VC-TWAS of AD clinical diagnosis and p-value < 0.0013 in VC-TWAS of global AD pathology are colored in red in (B).
Significant genes for phenotype AD clinical diagnosis by meta VC-TWAS with filtered DPR weights.
Significant genes have FDR < 0.05 by meta-TWAS with ROS/MAP and Mayo Clinic cohorts. AD risk genes identified by previous GWAS are shaded in grey.
| Gene name | CHR | Start | End | P-value | FDR |
|---|---|---|---|---|---|
| 19 | 44,645,710 | 44,664,462 | 1.11×10−16 | 1.40×10−12 | |
| 19 | 45,542,298 | 45,574,214 | 4.44×10−16 | 2.81×10−12 | |
| 19 | 45,666,187 | 45,681,485 | 3.60×10−14 | 1.52×10−10 | |
| 19 | 45,394,477 | 45,406,935 | 9.05×10−13 | 2.86×10−9 | |
| 19 | 45,754,550 | 45,808,541 | 4.62×10−16 | 1.17×10−8 | |
| 19 | 45,882,892 | 45,909,607 | 1.82×10−10 | 3.84×10−7 | |
| 19 | 45,457,848 | 45,496,598 | 5.71×10−8 | 1.03×10−4 | |
| 19 | 46,112,660 | 46,148,726 | 1.88×10−7 | 2.97×10−4 | |
| 19 | 46,213,887 | 46,234,151 | 4.13×10−7 | 5.80×10−4 | |
| 19 | 45,174,724 | 45,187,631 | 3.93×10−6 | 4.68×10−3 | |
| 19 | 46,171,502 | 46,185,704 | 4.07×10−6 | 4.68×10−3 | |
| 19 | 45,504,695 | 45,541,452 | 6.63×10−6 | 6.99×10−3 | |
| 19 | 44,617,548 | 44,637,255 | 2.59×10−5 | 2.51×10−2 |
a: Genes with significant p-values <0.0013 (Bonferroni correction with respect to 13 genes and 3 phenotypes) for at least one AD pathology phenotype
Fig 3Manhattan plots of VC-TWAS using IGAP summary data with filtered cis-eQTL DPR weights (A) and TWAS locus zoom plots for the loci on chromosome 11 (B) and chromosome 19 (C). Significant genes with FDR < 0.05 are colored, with red for significant ones only identified by using IGAP summary data and blue for the ones replicating our VC-TWAS findings using the individual-level GWAS data of ROS/MAP and Mayo Clinic cohort. Between gene R2 in (B, C) were calculated with respect to GReX values. The R2 in locus zoom plot denoted by various colors for the dots is the squared correlation of GReX between the most significant gene and other neighborhood genes. The heatmap is based on the squared correlation matrix of GReX.
Fig 4Manhattan plot of VC-TWAS using IGAP summary data with BGW cis- and trans- eQTL weights.
Significant genes with FDR < 0.05 are colored.
Significant genes identified by VC-TWAS using IGAP summary statistics data with BGW cis- and trans- eQTL weights, which are either known GWAS loci or shown to be related with AD or other neurological diseases by previous studies.
AD risk genes identified by previous GWAS are shaded in grey.
| Gene Name | CHROM | Start | End | P-value | FDR |
|---|---|---|---|---|---|
| 1 | 155,916,644 | 155,966,129 | 1.75×10−16 | 2.46×10−13 | |
| 1 | 173,833,037 | 173,838,020 | 1.33×10−22 | 3.11×10−19 | |
| 2 | 24,714,782 | 24,993,571 | 2.42×10−25 | 8.51×10−22 | |
| 2 | 25,016,004 | 25,045,245 | 4.07×10−7 | 1.85×10−4 | |
| 2 | 162,280,842 | 162,841,792 | 8.80×10−6 | 2.58×10−3 | |
| 5 | 109,025,066 | 109,205,326 | 6.30×10−8 | 3.70×10−5 | |
| 5 | 140,079,746 | 140,086,266 | 1.83×10−6 | 6.43×10−4 | |
| 6 | 32,485,119 | 32,498,064 | 7.81×10−7 | 3.23×10−4 | |
| 7 | 17,338,245 | 17,385,776 | 2.11×10−5 | 5.62×10−3 | |
| 7 | 27,870,191 | 28,220,362 | 7.49×10−8 | 4.22×10−5 | |
| 7 | 100,487,614 | 100,494,594 | 5.93×10−5 | 1.39×10−2 | |
| 8 | 1,449,530 | 1,656,642 | 1.18×10−5 | 3.38×10−3 | |
| 9 | 3,218,296 | 3,526,004 | 4.69×10−5 | 1.16×10−2 | |
| 10 | 101,370,281 | 101,380,535 | 1.55×10−7 | 7.79×10−5 | |
| 10 | 105,156,404 | 105,206,049 | 2.52×10−8 | 1.69×10−5 | |
| 11 | 124,753,586 | 124,768,396 | 1.13×10−6 | 4.30×10−4 | |
| 12 | 109,460,893 | 109,525,831 | 1.50×10−10 | 1.41×10−7 | |
| 14 | 23,765,111 | 23,772,057 | 2.11×10−5 | 5.62×10−3 | |
| 15 | 65,294,844 | 65,321,977 | 3.10×10−5 | 8.09×10−3 | |
| 15 | 72,533,521 | 72,565,340 | 6.77×10−6 | 2.07×10−3 | |
| 16 | 9,852,375 | 10,276,611 | 2.22×10−8 | 1.56×10−5 | |
| 16 | 58,549,382 | 58,554,431 | 3.44×10−5 | 8.81×10−3 | |
| 17 | 5,264,257 | 5,323,480 | 2.65×10−7 | 1.24×10−4 | |
| 17 | 26,083,791 | 26,127,555 | 9.55×10−5 | 2.07×10−2 | |
| 19 | 45,174,723 | 45,187,631 | 1.46×10−5 | 4.03×10−3 | |
| 19 | 45,417,920 | 45,422,606 | 2.80×10−6 | 9.63×10−4 |
a: Genes shown to be related with AD or other neurological diseases by previous studies