| Literature DB >> 35175464 |
Ling Li1,2,3, Zhifen Chen1,3, Moritz von Scheidt1,3, Shuangyue Li1,3, Andrea Steiner1,3, Ulrich Güldener1,3, Simon Koplev4, Angela Ma4, Ke Hao4, Calvin Pan5, Aldons J Lusis5,6,7, Shichao Pang1,3, Thorsten Kessler1,3,7, Raili Ermel8, Katyayani Sukhavasi8, Arno Ruusalepp8,9, Julien Gagneur2, Jeanette Erdmann10,11, Jason C Kovacic12,13,14, Johan L M Björkegren4,9,15, Heribert Schunkert16,17.
Abstract
The majority of risk loci identified by genome-wide association studies (GWAS) are in non-coding regions, hampering their functional interpretation. Instead, transcriptome-wide association studies (TWAS) identify gene-trait associations, which can be used to prioritize candidate genes in disease-relevant tissue(s). Here, we aimed to systematically identify susceptibility genes for coronary artery disease (CAD) by TWAS. We trained prediction models of nine CAD-relevant tissues using EpiXcan based on two genetics-of-gene-expression panels, the Stockholm-Tartu Atherosclerosis Reverse Network Engineering Task (STARNET) and the Genotype-Tissue Expression (GTEx). Based on these prediction models, we imputed gene expression of respective tissues from individual-level genotype data on 37,997 CAD cases and 42,854 controls for the subsequent gene-trait association analysis. Transcriptome-wide significant association (i.e. P < 3.85e-6) was observed for 114 genes. Of these, 96 resided within previously identified GWAS risk loci and 18 were novel. Stepwise analyses were performed to study their plausibility, biological function, and pathogenicity in CAD, including analyses for colocalization, damaging mutations, pathway enrichment, phenome-wide associations with human data and expression-traits correlations using mouse data. Finally, CRISPR/Cas9-based gene knockdown of two newly identified TWAS genes, RGS19 and KPTN, in a human hepatocyte cell line resulted in reduced secretion of APOB100 and lipids in the cell culture medium. Our CAD TWAS work (i) prioritized candidate causal genes at known GWAS loci, (ii) identified 18 novel genes to be associated with CAD, and iii) suggested potential tissues and pathways of action for these TWAS CAD genes.Entities:
Keywords: Coronary artery disease; Genetically regulated expression; Genome-wide association study; Transcriptome-wide association study
Mesh:
Year: 2022 PMID: 35175464 PMCID: PMC8852935 DOI: 10.1007/s00395-022-00917-8
Source DB: PubMed Journal: Basic Res Cardiol ISSN: 0300-8428 Impact factor: 12.416
Fig. 1The study design. Step 1, we trained prediction models using EpiXcan from two eQTL panels, the Stockholm-Tartu Atherosclerosis Reverse Network Engineering Task (STARNET) and the Genotype-Tissue Expression (GTEx) for nine tissues. Step 2, the prediction models were applied to impute genetically regulated expression (GReX) from individual-level genotype data of ten CARDIoGRAMplusC4D sets and UK Biobank (UKB). Step 3, we associated transcriptome-wide GReX with risk of coronary artery disease (CAD) (Supplementary Results) and identified 114 transcriptome-wide significant genes (TWAS genes). Of these, 96 resided within genome-wide significant (GWAS) loci and 18 outside of known GWAS loci (novel genes). Step 4, we tested the plausibility of novel TWAS genes by conducting colocalization analysis and studying effects of damaging mutations, as well as gene set enrichment analyses. Step 5, we explored potential mechanisms of novel genes by testing association with risk traits of CAD in human genotype data of UKB, and association between expressions and risk traits of CAD in atherosclerosis mouse models from the Hybrid Mouse Diversity Panel (HMDP). Lastly, we carried out CRISPR/Cas9-based knockdown experiment for two novel genes RGS19 and KPTN in human hepatocyte cell lines to experimentally validate related functions
Fig. 2Manhattan plot of CAD TWAS results. The association results from STARNET- and GTEx-based models were integrated by lowest P values. The blue line marks P = 3.85e−6, i.e. transcriptome-wide significance. Each point corresponds to an association test between gene-tissue pair. 18 novel TWAS genes were highlighted. Supplementary Fig. 6 identifies all genes identified by their genetically-modulated association signals. The color code identifies the tissue in which the genes were differentially expressed by genetic means: AOR aorta, COR coronary artery, MAM mammary artery, BLD blood, LIV liver, SF subcutaneous fat, VAF visceral abdominal fat, SKLM skeletal muscle
Fig. 3Tissue distribution of 114 TWAS genes of CAD. a Number of transcriptome-wide significant genes across tissues. b Heatmap plot of 38 genes identified in more than one tissues. The color codes indicate direction of effects. Cells marked with * represent significant gene-tissue pairs (P < 3.85e−6). AOR aorta, COR coronary artery, MAM mammary artery, BLD blood, LIV liver, SF subcutaneous fat, VAF visceral abdominal fat, SKLM skeletal muscle, TIB tibial artery
18 TWAS genes residing outside of published GWAS loci
| Cytoband | Gene | Tissue | SE | Froma | ||
|---|---|---|---|---|---|---|
| 2p22.3 | LIV | − 3.383 | 0.044 | 3.04E−06 | STARNET | |
| 3q21.3 | VAF | 2.566 | 0.059 | 1.36E−06 | STARNET | |
| 4p14 | VAF | 4.026 | 0.050 | 3.44E−09 | GTEx | |
| 4p14 | BLD | 4.845 | 0.037 | 1.80E−06 | GTEx | |
| 5p13.2 | COR | 5.596 | 0.047 | 7.70E−10 | GTEx | |
| 6q16.1 | MAM | − 5.246 | 0.038 | 1.62E−06 | STARNET | |
| 6q16.1 | BLD | − 4.687 | 0.038 | 8.70E−05 | STARNET | |
| 6q16.1 | BLD | − 4.955 | 0.042 | 3.96E−07 | GTEx | |
| 6q21 | SF | 4.320 | 0.059 | 1.91E−06 | STARNET | |
| 6q25.3 | LIV | − 3.187 | 0.025 | 3.53E−06 | STARNET | |
| 9p21.3 | VAF | 8.348 | 0.068 | 1.44E−12 | GTEx | |
| 9q34.3 | SKLM | − 3.015 | 0.061 | 1.74E−06 | STARNET | |
| 12p11.21 | VAF | 2.285 | 0.065 | 1.79E−07 | STARNET | |
| 12p12.3 | SF | − 3.412 | 0.040 | 5.67E−07 | GTEx | |
| 12q14.3 | VAF | − 2.355 | 0.030 | 1.19E−07 | GTEx | |
| 16p11.2 | COR | 3.347 | 0.056 | 2.59E−06 | GTEx | |
| 16q22.1 | AOR | 4.491 | 0.029 | 5.67E−06 | STARNET | |
| 16q22.1 | AOR | 6.570 | 0.031 | 1.19E−07 | GTEx | |
| 16q24.3 | LIV | 0.189 | 0.041 | 8.32E−07 | GTEx | |
| 19p13.11 | SKLM | 4.647 | 0.030 | 3.52E−08 | GTEx | |
| 19q13.32 | LIV | − 3.076 | 0.076 | 2.17E−06 | STARNET | |
| 20q13.33 | LIV | − 4.913 | 0.028 | 1.52E−06 | GTEx | |
| 20q13.33 | VAF | − 4.545 | 0.030 | 4.63E− 07 | GTEx | |
| 20q13.33 | SKLM | − 5.026 | 0.024 | 1.42E−06 | STARNET | |
| 20q13.33 | SKLM | − 5.298 | 0.018 | 9.29E−07 | GTEx |
TWAS transcriptome-wide association study, STARNET the Stockholm-Tartu Atherosclerosis Reverse Network Engineering panel, GTEx the Genotype-Tissue Expression panel, AOR aorta, COR coronary artery, MAM mammary artery, BLD blood, LIV liver, SF subcutaneous fat, VAF visceral abdominal fat, SKLM skeletal muscle
aAssociation statistics from either STARNET- or GTEx-based models
Fig. 4Effects of damaging variants in TWAS genes on CAD and its risk traits. Sign(beta)*−log10(p) displays direction and significance of gene-trait associations. When the Sign(beta)*−log10(P) > 8, they were trimmed to 8. The gene-trait association pairs reached Bonferroni-significance P < 3.44e−5 were highlighted in box. CAD coronary artery disease, LDL-C low-density lipoproteins cholesterol, VLDL-C very low-density lipoprotein cholesterol, HDL-C high density lipoproteins cholesterol, APOA apolipoprotein A, APOB apolipoprotein B, TC total cholesterol, TG triglycerides, CRP C-reactive protein, BMI body mass index
Fig. 5Novel risk genes were associated with lipid traits. a Data from UK Biobank (UKB) indicated that lead variants inside the boundary of risk genes were associated with lipid traits with Bonferroni-corrected significance levels (*P < 8.09e−6), or by genome-wide significance (**P < 5e−8). b Expression levels of novel genes were likewise associated with lipid traits and aortic lesion area in an atherosclerosis mouse model from the hybrid mouse diversity panel (HMDP). *P < 0.05; **FDR < 0.05. LDL-C low-density lipoproteins cholesterol, VLDL-C very low-density lipoprotein cholesterol, HDL-C high density lipoproteins cholesterol, APOA apolipoprotein A, APOB apolipoprotein B, TC total cholesterol, TG triglycerides, FFA free fatty acid
Fig. 6Targeting of KPTN and RGS19 reduced lipids and APOB secretion of human liver cells. a Two sgRNAs were used to target the exon4 of KPTN (shared exon among isoforms) in a Cas9-expressing huh7 liver cell line. The dual CRISPR strategy created a 40 bp frame shift deletion in the gene and profound reduction of KPTN at both mRNA and protein levels (Supplementary Fig. 11c, d). The primers (P-Fw and P-Rv) used for analyzing the CRISPR editing as indicated. b The same strategy was used for RGS19 targeting, which resulted in a 130 bp frame shift deletion in the gene, and reduction of mRNA and protein (Supplementary Fig. 11c, d). c Reduced triglyceride and cholesterol levels in knockout (KO) cell lines were detected by colorimetric method and APOB100 secretion was measured by human APOB100 Elisa (n = 6). Triglyceride, cholesterol, and APOB100 levels were normalized to total protein and compared between the KO and control (CTR) cell lines