| Literature DB >> 35886906 |
Zoha Kamali1,2, Jacob M Keaton3,4, Shaghayegh Haghjooy Javanmard5,6, Todd L Edwards7, Harold Snieder1, Ahmad Vaez1,2.
Abstract
Recent genome-wide association studies uncovered part of blood pressure's heritability. However, there is still a vast gap between genetics and biology that needs to be bridged. Here, we followed up blood pressure genome-wide summary statistics of over 750,000 individuals, leveraging comprehensive epigenomic and transcriptomic data from blood with a follow-up in cardiovascular tissues to prioritise likely causal genes and underlying blood pressure mechanisms. We first prioritised genes based on coding consequences, multilayer molecular associations, blood pressure-associated expression levels, and coregulation evidence. Next, we followed up the prioritised genes in multilayer studies of genomics, epigenomics, and transcriptomics, functional enrichment, and their potential suitability as drug targets. Our analyses yielded 1880 likely causal genes for blood pressure, tens of which are targets of the available licensed drugs. We identified 34 novel genes for blood pressure, supported by more than one source of biological evidence. Twenty-eight (82%) of these new genes were successfully replicated by transcriptome-wide association analyses in a large independent cohort (n = ~220,000). We also found a substantial mediating role for epigenetic regulation of the prioritised genes. Our results provide new insights into genetic regulation of blood pressure in terms of likely causal genes and involved biological pathways offering opportunities for future translation into clinical practice.Entities:
Keywords: blood pressure; epigenome; functional enrichment; gene expression; genome
Mesh:
Year: 2022 PMID: 35886906 PMCID: PMC9323755 DOI: 10.3390/ijms23147557
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 6.208
Figure 1Summary of the methods as well as the main results of prioritization and follow-up analyses using blood pressure GWAS summary statistics. Pink boxes with dashed borders represent methods and their underlying boxes with gray border detail the tools. Pink boxes with solid black borders represent results and their underlying boxes with gray border provide small pictures of results. High resolution figures of heatmap and graphs can be found at the results section. BP: blood pressure; GWAS: genome-wide association study; TWAS: transcriptome-wide association study; nsSNPs: nonsynonymous single nucleotide polymorphisms; MX: MetaXcan; SMR: summary data-based Mendelian randomization; mQTL: methylation quantitative trait loci; eQTL: expression quantitative trait loci; CVS: cardiovascular system.
Figure 2In silico lookup of BP loci associated with different molecular traits at the genome-wide significance level (p < 5 × 10−8; from QTL tests). The outer layer represents the genomic position of variants with QTL associations and is split up into 22 chromosomes with banded colours showing the cytobands. The inner layers with red lines (from the outer one to the inner one) are as follows: DNA methylation, transcription, protein, and metabolite levels. The height of the red lines is representative of −log10(P). The seven assigned loci are those with evidence of simultaneous associations with all four molecular traits.
Figure 3Average Z squares of the genes’ associations with BP traits across 48 GTEx tissues using both MetaXcan (MX) and summary data-based MR (SMR) TWAS. Colour intensity set to higher values. The order of tissues is based on the mean of the average Z squares of the genes across BP traits and the two TWAS approaches.
Figure 4Hierarchical relationships of the top 10 enriched pathways for BP in a Gene Ontology (GO) tree. Black connectors mean “is a” and blue connectors mean “part of”. Coloured in red are the top 10 significantly enriched (FDR < 0.01) GO terms.
Figure 5Network plot showing significantly enriched pathways resulting from DEPICT gene set enrichment analysis (FDR < 0.01) and clustered based on coregulation data provided by DEPICT. The outer layer contains pathways with the centrality degree ≤ 10, the middle—with the centrality degree of 10 < x ≤ 20, and the central circle contains pathways with the highest degree (>20), meaning the largest number of connections, which imply their importance in network survival. Among all, heart development has the maximum degree and the lowest p-value. Nodes are colour-coded based on the p-value.
Figure 6Comparison of the physical distance of 219 genes with and 613 genes without any functional evidence (A), as well as of 219 genes with different sources of evidence (B) to their nearest DNA methylation (DNAm) sites. DNAm signals are generated by the MSMR analysis (SMR analysis of GWAS vs. mQTL) and are mapped to their nearest genes based on Ensemble GRCh37 release 98.
The most highly prioritised genes with evidence from at least four different prioritization methods suggested for further follow-up studies.
| Gene | Evidence | nsSNP | Effect Direction in the Prioritised Tissues a | Likely Causal in Kidneys b | Reported c/Predicted d,e Mechanism(s) | Druggability | Drug Name | |
|---|---|---|---|---|---|---|---|---|
|
| NS, MX, SMR, | rs12628603 |
| (SBP, PP) † | Abnormal vascular endothelial cell morphology | |||
|
| NS, MX, SMR, | rs3088040 | (SBP, DBP, PP) † | × | FLG2 PPI subnetwork | # | ||
|
| NS, MX, SMR, | rs2306272 | − | (SBP, PP) † | EGF, FGF, and ERBB signalling pathways | |||
|
| NS, MX, SMR, | rs3747481 | −−−−−− | (DBP, PP) | KAT2A PPI subnetwork | |||
|
| NS, MX, SMR, | rs6762208 | (SBP, DBP) † | Wnt signalling pathway | # | |||
|
| NS, MX, SMR, | rs1049205 | −−−−−− | (SBP, DBP, PP) | Epithelium development, likely mediated by methylation | |||
|
| NS, MX, SMR, | rs7978353 | −−−− | (SBP, DBP) † | FGF signalling pathway | |||
|
| NS, MX, SMR, | rs267561 | −−−−−− | (SBP, DBP, PP) | Dilated heart atrium and abnormal epithelium morphology, likely mediated by methylation | # | ||
|
| MX, SMR, | (DBP) † | Chromatin modification, likely mediated by methylation | # | ||||
|
| MX, SMR, | (SBP, DBP) † (PP) † | Hemopoietic or lymphoid organ development, likely mediated by methylation | ¥ | TOSEDOSTAT | |||
|
| MX, SMR, | −−↓−−− | (DBP) | Cell–matrix adhesion, likely mediated by methylation | # | |||
|
| MX, SMR, | − | (SBP, DBP, PP) † | × | EGF, FGF, and ERBB signalling pathways, likely mediated by methylation | ¥ | INK-128, OSI-027, AZD-8055 | |
|
| MX, SMR, | −−−−−↓ | (SBP) (DBP) | × | Increased cell proliferation, increased urine protein level, kidney failure, likely mediated by methylation | # | ||
|
| NS, MX, SMR, 3× SMR | rs6468171 | − | (PP) † | Ribosomal biogenesis, likely mediated by methylation | |||
|
| ML, MX, SMR, 3× SMR |
| (SBP, DBP) † | Angiogenesis, likely mediated by methylation. | ¥ | TOSEDOSTAT | ||
* Genes with an asterisk have not yet been functionally studied for BP regulation based on an ISI Web of Science search TS (i.e., topic) = (Gene_name AND (“blood pressure” OR hyp*tension)). a From TWAS analyses using GTEx eQTL data; the arrows show the direction of effect for the association of gene expression and BP traits in the prioritised tissues, including tibial artery, aorta, skeletal muscle, subcutaneous adipose tissue, fibroblasts, and heart left ventricle, using MetaXcan and SMR analyses, respectively. TWAS associations with the BP trait of evidence for each gene are shown, and the trait of evidence is mentioned next to them. † Nominally significant associations for at least one of the traits of evidence, with consistent direction of effect across both TWAS approaches in any of the prioritised tissues. Dashes mean the gene data are not available in the GTEx dataset(s). b From the study by Eales et al. [22]. c From literature. d From DEPICT or GeneMania enrichment analyses. e The mediating mechanism by which the gene is regulated; from 3× SMR analyses of GWAS, eQTL, and mQTL data. Evidence: any biological evidence from our post-GWAS analyses; nsSNP, nonsynonymous SNP; NS, genes with nsSNP linked (r2 > 0.5) to GWAS loci; ML, multilayer molecular associations; MX, MetaXcan; SMR, summary-based Mendelian randomization. ¥ indicates that the gene is targeted by (a) known drug(s); # indicates that the gene is druggable according to the DGI database.
Summary of the 28 newly identified genes and their replication in the MVP cohort using TWAS analysis.
| Gene Name | No. of Evidence | Trait(s) of Evidence | min_P_rep | Druggability | Drug Name(s) |
|---|---|---|---|---|---|
|
| 3 | SBP, PP | 2.39 × 10−2 | # | |
|
| 3 | DBP | 5.35 × 10−3 | # | |
|
| 3 | PP | 1.62 × 10−2 | ||
|
| 3 | SBP, DBP | 1.07 × 10−2 | # | |
|
| 2 | SBP, PP | 4.61 × 10−7 | ¥ | METARAMINOL *, LY-2584702, LY-2780301, XL-418, BETHANIDINE SULFATE, SIROLIMUS, MSC-2363318A |
|
| 2 | SBP, DBP, PP | 2.03 × 10−2 | # | |
|
| 2 | DBP | 7.13 × 10−3 | ||
|
| 2 | DBP, PP | 4.42 × 10−4 | ||
|
| 2 | DBP | 1.43 × 10−2 | # | |
|
| 2 | DBP | 2.03 × 10−3 | ¥ | ESTRAMUSTINE |
|
| 2 | SBP | 5.00 × 10−4 | ||
|
| 2 | SBP, PP | 1.70 × 10−2 | # | |
|
| 2 | SBP, DBP | 5.09 × 10−5 | ||
|
| 2 | DBP, PP | 1.55 × 10−8 | ||
|
| 2 | SBP, DBP | 5.48 × 10−3 | ||
|
| 2 | DBP | 1.75 × 10−2 | ||
|
| 2 | SBP, PP | 1.45 × 10−10 | ||
|
| 2 | SBP, DBP | 2.33 × 10−2 | ||
|
| 2 | SBP, DBP, PP | 7.30 × 10−4 | ||
|
| 2 | SBP, DBP, PP | 9.45 × 10−4 | ||
|
| 2 | PP | 5.32 × 10−4 | ||
|
| 2 | SBP, DBP, PP | 3.21 × 10−7 | ||
|
| 2 | SBP, DBP | 1.95 × 10−4 | ||
|
| 2 | SBP, PP | 1.07 × 10−2 | ||
|
| 2 | SBP, DBP, PP | 2.00 × 10−5 | ||
|
| 2 | SBP, DBP | 4.04 × 10−2 | ||
|
| 2 | SBP, DBP, PP | 8.34 × 10−6 | ||
|
| 2 | SBP, DBP | 1.35 × 10−3 |
SBP, systolic blood pressure; DBP, diastolic blood pressure; PP, pulse pressure; min_P_rep, minimum p-value of replication. ¥ indicates that the gene is targeted by (a) known drug(s); # indicates that the gene is druggable according to the DGI database. * METARAMINOL is a known drug with the primary indication for hypotension.
Figure 7Potential regulatory mechanism for UTP11L through which a BP-associated variant controls BP-associated gene expression. SNP rs4360494 is positively correlated with the expression of UTP11L. At the same time, this SNP is inversely associated with the methylation level of three sites at the promoter and the early sequence of this gene as well as the upstream enhancer and an active CTCF-binding site. The overall hypothesis is that rs4360494 regulates gene expression by controlling the methylation of the promoter and the enhancer as well as the CTCF-binding site. CTCF increases the expression of UTP11L in the presence of the rs4360494 G allele by involving enhancer activity through DNA remodelling, and disruption of CTCF binding due to the methylated DNA in the presence of the rs4360494 C allele decreases the expression of the UTP11L gene.