Literature DB >> 28135244

Genome-wide association analysis identifies novel blood pressure loci and offers biological insights into cardiovascular risk.

Helen R Warren1,2, Evangelos Evangelou3,4, Claudia P Cabrera1,2, He Gao3,5, Meixia Ren1,2, Borbala Mifsud1, Ioanna Ntalla1, Praveen Surendran6, Chunyu Liu7,8,9, James P Cook10, Aldi T Kraja11, Fotios Drenos12,13, Marie Loh3,14, Niek Verweij15,16,17,18, Jonathan Marten19, Ibrahim Karaman3, Marcelo P Segura Lepe3,20, Paul F O'Reilly21, Joanne Knight22, Harold Snieder23, Norihiro Kato24, Jiang He25, E Shyong Tai26,27, M Abdullah Said15, David Porteous28, Maris Alver29, Neil Poulter30, Martin Farrall31, Ron T Gansevoort32, Sandosh Padmanabhan33, Reedik Mägi29, Alice Stanton34, John Connell35, Stephan J L Bakker36, Andres Metspalu29, Denis C Shields37, Simon Thom38, Morris Brown1,2, Peter Sever38, Tõnu Esko16,29, Caroline Hayward19, Pim van der Harst15, Danish Saleheen39,40,41, Rajiv Chowdhury6, John C Chambers3,42,43,44, Daniel I Chasman45,46, Aravinda Chakravarti47, Christopher Newton-Cheh16,17,18, Cecilia M Lindgren16,48,49, Daniel Levy7,9, Jaspal S Kooner43,50,51, Bernard Keavney52,53, Maciej Tomaszewski52,53, Nilesh J Samani54,55, Joanna M M Howson6, Martin D Tobin56, Patricia B Munroe1,2, Georg B Ehret47,57, Louise V Wain56.   

Abstract

Elevated blood pressure is the leading heritable risk factor for cardiovascular disease worldwide. We report genetic association of blood pressure (systolic, diastolic, pulse pressure) among UK Biobank participants of European ancestry with independent replication in other cohorts, and robust validation of 107 independent loci. We also identify new independent variants at 11 previously reported blood pressure loci. In combination with results from a range of in silico functional analyses and wet bench experiments, our findings highlight new biological pathways for blood pressure regulation enriched for genes expressed in vascular tissues and identify potential therapeutic targets for hypertension. Results from genetic risk score models raise the possibility of a precision medicine approach through early lifestyle intervention to offset the impact of blood pressure-raising genetic variants on future cardiovascular disease risk.

Entities:  

Mesh:

Year:  2017        PMID: 28135244      PMCID: PMC5972004          DOI: 10.1038/ng.3768

Source DB:  PubMed          Journal:  Nat Genet        ISSN: 1061-4036            Impact factor:   38.330


Elevated blood pressure (BP) is a strong, heritable1–4 and modifiable driver of risk for stroke and coronary artery disease and a leading cause of global mortality and morbidity5,6. At the time of analysis, genome-wide association study (GWAS) meta-analyses, and analyses of bespoke or exome content, have identified and replicated genetic variants of mostly modest or weak effect on blood pressure at over 120 loci7–11. Here, we report association analyses between BP traits and genetic variants among ˜150,000 participants in UK Biobank, a prospective cohort study of 500,000 men and women aged 40-69 years with extensive baseline phenotypic measurements, stored biological samples12, and follow-up by electronic health record linkage13. We undertake independent replication in large international consortia and other cohorts, providing robust validation of our findings and new biological insights into BP regulation. Our study design is summarized in Fig. 1. Briefly, data are available for 152,249 UK Biobank participants genotyped using a customised array (including GWAS and exome content) and with genome-wide imputation based on 1000 Genomes and UK10K sequencing data14. (Further details on the UK Biobank imputation are available at the UK Biobank website.) After quality measures and exclusions (see Online Methods), we study 140,886 unrelated individuals of European ancestry with two seated clinic BP measurements using the Omron HEM-7015IT device (Supplementary Table 1). We carry out GWAS analyses of systolic (SBP), diastolic (DBP) and pulse pressure (PP) using single-variant linear regression under an additive model, based on ˜9.8 million single nucleotide variants (SNVs) with minor allele frequency (MAF) ≥1% and imputation quality score (INFO) >0.1. For SNVs with P <1x10-6, we take forward for replication the sentinel SNV (i.e. with lowest P-value) at each locus, defined by linkage disequilibrium (LD) r2 < 0.2, within a 1Mb interval. We similarly analyze exome content for variants with MAF ≥0.01%, including rare variants, taking into replication the sentinel SNV (P < 1x10-5) from loci that are non-overlapping (r2 <0.2) with the GWAS findings. Overall we took sentinel SNVs from 240 loci into replication: 218 from GWAS and 22 from exome analysis (r2 < 0.2 and >500kb from previously reported BP SNVs at the time of analysis and not annotated to previously reported BP genes; Supplementary Table 2).
Figure 1

Study design schematic for discovery and validation of loci. N: sample size; QC: Quality Control; PCA: Principal Component Analysis; BP: blood pressure; SBP: systolic BP; DBP: diastolic BP; PP: pulse pressure; SNVs: single nucleotide variants; BMI: body mass index; UKB: UK Biobank; UKBL: UK BiLEVE; GWAS: Genome-wide association study; MAF: Minor Allele Frequency; P: P-value; LD: Linkage Disequilibrium; 1000G: 1000 Genomes. UKBBvsUKBL: a binary indicator variable for UK Biobank vs UK BiLEVE to adjust for the different genotyping chips

The replication resources comprise individuals of European ancestry from a large BP meta-analysis consortium (ICBP cohorts listed in Supplementary Note) and further cohorts with 1000 Genomes data for GWAS (Supplementary Table 3), and two large BP exome consortia. We use P <5x10-8 to denote genome-wide significance in the combined (discovery and replication) meta-analyses, with P < 0.01 for support in the replication data alone and concordant direction of effect. Additionally, we take forward for replication potential secondary signals at 51 previously reported BP loci at the time of analysis (excluding the HLA region). To better understand the functional consequences of our findings, we carry out a series of in silico investigations and experimental analysis of gene expression in relevant vascular tissue for selected putative functional SNVs (Supplementary Fig. 1).

Results

Genetic variants at novel and previously unvalidated loci

Of the 240 loci taken forward to replication, we validate 107 loci at P < 5x10-8, of which 102 derive from the GWAS analysis replicated and meta-analyzed in a total of 330,956 individuals (Tables 1-3; Supplementary Fig. 2a-c; Supplementary Fig. 3a), and a further five from the exome analysis in a total of 422,604 individuals (Tables 1-3 and Supplementary Fig. 3b; Supplementary Tables 4, 5 and 6). Thirty-two of these validated loci are novel findings. Since the time of analysis, the remaining 75 loci have also been reported in another study15, although at least 53 of these were previously unvalidated (Tables 1-3), hence we now validate these loci for the first time. We therefore present results here for all 107 validated loci in our study. Most SNVs also show association with hypertension in the UK Biobank data, for example 93 of the 107 validated sentinel SNVs are nominally significant (P < 0.01) (Supplementary Table 7).
Table 1

Loci validated with SBP as primary trait: combined meta-analysis results from (a) GWAS and (b) Exome for the sentinel variant

(a) GWAS
LocusChrPosrsIDEAEAFNBetaSEPNote
NADK-CPSF3L11,685,921rs139385870D0.5281,890-0.3520.051.3x10-12GIU
CELA2A115,798,197rs3820068A0.81310,7760.4250.061.1x10-12GIU
GTF2B189,360,158rs10922502A0.62323,666-0.3820.052.2x10-15GIU
FOSL2228,635,740rs7562T0.52319,9420.2630.051.9x10-8
PRKD3237,517,566rs13420463A0.77330,3070.3560.057.0x10-11GIU
METTL21A-AC079767.32208,526,140rs55780018T0.54304,567-0.3910.055.9x10-16GIU
RYK3134,000,025rs9859176T0.4322,4280.3220.051.3x10-11G
NPNT4106,911,742rs13112725C0.76306,3700.4350.061.5x10-14GIU
TMEM161B587,514,515rs10059921T0.08298,543-0.5260.094.0x10-9GIU
FBN25127,868,199rs6595838A0.3328,4010.3440.057.6x10-12GIU
CASC15622,130,601rs6911827T0.45326,4710.2960.052.0x10-10GIU
TFAP2D650,683,009rs78648104T0.92305,426-0.4810.081.3x10-8
MKLN17131,059,056rs13238550A0.4325,6470.3310.051.9x10-12
HIPK27139,463,264rs1011018A0.2325,110-0.3290.061.5x10-8
ZFAT8135,612,745rs894344A0.6329,834-0.2580.053.2x10-8
PAX210102,604,514rs112184198A0.1323,791-0.6590.083.6x10-18GIU
MCF2L13113,636,156rs9549328T0.23313,7870.3180.061.5x10-8GI
FERMT21453,377,540rs9888615T0.29326,235-0.3180.053.5x10-10GIU
PPP2R5E1463,928,546rs8016306A0.8329,8690.3350.063.7x10-9
ABHD17C1581,013,037rs35199222A0.45323,4070.3220.055.2x10-12GI
CFDP11675,331,044rs11643209T0.42309,242-0.3390.051.8x10-12GI
CRK171,333,598rs12941318T0.49299,739-0.2690.052.5x10-8GIU
ACOX11773,949,045rs2467099T0.22326,401-0.3070.063.3x10-8GIU

(b) Exome
SSPN1226,438,189rs6487543A0.77244,8420.30.056.3x10-10

Locus: named according to nearest annotated gene(s); Chr: chromosome; Pos: build 37; EA: effect allele; EAF: EA frequency in UK Biobank; Beta: effect estimate; SE: Standard Error of effect; P: P-value; N: total sample size analyzed; Note: indicates loci published since our analysis15 from GERA (G), GERA+ICBP(HapMap) (GI) or GERA+ICBP(HapMap)+UKB (GIU) analyses.

Table 3

Loci validated with PP as primary trait: combined meta-analysis results from (a) GWAS and (b) Exome for the sentinel variant

(a) GWAS
LocusChrPosrsIDEAEAFNBetaSEPNote
chr1mb919,441,949rs9662255A0.43310,618-0.2070.031.9x10-10GIU
SF3A3138,455,891rs4360494C0.55282,8510.2780.033.7x10-16G
RP4-710M16.1-PPAP2B156,576,924rs112557609A0.35325,9520.2270.036.8x10-12
FGGY159,653,742rs3889199A0.71329,4860.3510.031.8x10-24G
C2orf43220,881,840rs2289081C0.36329,140-0.2230.035.5x10-12GI
PRKCE246,363,336rs11690961A0.88327,8470.340.053.9x10-12GIU
CEP68265,283,972rs74181299T0.62324,2240.230.039.6x10-13GIU
TCF7L1285,491,365rs11689667T0.54330,6340.1760.031.7x10-8GIU
FN12216,300,482rs1250259A0.74325,485-0.3140.048.7x10-19G
GATA23128,201,889rs62270945T0.03279,9250.6070.11.8x10-9GIU
PALLD4169,717,148rs1566497A0.42320,9480.2360.031.9x10-13GI
chr4mb1744174,584,663rs17059668C0.92313,277-0.3320.062.8x10-8
LHFPL2577,837,789rs10057188A0.46325,985-0.2050.036.7x10-11GIU
GJA16121,781,390rs11154027T0.47316,7080.2070.031.1x10-10
ESR16152,397,912rs36083386I0.11323,3030.4390.051.5x10-18G
FNDC16159,699,125rs449789C0.14325,5840.3590.052.4x10-15GIU
THBS26169,587,103rs1322639A0.78319,8660.3160.044.8x10-17G
SUGCT740,447,971rs76206723A0.1328,162-0.3460.057.4x10-12GIU
SLC20A2842,324,765rs2978456T0.55304,964-0.1880.031.2x10-8GIU
TRAPPC98141,060,027rs4454254A0.63330,022-0.2610.035.1x10-16
SCAI9127,900,996rs72765298T0.87316,271-0.3740.052.7x10-14GI
KIAA14621030,317,073rs9337951A0.34299,6460.280.042.5x10-15G
ARHGAP121032,082,658rs10826995T0.71327,373-0.2120.031.1x10-9GIU
PRDM111145,208,141rs11442819I0.11326,483-0.2790.057.1x10-9GIU
NOX41189,224,453rs2289125A0.21307,682-0.3770.049.1x10-22G
CEP16411117,283,676rs8258T0.38327,0380.2360.032.9x10-13G
CCDC411294,880,742rs139236208A0.1291,244-0.3630.061.6x10-10G
RP11-61O1.11498,587,630rs9323988T0.63327,551-0.2120.034.1x10-11GIU
VAC141670,755,610rs117006983A0.01250,7660.9860.144.1x10-12
CDH131683,045,790rs7500448A0.75321,9580.3290.041.1x10-19G
KIAA0753176,473,828rs7226020T0.56303,389-0.2560.032.3x10-14GIU
TP53-SLC2A4177,571,752rs78378222T0.99294,0530.9040.141.8x10-10GIU
KCNH4-HSD17B11740,317,241rs79089478T0.97318,3260.5840.13.1x10-9
PYY1742,060,631rs62080325A0.66315,689-0.1860.034.0x10-8
MRC21760,767,151rs740698T0.56311,450-0.2280.033.1x10-12
SLC14A21843,097,750rs7236548A0.18330,0750.3520.042.0x10-18G
SLC24A32019,465,907rs6081613A0.28315,5460.2630.041.6x10-13GIU
ARVCF2219,967,980rs12628032T0.3310,2920.240.035.5x10-12GIU
XRCC62242,038,786rs73161324T0.05267,7220.4960.072.8x10-11

(b) Exome
CD341208,024,820rs12731740T0.1279,078-0.2490.041.1x10-8
ZNF638271,627,539rs3771371T0.57280,285-0.160.035.8x10-9GIU
CRACR2B11828,916rs7126805A0.73145,1620.2220.043.3x10-9

Locus: named according to nearest annotated gene(s); Chr: chromosome; Pos: build 37; EA: effect allele; EAF: EA frequency in UK Biobank; Beta: effect estimate; SE: Standard Error of effect; P: P-value; N: total sample size analyzed; Note: indicates loci published since our analysis15 from GERA (G), GERA+ICBP(HapMap) (GI) or GERA+ICBP(HapMap)+UKB (GIU) analyses.

Of the 107 validated loci, 24 are reported for association with SBP as the primary trait (most significant from combined meta-analysis), 41 for DBP and 42 for PP, although many loci are associated with more than one BP trait (Supplementary Fig. 4). For example, in the combined meta-analysis, 24 validated loci are associated with both SBP and DBP, 11 with SBP and PP, one locus with DBP and PP and four loci (NADK-CPSF3L, GTF2B, METTL21A-AC079767.3 and PAX2) with all three traits at genome-wide significance (Fig. 2).
Figure 2

Venn diagram of 107 validated loci from our study. This shows concordance of significant associations across the three blood pressure phenotypes for the 107 validated sentinel variants (Tables 1-3) from both the GWAS and exome analyses, according to genome-wide significance in the combined meta-analysis. The locus names labelled within the Venn Diagram correspond to Tables 1-3, and relate to the nearest annotated gene. The loci names in bold font highlight the 32 novel loci which are reported for the first time in our study.

After conditional analysis on the sentinel SNV we identify an independent validated secondary SNV at five of the 107 loci (Supplementary Table 8a; Supplementary Table 9). Compared with previously reported SNVs at the time of analysis, the contribution of our validated loci increases the percentage trait variance explained by ˜1%, e.g. to 3.56% for SBP. We report signals at known hypertension drug targets, including the angiotensin converting enzyme (ACE) locus (rs4308, P = 6.8 x 10-14, ACE-inhibitors), CACNA2D2 (rs743757, P = 2.4 x 10-10, calcium channel blockers), MME (rs143112823 in the RP11-439C8.2 locus, P = 1.4 x 10-14, omapatrilat), ADRA2B (rs2579519 in the GPAT2-FAHD2CP locus, P = 4.8 x 10-12, beta blockers), SLC14A2 (rs7236548, P = 2.0 x 10-18, nifedipine), and phosphodiesterase 5A (PDE5A; rs66887589, P = 3.4 x 10-15, sildenafil). Additionally, we evaluate our validated SNVs, where available, in cohorts of non-European ancestry9–11, while recognizing that these analyses are likely underpowered (Supplementary Table 10). We find concordance in direction of effect (P <0.05) for GWAS SNVs for all three BP traits among individuals of East Asian ancestry and for DBP for South Asian ancestry, also for exome SNVs among individuals of Hispanic ancestry, pointing to cosmopolitan effects for many of the BP associated variants. A PhenoScanner16 search showed that 27 of our 107 validated sentinel SNVs (or proxies; r2 ≥ 0.8) exhibit genome-wide significant associations with other traits (Supplementary Fig. 5), including coronary artery disease and myocardial infarction (where BP is likely on the causal pathway17), cardiovascular risk factors (e.g. lipids, height, body mass index) and non-cardiovascular traits (e.g. lung function, cancer, Alzheimer’s).

Variants at previously reported loci at time of analysis

In conditional analyses, we identify 22 secondary SNVs (17 common, one rare, four low-frequency variants) that are conditionally independent of the BP associated SNVs at 16 previously reported loci at the time of analysis (Supplementary Table 8b; Supplementary Tables 11 and 12). One rare variant (rs138582164, MAF=0.1%) in the CDH17 locus anticipated to act as an exonic stop/gain mutation at the GEM gene is associated with a relatively large effect on PP (3.5 mm Hg per allele copy, Supplementary Table 8b). At three previously reported loci (EBF1, PDE3A, JAG1) we identify multiple independent secondary SNVs in addition to the previously reported SNVs (Supplementary Table 11). The UK Biobank data show support (P < 0.01) for 119 of 122 previously reported BP loci at the time of analysis (159 of 163 SNVs) for one or more BP traits (Supplementary Fig. 2 a-c; Supplementary Table 13). We do not show support for one SNV (rs11066280, RPL6-ALDH1) identified from a GWAS of East Asian ancestry18, which may indicate ancestry-specific effects. We compare the MAF and effect sizes in UK Biobank with published results of previously reported variants (Supplementary Fig. 6), indicating consistency of results between the two sources of data. We also examine findings for low-frequency and rare gene mutations previously reported to be associated with monogenic hypertension disorders19 and included on the UK Biobank gene array. Despite lack of power overall, the variant with the lowest P-value (rs387907156; KLH3; MAF=0.02%) has a seemingly large effect on BP: 8.2 mm Hg (SE=4.1, P = 0.046) per allele for SBP; 5.6 mm Hg (SE=2.6, P = 0.048) for PP (Supplementary Table 14).

Functional analyses

We annotate the 107 validated loci to 212 genes (based on LD r2 ≥0.8) and seek putative function from in silico analyses and gene expression experiments. Candidate genes with the strongest supporting evidence are indicated in the last column of Supplementary Table 4 with an indication of the supporting data source. All genome-wide significant variants in LD (r2>0.8) with the variants reported here, ranked by supporting evidence, are annotated in Supplementary Table 15. Of the 107 validated sentinel SNVs three are Indels; all other variants are single nucleotide polymorphisms (SNPs). We identify non-synonymous SNVs at 13 of the 107 validated loci (Supplementary Table 16), three of which are predicted to be damaging (ANNOVAR) in TFAP2D (rs78648104), NOX4 (rs56061986) and CCDC141 (rs17362588, reported to be associated with heart rate20) (Supplementary Fig. 5a). Beyond the coding regions we identify 29 SNVs in 3’UTRs which are predicted to significantly weaken or cause loss of miRNA regulation by altering the recognition motif in seven genes, and strengthen or create target sites for miRNA binding in 13 genes (based on miRNASNP db, Supplementary Table 16). From our expression Quantitative Trait locus (eQTL) analysis (GTEx), 59 of the 107 validated loci contain variants with eQTLs in at least one tissue (Supplementary Table 17); arterial tissue has the largest number of loci with eQTLs (Supplementary Fig. 7), with targeted in silico analysis showing six loci with eQTLs in arterial tissue (Supplementary Table 16). For example, the GTEx tibial artery eQTL in SF3A3 (rs4360494) shows strong in silico supporting evidence, including an arterial DNase I site within which the major C allele removes a predicted AP-2 binding site (Supplementary Fig. 8). Hence we prioritized this gene for in vitro functional analysis (see below). By considering all loci reported here (our 107 validated loci, and previously reported loci at the time of analysis), our DEPICT analysis identifies enrichment of expression across 31 tissues and cells (Supplementary Fig. 9; Supplementary Table 18), with greatest enrichment in the arteries (P = 1.9 x 10-6, false discovery rate (FDR) < 1%). We use FORGE to investigate and identify significant (FDR, P <0.05) cell type specific enrichment within DNase I hypersensitive sites in a range of tissues including dermal and lung microvascular endothelial cell types, and cardiac fibroblasts (Supplementary Fig. 10). For a set of curated candidate regulatory SNVs from our 107 validated loci (see Supplementary Note), widespread enrichment is found in microvascular endothelium, aortic smooth muscle, aortic fibroblasts, vascular epithelium, heart and skin (Supplementary Fig. 10). In addition, we identify significant enrichment of histone marks in a wide range of cell types, including strong enrichment seen for H3K4Me3 (an activating modification found near promoters) marks in umbilical vein endothelial cells (HUVEC) (Supplementary Fig. 11). To explore expression at the level of cardiovascular cell types specifically, we use Fantom5 reference transcript expression data (see Online Methods) to cluster the 212 genes annotated to our 107 validated loci according to tissue specificity (Supplementary Fig. 12), with the significantly clustered genes forming four tissue-specific clusters, including a vascular smooth muscle cell (VSMC) and fibroblast cluster, an endothelial cell cluster (including probable endothelial cells in highly vascularized tissues), and a combined vascular cell cluster. Additionally, Ingenuity pathway analysis and upstream transcriptional analysis show enrichment of canonical pathways implicated in cardiovascular disease, including those targeted by antihypertensive drugs, such as the alpha-adrenergic, CXCR4, endothelin signalling and angiotensin receptor pathways (Supplementary Table 19). In keeping with vascular mediation of genetic influence we identify diphenyleneiodonium, an inhibitor of flavin-containing oxidases, including NAD(P)H oxidase (NOX), which is reported to reverse endothelial dysfunction (and hypertension) in a rat model21. To identify long range target genes of non-coding variants, we use chromatin interaction (Hi-C) data from HUVEC, as enhancers and silencers often form chromatin loops with their target promoter. In most loci the strongest promoter interaction involves a gene in high LD with the SNV, but for 21 loci we find a distal potential target gene (Supplementary Table 16). Pathway analysis of the distal genes shows greatest enrichment in regulators of cardiac hypertrophy. We evaluate pleiotropy using the Genomic Regions Enrichment of Annotations Tool (GREAT) to study enrichment of mouse phenotype and human disease ontology terms across all loci reported here. These highlight cardiovascular system abnormalities and vascular disease as the most highly enriched terms (Supplementary Fig. 5b & 5c). Collectively evidence from eQTLs, DEPICT, DNase I sites, histone marks, Hi-C data and ontological analyses indicates predominant vascular and cardiovascular tissue involvement for genes within the BP associated loci. We also look for association of our validated sentinel SNVs with metabolomic signatures. Three SNVs within the NOX4, KCNH4 and LHFPL2 loci show significant associations (family-wise error rate < 5%) with lipoprotein sub-fractions from 1H Nuclear Magnetic Resonance (NMR) spectroscopy analysis of 2,000 Airwave study samples (Supplementary Tables 20 and 21). The results for these variants suggest a link between BP regulation and lipid metabolism. Eleven SNVs (including at LHFPL2 locus) show association (family wise error rate < 5%) with metabolites in blood or urine from the publicly available “Metabolomics GWAS Server” resource based on mass spectrometry22,23 (Supplementary Table 21), including sugar acids, sphingolipids, fatty acids, glycerophospholipids, organic acids and benzene derivatives. Several genes and variants with putative function are highlighted in our in silico analysis as having biological support (e.g. eQTLs or nsSNVs) and those with novelty and tractability to laboratory investigation (e.g. expression in available tissue models) are prioritized. Sentinel variants in three genes which were highly significant in the combined meta-analysis (Tables 2 and 3) are selected for experimental testing and were successfully genotyped, each for at least 100 samples. We select ADAMTS7 due to strong biological support (e.g. mouse knockout phenotype), SF3A3 due to eQTLs, and NOX4 as it contains a rare nsSNV (Supplementary Table 9) in addition to common variant associations. We use quantitative polymerase chain reaction (qPCR) to study the impact of these sentinel variants on gene expression in human VSMCs and endothelial cells (ECs) (see Online Methods). For SF3A3, the major C allele of variant rs4360494 associated with increased PP (0.278 mmHg ±0.03, P=3.7x10-16, N=307,682) is associated with SF3A3 expression in human VSMCs, although not in endothelial cells (Supplementary Fig. 13a); and the T allele of SNV rs62012628 in ADAMTS7 associated with lower DBP (0.238 mmHg ±0.03, P=5.1x10-12, N=244,143), is associated with reduced ADAMTS7 expression in human VSMCs (Supplementary Fig. 13b), while the minor A allele of SNV rs2289125 at the NOX4 locus associated with lower PP (-0.377 mmHg ±0.04, P=9.1x10-22, N=282,851) correlates with increased NOX4 expression in ECs though not VSMCs (Supplementary Fig. 13c). Our study thus finds evidence for novel cis-eQTLs in ADAMTS7 and NOX4 in addition to validating the previously reported GTEx eQTL in SF3A3, and supports the vascular expression of these genes.
Table 2

Loci validated with DBP as primary trait: combined meta-analysis results from (a) GWAS and (b) Exome for the sentinel variant

(a) GWAS
LocusChrPosrsIDEAEAFNBetaSEPNote
chr1mb25125,030,470rs6686889T0.25322,5750.1850.033.6x10-9
DNM31172,357,441rs12405515T0.56328,543-0.1650.031.4x10-9GIU
GPATCH21217,718,789rs12408022T0.26320,9830.1980.032.4x10-10GIU
CDC42BPA1227,252,626rs10916082A0.73327,636-0.1770.038.4x10-9
WNT3A1228,191,075rs2760061A0.47312,7610.230.032.1x10-16GIU
SDCCAG81243,471,192rs953492A0.46325,2530.220.037.4x10-16G
ADCY3225,139,596rs55701159T0.89321,0520.2850.047.2x10-11
SLC8A1240,567,743rs4952611T0.58309,395-0.1570.034.0x10-8
AC016735.1243,167,878rs76326501A0.91318,1270.4190.053.6x10-18
GPAT2-FAHD2CP296,675,166rs2579519T0.63311,557-0.1970.034.8x10-12
TEX412145,646,072rs1438896T0.3329,2780.2340.032.0x10-15GIU
CCDC1412179,786,068rs79146658T0.91321,318-0.3110.052.4x10-10G
TMEM194B2191,439,591rs7592578T0.19304,672-0.240.049.5x10-12
TNS12218,668,732rs1063281T0.6315,354-0.20.031.3x10-12GIU
CAMKV-ACTBP13349,913,705rs36022378T0.8319,983-0.2020.034.7x10-9GIU
CACNA2D2350,476,378rs743757C0.14328,8360.2450.042.4x10-10GIU
FAM208A356,726,646rs9827472T0.37323,058-0.1770.034.3x10-10GIU
RP11-439C8.23154,707,967rs143112823A0.09297,343-0.4030.051.4x10-14GIU
SENP23185,317,674rs12374077C0.35327,5130.1630.039.2x10-9GIU
PDE5A4120,509,279rs66887589T0.52324,397-0.2150.033.4x10-15GIU
POC5575,038,431rs10078021T0.63314,172-0.1640.031.3x10-8G
CPEB45173,377,636rs72812846A0.28312,601-0.2090.032.2x10-11GIU
PKHD1651,832,494rs13205180T0.49325,4190.1680.037.0x10-10GIU
PDE10A6166,178,451rs147212971T0.06296,010-0.360.061.6x10-9GIU
SLC35F16118,572,486rs9372498A0.08330,6250.3340.051.8x10-11GIU
SNX318101,676,675rs2978098A0.54324,4240.1650.031.5x10-9
RP11-273G15.28144,060,955rs62524579A0.53268,645-0.1750.033.8x10-9GIU
MTAP921,801,530rs4364717A0.55327,173-0.1750.031.3x10-10
BDNF1127,728,102rs11030119A0.31330,002-0.1630.032.9x10-8GIU
MYEOV1169,079,707rs67330701T0.09276,760-0.3670.052.1x10-12GIU
RP11-321F6.11566,869,072rs7178615A0.37318,076-0.1790.032.6x10-10
ADAMTS71579,070,000rs62012628T0.29244,143-0.2380.035.1x10-12
chr15mb951595,312,071rs12906962T0.68319,952-0.2210.035.6x10-14GIU
PPL164,943,019rs12921187T0.43326,469-0.1740.032.5x10-10G
FBXL191630,936,743rs72799341A0.24324,5020.1850.035.8x10-9GIU
CMIP1681,574,197rs8059962T0.42319,839-0.170.031.3x10-9
ACE1761,559,625rs4308A0.37319,3940.2130.036.8x10-14GIU
MAPK41848,142,854rs745821T0.76330,9540.1890.031.4x10-9
CCNE11930,294,991rs62104477T0.33320,3470.1770.031.2x10-9GIU
PLCB1208,626,271rs6108168A0.25327,368-0.2110.031.1x10-11

(b) Exome
MRAS3138,119,952rs2306374T0.84281,715-0.1840.037.4x10-9GIU

Locus: named according to nearest annotated gene(s); Chr: chromosome; Pos: build 37; EA: effect allele; EAF: EA frequency in UK Biobank; Beta: effect estimate; SE: Standard Error of effect; P: P-value; N: total sample size analyzed; Note: indicates loci published since our analysis15 from GERA (G), GERA+ICBP(HapMap) (GI) or GERA+ICBP(HapMap)+UKB (GIU) analyses.

Genetic risk score analyses

We create an unbiased genetic risk score (GRS) (Supplementary Table 22) to evaluate, in an independent cohort (Airwave, see Online Methods), the impact of the combination of all loci reported here on BP levels and risk of hypertension. When compared with the lowest quintile of the distribution of the GRS, individuals >50 years in the highest quintile have sex-adjusted mean SBP higher by 9.3 mm Hg (95% CI 6.9 to 11.7 mm Hg, P =1.0 x 10-13) and an over two-fold higher risk of hypertension (OR 2.32 95% CI 1.76 to 3.06; P=2.8 x 10-9) compared with individuals in the lowest quintile (Fig. 3; Supplementary Table 23). Similar results were obtained from GRS associations with BP and hypertension within UK Biobank (Supplementary Table 24). In UK Biobank – based on self-reported health data, record linkage to Hospital Episode Statistics and mortality follow-up data (Supplementary Table 25) – we show that the GRS is associated with increased risk of stroke, coronary heart disease and all cardiovascular outcomes; comparing the upper and lower fifths of the GRS distribution, sex-adjusted odds ratios are 1.34 (95% CI 1.20 to 1.49, P =1.5×10-7), 1.38 (95% CI 1.30 to 1.47, P= 4.3×10-23) and 1.35 (95% CI 1.27 to 1.42, P=1.3×10-25) respectively (Fig. 3; Supplementary Table 26). Results are also provided for incident-only cases (Supplementary Table 27).
Figure 3

Distribution of Genetic Risk Score (GRS) and its relationship with blood pressure, hypertension and CVD outcomes. The GRS is based on all reported loci: both previously reported loci at the time of analysis; and all validated blood pressure variants from this study. (a) Distribution of GRS in Airwave and sex-adjusted odds ratio of hypertension in age 50+ comparing each of the upper four GRS quintiles with the lowest quintile; dotted lines represent the upper 95% confidence intervals. (b) Mean blood pressures and standard deviation in bracket in Airwave age 50+ across GRS quintiles. (c) Distribution of GRS in UKB and sex-adjusted odds ratio of CVD, CAD and stroke comparing each of the upper four GRS quintiles with the lowest quintile; dotted lines represent the upper 95% confidence intervals. (d) Count of CVD, CAD and stroke (events and deaths) across GRS quintiles in UKB participants

Discussion

A key attribute of this study is the combination of a large, single discovery sample with standardized BP measurement and dense 1000 Genomes/UK10K imputation, yielding a high quality dataset with ˜9.8 million variants14, taking advantage of major international consortia for parallel replication of common and low-frequency variants. In total we include GWAS data from 330,956 individuals and exonic SNVs from a total of 422,604 individuals. This strategy resulted in 107 robustly validated loci for BP traits, including 32 loci that have not previously been reported, and at least 53 further loci validated for the first time. Despite its size, our study is still under-powered to find low-frequency variants. Our findings are mostly common variants, with similarly modest effect sizes as variants previously reported at the time of analysis (Supplementary Fig. 14). The lack of rare variant discovery could also be due to the challenge of detecting rare variants from imputed data. There may be greater potential for identifying rare variants from the future release of genetic data for all 500,000 UK Biobank participants. Our findings point to new biology as well as highlighting gene regions in systems that have previously been implicated in the genetics of BP. Several of our validated loci affect atherosclerosis or vascular remodelling (ADAMTS7, THBS2, CFDP1) and exhibit locus pleiotropy in prior genome-wide association studies for coronary artery disease or carotid intimal-media thickness24–26 (Supplementary Fig. 5a and Fig. 4). In previous work we have shown that expression of ADAMTS7 is upregulated and increases vascular smooth muscle cell migration in response to vascular injury in relation to a distinct coronary artery variant (rs3825807, not in LD with our sentinel SNV; r2 = 0.17)27. In endothelial cells ADAMTS7 encodes a metalloproteinase to cleave thrombospondin-1 encoded by THBS2 which leads to reduced endothelial cell migration and plays a role in neo-intimal repair in the vessel wall27. Our functional work indicates that the allele associated with lower DBP is also associated with lower ADAMTS7 expression in human VSMCs; this fits with the murine knockout that exhibits reduced atherosclerosis. SF3A3 encodes a splicing factor with no prior links to BP other than our reported association and eQTL. At the CFDP1 locus our sentinel SNV is in high LD (r2 = 0.95) with a variant previously associated with carotid intimal-medial thickness. Collectively our findings highlight a potential common mechanism among these genes in vascular remodelling that has previously been observed in small resistance arteries in essential hypertension28.
Figure 4

Summary of gene cardiovascular expression from validated loci. Genes are shown on the basis of their tissue expression and supporting evidence summarised in Supplementary Table 16, based on Knockout (KO) phenotype, previously reported blood pressure biology or a strong functional rationale: eQTL (expression Quantitative Trait Loci), nsSNV (non-synonymous SNV), Hi-C. Multiple lines of evidence indicate the central importance of the vasculature in blood pressure regulation and we thus highlight existing drugged (*) and druggable (#) targets among these genes. Illustrations used elements with permission from Servier Medical Art. We note that some druggable genes may carry a safety liability, such as GJA1, which has known association with QT interval20

NADPH oxidase 4 (NOX4) has an established role in the endothelium where it enhances vasodilatation and reduces blood pressure in vivo29. This oxidase generates reactive oxygen species in the endothelium and may contribute to salt sensitive hypertension in the kidney and the vasculature30–32. We found that the allele of the common variant at the NOX4 locus correlates with increased tissue specific NOX4 expression in endothelial cells rather than VSMCs (Supplementary Fig. 13c). NOX4 mediates endothelial cell apoptosis and facilitates vascular collagen synthesis contributing to endothelial dysfunction and arterial stiffness, and may explain the association with PP33,34. We identify several loci containing genes involved in vascular signalling and second messenger systems such as PDE5A and PDE10A35–37. The phosphodiesterase PDE5A hydrolyzes cyclic GMP and is inhibited by sildenafil which leads to vasodilatation38. This finding fits with our previous discoveries of a role for gene loci encoding elements of natriuretic peptide-nitric oxide pathway and guanylate cyclase signalling systems in BP regulation18,39,40. Our findings strengthen the case for evaluating the opportunity to repurpose PDE5A inhibitors for use in hypertension. The importance of microvascular function is emphasised by the solute carrier transporters such as SLC14A2 encoding a urea transporter, which has previously been linked to autosomal dominant Streeten type orthostatic hypotensive disorder41 and BP response to nifedipine, a calcium channel blocker antihypertensive drug42. SLC8A1 encodes a sodium calcium exchanger expressed in cardiomyocytes which alters cardiac contractility and hypertrophy and shows abnormal BP in SLC8A1 transgenic mice43. Variants at SLC35F1 have previously been associated with resting heart rate and ventricular size which could contribute to BP elevation44. We also identify loci that are involved in cardiovascular development (GATA2, KIAA1462, FBN2, FN1 and HAND2) such as fibrillin 2 (FBN2) which overlaps in action with fibrillin 1 in development of the aortic matrix45–49. In addition, fibronectin expression is increased in hypertension and in atherosclerosis but it may also play a role in the development of the heart49–51. Our analysis validates loci containing genes with prior physiological connection to BP such as BDNF, FAM208A, and CACNA2D252–54. The neurotrophin Brain Derived Neurotrophic Factor (BDNF) modulates angiotensin 11 in the brain to elevate BP in experimental models; higher serum levels correlate with reduced risk of cardiovascular disease and mortality52. In experimental models FAM208A, which is thought to be a transcription factor, is a strong candidate for a QTL for BP54. The gene CACNA2D2 encodes a subunit of the L-type calcium channel that is most abundantly expressed in the atrium and in neurones and may be a target for negatively chronotropic and inotropic calcium channel antagonists which reduce BP55. We examine long range genomic interactions using Hi-C, whereby the promoter region has a strong chromatin interaction with a novel SNV. One example is EPAS1, which is ˜200kb away from the SNV (rs11690961). It encodes hypoxia-inducible factor 2alpha, which affects catecholamine homeostasis, protects against heart failure and mutations in the gene are associated with pulmonary hypertension56. Another such gene is INHBA, 1.3Mb away from the SNV (rs12531683), which is elevated in pulmonary hypertension and contributes to vascular remodelling by inducing expression of endothelin-1 and plasminogen activator inhibitor-1 in pulmonary smooth muscle cells57. Our observation of 9-10 mm Hg higher BP at age 50+ years when comparing the top vs bottom fifths of the BP GRS distribution has potential clinical and public health implications. We stratified by age due to a significant interaction of the GRS with age (P ranging between 9.96×10-11 and 1.16×10-3 for interaction with continuous BP traits, P = 0.012 for hypertension). Measuring the GRS in early life raises the possibility of adopting an early precision medicine approach to offset the genetic risk through lifestyle intervention (i.e. reduced sodium intake, increased potassium intake, maintenance of optimal weight, low adult alcohol consumption and regular exercise)58–60. Studies of non-pharmacologic approaches to BP control indicate that 10 mm Hg or more reduction in SBP is an achievable goal through lifestyle measures alone61, while recent evidence suggests that favorable lifestyle may offset the cardiovascular sequelae associated with high genetic risk62. As the above data are observational, the extent to which adherence to lifestyle recommendations amongst high genetic risk individuals might result in favorable outcomes remains uncertain; given the substantial effect of GRS on BP by middle-age, the potential for adopting early lifestyle intervention amongst individuals at high genetic risk, along with population-wide measures to lower BP, warrants further study. Since the completion of our study, another BP GWAS using UK Biobank data has been published15, as part of a larger single-stage combined meta-analysis without replication; it reported a total of 316 loci, including 241 loci identified from the meta-analysis involving UK Biobank that were not tested for validation. Of the 107 validated loci reported in our study, 32 are discovered and validated for the first time in our analysis of UK Biobank. In addition, 75 sentinel SNVs are in LD (r2 ≥ 0.2) with the recently reported loci15 and we validate at least 53 of these for the first time in our study (indicated by “GIU” in Tables 1-3). Furthermore we note that 49 of the reported loci from the recent study15 did not validate in our large independent replication resource. In summary we describe 107 validated loci for BP offering new biology, identifying potential new therapeutic targets and raising the possibility of a precision medicine approach to modify risk of hypertension and cardiovascular outcomes. Altogether, this represents a major advance in our understanding of the genetic architecture of BP.

Data Availability Statement

The data generated during the current study are available from the UK Biobank data repository (http://biota.osc.ox.ac.uk), which can be accessed by researchers upon application. This includes the derived GWAS analysis results summary data from our UK Biobank discovery data for all three BP traits. The genetic and phenotypic UK Biobank data are also available upon application to the UK Biobank (https://www.ukbiobank.ac.uk). All replication data generated during this study are included in the published article. For example, association results of look-up variants from our replication analyses and the subsequent combined meta-analyses are contained within all Supplementary Tables provided.

Online Methods

UK Biobank data

Our GWAS analysis is performed using data from the interim release of the first ˜150k UK Biobank (UKB) participants (Supplementary Note): ˜100k individuals from UK Biobank genotyped at ˜800,000 single nucleotide variants (SNVs) with a custom Affymetrix UK Biobank Axiom Array chip and ˜50k individuals genotyped with a custom Affymetrix UK BiLEVE Axiom Array chip from the UK BiLEVE study63, a subset of UKB. SNVs were imputed centrally by UKB using a merged UK10K sequencing + 1000G imputation reference panel. UK Biobank array design and protocols are available on the UK Biobank website.

Quality control

Following QC procedures already carried out centrally by UKB, we exclude discordant SNVs and samples with QC failures, gender discordance and high heterozygosity/missingness. We further restrict our data to a subset of individuals of European ancestry. By applying kmeans clustering to the Principal Component Analysis (PCA) data a total of N=145,315 Europeans remain (Supplementary Fig. 15). We use the kinship data to exclude 1st and 2nd degree relatives, with N=141,647 unrelated individuals remaining. Finally we restrict our data to non-pregnant individuals with two automated BP measurements available, resulting in a maximum of N=140,886 individuals for analysis (Supplementary Note).

Phenotypic data

After calculating the mean SBP and DBP values from the two BP measurements, we adjust for medication use by adding 15 and 10 mmHg to SBP and DBP, respectively, for individuals reported to be taking BP-lowering medication (21.4% of individuals)64. PP is calculated as SBP minus DBP, according to the medication-adjusted traits. Hypertension, used in secondary analyses, is defined as: (i) SBP ≥ 140 mmHg, or (ii) DBP ≥ 90 mmHg, (iii) or taking BP-lowering medication; otherwise individuals are classified as non-hypertensive. Descriptive summary statistics are provided for all individuals (Supplementary Table 1).

Statistical methods

Statistical approaches used for the discovery and replication of loci are reported in detail below. We also describe methods used for: identification of secondary signals; lookups in non-European populations and for monogenic BP genes; functional and experimental methods; construction of a genetic risk score for analysis with BP traits and cardiovascular outcomes. All P-values are from two-sided tests.

Analysis models

For the GWAS, we perform linear regression analyses of the three (untransformed) continuous, medication-adjusted BP traits (SBP, DBP, PP) for all measured and imputed genetic variants in dosage format using SNPTEST software65 under an additive genetic model. We carry out a similar analysis for the exome content. Quantile-quantile plots are shown in Supplementary Fig. 16. Each analysis includes the following covariates: sex, age, age2, body mass index, top ten PCs and a binary indicator variable for UK Biobank vs UK BiLEVE to adjust for the different genotyping chips. We also run an association analysis within UKB for validated BP SNVs and hypertension using logistic regression under an additive model with adjustments as above. There are 76,554 hypertensive cases and the 64,384 remaining participants are treated as non-hypertensive controls. This sample size is slightly larger than the N=140,866 used in the main analyses, since participants with only one BP measurement, but with reported BP-lowering medication, could be included as hypertensive.

Previously reported variants

We compile a list of all SNVs previously reported to be associated with BP at the time of analysis (Supplementary Table 13). This list includes all published SNVs which have been identified and validated from previous GWAS, CardioMetabochip and exome chip projects7–11. We augment this list to include all 34,459 SNVs in Linkage Disequilibrium (LD) with these previously reported SNVs, according to a threshold of r2 ≥ 0.2. Results for all these variants are extracted for each of the three BP traits, to check previously reported BP associations in the UKB data, according to whether the sentinel SNV or a variant at the locus in LD (r2 ≥ 0.2) with it showed evidence of support (P < 0.01) for association with at least one of the three BP traits.

Replication strategy

We use three independent external data sets for replication (Supplementary Note). First, for the GWAS analysis based on advanced 1000G imputation enhanced by UK10K data we consider SNVs with MAF ≥ 1% and perform a reciprocal replication exchange with the International Consortium of Blood Pressure (ICBP) 1000G meta-analysis (max N = 150,134). The imputation strategy for ICBP 1000G meta-analysis is based on an earlier imputation grid for the 1000G project. In addition, we recruit further cohorts with 1000G data which had not contributed to the ICBP-1000G discovery meta-analysis: ASCOT-UK (N = 3,803), ASCOT-SC (N = 2,462), BRIGHT (N = 1,791), Generation Scotland (GS) (N = 9,749), EGCUT (N = 5,468), Lifelines (N = 13,292) and PREVEND (N = 3,619). This gives a total of N = 190,318 independent replication samples for the GWAS analysis. Second, because the UK Biobank and UK BiLEVE genotyping chips contain exome content, we sought replication from two BP exome consortia (European exome consortium and the Cohorts for Heart and Ageing research in Genome Epidemiology – CHARGE BP exome consortium), to allow validation of coding variants and variants with lower frequency. The European exome consortium (N = 161,926) and CHARGE consortium (N = 119,792) give a total of N = 281,718 independent replication samples for the exome analysis. Note that the lookups for GWAS and exome discovery are distinct sets of SNVs. Loci are assigned sequentially, prioritising the primary GWAS discovery first, then considering any remaining loci with non-overlapping exome content for replication in the independent exome replication resources.

Statistical criteria for replication

For the GWAS discovery, there are ˜9.8 million SNVs with MAF ≥ 1% and INFO > 0.1. We consider for follow-up any SNVs with P < 1x10-6 for any of the three BP traits. For the exome discovery, there are 149,026 exome SNVs (Supplementary Note) which were polymorphic with INFO > 0.1; for follow-up we consider all SNVs with MAF ≥ 0.01% and P < 1x10-5. All such SNVs are annotated to loci according to both an LD threshold of r2 ≥ 0.2 and a 1Mb interval region (see Supplementary Note), and signals are classified either as belonging to unvalidated loci, or being potential secondary signals at previously reported loci at the time of analysis.

Selection of variants for follow-up

The sentinel (most significant) SNV from each association signal is selected for follow-up, all of which are pairwise-independent by LD (r2 < 0.2). For the GWAS discovery, we check that potential lookup SNVs are covered within the ICBP-1000G replication data (Supplementary Note; Supplementary Tables 28 and 29). Of the 235 novel loci containing previously unreported SNVs with MAF ≥ 1%, INFO > 0.1 and P<1x10-6, 218 are covered, and similarly 100 of the 123 potential secondary SNVs at 51 of the 54 previously reported BP loci are available for follow-up. For the exome discovery, by following up SNVs with MAF ≥ 0.01%, INFO > 0.1 and P < 1x10-5 across the three BP traits, we carry forward for replication sentinel SNVs at 22 unvalidated loci, and potential secondary SNVs at three previously reported loci at the time of analysis. We produce locus zoom plots for each of the lookup variants.

Replication meta-analyses

The replication and combined meta-analyses are performed within METAL software66 using fixed effects inverse variance weighted meta-analysis (Supplementary Note). The combined meta-analysis of both the UKB discovery (N = 140,886) and GWAS replication meta-analysis (max N = 190,070) include a total maximum sample size of N = 330,956. For the exome combined meta-analysis, we synthesize data from the UKB discovery exome content (max N=140,866), with the replication dataset from both exome consortia (total max N=281,718), giving a maximum sample size of N=422,604.

Validation Criteria

In our study a signal is declared validated if it satisfies ALL of the following three criteria: the sentinel SNV is genome-wide significant (P < 5×10-8) in the combined meta-analysis for any of the three BP traits; the sentinel SNV shows evidence of support (P < 0.01) in the replication meta-analysis alone for association with the most significantly associated BP trait from the combined meta-analysis (NB: P < 0.01 is more stringent than a range of thresholds calculated according to False Discovery Rate (FDR), see Supplementary Methods); the sentinel SNV has concordant direction of effect between the UKB discovery and the replication meta-analysis for the most significantly associated BP trait from the combined meta-analysis.

Secondary signals

By conditional analysis within UKB data we assess all validated secondary signals from our validated and previously reported loci at the time of analysis for independence from the sentinel or previously reported SNV, respectively (Supplementary Note). We declare a secondary signal to be independent of the previously reported SNV if there is less than a 1.5 fold difference between the main association and conditional association P-values on a –log10 scale, i.e. if –log10(P) / -log10(P_cond) < 1.5. Note that the lookup criteria already ensure that the secondary variant is not in LD (r2 < 0.2) with the previously reported SNV. If more than one SNV in a region is found to be independent we undertake further rounds of iterative conditional analysis.

Lookups in non-European ancestries

As a secondary analysis, we look up 102 and 5 validated SNVs from the GWAS and exome analyses, respectively, in non-European ancestry samples. These comprise analysis of East Asian (N = 31,513) and South Asian (N = 33,115) ancestry data from the iGEN-BP consortium11 for the GWAS lookups, and South Asian (N = 25,937), African American (N = 21,488) and Hispanic (N = 4,581) ancestry data from the CHARGE BP exome consortium10 and CHD+ Exome consortium9, for the exome content lookups (Supplementary Note). We carry out a binomial (sign) test based on the number of SNVs with consistent directions of effect between UKB and each of the non-European ancestry samples.

Monogenic blood pressure gene lookups

The UKB arrays include some rare coding variants for monogenic disorders. We collate a list of all specific mutation variants within genes known to be associated with monogenic BP disorders19. Results from the UKB association analyses for all three BP traits are extracted for any of these SNVs directly covered within the UKB dataset (Supplementary Table 14). Note that a search of proxies did not augment the list of available variants, so results are reported for the specific variants only.

Functional analyses

In order to prioritize associated SNVs, we use an integrative bioinformatics approach to collate functional annotation (Supplementary Table 30) at both the variant and gene level for each SNV within the BP loci (all SNVs in LD r2 ≥ 0.8 with the BP-associated SNVs). At the variant level we use ANNOVAR67 to obtain comprehensive functional characterization of variants, including gene location, conservation and amino acid substitution impact based on a range of prediction tools including SIFT and polyphen2. All nonsynonymous variants were predicted damaging by two or more methods. We use the University of California Santa Cruz (UCSC) genome browser to review sequence specific context of SNVs in relation to function, particularly in the Encyclopedia of DNA Elements (ENCODE) dataset68. We use the UCSC table browser to annotate SNVs in ENCODE regulatory regions. We evaluate SNVs for impact on putative micro RNA target sites in the 3’ un-translated regions (3’UTR) of transcripts by a query of the miRNASNP database69. We evaluate all SNVs in LD (r2 ≥ 0.8) with our validated sentinel SNVs for evidence of mediation of expression quantitative trait loci (eQTL) in all 44 tissues using the Genotype-Tissue Expression (GTEx) database, in order to identify validated loci which are highly expressed, and to highlight specific tissue types which show eQTLs for a large proportion of validated loci. We further seek to identify validated loci with the strongest evidence of eQTL associations in arterial tissue, in particular. At the gene level, we use Ingenuity Pathway Analysis (IPA) software (IPA®,QIAGEN Redwood City) to review genes with prior links to BP, based on annotation with the “Blood Pressure” Medline Subject Heading (MESH) term which is annotated to 684 genes. We also use IPA to identify genes which interact with BP MESH annotated genes, and evaluate genes for evidence of small molecule druggability based on queries of Chembl and Drug Gene Interaction database. We then perform overall enrichment testing across all loci. Firstly, we use DEPICT70 (Data-driven Expression Prioritized Integration for Complex Traits) to identify highly expressed tissues and cells within the BP loci. DEPICT uses a large number of microarrays (˜37k) to identify cells and tissues where the genes are highly expressed and uses precomputed GWAS phenotypes to adjust for co-founding sources. DEPICT provides a P-value of enrichment and false discovery rates adjusted P-values for each tissue/cells tested. Furthermore, to investigate regulatory regions, we employ a two tiered approach to investigate cell type specific enrichment within DNase I sites using FORGE, which tests for enrichment of SNVs within DNase I sites in 123 cell types from the Epigenomics Roadmap Project and ENCODE71 (Supplementary Note). Validated sentinel SNVs from our study are analysed along with previously reported SNVs at the time of analysis and secondary signals (with P-value < 1×10-4) to evaluate the overall tissue specific enrichment of BP associated variants. In a second analysis we use FORGE (with no LD filter) to investigate directly our curated candidate regulatory SNVs for overlap with cell-specific DNase I signals. GenomeRunner72 is used to search for enrichment of validated and previously reported sentinel SNVs with histone modification mark genomic features (Supplementary Note). Relevant cardiovascular tissue expression is investigated using Fantom5 reference transcript expression data (fantom.gsc.riken.jp/5) (Supplementary Note). We use IPA (IPA®,QIAGEN Redwood City) to identify biological pathways and transcriptional upstream regulators enriched for genes within the BP loci. The transcriptional upstream regulator analysis aims to identify transcription factors, compounds, drugs, kinases and other molecules, for which the target is one of the BP genes under investigation. We query SNVs against PhenoScanner16 to investigate trait pleiotropy, extracting all association results with nominal significance at P < 0.05 for full reporting (Supplementary Table 16), and then extract genome-wide significant results to highlight the validated loci with strongest evidence of association with other traits (Supplementary Fig. 5a). We also use the Genomic Regions Enrichment of Annotations Tool (GREAT) to study gene set enrichment of mouse phenotype and disease ontology terms within our validated and previously reported loci at the time of analysis, using default SNV to gene mapping settings73. We carry out metabolomics analysis using two sets of data. First we use 1H NMR lipidomics data on plasma from a subset of 2,000 participants of the Airwave Health Monitoring Study74,75 (Supplementary Note). For each validated BP-associated SNV we ran association tests with the lipidomics data using linear regression analyses, adjusted for age and sex. We computed significance thresholds using a permutation derived family wise error rate (5%) to account for the high correlation structure of these data (ENT=35)76. We also test each validated SNV against published genome-wide vs metabolome-wide associations in plasma and urine using publicly available data from the “Metabolomics GWAS Server” to identify metabolites that have been associated with variants of interest at P < 3.0 x 10-4 (Bonferroni corrected P for validated signals)22,23.

Experimental methods

We prioritize genes for laboratory testing on the basis of evidence for SNV function (including coding variants, eQTLs and Hi-C interactions), biological support for relevance to BP (from literature review) and transgenic phenotype. We perform genotyping and Quantitative Reverse-Transcription Polymerase Chain Reaction (q RT-PCR) for the selected sentinel variants of interest using human vascular smooth muscle cells and endothelial cells and test for expression levels (Supplementary Note; Supplementary Table 31). All three SNVs were tested using an additive model.

Genetic risk scores

Genetic risk scores (GRS) are constructed using the independent Airwave study74 data to assess the combined effect of the BP-associated variants on BP and risk of hypertension (Supplementary Note), whilst avoiding bias by “winners curse”. We create weighted GRSs for all pairwise-independent, LD-filtered (r2 < 0.2) previously reported variants at the time of analysis and our validated variants (sentinel and secondary SNVs) combined, using available SNVs (Supplementary Table 22). For the previously reported variants, we weight BP increasing alleles by the beta coefficients from the UKB analysis. For our validated variants, beta coefficients of the replication meta-analysis are used as independent, unbiased weights. For the variance explained analyses within the independent Airwave cohort, we use three trait-specific GRSs (SBP, DBP, PP). Each GRS includes all variants, but weights are trait-specific, using the beta coefficients from the analysis of each of the three different BP traits, e.g. the SBP-GRS is weighted by the beta coefficients from the SBP-GWAS. To calculate the percent of variance for each BP trait explained by its corresponding trait-specific GRS, not accounted for by known factors, we generate the residuals from the regression model of each trait against covariates of age, age2, sex and body mass index. We then fit a second linear model for the trait residuals with all the variants in the GRS plus the top 10 PCs. For risk score analyses we calculate a single BP GRS, as the average of the SBP and DBP GRSs. We standardize the average GRS to have mean of zero and standard deviation of one. We assess the association of the continuous average GRS variable with each BP trait by simple linear regression. We also run a logistic regression to examine the association of the average GRS with risk of hypertension. We perform each analysis both with and without adjustment for sex. We test for interaction between age (< 50, and ≥50 years) and the effect of the GRS on BP. We then compare BP levels and risk of hypertension for individuals in the top and bottom 20% of the GRS distribution at ≥50 years using linear and logistic regression, respectively. We also assess the association of the average BP GRS with cardiovascular outcomes in the UKB data. We include all pairwise-independent previously reported BP variants at the time of analysis, and our validated variants. We use logistic regression with binary outcome variables for coronary heart disease, stroke and cardiovascular disease (see Supplementary Note) and GRS as explanatory variable (with and without sex adjustment).
  75 in total

1.  Vascular Nox4: a multifarious NADPH oxidase.

Authors:  Rhian M Touyz; Augusto C Montezano
Journal:  Circ Res       Date:  2012-04-27       Impact factor: 17.367

2.  Exploring the landscape of the genome.

Authors:  Michael R Barnes
Journal:  Methods Mol Biol       Date:  2010

3.  Phosphodiesterase 5A inhibition induces Na+/H+ exchanger blockade and protection against myocardial infarction.

Authors:  Néstor G Pérez; Martín R Piaggio; Irene L Ennis; Carolina D Garciarena; Celina Morales; Eduardo M Escudero; Oscar H Cingolani; Gladys Chiappe de Cingolani; Xiao-Ping Yang; Horacio E Cingolani
Journal:  Hypertension       Date:  2007-03-05       Impact factor: 10.190

4.  Endothelial Nox4 NADPH oxidase enhances vasodilatation and reduces blood pressure in vivo.

Authors:  Robin Ray; Colin E Murdoch; Minshu Wang; Celio X Santos; Min Zhang; Sara Alom-Ruiz; Narayana Anilkumar; Alexandre Ouattara; Alison C Cave; Simon J Walker; David J Grieve; Rebecca L Charles; Philip Eaton; Alison C Brewer; Ajay M Shah
Journal:  Arterioscler Thromb Vasc Biol       Date:  2011-03-17       Impact factor: 8.311

5.  Biological interpretation of genome-wide association studies using predicted gene functions.

Authors:  Tune H Pers; Juha M Karjalainen; Yingleong Chan; Harm-Jan Westra; Andrew R Wood; Jian Yang; Julian C Lui; Sailaja Vedantam; Stefan Gustafsson; Tonu Esko; Tim Frayling; Elizabeth K Speliotes; Michael Boehnke; Soumya Raychaudhuri; Rudolf S N Fehrmann; Joel N Hirschhorn; Lude Franke
Journal:  Nat Commun       Date:  2015-01-19       Impact factor: 14.919

6.  Vascular stiffness and increased pulse pressure in the aging cardiovascular system.

Authors:  Jochen Steppan; Viachaslau Barodka; Dan E Berkowitz; Daniel Nyhan
Journal:  Cardiol Res Pract       Date:  2011-08-02       Impact factor: 1.866

7.  Nox4 and redox signaling mediate TGF-β-induced endothelial cell apoptosis and phenotypic switch.

Authors:  F Yan; Y Wang; X Wu; H M Peshavariya; G J Dusting; M Zhang; F Jiang
Journal:  Cell Death Dis       Date:  2014-01-23       Impact factor: 8.469

8.  PhenoScanner: a database of human genotype-phenotype associations.

Authors:  James R Staley; James Blackshaw; Mihir A Kamat; Steve Ellis; Praveen Surendran; Benjamin B Sun; Dirk S Paul; Daniel Freitag; Stephen Burgess; John Danesh; Robin Young; Adam S Butterworth
Journal:  Bioinformatics       Date:  2016-06-17       Impact factor: 6.937

9.  Evaluating the contribution of genetics and familial shared environment to common disease using the UK Biobank.

Authors:  María Muñoz; Ricardo Pong-Wong; Oriol Canela-Xandri; Konrad Rawlik; Chris S Haley; Albert Tenesa
Journal:  Nat Genet       Date:  2016-07-18       Impact factor: 38.330

10.  The Airwave Health Monitoring Study of police officers and staff in Great Britain: rationale, design and methods.

Authors:  Paul Elliott; Anne-Claire Vergnaud; Deepa Singh; David Neasham; Jeanette Spear; Andy Heard
Journal:  Environ Res       Date:  2014-09-06       Impact factor: 6.498

View more
  203 in total

1.  Hypertension: From GWAS to functional genomics-based precision medicine.

Authors:  David L Mattson; Mingyu Liang
Journal:  Nat Rev Nephrol       Date:  2017-03-06       Impact factor: 28.314

2.  Single-cell transcriptomics of the mouse kidney reveals potential cellular targets of kidney disease.

Authors:  Jihwan Park; Rojesh Shrestha; Chengxiang Qiu; Ayano Kondo; Shizheng Huang; Max Werth; Mingyao Li; Jonathan Barasch; Katalin Suszták
Journal:  Science       Date:  2018-04-05       Impact factor: 47.728

3.  Associations of Combined Genetic and Lifestyle Risks With Incident Cardiovascular Disease and Diabetes in the UK Biobank Study.

Authors:  M Abdullah Said; Niek Verweij; Pim van der Harst
Journal:  JAMA Cardiol       Date:  2018-08-01       Impact factor: 14.676

4.  Ordered multinomial regression for genetic association analysis of ordinal phenotypes at Biobank scale.

Authors:  Christopher A German; Janet S Sinsheimer; Yann C Klimentidis; Hua Zhou; Jin J Zhou
Journal:  Genet Epidemiol       Date:  2019-12-26       Impact factor: 2.135

Review 5.  Over 1000 genetic loci influencing blood pressure with multiple systems and tissues implicated.

Authors:  Claudia P Cabrera; Fu Liang Ng; Hannah L Nicholls; Ajay Gupta; Michael R Barnes; Patricia B Munroe; Mark J Caulfield
Journal:  Hum Mol Genet       Date:  2019-11-21       Impact factor: 6.150

6.  A multi-ancestry genome-wide study incorporating gene-smoking interactions identifies multiple new loci for pulse pressure and mean arterial pressure.

Authors:  Yun Ju Sung; Lisa de Las Fuentes; Thomas W Winkler; Daniel I Chasman; Amy R Bentley; Aldi T Kraja; Ioanna Ntalla; Helen R Warren; Xiuqing Guo; Karen Schwander; Alisa K Manning; Michael R Brown; Hugues Aschard; Mary F Feitosa; Nora Franceschini; Yingchang Lu; Ching-Yu Cheng; Xueling Sim; Dina Vojinovic; Jonathan Marten; Solomon K Musani; Tuomas O Kilpeläinen; Melissa A Richard; Stella Aslibekyan; Traci M Bartz; Rajkumar Dorajoo; Changwei Li; Yongmei Liu; Tuomo Rankinen; Albert Vernon Smith; Salman M Tajuddin; Bamidele O Tayo; Wei Zhao; Yanhua Zhou; Nana Matoba; Tamar Sofer; Maris Alver; Marzyeh Amini; Mathilde Boissel; Jin Fang Chai; Xu Chen; Jasmin Divers; Ilaria Gandin; Chuan Gao; Franco Giulianini; Anuj Goel; Sarah E Harris; Fernando P Hartwig; Meian He; Andrea R V R Horimoto; Fang-Chi Hsu; Anne U Jackson; Candace M Kammerer; Anuradhani Kasturiratne; Pirjo Komulainen; Brigitte Kühnel; Karin Leander; Wen-Jane Lee; Keng-Hung Lin; Jian'an Luan; Leo-Pekka Lyytikäinen; Colin A McKenzie; Christopher P Nelson; Raymond Noordam; Robert A Scott; Wayne H H Sheu; Alena Stančáková; Fumihiko Takeuchi; Peter J van der Most; Tibor V Varga; Robert J Waken; Heming Wang; Yajuan Wang; Erin B Ware; Stefan Weiss; Wanqing Wen; Lisa R Yanek; Weihua Zhang; Jing Hua Zhao; Saima Afaq; Tamuno Alfred; Najaf Amin; Dan E Arking; Tin Aung; R Graham Barr; Lawrence F Bielak; Eric Boerwinkle; Erwin P Bottinger; Peter S Braund; Jennifer A Brody; Ulrich Broeckel; Brian Cade; Archie Campbell; Mickaël Canouil; Aravinda Chakravarti; Massimiliano Cocca; Francis S Collins; John M Connell; Renée de Mutsert; H Janaka de Silva; Marcus Dörr; Qing Duan; Charles B Eaton; Georg Ehret; Evangelos Evangelou; Jessica D Faul; Nita G Forouhi; Oscar H Franco; Yechiel Friedlander; He Gao; Bruna Gigante; C Charles Gu; Preeti Gupta; Saskia P Hagenaars; Tamara B Harris; Jiang He; Sami Heikkinen; Chew-Kiat Heng; Albert Hofman; Barbara V Howard; Steven C Hunt; Marguerite R Irvin; Yucheng Jia; Tomohiro Katsuya; Joel Kaufman; Nicola D Kerrison; Chiea Chuen Khor; Woon-Puay Koh; Heikki A Koistinen; Charles B Kooperberg; Jose E Krieger; Michiaki Kubo; Zoltan Kutalik; Johanna Kuusisto; Timo A Lakka; Carl D Langefeld; Claudia Langenberg; Lenore J Launer; Joseph H Lee; Benjamin Lehne; Daniel Levy; Cora E Lewis; Yize Li; Sing Hui Lim; Ching-Ti Liu; Jianjun Liu; Jingmin Liu; Yeheng Liu; Marie Loh; Kurt K Lohman; Tin Louie; Reedik Mägi; Koichi Matsuda; Thomas Meitinger; Andres Metspalu; Lili Milani; Yukihide Momozawa; Thomas H Mosley; Mike A Nalls; Ubaydah Nasri; Jeff R O'Connell; Adesola Ogunniyi; Walter R Palmas; Nicholette D Palmer; James S Pankow; Nancy L Pedersen; Annette Peters; Patricia A Peyser; Ozren Polasek; David Porteous; Olli T Raitakari; Frida Renström; Treva K Rice; Paul M Ridker; Antonietta Robino; Jennifer G Robinson; Lynda M Rose; Igor Rudan; Charumathi Sabanayagam; Babatunde L Salako; Kevin Sandow; Carsten O Schmidt; Pamela J Schreiner; William R Scott; Peter Sever; Mario Sims; Colleen M Sitlani; Blair H Smith; Jennifer A Smith; Harold Snieder; John M Starr; Konstantin Strauch; Hua Tang; Kent D Taylor; Yik Ying Teo; Yih Chung Tham; André G Uitterlinden; Melanie Waldenberger; Lihua Wang; Ya Xing Wang; Wen Bin Wei; Gregory Wilson; Mary K Wojczynski; Yong-Bing Xiang; Jie Yao; Jian-Min Yuan; Alan B Zonderman; Diane M Becker; Michael Boehnke; Donald W Bowden; John C Chambers; Yii-Der Ida Chen; David R Weir; Ulf de Faire; Ian J Deary; Tõnu Esko; Martin Farrall; Terrence Forrester; Barry I Freedman; Philippe Froguel; Paolo Gasparini; Christian Gieger; Bernardo Lessa Horta; Yi-Jen Hung; Jost Bruno Jonas; Norihiro Kato; Jaspal S Kooner; Markku Laakso; Terho Lehtimäki; Kae-Woei Liang; Patrik K E Magnusson; Albertine J Oldehinkel; Alexandre C Pereira; Thomas Perls; Rainer Rauramaa; Susan Redline; Rainer Rettig; Nilesh J Samani; James Scott; Xiao-Ou Shu; Pim van der Harst; Lynne E Wagenknecht; Nicholas J Wareham; Hugh Watkins; Ananda R Wickremasinghe; Tangchun Wu; Yoichiro Kamatani; Cathy C Laurie; Claude Bouchard; Richard S Cooper; Michele K Evans; Vilmundur Gudnason; James Hixson; Sharon L R Kardia; Stephen B Kritchevsky; Bruce M Psaty; Rob M van Dam; Donna K Arnett; Dennis O Mook-Kanamori; Myriam Fornage; Ervin R Fox; Caroline Hayward; Cornelia M van Duijn; E Shyong Tai; Tien Yin Wong; Ruth J F Loos; Alex P Reiner; Charles N Rotimi; Laura J Bierut; Xiaofeng Zhu; L Adrienne Cupples; Michael A Province; Jerome I Rotter; Paul W Franks; Kenneth Rice; Paul Elliott; Mark J Caulfield; W James Gauderman; Patricia B Munroe; Dabeeru C Rao; Alanna C Morrison
Journal:  Hum Mol Genet       Date:  2019-08-01       Impact factor: 6.150

7.  Genome-wide discovery of genetic loci that uncouple excess adiposity from its comorbidities.

Authors:  Lam O Huang; Alexander Rauch; Eugenia Mazzaferro; Michael Preuss; Stefania Carobbio; Cigdem S Bayrak; Nathalie Chami; Zhe Wang; Ursula M Schick; Nancy Yang; Yuval Itan; Antonio Vidal-Puig; Marcel den Hoed; Susanne Mandrup; Tuomas O Kilpeläinen; Ruth J F Loos
Journal:  Nat Metab       Date:  2021-02-22

8.  Type 2 Diabetes and Hypertension.

Authors:  Dianjianyi Sun; Tao Zhou; Yoriko Heianza; Xiang Li; Mengyu Fan; Vivian A Fonseca; Lu Qi
Journal:  Circ Res       Date:  2019-03-15       Impact factor: 17.367

9.  Chrm3 Gene and M3 Muscarinic Receptors Contribute to Salt-Sensitive Hypertension.

Authors:  Allen W Cowley
Journal:  Hypertension       Date:  2018-09       Impact factor: 10.190

10.  Development and evaluation of a transfusion medicine genome wide genotyping array.

Authors:  Yuelong Guo; Michael P Busch; Mark Seielstad; Stacy Endres-Dighe; Connie M Westhoff; Brendan Keating; Carolyn Hoppe; Aarash Bordbar; Brian Custer; Adam S Butterworth; Tamir Kanias; Alan E Mast; Steve Kleinman; Yontao Lu; Grier P Page
Journal:  Transfusion       Date:  2018-11-20       Impact factor: 3.157

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.