Literature DB >> 34621299

HIF-1α Pulmonary Phenotype Wide Association Study Unveils a Link to Inflammatory Airway Conditions.

Jelte Kelchtermans1,2, Xiao Chang2, Michael E March2, Frank Mentch2, Patrick M A Sleiman2, Hakon Hakonarson1,2.   

Abstract

Despite experimental data linking HIF-1α dysfunction to inflammatory airway conditions, the effect of single nucleotide polymorphisms within the HIF1A gene on these conditions remains poorly understood. In the current study, we complete a phenotype wide association study to assess the link between SNPs with known disease associations and respiratory phenotypes. We report two SNPs of the HIF1A gene, the intronic rs79865957 and the missense rs41508050. In these positions the A and the T allele are significantly associated with allergic rhinitis and acute bronchitis and bronchiolitis, respectively. These findings further support the role of HIF-1α in inflammatory pulmonary conditions and may serve as a basis to refine our understanding of other HIF-1α associated phenotypes.
Copyright © 2021 Kelchtermans, Chang, March, Mentch, Sleiman and Hakonarson.

Entities:  

Keywords:  HIF1A; SNP; airway; bronchiolitis; hypoxia inducible factor; inflammation; phenotype; rhinitis

Year:  2021        PMID: 34621299      PMCID: PMC8490729          DOI: 10.3389/fgene.2021.756645

Source DB:  PubMed          Journal:  Front Genet        ISSN: 1664-8021            Impact factor:   4.599


Introduction

Hypoxia inducible factor (HIF) is central in the mammalian response to hypoxia. (Majmundar et al., 2010) HIF-1 is a nuclear factor that consists of a Hypoxia inducible factor 1α (HIF-1α) and a Hypoxia inducible factor 1β (HIF-1β) subunit. (Wang and Semenza, 1993; Gladek et al., 2017) While HIF-1β is stable regardless of the oxygen concentration, HIF-1α is rapidly degraded under normoxic conditions. (Yu et al., 1998; Gladek et al., 2017) Under hypoxic conditions however, HIF-1α is stabilized, leading to formation of HIF-1. (Ke and Costa, 2006; Slemc and Kunej, 2016) HIF-1 then in turn acts as a transcription factor, affecting over 98 target genes associated with up to 20 biological pathways. (Ke and Costa, 2006; Slemc and Kunej, 2016) Given this central role, it comes as no surprise that variations within the highly conserved HIF1A gene have been associated with a wide array of pathologic conditions. (Majmundar et al., 2010) Apart from playing an important role in normal lung development, HIFs have been shown to play a central role in the development of multiple pulmonary conditions, including pulmonary hypertension, Chronic obstructive pulmonary disease (COPD) and lung cancer angiogenesis. (Shimoda and Semenza, 2011) Despite this, within pulmonology, to date, variations within the HIF1A gene have only been associated with COPD and lung cancer. (Chan et al., 2017; Gladek et al., 2017; Paradowska-Gorycka et al., 2018; Wang et al., 2018; Hoang et al., 2019; Huang et al., 2020) Our current study sets out to examine the association between single nucleotide polymorphisms (SNPs) in the HIF1A gene and respiratory phenotypes. By starting with SNPs of interest, the Phenotype Wide Association Study (PheWAS) design flips the direction of inference commonly used in genome-wide association studies (GWAS). (Bush et al., 2016) To do so, it integrates data captured from patient’s electronic health records (EHRs) with their genetic information. The major benefit of this approach is that it allows us to focus our efforts specifically on SNPs with known disease associations within this master regulator gene, improving the likelihood that found associations are based on molecular mechanisms that are relevant to the disease phenotypes uncovered.

Methods

Single Nucleotide Polymorphisms Selection

SNPs were selected from enriched literature review, including recently completed review by Gladek et al. (Gladek et al., 2017) All studies identified with keywords “HIF1a” and “variant” published after their literature review was completed, were reviewed. SNPs significantly associated with human disease were included in the current study (Table 1).
TABLE 1

Minor allele frequency for all SNPs included in the current study, unless otherwise indicated the data was pulled from the genome aggregation database (Karczewski et al., 2020).

SNP IDSubstitutionMAF
rs1957757T>C0.326–0.952
rs12434438G>A0.117–0.841a
rs10873142C>T0.338–0.914
rs41508050C>T0.000292–0.0189
rs2301113C>A0.188–0.881
rs11549465C>T0.0366–0.155
rs11549467G>A0.000574–0.0431
rs199775054G>C0.000–0.001b
rs113182457rs60361955insGTUnavailable
rs2057482T>C0.658–0.939
rs2783778C>T0.184–0.870
rs7148720T>C0.00575–0.151
rs1535679A>C0.185–0.872
rs28708675A>T0.000218–0.370
rs1319462G>A0.682–0.944
rs1957755G>A0.000–0.075
rs41362550T>C0.0283–0.0755
rs7143164G>C0.0485–0.523
rs1951795A>C0.227–0.913
rs12435848A>G0.249–0.912
rs2301104G>C0.000344–0.0147
rs10129270G>A0.0311–0.373
rs8005745T>A0.369–0.952
rs779897997C>AUnavailable
rs4899056T>C0.262–0.948
rs11158358G>C0.655–0.928
rs2301111G>C0.222–0.902
rs966824T>C0.717–0.977
rs41492849C>T0.0000648–0.00207
rs34005929G>A0.000459–0.00943
rs61755645A>T0.00161–0.0150
rs4902080T>C0.654–0.977
rs4902082C>T0.226–0.862
rs17099207G>A0.237–0.399
rs142179458G>A0.000574–0.0282
rs12434439G>C0.110–0.506
rs76308410C>T0.0621–0.149
rs74481028A>G0.0759–0.226
rs7161527T>C0.686–0.939
rs10147275T>G0.680–0.939
rs2301108A>G0.379–0.952
rs79865957G>A0.000230–0.00210

Data from the Allele Frequency Aggregator (Phan Yj et al., 2020).

Data from the 1000 Genomes Project (Auton et al., 2015).

Minor allele frequency for all SNPs included in the current study, unless otherwise indicated the data was pulled from the genome aggregation database (Karczewski et al., 2020). Data from the Allele Frequency Aggregator (Phan Yj et al., 2020). Data from the 1000 Genomes Project (Auton et al., 2015).

Population

Subjects were drawn from The Children’s Hospital of Philadelphia (CHOP) biorepository at the Center for Applied Genomics (CAG). The pediatric samples included in this biorepository are linked to subjects’ EMRs. All subjects have consented to both genomic analysis and EMR mining (Gottesman et al., 2013).

Genotype Imputation

Genotype data were generated by the Center for Applied Genomics on patients recruited from CHOP and were acquired on four major genotyping arrays (HumanHapMap550, 610Q, OMNI2.5M and the GSA array). Where possible, data from similar arrays were merged. Data were filtered for genotype missingness (geno 0.1), individual missingness (0.02), and minor allele frequency (MAF) (0.01) using PLINK v1.9. (Chang et al., 2015) Data were imputed using the TOPMed v2 reference panel on the TOPMed Imputation Server. (Fuchsberger et al., 2015; Das et al., 2016; Taliun et al., 2021) Imputed genotypes were filtered on combinations of Rsq (imputation quality metric) and MAF [(MAF ≥ 0.05 and Rsq > 0.3) OR (MAF < 0.05 and Rsq > 0.5)] using BCFTools v1.10.2, and only SNPs that remained in 85% of samples were retained for use in PheWAS analysis (Danecek et al., 2021).

Ancestry Identification

Subjects in the PheWAS cohort were separated by ancestry based on the results of principal component analysis (PCA). PCA was performed using flashpca on approximately 2.4 million imputed SNPs with MAF >0.01 that had been pruned for linkage disequilibrium using PLINK v1.9 (Abraham and Inouye, 2014; Chang et al., 2015) The first three principle components were plotted, and ancestry designation was performed by comparison to the reference genotypes from the HapMap consortium. (Altshuler et al., 2010) The complete dataset contained 71,600 individuals: 34,410 Caucasians, 31,507 African Americans, 2644 Hispanics, and 3039 East Asians.

PheWAS

A PheWAS was conducting using the published PheWAS R package from Carroll et al. (v0.99.5-5). (Carroll et al., 2014) International Classification of Diseases 9 (ICD-9) codes were obtained from an anonymized extraction of the Children’s Hospital of Philadelphia diagnosis database that contained subjects that had been recruited into the patient collection of the Center for Applied Genomics. Counts of the occurrence of each ICD-9 code for each subject were generated, and the resulting table was converted into the PheWAS phenotype table by a function in the R package. Subjects were included in the case group for each PheWAS phenotype if they possessed two or more occurrences of any of the ICD-9 codes that composed the phenotype in question. Subjects were listed as controls for the PheWAS phenotype if they lacked the case-defining ICD-9 codes, as well as ICD-9 codes corresponding to closely related phenotypes. Conversion from ICD-9 codes to PheWAS phenotypes was performed using the default translation table included in the R package. Phenotypes were analyzed in the PheWAS if they were represented by 20 or more cases in the cohort. The subject’s sex and age were included as covariates in the analysis, as were the 10 flashpca generated principle components and a variable representing the group in which genotyping array had been imputed. Genotypes were extracted from the imputed data as allele dose information to preserve some information regarding genotype probability, and the allele doses were used as the genotype inputs to the PheWAS. The PheWAS analysis was performed individually on each PCA-defined ancestry, and then a meta-analysis was performed combining all four ancestries using the PheWAS-meta function provided in the PheWAS R package. For the association test, a logistic regression model, adjusted for age and sex was used. For defining significance in this study, we set a FDR threshold of 0.05. As a total of 2146 traits were analyzed, the over-conservative significance threshold based on Bonferroni correction was p = 2.3 × 10–5.

In Silico Validation

SNP’s significantly associated with respiratory disease were validated in an independent cohort by querying the publicly available Open Target Genetics database. (Ghoussaini et al., 2021) The Ensembl VEP was then used to assess the likely effect of these variants. (McLaren et al., 2016) To assess chromatin state and regulatory potential associated with the locations of the SNPs, other publicly available databases including Haploreg and Encode were queried.

Results

We found 42 SNPs that have been previously associated with different medical conditions, including various cancers, cardiovascular diseases, metabolic disorders and (auto) immune diseases. This includes the 34 SNPs identified by Gladek et al. (Gladek et al., 2017) In addition, eight more SNPs were identified in studies published after their literature review was completed (Chan et al., 2017; Paradowska-Gorycka et al., 2018; Wang et al., 2018; Hoang et al., 2019; Huang et al., 2020). Of the 42 SNPs included in our PheWAS, nine were significantly associated with at least one disorder. Table 2 summarizes the data for all the SNP-phenotype associations passing False Discovery Rate (FDR) or Bonferroni test. Most of the detected associations were from cohorts with less than 500 cases. However, the A allele of SNP rs79865957 was found to be significantly associated with allergic rhinitis (Figure 1) in a European cohort of 4,348 cases and 18,794 controls with an allele frequency of 0.08%. The OR was 2.86, Beta 1.05, SE 0.25 and p-value 3.48E−05. The second, rs41508050, the T allele was significantly associated with acute bronchitis and bronchiolitis (Figure 2) in an African American cohort of 2,234 cases and 21,463 controls with an allele frequency of 0.18%. The OR was 0.32, Beta 1.21, SE 3.36 and p-value 0.0001.
TABLE 2

SNP-phenotype associations passing False Discovery Rate (FDR) or Bonferroni test.

PopulationphecodeDescriptionsnpbetaORSEPn_casesn_controlsbonferronifdr
European573.3Hepatomegalyrs617556451.7280338760.4098013085.6295745782.48E−0510730,884TRUETRUE
 251.1Hypoglycemiars617556451.339524940.3376509723.8172296637.27E−0526428,603FALSETRUE
 476Allergic rhinitisrs798659571.1118144280.2787938883.0398690176.66E−05434818,794FALSETRUE
 155.1Malignant neoplasm of liver, primaryrs1421794582.97285E+1586170822.24Inf02032,275TRUETRUE
 961.1Poisoning/allergy of sulfonamidesrs14217945874.0538699817.243137651.45E+321.75E−0511828,063TRUETRUE
 696Psoriasis and related disordersrs287086751.3468067150.3401122453.8451273197.50E−0522029,124FALSETRUE
African242Thyrotoxicosis with or without goiterrs1421794583.4912518430.84804108732.827016283.84E−057228,558TRUETRUE
 569Other disorders of intestiners1421794583.299476230.8027196827.098441863.95E−059728,861TRUETRUE
 531Peptic ulcer (excl. esophageal)rs1421794584.4013473851.14870120481.560688320.000127322029,609FALSETRUE
 578.9Hemorrhage of gastrointestinal tractrs1421794584.1690900791.13750126664.656592920.0002472133127,843FALSETRUE
 427.11Paroxysmal supraventricular tachycardiars1421794583.948183871.10694708551.841131030.000361463826,415FALSETRUE
 300.9Posttraumatic stress disorderrs19517950.6714639110.1629193521.9571002453.76E−059123,629TRUETRUE
 300.9Posttraumatic stress disorderrs23011110.7205050230.1624031772.0554710089.14E−069123,629TRUETRUE
 379.2Disorders of vitreous bodyrs340059295.0242270521.198711093152.05268182.77E−052125,418TRUETRUE
 573.5Jaundice (not of newborn)rs340059293.0273181920.74278218320.641800954.59E−0523528,047FALSETRUE
 483Acute bronchitis and bronchiolitisrs415080501.2118424240.3176053343.3596688870.000135874223421,463FALSETRUE
 327.7Sleep related movement disordersrs798659574.7913845831.207932763120.46805137.29E−053523,361FALSETRUE
Asian195Cancer, suspected or otherrs23011042.8517546930.67149747917.318143222.17E−05252616TRUETRUE
 348Other conditions of brainrs23011041.9293174250.4921522576.8848092378.85E−05512399TRUETRUE
 801.1Fracture of footrs79865957−3.2555E+1452249787.9700202474TRUETRUE
FIGURE 1

PheWAS results for rs79865957.

FIGURE 2

PheWAS results for rs41508050.

SNP-phenotype associations passing False Discovery Rate (FDR) or Bonferroni test. PheWAS results for rs79865957. PheWAS results for rs41508050. Using the Open Target Genetics database rs79865957, the A allele was found to have been previously positively associated with both chronic airway obstruction (OR 1.94, p-value 0.0019, Beta 0.663) and asthma (OR 1.34, p-value 0.033, Beta 0.292). It has also been negatively associated with paternal chronic bronchitis/emphysema (OR 0.75, p-value 0.0069, Beta −0.293). Using Ensemble Variable Effect Predictor (VEP), it was found to be a likely intron variant for HIF1Α. For rs41508050, the T allele was previously negatively correlated with “Bring up phlegm/sputum/mucus on most days” (OR 0.72, p-value 0.0026, Beta −0.328) and is a missense variant for HIF1Α. The publicly available HaploReg tool was queried for both SNPs. SNP rs79865957 has four SNPs in linkage disequilibrium (r2 ≥ 0.8), two of which (rs76269977 and rs142660658) are intronic in the HIF1A gene. It is located in a regulatory region but not in a constrained sequence. It has histone H3K4me1_Enh enhancer marks in a lung carcinoma line and both H3K4me1_Enh and H3K27ac in a fetal lung fibroblast line. It is also a DNAse hypersensitivity site in a fetal lung fibroblast line. SNP rs41508050 has no other SNPs in linkage disequilibrium, is in a regulatory region and in a constrained sequence both by Genomic Evolutionary Rate Profiling and SiPhycons. It has histone H3K27ac_Enh marks in both lung fibroblast and lung carcinoma lines and is a DNase hypersensitivity site in a lung carcinoma cell line. Looking at Encode it had RFX5 bound in the GM12878 lymphoblastoid cell line. (Table 3)
TABLE 3

Summary of chromatin state and regulatory potential associated with the locations of the SNPs.

Locationrs79865957rs41508050
Reference alleleGC
Varriant alleleAT
Regulatory region?YesYes
Constrained sequence?NoYes
DNAse hypersensitivity site?In fetal fibroblast lineIn lung carcinoma cell line
Associated Histone markersH3K4me1_Enh and H3K27acH3K27ac_Enh
EncodeNo proteins boundRFX5 bound
Summary of chromatin state and regulatory potential associated with the locations of the SNPs.

Discussion

We present the results of a HIF-1α PheWAS analysis focused on association with respiratory phenotypes. We identified two SNPs that are significantly associated with respiratory disease. Given the allele rarity in our patient population, the Open Target Genetics database was queried in further support. This resource integrates knowledge derived from the UK Biobank with published data from other sources and provides an independent cohort to validate our findings. (Baumann and Cabassa, 2020) The prior associations with allergic airway disease in the form of asthma for rs79865957 and association with bringing up phlegm/sputum/mucus for rs41508050 are consistent with the respective associations with allergic rhinitis and acute bronchitis and bronchiolitis in our cohort, suggesting the association may be driven by the underlying biological “inflammation” process which is the central driver across all these phenotypes involving different organs. To address the likely impact of these variants we used the Ensembl VEP and the publicly available HaploReg tool (Ward and Kellis, 2012; McLaren et al., 2016), both of which underscore the possible significance of both variants. Adding to the evidence supporting a functional impact are the previously published associations between rs79865957 and diabetic kidney disease and between rs41508050 and angina versus myocardial infarction as initial presentation of coronary disease (Hlatky et al., 2007; Huang et al., 2020). Previously, variations within the HIF1A gene have been associated with COPD, lung cancer and a host of non-pulmonary conditions. (Gladek et al., 2017) Both the SNPs reported here had prior significant disease associations. First, rs79865957 was previously associated with diabetic kidney disease in a Han Chinese population. (Nava-Salazar et al., 2011) While to our knowledge the functional consequences of this SNP have not been eluded, the authors hypothesized that in a high glucose environment HIF1A transcription may be stimulated. Additionally, rs41508050 has a known association with the development of stable angina as opposed to myocardial infarction as initial presentation of coronary artery disease. (Hlatky et al., 2007) In vitro studies have previously linked this variant with a higher transcriptional activity. (Nava-Salazar et al., 2011) However, to our knowledge, the current study is the first to report on the association between SNPs of the HIF1A gene and allergic rhinitis, acute bronchitis and bronchiolitis. The reported association with allergic rhinitis is consistent with previously published experimental data highlighting the role of HIF-1α in allergic airway pathology. In an allergic airway disease model, HIF-1α inhibition decreased Th2 inflammation as measured by reduced IL-4, IL-5 and IL-13. (Kim et al., 2010) Beyond this, in a mouse model downregulation of HIF-1 or blockade of HIF-1α reduced cellular infiltrate in peribronchial lung tissues, thickness of smooth muscle and eosinophil infiltration. (Huerta-Yepez et al., 2008) Likewise, the role of HIF-1α in bronchiolitis is supported by experimental data on the consequences of HIF-1α stabilization by the Respiratory Syncytium Virus. (Kilani et al., 2004) Traditionally, GWAS identify SNPs significantly associated with human disease. These findings are then used to guide animal studies aiming to prove a causal link between the SNP and the disease. As briefly discussed above, the PheWAS design flips this process. It allowed us to look specifically at a highly conserved gene known to play a central role in the diseases of interest. In doing so, we were able to narrow down the list of SNPs within the HIF1A gene that play a potential role in respiratory pathology. Beyond this, we were able to detect significant effects of rare allelic variants. Conversely, this study design by definition excludes variants on other genes. While this is a limitation of the current study, given the hypoxemia dependent stabilization of HIF-1α and the experimental data supporting a role of HIF-1α in pulmonary conditions as outlined above it seemed reasonable to focus on HIF1A. Future studies may expand on the current work by including other members of the HIF family. Furthermoree, knowing that SNPs within the HIF1A gene are associated with respiratory diseases future studies can now refine our understanding of the associated phenotypes by looking at differences between patients with and without these SNPs.
  33 in total

1.  R PheWAS: data analysis and plotting tools for phenome-wide association studies in the R environment.

Authors:  Robert J Carroll; Lisa Bastarache; Joshua C Denny
Journal:  Bioinformatics       Date:  2014-04-14       Impact factor: 6.937

2.  Missense Variants in HIF1A and LACC1 Contribute to Leprosy Risk in Han Chinese.

Authors:  Dong Wang; Yu Fan; Mahadev Malhi; Rui Bi; Yong Wu; Min Xu; Xiu-Feng Yu; Heng Long; Yu-Ye Li; Deng-Feng Zhang; Yong-Gang Yao
Journal:  Am J Hum Genet       Date:  2018-04-26       Impact factor: 11.025

Review 3.  HIF1A gene polymorphisms and human diseases: Graphical review of 97 association studies.

Authors:  I Gladek; J Ferdin; S Horvat; G A Calin; T Kunej
Journal:  Genes Chromosomes Cancer       Date:  2017-03-31       Impact factor: 5.006

4.  General involvement of hypoxia-inducible factor 1 in transcriptional response to hypoxia.

Authors:  G L Wang; G L Semenza
Journal:  Proc Natl Acad Sci U S A       Date:  1993-05-01       Impact factor: 11.205

5.  SNPs in PRKCA-HIF1A-GLUT1 are associated with diabetic kidney disease in a Chinese Han population with type 2 diabetes.

Authors:  Yan Huang; Li Jin; Hairong Yu; Guozhi Jiang; Claudia H T Tam; Song Jiang; Chunxia Zheng; Feng Jiang; Rong Zhang; Hong Zhang; Juliana C N Chan; Ronald C W Ma; Weiping Jia; Cheng Hu; Zhihong Liu
Journal:  Eur J Clin Invest       Date:  2020-05-29       Impact factor: 4.686

6.  Polymorphisms in the hypoxia-inducible factor 1 alpha gene in Mexican patients with preeclampsia: A case-control study.

Authors:  Sonia Nava-Salazar; Elly N Sánchez-Rodríguez; C Adriana Mendoza-Rodríguez; Carlos Moran; Juan F Romero-Arauz; Marco A Cerbón
Journal:  BMC Res Notes       Date:  2011-03-17

7.  Fast principal component analysis of large-scale genome-wide data.

Authors:  Gad Abraham; Michael Inouye
Journal:  PLoS One       Date:  2014-04-09       Impact factor: 3.240

8.  A global reference for human genetic variation.

Authors:  Adam Auton; Lisa D Brooks; Richard M Durbin; Erik P Garrison; Hyun Min Kang; Jan O Korbel; Jonathan L Marchini; Shane McCarthy; Gil A McVean; Gonçalo R Abecasis
Journal:  Nature       Date:  2015-10-01       Impact factor: 49.962

9.  Genetic variants of HIF1α are associated with right ventricular fibrotic load in repaired tetralogy of Fallot patients: a cardiovascular magnetic resonance study.

Authors:  Thanh T Hoang; Paulo Henrique Manso; Sharon Edman; Laura Mercer-Rosa; Laura E Mitchell; Anshuman Sewda; Michael D Swartz; Mark A Fogel; A J Agopian; Elizabeth Goldmuntz
Journal:  J Cardiovasc Magn Reson       Date:  2019-08-19       Impact factor: 5.364

10.  Reframing implementation science to address inequities in healthcare delivery.

Authors:  Ana A Baumann; Leopoldo J Cabassa
Journal:  BMC Health Serv Res       Date:  2020-03-12       Impact factor: 2.655

View more
  1 in total

1.  Host Factor Interaction Networks Identified by Integrative Bioinformatics Analysis Reveals Therapeutic Implications in COPD Patients With COVID-19.

Authors:  Wenjiang Zheng; Ting Wang; Peng Wu; Qian Yan; Chengxin Liu; Hui Wu; Shaofeng Zhan; Xiaohong Liu; Yong Jiang; Hongfa Zhuang
Journal:  Front Pharmacol       Date:  2021-12-23       Impact factor: 5.810

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.