Literature DB >> 25519355

Pathway-based analysis of rare and common variants to test for association with blood pressure.

Huda Alsulami1, Xiaofeng Liu2, Joseph Beyene3.   

Abstract

Our goal is to test the effect of both rare and common variants in a blood pressure study. We use a pathway-based approach, gene-set enrichment analysis, to search for related genes affecting 4 phenotypes: systolic blood pressure, diastolic blood pressure, the difference between each of them and mean arterial pressure, which is a weighted linear combination of systolic and diastolic blood pressure. Using the real Genetic Analysis Workshop 18 data, we consider both rare and common variants in our analysis and incorporate other covariates by using a recently proposed test statistic. Our study identified a commonly enriched gene set/pathway for the two derived phenotypes we analyzed: the difference between systolic and diastolic blood pressure and mean arterial pressure, but none is identified with the individual blood pressure phenotypes. The gene CD47, in the enriched gene pathway/set, was reported in previous studies to be related to blood pressure. The findings are not surprising because the sample size we use in our analysis is small, and hence power to detect small but important effects is likely inadequate.

Entities:  

Year:  2014        PMID: 25519355      PMCID: PMC4143690          DOI: 10.1186/1753-6561-8-S1-S101

Source DB:  PubMed          Journal:  BMC Proc        ISSN: 1753-6561


Background

Worldwide, hypertension contributes to more than 10 million deaths and it affects one-third of the adult population per year [1]. It was predicted that the incidence of hypertension among adults in 2025 will reach 1.56 billion and contribute to approximately 54% of stroke and 47% of ischemic heart disease. Furthermore, it is a major risk factor for cardiovascular disease [2]. Several factors, including genetic, environmental, and demographic factors, play a major role in the development of hypertension. However, it is believed that 30% to 60% of the variability in blood pressure (BP) is inherited [1]. Many genome-wide association studies (GWAS) have been conducted to identify single-nucleotide polymorphism (SNPs) that are significantly associated with systolic blood pressure (SBP), diastolic blood pressure (DBP), and/or hypertension. Meta-analysis findings of the Global BPgen (Global Blood Pressure Genetics) consortium (n = 34,433) and CHARGE (The Cohorts for Heart and Aging Research in Genome Epidemiology) consortium (n = 29,136) based on populations of European ancestry identified 4 loci significantly associated with SBP (ATP2B1, CYP17A1, PLEKHA7, SH2B3), 6 associated with DBP (ATP2B1, CACNB2, CSK-ULK3, SH2B3, TBX3-TBX5, ULK4), and 1 associated with hypertension (ATP2B1) [1]. However, a genome-wide association study by Adeyemo et al [3] based on a population of African Americans (n = 1017) identified significant loci for SBP in or near the genes PMS1, SLC24A4, YWHA7, IPO7, and CACANA1H, while no significant loci were discovered to be associated with DBP or hypertension. Unlike single-gene analysis, pathway-based approaches consider multiple genes that are related together within gene sets/pathways; these pathways are predefined gene sets from biological databases. The aim of pathway-based approaches is to assess the significance of these sets/pathways by evaluating the enrichment of genes within a pathway at the top of a list of ranked genes [4-6]. Pathway-based analysis was originally applied to gene expression data; however, it has also been applied to GWAS data [4]. In this paper, we use a pathway-based approach based on Gene Set Enrichment Analysis (GSEA) [4]. We consider both rare and common variants and incorporate other covariates, including age, gender, use of antihypertensive medications, and smoking status. Our main focus is to test the effects of both rare and common variants on SBP, DBP, the difference between them (SBP-DBP), and mean arterial pressure (defined as MAP = [⅔ DBP] + [⅓ SBP]) by applying GSEA.

Methods

Phenotype and covariate data description

This data set was provided by the organizers of the Genetic Analysis Workshop 18 (GAW18). From this data set, we considered the 157 unrelated individuals with their phenotypes. Phenotypes were taken at 4 time points and included systolic and DBP measurements and hypertension. The following covariates were also provided: age, smoking status, antihypertensive medications usage, and gender. In our analysis, we used the baseline data; among the 157 unrelated individuals we had 129 individuals who had been genotyped. Table 1 summarizes the data.
Table 1

Descriptive statistics for phenotypes and covariates at baseline for 129 unrelated individuals

VariableSummary measure*
SBP128.4 ± 21.8
DBP71.8 ± 9.2
MAP90.7 ± 11.6
SBP-DBP56.6 ± 19
Hypertension (Yes, No)129 (29.5)
Age52.9 ± 15.6
Sex (Female)129 (60.5)
Medications use (Yes, No)129 (20.2)
Smoking status (Yes, No)129 (24.8)

*Mean ± SD for continuous variables; n(%) for categorical variables.

Descriptive statistics for phenotypes and covariates at baseline for 129 unrelated individuals *Mean ± SD for continuous variables; n(%) for categorical variables.

Genotype data description

Genotype data were provided only for odd-numbered autosomal chromosomes. In this paper, we focus on variants on chromosome 3 (as suggested by the GAW18 organizers to allow comparisons of findings with other GAW18 contributions).

Pathway-based analysis

We considered 4 phenotypes of interest SBP, DBP, SBP-DBP, and MAP and performed pathway-based analysis. We followed the following steps [4,5]:

Step 1: Mapping SNPs to genes

Among the 1,215,296 SNPs on chromosome three, 523,147 SNPs were mapped to 1224 known genes using NBCI2R.

Step 2: Obtaining test statistics for genes

We considered both rare and common variants and other covariates (age, smoking status, medications use, and gender) to assign a test statistic for each gene. VW-TOW (variable weight test for testing the effect of an optimally weighted combination of variants) [8] was used to construct test statistics and their p values. Assume that we have n individuals who have been genotyped at M variants and is the trait of interest for the ith individual. Each individual has a genotypic score T where {0,1,2} denotes the number of copies of the minor allele for the mth variant of the ith individual. We used a minor allele frequency (MAF) threshold of less than 1% to define rare variants. To test the effect of the optimally weighted combination (TOW) of variants , we used the statistic: where are the optimal weights. To test the effect of both rare and common variants, we applied TOW to each of them separately; and denote these statistics, respectively. Then we used the test statistic of VW-TOW: where is the p value of the test and . To evaluate the p value of , we used the permutation test. We incorporated the other covariates T for each individual i , by adjusting and using linear regression: And by using the residuals and , the following TOW and VW-TOW were used:

Step 3: Pathway analysis

We ranked all the genes (N), that had p values and test statistics, based on their statistical significance from the smallest to the largest p values. From step 2, we had (N = 1187) genes for SBP and DBP and (N = 1188) genes for SBP-DBP and MAP. Using the GSEA method [6], we evaluated the significance of predefined gene sets/pathways obtained from online pathway databases (The Molecular Signatures Database) [9]. We used the c2 curated gene sets (v3.1), which are compiled from online pathway databases, publications in PubMed, and knowledge of domain experts [9], which consisted of 4850 gene sets, but we only considered 3638 sets that had at least 1 gene from chromosome 3 and at least 10 genes in total. Of the 3638 pathways, 69.1% have between 1 and 5 genes on chromosome 3, while 24.1% of the pathways have between 6 and 20 genes and 6.8% have between 21 and 140 genes on the same chromosome. Then we calculated the enrichment score (ES) for each set/pathway using a weighted Kolmogorov-Smirnov-like running-sum statistic. This statistic describes the overrepresentation of the genes within the set at the top of the ranked genes. We then adjusted for different sizes of genes using 1000 gene-based permutations (π) and calculated the normalized enrichment score (NES) for each set (S). To estimate the significance level of NES for each set/pathway, we used the gene-based permutation approach to obtain the empirical p values of the NES. We used 1000 gene-set permutations and then we considered the set/pathway to be significantly enriched if its false discovery rate (FDR) q value is less than 0.05. We implemented the analysis using the GSEApreranked tool included in the GSEA software [6,7].

Results

Considering common and rare variants from chromosome 3 with other covariates, and applying GSEA to our data, we ranked the top 10 gene sets/pathways based on their FDR q values for each phenotype. These ranked genes are listed in Tables 2, 3, 4, and 5 for MAP, (SBP-DBP), SBP, and DBP phenotypes, respectively. We found that no gene sets were enriched when we considered SBP or DBP. However, we were able to identify 1 significant enriched gene pathway from c2 curated gene sets (Table 2) with MAP. Interestingly, the same pathway was declared to be significantly enriched with the difference between SBP and DBP phenotype (Table 3). We identified the same gene pathway (Koyama_Sema3B_Targets_DN) in both phenotypes, and this pathway had been shown to be related to different kinds of cancer [10,11]. In this pathway, 12 of 18 genes on chromosome 3 contributed to the enrichment score and the most interesting gene in this pathway is CD47. Several articles [12,13] reported that this gene regulates BP.
Table 2

The top 10 gene sets/pathways from c2 curated gene sets ranked by FDR q values for MAP

Pathway nameNo. genes*ESNESFDR q value
KOYAMA_SEMA3B_TARGETS_DN180.6112.2260.040

HUANG_GATA2_TARGETS_UP110.5591.7690.903

ONO_FOXP3_TARGETS_DN50.7241.7570.929

BENPORATH_ES_240.8591.9380.967

CARDOSO_RESPONSE_TO_GAMMA_RADIATION_AND_3AB30.8651.7700.980

ZHANG_TLX_TARGETS_60HR_UP220.4231.6250.981

LAIHO_COLORECTAL_CANCER_SERRATED_DN10.8911.2020.992

PID_INTEGRIN2_PATHWAY10.9151.2030.992

LIU_TARGETS_OF_VMYB_VS_CMYB_DN50.5011.2030.992

LIM_MAMMARY_STEM_CELL_DN210.3171.2030.992

*Number of genes on chromosome 3.

Table 3

The top 10 gene sets/pathways from c2 curated gene sets ranked by FDR q-values for the difference between SBP and DBP

Pathway nameNo. genes*ESNESFDR q value
KOYAMA_SEMA3B_TARGETS_DN180.6142.2270.042
ZHAN_MULTIPLE_MYELOMA_CD1_AND_CD2_UP40.8301.8670.822
TURASHVILI_BREAST_LOBULAR_CARCINOMA_VS_LOBULAR_NORMAL_UP70.6911.8880.823
BREDEMEYER_RAG_SIGNALING_NOT_VIA_ATM_DN40.8111.8440.871
LI_INDUCED_T_TO_NATURAL_KILLER_DN70.6931.9070.892
BENPORATH_ES_240.8581.9410.917
TONKS_TARGETS_OF_RUNX1_RUNX1T1_FUSION_SUSTAINED_IN_GRANULOCYTE_UP30.8691.7620.958
CHANG_CORE_SERUM_RESPONSE_DN210.2901.0910.976
WONG_ENDMETRIUM_CANCER_UP20.6251.0820.976
NOUZOVA_TRETINOIN_AND_H4_ACETYLATION180.2991.0900.976

*Number of genes on chromosome 3.

Table 4

The top 10 gene sets/pathways from c2 curated gene sets ranked by FDR q values for SBP

Pathway nameNo. genes*ESNESFDR q value
GOTTWEIN_TARGETS_OF_KSHV_MIR_K12_1160.5881.5540.623
SMIRNOV_RESPONSE_TO_IR_6HR_UP60.5831.5540.628
PID_SHP2_PATHWAY30.7381.5420.629
JIANG_VHL_TARGETS60.5881.5500.630
SHEDDEN_LUNG_CANCER_GOOD_SURVIVAL_A12180.4081.5550.631
PLASARI_TGFB1_SIGNALING_VIA_NFIC_10HR_UP50.6281.5430.632
CHEN_PDGF_TARGETS40.6681.5450.632
PID_IGF1_PATHWAY30.7591.5420.633
NAKAMURA_TUMOR_ZONE_PERIPHERAL_VS_CENTRAL_DN320.3491.5550.633
PID_BCR_5PATHWAY30.7641.5570.633

*Number of genes on chromosome 3.

Table 5

The top 10 gene sets/pathways from c2 curated gene sets ranked by FDR q values for DBP

Pathway nameNo. genes*ESNESFDRq value
PHONG_TNF_RESPONSE_VIA_P38_COMPLETE130.5831.7430.580
DELYS_THYROID_CANCER_DN110.6121.7480.599
CORRE_MULTIPLE_MYELOMA_DN30.8971.7330.603
SHEPARD_BMYB_MORPHOLINO_UP100.6091.7120.617
LI_INDUCED_T_TO_NATURAL_KILLER_UP170.5371.7240.617
WILCOX_PRESPONSE_TO_ROGESTERONE_UP60.7051.7000.621
WAMUNYOKOLI_OVARIAN_CANCER_LMP_DN130.5621.6740.632
KEGG_RENIN_ANGIOTENSIN_SYSTEM30.9321.7610.632
OSWALD_HEMATOPOIETIC_STEM_CELL_IN_COLLAGEN_GEL_DN110.6051.7510.633
REACTOME_POST_TRANSLATIONAL_PROTEIN_MODIFICATION160.5351.6850.644

*Number of genes on chromosome 3

The top 10 gene sets/pathways from c2 curated gene sets ranked by FDR q values for MAP *Number of genes on chromosome 3. The top 10 gene sets/pathways from c2 curated gene sets ranked by FDR q-values for the difference between SBP and DBP *Number of genes on chromosome 3. The top 10 gene sets/pathways from c2 curated gene sets ranked by FDR q values for SBP *Number of genes on chromosome 3. The top 10 gene sets/pathways from c2 curated gene sets ranked by FDR q values for DBP *Number of genes on chromosome 3 Because our pathway-based analysis is restricted to genes on chromosome 3, the number of pathways used for analysis exceeded the number of genes, which can have important implications in interpreting our findings. The results from our analyses should be interpreted cautiously.

Conclusions

Gene-set enrichment analysis considers multiple genes that are related biologically. In our data, we identified 1 identical enriched gene set/pathway with the MAP and the difference between SBP and DBP. The gene CD47 in this pathway was reported previously to be related to BP. Our analysis included only 129 unrelated individuals. Sample size plays a major role in identifying enriched gene sets/pathways, which could explain the lack of significant pathways in our analysis. Future studies can be done by applying GSEA on large family-based data where incorporating both rare and common variants, taking into account the correlations between individuals and increasing the sample size, may lead to new discoveries.

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

HA designed the overall study; HA and XFL performed all of the data analysis. HA and JB drafted the manuscript and JB conceived the study and provided critical comments. All authors read and approved the final manuscript.
  12 in total

1.  Pathway-based approaches for analysis of genomewide association studies.

Authors:  Kai Wang; Mingyao Li; Maja Bucan
Journal:  Am J Hum Genet       Date:  2007-12       Impact factor: 11.025

2.  The race associated allele of Semaphorin 3B (SEMA3B) T415I and its role in lung cancer in African-Americans and Latino-Americans.

Authors:  Carmen J Marsit; John K Wiencke; Mei Liu; Karl T Kelsey
Journal:  Carcinogenesis       Date:  2005-04-14       Impact factor: 4.944

3.  Genetic variations in CYP17A1, CACNB2 and PLEKHA7 are associated with blood pressure and/or hypertension in She ethnic minority of China.

Authors:  Yinghua Lin; Xiaolan Lai; Bin Chen; Yuan Xu; Baoying Huang; Zichun Chen; Shaoheng Zhu; Jin Yao; Qiqin Jiang; Huibin Huang; Junping Wen; Gang Chen
Journal:  Atherosclerosis       Date:  2011-09-16       Impact factor: 5.162

4.  Thrombospondin-1 supports blood pressure by limiting eNOS activation and endothelial-dependent vasorelaxation.

Authors:  Eileen M Bauer; Yan Qin; Thomas W Miller; Russell W Bandle; Gabor Csanyi; Patrick J Pagano; Philip M Bauer; Jurgen Schnermann; David D Roberts; Jeff S Isenberg
Journal:  Cardiovasc Res       Date:  2010-07-07       Impact factor: 10.787

5.  Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles.

Authors:  Aravind Subramanian; Pablo Tamayo; Vamsi K Mootha; Sayan Mukherjee; Benjamin L Ebert; Michael A Gillette; Amanda Paulovich; Scott L Pomeroy; Todd R Golub; Eric S Lander; Jill P Mesirov
Journal:  Proc Natl Acad Sci U S A       Date:  2005-09-30       Impact factor: 11.205

6.  Thrombospondin-1 and CD47 regulate blood pressure and cardiac responses to vasoactive stress.

Authors:  Jeff S Isenberg; Yan Qin; Justin B Maxhimer; John M Sipes; Daryl Despres; Jurgen Schnermann; William A Frazier; David D Roberts
Journal:  Matrix Biol       Date:  2009-01-21       Impact factor: 11.583

7.  Genome-wide association study of blood pressure and hypertension.

Authors:  Daniel Levy; Georg B Ehret; Kenneth Rice; Germaine C Verwoert; Lenore J Launer; Abbas Dehghan; Nicole L Glazer; Alanna C Morrison; Andrew D Johnson; Thor Aspelund; Yurii Aulchenko; Thomas Lumley; Anna Köttgen; Ramachandran S Vasan; Fernando Rivadeneira; Gudny Eiriksdottir; Xiuqing Guo; Dan E Arking; Gary F Mitchell; Francesco U S Mattace-Raso; Albert V Smith; Kent Taylor; Robert B Scharpf; Shih-Jen Hwang; Eric J G Sijbrands; Joshua Bis; Tamara B Harris; Santhi K Ganesh; Christopher J O'Donnell; Albert Hofman; Jerome I Rotter; Josef Coresh; Emelia J Benjamin; André G Uitterlinden; Gerardo Heiss; Caroline S Fox; Jacqueline C M Witteman; Eric Boerwinkle; Thomas J Wang; Vilmundur Gudnason; Martin G Larson; Aravinda Chakravarti; Bruce M Psaty; Cornelia M van Duijn
Journal:  Nat Genet       Date:  2009-05-10       Impact factor: 38.330

8.  PGC-1alpha-responsive genes involved in oxidative phosphorylation are coordinately downregulated in human diabetes.

Authors:  Vamsi K Mootha; Cecilia M Lindgren; Karl-Fredrik Eriksson; Aravind Subramanian; Smita Sihag; Joseph Lehar; Pere Puigserver; Emma Carlsson; Martin Ridderstråle; Esa Laurila; Nicholas Houstis; Mark J Daly; Nick Patterson; Jill P Mesirov; Todd R Golub; Pablo Tamayo; Bruce Spiegelman; Eric S Lander; Joel N Hirschhorn; David Altshuler; Leif C Groop
Journal:  Nat Genet       Date:  2003-07       Impact factor: 38.330

9.  A genome-wide association study of hypertension and blood pressure in African Americans.

Authors:  Adebowale Adeyemo; Norman Gerry; Guanjie Chen; Alan Herbert; Ayo Doumatey; Hanxia Huang; Jie Zhou; Kerrie Lashley; Yuanxiu Chen; Michael Christman; Charles Rotimi
Journal:  PLoS Genet       Date:  2009-07-17       Impact factor: 5.917

10.  Pathway-based analysis of a genome-wide case-control association study of rheumatoid arthritis.

Authors:  Joseph Beyene; Pingzhao Hu; Jemila S Hamid; Elena Parkhomenko; Andrew D Paterson; David Tritchler
Journal:  BMC Proc       Date:  2009-12-15
View more
  2 in total

1.  Pathway analysis approaches for rare and common variants: insights from Genetic Analysis Workshop 18.

Authors:  Stella Aslibekyan; Marcio Almeida; Nathan Tintle
Journal:  Genet Epidemiol       Date:  2014-09       Impact factor: 2.135

2.  Association of polymorphisms in the aldosterone-regulated sodium reabsorption pathway with blood pressure among Hispanics.

Authors:  Bamidele O Tayo; Liping Tong; Richard S Cooper
Journal:  BMC Proc       Date:  2016-10-18
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.