Literature DB >> 29253858

A family-based genome-wide association study of chronic rhinosinusitis with nasal polyps implicates several genes in the disease pathogenesis.

Anton Bohman1, Julius Juodakis2, Martin Oscarsson3, Jonas Bacelis2, Mats Bende3, Åsa Torinsson Naluai4.   

Abstract

BACKGROUND: The pathogenesis of chronic rhinosinusitis with nasal polyps is largely unknown. Previous studies have given valuable information about genetic variants associated with this disease but much is still unexplained. Our goal was to identify genetic markers and genes associated with susceptibility to chronic rhinosinusitis with nasal polyps using a family-based genome-wide association study.
METHODS: 427 patients (293 males and 134 females) with CRSwNP and 393 controls (175 males and 218 females) were recruited from several Swedish hospitals. SNP association values were generated using DFAM (implemented in PLINK) and Efficient Mixed Model Association eXpedited (EMMAX). Analyses of pathway enrichment, gene expression levels and expression quantitative trait loci were then performed in turn.
RESULTS: None of the analysed SNPs reached genome wide significant association of 5.0 x 10-8. Pathway analyses using our top 1000 markers with the most significant association p-values resulted in 138 target genes. A comparison between our target genes and gene expression data from the NCBI Gene Expression Omnibus database showed significant overlap for 36 of these genes. Comparisons with data from expression quantitative trait loci showed the most skewed allelic distributions in cases with chronic rhinosinusitis with nasal polyps compared with controls for the genes HLCS, HLA-DRA, BICD2, VSIR and SLC5A1.
CONCLUSION: Our study indicates that HLCS, HLA-DRA, BICD2, VSIR and SLC5A1 could be involved in the pathogenesis of chronic rhinosinusitis with nasal polyps. HLA-DRA has been associated with chronic rhinosinusitis with nasal polyps in previous studies and HLCS, BICD2, VSIR and SLC5A1 may be new targets for future research.

Entities:  

Mesh:

Year:  2017        PMID: 29253858      PMCID: PMC5734761          DOI: 10.1371/journal.pone.0185244

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Chronic rhinosinusitis (CRS) as defined by the European Position Paper on Rhinosinusitis and Nasal Polyps (EPOS) [1] is classified into chronic rhinosinusitis with nasal polyps (CRSwNP) and chronic rhinosinusitis without nasal polyps (CRSsNP). CRSwNP is a disease characterized by benign outgrowths from the middle meatus of the nasal cavity and chronic sinonasal inflammation. CRSwNP is a common chronic disease and depending on the geographical area, 2–4% of the population is afflicted [2-4]. The disease causes individual suffering and a decreased quality of life [5,6]. Risk factors include asthma, male sex and increasing age. The disease often requires a combination of surgical and medical treatment. However, CRSwNP often recurs even after therapy. The aetiology of the disease is unknown. Several environmental factors have been suggested and previous studies have also shown an increased prevalence among relatives [7,8] and a higher rate of positive family history of CRSwNP among those affected [9-11], confirming a genetic susceptibility to the disease. Compared to the general population, having an afflicted family member increases the risk of disease five times [7]. Genetic studies on patients with CRSwNP could help to explain the pathogenesis of the disease and over time identify new drug targets leading to a more effective, individually tailored, therapy. Genetic association can be explored using candidate gene studies or genome-wide association studies (GWAS). Candidate gene studies usually investigate a small number of single-nucleotide polymorphisms (SNPs) or other types of genetic variation, in order to find or reject associations between the genetic variants and the disease in question. These studies rely on previous knowledge and hypotheses regarding which SNPs to suspect and investigate. In comparison, a GWAS investigates hundreds of thousands of SNPs across the whole genome and is therefore not reliant on previous knowledge or hypotheses regarding the pathogenesis of the investigated disease or trait. A large number of GWASs have been performed for various complex diseases such as diabetes and asthma which has led to the finding of novel genetic pathways [12]. There is currently no published GWAS performed only on patients with CRSwNP but there is a pooling-based GWAS done on patients with CRS (both CRSsNP and CRSwNP) [13] as well as several studies of candidate genes [14]. These studies have implicated several genes and pathways such as the cystic fibrosis transmembrane conductance regulator gene (CFTR) [15,16] and, among others, genes involved in immunity [17-20], inflammation [21,22], tissue remodelling [20] and arachidonic acid metabolism [23] Even though GWA studies have been successful in detecting genetic variants associated with many common diseases, the inability to explain most of the estimated heritability makes linkage analysis an alternative to detect possible rare variants. To this date, no published linkage analysis have been performed on subjects with CRSwNP. However, one such study has been performed on 8 subjects with CRS (not specified whether any of them had CRSwNP) which found a linkage signal on chromosome 7q [24]. A combination of a GWAS and linkage analysis such as a family-based GWAS could be a more potent way of identifying both common and rare variants associated with CRSwNP [25].

Aim

The aim of this study is to identify SNPs and genes associated with CRSwNP susceptibility using a family-based approach.

Materials and methods

Material

367 patients with CRSwNP (250 men and 117 women, mean age 52.3 years) from three Swedish ear, nose and throat clinics were recruited. These subjects were all known patients at their respective clinics, all of them fulfilled the EPOS criteria for CRSwNP [1] and had at least intermittently been on either intranasal or systemic corticosteroids, most of them had undergone at least one operation for the condition. In order to increase power level for genome-wide analysis, patients with associated diseases such as asthma or aspirin-exacerbated respiratory disease (AERD) were not excluded. A total of 453 first-degree relatives (218 men, 235 women, mean age 49.4 years) were also recruited. The study was carried out in accordance with the Declaration of Helsinki and was approved by the Ethics Committee at the University of Gothenburg, Sweden. Written consent was obtained from all participants.

Investigation

Nasal endoscopy was performed on all participants using a 2.7 mm rigid endoscope (KARL STORZ) and the participants were subsequently phenotyped as either having CRSwNP or being free from this disease. Additional data about asthma and corticosteroid medication (used to counter symptoms from either the upper or lower airways) was obtained via a structured interview. Peripheral blood samples were collected from each individual. DNA was extracted from whole blood using an in-house protocol at KBiosciences (LGC Genomics, Hoddeston UK). The HiSeq Illumina platform was used for genotyping. 144 of the samples were run on Illumina Omni Express bead chips and the remaining 676 on the Illumina Core Exome array.

Quality control

Individuals were removed if they showed >2% missing calls (all samples passed), heterozygosity >3 SDs above or below mean, >100 Mendelian errors). Markers with >2% missing calls, >3 Mendelian errors or those with minor allele frequency = 0 (i.e. monomorphic loci) were excluded. SNPs with low linkage disequilibrium among each other (LD) (r2<0.2) were selected for population structure analysis. As a reference, samples from the 1000 Genomes project, Phase 3 were retrieved. Principal component analysis was then performed using GCTA, a genome-wide complex trait analysis software [26], and the first three principal components were investigated. As a measure of non-European admixture in each sample, we calculated the Euclidean distance E from that sample to the mass centre of the CEU reference group. Individuals with E > 5 SD were removed (2 samples). Only autosomal markers shared in both genotyping platforms were retained. Finally, principal components analysis was performed to check for batch effects. Visual inspection of sample scores along the first three principal components showed no differences between batches.

Association testing

Two methods were used to generate SNP association values. First, we used DFAM, implemented in PLINK, which combines the transmission disequilibrium test (TDT), the sibling TDT and an allelic test for unrelated cases and controls in a single Cochran-Mantel-Haenszel test for each marker [27]. The second method was EMMAX (Efficient Mixed Model Association eXpedited), implemented in Golden Helix SNP & Variation Suite v8.3.4 [28]. This method involves computing an empirical relatedness matrix of the samples, and using this relatedness as a covariate in linear regression for each marker [29]. We performed this test using additive, dominant and recessive models, and the smallest p-value from the three models was assigned to each SNP. A commonly used level of significance in conventional GWA-studies on unrelated subjects is 5x10-8 [30] but many variants that do not reach this level of significance can still be true disease-influencing variation. The decision was therefore made to perform post-GWAS analyses for the 1000 SNPs with the lowest p-values for each method rather than adhering to a strict threshold for the level of significance.

Pathway enrichment analysis

The top 1000 markers with the most significant association p-values were combined into intervals of SNPs in high LD (defined by pairwise r2>0.25). This process was performed separately for DFAM and EMMAX association results. INRICH software was then used to detect possible pathway enrichment within these regions [31]. To calculate the significance of such overlaps, the process was repeated 50,000 times with random genomic regions, however matched in size and SNP density. INRICH analysis was performed separately using DFAM or EMMAX results and Gene Ontology (GO) or Kyoto Encyclopedia of Genes and Genomes (KEGG)-based gene-sets. The 20 gene-sets with the highest enrichment p-values were retrieved from each of these setups. INRICH produced a list of genes which were located close to the top GWAS ‘hits’ in the genome and that share functional annotations. All genes retrieved in this way from the four INRICH analyses (DFAM+GO, EMMAX+GO, DFAM+KEGG, EMMAX+KEGG) were combined together, creating a list of target NP genes implicated in this study.

Gene expression data

Publicly-available gene expression data, collected by Plager et al. [32], was retrieved from NCBI Gene Expression Omnibus (GEO) database ([33]; series accession number GSE23552). Per authors’ recommendation, two samples (aCRSm1 and aCRSm2) were excluded, leaving 20 case samples (all from patients with CRSwNP) and 17 control samples from either allergic rhinitis patients or healthy individuals. Expression levels between the case and control groups were compared using the GEO2R interface. All genes corresponding to probes with significant difference in expression levels (Benjamini-Hochberg FDR < 0.05) comprise the differentially-expressed gene set.

Expression quantitative trait loci (eQTL) analysis

Two datasets were used for eQTL analysis: Blood eQTL from Westra et al. [34] and Multiple Tissue Human Expression Resource (MuTHER) project [35]. These datasets are produced by microarray genotyping and expression profiling of selected tissue samples. SNP variations are then associated with gene expression patterns, resulting in a list of regulatory SNPs for each gene. In MuTHER project, the regulatory effects of each SNP were determined in adipose, skin tissues and lymphoblastoid cell lines (LCL). For each SNP-gene pair we have retained either LCL or skin data, corresponding to the tissue with more significant regulatory effect (i.e. lower p-value in skin samples meant that LCL data was discarded for that SNP-gene pair). We also excluded all SNPs with p-values > 0.05 or absolute effect size (regression coefficient β) of < 0.01. FDR of 0.5 was used as a cut-off for the Blood dataset, with no additional limits on effect size. To check for directed eQTL enrichment, all eQTL SNPs for each gene of interest were extracted and classified according to the direction of their regulatory effect (up-regulating or down-regulating). The frequency of the allele bearing the reported regulatory effect was then determined in our GWAS cases and controls using PLINK [27]. The marker was then assigned to a bin depending on whether the regulatory allele shows higher frequency in cases or in controls. In this way, a 2x2 contingency table was constructed for each gene, where all SNPs fall into one of four bins (up-regulating + less frequent in cases; up-regulating + more frequent in cases; down-regulating + less frequent in cases; down-regulating + more frequent in cases). Fisher’s test was used to test whether the regulatory effect and frequency difference are dependent. However, Fisher’s test does not account for the effect of LD between SNPs. Therefore, the empirical significance was calculated using an iterative procedure. Genes were ordered according to the number of eQTL SNPs remaining after all filters; genes found in the differentially-expressed NP set (as described in the previous section) were removed; for each gene of interest with n SNPs, 500 genes with the same number n of SNPs are retrieved; if less than 500 genes have the required number of SNPs, genes with n+1 (then n+2, n+3…) SNPs are also retrieved, and n SNPs are randomly selected for analysis in those genes. Each gene is analysed in the same manner as the target gene. Resulting empirical distribution of p-values is used to determine the empirical significance for the gene of interest. The workflow of the analysis is shown in Fig 1.
Fig 1

Workflow of analysis.

A. Initially, SNP association p-values are produced by an association test. Based on these values, top 1000 SNPs are selected and annotated to nearby genes. B. INRICH software is then used to detect over-represented gene-sets (empty circles denote all genes that were not detected by GWAS). Genes from top 20 such sets are retrieved, and only the ones overlapping with GWAS hits are analysed further. C. Using publicly available expression data from NP samples, this gene list is filtered to retain only differentially expressed genes. D. For each of these remaining targets, known eQTLs are assigned into bins based on effect direction (up- or down- regulating) and frequency distribution in our genotyping data (higher frequency in cases or in controls). Fisher's exact test is used to evaluate the observed distribution. 500 genes with equal or higher count of eQTLs are analysed in the same way, and statistic values generated from this control set are then compared with the target gene statistic to estimate the empirical significance.

Workflow of analysis.

A. Initially, SNP association p-values are produced by an association test. Based on these values, top 1000 SNPs are selected and annotated to nearby genes. B. INRICH software is then used to detect over-represented gene-sets (empty circles denote all genes that were not detected by GWAS). Genes from top 20 such sets are retrieved, and only the ones overlapping with GWAS hits are analysed further. C. Using publicly available expression data from NP samples, this gene list is filtered to retain only differentially expressed genes. D. For each of these remaining targets, known eQTLs are assigned into bins based on effect direction (up- or down- regulating) and frequency distribution in our genotyping data (higher frequency in cases or in controls). Fisher's exact test is used to evaluate the observed distribution. 500 genes with equal or higher count of eQTLs are analysed in the same way, and statistic values generated from this control set are then compared with the target gene statistic to estimate the empirical significance.

Results

Genetic analysis

Six samples were removed due to heterozygosity > 3 SDs above or below mean, three samples were removed due to > 100 Mendelian errors and one sample due to a mismatch between genotyped and indicated sex. The final dataset after quality-control consists of 782 individuals and 233 409 SNPs. Additional data about the subjects who passed quality control are presented in Table 1. Of the 406 individuals with nasal polyps, 350 were index patients with CRSwNP, 22 of the non-index patients knew they had polyps beforehand and 34 had newly discovered polyps.
Table 1

Descriptive statistics of the participants who passed quality control.

Nasal polyps n = 406Mean age 56.42, SD 14.3Unaffected subjects n = 376Mean age 47.85, SD 17.0p-value
Sex275 male, 131 female169 male, 207 female<0.0011
Asthma209 (51.5%)65 (17.3%)<0.0011
Use of corticosteroids for nasal symptoms261 (64.3%)24 (6.4%)<0.0011
Use of corticosteroids for pulmonary symptoms160 (39.4%)30 (8%)<0.0011

1 Comparison of the prevalence/proportion between the groups by Chi-2 test.

1 Comparison of the prevalence/proportion between the groups by Chi-2 test. The Manhattan plot from the DFAM analysis is provided in Fig 2 and the Manhattan plot from the EMMAX analysis in Fig 3. Table 2 lists the 30 SNPs with the strongest associations from the DFAM analysis and Table 3 lists the 30 SNPs with the strongest association values from the EMMAX analysis. None of tested SNPs reached a significance of 5x10-8, the top ranking SNP from the DFAM analysis was rs4629180 with a p-value of 1.47x10-6, the top ranking SNP from the EMMAX analysis was rs2491026 with a p-value of 0.00014.
Fig 2

Manhattan plot from the DFAM analysis.

Fig 3

Manhattan plot from the EMMAX analysis.

Table 2

Top 30 associated SNPs from DFAM analysis.

SNPCHRBPA1A2pOBSEXP
rs46291802102088370GA1.47E-06107125.1
rs2274576165469641GT1.94E-06140159.8
rs48514552102083183TC1.28E-05124141.3
rs1079044311121234131CA1.37E-05154137.2
rs45319002173125108CT2.01E-05199181.2
rs79413801123660635CT2.61E-05110125.9
rs79328071116438411GA2.66E-05176157.9
rs96546997157512750GA3.13E-05114129.5
rs3437911111301623TC3.37E-05147162.6
rs7445642218588959AC3.45E-056351.7
rs1326917932717956AG3.45E-05155172.6
rs23996851193295334CT4.82E-05175158.7
rs7299111121243716CT6.36E-05133118.3
rs100865068116988947GT6.57E-05182165.8
rs205226689743077CT6.67E-05163179.8
rs1275928226865646CA6.75E-05171188.8
rs102539097157516956CA6.96E-05159174.5
rs921705233474645CT7.11E-05144161.1
rs3016621361446809TG7.96E-054231.83
rs95483741339077255TC8.27E-05135151.9
rs65636181339021562AG9.35E-05134150.2
rs22230638117050089CT9.49E-05158143.4
rs1057713996714161GA1.03E-04147164.1
rs80965981828584774CA1.06E-042432.5
rs93647886165458279CA1.06E-04105120.7
rs100656555101192693CT1.14E-04133118.9
rs120532732223106526AG1.18E-0410388.87
rs1075019111121239305GA1.21E-04130115.8
rs1050722112108373750CT1.21E-048775.17
rs20163942172972971AG1.32E-04192208.4

CHR: Chromosome

BP: Base-pair, position of the SNP on the chromosome

A1: Allele 1

A2: Allele 2

p: p-value

OBS: Observed number of CRSwNP patients with allele 1

EXP: Expected number of CRSwNP patients with allele 1 under the assumption of random inheritance of alleles.

Beta: Effect size

Model: The model of inheritance that produced the smallest p-value in the analysis; ADD = additive, DOM = dominant, REC = recessive

Table 3

Top 30 associated SNPs from EMMAX analysis.

SNPCHRBPA1A2pBetaModel
rs24910261070802309AG0.00014-0.40REC
rs382431086386250CT0.00014-0.22REC
rs71277637463316TC0.000150.26REC
rs77139835177690978TC0.000150.24REC
rs7998558117265106CT0.000160.44REC
rs1671389813181518AC0.000170.49REC
rs967037013105738427TC0.000210.40REC
rs2463341623738355CT0.000210.14DOM
rs23664083159696099TG0.000220.10ADD
rs108973161162789763GA0.00023-0.16REC
rs65016581771661629CT0.000230.11ADD
rs98217813149362405GA0.00024-0.14DOM
rs7459662829682635CA0.00024-0.21REC
rs294482558845915GT0.000240.37REC
rs13727711867153636GA0.000250.29REC
rs6198783109677974GA0.00027-0.28REC
rs19414761837686439CT0.000290.13DOM
rs660895632577380GA0.000290.23REC
rs98601433109708722AG0.00030-0.16ADD
rs69207976166237461GT0.000330.18REC
rs125841121389159334TC0.000340.13DOM
rs1127898333186356CT0.000340.11ADD
rs8703911835885805TC0.000370.22REC
rs70863931058250470GA0.000370.16REC
rs10957577872919087GA0.000370.18REC
rs355687478505578CT0.00037-0.13DOM
rs22116782133339340TC0.00038-0.23REC
rs12736883131241312AG0.00039-0.14DOM

CHR: Chromosome

BP: Base-pair, position of the SNP on the chromosome

A1: Allele 1

A2: Allele 2

p: p-value

OBS: Observed number of CRSwNP patients with allele 1

EXP: Expected number of CRSwNP patients with allele 1 under the assumption of random inheritance of alleles.

Beta: Effect size

Model: The model of inheritance that produced the smallest p-value in the analysis; ADD = additive, DOM = dominant, REC = recessive

CHR: Chromosome BP: Base-pair, position of the SNP on the chromosome A1: Allele 1 A2: Allele 2 p: p-value OBS: Observed number of CRSwNP patients with allele 1 EXP: Expected number of CRSwNP patients with allele 1 under the assumption of random inheritance of alleles. Beta: Effect size Model: The model of inheritance that produced the smallest p-value in the analysis; ADD = additive, DOM = dominant, REC = recessive CHR: Chromosome BP: Base-pair, position of the SNP on the chromosome A1: Allele 1 A2: Allele 2 p: p-value OBS: Observed number of CRSwNP patients with allele 1 EXP: Expected number of CRSwNP patients with allele 1 under the assumption of random inheritance of alleles. Beta: Effect size Model: The model of inheritance that produced the smallest p-value in the analysis; ADD = additive, DOM = dominant, REC = recessive From the pathway enrichment analysis we extracted the top 20 gene-sets from each of the four INRICH analyses resulting in a combined list of 138 target CRSwNP genes from this study (S1 Table). Out of our 138 target genes from the INRICH analysis, 36 genes showed a significant difference in mRNA expression levels between nasal polyp tissue and normal tissue (from Plager et al. [32]) (Table 4)
Table 4

Overlap between genes from INRICH analysis and expression data (from Plager et al. [32]).

NameENTREZ IDSourceProbe IDAdjusted pLog2FCCHRFC
IL2RA3559GO + EMMAX32757293.87e-092.25104.75
PDGFD80310GO + EMMAX33890771.26e-050.83111.78
P4HA15033GO + DFAM32941590.00350.74101.67
HLA-DRA3122KEGG + EMMAX29031890.00820.6961.61
LAMB33914KEGG + EMMAX24537930.00840.6311.55
VSIR64115GO + DFAM32937240.00020.63101.55
TIAM17074KEGG + DFAM39286680.00620.44211.36
E2F31871KEGG + EMMAX28975760.00600.4461.36
SIPA16494GO + DFAM33354650.00270.42111.34
LYN4067KEGG + DFAM30989770.00990.4081.32
CACNA1D776KEGG + DFAM26243850.0140.3731.30
CPLX210814GO + DFAM28422550.00910.3551.28
COL16A11307GO + EMMAX24045460.00710.3011.23
PPARD5467GO + EMMAX29045970.00320.2961.23
BICD223299GO + EMMAX32149840.0300.2591.19
STXBP16812GO + DFAM31899320.031-0.2890.83
DOCK11793KEGG + DFAM32699390.016-0.29100.82
GNG1255970KEGG + DFAM24172720.037-0.3410.79
CAPN2824GO + EMMAX23821170.032-0.3510.79
USP138975GO + DFAM26540910.015-0.3530.79
ADCY1107KEGG + EMMAX30003420.0004-0.3770.78
NMD351068KEGG + EMMAX26505380.032-0.3730.78
ANK2287GO + DFAM27400670.0026-0.3740.77
OGFOD155239GO + DFAM36929280.022-0.38160.77
NDUFS54725KEGG + EMMAX23311780.019-0.4310.74
CPEB322849GO + EMMAX33002422.22e-05-0.49100.71
LRP54041GO + EMMAX33375160.0006-0.50110.71
HLCS3141KEGG + DFAM39311121.09e-05-0.51210.70
MYL910398KEGG + EMMAX38839210.0034-0.58200.70
PLCB123236KEGG + EMMAX38756420.0013-0.67200.63
FERMT210979GO + EMMAX35649190.0002-0.70140.62
CHRM31131GO + EMMAX23876060.0002-0.7310.60
LEPR3953GO + EMMAX23404330.0022-0.7610.59
PDE3A5139GO + EMMAX34074531.04e-06-1.16120.45
LYZ4069KEGG + EMMAX34215111.16e-06-1.46120.36
SLC5A16523KEGG + DFAM39432344.32e-05-1.62220.33

Name: HUGO gene ID

ENTREZ ID: gene ID from ENTREZ

Source: combination of pathways and methods that implicated this gene, GO = Gene Ontology, KEGG = Kyoto Encyclopedia of Genes and Genomes

Probe ID: ID of the corresponding probe from the expression dataset

Adjusted p: FDR adjustment

Log2FC: log2 fold change; 0 means no change, positive means up-regulation in NP samples, negative means down-regulation

CHR: chromosome

FC: fold change (1 means no change)

Name: HUGO gene ID ENTREZ ID: gene ID from ENTREZ Source: combination of pathways and methods that implicated this gene, GO = Gene Ontology, KEGG = Kyoto Encyclopedia of Genes and Genomes Probe ID: ID of the corresponding probe from the expression dataset Adjusted p: FDR adjustment Log2FC: log2 fold change; 0 means no change, positive means up-regulation in NP samples, negative means down-regulation CHR: chromosome FC: fold change (1 means no change) Finally, the eQTL analysis for the Blood eQTL dataset showed significantly skewed distributions of eQTLs in cases with CRSwNP compared to controls for HLCS (empirical p-value 0.014) HLA-DRA (empirical p-value 0.02) and BICD2 (empirical p 0.046) (Table 5). The same analysis performed with MuTHER eQTL dataset showed significantly skewed eQTL distribution in cases for VSIR (empirical p-value 0.006), HLCS (empirical p-value 0.014) and BICD2 (empirical p-value 0.016). SLC5A1 also had a skewed distribution with an empirical p-value of 0.052, these results are provided in Table 6.
Table 5

Results from the eQTL analysis, Blood dataset.

NamepEmpirical pLog2FCSNPsUp +Up -Down +Down -
HLCS0.000680.014-0.512211740
HLA-DRA0.00140.0200.692348110
BICD20.00820.0460.241601042
PDGFD0.00650.0660.831802160
NDUFS50.00980.080-0.43208138
TIAM10.0480.110.44105014
VSIR0.0270.160.632471511
LYZ0.0680.27-1.463241486
LEPR0.360.52-0.76441313612
OGFOD110.76-0.13132245
SIPA110.830.422613913

Name: HUGO gene ID

p: unadjusted p-value, produced by Fisher's test for the target gene

Empirical p: p-value, calculated from the cumulative distribution of Fisher's test p-values for similar genes

Log2FC: log2 fold change; 0 means no change, positive means up-regulation in NP samples, negative means down-regulation

SNPs: number of eQTL SNPs tested in this analysis

Up +, Up -, Down + and Down—form the 2x2 contingency table for Fisher's test: Up +: number of up-regulating SNPs that show increased frequency in cases

Up -: number of up-regulating SNPs that show similar or decreased frequency in cases

Down +: number of down-regulating SNPs that show increased frequency in cases

Down -: number of down-regulating SNPs that show similar or decreased frequency in cases

Table 6

Results from the eQTL analysis, MuTHER dataset.

NamepEmpirical pLog2FCSNPsUp +Up -Down +Down -
VSIR2.51e-050.0060.6351251510
HLCS3.88e-050.014-0.51581123213
BICD20.00160.0170.2433314124
SLC5A10.00550.052-1.622535161
PDGFD0.00670.0560.8335512144
CAPN20.100.11-0.35132461
LYN0.0270.140.402941294
PDE3A0.140.16-1.57141553
OGFOD10.0340.17-0.13601316238
PLCB10.0550.21-0.6740156712
FERMT20.110.25-0.702648104
ADCY10.160.34-0.37521811914
HLA-DRA0.320.370.69188244
TIAM10.230.410.4446189910
STXBP10.390.41-0.28223865
LRP50.310.42-0.5035411911
NDUFS50.390.44-0.432411364
IL2RA0.400.492.255116101213
ANK20.460.59-0.373061086
COL16A10.670.660.30237736
LAMB30.540.680.63431261411
CPEB30.720.71-0.4939161256
GNG1210.73-0.34161348
CACNA1D10.760.37233569
CPLX20.700.790.17268846
DOCK110.79-0.293477119
PPARD0.750.810.29391181010
LEPR10.88-0.767418191720

Name: HUGO gene ID

p: unadjusted p-value, produced by Fisher's test for the target gene

Empirical p: p-value, calculated from the cumulative distribution of Fisher's test p-values for similar genes

Log2FC: log2 fold change; 0 means no change, positive means up-regulation in NP samples, negative means down-regulation

SNPs: number of eQTL SNPs tested in this analysis

Up +, Up -, Down + and Down—form the 2x2 contingency table for Fisher's test:

Up +: number of up-regulating SNPs that show increased frequency in cases

Up -: number of up-regulating SNPs that show similar or decreased frequency in cases

Down +: number of down-regulating SNPs that show increased frequency in cases

Down -: number of down-regulating SNPs that show similar or decreased frequency in cases

Name: HUGO gene ID p: unadjusted p-value, produced by Fisher's test for the target gene Empirical p: p-value, calculated from the cumulative distribution of Fisher's test p-values for similar genes Log2FC: log2 fold change; 0 means no change, positive means up-regulation in NP samples, negative means down-regulation SNPs: number of eQTL SNPs tested in this analysis Up +, Up -, Down + and Down—form the 2x2 contingency table for Fisher's test: Up +: number of up-regulating SNPs that show increased frequency in cases Up -: number of up-regulating SNPs that show similar or decreased frequency in cases Down +: number of down-regulating SNPs that show increased frequency in cases Down -: number of down-regulating SNPs that show similar or decreased frequency in cases Name: HUGO gene ID p: unadjusted p-value, produced by Fisher's test for the target gene Empirical p: p-value, calculated from the cumulative distribution of Fisher's test p-values for similar genes Log2FC: log2 fold change; 0 means no change, positive means up-regulation in NP samples, negative means down-regulation SNPs: number of eQTL SNPs tested in this analysis Up +, Up -, Down + and Down—form the 2x2 contingency table for Fisher's test: Up +: number of up-regulating SNPs that show increased frequency in cases Up -: number of up-regulating SNPs that show similar or decreased frequency in cases Down +: number of down-regulating SNPs that show increased frequency in cases Down -: number of down-regulating SNPs that show similar or decreased frequency in cases

Discussion

None of the SNPs in this study reached the suggested genome-wide significance level of 5x10-8. However, by using pathway enrichment and post-GWAS analyses, we identified five interesting genes that could be involved in the pathogenesis of CRSwNP in this study: HLCS, HLA-DRA, BICD2, VSIR and SLC5A1. Of these, only HLA-DRA has been presented in previous studies on subjects with CRSwNP [36]. The present study is the first GWAS performed only on subjects with CRSwNP. Using family relationship data and non-transmitted genetic variation for control, as in the TDT, is both a strength and a potential weakness. In most chromosomal regions, we expect the related individuals to be more similar compared with completely unrelated controls. Therefore, when there are differences between related individuals these are more likely to be due to the disease than to general differences in a population (population stratification). In practice this is a strength because it would be expected to lead to less false positive results. However, due to the increased risk of CRSwNP among relatives [7] and the increased prevalence of polyps with higher age [4], some of the relatives who we have defined as not having polyps could develop CRSwNP later in life and therefore be falsely classified as controls in this study. This could possibly have led to missed markers and genes of potential importance. However, most of the relatives in this study are middle-aged or older (mean age 49.4 years) and the prevalence of nasal polyps among them is 13% which makes it unlikely that more than a few percent of the relatives are falsely classified as phenotype negative. One could also argue that there could be subjects with asymptomatic polyps among the 55 relatives who had nasal polyps during endoscopy. However, 21 of them knew they had polyps beforehand and only 34 were unaware of this. The heritability of CRSwNP makes it much more likely that first-degree relatives of patients with CRSwNP would inherit variants associated with CRSwNP than inherit variants associated with asymptomatic nasal polyps. In the event that there are many relatives with polyps but without the predisposition to develop CRS it would indeed influence the GWAS findings, however excluding relatives with polyps would most likely influence the result in a more negative way and severely limit the study. The HLCS gene was the most significant gene from the eQTL analysis in the Blood dataset and the second most significant in the MuTHER dataset. HLCS is under-expressed in nasal polyp tissue with significantly increased frequency of down-regulating alleles among cases in the eQTL analysis from both the MuTHER and the Blood datasets. This gene encodes the enzyme Holocarboxylase synthetase, which is important for biotin metabolism. Holocarboxylase synthetase itself has not been implicated in CRSwNP but a study from 2013 showed that the enzyme catalyses biotinylation of heat shock protein 72 thereby inducing the expression of the gene RANTES (regulated on activation normal T-expressed and presumably secreted) [37]. RANTES is implicated in multiple studies of CRSwNP; for example a study by Chao et al. found a positive correlation between plasma RANTES protein levels and severity of disease among patients with CRSwNP[38]. RANTES protein has also been detected in nasal polyps using immunological staining [39]. Although it seems counter-intuitive that HLCS is under-expressed when RANTES protein levels are higher in polyp tissue compared with controls, the up-regulation of RANTES might be a counter reaction to initially too low levels during an early disease phase. Further studies are needed to increase the knowledge about the role of HLCS in the pathogenesis of CRSwNP. HLA-DRA is over-expressed in polyp tissue and has a significantly skewed distribution of eQTLs from the Blood dataset where cases have an increased number of down-regulating alleles. HLA-DRA is one of the major histocompatibility complex, class II genes. Polymorphisms in this gene have been associated with the presence of nasal polyps in asthmatic patients [36]. Additionally, polymorphisms in other HLA class II genes have been linked to CRSwNP [18,40]. Using HLA typing on a series of 29 patients with nasal polyps, with or without asthma, Moloney and Oliver found a significant increase in the haplotype AI/B8 in patients with both nasal polyps and asthma[41]. BICD2 is over-expressed in nasal polyps and up-regulating alleles more common in controls and down regulating slightly more common in cases. The gene product is bicaudal D homolog 2, which has been shown to induce microtubule movement [42]. It is also linked to dominant congenital spinal muscular atrophy [43]. The relationship between gene-expression and eQTLs is reversed for HLA-DRA and BICD2 where cases have an increased number of down-regulating eQTLs even though the gene is over-expressed in polyp tissue. Over-production in the diseased state could possibly be the result of the body compensating for an under-production in the pre-disease state, which hypothetically could have contributed to the development of the disease. VSIR (V-set immunoregulatory receptor) is over-expressed in polyp tissue and up-regulating alleles from the MuTHER dataset are more common in our cases with CRSwNP compared with unaffected individuals. The gene codes for the protein V-type immunoglobulin domain-containing suppressor of T-cell activation, a member of the Ig superfamily. An experimental study has suggested that it could facilitate tumour invasiveness by regulating cell surface membrane-type 1 matrix metalloproteinase [44]. Lines et al. published an article showing that it also acts a negative checkpoint regulator that suppresses T cell activation [45]. It has not been implicated in CRSwNP in previous studies. Even though the empirical p-value is slightly higher than 0.05 (empirical p-value 0.052) this study also implicates SLC5A1 as borderline significant. SLC5A1 is under-expressed in polyp tissue and also has an increased frequency of down-regulating alleles in our CRSwNP cases. The gene-product, solute carrier family 5 (sodium/glucose cotransporter) member 1 (SGLT1), is part of a family of sodium-dependent glucose transporters. Once again, this gene has not been associated with CRSwNP but one article suggests a positive substrate cross-regulation of SGLT1 and CFTR [46]. The CFTR gene is highly associated with cystic fibrosis, which often has CRSwNP as one of its clinical features [47]. Furthermore, a study from Varon et al. found an association between CRSwNP (without any other clinical features of cystic fibrosis) and mutations in the CFTR locus [48]. In order to reach a power level necessary for genome-wide analysis we decided to include all patients with CRSwNP regardless of any other diseases which could be associated to this condition. Another reason for this is that the only largely accepted subgrouping of CRS is the division into either CRSsNP or CRSwNP. This is in all likelihood a gross simplification of the pathophysiological and genetic mechanisms behind these conditions and CRS as defined by EPOS is probably a result of a large number of different sub-diseases, each with their own genetic and/or environmental background of which little is known at the present. In this study we did not exclude participants based on potential subgroups since there is still uncertainty surrounding a division into subgroups other than CRSsNP and CRSwNP and we chose to focus on the phenotype CRSwNP itself. Similarly, we chose not to record allergy history due to the controversy surrounding allergy as a possible association to CRSwNP [49]. However, two possible genetic subgroups of CRSwNP; CRSwNP with concomitant asthma and aspirin-exacerbated respiratory disease (AERD) warrant attention in our minds. Since 51.5% of the participants with nasal polyps also had asthma it is possible that some of the associations could be due to an association with asthma. However, this is less likely since 17.3% of the subjects in the healthy group without polyps also had asthma. The high number of participants with both CRSwNP and asthma could have diluted the association analysis if their SNP-profiles differ significantly from subjects with CRSwNP but without asthma and potentially have made us miss markers of importance in the association analysis. This situation is likely to be countered by the gene enrichment and pathway analyses. HLCS, BICD2, and SLC5A1 have not been connected to asthma, VSIR was implicated with lung function decline in non-asthmatic patients in a genome-wide study published in 2012 but this association could not be confirmed by replication [50]. HLA-DRA is associated to asthma [51] but also, as mentioned above, associated to the presence of nasal polyps in a cohort of asthmatic patients [36]. AERD is another condition linked to CRSwNP, a meta-analysis published in 2015 found a large variation in the prevalence of AERD among patients with CRSwNP among the included studies, the overall prevalence was 9.7% [52]. One of the included studies is from the same geographical region as 96% of our test subjects and found that the prevalence of AERD among subjects with CRSwNP was 6/82 but the numbers were thought to be too small for any meaningful statistical analysis [53]. Although some of our participants probably suffer from AERD, the overall number is in all likelihood too small to influence the association analysis and post-GWAS analyses significantly. HLCS, BICD2, SLC5A1 and VSIR have not been associated with AERD in previous studies. HLA-DRA has not been linked to AERD but other HLA class II genes have [54], however, the same study that linked HLA-DRA to the presence of nasal polyps among asthmatic patients found two HLA-DRA polymorphisms to be potential markers for nasal polyp development in aspirin-tolerant asthma compared to the AERD subgroup [36]. With these caveats in mind, this study is the first of its kind. It is currently the only GWAS performed on CRSwNP and the only study that explores linkage and family-based genome-wide association with regards to this condition. Despite the issue of accurate phenotyping discussed above, this study suggests four novel genes as potential targets of interest for future research as well as once again implicate HLA-DRA.

Conclusion

This study suggests that HLA-DRA as well as four additional genes; HLCS, VSIR, BICD2 and SLC5A1, which have not been previously identified as associated with chronic rhinosinusitis with nasal polyps, could be important for the development of this disease.

List of target genes from the top 20 gene-sets in the INRICH analysis.

(DOCX) Click here for additional data file.
  51 in total

1.  Prevalence of nasal polyposis in France: a cross-sectional, case-control study.

Authors:  J M Klossek; F Neukirch; C Pribil; R Jankowski; E Serrano; I Chanal; A El Hasnaoui
Journal:  Allergy       Date:  2005-02       Impact factor: 13.146

2.  DQA1 and DQB1 association and nasal polyposis.

Authors:  Germán Fajardo-Dolci; Jesús Solorio-Abreu; Julio César Romero-Alvarez; Beatríz Zavaleta-Villa; Oscar Cerezo-Camacho; Rocío Jiménez-Lucio; Angélica Olivo-Díaz
Journal:  Otolaryngol Head Neck Surg       Date:  2006-08       Impact factor: 3.497

3.  HLA-DRA polymorphisms associated with risk of nasal polyposis in asthmatic patients.

Authors:  Jeong-Hyun Kim; Byung-Lae Park; Hyun Sub Cheong; Charisse Flerida A Pasaje; Joon Seol Bae; Jong Sook Park; Soo-Taek Uh; Yong-Hoon Kim; Mi-Kyeong Kim; Inseon S Choi; Byoung Whui Choi; Choon-Sik Park; Hyoung Doo Shin
Journal:  Am J Rhinol Allergy       Date:  2012 Jan-Feb       Impact factor: 2.467

4.  Familial aggregation of sinonasal polyps correlates with severity of disease.

Authors:  Noam A Cohen; Jeffrey S Widelitz; Alexander G Chiu; James N Palmer; David W Kennedy
Journal:  Otolaryngol Head Neck Surg       Date:  2006-04       Impact factor: 3.497

5.  A multi-centre study on quality of life and absenteeism in patients with CRS referred for endoscopic surgery.

Authors:  P Sahlstrand-Johnson; B Ohlsson; C Von Buchwald; M Jannert; Marianne Ahlner-Elmqvist
Journal:  Rhinology       Date:  2011-10       Impact factor: 3.681

6.  Familial risk of chronic rhinosinusitis with and without nasal polyposis: genetics or environment.

Authors:  Gretchen M Oakley; Karen Curtin; Quinn Orb; Carole Schaefer; Richard R Orlandi; Jeremiah A Alt
Journal:  Int Forum Allergy Rhinol       Date:  2015-02-12       Impact factor: 3.858

7.  Evaluation of HLA-A, -B, -Cw, and -DRB1 alleles frequency in Turkish patients with nasal polyposis.

Authors:  Bahar Keles; Tülin Cora; Hasan Acar; Hamdi Arbag; Ziya Inan; Kayhan Ozturk; Bedri Ozer
Journal:  Otolaryngol Head Neck Surg       Date:  2008-10       Impact factor: 3.497

8.  Interrelationship between the Na+/glucose cotransporter and CFTR in Caco-2 cells: relevance to cystic fibrosis.

Authors:  C Mailleau; J Capeau; M C Brahimi-Horn
Journal:  J Cell Physiol       Date:  1998-09       Impact factor: 6.384

9.  Mapping cis- and trans-regulatory effects across multiple tissues in twins.

Authors:  Elin Grundberg; Kerrin S Small; Åsa K Hedman; Alexandra C Nica; Alfonso Buil; Sarah Keildson; Jordana T Bell; Tsun-Po Yang; Eshwar Meduri; Amy Barrett; James Nisbett; Magdalena Sekowska; Alicja Wilk; So-Youn Shin; Daniel Glass; Mary Travers; Josine L Min; Sue Ring; Karen Ho; Gudmar Thorleifsson; Augustine Kong; Unnur Thorsteindottir; Chrysanthi Ainali; Antigone S Dimas; Neelam Hassanali; Catherine Ingle; David Knowles; Maria Krestyaninova; Christopher E Lowe; Paola Di Meglio; Stephen B Montgomery; Leopold Parts; Simon Potter; Gabriela Surdulescu; Loukia Tsaprouni; Sophia Tsoka; Veronique Bataille; Richard Durbin; Frank O Nestle; Stephen O'Rahilly; Nicole Soranzo; Cecilia M Lindgren; Krina T Zondervan; Kourosh R Ahmadi; Eric E Schadt; Kari Stefansson; George Davey Smith; Mark I McCarthy; Panos Deloukas; Emmanouil T Dermitzakis; Tim D Spector
Journal:  Nat Genet       Date:  2012-09-02       Impact factor: 38.330

10.  Mutations in BICD2 cause dominant congenital spinal muscular atrophy and hereditary spastic paraplegia.

Authors:  Emily C Oates; Alexander M Rossor; Majid Hafezparast; Michael Gonzalez; Fiorella Speziani; Daniel G MacArthur; Monkol Lek; Ellen Cottenie; Mariacristina Scoto; A Reghan Foley; Matthew Hurles; Henry Houlden; Linda Greensmith; Michaela Auer-Grumbach; Thomas R Pieber; Tim M Strom; Rebecca Schule; David N Herrmann; Janet E Sowden; Gyula Acsadi; Manoj P Menezes; Nigel F Clarke; Stephan Züchner; Francesco Muntoni; Kathryn N North; Mary M Reilly
Journal:  Am J Hum Genet       Date:  2013-05-09       Impact factor: 11.025

View more
  5 in total

Review 1.  Next Generation Sequencing and Bioinformatics Analysis of Family Genetic Inheritance.

Authors:  Aquillah M Kanzi; James Emmanuel San; Benjamin Chimukangara; Eduan Wilkinson; Maryam Fish; Veron Ramsuran; Tulio de Oliveira
Journal:  Front Genet       Date:  2020-10-23       Impact factor: 4.599

2.  Glutathione S-transferase gene polymorphisms and risk of nasal or colorectal polyposis.

Authors:  Yonglan Zhang; Haichao Zhang; Peng Lin; Guimin Zhang
Journal:  Biosci Rep       Date:  2019-01-25       Impact factor: 3.840

3.  Transcriptional Changes in Chronic Rhinosinusitis with Asthma Favor a Type 2 Molecular Endotype Independent of Polyp Status.

Authors:  Amarbir S Gill; Abigail Pulsipher; Jorgen S Sumsion; Gretchen M Oakley; Laurie W Leclair; Heather Howe; Richard R Orlandi; Jeremiah A Alt
Journal:  J Asthma Allergy       Date:  2021-04-21

4.  Multiple genetic variations of chronic rhinosinusitis with nasal polyps are associated with respiratory parameters in men with obstructive sleep apnea.

Authors:  Qianqian Zhang; Xiaoting Wang; Xiangyu Cheng; Xiaolin Wu; Yunhai Feng; Huajun Xu; Huaming Zhu; Hongliang Yi; Weitian Zhang; Xinyi Li; Haibo Ye
Journal:  Sleep Breath       Date:  2021-03-26       Impact factor: 2.816

5.  Regional differences in endoscopic sinus surgery in Finland: a nationwide register-based study.

Authors:  Sanna Toppila-Salmi; Heikki Rihkanen; Martti Arffman; Kristiina Manderbacka; Ilmo Keskimaki; Maija L Hytönen
Journal:  BMJ Open       Date:  2018-10-18       Impact factor: 2.692

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.