Literature DB >> 32603359

Age-of-onset information helps identify 76 genetic variants associated with allergic disease.

Manuel A R Ferreira1, Judith M Vonk2, Hansjörg Baurecht3,4, Ingo Marenholz5,6, Chao Tian7, Joshua D Hoffman8, Quinta Helmer9, Annika Tillander10, Vilhelmina Ullemar10, Yi Lu10, Sarah Grosche5,6,11, Franz Rüschendorf5, Raquel Granell12, Ben M Brumpton12,13,14, Lars G Fritsche13,15, Laxmi Bhatta13, Maiken E Gabrielsen13, Jonas B Nielsen16,17, Wei Zhou17, Kristian Hveem13, Arnulf Langhammer18, Oddgeir L Holmen13, Mari Løset13,19, Gonçalo R Abecasis13,15, Cristen J Willer15,16,17, Nima C Emami20,21, Taylor B Cavazos20, John S Witte20,21,22,23, Agnieszka Szwajda24, David A Hinds7, Norbert Hübner5, Stephan Weidinger3, Patrik Ke Magnusson10, Eric Jorgenson25, Robert Karlsson10, Lavinia Paternoster12, Dorret I Boomsma9, Catarina Almqvist10,26, Young-Ae Lee5,6, Gerard H Koppelman27.   

Abstract

Risk factors that contribute to inter-individual differences in the age-of-onset of allergic diseases are poorly understood. The aim of this study was to identify genetic risk variants associated with the age at which symptoms of allergic disease first develop, considering information from asthma, hay fever and eczema. Self-reported age-of-onset information was available for 117,130 genotyped individuals of European ancestry from the UK Biobank study. For each individual, we identified the earliest age at which asthma, hay fever and/or eczema was first diagnosed and performed a genome-wide association study (GWAS) of this combined age-of-onset phenotype. We identified 50 variants with a significant independent association (P<3x10-8) with age-of-onset. Forty-five variants had comparable effects on the onset of the three individual diseases and 38 were also associated with allergic disease case-control status in an independent study (n = 222,484). We observed a strong negative genetic correlation between age-of-onset and case-control status of allergic disease (rg = -0.63, P = 4.5x10-61), indicating that cases with early disease onset have a greater burden of allergy risk alleles than those with late disease onset. Subsequently, a multivariate GWAS of age-of-onset and case-control status identified a further 26 associations that were missed by the univariate analyses of age-of-onset or case-control status only. Collectively, of the 76 variants identified, 18 represent novel associations for allergic disease. We identified 81 likely target genes of the 76 associated variants based on information from expression quantitative trait loci (eQTL) and non-synonymous variants, of which we highlight ADAM15, FOSL2, TRIM8, BMPR2, CD200R1, PRKCQ, NOD2, SMAD4, ABCA7 and UBE2L3. Our results support the notion that early and late onset allergic disease have partly distinct genetic architectures, potentially explaining known differences in pathophysiology between individuals.

Entities:  

Mesh:

Year:  2020        PMID: 32603359      PMCID: PMC7367489          DOI: 10.1371/journal.pgen.1008725

Source DB:  PubMed          Journal:  PLoS Genet        ISSN: 1553-7390            Impact factor:   5.917


Introduction

In the last 10 years, at least 45 genome-wide association studies (GWAS) of allergic disease susceptibility were published: 25 for asthma (reviewed in [1]), three for hay fever (or allergic rhinitis) [2-4], eight for eczema (or atopic dermatitis) [5-12], four for food allergy [13-16] and six for allergy-related traits, namely atopic march [17], asthma with hay fever [18], allergies [19], allergic sensitization [2,20,21] and a combined asthma, hay fever and eczema phenotype [22]. Genetic risk variants identified in these studies provide a foundation to help us better understand why and how allergic disease develops in susceptible individuals. One twin study has previously indicated that the timing of asthma onset may be under genetic control [23]. In the first genome wide association study for asthma published in 2007, it was reported that the ORMDL3/GSDMA locus at chromosome 17q12 was specifically associated with childhood onset asthma[24]. This observation was subsequently confirmed, showing strong associations of this locus with childhood-onset asthma, potentially interacting with passive cigarette smoke exposure in early childhood [25] or as childhood onset asthma defined as asthma developing before 16 years of age) but not later onset asthma in the GABRIEL consortium [26] In subsequent stratified analyses in a multinational study, it was reported that the association of the 17q risk SNP rs7216389-T was confined to cases with early onset of asthma, particularly in early childhood (age: 0–5 years) and adolescence (age: 14–17 years), but a weaker association was observed for onset between 6 and 13 years of age, whereas no association was observed for adult-onset asthma [27]. This shows that defining cut-offs for age at onset of asthma is difficult, and that other approaches such as using a continuous age at onset might be beneficial. To our knowledge, only three studies have reported genetic variants that associate with the age at which allergic disease symptoms first develop. Forno et al. [28] studied asthma age-of-onset in 573 children and identified two variants that had a genome-wide significant association after combining the discovery and replication (n = 931) cohorts: rs9815663 near the CRBN gene on chromosome 3p26, and rs7927044 near ETS1 on 11q24. In a more recent GWAS of 5,462 cases with asthma, Sarnowski et al. [29] identified five variants associated with age-of-onset, located in/near: CYLD on 16q12 (rs1861760), IL1RL1 on 2q12 (rs10208293), HLA-DQA1 on 6p21 (rs9272346), IL33 on 9p24 (rs928413) and GSDMA on 17q12 (rs9901146). The latter four variants were previously reported to be associated with allergic disease susceptibility as well. Lastly, Ferreira et al. [22] reported that 26 of 136 variants associated with allergic disease risk were also associated with the age at which allergic symptoms first developed (n = 35,972). Amongst these were five variants for which the association with age-of-onset was genome-wide significant: rs61816761 in the FLG gene and rs12123821 near HRNR, both on chromosome 1q21; rs921650 in GSDMB on 17q12; rs10865050 in IL18R1 on 2q12; and rs7936323 near LRRC32 on 11q13. Two of the variants reported in Ferreira et al. (rs10865050 and rs921650) were in linkage disequilibrium (LD) with variants reported in Sarnowski et al., and so are unlikely to represent independent associations. Therefore, collectively across these three studies, 12 variants (2+5+5, including 10 in low LD with each other) were reported to associate with age-of-onset of allergic disease at the genome-wide significance level. Of interest, the joint association between age-at-onset and disease susceptibility at some of these loci [29] suggests that both phenotypes are genetically correlated, and so that combining information from both may improve power to identify variants that influence the aetiology of allergic disease. The main aim of this study was to identify novel loci that contribute to inter-individual variability in the age at which allergic symptoms first develop, considering information from the three most common allergic diseases: asthma, hay fever and eczema. Rather than study the age-of-onset of each disease separately, we adopted the multi-disease phenotype approach that we used recently to identify risk variants that are shared across different allergic diseases [22]. Specifically, we determined the earliest age at which asthma and/or hay fever and/or eczema first developed and then tested this single combined age-of-onset of allergic disease phenotype in a GWAS. In addition, we also tested if variants associated with disease age-of-onset were also associated with disease risk, as noted by Sarnowski et al. [29]. Lastly, we used multivariate association analysis to identify variants jointly associated with allergic disease age-of-onset and case-control status, which were missed by analyzing each phenotype alone.

Results

Genetic variants associated with the age-of-onset of allergic disease

Our study population consisted of n = 117,130 participants from the UK Biobank study ( who had a mean age of 55.5 years (range 38–72 years), with a mean (median) age at onset of any allergic disease of 26.3 (22) years, defined as the earliest age at which any allergic disease (asthma, hay fever or eczema) was first reported (see for distribution). We first performed a GWAS of a combined age-of-onset phenotype (n = 117,130 from the UK Biobank study. After adjusting the association results () for the observed LD-score regression intercept [30] of 1.025, we identified 4,160 variants with a genome-wide significant association with age-of-onset (P<3x10-8, ). Of these, 50 variants in 40 loci (i.e. regions >1 Mb apart) remained associated at that threshold after accounting for the effects of adjacent SNPs in joint association analysis (<10 Mb; and ), indicating that they represent statistically independent associations with age-of-onset. Henceforth, we refer to these SNPs as sentinel variants for age-of-onset. Two additional variants had a P<3x10-8 in the joint but not in the original single-SNP analysis (), both located in the major histocompatibility complex (MHC) locus. These represent secondary association signals at the MHC that were masked in the original GWAS by the association with other stronger nearby SNPs.

Summary of results from the GWAS of allergic disease age-of-onset in the UK Biobank study (n = 117,130).

UK Biobank participants reported age-of-onset for asthma and, in a single separate question, for hay fever/eczema. In this analysis, we took the earliest age-of-onset reported across these two questionnaire items and tested this phenotype for association with SNP allelic dosage. We identified 4,160 variants associated with age-of-onset at a P<3x10-8 (red circles), including 50 with a statistically independent association. # Frequency of effect allele in the allergic disease cases studied. Three of the 50 sentinel variants were in linkage disequilibrium (LD; r2>0.8) with variants previously reported to have a genome-wide significant association with asthma age-of-onset [29]: rs72823628 in IL18R1, rs7848215 near IL33 and rs4795400 in GSDMB. Similarly, an additional three variants were in LD with SNPs that we reported recently [22] to be associated with the same combined age-of-onset phenotype: rs61816761 in FLG, rs12123821 near HRNR and rs11236791 near LRRC32. On the other hand, to our knowledge, the remaining 44 sentinel variants have not previously been implicated in the age-of-onset of any allergic disease at the genome-wide significance level. Of the 12 specific variants previously reported to associate with allergic disease age-of-onset, 11 were tested in our current age-of-onset GWAS, of which nine had a highly significant and directionally concordant association (). For two variants, there was no evidence for association with the combined age-of-onset phenotype: rs1861760 near CYLD (P = 0.41), reported by Sarnowski et al. [29], and rs9815663 near CRBN (P = 0.67), reported by Forno et al. [28]. The second variant reported by Forno et al. had a MAF<1% and so it was not tested in our current age-of-onset GWAS. We did however test this variant ad-hoc and found that it was not significantly associated with age-of-onset (P = 0.35, not shown).

Potential impact of recall bias and phenotypic misclassification on SNP associations

All UK Biobank participants included in our analyses were adults (aged 38 to 70) at the time of data collection, and so recall bias might have affected the reported age-of-onset. Furthermore, proportionally, there were many individuals who reported late onset of allergic disease (e.g. 41% of asthmatics with onset ≥40 years old), which could have resulted from recall bias and/or phenotypic misclassification. We performed an additional set of analyses to determine if these potential confounders were likely to have had a major impact on the SNP associations described above. We addressed reliability of the age-of-onset information by comparing the self-reported age-of-onset between two surveys that were between 4–7 years apart. Age of onset was within 5 years accurate in 86% of cases. Subjects that reported less reliable information were likely to be older at enrollment. Older subjects were also less likely to report childhood onset asthma. When we analyzed the 50 sentinel variants in subjects who reported developing asthma as a child, and secondly, rhinitis as a child, we obtained highly consistent results, see and , respectively. We also replicated our findings in a prospective birth cohort ALSPAC, and show a high correlation of 0.67–0.825 of the effect size of our analysis with the results obtained in the ALSPAC study. Since the ALSPAC study prospectively assessed asthma, recall bias in this study is not a concern. Moreover, we correlated our findings for adult-onset asthma with two independent, published datasets of asthma GWAS performed by the GABRIEL consortium [26] and the TAGC consortium [31], and identified a substantial genetic correlation of rg of 0.62 and 0.66, respectively. We further correlated our UKBB results of adult onset asthma with an analysis of adult onset asthma in the HUNT study, and again observed a significant genetic correlation rg of 0.69. Further details of these analyses are provided in (page 9) and –, and .

Association with age-of-onset in individuals suffering from a single allergic disease

By analyzing an age-of-onset phenotype that considered information from asthma, hay fever and eczema, the GWAS described above was expected to identify variants that affect age-of-onset broadly across the three diseases. To formally address this possibility, we tested each of the 50 sentinel variants identified above for association with the age-of-onset of asthma, hay fever and eczema, in three separate analyses. Specifically, we analysed age-of-onset in three non-overlapping groups of individuals (): those who reported suffering only from asthma (n = 22,029), only from hay fever (n = 14,474) or only from eczema (n = 3,969). Within each of these groups, we tested the association between the 50 sentinel variants and disease age-of-onset, using BOLT-LMM. In individuals suffering from asthma only, 19 sentinel variants were associated with variation in age-of-onset at P<3.3x10-4 (43 at P<0.05), which corrects for 50 SNPs tested in 3 groups, despite the smaller sample size of this analysis (). For hay fever and eczema, there were respectively 8 and 5 SNPs associated with age-of-onset at that significance threshold (24 and 12 at P<0.05). Of note, the directional effect observed with the combined phenotype was the same as in the single disease analyses for most sentinel variants (100% for asthma, 94% for hay fever and 80% for eczema). Lastly, when we formally compared the effect of each sentinel variant on age-of-onset (i.e. the beta from the linear model) between pairs of diseases, we found that most variants (45 of 50, 90%) did not have significant disease-specific effects on age-of-onset (all pairwise comparisons with P>3.3x10-4; ). The exceptions were four variants located on chromosomes 1q21.3 (in/near TCHHL1, HRNR, FLG and SPRR2A) which had significantly stronger effects on age-of-onset of eczema, and one on 17q12 (in GSDMB) which had a stronger effect on the age-of-onset of asthma (). Therefore, we conclude that most (45 of 50) sentinel variants identified in the GWAS of the combined age-of-onset phenotype have similar effects when considering the age-of-onset separately for asthma, hay fever and eczema.

Variants with evidence for disease-specific effects on age-of-onset.

Each of the 50 variants identified in the GWAS of age-of-onset were tested for association with age-of-onset in three non-overlapping groups of individuals: those suffering from asthma only (n = 22,029), hay fever only (n = 14,474) and eczema only (n = 3,969). We then compared the effects (i.e. betas) obtained in these three groups. For 5 of the 50 variants (shown with an orange inner triangle), the effect on age-of-onset was significantly different (P<0.05/(3 x 50) = 3.3x10-4) between at least two groups. For a given variant, the vertices of the inner triangle point to the position along the edges of the outer triangle that corresponds to difference in effect observed between pairs of single-disease cases. For example, the rs61816761[A] allele, which is located in the FLG gene (filaggrin), had an effect on age-of-onset that was larger (absolute of difference = 0.42) in individuals suffering only from eczema when compared to individuals suffering only from hay fever (P = 4.3x10−8), consistent with this SNP having a stronger effect on the age-of-onset of eczema than of hay fever. For comparison, a variant with no significant differences when comparing the effect on age-of-onset in all three pairwise single-disease association analyses is also shown (rs705699, in the RAB5B gene). In this case, the difference in effect was approximately equal to 0 in the three pairwise comparisons. The color of the difference in effect reflects the significance of the corresponding z-score (see Methods): red for P< 3.2x10−4 (correction for multiple testing), blue for P<0.05 and black for P>0.05.

Association between age-of-onset sentinel variants and allergic disease risk

We then asked if the 50 sentinel variants were also likely to influence the risk of developing allergic disease, in addition to contributing to variation in age-of-onset amongst affected individuals. To this end, we investigated the association between each sentinel variant and a combined allergic disease phenotype, as reported in our recent GWAS [22]. After excluding the UK Biobank study from that GWAS, association results were based on data from 137,883 cases with asthma and/or hay fever and/or eczema, and 84,601 disease-free controls. Forty-eight of the 50 sentinel variants were tested in that GWAS, either directly or via a proxy (one variant), of which 38 (or 79%) were significantly associated with disease risk (P<0.001, which corrects for 48 tests; ). This includes 19 variants for which the association with disease risk was genome-wide significant (P<3x10-8); that is, variants that represent previously known risk factors for allergic disease. Notably, for all 48 variants tested, the allele associated with a higher disease risk was associated with a lower age-of-onset. Therefore, we conclude that the sentinel variants identified influence both the likelihood of developing any allergic disease as well as the age at which symptoms first develop. # Association results from 12 of the 13 individual studies reported in the Ferreira et al. 2017 allergic disease GWAS were included in this analysis (all except UK Biobank). Results from the individual studies were adjusted for the respective study-specific LD-score intercept and then combined using a fixed-effects meta-analysis, as described previously [22]. The LD-score intercept of this 12-study meta-analysis was 1.018 (attenuation ratio of 0.0717) * rs7559046 was not directly tested in the Ferreira et al. 2017 GWAS, and so we used a proxy instead (rs6547850, r = 0.93).

Genetic correlation between age-of-onset and disease case-control status

For all 50 sentinel age-of-onset variants, the allele that was associated with a lower age-of-onset was associated with a higher risk of allergic disease. This observation suggested that these two traits–age-of-onset and case-control status of allergic disease–have a substantial negative genetic correlation; to our knowledge, this has not been previously estimated. To understand the extent to which the same genetic variants contribute to variation in these two traits, we applied LD-score regression [30] to the summary statistics of our age-of-onset and allergic disease [22] GWAS. Based on 1.1 million common SNPs, the genetic correlation between the two traits was estimated to be -0.625 (SE = 0.038, P = 4.5x10-61). This estimate was not expected to be biased by the sample overlap between the two GWAS [32], which we confirmed when we excluded samples from the UK Biobank study from the allergic disease [22] GWAS (r = -0.612, SE = 0.046, P = 5.0x10-41). These results indicate that a substantial fraction of genetic variants are likely to influence both the liability to, and the age-of-onset of, allergic disease. Furthermore, for most (but not necessarily all) shared variants, the directional effect is such that variants that are associated with higher disease risk are associated with lower age-of-onset. More broadly, these results strongly suggest that a key risk factor that distinguishes individuals with early disease onset from those with late disease onset is the overall genetic burden inherited at allergy-associated SNPs. To illustrate this effect, we compared the distribution of age-of-onset between individuals with the highest (top 10%) and the lowest (bottom 10%) polygenic risk score (PRS) for allergic disease, constructed for each individual from the UK Biobank study based on information from 136 allergy risk variants that we reported recently [22]. This analysis was performed separately for asthma, hay fever and eczema, using the same single-disease case groups described above. For asthma, individuals with the lowest genetic burden of allergic disease (n = 2,202) had a median age-of-onset of 39 years, with only 14% having an age-of-onset before the age of 16; the distribution of age-of-onset was broadly consistent with a pattern of late disease onset (). In contrast, in the group with the highest genetic burden (n = 2,203), the median age-of-onset decreased to 29 years, with 35% of individuals reporting that asthma was diagnosed before the age of 16. In this group, there was a clear shift in the distribution of age-of-onset towards a pattern of early disease onset. Similar results were observed for hay fever and eczema (). Collectively, our results indicate that genetic risk factors for allergic disease are enriched in cases with early disease onset when compared to those with late disease onset.

Distribution of allergic disease age-of-onset as a function of a polygenic risk score (PRS) for allergic disease in UK Biobank participants who reported suffering from a single disease (asthma only, hay fever only and eczema only).

The PRS of each individual was calculated based on 136 SNPs that were associated with allergic disease risk in our recent GWAS [22]. The mean and median of each distribution are shown in red and blue, respectively.

Multivariate GWAS of allergic disease case-control status and age-of-onset

The high genetic correlation observed between case-control status and age-of-onset of allergic disease suggests that a large number of variants contribute to the heritability of both traits. We therefore hypothesized that multivariate association analysis would identify variants jointly associated with both traits that were missed in the single-trait analyses. To this end, we first adjusted the single-SNP results obtained in the age-of-onset and case-control [22] GWAS for the effects of the sentinel variants identified in the respective study. In the two resulting adjusted GWAS, there were no variants with an association significant at a P<3x10-8, as expected (. There was, however, an excess of significant associations when compared to the number expected by chance given the number of SNPs tested ( Many of these associations are likely to represent true positive findings that do not reach the stringent genome-wide significance threshold in each of those two univariate analyses. To help identify these, we then performed multivariate analysis of age-of-onset and case-control status, using metaUSAT [33], which is applicable to association summary statistics. Using this approach, we identified 281 variants with a multivariate P<3x10-8 ( and ), including 26 that were in low LD with each other (r2<0.05) and so that are likely to represent statistically independent associations (). However, the QQ Plots may indicate some inflation of the P values, so therefore, these data need to be interpreted with caution. The genomic inflation factor could not be calculated because metaUSAT does not have a closed form null distribution. Nonetheless, inflation of significant associations can be assessed by comparing the observed and expected number of associations significant at a given significance threshold. We observed 38%, 17%, 10%, 5.9% and 1.9% of SNPs tested with a multivariate P-value <0.5, <0.2, <0.1, <0.05 and <0.01, respectively, when the expectations under the null hypothesis of no association were 50%, 20%, 5% and 1%. For most variants, the association in each of the two univariate analyses was one to four orders of magnitude below genome-wide significance, which was exceeded in the multivariate analysis. For all variants, the allele associated with higher disease risk was associated with lower age-of-onset. Results obtained with the recently described MTAG multivariate approach [34] supported the associations identified with metaUSAT (). We conclude that these 26 variants represent risk factors for both the presence and early onset of allergic disease, which were only detectable when we combined information from the age-of-onset and case-control GWAS.

Summary of results from the multivariate analysis of allergic disease age-of-onset and allergic disease case-control status.

The GWAS of allergic disease age-of-onset was performed in the UK Biobank study (n = 117,130) as described in the main text. The GWAS of allergic disease case-control status included 360,838 individuals, has reported recently [22]. Single-SNP results from each GWAS were adjusted for the top independent associations (P<3x10-8) identified and then multivariate analysis was performed using metaUSAT [33]. We identified 281 variants with a multivariate P<3x10-8 (red circles), including 26 that were in low LD (r<0.05) with each other and so that are likely to represent statistically independent associations.

Sentinel variants not previously implicated in the aetiology of allergic disease

We then determined which of the sentinel variants identified in the age-of-onset and multivariate GWAS described above represented novel associations for allergic disease in general, that is, when considering all previously reported associations with P<5x10-8 for asthma, hay fever, eczema, food allergy and/or atopy. Of the 50 sentinel variants identified in our age-of-onset GWAS, 47 were in LD (r2>0.05) with variants previously reported to associate with allergic disease (). The remaining 3 represent novel associations for allergic disease: rs184587444 in SPRR2A, rs4971089 in KRTCAP2, and rs4809619 in EYA2 (). On the other hand, most 15 of the 26 sentinel variants identified in the multivariate GWAS represent novel associations for allergic disease ( and ), including for example rs7565907 in LCLAT1and rs11242709 near DUSP22. Thus, overall, by considering age-of-onset information, we identified 18 (3+15) novel genetic associations for allergic disease.

Likely target genes of sentinel variants identified in the age-of-onset and multivariate GWAS

To help understand how the 76 sentinel variants might influence allergic disease pathophysiology, we identified genes for which variation in gene expression and/or protein sequence was associated/determined by SNPs in LD with the sentinel variants. We first extracted association summary statistics from 101 published datasets of eQTL identified in five different broad tissue types relevant for allergic disease (). For each gene and for a given eQTL dataset, we then (i) identified cis eQTL in low LD (r2<0.05) with each other, which we refer to as “sentinel eQTL”; and (ii) determined if any of the 76 sentinel variants were in high LD (r2>0.8) with a sentinel eQTL. Using this approach, we found sentinel eQTL in LD with 26 of the 50 (52%) sentinel variants identified in the age-of-onset GWAS (), and with 15 of the 26 (58%) sentinel variants identified in the multivariate GWAS (). The sentinel eQTL implicated respectively 47 and 28 genes (one in common: HLA-DQB1) as likely targets of the sentinel variants identified in these two GWAS (). Second, we found 21 non-synonymous SNPs in 14 genes that were in high LD (r2>0.8) with sentinel variants identified in the age-of-onset or multivariate GWAS (). This list included, for example, four non-synonymous SNPs in the CD200R1 gene that were in complete LD (r2 = 1) with the sentinel variant identified in the multivariate GWAS. Of the 14 genes, seven were novel target predictions, that is, they were not identified in the eQTL analysis described above: FLG, EFNA1, SH2B3, TNFRSF14, HIST1H2BE, MLX and YDJC. Overall, when considering information from eQTL and non-synonymous SNPs, we identified 81 (47+27+7) likely target genes of the 76 sentinel variants identified in this study.

Association between the 76 sentinel variants and the risk of food allergy

Finally, we tested if the sentinel variants identified above were associated with food allergy case-control status, in children and adults separately. Although the discovery analysis that identified the sentinel variants for age of onset of allergy did not include food allergy, we hypothesized that these sentinel variants may also relate to food allergy. First, we extracted association results from GWAS that we published recently [15], comprising 497 children with food allergy diagnosed by oral food challenge in the GOFA study and 2,387 controls. This study comprised a highly selected group of children with early onset food allergy (mean age at diagnosis was 2.1 years). In that GWAS, nine of the 76 sentinel variants were significantly associated with food allergy after correcting for multiple testing (P<0.05/76 = 0.00065; ), namely those in/near FLG (four variants), KIF3A, LRRC32, RAD50, CYLD, and SERPINB7. Overall, there was a very close agreement in SNP associations between the age-of-onset and food allergy analyses (); for example, for 66 of 76 variants the allele associated with a lower age-of-onset onset was associated with a higher disease risk (binomial test P = 2x10-12). To assess the association between the 76 sentinel variants and food allergy risk in adults, we extracted association results from a GWAS of self-reported food allergy conducted in the adult GERA cohort [16], which included 5,108 subjects with self-reported food allergy, of whom 1,104 were admitted to hospital because of food allergy and 23,945 controls who did not report to have food allergy. In this GWAS, we compared the 1,104 subjects admitted to hospital because of food allergy to the 23.945 controls. No single variant was significantly associated with food allergy after correcting for multiple testing (). Across the 72 variants tested in both the child and adult food allergy GWAS, only 43 (60%) had a directionally consistent association, reflecting very little agreement between results from the two analyses. Overall, our results show that many variants associated with allergic disease age-of-onset also represent genetic risk factors for food allergy in young children but not (or less so) in adults. Moreover, the self-report of food allergy in the adult population is more subject to misclassification and may also have contributed to this latter observation.

Discussion

In this study, we identified (i) 50 variants associated with allergic disease age-of-onset; (ii) a significant negative genetic correlation between allergic disease age-of-onset and case-control status; (iii) 26 additional variants jointly associated with allergic disease age-of-onset and case-control status; (iv) 81 genes that are likely targets of sentinel variants identified in the age-of-onset or multivariate GWAS; and (v) nine variants (out of the 76) that are also associated with the risk of food allergy in young children. Amongst the 50 associations for allergic disease age-of-onset, six were reported in previous studies of age-of-onset [28,29], but the remaining 44 were novel associations for this phenotype. Conversely, of the 12 variants reported in previous studies, nine were associated in our GWAS, but three were not (in/near CYLD/NOD2 [29], CRBN [28] and ETS1 [28]). Possible explanations for the lack of association with these three variants in our GWAS is that their effect on age-of-onset is population specific or specific to asthma, the disease considered in the original studies. However, the three variants were also not significantly associated with age-of-onset when we restricted our analysis to cases who suffered only from asthma (P = 0.17, P = 0.64 and P = 0.26, respectively), which suggests that disease-specific effects are unlikely to explain the discordance. When we compared the effect of the 50 variants on the age-of-onset of each of the three individual diseases, we found significant differences only for five variants. Four of these had a stronger effect on the age-of-onset of eczema: those in/near HRNR (rs1213821), FLG (rs61816761), TCHHL1 (rs115045402), SPRR2A (rs184587444), all within a 1 Mb locus on chromosome 1q21. The former two represent known risk factors for allergic disease, with a stronger effect on eczema [22], consistent with our results. On the other hand, the latter two variants, which are relatively uncommon (MAF of 2.8% and 2.0%), have not previously been established as risk factors for allergic disease, although our results for age-of-onset suggest that this is very likely to be the case. We did not find any eQTL in LD with either variant; on the other hand, both variants are in low to moderate LD with rs558269137 (r2 = 0.46 and 0.24, respectively), which encodes the FLG 2282del4 mutation that is associated with eczema and ichthyosis vulgaris [35]. It is therefore possible that at one (or both) of these variants are tagging that mutation, which was not tested in our study. The fifth variant, rs4795400 in GSDMB, showed a stronger effect on the age-of-onset of asthma. This variant is in high LD (r2>0.8) with variants reported to associate with earlier age-of-onset for asthma (rs9901146) [29] and which are stronger risk factors for asthma when compared to hay fever and eczema (rs921650) [22], consistent with our results. For the remaining 45 variants identified in our GWAS, our results suggest that their effect on age-of-onset is comparable between the three individual diseases. We also investigated if the 50 variants that determined variation in age-of-onset amongst allergic disease cases also contributed to differences in case-control status amongst an independent sample of 222,484 individuals not part of the UK Biobank that we studied recently [22]. Remarkably, 39 of 48 variants with available results had a significant association with case-control status after accounting for multiple testing. Furthermore, for all 39 variants (and also for the other nine tested), the disease-predisposing allele was associated with a lower age-of-onset. These results suggested that case-control status and age-of-onset have a strong negative genetic correlation, which we confirmed (rg = -0.63) using genome-wide SNP data. We highlight two implications that arise from this observation. First, this observation confirms that many genetic variants, including those identified in our age-of-onset GWAS, determine both the lifetime risk of developing an allergic disease as well as the age at which symptoms first develop. As such, combining information from these two phenotypes can help identify variants that influence disease liability, as suggested previously [29]. Motivated by this prediction, we performed multivariate analysis of results from our GWAS of age-of-onset and our recently published GWAS of allergic disease case-control status, which also considered information from asthma, hay fever and eczema. Importantly, we used a multivariate approach (metaUSAT [33]) that was expected to increase power to detect an association with a variant that influences both traits, when compared to other methods that are also applicable to GWAS summary statistics (e.g. metaCCA [36]). In this analysis, we identified 26 variants that were missed by the individual GWAS, highlighting the substantial gain in power that can be obtained by combining information from age-of-onset and case-control status. Of these 26 variants, only six were in LD (r2>0.05) with variants previously reported in GWAS of allergic disease. Therefore, most represent new associations for both age-of-onset and disease risk. Since we were not able to formally replicate these findings in an independent study, we emphasize the importance of future studies to replicate our results. We also suggest that this approach could be extended to include other phenotypes that can be shown to have a significant genetic correlation with disease risk; for example, these could be disease severity or markers of allergic sensitization. Second, the large negative genetic correlation between case-control status and age-of-onset indicates that for most variants associated with both traits, the allele that is more common in allergic disease cases (when compared to controls) is also more common in cases with early onset disease (when compared to those with late onset disease). That is, individuals who inherit a larger overall burden of allergy-predisposing alleles are more likely to have early onset disease when compared to those who inherit a lower genetic burden, consistent with previous findings [37]. This shows that allergic disease risk alleles are more common in early onset disease, which might imply that allergic disease with late onset is less heritable (i.e. more ‘environmental’) than allergic disease with early onset. For example, it is conceivable that in late onset disease, environmental (more than genetic) risk factors dysregulate the expression of genes that play a key role in disease pathophysiology through epigenetic mechanisms, as we suggested recently [22]. But that may not necessarily be the case. Instead, it is possible that individuals develop late onset disease because they inherit risk alleles that influence asthma, hay fever and/or eczema pathophysiology through mechanisms that are not shared with early onset disease. Studies that address these possibilities are warranted. It is also important to highlight that we cannot rule out the possibility that recall bias might have contributed to the negative genetic correlation observed between age-of-onset and case-control status. This might have occurred if recall bias was less common amongst subjects who reported a younger age of onset. We used information from eQTL studies and non-synonymous SNPs to identify 81 genes that are likely targets of 48 of the 76 (63%) variants identified in either the age-of-onset or multivariate GWAS performed. In the (page 10–15), we discuss in greater detail 10 genes that are plausible targets of novel allergic disease variants identified in our study and that have a known function that is directly relevant to disease pathophysiology. In brief, the 10 genes are: ADAM15, a metalloproteinase which cleaves the toll like receptor adaptor molecule TRIF [38] and the low affinity IgE receptor [39]; FOSL2, a regulator of cell proliferation involved in B cell, Th17 cell and epidermal differentiation and function [40-42]; TRIM8, a ligase involved in post-translational modifications of proteins, including ubiquitination of TAK1 [43] and TRIF [44]; BMPR2, a receptor for the TGF-beta superfamily [45] that inhibits Smad-mediated signaling [46]; CD200R1, a surface glycoprotein that interacts with CD200 [47], which is known to suppress the activation of various immune cells, including macrophages [48], mast cells [49], monocytes [50] and dendritic cells [51]; PRKCQ, a protein kinase involved in the development and function of Th17 cells [52], Th2 cells [53], Tregs [54] and type 2 innate lymphoid cells [55]; NOD2, an intracellular pattern recognition receptor that upon activation by bacterial peptidoglycans [56] and viruses [57] promotes host defense through the production of inflammatory mediators [58-60]; SMAD4, a central regulator of TGF-beta signaling [61], involved in Th2 cytokine production [62], Treg [63] and Th17 differentiation [64], the expression of selectin ligands [65] and of the pro-allergic cytokine IL-9 [66]; ABCA7, a transporter protein that moves lipids across membranes [67], enhances phagocytosis of apoptotic cells by macrophages [68], promotes NKT cell development and function [69], and was suggested to play a role in keratinocyte differentiation [70]; and UBE2L3, an essential component of the post-translational protein ubiquitination pathway, which plays a major role in the regulation of inflammatory responses [71-75]. The combined age-of-onset phenotype analysed did not take into account information from food allergy, as this was not available in the UK Biobank study. To partly address this limitation, we tested if the 76 sentinel variants identified in the age-of-onset or multivariate GWAS were also associated with food allergy, both in children and adults. After correcting for multiple testing, nine variants were significantly associated with food allergy confirmed by oral challenge in young children of the GOFA study [15], including one variant located in a locus not previously reported in food allergy GWAS: rs8056255 near CYLD. As such, this variant represents a putative novel risk factor for food allergy, which should be studied in greater detail in future studies. On the other hand, there was no evidence that the sentinel variants for age-of-onset identified in our study were associated with food allergies (based on hospital admissions) in adults of the GERA cohort. The lack of agreement between the food allergy results obtained in the GOFA and GERA studies raises the possibility that genetic risk factors for food allergy in children and adults might be largely distinct, which warrants further investigation. Another potential limitation of our study is that age-of-onset reported by UK Biobank participants may have been affected by recall bias. For example, individuals with current disease symptoms at the time of data collection might have recalled early onset of disease more reliably than those who no longer suffered from allergies. We addressed this potential limitation by testing the association between the 50 sentinel variants identified in the age-of-onset GWAS in a subset of UK Biobank individuals who reported developing asthma as a child, specifically up to age 19. We found that the association between the 50 sentinel variants and age-of-onset in this smaller but more homogenous group of allergic disease cases was consistent with results obtained in the overall sample. Furthermore, we also found consistent associations when considering asthma onset recorded in children from the independent and prospective Avon Longitudinal Study of Parents and Children (ALSPAC) birth cohort. Similar results were observed for the 26 sentinel variants identified in the multivariate GWAS (). Therefore, the 76 sentinel variants reported in our study show a consistent pattern of association with allergic disease age-of-onset in two analyses for which recall bias was not a major concern. Similarly, we found that phenotypic misclassification amongst individuals who reported late onset of allergic disease, if present, was unlikely to have significantly affected our main findings. In addition, the collection of information in adulthood is likely to have caused overrepresentation of SNPs involved in persistent disease and underrepresentation of association related to disease that remitted earlier in life (transient disease). Finally, we showed that in a subset of cases that provided data on two different occasions 4–7 years apart, age of onset of asthma was within 5 years in 86% of cases. However, we were not able to investigate this reliability for eczema and hayfever separately, since only a combined question was available. Furthermore, we acknowledge that only 3% of asthmatics in UKBB provided data on two different occasions. Thus, recall bias may have reduced our power, but not have resulted in spurious results. In conclusion, we show that novel risk loci for allergic disease can be identified by extending the analytical approach that we reported recently [22] to the analysis of age-of-onset of asthma, hay fever and eczema. GWAS of other complex diseases might also benefit from considering age-of-onset information. We found 76 specific genetic associations with allergic disease, of which 28 had not previously been reported. We implicate 81 genes as likely targets of the associated variants and provide further evidence that individuals with early disease onset have a greater burden of genetic risk factors for allergic disease than individuals with late disease onset.

Methods

Definition of the combined age-of-onset phenotype and allergic disease status

We created a single age-of-onset phenotype for individuals from the UK Biobank study [76] that considered information from asthma, hay fever and eczema. Age-of-onset for food allergy was not available in the UK Biobank study and so was not considered in our analysis. Specifically, we extracted information from two data fields included in the touchscreen questionnaire. Field 3786, which asked “What was your age when the asthma was first diagnosed?”, and field 3761, which asked “What was your age when the hayfever, rhinitis or eczema was first diagnosed?”. After excluding individuals who were not genotyped (absent from the array data), non-missing information was available for 50,109 and 98,161 individuals, respectively for fields 3786 and 3761. The combined age-of-onset phenotype corresponded to the earliest age reported across these two fields, which was obtained for 127,382 individuals. Of these, we restricted our analysis to individuals who were determined to suffer from at least one allergic disease (asthma and/or hay fever and/or eczema) based on the criteria described in detail previously [22]. Briefly, “asthma cases” were those with (i) both a report of “Asthma” in field 6152 (self-reported medical conditions, specifically the question “Has a doctor ever told you that you have had any of the following conditions?”) and a code for asthma in field 20002 (verbal interview), or alternatively, an ICD10 code for asthma in fields 41202 (Diagnoses–main ICD10) or 41204 (Diagnoses–secondary ICD10); and (ii) no report of COPD in fields 6152 or 20002, nor of other respiratory diseases in field 20002. “Hay fever/eczema cases” were those who answered “Hay fever, allergic rhinitis or eczema” in field 6152. Lastly, allergic disease cases were individuals who were classified as an “asthma case” and/or “hay fever/eczema case”, a total of 124,616 individuals. To identify individuals who suffered from a single allergic disease, we had to classify hay fever and eczema status separately, which could not be determined using field 6152 per se (“Has a doctor ever told you that you have had any of the following conditions?”). This is because of the seven possible answers to this question, a single item covered the two different diseases, specifically the answer “Hayfever, allergic rhinitis or eczema”. To identify individuals who reported suffering specifically from hay fever, we instead considered information reported in the verbal interview (field 20002) and ICD10 codes (fields 41202 and 41204), as described above for asthma. This information was available for a subset of individuals who answered “Hayfever, allergic rhinitis or eczema” in field 6152. We then used the exact same approach (i.e. information from fields 20002, 41202 and 41204) to identify “eczema cases”.Using this approach, we were able to classify hay fever and eczema status separately, in addition to asthma, for 52,114 individuals. Of these, 975 suffered from all three diseases, 8,172 from two diseases and 42,967 from a single disease. Of the latter, 23,375 reported suffering from asthma only; 15,445 from hay fever only; and 4,147 from eczema only.

Association analysis of the combined age-of-onset phenotype

We first performed multi-dimensional scaling (MDS) analysis of allele sharing to identify individuals who clustered closely to Europeans of the 1000 Genomes project, as described previously [22]. Of the 124,616 individuals with a phenotype available for analysis, 117,530 clustered within 5 standard deviations of the mean for the first and second MDS components estimated using individuals from the five European ancestry groups (CEU, GBR, FIN, IBS and TSI) of the 1000 Genomes project. As such, these individuals were considered to have European ancestry and were retained for analysis. We then excluded 400 individuals who: (i) had self-reported sex different from genetically-inferred sex; (ii) were outliers when considering genotype missing rates and/or genome-wide heterozygosity levels; (iii) had more than 10 third degree relatives or were excluded from kinship inference; and/or (iv) were not present in the imputed dataset released in July 2017. The first three criteria were assessed based on information included in the QC file ukb_sqc_v2.txt released by the UK Biobank study. After these exclusions, phenotype and genotype data were available for 117,130 individuals. Of these, 22,029 reported suffering from asthma only; 14,474 from hay fever only; and 3,939 from eczema only. To maximize power, we did not select a subset of unrelated individuals for analysis but instead tested variants for association with age-of-onset using the linear mixed model implemented in BOLT-LMM [77], which includes a genetic relationship matrix (GRM) as a random effect in the model. The GRM was estimated based on 577,110 array SNPs obtained after quality control filters, namely a minor allele frequency (MAF) >1%, call rate >95% and Hardy-Weinberg equilibrium P-value >10−6. Age-of-onset was quantile-normalized prior to analysis; gender and an indicator of the genotyping array used were included as discrete covariates. Of the 92 million variants with imputed data released by the UK Biobank, we analysed 7,647,814 variants that (i) had a MAF >1%; (ii) were imputed based on the Haplotype Reference Consortium panel; (iii) had matching alleles when compared to genotype data from the 1000 Genomes project; and (iv) had a unique reference sequence (rs) number and genomic position (based on hg19). We used a P-value threshold of 3x10-8 for genome-wide significance, as suggested for studies that analyse variants with a MAF >1% [78].

Identification of variants with a statistically independent association with the combined age-of-onset phenotype

We used the approximate joint association analysis option of GCTA [79] to identify variants that remained associated with age-of-onset at a P<3x10-8 after accounting for the effects of nearby (<10 Mb) more strongly associated variants. In this analysis, LD was estimated based on a subset of 5,000 unrelated allergic disease cases from the UK Biobank study.

Association analysis of age-of-onset in individuals suffering from a single allergic disease

To understand if variants discovered through the analysis of the combined age-of-onset phenotype were likely to influence the age-of-onset of the three individual diseases considered (asthma, hay fever and eczema), we performed association analyses in adult cases from the UK Biobank study who reported suffering from a single allergic disease, as described in detail previously [22]. Specifically, we tested the association between selected variants and age-of-onset separately in three non-overlapping groups of individuals who suffered from a single allergic disease: asthma only cases (n = 22,029), hay fever only cases (n = 14,474), and eczema only cases (n = 3,969). These sample sizes are smaller than indicated above (23,375, 15,445 and 4,147, respectively) because individuals of non-European ancestry were not included in the association analysis. For each SNP, we then compared the effect on age-of-onset (i.e. beta from the linear model) between individual diseases (i.e. asthma vs. hay fever, asthma vs. eczema and hay fever vs. eczema), using the formula z = sigma / SE_sigma, where sigma = beta_diseaseA–beta_diseaseB, and SE_sigma = sqrt(SE_beta_diseaseA^2 + SE_beta_diseaseB^2), which follows a normal distribution. We recently observed that many variants associated with the case-control status of allergic disease–defined by the presence of asthma, hay fever and/or eczema–were also associated with variation in age-of-onset amongst allergic disease cases [22]. We therefore reasoned that the reverse would also be likely: that many variants associated with variation in age-of-onset would be associated with disease liability, as suggested by Sarnowski et al. [29]. To test this, for each variant with a genome-wide significant association with age-of-onset in the analysis described above, we extracted association results from our recent GWAS of allergic disease [22]; after excluding overlapping samples (the UK Biobank study, n = 138,354), results were based on the analysis of 222,484 individuals, which included 137,883 cases with asthma and/or hay fever and/or eczema, and 84,601 disease-free controls. If a variant of interest was not directly tested in that GWAS, we extracted results for the most correlated proxy (with r2>0.8), if available. To understand the extent to which the same genetic variants contribute to variation in disease age-of-onset and disease liability, we applied LD-score regression [32] to the association summary statistics obtained in the GWAS of the combined age-of-onset phenotype carried out as part of this study (n = 117,130) and in the GWAS of a combined allergic disease case-control phenotype (n = 360,838) that we reported recently [22]. The genetic correlation between the two GWAS (which is not biased by sample overlap) was estimated based on 1.1 million HapMap 3 SNPs, all with a MAF>1%, as recommended previously [32]. To illustrate graphically the observed genetic correlation between disease liability and age-of-onset, we compared the age-of-onset between individuals with a high and low polygenic burden of allergic disease risk SNPs. Specifically, for each individual in the UK Biobank study, we calculated a polygenic risk score (PRS) as the weighted average of the number of disease-predisposing alleles across the 136 sentinel variants that we identified in our recent GWAS of allergic disease case-control status [22]. Weights for each SNP corresponded to the allelic effect (i.e. beta) reported in that GWAS [22]. We then restricted our analysis to individuals that reported suffering from a single disease, as described above: asthma only cases (n = 22,029), hay fever only cases (n = 14,474), and eczema only cases (n = 3,969). Within each of these case groups, we identified individuals with a PRS in the top 10% and bottom 10% for that group, and then compared the distribution of age-of-onset between the two PRS groups using descriptive statistics.

Multivariate GWAS of allergic disease age-of-onset and case-control status

If two phenotypes have a genetic correlation that is significantly different from 0, then multivariate association analysis could potentially increase the power to identify variants that contribute to the heritability of both phenotypes, when compared to the alternative of testing each phenotype separately. Whether or not increased power is obtained with a multivariate test depends, for example, on the statistical approach used, the magnitude of the overall phenotypic correlation between the two phenotypes, as well as the magnitude and direction of the effect of the shared variant on the two phenotypes [33,80,81]. To analyse the joint association between single SNPs and allergic disease age-of-onset and case-control status, we used the recently described metaUSAT approach [33], for three main reasons. First, this approach is applicable to summary statistics from GWAS with unknown sample overlap. Second, it accommodates summary statistics from a mix of continuous and binary traits. And third, by combining two classes of tests (MANOVA and sum squared score tests), metaUSAT provides increased power over alternative methods when a SNP affects all phenotypes analysed [33]. MetaUSAT was applied to the summary statistics of the combined age-of-onset phenotype carried out as part of this study (n = 117,130) and the GWAS of a combined allergic disease case-control phenotype (n = 360,838) that we reported recently [22]. Because we were interested in identifying SNPs that were genome-wide significant (i.e. with P<3x10-8) in the multivariate but not in the separate univariate analyses, we first used approximate conditional analyses [79] to adjust the results from each GWAS for the effects of variants that had a statistically independent association at P<3x10-8 in the respective study (which were identified by the joint association analysis described above). After this adjustment, no single SNP had a P<3x10-8 in each of the two individual phenotype GWAS as expected. MetaUSAT was then applied to the adjusted GWAS; the Pearson correlation coefficient between z-scores of the two phenotypes was estimated based on SNPs not associated with either trait (i.e. with a P>0.05 for both). A new multivariate test of association with similar properties to metaUSAT (e.g. applicable to summary statistics of GWAS with sample overlap), was reported just prior to the submission of this manuscript for publication–the MTAG approach [34]. We therefore tested if the multivariate associations discovered with metaUSAT were also supported by results from this different approach. For each SNP, instead of a multivariate P-value, MTAG returns an effect estimate (beta, SE and P-value) for each phenotype that incorporates information contained in the GWAS of the other phenotype. This increases the effective sample size of each analysis and so it improves the power to detect associations with variants that are shared between the two phenotypes [34]. To determine if a sentinel variant was in LD with a SNP previously reported to associate with any allergic disease, we (i) identified all SNPs in LD (r2>0.05) with that sentinel variant, using genotype data from individuals of European descent from the 1000 Genomes Project [82] (n = 294, release 20130502_v5a); and (ii) determined if the sentinel variant or any of the correlated SNPs identified were reported to associate with any allergic disease (asthma, hay fever, eczema, food allergy or atopy) in the NHGRI-EBI GWAS catalog database [83], which was downloaded on the 10th of January 2018. This catalogue was supplemented with recent studies not included at the time of submission of this manuscript. [2, 31, 84].

Predicting target genes of sentinel variants based on LD with eQTL and non-synonymous SNPs

We performed the following steps to identify genes for which variation in gene expression and/or protein sequence was associated with sentinel SNPs identified in the age-of-onset and multivariate GWAS. First, we identified single nucleotide polymorphisms (SNPs) associated with variation in gene expression (i.e. expression quantitative trait loci (eQTL)) in published transcriptome studies of five broad tissue types relevant for allergic disease: individual immune cell types, lung, skin, spleen and whole-blood. We identified a total of 43 transcriptome studies reporting results from eQTL analyses in any one of those five tissue types (). Some studies included multiple cell types, experimental conditions and/or eQTL types, resulting in a total of 101 separate eQTL datasets. For each eQTL dataset, we then (i) downloaded the original publication tables/files containing results for the eQTL reported; (ii) extracted the SNP identifier, gene name, association P-value and directional effect (if available; beta/z-score and effect allele); (iii) excluded eQTL located >1 Mb of the respective gene (i.e. trans eQTL), because often these are thought to be mediated by cis effects [85]; (iv) excluded eQTL with an association P>8.9x10-10, a conservative threshold that corrects for 55,765 genes (based on GENCODE v19), each tested for association with 1,000 SNPs (as suggested by others [86-88]); and (v) for each gene, used the—clump procedure in PLINK [89] to reduce the list of eQTL identified (which often included many correlated SNPs) to a set of ‘sentinel eQTL’, defined as the SNPs with strongest association with gene expression and in low LD (r2<0.05, linkage disequilibrium (LD) window of 2 Mb) with each other. Second, we identified genes for which a sentinel eQTL reported in any of the 101 eQTL datasets described above was in high LD (r2>0.8) with a sentinel variant identified in the age-of-onset or multivariate GWAS. That is, we only considered genes for which there was high LD between a sentinel eQTL and a sentinel allergic disease variant, which reduces the chance of spurious co-localization. Third, we used wANNOVAR [90] to identify genes containing non-synonymous SNPs amongst all variants in LD (r2>0.8) with any sentinel variants. SNPs in LD with sentinel variants were identified using genotype data from individuals of European descent from the 1000 Genomes Project [83] (n = 294, release 20130502_v5a).

Association between sentinel variants and the risk of food allergy

Age-of-onset of food allergy was not available in the UK Biobank study and so it was not considered in our combined allergic disease phenotype. To partly address this limitation, we determined if variants identified in the age-of-onset or multivariate analysis also contribute to food allergy risk. Specifically, we extracted association results for selected variants from a GWAS of food allergy conducted in the GOFA cohort that we published recently [15], which included 497 children with a positive oral food challenge and 2,387 controls (the genomic inflation factor [λ] for this GWAS was 1.03). All cases were recruited at the time of diagnosis (mean age 2.1; 84% under the age of 4). The most common food allergy was observed with hen’s egg (58%), followed by peanut (44%) and cow’s milk (34%). To study the association between each variant and food allergy in adults, we extracted association results from a GWAS of food allergy conducted in the GERA cohort, which we also reported recently [16], including 1,104 adults with an ICD9 (i.e. hospital admission) code for food allergy and 23,945 controls. The genomic inflation factor for this GWAS was 1.01. Most cases were admitted to hospital because of an allergic reaction to fish/shellfish (58%), peanuts (16%), eggs (13%) or milk (8%). This study was approved by the Human Ethics Committee of the QIMR Berghofer Medical Research Institute. Approval of individual studies has been reported in Ferreira et al. [22].

Potential impact of recall bias on SNP associations with age-of-onset of allergic disease (page 4–6) Potential impact on SNP associations of phenotypic misclassification amongst individuals reporting late onset disease (page 7–9).

Ten genes that are predicted targets of novel allergic disease variants and that have a known function that is directly relevant to disease pathophysiology (page 10–15). Study Acknowledgments (page 16–19). References (page 20–23). (DOC) Click here for additional data file.

Distribution of allergic disease age-of-onset in UK Biobank participants (n = 117,130) who reported suffering from asthma and/or hay fever/eczema.

(DOCX) Click here for additional data file.

Distribution of the observed and expected association P-values for the GWAS of allergic disease age-of-onset in the UK Biobank study (n = 117,130).

(DOCX) Click here for additional data file.

Reliability of self-reported age-of-onset of asthma based on information provided by 1,650 UK Biobank participants at two time points.

(DOCX) Click here for additional data file.

Association between sentinel SNPs identified in the age-of-onset (Panel A) or multivariate GWAS (Panel B) and asthma age-of-onset in the subset of UK Biobank individuals who reported developing asthma as a child.

(DOCX) Click here for additional data file.

Association between sentinel SNPs identified in the age-of-onset (Panel A) or multivariate GWAS (Panel B) and hay fever age-of-onset in the subset of UK Biobank individuals who reported developing hay fever as a child.

(DOCX) Click here for additional data file.

Association between sentinel SNPs identified in the age-of-onset (Panel A) or multivariate GWAS (Panel B) and time to asthma onset in children of the ALSPAC study.

(DOCX) Click here for additional data file.

Summary of results from the GWAS of allergic disease age-of-onset in the UK Biobank study (n = 117,130), after adjusting single-SNP results for the effects of independently associated variants (i.e. with P<3x10-8 in the joint association analysis performed with GCTA.

(DOCX) Click here for additional data file.

Summary of results from the GWAS of allergic disease case-control status (n = 360,838) after adjusting single-SNP results for the effects of independently associated variants (i.e. with P<3x10-8 in the joint association analysis performed with GCTA).

(DOCX) Click here for additional data file.

Distribution of the observed and expected association P values for the GWAS of allergic disease age-of-onset in the UK Biobank study (n = 117,130), after adjusting single-SNP results for the effects of independently associated variants (i.e. with P<3x10-8 in the joint association analysis performed with GCTA.

(DOCX) Click here for additional data file.

Distribution of the observed and expected association P values for the GWAS of allergic disease case-control status (n = 360,838) after adjusting single-SNP results for the effects of independently associated variants (i.e. with P<3x10-8 in the joint association analysis performed with GCTA.

(DOCX) Click here for additional data file.

Distribution of the observed and expected association P values obtained in the multivariate analysis of the GWAS of allergic disease risk (n = 360,838) and GWAS of allergic disease age-of-onset (n = 117,130).

(DOCX) Click here for additional data file.

Association between sentinel SNPs identified in the age-of-onset (Panel A) or multivariate GWAS (Panel B) and food allergy in children.

(DOCX) Click here for additional data file.

Genome-wide association studies of gene expression levels queried to identify expression quantitative trait loci (i.e. eQTL).

(XLS) Click here for additional data file.

Descriptive statistics for the 117,130 allergic disease cases from the UK Biobank included in this study.

(XLS) Click here for additional data file.

Results from joint association analysis performed with GCTA 79.

(XLS) Click here for additional data file.

Variants associated with allergic disease age-of-onset at a P<3x10-8 in the joint association analysis but not in the original analysis.

(XLS) Click here for additional data file.

Association with allergic disease age-of-onset for variants previously reported in the literature.

(XLS) Click here for additional data file.

Association between the 50 sentinel variants and age-of-onset of asthma, hay fever and eczema separately.

(XLS) Click here for additional data file.

Sentinel SNPs in LD (r2>0.05) with variants previously reported to associate with at least one allergic disease.

(XLS) Click here for additional data file.

Results from the MTAG multivariate approach for the sentinel variants identified in the metaUSAT multivariate analysis of allergic-disease age-of-onset and allergic disease case-control status.

(XLS) Click here for additional data file.

Sentinel eQTL in LD (r2>0.5) with sentinel variants associated with allergic disease age-of-onset.

(XLS) Click here for additional data file.

Sentinel eQTL in LD (r2>0.8) with sentinel variants jointly associated with allergic disease age-of-onset and allergic-disease risk.

(XLS) Click here for additional data file.

Non-synonymous SNPs in LD (r2>0.8) with sentinel variants associated with allergic disease.

(XLS) Click here for additional data file.

Two independent associations with self-reported food allergy case-control status at a P<3x10-8.

(XLS) Click here for additional data file.

Association between the 76 sentinel SNPs and self-reported food allergy case-control status.

(XLS) Click here for additional data file.

Results of replication study of age-of-onset GWAS in ALSPAC.

(XLS) Click here for additional data file.
Table 1

Variants independently associated with allergic disease age-of-onset at a P<3x10-8.

ChrBpSentinel SNPGene contextAssociation with allergic disease age-of-onset (n = 117,130)Novel association for allergic disease
Effect alleleFreq#BetaSEP-value
1152029548rs115045402S100A11-[]-TCHHL1A0.028-0.2020.0141.7e-47No
1153051661rs184587444[SPRR2A]T0.020-0.1910.0173.8e-28Yes
1155142927rs4971089[KRTCAP2]G0.488-0.0240.0041.6e-08Yes
1172715702rs78037977FASLG-[]—TNFSF18A0.879-0.0440.0061.1e-11No
1173141960rs7521390TNFSF18—[]-TNFSF4C0.296-0.0310.0052.2e-11No
28451701rs13398375[LINC00299]T0.716-0.0290.0057.5e-10No
228644670rs7559046FOSL2-[]-PLB1C0.537-0.0310.0043.1e-13No
2228670437rs10187276SLC19A3-[]-CCL20T0.256-0.0340.0054.6e-13No
2242698640rs34290285[D2HGDH]G0.757-0.0360.0052.6e-13No
3188132110rs6780858[LPP]G0.524-0.0310.0041.3e-13No
438792340rs6531663TLR10-[]-TLR1T0.808-0.0670.0052.6e-38No
4103515055rs4648052[NFKB1]G0.633-0.0250.0047.0e-09No
4123141070rs45613035[KIAA1109]C0.104-0.0380.0072.2e-08No
4123403008rs45610037IL2-[]—IL21A0.227-0.0420.0051.5e-17No
5110164674rs7728612SLC25A46-[]—TSLPT0.167-0.0450.0069.9e-16No
5110470137rs6594499WDR36-[]-CAMK4C0.530-0.0290.0044.5e-12No
5132028858rs4705962[KIF3A]T0.234-0.0310.0052.4e-10No
631323012rs2854001[HLA-B]A0.237-0.0350.0054.5e-13No
632626015rs6905282HLA-DQA1-[]-HLA-DQB1A0.466-0.0420.0041.1e-23No
633033710rs73739621[HLA-DPA1]C0.096-0.0510.0083.7e-11No
750325815rs2085423C7orf72—[]-IKZF1A0.248-0.0350.0054.1e-13No
9136155000rs635634ABO-[]-SURF6T0.189-0.0310.0051.1e-08No
10104285594rs12572775[SUFU]A0.447-0.0330.0047.6e-15No
1165559266rs10791824[OVOL1]G0.575-0.0350.0041.8e-16No
1176299431rs55646091WNT11—[]-LRRC32A0.056-0.0890.0094.7e-22No
11118746769rs4938576DDX6-[]-CXCR5G0.592-0.0280.0042.1e-11No
11128161142rs61907712KIRREL3-AS3—[]—ETS1C0.813-0.0340.0051.8e-10No
1256384804rs705699[RAB5B]A0.432-0.0250.0043.6e-09No
1257493727rs3024971[STAT6]T0.900-0.0440.0073.3e-10No
12111973358rs597808[ATXN2]G0.523-0.0330.0041.9e-15No
12121202664rs9431[SPPL3]A0.497-0.0280.0042.4e-11No
1343034968rs1853573AKAP11—[]—TNFSF11G0.465-0.0290.0044.2e-12No
1468760527rs7140939[RAD51B]A0.403-0.0270.0041.9e-10No
14103256961rs56101042[TRAF3]A0.820-0.0360.0052.6e-11No
1561069988rs11071559[RORA]C0.877-0.0420.0065.3e-11No
1567455630rs56062135[SMAD3]T0.247-0.0290.0052.7e-09No
1590936225rs2601191[IQGAP1]T0.473-0.0280.0041.9e-11No
1611229589rs2041733[CLEC16A]T0.447-0.0310.0042.2e-13No
1738756969rs7216890CCR7-[]-SMARCE1T0.654-0.0330.0047.2e-14No
1851780408rs3017289MBD2-[]-POLIC0.289-0.0290.0052.4e-10No
1860009814rs4574025[TNFRSF11A]T0.540-0.0320.0042.7e-14No
1861442619rs12964116[SERPINB7]G0.037-0.1080.0116.2e-23No
198785744rs2918302ADAMTS10—[]-ACTL9A0.156-0.0380.0065.4e-11No
2045689783rs4809619[EYA2]G0.752-0.0290.0053.7e-09Yes
1152179152rs12123821RPTN-[]-HRNRT0.051-0.1380.0092.9e-48No
1152285861rs61816761[FLG]A0.026-0.2660.0142.8e-82No
2102928617rs72823628[IL18R1]G0.874-0.0760.0062.4e-33No
96213468rs7848215RANBP6—[]-IL33T0.273-0.0290.0057.5e-10No
1176295598rs11236791WNT11—[]-LRRC32A0.469-0.0460.0043.8e-28No
1738067020rs4795400[GSDMB]C0.541-0.0500.0046.0e-33No

# Frequency of effect allele in the allergic disease cases studied.

Table 2

Association between sentinel age-of-onset variants and allergic disease risk in an independent sample of 222,484 individuals studied by Ferreira et al. [22].

ChrBpSentinel SNPGene contextAssociation with allergic disease risk in Ferreira et al. 2017 (n = 222,484)*
Effect alleleOdds ratioSEP-value
1152029548rs115045402S100A11-[]-TCHHL1NANANANA
1152179152rs12123821RPTN-[]-HRNRT1.0910.0192.40E-06
1152285861rs61816761[FLG]A1.260.0368.90E-11
1153051661rs184587444[SPRR2A]NANANANA
1155142927rs4971089[KRTCAP2]G1.0060.0073.50E-01
1172715702rs78037977FASLG-[]—TNFSF18A1.0510.0116.70E-06
1173141960rs7521390TNFSF18—[]-TNFSF4C1.0470.0087.90E-10
28451701rs13398375[LINC00299]T1.0630.0081.20E-15
228644670rs7559046*FOSL2-[]-PLB1C1.0340.0073.50E-06
2102928617rs72823628[IL18R1]G1.1230.013.50E-32
2228670437rs10187276SLC19A3-[]-CCL20T1.0340.0085.70E-05
2242698640rs34290285[D2HGDH]G1.080.0112.00E-13
3188132110rs6780858[LPP]G1.0360.0075.00E-07
438792340rs6531663TLR10-[]-TLR1T1.0880.0083.30E-26
4103515055rs4648052[NFKB1]G1.0360.0078.70E-07
4123141070rs45613035[KIAA1109]C1.050.0139.50E-05
4123403008rs45610037IL2-[]—IL21A1.0690.0086.20E-16
5110164674rs7728612SLC25A46-[]—TSLPT1.0660.0091.80E-12
5110470137rs6594499WDR36-[]-CAMK4C1.0730.0071.70E-24
5132028858rs4705962[KIF3A]T1.0470.0082.30E-08
631323012rs2854001[HLA-B]A1.0610.0095.30E-11
632626015rs6905282HLA-DQA1-[]-HLA-DQB1A1.0630.0073.70E-17
633033710rs73739621[HLA-DPA1]C1.060.0131.10E-05
750325815rs2085423C7orf72—[]-IKZF1A1.0170.0083.60E-02
96213468rs7848215RANBP6—[]-IL33T1.070.0083.80E-18
9136155000rs635634ABO-[]-SURF6T1.0390.0099.60E-06
10104285594rs12572775[SUFU]A1.0150.0072.60E-02
1165559266rs10791824[OVOL1]G1.0330.0076.30E-06
1176295598rs11236791WNT11—[]-LRRC32A1.0880.0073.70E-34
1176299431rs55646091WNT11—[]-LRRC32A1.1880.0181.10E-22
11118746769rs4938576DDX6-[]-CXCR5G1.0370.0072.30E-07
11128161142rs61907712KIRREL3-AS3—[]—ETS1C1.0410.0095.60E-06
1256384804rs705699[RAB5B]A1.0390.0076.80E-08
1257493727rs3024971[STAT6]T1.0830.0124.00E-12
12111973358rs597808[ATXN2]G1.0290.0073.50E-05
12121202664rs9431[SPPL3]A1.0280.0074.50E-05
1343034968rs1853573AKAP11—[]—TNFSF11G1.0170.0071.20E-02
1468760527rs7140939[RAD51B]A1.0330.0081.60E-05
14103256961rs56101042[TRAF3]A1.0260.0092.90E-03
1561069988rs11071559[RORA]C1.0550.015.30E-08
1567455630rs56062135[SMAD3]T1.0620.0081.70E-13
1590936225rs2601191[IQGAP1]T1.0230.0071.40E-03
1611229589rs2041733[CLEC16A]T1.050.0071.10E-12
1738067020rs4795400[GSDMB]C1.0660.0075.70E-21
1738756969rs7216890CCR7-[]-SMARCE1T1.0260.0072.00E-04
1851780408rs3017289MBD2-[]-POLIC1.0130.0087.40E-02
1860009814rs4574025[TNFRSF11A]T1.0280.0074.80E-05
1861442619rs12964116[SERPINB7]G1.040.0216.00E-02
198785744rs2918302ADAMTS10—[]-ACTL9A1.0230.011.50E-02
2045689783rs4809619[EYA2]G1.0130.0089.70E-02

# Association results from 12 of the 13 individual studies reported in the Ferreira et al. 2017 allergic disease GWAS were included in this analysis (all except UK Biobank). Results from the individual studies were adjusted for the respective study-specific LD-score intercept and then combined using a fixed-effects meta-analysis, as described previously [22]. The LD-score intercept of this 12-study meta-analysis was 1.018 (attenuation ratio of 0.0717)

* rs7559046 was not directly tested in the Ferreira et al. 2017 GWAS, and so we used a proxy instead (rs6547850, r = 0.93).

Table 3

Variants jointly associated with allergic disease age-of-onset and allergic disease risk (multivariate P<3x10-8).

ChrBpSentinel SNPGene ContextmetaUSAT P-valueEffect allele and frequencyAssociation with allergic disease age-of-onset (n = 117,130)Association with allergic disease risk in Ferreira et al. 2017 (n = 360,838)Novel
BetaSEPBetaSEP
12510755rs10910095TNFRSF14-[]-FAM213B1.62e-08G0.868-0.0170.0066.0e-030.0410.0074.3e-08No
1212864992rs12068304[BATF3]2.32e-10G0.166-0.0260.0063.9e-060.0350.0075.8e-07No
230846848rs7565907[LCLAT1]6.66e-09T0.609-0.0170.0045.5e-050.0250.0051.4e-06Yes
237137123rs112844988[STRN]2.48e-08G0.368-0.0200.0042.2e-060.020.0051.4e-04Yes
2112268732rs143326447BCL2L11—[]—ANAPC18.22e-11C0.123-0.0300.0063.2e-060.0430.0082.3e-07No
2203487023rs72926957BMPR2-[]-FAM117B1.01e-09G0.706-0.0230.0054.7e-070.0230.0062.3e-05Yes
333047662rs35570272[GLB1]1.28e-08T0.404-0.0230.0048.2e-080.0160.0052.3e-03No
356605990rs6778373[CCDC66]1.51e-09A0.535-0.0170.0047.5e-050.0260.0052.2e-07Yes
3112643560rs9870568[CD200R1]4.35e-09C0.474-0.0160.0041.7e-040.0260.0053.1e-07Yes
5131952222rs6596086[RAD50]1.9e-08C0.200-0.0180.0057.5e-040.0320.0063.4e-07No
6209159rs11242709[]-DUSP221.11e-09T0.210-0.0290.0054.3e-080.0240.0073.0e-04Yes
626186200rs9379832HIST1H2BE-[]-HIST1H4D2.15e-08G0.26-0.0190.0056.5e-050.0270.0064.2e-06Yes
632624874rs28483633HLA-DQA1-[]-HLA-DQB12.77e-08C0.838-0.0350.0071.5e-070.0560.0192.8e-03No
6149909491rs9322188[GINM1]2.99e-08T0.313-0.0150.0057.3e-040.0280.0065.7e-07Yes
106625378rs2255088[PRKCQ-AS1]2.35e-10C0.335-0.0210.0041.5e-060.0270.0061.5e-06No
1250336638rs416959LOC283332-[]-AQP22.21e-09T0.824-0.0260.0063.4e-060.0300.0077.2e-06Yes
1650745926rs2066844[NOD2]1.13e-09T0.049-0.0430.0108.2e-060.0590.0121.5e-06No
1650885211rs8056255CYLD-[]—SALL13.26e-09A0.033-0.0470.0125.5e-050.0710.0146.8e-07Yes
1717624349rs77904527[RAI1]2.17e-09C0.147-0.0310.0062.5e-070.0280.0071.0e-04Yes
1740741013rs12951632[FAM134C]1e-09T0.719-0.0260.0053.4e-080.0200.0063.4e-04No
1776244926rs72901762TMEM235-[]-LOC1009962911.89e-08A0.711-0.0190.0053.5e-050.0260.0066.7e-06Yes
1848558415rs35014537[SMAD4]4.34e-09G0.384-0.0180.0043.1e-050.0250.0051.6e-06No
191170445rs4807630[SBNO2]1.89e-08T0.312-0.0200.0051.5e-050.0260.0061.6e-05No
2052258875rs4811448ZNF217-[]—SUMO1P12.6e-08C0.293-0.0170.0052.9e-040.0280.0061.2e-06Yes
2221939675rs5754217[UBE2L3]2.93e-08T0.196-0.0150.0053.8e-030.0340.0061.2e-07Yes
2237319947rs4437064[CSF2RB]2.44e-08G0.529-0.0090.0043.2e-020.0280.0053.0e-08Yes
Table 4

Genes with a sentinel eQTL (in italic) or non-synonymous SNP (in bold) in LD (r>0.8) with sentinel variants identified in the age-of-onset or multivariate GWAS.

ChrBpSentinel SNPGene contextLD between sentinel eQTL/non-synonymous SNP and sentinel GWAS SNP
r2 > 0.950.80 < r2 ≤ 0.95
Sentinel variants identified in the GWAS of age-of-onset
1152285861rs61816761[FLG]FLG-
1155142927rs4971089[KRTCAP2]-ADAM15, EFNA1
1173141960rs7521390TNFSF18—[]-TNFSF4TNFSF4-
228644670rs7559046FOSL2-[]-PLB1-FOSL2,RP11-373D23.3
2102928617rs72823628[IL18R1]MFSD9IL18RAP,IL1RL1
2228670437rs10187276SLC19A3-[]-CCL20CCL20-
3188132110rs6780858[LPP]-BCL6
438792340rs6531663TLR10-[]-TLR1TLR1-
4103515055rs4648052[NFKB1]-NFKB1
4123403008rs45610037IL2-[]—IL21KIAA1109-
5110470137rs6594499WDR36-[]-CAMK4-CAMK4,CTC-551A13.2,TSLP,WDR36
5132028858rs4705962[KIF3A]-KIF3A
631323012rs2854001[HLA-B]-HLA-C
633033710rs73739621[HLA-DPA1]-HLA-DPA1,HLA-DPB1,HLA-DQB1,TAPBP
10104285594rs12572775[SUFU]-ACTR1A,C10orf32,SUFU,TMEM180,TRIM8
1165559266rs10791824[OVOL1]EFEMP2,OVOL1,SNX32-
1256384804rs705699[RAB5B]RPS26,SUOX-
1257493727rs3024971[STAT6]NAB2,STAT6-
12111884608rs597808[ATXN2]SH2B3-
12121202664rs9431[SPPL3]SPPL3OASL
1343034968rs1853573AKAP11—[]—TNFSF11TNFSF11-
1561069988rs11071559[RORA]-RP11-554D20.1
1567455630rs56062135[SMAD3]AAGAB-
1590936225rs2601191[IQGAP1]-IQGAP1
1738067020rs4795400[GSDMB]GSDMBGSDMB,IKZF3,ORMDL3,ZPBP2
1738756969rs7216890CCR7-[]-SMARCE1SMARCE1-
1860009814rs4574025[TNFRSF11A]PIGN-
1861442619rs12964116[SERPINB7]SERPINB7-
Sentinel variants identified in the multivariate GWAS of age-of-onset and case-control status
12510755rs10910095TNFRSF14-[]-FAM213BTNFRSF14-
230846848rs7565907[LCLAT1]-LCLAT1
237137123rs112844988[STRN]STRN-
2203487023rs72926957BMPR2-[]-FAM117BBMPR2,FAM117B-
356605990rs6778373[CCDC66]CCDC66,FAM208A-
3112643560rs9870568[CD200R1]CD200R1-
5131952222rs6596086[RAD50]SLC22A5-
626186200rs9379832HIST1H2BE-[]-HIST1H4D-HIST1H2BE
632624874rs28483633HLA-DQA1-[]-HLA-DQB1HLA-DQA1,HLA-DQB1,HLA-DQB2HLA-DQA1
6149909491rs9322188[GINM1]PCMT1GINM1,LATS1,NUP43
106625378rs2255088[PRKCQ-AS1]PRKCQ-
1650745926rs2066844[NOD2]NOD2-
1650885211rs8056255CYLD-[]—SALL1-NOD2
1740741013rs12951632[FAM134C]FAM134C,MLX,PSMC3IPBECN1
1776244926rs72901762TMEM235-[]-LOC100996291RP11-219G17.4THA1P
1848558415rs35014537[SMAD4]RP11-729L2.2,SMAD4-
191170445rs4807630[SBNO2]-ABCA7
2221939675rs5754217[UBE2L3]CCDC116,UBE2L3YDJC
  90 in total

1.  A genome-wide association study to identify genetic determinants of atopy in subjects from the United Kingdom.

Authors:  Yize I Wan; David P Strachan; David M Evans; John Henderson; Tricia McKeever; John W Holloway; Ian P Hall; Ian Sayers
Journal:  J Allergy Clin Immunol       Date:  2010-11-20       Impact factor: 10.793

2.  Smad4 deficiency in T cells leads to the Th17-associated development of premalignant gastroduodenal lesions in mice.

Authors:  Jennifer Nancy Hahn; Vincent George Falck; Frank Robert Jirik
Journal:  J Clin Invest       Date:  2011-09-01       Impact factor: 14.808

Review 3.  Regulation of inflammasomes by ubiquitination.

Authors:  Joseph S Bednash; Rama K Mallampalli
Journal:  Cell Mol Immunol       Date:  2016-04-11       Impact factor: 11.530

4.  Differential regulation of Th2 and Th1 lung inflammatory responses by protein kinase C theta.

Authors:  Shahram Salek-Ardakani; Takanori So; Beth S Halteman; Amnon Altman; Michael Croft
Journal:  J Immunol       Date:  2004-11-15       Impact factor: 5.422

5.  A genome-wide association study of atopic dermatitis identifies loci with overlapping effects on asthma and psoriasis.

Authors:  Stephan Weidinger; Saffron A G Willis-Owen; Yoichiro Kamatani; Hansjörg Baurecht; Nilesh Morar; Liming Liang; Pauline Edser; Teresa Street; Elke Rodriguez; Grainne M O'Regan; Paula Beattie; Regina Fölster-Holst; Andre Franke; Natalija Novak; Caoimhe M Fahy; Mårten C G Winge; Michael Kabesch; Thomas Illig; Simon Heath; Cilla Söderhäll; Erik Melén; Göran Pershagen; Juha Kere; Maria Bradley; Agne Lieden; Magnus Nordenskjold; John I Harper; W H Irwin McLean; Sara J Brown; William O C Cookson; G Mark Lathrop; Alan D Irvine; Miriam F Moffatt
Journal:  Hum Mol Genet       Date:  2013-07-25       Impact factor: 6.150

6.  Terminal epidermal differentiation is regulated by the interaction of Fra-2/AP-1 with Ezh2 and ERK1/2.

Authors:  Stefanie Wurm; Jisheng Zhang; Juan Guinea-Viniegra; Fernando García; Javier Muñoz; Latifa Bakiri; Elena Ezhkova; Erwin F Wagner
Journal:  Genes Dev       Date:  2014-12-29       Impact factor: 11.361

7.  Mediation analysis demonstrates that trans-eQTLs are often explained by cis-mediation: a genome-wide analysis among 1,800 South Asians.

Authors:  Brandon L Pierce; Lin Tong; Lin S Chen; Ronald Rahaman; Maria Argos; Farzana Jasmine; Shantanu Roy; Rachelle Paul-Brutus; Harm-Jan Westra; Lude Franke; Tonu Esko; Rakibuz Zaman; Tariqul Islam; Mahfuzar Rahman; John A Baron; Muhammad G Kibriya; Habibul Ahsan
Journal:  PLoS Genet       Date:  2014-12-04       Impact factor: 5.917

8.  metaCCA: summary statistics-based multivariate meta-analysis of genome-wide association studies using canonical correlation analysis.

Authors:  Anna Cichonska; Juho Rousu; Pekka Marttinen; Antti J Kangas; Pasi Soininen; Terho Lehtimäki; Olli T Raitakari; Marjo-Riitta Järvelin; Veikko Salomaa; Mika Ala-Korpela; Samuli Ripatti; Matti Pirinen
Journal:  Bioinformatics       Date:  2016-02-19       Impact factor: 6.937

9.  Meta-analysis of genome-wide association studies identifies ten loci influencing allergic sensitization.

Authors:  Klaus Bønnelykke; Melanie C Matheson; Tune H Pers; Raquel Granell; David P Strachan; Alexessander Couto Alves; Allan Linneberg; John A Curtin; Nicole M Warrington; Marie Standl; Marjan Kerkhof; Ingileif Jonsdottir; Blazenka K Bukvic; Marika Kaakinen; Patrick Sleimann; Gudmar Thorleifsson; Unnur Thorsteinsdottir; Katharina Schramm; Svetlana Baltic; Eskil Kreiner-Møller; Angela Simpson; Beate St Pourcain; Lachlan Coin; Jennie Hui; Eugene H Walters; Carla M T Tiesler; David L Duffy; Graham Jones; Susan M Ring; Wendy L McArdle; Loren Price; Colin F Robertson; Juha Pekkanen; Clara S Tang; Elisabeth Thiering; Grant W Montgomery; Anna-Liisa Hartikainen; Shyamali C Dharmage; Lise L Husemoen; Christian Herder; John P Kemp; Paul Elliot; Alan James; Melanie Waldenberger; Michael J Abramson; Benjamin P Fairfax; Julian C Knight; Ramneek Gupta; Philip J Thompson; Patrick Holt; Peter Sly; Joel N Hirschhorn; Mario Blekic; Stephan Weidinger; Hakon Hakonarsson; Kari Stefansson; Joachim Heinrich; Dirkje S Postma; Adnan Custovic; Craig E Pennell; Marjo-Riitta Jarvelin; Gerard H Koppelman; Nicholas Timpson; Manuel A Ferreira; Hans Bisgaard; A John Henderson
Journal:  Nat Genet       Date:  2013-06-30       Impact factor: 38.330

10.  Integrated genome-wide association, coexpression network, and expression single nucleotide polymorphism analysis identifies novel pathway in allergic rhinitis.

Authors:  Supinda Bunyavanich; Eric E Schadt; Blanca E Himes; Jessica Lasky-Su; Weiliang Qiu; Ross Lazarus; John P Ziniti; Ariella Cohain; Michael Linderman; Dara G Torgerson; Celeste S Eng; Maria Pino-Yanes; Badri Padhukasahasram; James J Yang; Rasika A Mathias; Terri H Beaty; Xingnan Li; Penelope Graves; Isabelle Romieu; Blanca del Rio Navarro; M Towhid Salam; Hita Vora; Dan L Nicolae; Carole Ober; Fernando D Martinez; Eugene R Bleecker; Deborah A Meyers; W James Gauderman; Frank Gilliland; Esteban G Burchard; Kathleen C Barnes; L Keoki Williams; Stephanie J London; Bin Zhang; Benjamin A Raby; Scott T Weiss
Journal:  BMC Med Genomics       Date:  2014-08-02       Impact factor: 3.063

View more
  3 in total

1.  Accounting for age of onset and family history improves power in genome-wide association studies.

Authors:  Emil M Pedersen; Esben Agerbo; Oleguer Plana-Ripoll; Jakob Grove; Julie W Dreier; Katherine L Musliner; Marie Bækvad-Hansen; Georgios Athanasiadis; Andrew Schork; Jonas Bybjerg-Grauholm; David M Hougaard; Thomas Werge; Merete Nordentoft; Ole Mors; Søren Dalsgaard; Jakob Christensen; Anders D Børglum; Preben B Mortensen; John J McGrath; Florian Privé; Bjarni J Vilhjálmsson
Journal:  Am J Hum Genet       Date:  2022-02-08       Impact factor: 11.025

2.  NOD2 Signaling Circuitry during Allergen Sensitization Does Not Worsen Experimental Neutrophilic Asthma but Promotes a Th2/Th17 Profile in Asthma Patients but Not Healthy Subjects.

Authors:  Mélodie Bouté; Saliha Ait Yahia; Ying Fan; Daniel Alvarez-Simon; Han Vorng; Joanne Balsamelli; Julie Nanou; Patricia de Nadai; Cécile Chenivesse; Anne Tsicopoulos
Journal:  Int J Mol Sci       Date:  2022-10-06       Impact factor: 6.208

3.  Understanding How Asthma Starts: Longitudinal Patterns of Wheeze and the Chromosome 17q Locus.

Authors:  Gerard H Koppelman; Elin T G Kersten
Journal:  Am J Respir Crit Care Med       Date:  2021-04-01       Impact factor: 21.405

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.