| Literature DB >> 25710614 |
Xiumei Hong1, Ke Hao2, Christine Ladd-Acosta3, Kasper D Hansen4, Hui-Ju Tsai5, Xin Liu6, Xin Xu7, Timothy A Thornton8, Deanna Caruso1, Corinne A Keet9, Yifei Sun10, Guoying Wang1, Wei Luo11, Rajesh Kumar12, Ramsay Fuleihan12, Anne Marie Singh13, Jennifer S Kim14, Rachel E Story15, Ruchi S Gupta16, Peisong Gao17, Zhu Chen1, Sheila O Walker1, Tami R Bartell16, Terri H Beaty3, M Daniele Fallin18, Robert Schleimer19, Patrick G Holt20, Kari Christine Nadeau21, Robert A Wood22, Jacqueline A Pongracic12, Daniel E Weeks23, Xiaobin Wang24.
Abstract
Food allergy (FA) affects 2%-10% of US children and is a growing clinical and public health problem. Here we conduct the first genome-wide association study of well-defined FA, including specific subtypes (peanut, milk and egg) in 2,759 US participants (1,315 children and 1,444 parents) from the Chicago Food Allergy Study, and identify peanut allergy (PA)-specific loci in the HLA-DR and -DQ gene region at 6p21.32, tagged by rs7192 (P=5.5 × 10(-8)) and rs9275596 (P=6.8 × 10(-10)), in 2,197 participants of European ancestry. We replicate these associations in an independent sample of European ancestry. These associations are further supported by meta-analyses across the discovery and replication samples. Both single-nucleotide polymorphisms (SNPs) are associated with differential DNA methylation levels at multiple CpG sites (P<5 × 10(-8)), and differential DNA methylation of the HLA-DQB1 and HLA-DRB1 genes partially mediate the identified SNP-PA associations. This study suggests that the HLA-DR and -DQ gene region probably poses significant genetic risk for PA.Entities:
Mesh:
Substances:
Year: 2015 PMID: 25710614 PMCID: PMC4340086 DOI: 10.1038/ncomms7304
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Figure 1Study design and work flow diagram for the GWAS of food allergy
MQLS: Modified quasi-likelihood analyses; PA: peanut allergy; SNP: single nucleotide polymorphism; DMP: differentially methylated position.
Figure 2Quantile-quantile (Q-Q) and Manhattan plots for genome-wide associations of peanut allergy in 2,197 discovery samples of European ancestry
(a) Q-Q plots for all of the genotyped and/or imputed SNPs and SNPs outside of the HLA-DQ and -DR region; (b) Manhattan plots for the genotyped SNPs alone (upper panel) and for the genotyped plus imputed SNPs. SNPs above the dashed line reached genome-wide significance at 5×10−8, based on the modified quasi-likelihood score test.
The top loci associated with food allergy and the three most common types of food allergy in 2,197 discovery samples of European ancestry.
| SNP | CHR | Position | Nearest gene | Allele | MAF | ||||
|---|---|---|---|---|---|---|---|---|---|
| Food allergy | Peanut allergy | Egg allergy | Milk allergy | ||||||
| The 5 top loci for food allergy | |||||||||
| rs12121623 | 1 | 54931396 | G/T | 0.13 | 3.1×10−7 | 0.024 | 0.046 | 0.053 | |
| rs1318710 | 4 | 101436201 | A/G | 0.10 | 2.6×10−6 | 0.002 | 0.020 | 0.207 | |
| rs777717 | 2 | 195954734 | C/T | 0.08 | 4.7×10−6 | 0.008 | 0.057 | 0.214 | |
| rs10994607 | 10 | 62760742 | C/T | 0.04 | 7.1×10−6 | 0.250 | 0.028 | 4.6×10−5 | |
| rs6942407 | 7 | 86861313 | G/A | 0.21 | 8.2×10−6 | 7.6×10−4 | 0.016 | 0.003 | |
| The 5 top loci for peanut allergy | |||||||||
| rs9275596 | 6 | 32681631 | T/C | 0.35 | 0.006 | 6.8×10−10 | 0.509 | 0.247 | |
| rs7192 | 6 | 32411646 | G/T | 0.39 | 0.175 | 5.5×10−8 | 0.293 | 0.468 | |
| rs862942 | 14 | 26492233 | T/C | 0.07 | 2.7×10−4 | 3.0×10−6 | 0.169 | 0.026 | |
| rs4584173 | 8 | 135336557 | T/C | 0.40 | 0.031 | 3.6×10−6 | 0.168 | 0.386 | |
| rs10878354 | 12 | 66384885 | G/A | 0.23 | 0.031 | 5.1×10−6 | 0.603 | 0.451 | |
| The 5 top loci for egg allergy | |||||||||
| rs7717393 | 5 | 155753914 | C/G | 0.07 | 0.067 | 0.627 | 1.4×10−6 | 2.7×10−4 | |
| rs5961136 | 23 | 54802520 | T/G | 0.39 | 0.251 | 0.067 | 2.4×10−6 | 0.900 | |
| rs250585 | 16 | 23401076 | G/A | 0.17 | 0.003 | 0.101 | 3.8×10−6 | 0.021 | |
| rs16823014 | 2 | 169817713 | G/A | 0.05 | 0.352 | 0.563 | 4.4×10−6 | 0.052 | |
| rs6498482 | 16 | 13987719 | T/C | 0.39 | 0.022 | 0.158 | 4.8×10−6 | 0.006 | |
| The 5 top loci for milk allergy | |||||||||
| rs9898058 | 17 | 47818821 | C/T | 0.15 | 7.7×10−5 | 0.014 | 0.016 | 1.1×10−6 | |
| rs17032597 | 2 | 67055115 | C/A | 0.29 | 0.003 | 0.568 | 0.074 | 1.6×10−6 | |
| rs78405116 | 11 | 1892562 | G/T | 0.03 | 0.015 | 0.423 | 0.775 | 1.7×10−6 | |
| rs10994613 | 10 | 62780127 | G/A | 0.03 | 1.6×10−5 | 0.997 | 0.036 | 4.8×10−6 | |
| rs7833294 | 8 | 58008281 | C/T | 0.02 | 0.281 | 0.504 | 0.066 | 7.3×10−6 | |
Only the genotyped SNP with the minimum p-value based on the modified quantitative likelihood score test is shown for each gene.
SNP: single nucleotide polymorphism; CHR: chromosome; MAF: minor allele frequency.
Major/minor allele.
The minor allele frequency was calculated using the genotyping data from parents of European ancestry.
P-value was generated using the modified quantitative likelihood score (MQLS) test in the 2,197 discovery samples.
Figure 3Locus-specific plot of peanut allergy-associated loci reaching genome-wide significance
Each dot represents the -log10 (p-value) for one genotyped or imputed SNP based on the modified quasi-likelihood score test in 2,197 discovery samples of European ancestry. The estimated recombination rates from the 1000 Genomes Project data are shown as blue lines and the genomic locations of genes within the regions of interest are shown at the bottom. SNP color represents linkage disequilibrium with the most significant genotyped SNP (rs9275596). SNP annotations are indicated as follows: triangles: genotyped SNPs; circles: imputed SNPs.
The estimated effect sizes of the two top SNPs on risk of peanut allergy in the discovery and replication samples.
| SNP | Allele | Discovery
| Replication
| ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Case | Control | Uncertain | OR(95%CI) | Case | Control | OR(95%CI) | |||||
| rs7192 | G/T | 0.49 | 0.42 | 0.40 | 1.7 (1.4–2.1) | 5.5×10− | 0.50 | 0.31 | 1.8 (1.2–2.7) | 0.005 | 2.7×10− |
| rs9275596 | T/C | 0.46 | 0.37 | 0.35 | 1.7 (1.4–2.1) | 6.8×10− | 0.44 | 0.31 | 1.7 (1.1–2.6) | 0.022 | 6.3×10− |
|
| |||||||||||
| rs7192 | G/T | 0.43 | 0.46 | 0.36 | 1.2 (0.8–1.8) | 0.198 | 0.42 | 0.36 | 1.4 (0.7–3.1) | 0.375 | 0.147 |
| rs9275596 | T/C | 0.34 | 0.41 | 0.30 | 1.2 (0.8–1.8) | 0.327 | 0.25 | 0.40 | 0.6 (0.2–1.3) | 0.176 | 0.420 |
SNP: single nucleotide polymorphism; OR: odds ratio; CI: confidence interval; MQLS: Modified quasi-likelihood score; GEE: generalized estimating equation.
Major/minor allele. The major allele is the reference allele, and the minor allele is the effective allele.
Minor allele frequency is shown in each group.
Controls of uncertain phenotype, which included 1,148 parents and 589 children.
The GEE model was applied to estimate the effect size of each SNP (additive genetic model) on the risk of peanut allergy in children with and without peanut allergy, with adjustment for age and gender. For analyses in non-European subjects, besides age and gender, we also included as covariates the first three principal components from the genome-wide SNP genotypes, to control for potential population stratification.
P-value from the MQLS analyses in the 2,197 discovery samples.
Meta-analysis was performed based on the Stouffer’s weighted z-score method to combine association results from the MQLS analyses in the discovery sample and from the GEE analyses in the replication sample.
Controls of uncertain phenotype, which included 263 parents and 139 children.
Associations of the two top SNPs with peanut allergy based on various definitions in 2,197 discovery samples of European ancestry.
| PA Definitions | Cases | rs7192 | rs9275596 | ||||||
|---|---|---|---|---|---|---|---|---|---|
| OR | 95%CI | OR | 95%CI | ||||||
| CR & (psIgE ≥0.1kU/L or SPT ≥3mm) | 316 | 1.7 | 1.4–2.1 | 1.7×10−7 | 5.5×10−8 | 1.7 | 1.4–2.1 | 9.6×10−8 | 6.8×10−10 |
| CR in 2 hours & (psIgE ≥0.1 kU/L or SPT ≥3mm) | 286 | 1.8 | 1.4–2.2 | 1.5×10−7 | 6.7×10−8 | 1.8 | 1.4–2.2 | 1.8×10−7 | 1.9×10−9 |
| CR in 2 hours & (psIgE ≥0.35 kU/L or SPT ≥3mm) | 278 | 1.8 | 1.4–2.2 | 2.0×10−7 | 1.0×10−7 | 1.8 | 1.4–2.2 | 1.8×10−7 | 2.5×10−9 |
| CR in 2 hours & (psIgE ≥0.35 kU/L or SPT ≥5mm) | 276 | 1.7 | 1.4–2.2 | 4.2×10−7 | 3.6×10−7 | 1.7 | 1.4–2.2 | 3.0×10−7 | 6.7×10−9 |
| CR in 2 hours & (psIgE ≥15 kU/L or SPT ≥8mm) | 216 | 1.8 | 1.4–2.3 | 7.5×10−7 | 1.4×10−6 | 1.8 | 1.4–2.3 | 7.0×10−7 | 8.9×10−8 |
| CR in 2 hours & (psIgE ≥57 kU/L or SPT ≥8mm) | 187 | 1.8 | 1.4–2.3 | 4.2×10−6 | 2.3×10−5 | 1.8 | 1.4–2.2 | 9.2×10−6 | 4.2×10−6 |
SNP: single nucleotide polymorphism; PA: peanut allergy; OR: odds ratio; CI: confidence interval; GEE: generalized estimating equation; MQLS: modified quasi-likelihood score. CR: clinical allergic reaction to peanut ingestion; psIgE: peanut-specific IgE; SPT: skin prick test.
Using the minor allele (T allele) as the effective allele and the major allele (G allele) as the reference allele
Using the minor allele (C allele) as the effective allele and the major allele (T allele) as the reference allele.
The GEE model was applied to estimate the effect size of each SNP (additive genetic model) on the risk of PA in PA cases vs 733 non-PA children, with adjustment for age and gender and controlling for within-family relationship.
The MQLS test was applied to estimate the p-value by comparing PA cases, 144 non-allergic, non-sensitized normal controls, and 1,737 controls of uncertain phenotype.
Figure 4Differentially methylated positions (DMPs) associated with rs7192, rs9275596, and with peanut allergy, as well as DMPs that mediate genetic risk in peanut allergy
(4a) Diagram showing associations between genotype (rs7192: blue circle, and rs9275596: green circle) and DMPs, and between the genotype-dependent DMPs and peanut allergy (PA). The DMPs are denoted by empty circles. Dashed blue (for rs7192) and green (for rs9275596) lines represent significant associations between genotypes and DMPs at P < 5×10−8 based on linear regression models (N=218). In between the DMPs and gene tracks are purple bars that represent the genotype-dependent DMPs that are associated with PA based on linear regression models in 73 PA cases and 67 non-PA controls (DMPs as the outcomes). The two DMPs marked with purple triangles are DMP cg15982117 in the HLA-DRB1 gene and DMP cg18024368 in the HLA-DQB1 gene that may mediate genetic risk in PA, as determined by the causal inference test; (4b) DMP cg18024368 in the HLA-DQB1 gene that may mediate the association between the rs7192 genotype and PA risk in 73 PA cases and 67 non-PA controls. The left panel shows the association between the methylation level at DMP cg18024368 and the rs7192 genotype. The middle panel depicts the association between the methylation level at DMP cg18024368 and PA. The blue bars in the left and middle panels represent median methylation levels. The right panel shows the effect size of the observed rs7192-PA associations (represented by odds ratios [ORs] based on the logistic regression model) before and after adjusting for the methylation level at DMP cg18024368 (M), as well as adjusting for both the methylation level at DMP cg18024368 (M) and at DMP cg15982117 (M2). Error bars represent the 95% conference intervals of the estimated ORs.
Differentially methylated positions that mediate genetic risk in peanut allergy.
| DMP | DMP annotation
| DMP-rs7192 associations | DMP-rs9275596 associations | DMP-PA associations | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Position | Nearest gene | Location | Beta | Beta | Beta | rs7192 | rs9275596 | ||||
| cg17039645 | 32294503 | Gene Body | 0.04 | 4.4×10−14 | 0.03 | 7.1×10−9 | 0.05 | 5.4×10−7 | 0.162 | 0.155 | |
| cg18111114 | 32498493 | TSS1500 | 0.10 | 5.5×10−25 | 0.09 | 3.2×10−22 | 0.10 | 1.6×10−5 | 0.098 | 0.037 | |
| cg15982117 | 32552106 | Gene Body | −0.09 | 8.9×10−10 | −0.09 | 3.1×10−10 | −0.15 | 3.8×10−7 | 0.002 | 0.002 | |
| cg18024368 | 32632848 | Gene Body | −0.04 | 2.7×10−10 | −0.05 | 3.1×10−13 | −0.06 | 2.4×10−6 | 0.002 | 0.003 | |
DMP: differentially methylated position; PA: peanut allergy; CIT: causal inference tests; TSS: transcription start site.
Represented by Infinium HumanMethylation450 BeadChip probe name. For each gene, only the top DMP that was genome-wide significantly associated with both SNPs (p<5×10−8) and that had the strongest association with risk of PA is shown.
Location of the methylation CpG site, in relation to the nearest gene.
Linear regression models were applied to test SNP-DMP associations in a subset of 218 children of European ancestry with available GWAS data and DNA methylation data, with adjustment of age, gender and the estimated cell composition.
Adjusted methylation difference with an increase of one copy of the risk allele (T allele for rs7192; or C allele for rs9275596).
Linear regression models were applied to test the DMP-PA associations in 73 PA cases and 67 controls of European ancestry, with adjustment of age, gender and the estimated cell composition.
Adjusted methylation difference between 73 PA cases and 67 controls of European ancestry.
P<0.00069 (=0.05/72) represents the significance level after Bonferroni correction.
Causal inference tests (CIT) were performed for the 4 DMPs in 73 peanut allergic cases and 67 controls of European ancestry using the modified version for binary outcomes.
P <0.05 after Bonferroni correction for four tested DMPs and two SNPs (cutoff: P<0.006 for 8 tests).