| Literature DB >> 28199695 |
Amy V Jones1, Mera Tilley2, Alex Gutteridge3, Craig Hyde4, Michael Nagle5, Daniel Ziemek6, Donal Gorman7, Eric B Fauman8, Xing Chen4, Melissa R Miller5, Chao Tian9, Youna Hu9, David A Hinds9, Peter Cox10, Serena Scollen1.
Abstract
Understanding the interaction between humans and mosquitoes is a critical area of study due to the phenomenal burdens on public health from mosquito-transmitted diseases. In this study, we conducted the first genome-wide association studies (GWAS) of self-reported mosquito bite reaction size (n = 84,724), itchiness caused by bites (n = 69,057), and perceived attractiveness to mosquitoes (n = 16,576). In total, 15 independent significant (P < 5×10-8) associations were identified. These loci were enriched for immunity-related genes that are involved in multiple cytokine signalling pathways. We also detected suggestive enrichment of these loci in enhancer regions that are active in stimulated T-cells, as well as within loci previously identified as controlling central memory T-cell levels. Egger regression analysis between the traits suggests that perception of itchiness and attractiveness to mosquitoes is driven, at least in part, by the genetic determinants of bite reaction size.Our findings illustrate the complex genetic and immunological landscapes underpinning human interactions with mosquitoes.Entities:
Mesh:
Year: 2017 PMID: 28199695 PMCID: PMC5390679 DOI: 10.1093/hmg/ddx036
Source DB: PubMed Journal: Hum Mol Genet ISSN: 0964-6906 Impact factor: 6.150
Demographic characteristics of the GWAS cohort. Itch intensity numbers reflect exclusion of participants positive for a common immune condition
| Phenotype | Bite size ( | Itch intensity ( | Attractiveness ( | |
|---|---|---|---|---|
| total | 84 724 (100) | 69 057 (100) | 16 576 (100) | |
| Sex | male | 41 355 (48.8) | 35 460 (51.3) | 8122 (49.0) |
| female | 43 369 (51.2) | 33 597 (48.7) | 8454 (51.0) | |
| Age (years) | ≤30 | 9381 (11.1) | 8418 (12.2) | 2497 (15.1) |
| >30 and ≤45 | 24 578 (29.0) | 20 814 (30.1) | 5238 (31.6) | |
| >45 and ≤60 | 23 721 (28.0) | 18 778 (27.2) | 4354 (26.3) | |
| >60 | 27 044 (31.9) | 21 047 (30.5) | 4487 (27.0) |
Figure 1(A) Manhattan plot of GWS loci for mosquito bite size. The grey horizontal line corresponds to P= 5 × 10−8, and results above this threshold are shown in dark grey. Gene labels are annotated as the nearby genes to the significant SNPs. λ = 1.073. (B) Quantile-quantile plot. Observed P values versus theoretical P values under the null hypothesis of no association, plotted on a log scale. The solid black line is shown with a slope of 1, and dashed black lines represent a 95% confidence envelope under the assumption that the test results are independent.
Index SNPs that are associated with mosquito bite size variation (P < 1.0e-06)
| Marker name | Region | Chr | Position | SNP quality | Alleles (A/B) | EAF (B allele) | Effect size for B allele (95% CI) | Gene context | |
|---|---|---|---|---|---|---|---|---|---|
| rs377070 | 4q27 | 4 | 123629002 | 0.99 | C/G | 0.388 | −0.070 (−0.081, −0.060) | 1.2 × 10−36 | IL21--[]--BBS12 |
| rs3134995 | 6p21.32 | 6 | 32672089 | 0.99 | C/T | 0.242 | −0.065 (−0.077, −0.052) | 1.9 × 10−24 | HLA-DQB1--[]--HLA-DQA2 |
| rs5750339 | 22q12.3 | 22 | 37319589 | 0.96 | C/G | 0.457 | −0.052 (−0.062, −0.041) | 9.2 × 10−21 | [CSF2RB] |
| rs55722650 | 5q31.1 | 5 | 131607300 | >0.99 | C/T | 0.41 | −0.049 (−0.060, −0.038) | 4.8 × 10−19 | [PDLIM4] |
| rs2906856 | 12q15 | 12 | 68390201 | 0.99 | C/T | 0.353 | 0.039 (0.028, 0.050) | 7.9 × 10−12 | DYRK2---[]---IFNG |
| rs11751172 | 6p21.1 | 6 | 45552731 | 0.99 | C/T | 0.244 | −0.040 (−0.053, −0.028) | 1.1 × 10−10 | RUNX2--[]---CLIC5 |
| rs6754311 | 2q21.3 | 2 | 136707982 | 0.97 | C/T | 0.609 | 0.039 (0.027, 0.051) | 1.6 × 10−10 | [DARS] |
| rs143626010 | 16p11.2 | 16 | 28539398 | 0.91 | D/I | 0.611 | 0.033 (0.021, 0.044) | 1.5 × 10−8 | [NPIPL1] |
| rs7793919 | 7p22.1 | 7 | 4769030 | 0.94 | C/T | 0.435 | 0.031 (0.020, 0.042) | 2.8 × 10−8 | [FOXK1] |
| rs62447171 | 7p12.2 | 7 | 50309890 | >0.99 | A/G | 0.736 | 0.034 (0.022, 0.046) | 3.5 × 10−8 | C7orf72---[]--IKZF1 |
| rs3024971 | 12q13.3 | 12 | 57493727 | >0.99 | G/T | 0.896 | 0.047 (0.030, 0.065) | 5.9 × 10−8 | [STAT6] |
| rs3102960 | 2p25.1 | 2 | 8452497 | 0.98 | C/T | 0.793 | −0.036 (−0.049, −0.023) | 7.6 × 10−8 | []---ID2 |
| rs2230624 | 1p36.22 | 1 | 12175658 | 0.80 | A/G | 0.988 | 0.162 (0.102, 0.222) | 1.1 × 10−7 | [TNFRSF8] |
| rs9815073 | 3q28 | 3 | 188115682 | 0.80 | A/C | 0.657 | −0.033 (−0.046, −0.021) | 1.1 × 10−7 | [LPP] |
| rs112637405 | 4q28.2 | 4 | 130258336 | 0.63 | A/T | 0 | 1.946 (1.228, 2.664) | 1.1 × 10−7 | C4orf33---[] |
| rs6841258 | 4p14 | 4 | 40565426 | 0.85 | C/T | 0.179 | 0.039 (0.024, 0.054) | 3.6 × 10−7 | [RBM47] |
| rs78587638 | 2q14.2 | 2 | 119187509 | 0.98 | A/G | 0.842 | −0.037 (−0.051, −0.023) | 4.2 × 10−7 | INSIG2---[]---EN1 |
| rs12133641 | 1q21.3 | 1 | 154428283 | >0.99 | A/G | 0.395 | 0.028 (0.017, 0.038) | 4.3 × 10−7 | [IL6R] |
| rs2075533 | 9q32 | 9 | 117693631 | >0.99 | A/G | 0.572 | 0.027 (0.017, 0.038) | 4.7 × 10−7 | TNFSF8[]--TNC |
| rs921533 | 2q31.1 | 2 | 174551931 | 0.98 | A/T | 0.658 | 0.029 (0.018, 0.040) | 5.2 × 10−7 | CDCA7---[]---SP3 |
| rs2963854 | 5q35.2 | 5 | 175083754 | 0.84 | A/G | 0.196 | −0.034 (−0.047, −0.021) | 5.3 × 10−7 | SFXN1---[]-HRH2 |
| rs112724722 | 7q21.3 | 7 | 93546386 | >0.99 | A/G | 0.998 | −0.384 (−0.534, −0.234) | 5.3 × 10−7 | GNGT1-[]-GNG11 |
| rs969584 | 8q24.13 | 8 | 126620429 | 0.99 | C/G | 0.351 | −0.028 (−0.040, −0.017) | 5.4 × 10−7 | TRIB1---[]---FAM84B |
| rs71872522 | 1p31.1 | 1 | 77050728 | 0.73 | D/I | 1 | −1.797 (−2.505, −1.089) | 6.6 × 10−7 | [ST6GALNAC3] |
| rs200872239 | 1p36.11 | 1 | 25250916 | 0.77 | D/I | 0.313 | 0.032 (0.020, 0.045) | 6.8 × 10−7 | [RUNX3] |
Region, cytogenetic band; chr, chromosome; position, build 37 map position of the SNP; SNP quality is average r2 from imputation; alleles A and B are assigned based on their alphabetical order; EAF (B allele), the frequency of the effect allele, which is denoted here as allele B, across all study participants; effect size, magnitude of effect for the B allele; CI, confidence interval; P, λ adjusted significance level; gene context, gene(s) spanning or flanking (<1Mb away from) the index SNP: brackets indicate the position of the SNP, and dashes indicate distance to a flanking gene (−, >1 kb; −, >10kb; − −, >100kb).
Figure 2(A) Manhattan plot of GWS loci for itch intensity from mosquito bites, adjusted for bite size and excluding responders positive for common immune-related conditions. The grey horizontal line corresponds to P = 5 × 10−8, and results above this threshold are shown in dark grey. Gene labels are annotated as the nearby genes to the significant SNPs. λ = 1.042. (B) Quantile-quantile plot (insert). Observed P values versus theoretical P values under the null hypothesis of no association, plotted on a log scale. The solid black line is shown with a slope of 1, and dashed black lines represent a 95% confidence envelope under the assumption that the test results are independent.
Index SNPs that are associated with itch intensity from mosquito bites (P < 1.0e-06)
| Marker name | Region | Chr | Position | SNP quality | Alleles (A/B) | EAF(B allele) | Effect size for B allele (95% CI) | Gene context | |
|---|---|---|---|---|---|---|---|---|---|
| rs2248116 | 5q31.1 | 5 | 131804347 | 0.98 | A/C | 0.414 | 0.034 (0.027, 0.041) | 3.5 × 10−21 | |
| rs309394 | 4q27 | 4 | 123613670 | >0.99 | G/T | 0.433 | 0.024 (0.017, 0.031) | 4.9 × 10−11 | |
| rs12055445 | 6p21.32 | 6 | 32626739 | 0.84 | A/G | 0.513 | −0.025 (−0.032, −0.017) | 1.5 × 10−10 | |
| rs2523614 | 6p21.33 | 6 | 31320654 | 0.97 | C/T | 0.689 | 0.024 (0.016, 0.031) | 1.4 × 10−9 | |
| rs3814244 | 12q15 | 12 | 68413471 | >0.99 | A/T | 0.401 | −0.022 (−0.029, −0.015) | 1.9 × 10−9 | |
| rs4499342 | 19p13.2 | 19 | 8786913 | 0.98 | C/T | 0.147 | 0.029 (0.020, 0.039) | 2.7 × 10−9 | |
| rs778798 | 19p13.3 | 19 | 5839613 | >0.99 | A/C | 0.74 | −0.022 (−0.029, −0.014) | 8.6 × 10−8 | |
| rs5796229 | 12p13.31 | 12 | 6491256 | 0.75 | D/I | 0.691 | 0.023 (0.015, 0.032) | 9.1 × 10−8 | |
| rs183794680 | 15q24.1 | 15 | 74807954 | 0.56 | C/T | 0.996 | 0.468 (0.295, 0.640) | 1.1 × 10−7 | |
| rs72816448 | 2p16.3 | 2 | 48314623 | 0.81 | A/G | 0.977 | −0.071 (−0.097, −0.044) | 1.5 × 10−7 | |
| rs10117812 | 9q33.3 | 9 | 128251025 | >0.99 | C/G | 0.697 | 0.02 (0.013, 0.028) | 1.8 × 10−7 | |
| rs4978899 | 9q31.3 | 9 | 113032356 | 0.98 | C/T | 0.099 | 0.03 (0.018, 0.041) | 4.9 × 10−7 | |
| rs5756391 | 22q12.3 | 2 | 37298344 | >0.99 | A/G | 0.621 | −0.018 (−0.025, −0.011) | 5.5 × 10−7 | |
| rs140260076 | 5q23.1 | 5 | 117114517 | 0.82 | C/T | 0.017 | −0.073 (−0.102, −0.044) | 8.8 × 10−7 | |
| rs55665660 | 4q28.1 | 4 | 124786263 | 0.99 | C/T | 0.959 | −0.043 (−0.061, −0.026) | 9.7 × 10−7 |
The analysis was adjusted for bite size and excluding responders positive for common immune-related conditions). Region, cytogenetic band; chr, chromosome; position, build 37 map position of the SNP; SNP quality is average r2 from imputation; alleles A and B are assigned based on their alphabetical order; EAF (B allele), the frequency of the effect allele, which is denoted here as allele B, across all study participants; effect size, magnitude of effect for the B allele; CI, confidence interval; P, λ adjusted significance level; gene context, gene(s) spanning or flanking (<1Mb away from) the index SNP: brackets indicate the position of the SNP, and dashes indicate distance to a flanking gene (−, >1 kb; −, >10kb; − −, >100kb).
Figure 3(A) Manhattan plot showing GWS loci for perceived attractiveness to mosquitoes. The horizontal grey line corresponds to P= 5 × 10−8, and results above this threshold are shown in dark grey. Gene labels are annotated as the nearby genes to the significant SNPs. λ = 1.027. (B) Quantile-quantile plot (insert). Observed P values versus theoretical P values under the null hypothesis of no association, plotted on a log scale. The solid black line is shown with a slope of 1, and dashed black lines represent a 95% confidence envelope under the assumption that the test results are independent.
Index SNPs that are associated with perceived attractiveness to mosquitoes (P < 1.0e-06)
| SNP | Region | Chr | Position | SNP quality | Alleles (A/B) | EAF(B allele) | OR for B allele (95% CI) | Gene context | |
|---|---|---|---|---|---|---|---|---|---|
| rs309403 | 4q27 | 4 | 123616822 | 0.99 | C/T | 0.393 | 1.269 (1.210, 1.330) | 6.8 × 10−23 | |
| rs1858074 | 5q31.1 | 5 | 131371999 | >0.99 | A/G | 0.317 | 1.161 (1.105, 1.219) | 2.4 × 10−9 | |
| rs9268659 | 6p21.32 | 6 | 32410941 | >0.99 | C/T | 0.42 | 1.15 (1.098, 1.205) | 3.5 × 10−9 | |
| rs521977 | 6p21.33 | 6 | 31836827 | >0.99 | G/T | 0.276 | 1.14 (1.083, 1.200) | 5.9 × 10−7 | |
| rs76338894 | 1p36.31 | 1 | 5914245 | 0.79 | G/T | 0.949 | 1.345 (1.195, 1.513) | 6.5 × 10−7 | |
| rs3132479 | 6p21.33 | 6 | 31274752 | 0.99 | A/G | 0.466 | 0.891 (0.851, 0.933) | 8.2 × 10−7 | |
| rs139253612 | 11q24.1 | 11 | 123114582 | 0.98 | A/G | 0.985 | 1.57 (1.306, 1.887) | 9.7 × 10−7 |
Region, cytogenetic band; chr, chromosome; position, build 37 map position of the SNP; SNP quality is average r2 from imputation; alleles A and B are assigned based on their alphabetical order; EAF (B allele), the frequency of the effect allele, which is denoted here as allele B, across all study participants; CI, confidence interval; P, λ adjusted significance level; gene context, gene(s) spanning or flanking (<1Mb away from) the index SNP: brackets indicate the position of the SNP, and dashes indicate distance to a flanking gene (−, >1 kb; −, >10kb; − −, >100kb).
Figure 4Causal relationships between the three mosquito-related traits. Evidence provided by Egger regression revealed a high degree of overall pleiotropy between all three traits, and also suggested bite size may have a causal relationship on attractiveness to mosquitoes and itch intensity. P; slope coefficient significance level.
Summary of supporting evidence for GWS mosquito-associated loci
| HaploReg v4.1 annotation | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Region | Mosquito-related trait + SNP | Gene context | Immune disease/ trait | NS SNP | eQTL | Promoter | Enhancer | DNase | Proteins bound | 5' or 3' |
| 2q21.3 | S: rs6754311 | [DARS] | * | * | ||||||
| 4q27 | S: rs377070, A: rs309403, I: rs309394 | IL21--[]--BBS12 | * | * | * | * | ||||
| 5q31.1 | I: rs2248116, S: rs55722650 | C5orf56-[]--IRF1 | *** | * | * | * | * | * | ||
| 5q31.1 | A: rs1858074 | *** | * | * | * | |||||
| 6p21.1 | S: rs11751172 | RUNX2--[]---CLIC5 | * | * | ||||||
| 6p21.32 | S: rs3134995 | HLA-DQB1--[]--HLA-DQA2 | *** | * | * | * | * | |||
| 6p21.32 | I: rs12055445 | HLA-DQA1--[]HLA-DQB1 | *** | * | * | * | ||||
| 6p21.32 | A: rs9268659 | *** | * | * | * | * | * | |||
| 6p21.33 | I: rs2523614 | HLA-C--[]--MICA | * | * | * | |||||
| 7p12.2 | S: rs62447171 | C7orf72---[]--IKZF1 | *** | * | * | * | ||||
| 7p22.1 | S: rs7793919 | [FOXK1] | * | * | * | * | ||||
| 12q15 | S: rs2906856, I: rs3814244 | DYRK2---[]---IFNG | * | * | ||||||
| 16p11.2 | S: rs143626010 | [NPIPL1] | * | * | * | * | ||||
| 19p13.2 | I: rs4499342 | ADAMTS10---[]--ACTL9 | *** | * | * | |||||
| 22q12.3 | S: rs5750339 | [CSF2RB] | * | * | * | * | ||||
Cases are highlighted, by endpoint, where an index SNP is within 500 kb and r2 > 0.5 with a SNP that has been reported to be an eQTL index SNP (i.e. the strongest SNP associated with gene expression for a particular study, tissue and gene) from the literature. Datasets used for the eQTL lookup are contained in Supplementary Material, Table S34. Region, cytogenetic band; trait defined as I: mosquito itch intensity (adjusted for bite size and excluding responders positive for common immune-related conditions), S: mosquito bite size, A: mosquito attractiveness, followed by index association SNP (traits were grouped if index SNPs were in the same chromosomal region and highly correlated at r2 >0.5), gene context, gene(s) spanning or flanking (<1Mb away from) the index SNP: brackets indicate the position of the SNP, and dashes indicate distance to a flanking gene (−, >1 kb; −, >10kb; − −, >100kb); immune disease/trait, correlation with variant associated with other immune condition (na; not available), where strength of association is: **P < 5 × 10−7, ***P < 5 × 10−8 (Supplementary Material, Tables S18–S22); NS SNP, nonsynonymous SNP, r>0.5 with nonsynonymous SNP (Supplementary Material, Table S24); HaploReg annotation summary (*denotes positive for annotation, Supplementary Material, Table S26); eQTL, index SNP association with the expression of the listed gene(s); Promoter, co-localisation with promoter histone marks; Enhancer, co-localisation with enhancer histone marks, DNase, co-localisation with DNase I hypersensitivity marks; Proteins bound, co-localisation with protein binding detected by ChIP-sequencing; 5’ or 3’, index variant(s) in LD (r2 >0.8) with variants located in a genes 3’ or 5’ UTR region, and dashes indicate distance to a flanking gene (−, >1 kb; −, >10kb; − −, >100kb).
Figure 5Mosquito-trait pathway analysis. The highest scoring PPI subnetworks based on the PASCAL gene-trait association scores are illustrated for the bite size (A), itch intensity (B) and attractiveness (C) GWAS; nodes are sized and coloured according to P value (red = Gene-trait association P > 1 × 10−8, white = Gene-trait association P = 0).