| Literature DB >> 23326239 |
Rohina Rubicz1, Robert Yolken, Eugene Drigalenko, Melanie A Carless, Thomas D Dyer, Lara Bauman, Phillip E Melton, Jack W Kent, John B Harley, Joanne E Curran, Matthew P Johnson, Shelley A Cole, Laura Almasy, Eric K Moses, Nikhil V Dhurandhar, Ellen Kraig, John Blangero, Charles T Leach, Harald H H Göring.
Abstract
Infection with Epstein-Barr virus (EBV) is highly prevalent worldwide, and it has been associated with infectious mononucleosis and severe diseases including Burkitt lymphoma, Hodgkin lymphoma, nasopharyngeal lymphoma, and lymphoproliferative disorders. Although EBV has been the focus of extensive research, much still remains unknown concerning what makes some individuals more sensitive to infection and to adverse outcomes as a result of infection. Here we use an integrative genomics approach in order to localize genetic factors influencing levels of Epstein Barr virus (EBV) nuclear antigen-1 (EBNA-1) IgG antibodies, as a measure of history of infection with this pathogen, in large Mexican American families. Genome-wide evidence of both significant linkage and association was obtained on chromosome 6 in the human leukocyte antigen (HLA) region and replicated in an independent Mexican American sample of large families (minimum p-value in combined analysis of both datasets is 1.4×10(-15) for SNPs rs477515 and rs2516049). Conditional association analyses indicate the presence of at least two separate loci within MHC class II, and along with lymphocyte expression data suggest genes HLA-DRB1 and HLA-DQB1 as the best candidates. The association signals are specific to EBV and are not found with IgG antibodies to 12 other pathogens examined, and therefore do not simply reveal a general HLA effect. We investigated whether SNPs significantly associated with diseases in which EBV is known or suspected to play a role (namely nasopharyngeal lymphoma, Hodgkin lymphoma, systemic lupus erythematosus, and multiple sclerosis) also show evidence of associated with EBNA-1 antibody levels, finding an overlap only for the HLA locus, but none elsewhere in the genome. The significance of this work is that a major locus related to EBV infection has been identified, which may ultimately reveal the underlying mechanisms by which the immune system regulates infection with this pathogen.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23326239 PMCID: PMC3542101 DOI: 10.1371/journal.pgen.1003147
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Figure 1EBV seroprevalence, based on measurement of anti-EBNA-1 antibodies, by sex and age for SAFHS.
Sliding 15-year age windows are used to smooth the curves, and age shown is the midpoint of each age interval.
Heritability estimates of EBNA-1 quantitative antibody and discrete serostatus traits, with and without household effects.
| Trait | Sample size | Heritability without household effects ( | Heritability with household effects ( | Household effects ( |
| Quantitative | 1367 | 42.8% (2.9×10−22) | 42.8% (2.4×10−22) | 3.8% (0.210) |
| Discrete | 1047 | 68.5% (4.0×10−16) | 68.7% (3.8×10−16) | 9.8% (0.179) |
Intermediate antibody titers were coded as unknown (indeterminate), thereby reducing the number of individuals included in analyses of the dichotomous serostatus trait.
Figure 2Genome-wide joint linkage and association analysis results for EBNA-1 antibody traits for SAFHS.
(A) Quantitative antibody titer. (B) Discrete serostatus.
Association analysis given linkage.
| SNP | Location on chrom 6 (bp) | Nearest gene | Minor allele | Minor allele frequency | This study (SAFHS) | Replicate (SAFDGS) | Combined (SAFHS+SAFDGS) | |||
| Quantitative | Discrete | Quantitative | Discrete | Quantitative | Discrete | |||||
| rs3130048 | 31721718 |
| C | 0.138 |
| 6.17×10−8(0.51) | 0.010(−0.24) | 0.052(0.29) |
|
|
| rs652888 | 31959213 |
| C | 0.096 | 6.96×10−7(−0.36) |
| 0.005(−0.30) | 0.009(0.48) |
|
|
| rs204999 | 32217957 |
| G | 0.180 |
|
| 1.11×10−5(−0.38) | 9.39×10−6(0.58) |
|
|
| rs4248166 | 32474399 |
| C | 0.254 |
|
| 0.002(0.23) | 0.075(−0.21) |
|
|
| rs2294884 | 32475237 |
| C | 0.261 | 2.45×10−7(0.24) |
| 0.003(0.22) | 0.131(−0.18) |
|
|
| rs2294882 | 32475493 |
| G | 0.280 | 1.12×10−7(0.24) |
| 0.003(0.22) | 0.109(−0.19) |
|
|
| rs2294881 | 32475582 |
| G | 0.280 | 1.12×10−7(0.24) |
| 0.003(0.22) | 0.109(−0.19) |
|
|
| rs28362680 | 32478794 |
| T | 0.235 | 5.53×10−7(0.24) |
| 0.009(0.20) | 0.185(−0.16) |
|
|
| rs28362683 | 32480941 |
| T | 0.220 | 6.82×10−7(0.25) |
| 0.011(0.19) | 0.209(−0.15) |
|
|
| rs10947261 | 32481210 |
| T | 0.233 | 4.54×10−7(0.25) |
| 0.009(0.20) | 0.193(−0.16) |
|
|
| rs10947262 | 32481290 |
| T | 0.231 | 1.10×10−6(0.24) |
| 0.009(0.20) | 0.185(−0.16) |
|
|
| rs7192 | 32519624 |
| T | 0.389 | 7.81×10−7(0.21) |
| 0.010(0.16) | 0.163(−0.14) |
|
|
| rs2239803 | 32519811 |
| G | 0.453 | 2.06×10−6(0.20) |
| 0.016(0.17) | 0.240(−0.11) |
|
|
| rs7194 | 32520458 |
| G | 0.391 | 5.24×10−7(0.22) |
| 0.010(0.16) | 0.163(−0.14) |
|
|
| rs7195 | 32520517 |
| A | 0.391 | 5.24×10−7(0.22) |
| 0.010(0.16) | 0.163(−0.14) |
|
|
| rs2213586 | 32521072 |
| T | 0.391 | 5.24×10−7(0.22) |
| 0.010(0.16) | 0.163(−0.14) |
|
|
| rs2213585 | 32521128 |
| C | 0.391 | 5.24×10−7(0.22) |
| 0.010(0.16) | 0.163(−0.14) |
|
|
| rs2227139 | 32521437 |
| C | 0.391 | 5.24×10−7(0.22) |
| 0.010(0.16) | 0.163(−0.14) |
|
|
| rs7754768 | 32528157 |
| C | 0.402 | 5.06×10−7(0.21) |
| 0.010(0.16) | 0.137(−0.15) |
|
|
| rs9268832 | 32535767 |
| T | 0.385 | 9.63×10−8(0.23) |
| 0.029(0.14) | 0.197(−0.14) |
|
|
| rs477515 | 32677669 |
| T | 0.320 |
| 8.72×10−8(0.37) |
|
|
|
|
| rs2516049 | 32678378 |
| G | 0.320 |
| 8.72×10−8(0.37) |
|
|
|
|
Shown are all SNPs yielding genome-wide significant p-values with either the quantitative and/or the qualitative antibody phenotype in the SAFHS. The regression coefficients refer to the estimated change in the phenotype for each dose of the rarer SNP allele. For the SAFHS all significant genome-wide results (p≤5.29×10−8) are presented in bold lettering. After correcting for multiple testing during replication in the SADGS (we tested the entire HLA region, with 5689 available SNPs: p≤0.05/5689≈8.79×10−6), there are two significant SNPs in the replication sample. When using the combined sample of both studies (SAFHS+SAFDGS), all SNPs originally significant in the SAFHS discovery sample are highly significant.
Since we used a liability threshold model for analysis of the dichotomous trait (see methods section), the direction of effect on EBNA-1 discrete serostatus is opposite of the sign of the regression coefficient, but is in the same direction as the regression coefficient for the quantitative trait (e.g. for SNP rs204999, the minor allele is associated with a decrease in EBNA-1 antibody level and seronegativity).
Figure 3Association analysis results (conditional on linkage) of extended HLA region for the combined sample (SAFHS+SAFDGS).
The LD pattern was estimated based on SNP genotypes from study participants. SNPs in red are highly correlated with the top SNP associated with the EBNA-1 quantitative trait (rs477515/rs2516049). (A) Quantitative antibody titer. (B) Discrete serostatus.
Association analysis, given linkage, conditional on the top SNP.
| SNP | Location on chrom 6 (bp) | Nearest gene | Minor allele | Minor allele frequency |
| |
| Quantitative trait | Discrete trait | |||||
| rs2233971 | 31188302 |
| A | 0.003 | 0.291 (0.05) |
|
| rs2233956 | 31189184 |
| G | 0.067 |
| 4.93×10−6 (0.49) |
| rs3130557 | 31202682 |
| T | 0.029 |
| 9.60×10−5 (0.57) |
| rs2854008 | 31420517 |
| T | 0.195 |
| 2.53×10−4 (0.26) |
| rs2507997 | 31422760 |
| C | 0.190 |
| 2.11×10−4 (0.26) |
| rs3093970 | 31505856 |
| A | 0.003 | 0.233 (0.05) |
|
| rs3130070 | 31699787 |
| G | 0.083 |
|
|
| rs3130623 | 31705679 |
| T | 0.089 | 1.02×10−7 (−0.34) |
|
| rs3130626 | 31706468 |
| G | 0.084 |
|
|
| rs2736157 | 31708799 |
| C | 0.084 |
|
|
| rs3115663 | 31709822 |
| G | 0.088 |
|
|
| rs9267522 | 31711749 |
| G | 0.088 |
|
|
| rs10885 | 31712570 |
| T | 0.083 |
|
|
| rs3130628 | 31717251 |
| C | 0.088 |
|
|
| rs3130048 | 31721718 |
| C | 0.138 |
|
|
| rs3117583 | 31727555 |
| C | 0.088 |
|
|
| rs3130618 | 31740113 |
| A | 0.088 |
|
|
| rs652888 | 31959213 |
| C | 0.096 |
|
|
| rs204999 | 32217957 |
| G | 0.180 |
|
|
| rs9267947 | 32319196 |
| G | 0.391 | 9.24×10−8 (−0.19) |
|
| rs9273327 | 32731201 |
| C | 0.069 |
|
|
| rs2854275 | 32736406 |
| T | 0.064 |
|
|
The top SNP, rs477515/rs2516049, was determined based on analysis of the quantitative antibody trait in the combined sample (SAFHS and SAFDGS). Only the extended HLA region was analyzed. Shown are the SNPs yielding genome-wide significant p-values (p≤5.29×10−8, shown in bold) with either the quantitative and/or the qualitative antibody phenotype in the combined sample. The regression coefficients refer to the estimated change in the phenotype for each dose of the rarer SNP allele.
Since we used a liability threshold model for analysis of the dichotomous trait (see methods section), the direction of effect on EBNA-1 discrete serostatus is opposite of the sign of the regression coefficient, but is in the same direction as the regression coefficient for the quantitative trait (e.g. for SNP rs204999, the minor allele is associated with a decrease in EBNA-1 antibody level and seronegativity).
Association (conditional on linkage) results for 12 comparative pathogens.
| Pathogen | rs477515/rs2516049 | rs2854275 |
|
|
|
|
| CMV | 0.89, 0.45 | 0.99, 0.72 |
| HSV-1 | 0.86, 0.71 | 0.03, 0.58 |
| HSV-2 | 0.23, 0.15 | 0.73, 0.54 |
| HHV-6 | 0.19, 0.53 | 0.06, 0.88 |
| VZV | 0.06, 0.22 | 0.95, 0.31 |
| Ad36 | 0.54, 0.72 | 0.69, 0.88 |
| HAV | 0.61, 0.53 | 0.54, 0.62 |
| Influenza A | 0.08, 0.09 | 0.92, 0.50 |
| Influenza B | 0.43, 0.33 | 0.09, 0.28 |
| C. pneumoniae | 0.05, 0.11 | 0.21, 0.99 |
| H. pylorii | 0.45, 0.95 | 0.57, 0.33 |
| T. gondii | 0.68, 0.44 | 0.64, 0.18 |
Focus is on the independent EBNA-1 associated SNPs rs477515/rs2516049 and rs2854275. There is no significant association of the top EBV SNPs with any of the other pathogens, after applying a Bonferroni correction to account for multiple testing (p≤0.05/12≈0.004).
P-values for EBNA-1 association, conditional on linkage, analysis for top nasopharyngeal carcinoma SNPs.
| SNP | Reference | Chromosome | Base pair location | EBNA-1 quantitative trait: | EBNA-1 discrete trait: | Genes of interest |
| rs2212020 |
| 3 | 37492466 | 0.381 | 0.306 |
|
| rs6774494 |
| 3 | 170565327 | 0.225 | 0.583 |
|
| rs2267633 |
| 6 | 29678820 | 0.874 | 0.825 |
|
| rs2076483 |
| 6 | 29679524 | 0.874 | 0.825 |
|
| rs29230 |
| 6 | 29684372 | 0.589 | 0.340 |
|
| rs29232 |
| 6 | 29719410 | 0.723 | 0.241 |
|
| rs3129055 |
| 6 | 29778240 | 0.327 | 0.119 |
|
| rs9258122 |
| 6 | 29779719 | 0.327 | 0.119 |
|
|
|
|
|
|
| 0.003 |
|
| rs9260734 |
| 6 | 30040645 | 0.075 | 0.084 |
|
| rs3869062 |
| 6 | 30042870 | 0.649 | 0.915 |
|
| rs5009448 |
| 6 | 30048467 | 0.571 | 0.890 |
|
| rs16896923 |
| 6 | 30108666 | 0.808 | 0.802 |
|
| rs2894207 |
| 6 | 31371730 | 0.483 | 0.523 |
|
|
|
|
|
| 0.009 |
|
|
| rs1412829 |
| 9 | 22033926 | 0.290 | 0.106 |
|
| rs9418990 |
| 10 | 135187956 | 0.503 | 0.716 |
|
| rs915906 |
| 10 | 135193728 | 0.022 | 0.115 |
|
| rs2249695 |
| 10 | 135202158 | 0.053 | 0.188 |
|
| rs1536826 |
| 10 | 135207229 | 0.381 | 0.550 |
|
| rs1572072 |
| 13 | 23025210 | 0.046 | 0.012 |
|
| rs9510787 |
| 13 | 23103195 | 0.896 | 0.899 |
|
| rs927220 |
| 14 | 67837725 | 0.846 | 0.690 |
|
Bold = p-value significant after Bonferoni correction for multiple testing (0.05/23≈2.17×10−3).
P-values for EBNA-1 association, conditional on linkage, analysis for top Hodgkin lymphoma SNPs.
| SNP | Reference | Chromosome | Base pair location | EBNA-1 quantitative trait: | EBNA-1 discrete trait: | Genes of interest |
| rs1432295 |
| 2 | 60920170 | 0.266 | 0.681 |
|
| rs2734986 |
| 6 | 29926547 | 0.011 | 0.104 |
|
| rs6904029 |
| 6 | 30051046 | 0.043 | 0.309 |
|
| rs2248462 |
| 6 | 31554775 | 0.013 | 0.011 |
|
|
|
|
|
|
|
|
|
|
|
|
|
| 0.017 |
|
|
| rs6903608 |
| 6 | 32536263 | 0.102 | 0.189 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| rs2608053 |
| 8 | 129145014 | 0.200 | 0.882 |
|
| rs2019960 |
| 8 | 129261453 | 0.120 | 0.217 |
|
| rs501764 |
| 10 | 8133040 | 0.657 | 0.924 |
|
| rs485411 |
| 10 | 8133191 | 0.706 | 0.646 |
|
Bold = p-value significant after Bonferoni correction for multiple testing (0.05/13≈3.85×10−3).
P-values for EBNA-1 association, conditional on linkage, analysis for top systemic lupus erythematosus SNPs.
| SNP | Reference | Chromosome | Base pair location | EBNA-1 quantitative trait: | EBNA-1 discrete trait: | Genes of interest |
| rs2476601 |
| 1 | 114179091 | 0.625 | 0.671 |
|
| rs1801274 |
| 1 | 159746369 | 0.883 | 0.716 |
|
| rs2205960 |
| 1 | 171458098 | 0.506 | 0.773 |
|
| rs3024505 |
| 1 | 205006527 | 0.400 | 0.512 |
|
| rs13385731 |
| 2 | 33555394 | 0.465 | 0.746 |
|
| rs1990760 |
| 2 | 162832297 | 0.871 | 0.248 |
|
| rs3024866 |
| 2 | 191631086 | 0.530 | 0.514 |
|
| rs7574865 |
| 2 | 191672878 | 0.391 | 0.928 |
|
| rs3135945 |
| 3 | 48483970 | 0.020 | 0.408 |
|
| rs6445975 |
| 3 | 58345217 | 0.906 | 0.789 |
|
| rs10516487 |
| 4 | 102970099 | 0.184 | 0.859 |
|
| rs907715 |
| 4 | 123754503 | 0.549 | 0.149 |
|
| rs10036748 |
| 5 | 150438339 | 0.804 | 0.904 |
|
| rs2507987 |
| 6 | 31451012 | 0.400 | 0.523 |
|
| rs409558 |
| 6 | 31816126 | 0.376 | 0.653 |
|
| rs3131379 |
| 6 | 31829012 | 2.21×10−3 | 3.47×10−3 |
|
| rs1270942 |
| 6 | 32026839 | 2.21×10−3 | 3.47×10−3 |
|
|
|
|
|
| 2.84×10−3 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| rs2071351 |
| 6 | 33151908 | 0.587 | 0.929 |
|
| rs3117213 |
| 6 | 33172583 | 0.214 | 0.735 |
|
| rs11755393 |
| 6 | 34932614 | 0.612 | 0.682 |
|
| rs548234 |
| 6 | 106674727 | 0.776 | 0.876 |
|
| rs6568431 |
| 6 | 106695499 | 0.468 | 0.539 |
|
| rs2230926 |
| 6 | 138237759 | 0.868 | 0.218 |
|
| rs4917014 |
| 7 | 50276409 | 0.671 | 0.509 |
|
| rs10488631 |
| 7 | 128381419 | 0.226 | 0.163 |
|
| rs12537284 |
| 7 | 128505142 | 0.322 | 0.240 |
|
| rs6985109 |
| 8 | 10798995 | 0.837 | 0.554 |
|
| rs7812879 |
| 8 | 11377591 | 0.636 | 0.614 |
|
| rs13277113 |
| 8 | 11386595 | 0.777 | 0.689 |
|
| rs7829816 |
| 8 | 57011940 | 0.909 | 0.609 |
|
| rs1913517 |
| 10 | 49789060 | 0.738 | 0.768 |
|
| rs4963128 |
| 11 | 579564 | 0.960 | 0.735 |
|
| rs507230 |
| 11 | 35085748 | 0.075 | 0.062 |
|
| rs6590330 |
| 11 | 127816269 | 0.154 | 0.184 |
|
| rs10847697 |
| 12 | 127865338 | 0.329 | 0.625 |
|
| rs1385374 |
| 12 | 127866647 | 0.365 | 0.670 |
|
| rs7329174 |
| 13 | 40456110 | 0.782 | 0.378 |
|
| rs16972959 |
| 16 | 23808877 | 0.029 | 0.0377 |
|
| rs9888739 |
| 16 | 31220754 | 0.613 | 0.491 |
|
| rs2280381 |
| 16 | 84576134 | 0.464 | 0.164 |
|
| rs280519 |
| 19 | 10333933 | 0.403 | 0.984 |
|
| rs4810485 |
| 20 | 44181354 | 0.672 | 0.865 |
|
| rs463426 |
| 22 | 20139185 | 0.259 | 0.051 |
|
| rs131654 |
| 22 | 20247190 | 0.100 | 0.133 |
|
Bold = p-value significant after Bonferoni correction for multiple testing (0.05/47≈1.06×10−3).
P-values for EBNA-1 association, conditional on linkage, analysis for top multiple sclerosis SNPs.
| SNP | Reference | Chromosome | Base pair location | EBV quantitative trait: p-value for association analysis | EBV discrete trait: p-value for association analysis | Genes of interest |
| rs6604026 |
| 1 | 93076191 | 0.224 | 0.952 |
|
| rs12044852 |
| 1 | 116889302 | 0.762 | 0.474 |
|
| rs1335532 |
| 1 | 116902480 | 0.242 | 0.077 |
|
| rs2300747 |
| 1 | 116905738 | 0.182 | 0.093 |
|
| rs1109670 |
| 2 | 9167489 | 0.729 | 0.517 |
|
| rs651477 |
| 2 | 119112161 | 0.630 | 0.551 |
|
| rs882300 |
| 2 | 136692725 | 0.267 | 0.079 |
|
| rs908821 |
| 3 | 142023408 | 0.812 | 0.712 |
|
| rs1841770 |
| 3 | 149239376 | 0.027 | 0.096 |
|
| rs7672826 |
| 4 | 182636689 | 0.388 | 0.731 |
|
| rs6897932 |
| 5 | 35910332 | 0.606 | 0.749 |
|
| rs3129934 |
| 6 | 32444165 | 0.022 | 4.54×10−3 |
|
|
|
|
|
| 1.84×10−3 |
|
|
| rs3135338 |
| 6 | 32509195 | 0.088 | 0.013 |
|
|
|
|
|
| 1.33×10−3 |
|
|
|
|
|
|
|
|
|
|
| rs1529316 |
| 8 | 3815546 | 0.221 | 0.107 |
|
| rs1755289 |
| 9 | 17928351 | 0.043 | 0.141 |
|
| rs3780792 |
| 9 | 135825164 | 0.587 | 0.984 |
|
| rs2104286 |
| 10 | 6139051 | 0.774 | 0.928 |
|
| rs2503875 |
| 10 | 43134055 | 0.973 | 0.812 |
|
| rs1800693 |
| 12 | 6310270 | 0.199 | 0.224 |
|
| rs4149584 |
| 12 | 6312904 | 0.298 | 0.183 |
|
| rs1458175 |
| 12 | 40252128 | 0.280 | 0.437 |
|
| rs703842 |
| 12 | 56449006 | 0.450 | 0.729 |
|
| rs9523762 |
| 13 | 92129887 | 0.451 | 0.439 |
|
| rs744166 |
| 17 | 37767727 | 0.314 | 0.407 |
|
| rs397020 |
| 20 | 1153886 | 0.399 | 0.278 |
|
| rs6074022 |
| 20 | 44173603 | 0.802 | 0.797 | CD40 |
| rs1569723 |
| 20 | 44175471 | 0.802 | 0.797 | CD40 |
Bold = p-value significant after Bonferoni correction for multiple testing (0.05/30≈9.09×10−4).