| Literature DB >> 30332657 |
Vasiliki Lagou1, Josselyn E Garcia-Perez2, Ide Smets3, Lies Van Horebeek4, Marijne Vandebergh4, Liye Chen5, Klara Mallants4, Teresa Prezzemolo2, Kelly Hilven4, Stephanie Humblet-Baron2, Matthieu Moisse6, Philip Van Damme7, Guy Boeckxstaens8, Paul Bowness5, Bénédicte Dubois3, James Dooley2, Adrian Liston9, An Goris10.
Abstract
The immune system is highly diverse, but characterization of its genetic architecture has lagged behind the vast progress made by genome-wide association studies (GWASs) of emergent diseases. Our GWAS for 54 functionally relevant phenotypes of the adaptive immune system in 489 healthy individuals identifies eight genome-wide significant associations explaining 6%-20% of variance. Coding and splicing variants in PTPRC and COMMD10 are involved in memory T cell differentiation. Genetic variation controlling disease-relevant T helper cell subsets includes RICTOR and STON2 associated with Th2 and Th17, respectively, and the interferon-lambda locus controlling regulatory T cell proliferation. Early and memory B cell differentiation stages are associated with variation in LARP1B and SP4. Finally, the latrophilin family member ADGRL2 correlates with baseline pro-inflammatory interleukin-6 levels. Suggestive associations reveal mechanisms of autoimmune disease associations, in particular related to pro-inflammatory cytokine production. Pinpointing these key human immune regulators offers attractive therapeutic perspectives.Entities:
Keywords: adaptive immune system; association; autoimmunity; genetics; genome-wide association; immune phenotype; susceptibility
Mesh:
Substances:
Year: 2018 PMID: 30332657 PMCID: PMC6205839 DOI: 10.1016/j.celrep.2018.09.048
Source DB: PubMed Journal: Cell Rep Impact factor: 9.423
Replication of Previously Known Genotype-Immune Phenotype Associations
| Chr | Pos | rsID | EA | NEA | EAF | BETA (SE) | p Value | Trait | Candidate Genes | Trait ( |
|---|---|---|---|---|---|---|---|---|---|---|
| 2 | 38897074 | rs13011383 | G | A | 0.87 | −0.34 (0.12) | .0070 | CD4+ EMRA | TD CD4+ %GP | |
| 2 | 38921934 | rs7583259 | G | C | 0.49 | −0.31 (0.09) | .00051 | CD8+ EM | CD45RA− CD28− CD8br %P | |
| 2 | 87014377 | rs2944254 | C | T | 0.72 | 0.25 (0.09) | .0073 | CD4+ proliferating | CD4+ CD8dim AC | |
| 12 | 6899181 | rs2855537 | G | T | 0.76 | 0.18 (0.08) | .023 | TREC | naive (CD4+ CD8+) AC | |
| 17 | 33797371 | rs9916257 | T | G | 0.47 | −0.23 (0.06) | .00028 | NK | NK %GP |
We observed nominal significance for five genome-wide significant associations previously reported in the Sardinian population (Orrù et al., 2013). Trait names in Orrù et al. study: AC, absolute count; %GP, percentage of grandparental cells; NK, natural killer (cells); %P, percentage of parental cells; TD, terminally differentiated. BETA, effect; Chr, chromosome; EA, effect allele; EAF, effect allele frequency; NEA, non-effect allele; Pos, position in GRCh37; rsID, reference SNP identification; SE, standard error.
Figure 1Genome-wide Significant Genotype-Immune Phenotype Associations
(A) Circos plot demonstrating eight regions reaching genome-wide significant association with immune phenotypes. The y axis displays the negative logarithm of the p value. Variants reaching genome-wide significance (p < 5 × 10−8, dotted red line) are depicted in red, and the corresponding trait with which the variant is associated is indicated.
(B) Overview of the association of eight independent lead variants reaching genome-wide significance to at least one immune phenotype with all 54 immune phenotypes (see also Table S1 for definitions of immune phenotypes). Darkest colors indicate genome-wide significant associations, whereas red and blue colors distinguish a positive or negative direction of effect, respectively. Genome-wide (GW) significant, suggestive, nominal, and trend correspond to p values < 5 × 10−8, < 1 × 10−4, < 0.05, and < 0.10, respectively.
Novel Genome-wide Significant Genotype-Immune Phenotype Associations
| Chr | Pos | rsID | EA | NEA | EAF | BETA (SE) | % var | INFO | n | p value | Trait | Gene | Annotation |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 82196322 | rs9324185 | C | T | 0.37 | 0.36 (0.06) | 6.22 | 0.98 | 474 | 3.51 × 10−8 | IL-6 | G | |
| 1 | 198665917 | rs17612648 | G | C | 0.02 | −2.04 (0.26) | 15.70 | 0.883 | 362 | 6.22 × 10−15 | CD4+ EM | Co, Q, G, L | |
| 1 | 198665917 | rs17612648 | G | C | 0.02 | −1.76 (0.27) | 11.84 | 0.859 | 362 | 8.94 × 10−11 | CD4+ CM | Co, Q, G, L | |
| 1 | 198830942 | rs113116201 | C | T | 0.02 | 1.72 (0.29) | 12.26 | 0.996 | 255 | 7.58 × 10−9 | CD4+ EMRA | Co, Q, L | |
| 1 | 198830942 | rs113116201 | C | T | 0.02 | −2.17 (0.27) | 20.08 | 0.996 | 248 | 7.52 × 10−14 | CD8+ EM | Co, Q, L | |
| 4 | 128924522 | rs373482106 | C | CA | 0.23 | 0.49 (0.08) | 8.21 | 0.919 | 449 | 4.90 × 10−10 | KREC | Q, L, E | |
| 5 | 38974929 | rs16867919 | G | A | 0.21 | 0.43 (0.08) | 6.13 | 1 | 475 | 3.56 × 10−8 | Th2 | R, G, L | |
| 5 | 115413042 | rs6886944 | T | C | 0.67 | −0.54 (0.10) | 11.57 | 0.966 | 248 | 4.83 × 10−8 | CD8+ CM | Co, Q, Cs, N, L | |
| 7 | 21115110 | rs917812 | G | C | 0.22 | −0.46 (0.08) | 6.87 | 0.967 | 468 | 5.86 × 10−9 | Memory B | Cs, L, E | |
| 14 | 82778603 | rs1457990 | A | G | 0.51 | −0.36 (0.06) | 6.56 | 0.998 | 466 | 1.91 × 10−8 | Th17 | Cs, L, E | |
| 19 | 39745146 | rs10853728 | G | C | 0.67 | −0.48 (0.08) | 9.95 | 1 | 294 | 2.89 × 10−8 | proliferating Treg | IFNλ cluster | N, L |
Variants associated with p < 5 × 10−8. SNPs rs17612648 and rs113116201 at the PTPRC locus are in LD (r2 = 0.62 in EURs, r2 = 1 in CEU) and conditional analyses were not able to distinguish between them (see also Table S3). Annotation indicates whether the variant is or is in LD with (r2 > 0.8) a coding variant (Co) or known splicing or expression quantitative trait locus (Q) or is conserved (Cs), whether the variant disrupts a regulatory motif (R), whether the variant is located in the candidate gene (G) or the candidate gene is the nearest gene to the variant (N), and whether the candidate gene is supported by biological evidence in the literature (L) or by expression data obtained in this study (E). For all three regions where both trait and variant have appropriate equivalents in previous GWASs, our findings replicated with nominal significance (p < 0.05) or showed a trend in the same direction (see also Table S2). BETA, effect; Chr, chromosome; EA, effect allele; EAF, effect allele frequency; INFO, imputation quality, with 1 for directly genotyped variants; n, number of individuals with genotype and immune phenotype; NEA, non-effect allele; Pos, position in GRCh37; rsID, reference SNP identification; SE, standard error; % var, percentage of variance explained.
Figure 2Coding and Splicing Variants Involved in T Cell Memory Differentiation
(A–H) Regional association plots (A–D) and boxplots (E–H) for PTPRC variants with CD4+ effector memory (EM) T cells (A and E), CD4+ central memory (CM) T cells (B and F), CD4+ terminally differentiated (EMRA) T cells (C and G), and CD8+ EM cells (D and H). Variant rs17612648 disrupts an exonic splicing silencer for PTPRC exon 4 (CD45RA splice form) and is in LD with rs113116201 (see also Table S3).
(I and J) Regional association plot (I) and boxplot (J) for the COMMD10 region with CD8+ CM cells. The lead variant rs6886944 is in high LD with synonymous coding variant rs1129494. In regional association plots, the x axis depicts the position on the chromosome and RefSeq genes, the left y axis indicates the negative logarithm of the p value for each variant (with the horizontal line corresponding to genome-wide significance or p < 5 × 10−8), and the right y axis shows recombination rates. The lead variant is indicated with a purple diamond and text, other variants of interest are indicated in blue text, and LD of other variants with the lead variant is color-coded based on r2 in the 1000 Genomes November 2014 European (EUR) database. In boxplots, boxes indicate median and interquartile range, with whiskers extending to 1.5× the interquartile range.
Figure 3Genetic Variants Associated with T Helper Subset Differentiation and Proliferation
(A–G) Regional association (A–C) and boxplots (D–F) for T helper 2 (Th2) (A and D), T helper 17 (Th17) (B and E), and proliferating regulatory T cells (Tregs) (C and F). Legends as in Figure 2.
(G) Lead associated variant in RICTOR is predicted to disrupt the T cell transcription factor MEF2-binding site.
(H) Among three candidate genes (STON2, SEL1L, and LINC01467) within a 1-Mb interval in the chromosome 14 region, STON2 was the only gene differentially expressed in Th17 versus Th1 cells differentiated from naive CD4+ T cells. Expression of LINC01467 was undetectable and not shown. IL17A and IFNG were included as positive controls for Th17 and Th1 cells, respectively. Mean and SEM for triplicate measurements from three donors are shown; relative quantity (RQ) was normalized using a T cell housekeeping gene (RPL13A) and was log-transformed for analysis.
Figure 4Genetic Variants Associated with B Cell Differentiation
(A–G) Regional association plots (A and B) and boxplots (C and D) for sjKREC levels (A and C) and memory B cells (B and D). Treatments known to increase early B cells such as interferon-beta (IFNB) compared to untreated multiple sclerosis patients (UNT) increased KREC levels (E) and decreased LARP1B gene expression (F), with an inverse correlation between LARP1B and KREC levels (G) (simplex measurements in PBMCs from 82 individuals).
(H) Among genes in the chromosome 7 region (LINC01162, SP4, and SP8), only SP4 is highly expressed in B cell subsets. Mean and SD of gene expression levels (triplicate measurements from four donors) is depicted. Additional legend as in Figure 2.
Figure 5Genetic Control of Pro-inflammatory Cytokine Production
(A) Regional association plot for ex vivo plasma interleukin-6 levels additionally depicting known GWAS hits in this region, including a variant associated with pediatric autoimmune diseases but in weak LD (r2 = 0.055).
(B and C) Boxplot for ex vivo plasma interleukin-6 levels (B) and interleukin-6 gene expression (C) in RNA extracted from PBMCs (simplex measurements from 173 individuals) (p = 0.16). Additional legend as in Figure 2.
Genotype-Immune Phenotype Correlations for Known Immune Disease Susceptibility Loci
| Chr | Pos | rsID | EA | NEA | EAF | BETA (SE) | p value | Trait | Disease | Variants | Genes |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 2 | 204690355 | rs231746 | C | G | 0.49 | 0.29 (0.06) | 4.68 × 10−6 | CD8+ IL2+ | RA (+) | rs231735∗T | |
| 3 | 18752031 | rs55845060 | C | T | 0.17 | 0.35 (0.08) | 8.02 × 10−5 | TNF-α | Ps (+) | rs68080462∗C | |
| 6 | 32590924 | rs3129763 | A | G | 0.26 | 0.35 (0.08) | 2.93 × 10−5 | lymphocyte | SSc (+) | rs3129763 | HLA class II |
| 6 | 32680928 | rs7765379 | T | G | 0.11 | 0.51 (0.10) | 8.90 × 10−7 | TNF-α | RA, CD, EF (+) | rs7765379 | HLA class II |
| 6 | 33037085 | HLA-DPA1∗0103 | P | A | 0.80 | 0.32 (0.08) | 6.73 × 10−5 | memory B | hepatitis B (−) | HLA-DPA1∗0103 | HLA-DPA1 |
| 6 | 135424203 | rs6930223 | G | T | 0.54 | −0.48 (0.12) | 8.39 × 10−5 | CD8+ GM-CSF+ | HL (+) | rs7745098∗G | |
| 10 | 94502244 | rs11187157 | C | T | 0.43 | 0.29 (0.07) | 4.30 × 10−5 | mDCs | IBD (+) | rs11187157∗C | |
| 10 | 94436851 | rs7911264 | T | C | 0.53 | 0.31 (0.06) | 3.98 × 10−6 | mDCs | IBD (+) | rs7911264∗C | |
| 12 | 26691549 | rs9668498 | A | G | 0.25 | 0.30 (0.08) | 6.22 × 10−5 | TNF-α | KB (+) | rs10842750∗A | |
| 15 | 78785944 | rs373948468 | ATT | AT | 0.31 | 0.54 (0.12) | 2.17 × 10−5 | plasmablast | COPD (+) | rs17484524∗G, rs7181486∗C | |
| 16 | 11223454 | rs11645657 | C | G | 0.45 | 0.59 (0.12) | 4.89 × 10−6 | transitional B | AD, HF, asthma (+) | rs2041733∗T | |
| 16 | 50756881 | rs2076756 | G | A | 0.32 | −0.31 (0.08) | 7.63 × 10−5 | CD8+ naive | CD (+) | rs2076756∗G | |
| 20 | 62347191 | rs62217799 | T | G | 0.69 | −0.29 (0.07) | 4.47 × 10−5 | memory B | CD (+) | rs4809330∗G |
Suggestive (p < 10−4) genotype-immune phenotype correlations in LD (r2 > 0.8) with established immune disease susceptibility SNPs (+, susceptibility; −, protection). See also Tables S4 and S6 for all non-HLA and HLA suggestive associations, respectively, and Table S5 for all suggestive associations in LD with a variant for any disease in the EBI GWAS catalog. BETA, effect; EA, effect allele (P indicates present and A absent for HLA allele); EAF, effect allele frequency; NEA, non-effect allele; Pos, position in GRCh37; rsID, reference SNP identification; SE, standard error. Traits and disease associations: AD, atopic dermatitis (Paternoster et al., 2015), asthma (Paternoster et al., 2015); CD, Crohn disease (Franke et al., 2010, Yamazaki et al., 2013); COPD, chronic obstructive pulmonary disorder obstruction (Lee et al., 2015, Lutz et al., 2015); EF, enteric fever (Dunstan et al., 2014), hepatitis B (Kamatani et al., 2009); HF, hay fever (Paternoster et al., 2015); HL, Hodgkin lymphoma (Frampton et al., 2013); IBD, inflammatory bowel disease (Jostins et al., 2012, Liu et al., 2015); KB, Kashin-Beck disease (Zhang et al., 2015a); mDCs, myeloid dendritic cells; Ps, psoriasis (Baurecht et al., 2015); RA, rheumatoid arthritis (Freudenberg et al., 2011, Gregersen et al., 2009); SSc, systemic sclerosis (Gorlova et al., 2011).
| REAGENT or RESOURCE | SOURCE | IDENTIFIER |
|---|---|---|
| CCR7 G043H7 | Biolegend | Cat# 353205; RRID: |
| CD11c 3.9 | eBioscience | Cat# 46-0116-41; RRID: |
| CD123 6H6 | eBioscience | Cat# 25-1239-41; RRID: |
| CD14 61D3 | eBioscience | Cat# 48-0149-41; RRID: |
| CD24 ML5 | Biolegend | Cat# 311121; RRID: |
| CD27 O323 | eBioscience | Cat# 56-0279-41; RRID: |
| CD3 SK7 | eBioscience | Cat# 56-0037-42; RRID: |
| CD31 WM-59 | eBioscience | Cat# 12-0319-41; RRID: |
| CD38 HIT2 | eBioscience | Cat# 17-0389-41; RRID: |
| CD4 RPA-T4 | eBioscience | Cat# 61-0049-41; RRID: |
| CD8α RPA-T8 | eBioscience | Cat# 56-0088-41; RRID: |
| CD19 HIB19 | Biolegend | Cat# 302241; RRID: |
| CD45Ra HI100 | eBioscience | Cat# 47-0458-41; RRID: |
| CD56 5.1H11 | Biolegend | Cat# 362503; RRID: |
| CXCR5 J252D | Biolegend | Cat# 356903; RRID: |
| FOXP3 206D | Biolegend | Cat# 320113: RRID: |
| γδTCR B1.1 | eBioscience | Cat# 11-9959-41; RRID: |
| HLA-DR LN3 | eBioscience | Cat# 47-9956-41; RRID: |
| IFNγ 4S.B3 | eBioscience | Cat# 47-7319-41; RRID: |
| IgE IgE21 | eBioscience | Cat# 11-6986-41; RRID: |
| IgM MHM-88 | Biolegend | Cat# 314511; RRID: |
| IL-17 eBio64DEC17 | eBioscience | Cat# 11-7179-41; RRID: |
| IL-2 MQ1-17H12 | eBioscience | Cat# 46-7029-41; RRID: |
| IL-4 8D4-8 | eBioscience | Cat# 12-7049-41; RRID: |
| Ki67 B56 | BD | Cat# 556027; RRID: |
| CD127 eBioRDR5 | eBioscience | Cat# 13-1278-82; RRID: |
| GM-CSF BVD-21C11 | eBioscience | Cat# 502309; RRID: |
| IL-21 eBio3A3-N2 | Biolegend | Cat# 12-7219-41; RRID: |
| Vα24Jα18 6B11 | eBioscience | Cat# 12-5806-41; RRID: |
| Naive CD4+ T Cell Isolation Kit II, human | Miltenyi Biotec | 130-094-131 |
| T Cell Activation/Expansion Kit, human | Miltenyi Biotec | 130-091-441 |
| CD24 ML5 | BioLegend | Cat# 311103; RRID: |
| IgM SA-DA4 | eBioscience | Cat# 12-9998-42; RRID: |
| CD14 TuK4 | eBioscience | Cat# MHCD1418; RRID: |
| IgD IA6-2 | BioLegend | Cat# 348209; RRID: |
| CD38 HIT2 | eBioscience | Cat# 17-0389-42; RRID: |
| CD27 O323 | eBioscience | Cat# 56-0279-42: RRID: |
| CD19 HIB19 | BioLegend | Cat# 302209; RRID: |
| Serum and PBMCs of healthy volunteers | N/A | |
| DNA samples of healthy volunteers | This paper | N/A |
| PBMCs, DNA and RNA samples of healthy volunteers | This paper | N/A |
| DNA and RNA samples of multiple sclerosis patients | This paper | N/A |
| Lymphocyte separation medium | LSM, MP Biomedicals | 0850494 |
| Lymphoprep | StemCell Technologies | 07861 |
| 10% DMSO | Sigma | D2650-100ML |
| Histopaque-1077 | Sigma | 10771 |
| Fixation-permeabilization buffer | eBioscience | 00-5523-00 |
| PMA | Sigma | P8139 |
| Ionomycin | Sigma | I3909-1ML |
| GolgiStop | BD Biosciences | 554724 |
| Cytofix/Cytoperm | BD Biosciences | 55471 |
| IL-2 | Peprotech | 200-02 |
| IL-12 | R&D Systems | 219-IL-005 |
| IL-1β | Peprotech | 200-01B |
| IL-6 | Peprotech | 200-06 |
| IL-23 | Peprotech | 200-23 |
| EcoRI | New England Biolabs | R0101S |
| Trizol | Thermo Fisher | 15596026 |
| High-Capacity cDNA Reverse Transcription Kit | Thermo Fisher | 4374967 |
| BAFF Quantikine ELISA kit | R&D Systems | DBLYS0B |
| V-Plex human Proinflammatory panel MSD | Meso Scale Discovery | K15049D-2 |
| Infinium HTS assay on Global Screening Array bead-chips | Illumina | |
| Immune phenotypes of healthy volunteers | Raw Data Resource | |
| Strand file for Illumina GSA array (“GSAMD-24v1-0_20011747_A1” file) | Developer: Will Rayner | |
| 1000 Genomes reference datasets [Phase1 (2012); Phase 3 (2014)] | ||
| T1DGC reference panel for the HLA region, build 37 | ||
| ExSNP integrated eQTL database (“All processed eQTL data for each study” file) | ||
| Published dataset for splicing QTLs in whole blood (“ng.3220-S2.xlsx” file) | N/A | |
| DNA detection assays for sjKREC | Thermo Fisher | Custom made |
| DNA detection assays for sjKREC | Thermo Fisher | Custom made |
| TaqMan Copy Number Reference Assay for human RNase P | Thermo Fisher | 4403326 |
| Thermo Fisher | Hs00263833_m1 | |
| Thermo Fisher | Hs01071406_m1 | |
| Thermo Fisher | Hs04403614_m1 | |
| Thermo Fisher | Hs04194366_g1 | |
| Thermo Fisher | Hs00162095_m1 | |
| Thermo Fisher | Hs01941366_s1 | |
| Thermo Fisher | Custom made | |
| Thermo Fisher | Hs00292731_m1 | |
| Thermo Fisher | Hs00174131_m1 | |
| Thermo Fisher | Hs00608519_m1 | |
| Thermo Fisher | Hs00172187_m1 | |
| FlowJo v9 | Tree Star | |
| QuantaSoft v1.4 | Bio-Rad | |
| R programming language | ||
| GenomeStudio V2011.1 | Illumina team | |
| PLINK v1.07;v1.9 | ||
| SHAPEIT2 | ||
| IMPUTE v2.0 | ||
| SNPTEST v2 | ||
| SNP2HLA v1.0.3 | ||
| PRSice v1.25 | ||
| Locuszoom standalone v1.4 | ||
| SWISS v1.0.05b | Developer: Ryan Welch | |
| GEMINI v0.20.1 | ||
| HaploReg v4 tool | ||
| RegulomeDB v1.1 | ||