| Literature DB >> 32128391 |
Lucija Klarić1,2, Yakov A Tsepilov3,4, Chloe M Stanton2, Massimo Mangino5,6, Timo Tõnis Sikka7,8, Tõnu Esko7,9,10, Eugene Pakhomov3,4, Perttu Salo11, Joris Deelen12,13, Stuart J McGurnaghan2, Toma Keser14, Frano Vučković1, Ivo Ugrina1,15, Jasminka Krištić1, Ivan Gudelj1, Jerko Štambuk1, Rosina Plomp16, Maja Pučić-Baković1, Tamara Pavić14, Marija Vilaj1, Irena Trbojević-Akmačić1, Camilla Drake2, Paula Dobrinić17, Jelena Mlinarec17, Barbara Jelušić17, Anne Richmond2, Maria Timofeeva18, Alexander K Grishchenko3,4, Julia Dmitrieva19, Mairead L Bermingham2, Sodbo Zh Sharapov3, Susan M Farrington18, Evropi Theodoratou20,21, Hae-Won Uh16,22, Marian Beekman12, Eline P Slagboom12, Edouard Louis23, Michel Georges19, Manfred Wuhrer16, Helen M Colhoun2,24, Malcolm G Dunlop18, Markus Perola11, Krista Fischer7, Ozren Polasek25,26,27, Harry Campbell20, Igor Rudan20, James F Wilson2,20, Vlatka Zoldoš17, Veronique Vitart2, Tim Spector5, Yurii S Aulchenko3,28,29, Gordan Lauc1,14, Caroline Hayward2,30.
Abstract
Effector functions of immunoglobulin G (IgG) are regulated by the composition of a glycan moiety, thus affecting activity of the immune system. Aberrant glycosylation of IgG has been observed in many diseases, but little is understood about the underlying mechanisms. We performed a genome-wide association study of IgG N-glycosylation (N = 8090) and, using a data-driven network approach, suggested how associated loci form a functional network. We confirmed in vitro that knockdown of IKZF1 decreases the expression of fucosyltransferase FUT8, resulting in increased levels of fucosylated glycans, and suggest that RUNX1 and RUNX3, together with SMARCB1, regulate expression of glycosyltransferase MGAT3. We also show that variants affecting the expression of genes involved in the regulation of glycoenzymes colocalize with variants affecting risk for inflammatory diseases. This study provides new evidence that variation in key transcription factors coupled with regulatory variation in glycogenes modifies IgG glycosylation and has influence on inflammatory diseases.Entities:
Mesh:
Substances:
Year: 2020 PMID: 32128391 PMCID: PMC7030929 DOI: 10.1126/sciadv.aax0301
Source DB: PubMed Journal: Sci Adv ISSN: 2375-2548 Impact factor: 14.136
Loci associated with IgG glycosylation.
Each locus is represented by the SNP with the strongest association in the region. Locus, coded by “chromosome: locus start–locus end” (GRCh37); No. of SNPs, maximum number of SNPs independently contributing to trait variation; SNP, variant with the strongest association in the locus; position, position of the SNP on the NCBI (National Center for Biotechnology Information) build 37; EA, allele for which effect estimate is reported; OA, other allele; EAF, frequency of the effect allele; R2, sample size weighted average of imputation quality for the SNP with the strongest association in the locus; glycan, glycan associated with the reported SNP; No. of glycans, number of other glycans suggestively or significantly associated with variants in given locus; β, effect estimate for the SNP and glycan with the strongest association in the locus in the discovery; P, P value for the discovery effect estimate; β UPLC repl, effect estimate for the SNP and glycan with the strongest association in the locus in replication; P UPLC repl, P value for the effect estimate in replication. The loci replicated in UPLC replication at P ≤ 0.0019 are in bold. The loci from Lauc et al. (), Shen et al. (), and Wahl et al. () are in italics.
| 7:6520676-6537913 | 1 | rs6964421 | T | C | 0.328 | 0.957 | IGP14 | 7 | 0.117 | 5.31 × 10−11 | 0.000 | 9.95 × 10−1 | |
| 8:103538266-103550211 | 1 | rs10096810 | A | G | 0.627 | 1.000 | IGP77 | 4 | −0.110 | 9.52 × 10−11 | −0.074 | 1.37 × 10−2 | |
| 16:23397113-23613191 | 1 | rs250555 | T | C | 0.860 | 0.961 | IGP26 | 4 | −0.155 | 6.76 × 10−10 | −0.044 | 2.24 × 10−1 | |
| 19:19260586-19296217 | 1 | rs7257072 | T | C | 0.515 | 0.997 | IGP9 | 12 | −0.122 | 1.59 × 10−13 | −0.068 | 1.97 × 10−2 | |
| 20:17818141-17833534 | 1 | rs2745851 | A | G | 0.376 | 0.982 | IGP38 | 10 | −0.125 | 4.61 × 10−13 | −0.094 | 8.82 × 10−3 | |
| 1:246854862-246963137 | LINC01341 | 1 | rs3795464 | A | G | 0.383 | 0.984 | IGP73 | 2 | −0.099 | 1.01 × 10−08 | ||
| 2:100636757-100805273 | AC092667.1 | 1 | rs2309748 | A | T | 0.647 | 0.993 | IGP34 | 2 | −0.105 | 3.38 × 10−09 | ||
| 17:16820099-16875636 | TBC1D27-TNFRSF13B | 1 | rs4561508 | T | C | 0.105 | 0.975 | IGP9 | 2 | 0.163 | 5.70 × 10−09 | ||
| 17:56398006-56410041 | MIR142 | 1 | rs2526378 | A | G | 0.549 | 0.935 | IGP31 | 1 | −0.100 | 4.44 × 10−09 | ||
| 19:1614910-1658699 | TCF3 | 1 | rs4807942 | T | C | 0.878 | 0.661 | IGP68 | 1 | 0.154 | 1.54 × 10−08 | ||
| 20:52170177-52212273 | ZNF217 | 1 | rs1555926 | T | C | 0.789 | 0.992 | IGP11 | 3 | −0.118 | 6.96 × 10−09 | ||
Fig. 1Gene prioritization in loci associated with IgG N-glycosylation.
The Manhattan plot was created by taking the lowest P value at every genomic position from all 77 GWAS. For simplicity, the plot was trimmed at the equivalent of P = 10−50. The lowest observed P value in this analysis was 4.65 × 10−276 at ST6GAL1. Known loci, loci detected in previous IgG N-glycosylation GWAS; replicated, UPLC replication GWAS. Bottom: Summary of support for prioritization of every gene in the Manhattan plot. DEPICT, genes from enriched gene sets; expression, genes whose expression is pleiotropic with IgG N-glycosylation; CD19, B cells; PB, peripheral blood; coding, genes for which IgG N-glycosylation–associated SNP results in a changed amino acid sequence.
Fig. 2Functional network of loci associated with IgG N-glycosylation.
Correlation estimates are computed on the basis of squared pairwise Spearman’s correlation of SNP effects. The loci are denoted with names of genes that were prioritized in regions tagged by the given lead SNP. (A) Functional network of loci associated with IgG N-glycosylation. In this network, each node represents a lead SNP in the locus, and each edge represents the squared correlation of glycome-wide effects of the two nodes. Only significant correlations after multiple testing correction (P ≤ 1.4 × 10−4) are shown. The thickness and intensity of edges depend on variation in one locus explained by the effect estimates in the second locus. Round-edged rectangular nodes denote genes that are, according to GO, involved in glycosylation; purple-edged nodes denote genes involved in immune system processes; green nodes denote loci containing genes involved in transcription regulation; orange nodes denote glycosyltransferases; and blue rectangles indicate diseases pleiotropic with IgG glycans in the given locus (Table 3). (B) Hierarchical clustering of pairwise Spearman’s locus-effect correlations.
In silico evidence for edges from functional network of IgG N-glycosylation loci.
Permutation analysis was used to assess the distribution of glycome-wide SNP effect pairwise correlations and the STRING PPI database () to search for biological validation of observed links. All reported correlations are statistically significant after Bonferroni correction. Corr, Spearman’s correlation coefficient of z scores of SNP1 and z scores of SNP2; quant, quantile of distribution at which the given ρ2 was observed in permutation analysis; STRING score, probability that the link exists obtained by combining evidence from different sources. SNPs that were replicated in UPLC replication are in bold.
| 0.967 | 0 | 0 | |||||||
| 0.805 | 3.54 × 10−3 | 7.72 × 10−3 | |||||||
| 0.711 | 1.87 × 10−2 | 3.83 × 10−3 | |||||||
| rs481080 | 0.773 | 6.00 × 10−5 | 3.01 × 10−3 | ||||||
| rs7700895 | 0.581 | 8.32 × 10−3 | 2.45 × 10−2 | ||||||
| 0.709 | 8.42 × 10−3 | 2.59 × 10−2 | |||||||
| 0.624 | 2.62 × 10−2 | 9.69 × 10−2 | |||||||
| 0.891 | 2.00 × 10−5 | 3.00 × 10−5 | 0.9 | ||||||
| 0.597 | 3.25 × 10−2 | 1.59 × 10−2 | 0.4 | ||||||
| 0.694 | 1.15 × 10−2 | 3.33 × 10−3 | |||||||
| 0.609 | 9.57 × 10−3 | 2.49 × 10−2 | 0.42 | ||||||
| 0.609 | 9.57 × 10−3 | 2.49 × 10−2 | 0.48 | ||||||
| 0.626 | 2.56 × 10−2 | 2.15 × 10−2 | 0.6 | ||||||
| rs3099844 | 0.231 | 8.82 × 10−2 | 9.86 × 10−2 | 0.93 | |||||
| 0.283 | 1.63 × 10−1 | 6.02 × 10−2 | 0.7 | ||||||
| rs7257072 | rs3099844 | 0.23 | 1.77 × 10−1 | 9.85 × 10−2 | 0.62 | ||||
Summary of the HEIDI testing for pleiotropy versus LD between variants coassociated with IgG glycan levels, complex traits, and gene expression.
All traits have significant (PSMR ≤ 9.1 × 10−4) coassociation. In these loci where the associated variant was the same in IgG glycosylation, complex traits, and gene expression, we report only glycans pleiotropic with both expression and complex traits. The HEIDI test distinguishes between pleiotropy (shared causal variant, HEIDI P ≥ 0.05) or causal variants in LD (HEIDI P < 0.05). Glycan trait descriptions: G, galactose; F, fucose; N, bisecting GlcNAc; S, sialic acid; gene expression: CD4 and CD8, helper and cytotoxic T cells; CD19, B cells; PB, peripheral blood.
| IRF1-SLC22A4 | IGP2 and | G0 | CD | + | − | |
| ORMDL3- | IGP2, IGP6, | G0 and G1 | Asthma, HDL, PBC, | + | − | |
| − | ||||||
| + | ||||||
| IGP58, IGP59, | % Fucosylation in total and | Asthma, PBC, HDL, RA, | − | + | ||
| + | ||||||
| − | ||||||
| CRHR1-SPPL2C- | IG14, IGP49, | G1FN, G2FN, % fucosylation | PD | − | ||
| RUNX3 | IGP65, IGP69, | Ratio of fucosylated | Ankylosing | |||
| IRF1-SLC22A4 | IGP2 and | G0 | Height and IBD | |||
Fig. 3Results of in vivo validation of the functional links between IKZF1 and FUT8.
(A) IKZF1 binds to a regulatory region upstream of FUT8. (B) Knockdown of IKZF1 leads to decreased expression of IKZF1 (n = 3). (C) Representative Western blot showing that IKZF1 is depleted at the protein level (protein reduced by around 50% compared with a random shRNA line or a non-IKZF1 targeting shRNA line). WB, Western blot. (D) Knockdown of IKZF1 leads to decreased expression of IKZF3 (n = 3). (E) Knockdown of IKZF1 leads to increased expression of FUT8 in the LCL (n = 3). (F) There is a small but significant increase in fucosylation of IgG secreted by the LCLs in which IKZF1 is knocked down (n = 3 lines, measured at two time points). All P values are t tests, and error bars show SD from the mean. Asterisk indicates P value <0.05.