| Literature DB >> 33227023 |
Yunpeng Wang1,2,3,4, Ron Nudel1,2, Michael E Benros1,5,6, Kristin Skogstrand1,7, Simon Fishilevich8, Doron Lancet8, Jiangming Sun1,2, David M Hougaard1,7, Ole A Andreassen3, Preben Bo Mortensen1,9, Alfonso Buil1,2, Thomas F Hansen1,2,10, Wesley K Thompson1,2,11, Thomas Werge1,2,12.
Abstract
Circulating inflammatory markers are essential to human health and disease, and they are often dysregulated or malfunctioning in cancers as well as in cardiovascular, metabolic, immunologic and neuropsychiatric disorders. However, the genetic contribution to the physiological variation of levels of circulating inflammatory markers is largely unknown. Here we report the results of a genome-wide genetic study of blood concentration of ten cytokines, including the hitherto unexplored calcium-binding protein (S100B). The study leverages a unique sample of neonatal blood spots from 9,459 Danish subjects from the iPSYCH initiative. We estimate the SNP-heritability of marker levels as ranging from essentially zero for Erythropoietin (EPO) up to 73% for S100B. We identify and replicate 16 associated genomic regions (p < 5 x 10-9), of which four are novel. We show that the associated variants map to enhancer elements, suggesting a possible transcriptional effect of genomic variants on the cytokine levels. The identification of the genetic architecture underlying the basic levels of cytokines is likely to prompt studies investigating the relationship between cytokines and complex disease. Our results also suggest that the genetic architecture of cytokines is stable from neonatal to adult life.Entities:
Mesh:
Substances:
Year: 2020 PMID: 33227023 PMCID: PMC7721185 DOI: 10.1371/journal.pgen.1009163
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Fig 1SNP heritability of circulating protein levels.
The variation of circulating marker levels captured by a. all the genotyped SNP; b. SNPs on each autosome and c. polygenic scores computed from independent sample are shown. Cytokine SNP-heritabilities were shown in a by point estimates and standard errors. These point estimates were also shown in parentheses following the cytokine names on b. The Pearson’s correlation coefficients between polygenic scores and measured protein levels in the discovery sample are stratified by different p value thresholds (pT) of association in the discovery sample (S1, P<1x10-6; S2, P <1x10-5; S3,1x10-4; S4, P <0.001; S5, P < 0.01; S6, P < 0.1; S7, P <0.5; S8, P <1.0). The effect sizes used to compute polygenic scores are derived from Ahola-Olli et al.[17].
Fig 2Distribution of association statistics for inflammation marker level a.
The empirical cumulative distribution function of the log10(-log10(P)) for the association of SNPs (P<5x10-9) with each inflammation marker. Colors indicate different markers. b. Distribution of SNPs (P<1x10-9) in different minor allele frequency (MAF) bins is shown for each marker. Colors indicate different MAF intervals. Numbers in the figure shows the proportion of SNPs in the region.
Genome wide significant associations with blood inflammatory marker levels.
| Marker | SNP | Chr | Region | Pos | A1/A2 | Beta | Se | P | VE | INFO | Freq | P repl | Gene |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| CRP | rs3091244 | 1 | 1q23.2 | 159684665 | A/G | 0.33 | 1.89E-2 | 7.47E-68 | 5.2E-2 | 1.01 | 0.32 | CRP | |
| rs112635299 | 14 | 14q32.13 | 94838142 | T/G | -0.41 | 5.61E-2 | 3.31-E-13 | 9.02E-3 | 0.95 | 0.03 | SERPINA1 | ||
| EPO | rs1130864 | 1 | 1q32.2 | 159683091 | A/G | 0.11 | 0.01 | 4.24E-23 | 1.02E-2 | 1.01 | 0.32 | CRP | |
| IgA | rs3094087 | 6 | 6p21.33 | 31061561 | T/C | 0.10 | 0.01 | 1.83E-10 | 3.12E-3 | 1.03 | 0.15 | HLA | |
| IL18 | rs10891329 | 11 | 11q23.1 | 112009892 | T/C | 0.32 | 7.4E-3 | 1E-300 | 4.0E-2 | 0.95 | 0.32 | 3.72E-37 | IL18 |
| rs10891268 | 11 | 11q23.1 | 111301044 | A/G | 6.1E-2 | 9E-3 | 1.21E-11 | 1.26E-3 | 0.98 | 0.22 | 4.46E-4 | POU2AF1 | |
| rs56195122 | 19 | 19p13.2 | 13020506 | A/G | -0.15 | 2.08E-2 | 2.4E-13 | 1.22E-3 | 0.99 | 0.03 | 6.59E-4 | SYCE2 | |
| rs9402686 | 6 | 6q23.3 | 135427817 | A/G | 5.65E-2 | 8.3E-3 | 1.51E-11 | 1.22E-3 | 1.0 | 0.26 | 0.58 | HBS1L | |
| MCP1 | rs12075 | 1 | 1q23.2 | 159175354 | A/G | 0.11 | 5.1E-3 | 1.12E-92 | 5.47E-3 | 0.98 | 0.44 | 1.84E-8 | ACKR1 |
| rs4493469 | 3 | 3p21.31 | 46177992 | T/C | -7.01E-2 | 8.5E-3 | 1.62E-16 | 9.18E-4 | 0.97 | 0.10 | 2.0E-3 | CCR3 | |
| rs2228467 | 3 | 3p22.1 | 42906116 | T/C | -8.19E-2 | 1.01E-2 | 6.22E-16 | 8.35E-4 | 1.02 | 0.07 | 1.24E-5 | ACKR2 | |
| rs60200069 | 10 | 10q22.1 | 73503994 | T/G | -3.74E-2 | 5.2E-3 | 7.82E-13 | 6.86E-4 | 0.98 | 0.43 | 8.67E-2 | CDH23 | |
| S100B | rs62224256 | 21 | 21q22.3 | 47887095 | A/G | -0.61 | 1.31E-2 | 1E-300 | 0.18 | 0.98 | 0.49 | 9.85E-38 | PCNT |
| rs28397289 | 6 | 6p21.33 | 31197407 | T/C | 0.15 | 1.72E-2 | 7.67E-19 | 8.46E-3 | 0.98 | 0.24 | rs4713462 | HLA | |
| TARC | rs115952894 | 3 | 3p24.3 | 16950359 | A/G | 0.54 | 2.46E-2 | 1E-104 | 2.67E-2 | 1.0 | 0.05 | PLCL2 | |
| rs2228467 | 3 | 3p22.1 | 42906116 | T/C | -0.41 | 2.09E-2 | 1.84E-82 | 2.05E-2 | 1.02 | 0.07 | ACKR2 | ||
| rs10886430 | 10 | 10q26.11 | 121010256 | A/G | 0.33 | 1.80E-2 | 1.23E-75 | 2.16E-2 | 0.89 | 0.11 | GRK5 | ||
| rs223896 | 16 | 16q21 | 57443146 | A/G | -0.12 | 1.08E-2 | 2.91E-29 | 7.10E-3 | 1.02 | 0.41 | CCL17 | ||
| VEGFA | rs7767396 | 6 | 6q21.1 | 43927050 | A/G | 0.22 | 6.2E-3 | 3.31E-253 | 2.36E-2 | 0.99 | 0.47 | intergenic | |
| rs11789392 | 9 | 9p24.2 | 2694914 | T/C | 0.13 | 7.0E-3 | 1.22E-73 | 8.09E-3 | 0.87 | 0.44 | intergenic |
a. from Ahola-Olli et al[17]
b. from Suhre et al[16]
c. from choi et al[20].
Fig 3Prediction of inflammation marker levels by genetic variants.
a. The distribution of the normalized S100B level in the replication sample is shown in the three genotype groups of rs62224256 (0: AA, 1, AG and 2 GG). A simple linear regression line(red) is added in the figure to show the trend. b. The Pearson’s correlation coefficients between polygenic scores and normalized S100B level in the replication sample are stratified by different p value threshold(pT) of association in the discovery sample (S1, P<1x10-6; S2, P<1x10-5; S3, P<1x10-4; S4, P <0.001; S5, P< 0.01; S6, P< 0.1; S7, P<0.5; S8, P<1.0). Standard errors are show by the error bar. Stars indicate significant correlations (P<0.00125 = 0.05/40). c. A scatter plot shows the predicted S100B level (normalized, fitted strait line) in the replication sample by SNPs with P<1x10-6 in the discovery sample.
Fine mapping of associated regions.
| CRP | rs3091244 | 93.24 | rs3091244 | 2.56 | 0.17 | 1.0 | 7.47E-68 | GH01G159751 (rs4131568,R2 = 0.82; rs12094103,R2 = 0.79) | |||
|---|---|---|---|---|---|---|---|---|---|---|---|
| rs376195567 | 2.75 | 0.24 | 0.05 | 3.66E-24 | |||||||
| rs3093059 | 2.38 | 0.11 | 0.02 | 2.26E-14 | |||||||
| rs112635299 | 10.31 | rs112635299 | 3.6 | 0.63 | 1.0 | 3.31-E-13 | |||||
| EPO | rs1130864 | 27.74 | rs1130864 | 2.59 | 0.18 | 1.0 | 4.24E-23 | GH01G159751 (rs4131568,R2 = 0.83; rs12094103,R2 = 0.83) | |||
| rs16842568 | 2.28 | 0.10 | 0.02 | 2.06E-6 | |||||||
| IL18 | rs10891329 | 362.57 | rs10891325 | 3.08 | 0.55 | 0.79 | 6.19E-263 | 1.26E-23 | |||
| rs11214126 | 9.12 | 1.0 | 0.26 | 2.03E-98 | 4.84E-14 | ||||||
| rs10891343 | 4.74 | 0.98 | 0.55 | 7.23E-287 | 8.58E-27 | ||||||
| rs10891268 | 21.25 | rs10444327 | 2.65 | 0.24 | 0.0 | 1.32E-5 | 0.25 | ||||
| rs117369151 | 2.53 | 0.29 | 0.05 | 1.62E-5 | 0.16 | ||||||
| rs79958943 | 3.70 | 0.78 | 0.12 | 4.49E-15 | 2.68E-3 | GH19G012880 | |||||
| rs56195122 | 10.44 | rs56195122 | 2.76 | 0.28 | 1.0 | 2.4E-13 | 6.59E-4 | GH19G012890 (rs2072596,R2 = 0.91); | |||
| rs9402686 | 12.10 | rs9402686 | 2.18 | 0.10 | 1.0 | 1.51E-11 | 0.58 | ||||
| rs56293029 | 2.09 | 0.09 | 0.93 | 2.39E-11 | 0.66 | ||||||
| MCP1 | rs12075 | 119.06 | rs12075 | 4.87 | 0.97 | 1.0 | 1.12E-92 | 1.84E-8 | |||
| rs13962 | 0.96 | 4.64 | 0.17 | 0.013 | 0.11 | ||||||
| rs72698561 | 3.55 | 0.65 | 0.04 | 7.85E-17 | 0.14 | ||||||
| rs4493469 | 30.68 | rs6441947 | 2.91 | 0.31 | 0.03 | 0.01 | 0.61 | ||||
| rs11923627 | 2.40 | 0.11 | 0.68 | 2.73E-15 | 2.49E-2 ( | ||||||
| rs12495098 | 2.48 | 0.14 | 0.01 | 1.46E-10 | 1.81E-4 ( | GH03G046297 | |||||
| rs2228467 | 17.78 | rs2228467 | 6.94 | 1.0 | 1.0 | 6.22E-16 | 1.23E-5 ( | ||||
| rs60200069 | 14.15 | rs10823838 | 2.85 | 0.27 | 1.0 | 8.33E-13 | 7.90E-2 | GH10G071740 | , | ||
| rs3747858 | 2.76 | 0.23 | 0.03 | 5.16E-9 | 0.28 | GH10G071745 | |||||
| S100B | rs62224256 | 657.11 | rs11910707 | 13.25 | 1.0 | 0.10 | 1.26E-205 | 1.66E-27 | GH21G046620 | ||
| rs55912899 | 5.22 | 0.99 | 0.09 | 5.89E-129 | 1.7E-6 | ||||||
| rs2839314 | 4.07 | 0.96 | 0.13 | 1.99E-240 | 4.30E-19 | GH21G046541 | |||||
| TARC | rs115952894 | 177.69 | rs115952894 | 4.34 | 0.92 | 1.0 | 1E-104 | ||||
| rs76472873 | 3.0 | 0.36 | 0.0 | 2.21E-57 | GH03G016916 | ||||||
| rs369616361 | 3.78 | 0.77 | 0.14 | 0.013 | |||||||
| rs2228467 | 91.10 | rs2228467 | 7.85 | 1.0 | 1.0 | 1.84E-82 | |||||
| rs115667394 | 2.76 | 0.31 | 0.0 | 0.02 | |||||||
| rs1427803 | 2.13 | 0.10 | 0.03 | 4.59E-26 | |||||||
| rs10886430 | 73.26 | rs10886430 | 13.76 | 1.0 | 1.0 | 1.23E-75 | GH10G119249 | ||||
| rs10886437 | 3.45 | 0.60 | 0.65 | 5.05E-51 | |||||||
| rs223896 | 53.73 | rs4396523 | 2.94 | 0.30 | 0.11 | 2.30E-23 | GH16G057409 | ||||
| rs223897 | 2.92 | 0.29 | 0.53 | 3.51E-19 | GH16G057409 | ||||||
| rs34379253 | 4.09 | 0.86 | 0.03 | 4.38E-13 | GH16G057409 | ||||||
| VEGFA | rs7767396 | 278.31 | rs9369421 | 3.70 | 0.75 | 0.0 | 0.001 | intergenic | GH06G043953 | ||
| rs73422214 | 12.65 | 1.0 | 0.07 | 1.70E-29 | intergenic | GH06G043953 | |||||
| rs4481426 | 3.74 | 0.77 | 0.77 | 4.95E-194 | intergenic | GH06G043953 | |||||
| rs11789392 | 132.60 | rs11789392 | 5.12 | 0.98 | 1.0 | 1.22E-73 | intergenic | ||||
| rs2219143 | 4.97 | 0.97 | 0.0 | 10.8E-58 | GH09G002620 | ||||||
| rs10812148 | 3.15 | 0.38 | 0.01 | 4.33E-16 |
a. from Ahola-Olli et al[17]
b. from Suhre et al[16]
c. from choi et al[20].
Fig 4Annotation of the region indexed by rs62224256 associated with S100B.
The top panel shows the regional plot. P values bellow 1x10-100 were censored at 1x10-100 for the clearness of illustration. Genes located in this region are shown in the middle panel. The sub-region contains rs662224256 is zoomed in approximately. Two enhancers are represented by the black and red bars. Genes regulated by the enhancers are underscored by red line and shaded bar when they are regulated by both enhancers. The log10 Bayesian Factor (LBF), posterior inclusion probability(PIP) of being included in the causal set and association p values (P) scales are shown in the same order as SNP rs-numbers. The genomic coordinates (build hg19) of SNPs and enhancers are shown on the lower-left panel.