| Literature DB >> 35368020 |
Bo Gong1,2,3,4, Lulin Huang2,3,4, Yongquan He1,2,3,4, Wen Xie5,6, Yi Yin2,4, Yi Shi2,3,4, Jialing Xiao2, Ling Zhong2,3,4, Yi Zhang2,3,4, Zhilin Jiang2,3,4, Fang Hao2,3, Yu Zhou2,3,4, Huan Li2, Li Jiang2,3, Xingxiang Yang7, Xiangrong Song8, Yan Kang8, Lin Tuo7, Yi Huang2,3, Ping Shuai1, Yuping Liu1, Fang Zheng9, Zhenglin Yang10,11,12.
Abstract
Critical coronavirus disease 2019 (COVID-19) is associated with high mortality and potential genetic factors have been reported to be involved in the development of critical COVID-19. We performed a genome-wide association study to identify the genetic factors responsible for developing critical COVID-19. 632 critical patients with COVID-19 and 3021 healthy controls from the Chinese population were recruited. First, we identified a genome-wide significant difference of IL-6 rs2069837 (p = 9.73 × 10-15, OR = 0.41) between 437 critical patients with COVID-19 and 2551 normal controls in the discovery cohort. When replicated these findings in a set of 195 patients with critical COVID-19 and 470 healthy controls, we detected significant association of rs2069837 with COVID-19 (p = 8.89 × 10-3, OR = 0.67). This variant surpassed the formal threshold for genome-wide significance (combined p = 4.64 × 10-16, OR = 0.49). Further analysis revealed that there was a significantly stronger expression of IL-6 in the serum from patients with critical COVID-19 than in that from patients with asymptomatic COVID-19. An in vitro assay showed that the A to G allele changes in rs2069837 within IL-6 obviously decreased the luciferase expression activity. When analyzing the effect of this variant on the IL-6 in the serum based on the rs2069837 genotype, we found that the A to G variation in rs2069837 decreased the expression of IL-6, especially in the male. Overall, we identified a genetic variant in IL-6 that protects against critical conditions with COVID-19 though decreasing IL-6 expression in the serum.Entities:
Mesh:
Substances:
Year: 2022 PMID: 35368020 PMCID: PMC8976167 DOI: 10.1038/s41392-022-00923-1
Source DB: PubMed Journal: Signal Transduct Target Ther ISSN: 2059-3635
Characteristics of the genotyped samples used for analysis
| Cohort, group | Number of subjects | Age, mean (range) Years | ||
|---|---|---|---|---|
| Total | Male | Female | ||
| Total (Combined) | 3653 | 1778 | 1875 | |
| Discovery | 2988 | 1396 | 1592 | |
| Cases | 437 | 225 | 212 | 58.4(6–93) |
| Controls | 2551 | 1171 | 1380 | 60.7(45–89) |
| Replication | 665 | 382 | 283 | |
| Cases | 195 | 112 | 83 | 61.7(22–87) |
| Controls | 470 | 270 | 200 | 50.6(20–89) |
The age when the cases and controls were recruited
Fig. 1Summary of GWAS results. a Principal component analysis (PCA). b The quantile-quantile (Q-Q) plots of the GWAS results. c Nine loci surpassed genome-wide significance (P < 5 × 10−8) in the combined of discovery and replication results
Association results for selected SNPs in the discovery cohort and replication cohort and all cases and controls combined
| SNP | CHR | BP (hg19) | A1 | F_A | F_U | A2 | P-Discovery | OR-Discovery | F_A | F_U | P-Replication | OR-Replication | P-meta | OR-Meta | Gene | Region |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| rs12995389 | 2 | 6485953 | A | 0.31 | 0.41 | G | 8.04 × 10−8 | 0.66 | 0.38 | 0.40 | 0.464 | 0.91 | 2.35 × 10−7 | 0.72 | intergenic | |
| rs9287675 | 2 | 6494761 | A | 0.24 | 0.32 | G | 5.03 × 10−7 | 0.66 | 0.30 | 0.31 | 0.817 | 0.97 | 3.37 × 10−6 | 0.73 | intergenic | |
| rs7598285 | 2 | 6505563 | G | 0.21 | 0.32 | A | 2.70 × 10−12 | 0.55 | 0.30 | 0.30 | 0.891 | 0.98 | 1.69 × 10−10 | 0.65 | downstream | |
| rs10519086 | 2 | 67262646 | G | 0.02 | 0.08 | T | 5.61 × 10−11 | 0.18 | 0.07 | 0.09 | 0.164 | 0.72 | 6.82 × 10−11 | 0.42 | intergenic | |
| rs72809129 | 2 | 77475663 | A | 0.24 | 0.35 | C | 6.97 × 10−10 | 0.59 | 0.43 | 0.47 | 0.268 | 0.87 | 1.42 × 10−9 | 0.67 | intronic | |
| rs7422259 | 2 | 167377995 | T | 0.04 | 0.11 | G | 1.11 × 10−9 | 0.37 | 0.10 | 0.14 | 0.076 | 0.71 | 3.60 × 10−10 | 0.49 | intergenic | |
| rs4959041 | 6 | 30077967 | C | 0.25 | 0.27 | T | 6.90 × 10−6 | 0.92 | 0.23 | 0.25 | 0.279 | 0.86 | 5.88 × 10−6 | 0.91 | ncRNA_intronic | |
| rs9261445 | 6 | 30094788 | T | 0.23 | 0.26 | G | 4.37 × 10−7 | 0.86 | 0.23 | 0.25 | 0.298 | 0.86 | 5.30 × 10−7 | 0.86 | intergenic | |
| rs9261453 | 6 | 30099948 | G | 0.24 | 0.26 | A | 5.65 × 10−6 | 0.91 | 0.23 | 0.24 | 0.540 | 0.91 | 1.25 × 10−5 | 0.91 | intergenic | |
| rs9261484 | 6 | 30108683 | T | 0.26 | 0.29 | C | 7.14 × 10−6 | 0.89 | 0.24 | 0.26 | 0.492 | 0.91 | 1.33 × 10−5 | 0.89 | intronic | |
| rs9261496 | 6 | 30111014 | T | 0.25 | 0.26 | C | 7.88 × 10−6 | 0.91 | 0.22 | 0.25 | 0.410 | 0.89 | 1.11 × 10−5 | 0.91 | intronic | |
| rs9261502 | 6 | 30111610 | T | 0.24 | 0.26 | A | 4.51 × 10−6 | 0.91 | 0.22 | 0.24 | 0.568 | 0.92 | 1.11 × 10−5 | 0.91 | intronic | |
| rs9261521 | 6 | 30117773 | T | 0.22 | 0.25 | C | 2.61 × 10−8 | 0.81 | 0.24 | 0.25 | 0.860 | 0.97 | 3.17 × 10−7 | 0.85 | intergenic | |
| rs2069837 | 7 | 22768027 | G | 0.09 | 0.20 | A | 9.73 × 10−15 | 0.41 | 0.18 | 0.24 | 8.89 × 10−3 | 0.67 | 4.64 × 10−16 | 0.49 | intronic | |
| rs17158686 | 7 | 83794756 | G | 0.11 | 0.18 | T | 2.38 × 10−7 | 0.54 | 0.16 | 0.20 | 0.072 | 0.75 | 4.99 × 10−8 | 0.61 | intronic |
CHR chromosome, BP base-pair position (hg19), A1 Minor allele code, A2 major allele code, F_A frequency of this allele in cases, F_U frequency of this allele in controls, P logistic p value for association test, OR estimated odds ratio (for A1, i.e. A2 is reference), P-meta p value for METAL combined analysis
Fig. 2The association signal of the rs2069837 LD. a No loci surpassed genome-wide significance (P < 5 × 10−8) between the 437 critical and the 33 asymptomatic cases in the discovery stage of GWAS. b Linkage disequilibrium (LD) plot of 85.6 kb of rs2069837 association with COVID-19 in our imputed GWAS result. c-d r2 of LD plot of rs2069837 in 1000 Genomes Southern Han Chinese (CHS) data (c) and D’ of LD plot of rs2069837 in 1000 G CHS data (d)
Fig. 3IL-6 rs2069837 region decreased its expression in the COVID-19 patients and its sex-specific. a Serum IL-6 concentrations between severe patients (n = 9) and asymptomatic patients (n = 34). b The luciferase activity of IL-6 rs2069837 region and its variation. Representative of four independent experiments with three biological replicates. c Serum IL-6 concentrations are shown according to IL-6 rs2069837 groups, AA type (n = 480), GG type (n = 28). d Baseline of IL-6 in the male (n = 224) and female (n = 284). e IL-6 level are shown according to IL-6 rs2069837 groups in the male, AA type (n = 214), GG type (n = 10). f IL-6 level are shown according to IL-6 rs2069837 groups in the female. AA type (n = 266), GG type (n = 18). Data are mean ± SEM. Significance determined by unpair T test, *P < 0.05, **P < 0.01, ***P < 0.001
Fig. 4A proposed model for the regulatory mechanism of rs2069837 on the expression of IL6 involved in COVID-19. There was higher expression of MEF2 expression and lower GPNMB expression in IL6 rs2069837 AA genotype group, leading to IL6 up-regulation and severe COVID-19 development. IL6 rs2069837 GG genotype could decrease MEF2a binding and increase GPNMB expression, resulting in lower IL6 expression and prevention of severe COVID-19