| Literature DB >> 33949668 |
Yunlong Ma1, Yukuan Huang1, Sen Zhao2, Yinghao Yao3, Yaru Zhang1, Jia Qu1, Nan Wu2, Jianzhong Su1,3.
Abstract
The systematic identification of host genetic risk factors is essential for the understanding and treatment of coronavirus disease 2019 (COVID-19). By performing a meta-analysis of two independent genome-wide association summary datasets (N = 680 128), a novel locus at 21q22.11 was identified to be associated with COVID-19 infection (rs9976829 in IFNAR2-IL10RB, odds ratio = 1.16, 95% confidence interval = 1.09-1.23, P = 2.57 × 10-6). The rs9976829 represents a strong splicing quantitative trait locus for both IFNAR2 and IL10RB genes, especially in lung tissue (P = 1.8 × 10-24). Integrative genomics analysis of combining genome-wide association study with expression quantitative trait locus data showed the expression variations of IFNAR2 and IL10RB have prominent effects on COVID-19 in various types of tissues, especially in lung tissue. The majority of IFNAR2-expressing cells were dendritic cells (40%) and plasmacytoid dendritic cells (38.5%), and IL10RB-expressing cells were mainly nonclassical monocytes (29.6%). IFNAR2 and IL10RB are targeted by several interferons-related drugs. Together, our results uncover 21q22.11 as a novel susceptibility locus for COVID-19, in which individuals with G alleles of rs9976829 have a higher probability of COVID-19 susceptibility than those with non-G alleles.Entities:
Mesh:
Substances:
Year: 2021 PMID: 33949668 PMCID: PMC8136003 DOI: 10.1093/hmg/ddab125
Source DB: PubMed Journal: Hum Mol Genet ISSN: 0964-6906 Impact factor: 6.150
Figure 1
Meta-analysis of GWAS summary data highlighting susceptibility loci for COVID-19. a) Manhattan plot of the meta-analysis GWAS summary statistics highlighting three susceptibility loci for COVID-19. The Manhattan plot is shown of the meta-GWAS summary statistics of meta-analyzing the COVID_I GWAS data (controlled for potential population stratification) with ANA5 GWAS data. The red horizontal line marks the genome-wide significance threshold of a P value < 5 × 10−8. b) Quantile–quantile (QQ) plot of the meta-analysis GWAS summary statistics. All 8 424 883 high-quality SNPs with a MAF ≥ 1% and imputation R2 ≥ 0.6 were used for plotting. In QQ plot the 2.5th and 97.5th centiles of the distribution under random sampling and the null hypothesis form the 95% concentration band. The genomic inflation factor lambda (λ) is 1.0075. c) Regional association plot for 21q22.11 locus of meta-GWAS summary statistics. Regional association plot is shown for 21q22.11 locus of the meta-GWAS summary statistics of meta-analyzing the COVID_I GWAS data (controlled for potential population stratification) with ANA5 GWAS data. The purple diamond marks the most strongly associated SNP of rs9976829 with COVID-19. The color illustrates LD information with rs9976829, as shown in the color legend.
Susceptibility loci associated with COVID-19 identified by meta-analysis of GWAS summary data
| SNP | CHR | Position | Loci | ALT | REF | COVID_I | ANA5 | Meta-analysis | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| OR | 95% CI |
| OR | 95% CI |
| OR | 95% CI |
| ||||||
| rs11385942 | 3 | 45 876 459 | 3p21.31 | GA | G | 1.77 | 1.49–2.11 | 1.15 | 1.50 | 1.29–1.74 | 1.10 | 1.61 | 1.43–1.80 | 2.87 |
| rs8176719 | 9 | 136 132 908 | 9q34.2 | TC | T | 1.32 | 1.19–1.46 | 9.93 | 1.10 | 1.01–1.19 | 2.10 | 1.17 | 1.10–1.25 | 4.76 |
| rs657152 | 9 | 136 139 265 | 9q34.2 | A | C | 1.33 | 1.20–1.47 | 4.95 | 1.07 | 0.99–1.15 | 7.86 | 1.15 | 1.08–1.21 | 5.53 |
| rs9976829 | 21 | 34 614 834 | 21q22.11 | G | A | 1.18 | 1.07–1.32 | 1.77 | 1.15 | 1.06–1.24 | 3.58 | 1.16 | 1.09–1.23 | 2.57 |
Note: CHR = chromosome, OR = odds ratio, 95% CI = 95% confidence interval, ALT = Altered allele, REF = Reference allele, COVID_I = COVID_I GWAS summary statistics (Dataset #1), ANA5 = ANA5 GWAS summary statistics (Dataset #2). The meta-analysis data were based on the combination of COVID_I GWAS summary data (controlled for potential population stratification) with ANA5 GWAS summary data.
Figure 2
Circus plot showed the results of gene-based association analysis. Note: The inner ring shows the 22 autosomal human chromosomes (Chr1–22) and X chromosome (Chr23). A circular symbol in the outer ring represents a gene. Color indicates the statistical significance of genes (red marks genes significantly associated with COVID-19 with FDR < 0.05, yellow indicates genes with 1.86 × 10−5 < P ≤ 1 × 10−3, green marks genes with 1 × 10−3 < P ≤ 0.05, and gray represents genes with P > 0.05).
Significant genes associated with COVID-19 identified by MAGMA gene-based association analysis
| Gene | CHR | Start position | Stop position | Loci | MAGMA on COVID_I | MAGMA on ANA5 | MAGMA on meta-analysis data | ||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Z score |
| Z score |
| Z score |
| FDR | |||||
|
| 3 | 45 844 808 | 45 977 216 | 3p21.31 | 5.71 | 5.61 | 4.51 | 3.29 | 7.05 | 8.83 | 1.69 |
|
| 3 | 46 042 291 | 46 088 979 | 3p21.31 | 4.82 | 7.13 | 4.14 | 1.71 | 6.03 | 8.41 | 8.03 |
|
| 3 | 45 907 996 | 45 964 667 | 3p21.31 | 4.81 | 7.70 | 3.61 | 1.54 | 5.96 | 1.28 | 8.14 |
|
| 3 | 45 939 391 | 46 057 316 | 3p21.31 | 4.28 | 9.55 | 3.32 | 4.49 | 5.50 | 1.95 | 9.31 |
|
| 3 | 45 776 941 | 45 858 039 | 3p21.31 | 4.05 | 2.60 | 2.72 | 3.26 | 5.11 | 1.58 | 6.03 |
|
| 21 | 34 582 231 | 34 656 831 | 21q22.11 | 3.53 | 2.10 | 3.50 | 2.31 | 5.02 | 2.58 | 7.58 |
|
| 3 | 45 964 973 | 46 009 845 | 3p21.31 | 3.85 | 5.99 | 3.13 | 8.83 | 5.01 | 2.78 | 7.58 |
|
| 9 | 86 562 998 | 86 615 692 | 9q21.32 | 2.02 | 2.16 | 3.18 | 7.42 | 4.18 | 1.46 | 3.48 |
|
| 9 | 86 575 321 | 86 638 989 | 9q21.32 | 2.09 | 1.84 | 3.15 | 8.29 | 4.12 | 1.86 | 3.94 |
|
| 9 | 136 110 563 | 136 170 630 | 9q34.2 | 4.85 | 6.28 | 0.10 | 0.46 | 3.21 | 6.55 | 0.43 |
Note: CHR = chromosome, FDR = False discovery rate, COVID_I = COVID_I GWAS summary statistics (Dataset #1), ANA5 = ANA5 GWAS summary statistics (Dataset #2). The meta-analysis data were based on the combination of COVID_I GWAS summary data (controlled for potential population stratification) with ANA5 GWAS summary data.
Figure 3
Functional enrichment analysis of genes associated with COVID-19. a) Pathway enrichment analysis identified 9 significant or suggestive KEGG pathways enriched by COVID-19-associated genes. b) GO enrichment analysis identified 10 significant or suggestive GO-terms enriched by COVID-19-associated genes. a and b) The green bar represents a suggestive enrichment (P < 0.05), and the orange bar represents a significant enrichment (FDR < 0.05). c) Scatter plot show the consistency of 11 risk genes identified from both MAGMA and S-MultiXcan analysis. The vertical and horizontal dotted lines represent -log10 (P = 0.05). d) In silico permutation analysis of 100 000 times of random selections. This permutation analysis was used to compare the overlapped genes between MAGMA and S-MultiXcan (see Methods). The empirical P value is less than 1 × 10−5.
Figure 4
Expression of IFNAR2 and IL10RB among 50 cellular populations from lung tissue. This plot is based on a data set as a part of the Human Lung Atlas, consisting of 50 cell populations across 4 compartments (epithelial, endothelial, stromal, and immune) of lung tissue (x axis). y axis represents the expression level with log transformed count.