| Literature DB >> 34065289 |
Sueva Cantalupo1,2, Vito Alessandro Lasorsa1,2, Roberta Russo1,2, Immacolata Andolfo1,2, Giuseppe D'Alterio2, Barbara Eleni Rosato1,2, Giulia Frisso1,2, Pasquale Abete3, Gian Marco Cassese3, Giuseppe Servillo4, Ivan Gentile5, Carmelo Piscopo6, Matteo Della Monica6, Giuseppe Fiorentino7, Giuseppe Russo8, Pellegrino Cerino9, Carlo Buonerba9, Biancamaria Pierri9,10, Massimo Zollo1,2, Achille Iolascon1,2, Mario Capasso1,2.
Abstract
Genome-wide association studies (GWAS) found locus 3p21.31 associated with severe COVID-19. CCR5 resides at the same locus and, given its known biological role in other infection diseases, we investigated if common noncoding and rare coding variants, affecting CCR5, can predispose to severe COVID-19. We combined single nucleotide polymorphisms (SNPs) that met the suggestive significance level (P ≤ 1 × 10-5) at the 3p21.31 locus in public GWAS datasets (6406 COVID-19 hospitalized patients and 902,088 controls) with gene expression data from 208 lung tissues, Hi-C, and Chip-seq data. Through whole exome sequencing (WES), we explored rare coding variants in 147 severe COVID-19 patients. We identified three SNPs (rs9845542, rs12639314, and rs35951367) associated with severe COVID-19 whose risk alleles correlated with low CCR5 expression in lung tissues. The rs35951367 resided in a CTFC binding site that interacts with CCR5 gene in lung tissues and was confirmed to be associated with severe COVID-19 in two independent datasets. We also identified a rare coding variant (rs34418657) associated with the risk of developing severe COVID-19. Our results suggest a biological role of CCR5 in the progression of COVID-19 as common and rare genetic variants can increase the risk of developing severe COVID-19 by affecting the functions of CCR5.Entities:
Keywords: CCR5; COVID-19; GWAS; SARS-CoV-2; SNP; whole exome sequencing
Year: 2021 PMID: 34065289 PMCID: PMC8161088 DOI: 10.3390/ijms22105372
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Figure 1CCR5 is highly expressed in lung and bronchus. (A) Gene expression data of 17,382 samples categorized in 30 different normal tissues obtained from GTEx database and (B) of 353 samples categorized in 69 different normal tissues obtained from Roth et al. study. The gray boxes highlight tissues from lung and bronchus. Data are plotted by using R2: Genomics Analysis and Visualization Platform (http://r2.amc.nl, accessed on 10 May 2021).
Figure 2rs9845542, rs12639314, and rs35951367 SNPs are in LD in the European population, and all the minor alleles are associated with low expression of CCR5 in lung. (A) Heatmap shows LD among the selected SNPs in the European population. (B) Violin plot showing the effect of eQTLs SNPs on CCR5 expression in lung. NES: normalized effect size.
Figure 3rs35951367 is located in the genomic region with the highest level of interaction with CCR5 in lung. For all SNPs, the level of interaction with CCR5 as Distance Normalized Interaction Frequency (DNIF) in all tissues is shown. (A) DNIF of rs9845542; (B) DNIF of rs12639314 and (C) DNIF for rs35951367. The highest levels of interaction with CCR5 in lung were found for the (C) rs35951367 SNP. Data were retrieved from the 3DIV database (http://kobic.kr/3divv1/, accessed on 10 May 2021).
Figure 4Genomic view of the region surrounding the SNPs. The figure shows the transcription factor binding sites (TFBSs) within the region surrounding the SNPs. (A) Genomic view of the SNP localization. (B) Zoom-in view showing, from top to bottom, the SNP localization, the candidate SNPs, and the TFBSs of the ENCODE catalogue (v3) belonging to the IMR-90 and NHLF lung cell lines.
Association of common regulatory variant rs35951367 with severe COVID-19 in the Italian cohort.
| Italian Cases | Italian Controls |
| OR (CI: 95%) | |||
|---|---|---|---|---|---|---|
| rs35951367 | n | % | n | % | ||
| Genotype | ||||||
| TT | 120 | 59 | 625 | 67 | - | |
| TC | 72 | 36 | 266 | 29 | 0.038 | 1.410 (1.018–1.952) |
| CC | 10 | 5 | 38 | 4 | 0.39 | 1.371 (0.665–2.826) |
| Allele | ||||||
| T | 312 | 77 | 1516 | 82 | - | |
| C | 92 | 23 | 342 | 18 | 0.043 | 1.307 (1.007–1.696) |
| Dominant | ||||||
| TT/TC | 192 | 95 | 891 | 96 | - | |
| CC | 10 | 5 | 38 | 4 | 0.39 | 1.371 (0.665–2.826) |
| Recessive | ||||||
| TT | 120 | 59 | 625 | 67 | - | |
| TC/CC | 82 | 41 | 304 | 33 | 0.032 | 1.405 (1.028–1.920) |
OR: odd ratio; CI: confidence interval.
Coding variants found in CCR5.
| Sample ID | Chr Position (hg19) | Ref | Alt | Func. Ref Gene | AA Change (CCR5: NM_000579) | SNP | CADD |
|---|---|---|---|---|---|---|---|
| R- 16 | chr3:46415061 | G | A | nonsynonymous SNV | exon3: c. G668A: p. R223Q | rs1800452 | 22.5 |
| R- 68 | chr3:46414611 | C | T | nonsynonymous SNV | exon3: c. C218T: p. A73V | rs56198941 | 31 |
| R- 135 | chr3:46415073-46415075 | AGA | - | nonframeshift deletion | exon3: c. 680_682del: p. K229del | rs774845977 | / |
| R- 30 | chr3:46414784 | G | T | nonsynonymous SNV | exon3: c. G391T: p. V131F | rs34418657 | 29.8 |
| R- 60 | chr3:46414784 | G | T | nonsynonymous SNV | exon3: c. G391T: p. V131F | rs34418657 | 29.8 |
| R- 72 | chr3:46414784 | G | T | nonsynonymous SNV | exon3: c. G391T: p. V131F | rs34418657 | 29.8 |
Chr: chromosome; Ref: reference and non-effect allele; Alt: alternative and effect allele.
Association of rare coding variant rs34418657 with severe COVID-19 in the Italian cohort.
| Italian Cases | Italian Controls |
| OR (CI: 95%) | |||
|---|---|---|---|---|---|---|
| rs34418657 | n | % | n | % | ||
| Genotype | ||||||
| GG | 217 | 98 | 1079 | 99.5 | - | |
| GT | 4 | 2 | 5 | 0.5 | 0.027 | 3.978 (1.060–14.933) |
| TT | 0 | - | 0 | - | ||
| Allele | ||||||
| G | 438 | 99 | 2163 | 99.8 | - | |
| T | 4 | 1 | 5 | 0.2 | 0.116 | 3.951 (1.057–14.771) |
| Recessive | ||||||
| GG | 217 | 98.2 | 1079 | 99.5 | - | |
| GT/TT | 4 | 1.8 | 5 | 0.5 | 0.027 | 3.978 (1.060–14.933) |
OR: odds ratio; CI: confidence interval.