| Literature DB >> 35571054 |
Nirmal Vadgama1,2, Alexander Kreymerman3,4, Jackie Campbell5, Olga Shamardina6, Christiane Brugger7, Genomics England Research Consortium8, Alexandra M Deaconescu7, Richard T Lee4, Christopher J Penkett6, Casey A Gifford2, Mark Mercola3, Jamal Nasir5, Ioannis Karakikes1.
Abstract
There is considerable variability in the susceptibility and progression for COVID-19 and it appears to be strongly correlated with age, gender, ethnicity and pre-existing health conditions. However, to our knowledge, cohort studies of COVID-19 in clinically vulnerable groups are lacking. Host genetics has also emerged as a major risk factor for COVID-19, and variation in the ACE2 receptor, which facilitates entry of the SARS-CoV-2 virus into the cell, has become a major focus of attention. Thus, we interrogated an ethnically diverse cohort of National Health Service (NHS) patients in the United Kingdom (United Kingdom) to assess the association between variants in the ACE2 locus and COVID-19 risk. We analysed whole-genome sequencing (WGS) data of 1,837 cases who were tested positive for SARS-CoV-2, and 37,207 controls who were not tested, from the UK's 100,000 Genomes Project (100KGP) for the presence of ACE2 coding variants and extract expression quantitative trait loci (eQTLs). We identified a splice site variant (rs2285666) associated with increased ACE2 expression with an overrepresentation in SARS-CoV-2 positive patients relative to 100KGP controls (p = 0.015), and in hospitalised European patients relative to outpatients in intra-ethnic comparisons (p = 0.029). We also compared the prevalence of 288 eQTLs, of which 23 were enriched in SARS-CoV-2 positive patients. The eQTL rs12006793 had the largest effect size (d = 0.91), which decreases ACE2 expression and is more prevalent in controls, thus potentially reducing the risk of COVID-19. We identified three novel nonsynonymous variants predicted to alter ACE2 function, and showed that three variants (p.K26R, p. H378R, p. Y515N) alter receptor affinity for the viral Spike (S) protein. Variant p. N720D, more prevalent in the European population (p < 0.001), potentially increases viral entry by affecting the ACE2-TMPRSS2 complex. The spectrum of genetic variants in ACE2 may inform risk stratification of COVID-19 patients and could partially explain the differences in disease susceptibility and severity among different ethnic groups.Entities:
Keywords: ACE2; COVID-19; SARS-CoV-2; eQTLs; genetics; variants
Year: 2022 PMID: 35571054 PMCID: PMC9091502 DOI: 10.3389/fgene.2022.888025
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.772
FIGURE 1SARS-CoV-2 infection numbers and hospitalisation rate per ethnic group in the United Kingdom population. (A) The proportion of individuals in each population of the entire 100KGP cohort was assigned with a probability of >0.8 for any one ancestry. (B) Percentage of patients who tested positive for SARS-CoV-2 presented as positive cases in each ethnic background relative to the total positive cases in the 100KGP. (C) Percentage of hospitalised patients per total positive individuals in each ethnic group. p values are calculated using the Chi-square test.
FIGURE 2Allelic frequency of nonsynonymous ACE2 variants in different ethnicities. (A) Allelic frequency of ACE2 coding variants present in individual ethnicities, graphed as a log base 2-fold change relative to the allelic frequency in 100KGP unrelated controls (n = 37,207). (B) Allelic frequency of ACE2 coding variants in individual ethnicities that increase SARS-CoV-2 S protein binding affinity for ACE2. (C) Allelic frequency of ACE2 coding variants in separate ethnicities that decrease SARS-CoV-2 S protein binding affinity for ACE2. All heat maps are graphed with values below and above zero set relative to the dataset. Shades of red indicate increased allelic frequency and green decreased allelic frequency, with lighter shades appearing as values move closer to no appreciated change relative to the 100KGP allelic frequencies, as indicated by white. Coding variants displayed by protein location and amino acid change.
Characterisation of nonsynonymous ACE2 variants in SARS-CoV-2 patients. Intronic, intergenic and synonymous variants are considered non-pathogenic and therefore excluded from the list. The index score is based on the type of variant, the REVEL score, and the GERP++ score. An index score of 1 signifies a nonsynonymous variant with a corresponding REVEL score <0.5, suggesting low pathogenicity. An index score of 2 indicates a nonsynonymous variant with a REVEL score of >0.5, GERP++ score >2, CADD score of >20 and PolyPhen2 prediction outcome of “probably damaging”, suggesting high pathogenicity. Six variants were not reported in public databases (p.V316L, p.H345R, p.H401R, p.M455V, p.P737L, p.Y515N). Deep mutagenesis experimental data show p.K26R, p.H378R and p.Y515N altered binding affinity with S protein (Chan et al., 2020). This is consistent with our protein interaction analysis. Mutations are ranked according to their potential to affect protein stability and activity, denoted by asterisks (* = low, ** = medium, *** = high).
| Variant | rsID | AA change | Index | Protein modelling | S Protein binding affinity in RBD | Minor allele frequency within 100KGP | |||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Combined | EUR | SAS | AFR | AMR | EAS | ||||||
| chrX:15564123G > A | novel | p.P737L | 1 | — | NA | 1.74E-05 | 2.03E-05 | 0 | 0 | 0 | 0 |
| chrX:15564142G > A | rs147311723 | p.L731F | 1 | — | NA | 6.63E-04 | 6.09E-05 | 0 | 1.81E-02 | 0 | 0 |
| chrX:15564175T > C | rs41303171 | p.N720D | 1 | — | NA | 2.34E-02 | 2.64E-02 | 6.09E-03 | 2.58E-03 | 9.21E-03 | 1.98E-03 |
| chrX:15566293A > G | rs149039346 | p.S692P |
| — | NA | 4.06E-05 | 0 | 0 | 6.71E-03 | 0 | 0 |
| chrX:15572271C > T | rs763593286 | p.A532T | 1 | * | NA | 5.23E-05 | 0 | 5.89E-04 | 0 | 0 | 0 |
| chrX:15572322A > T | novel | p.Y515N |
| ** | Decrease | 0 | 0 | 0 | 0 | 0 | 0 |
| chrX:15573419A > G | rs961921482 | p.Y497H | 1 | * | NA | 3.49E-05 | 2.03E-05 | 0 | 0 | 0 | 1.98E-03 |
| chrX:15575745T > C | novel | p.M455V | 1 | ** | NA | 5.23E-05 | 2.03E-05 | 1.96E-04 | 5.17E-04 | 0 | 0 |
| chrX:15578124A > G | rs1352508510 | p.I421T | 1 | * | NA | 1.74E-05 | 0 | 0 | 0 | 1.84E-03 | 0 |
| chrX:15578184T > C | novel | p.H401R |
| ** | Negligible | 1.74E-05 | 0 | 0 | 5.17E-4 | 0 | 0 |
| chrX:15578253T > C | rs142984500 | p.H378R |
| *** | Increase | 1.92E-04 | 2.23E-04 | 0 | 0 | 0 | 0 |
| chrX:15581257T > C | novel | p.H345R |
| ** | Negligible | 1.74E-05 | 2.03E-05 | 0 | 0 | 0 | 0 |
| chrX:15581269T > C | rs138390800 | p.K341R | 1 | * | NA | 2.27E-04 | 0 | 0 | 6.71E-03 | 0 | 0 |
| chrX:15581345C > A | novel | p.V316L | 1 | * | NA | 1.74E-05 | 0 | 1.96E-04 | 0 | 0 | 0 |
| chrX:15587768G > A | rs200745906 | p.P263S |
| * | NA | 1.92E-04 | 2.23E-04 | 0 | 0 | 0 | 0 |
| chrX:15589385G > A | rs372272603 | p.R219C | 1 | * | NA | 9.42E-04 | 1.04E-03 | 5.89E-04 | 0 | 0 | 0 |
| chrX:15589409C > T | rs148771870 | p.G211R | 1 | * | NA | 2.14E-03 | 2.35E-03 | 1.37E-03 | 0 | 0 | 0 |
| chrX:15600835T > C | rs4646116 | p.K26R | 1 | * | Increase | 6.42E-03 | 7.10E-03 | 2.75E-03 | 1.55E-03 | 1.84E-03 | 0 |
FIGURE 3Interaction diagrams of ACE2 variants and SARS-CoV-2 S-protein. (A) Crystal structure of ACE2 and SARS-Cov-2 S protein (pdb identifier: 6LZG) with ACE2 coloured in red-orange and the RBD domain of the SARS-CoV-2 S protein in light blue. Regions of interest are highlighted and expanded in panel (B), showing mutated residues M455V, Y515N, H345R, H378R and H401R with wild-type sequences (left) and mutated sequences (right). Hydrogen bonds are shown in green, Zinc ions are shown in purple, and water molecules are shown in red.
FIGURE 4Statistically significant eQTLs in SARS-CoV-2 positive individuals. Displayed by total positive individuals (value on the left) and by ethnic background (value on the right), values are presented as log base 2-fold change relative to the allelic frequency within non-positive individuals found in the 100KGP (n = 37,207). eQTLs are also subdivided by effects on ACE2 expression, with eQTLs known to associate with increased ACE2 expression marked by a red arrow and those associated with decreased ACE2 expression marked by a green arrow. All displayed eQTLs are significant based on chi-squared test, and yellow highlighted eQTLs had Cohen’s values above 0.2. All eQTLs are referenced by rs identifiers.
FIGURE 5Schematic representation of the ACE2 gene with variants affecting risk. Genomic structure is based on the Ensembl canonical transcript (ENST00000252519.8), which has 18 exons, a transcript length of 3,339 bps, and a translation length of 805 residues. Exons are shown as blue boxes, the 5′ and 3′ UTRs are shown as grey boxes, and the horizontal black dashed line represents the introns. The eQTLs that increase and decrease ACE2 expression levels are depicted as green and red boxes, respectively. Only eQTLs with a Cohen’s d effect size >0.2 are shown. Missense and splicing variants are shown with their CDS and protein position. The locations of the variants associated with increased and decreased COVID-19 susceptibility are shown above and below the gene, respectively.