Literature DB >> 33532652

ACE2 and FURIN variants are potential predictors of SARS-CoV-2 outcome: A time to implement precision medicine against COVID-19.

Fahd Al-Mulla1, Anwar Mohammad2, Ashraf Al Madhoun3, Dania Haddad1, Hamad Ali1,4, Muthukrishnan Eaaswarkhanth1, Sumi Elsa John1, Rasheeba Nizam1, Arshad Channanath1, Mohamed Abu-Farha5, Rasheed Ahmad6, Jehad Abubaker2, Thangavel Alphonse Thanaraj1.   

Abstract

The severity of the new COVID-19 pandemic caused by the SARS-CoV-2 virus is strikingly variable in different global populations. SARS-CoV-2 uses ACE2 as a cell receptor, TMPRSS2 protease, and FURIN peptidase to invade human cells. Here, we investigated 1,378 whole-exome sequences of individuals from the Middle Eastern populations (Kuwait, Qatar, and Iran) to explore natural variations in the ACE2, TMPRSS2, and FURIN genes. We identified two activating variants (K26R and N720D) in the ACE2 gene that are more common in Europeans than in the Middle Eastern, East Asian, and African populations. We postulate that K26R can activate ACE2 and facilitate binding to S-protein RBD while N720D enhances TMPRSS2 cutting and, ultimately, viral entry. We also detected deleterious variants in FURIN that are frequent in the Middle Eastern but not in the European populations. This study highlights specific genetic variations in the ACE2 and FURIN genes that may explain SARS-CoV-2 clinical disparity. We showed structural evidence of the functionality of these activating variants that increase the SARS-CoV-2 aggressiveness. Finally, our data illustrate a significant correlation between ACE2 variants identified in people from Middle Eastern origins that can be further explored to explain the variation in COVID-19 infection and mortality rates globally.
© 2021 Published by Elsevier Ltd.

Entities:  

Keywords:  ACE2; COVID-19; FURIN; Genomic medicine; Middle Eastern populations; Precision medicine; SARS-CoV-2; TMPRSS2; Variants; eQTLs

Year:  2021        PMID: 33532652      PMCID: PMC7843038          DOI: 10.1016/j.heliyon.2021.e06133

Source DB:  PubMed          Journal:  Heliyon        ISSN: 2405-8440


Introduction

The global pandemic COVID-19 caused by SARS-CoV-2 virus is life-threatening and has become a significant concern to humanity. Notably, the severity of this disease is highly variable in different populations across the world [1, 2]. Since the outbreak, several studies have reported specific factors including age, gender, and pre-existing health conditions that could have contributed to the increased severity of the disease [3, 4, 5, 6]. The genetic susceptibility to COVID-19 has also been explored by scrutinizing Angiotensin converting enzyme 2 (ACE2) genetic variations in different populations. ACE2 is the functional receptor mediating entry of SARS-CoV-2 into the host cells [7], which is facilitated by FURIN cleavage [8, 9, 10]. Transmembrane serine protease 2 (TMPRSS2) is another candidate gene that has been linked to COVID-19 disease [10, 11, 12]. TMPRSS2 expression enhances ACE2-mediated SARS-CoV-2 cell invasion by operating as a co-receptor [12]. The increased cleavage activity of this protease was suggested to diminish viral recognition by neutralizing antibodies and by activating SARS spike (S) protein for virus-cell fusion [13] and facilitates the active binding of SARS-CoV-2 through ACE2 receptor, which is a risk factor for a more serious COVID-19 presentation [10, 11, 12]. The hallmark of the novel SARS-CoV-2, as compared to other SARS viruses, is the presence of a polybasic FURIN cleavage site. FURIN has been reported to facilitate the transport of SARS-CoV-2 into or from the host cell [8, 9, 10, 14]. Notably, a recent study has highlighted the presence of a unique functional polybasic FURIN cleavage consensus site between the two spike subunits S1 and S2 by the insertion of 12 nucleotides encoding PRRA in the S protein of SARS-CoV-2 virus [15]. The FURIN-like cleavage-site is cleaved during virus egress, which primes the S-protein providing a gain-of-function for the efficient spreading of the SARS-CoV-2 among humans [16, 17]. It is, therefore, likely that the presence of a deleterious ACE2, TMPRSS2 and FURIN gene variants may modulate viral infectivity among humans, making some people less vulnerable than others. Recent studies, assessed the genetic variations and eQTL (expression quantitative trait locus) expression profiles in the candidate genes ACE2, TMPRSS2, and FURIN to demonstrate the sex and population-wise differences that may influence the pathogenicity of SARS-CoV-2 [3, 11, 18, 19, 20, 21, 22]. It is to be noted that these studies focused only on the European and East Asian populations. Given the extremely high prevalence of obesity (80%), hypertension (28%) and diabetes (20%) of the population in the Gulf states [23, 24, 25] which are considered as risk factors for mortality from COVID-19 [26, 27], the witnessed low infectivity and mortality rates registered in this area of the world are intriguing. Even though this could be due to various factors that are not well accounted for yet such as testing, hot weather or extreme measures taken early on by some countries, it could also be due to ethnic genetic variations in the ACE2-TMPRSS2-FURIN genes that are key regulators for orchestrating SARS-CoV-2 cellular access. As a result, it is crucial to study the variation of these candidate genes ACE2, TMPRSS2 and FURIN in Middle Eastern populations to better understand possible natural genetic components that can be responsible for these differences. Here, we present a comprehensive comparative assessment of deleterious or gain of function mutations of ACE2, TMPRSS2, and FURIN that may have influenced disease progression in the Middle Eastern populations compared to European, African and East Asian populations. While these findings are preliminary and based on our genetic data and worldwide public datasets, they present a compelling perception into the role of naturally occurring genetic variants in ACE2, TMPRSS2, and FURIN in different populations. They can shed light on the reported variations in susceptibility or resistance to SARS-CoV-2 infection in different populations and can be available for other scientists to utilize them in precision medicine.

Materials and methods

Genetic data

The whole-exome sequences data of 473 Kuwaitis [28], 800 Iranians [29] and 100 Qataris [30] published previously were used in this analysis. The genetic data of non-Finnish European, East Asian and African American populations were obtained from the gnomAD repository [31], which contain data on a total of 125,748 exomes and 71,702 genomes (https://gnomad.broadinstitute.org/). The minor allele frequencies of the ACE2 variant, rs41303171 (N720D) for different countries were obtained from 1000 Genomes Project phase 3 [32] and ABraOM [33] databases.

Gene expression data

The expression data for ACE2, TMPRSS2, and FURIN were obtained from the genotype-tissue expression (GTEx) database (https://gtexportal.org/home/). The same database portal was used to extract quantitative trait loci (eQTLs) for the three genes.

Data on case fatality rate (CFR)

Country level numbers of cases and deaths were obtained from the data available from https://covid.ourworldindata.org/data/owid-covid-data.xlsx (accessed on 7 August 2020) and the CFR was calculated as stated in https://ourworldindata.org/mortality-risk-covid i.e., Case Fatality Rate (CFR in %) = (Number of deaths from disease/Number of diagnosed cases of disease) x 100. The country-wise CFR values are available online at https://www.coronatracker.com/country/kuwait/(accessed on 7 August 2020).

Statistical analysis

The missense variants were defined as deleterious when predicted to be damaging, probably damaging, disease causing and deleterious by the five algorithms applied, SIFT [34], PolyPhen-2 HumVar, PolyPhen-2 HumDiv [35], MutationTaster [36] and LRT score [37] and/or CADD (Combined Annotation-Dependent Depletion) score of more than 20 [38]. We considered only deleterious variants with minor allele frequency (MAF) less than 1% in the burden analysis. The significance of the differences in MAFs between different populations was calculated using Chi-Square test, using the R software (https://www.r-project.org/). All the p-values presented in the tables are not corrected for multiple testing. P values ≤ 0.05 were considered significant.

Structural analysis

All the identified ACE2 missense exon variants were mapped, modeled, and analyzed using Pymol modeling software (https://pymol.org/2/). DynaMut web server was used to predict the effect of genetic variants on the stability and flexibility of ACE2 receptor [39, 40].

Results

ACE2 receptor variations in the Middle East

We first examined ACE2 gene variation frequencies in Kuwait, Qatar, East Asia, and Africa where the impact of SARS-CoV-2 has been modest and compared it to Iran, which is moderately affected and then to Europe, the continent with the most deaths per population. Overall, we found human ACE2 gene variations and the probability of loss of function mutations (pLOF = 0.1, C.I. 0.04–0.25) to be low in comparison to ACE (pLOF = 0.87, C.I. 0.71–1.08), which is a gene of similar size, indicating that the ACE2 gene is highly intolerant to loss of function mutations. Additionally, we identified 19 missense variants in the ACE2 gene from Kuwait, Qatar and Iran (Table 1). All were rare variants defined by minor allele frequency (MAF) of less than 1%. The genetic variants included four novel variants from Kuwait, and six from Iran.
Table 1

ACE2 missense variants in the Middle East and gnomAD populations.

Variant IDProteinConsequenceMinor Allele Frequency
Functional Risk Prediction (Scores)
KuwaitIranQatarEuropeEast AsiaAfricaSIFTPP2HVARPP2HDIVMUTTASTERLRTCADD
rs372923812I741V000.006550.0000900.00005T (0.32)B (0.004)B (0.001)N (0.07)N (0.005)0.066
rs4646116K26R0.00209000.005870.000070.00095T (0.47)B (0.001)B (0.0)N (0.31)N (0.83)8.188
rs200540199R716H0.00105000.0001400D (0.02)B (0.277)P (0.771)N (0.03)N (0.207)11.52
rs769062069R708Q00.0018800.0000200D (0.00)D (0.935)D (0.999)D (0.99)D (0.00)24.9
rs776995986R708W0.001050.0058400.0000400D (0.00)D (0.997)D (1.00)D (1.00)D (0.00)22.9
rs765152220D494V00.0006300.0000500D (0.01)D (0.926)D (0.995)D (1.00)D (0.00)31
rs750145841Y199C00.0012500.0000200T (0.08)D (0.984)D (1.0)D (0.98)D (0.0004)26.2
rs759162332Q60R00.0012500.0000300T (0.21)B (0.009)B (0.001)N (0.20)N (0.23)15.34
rs41303171N720D0.003140.006250.002040.0252100.00269T (0.07)B (0.02)B (0.006)N (0.20)N (0.009)14.9
X:15580089I786T00.001880000T (0.18)B (0.088)B (0.297)N (0.00)U (0.001)6.519
X:15582247P737A0.0031400000D (0.01)B (0.210)P (0.489)N (0.00)U (0.953)2.44
X:15585864Q661P0.0010500000D (0.02)B (0.30)P (0.812)N (0.00)U (0.3)22.9
X:15589919F555L0.0010500000T (0.09)B (0.145)B (0.16)N (0.00)U (0.223)8.825
X:15591578V485L00.001880000T (0.10)P (0.56)B (0.028)N (0.00)U (0.63)23.7
X:15593877F452V00.000830000D (0.03)B (0.038)B (0.189)N (0.00)U (0.004)26.3
X:15596394A372G0.0020900000D (0.05)B (0.049)B (0.437)N (0.00)U (0.001)25.4
X:15599413T334M00.001880000T (0.11)B (0.026)B (0.01)N (0.00)U (0.084)12.87
X:15599422S331F00.000830000D (0.00)P (0.944)B (0.006)N (0.00)U (0.344)24.6
X:15607489D225G00.001250000T (0.37)B (0.007)B (0.002)N (0.00)U (0.03)26.1

SIFT (Sorting Intolerant From Tolerant): D = damaging, T = tolerated.

PP2HVAR (PolyPhen-2 Polymorphism Phenotyping v2 HumVar): D = probably damaging, P = possibly damaging, B = benign.

PP2HDIV (PolyPhen-2 Polymorphism Phenotyping v2 HumDiv): D = probably damaging, P = possibly damaging, B = benign.

MUTTASTER (MutationTaster): A = disease causing automatic, D = disease causing, N = polymorphism, P = polymorphism automatic.

LRT (Likelihood Ratio Test): D = deleterious, N = Neutral, U = unknown.

CADD - Combined Annotation Dependent Depletion based scores.

The missense variants were defined as deleterious when predicted to be damaging, probably damaging, disease causing and deleterious by the five algorithms applied (SIFT, PolyPhen-2 HumVar, PolyPhen-2 HumDiv, MutationTaster and LRT score) and/or CADD score of more than 20. We considered only deleterious variants with minor allele frequency less than 1% in the burden analysis.

Denotes Novel variants presented with chromosome: position (based on the human reference genome build GRCh37/hg19).

ACE2 missense variants in the Middle East and gnomAD populations. SIFT (Sorting Intolerant From Tolerant): D = damaging, T = tolerated. PP2HVAR (PolyPhen-2 Polymorphism Phenotyping v2 HumVar): D = probably damaging, P = possibly damaging, B = benign. PP2HDIV (PolyPhen-2 Polymorphism Phenotyping v2 HumDiv): D = probably damaging, P = possibly damaging, B = benign. MUTTASTER (MutationTaster): A = disease causing automatic, D = disease causing, N = polymorphism, P = polymorphism automatic. LRT (Likelihood Ratio Test): D = deleterious, N = Neutral, U = unknown. CADD - Combined Annotation Dependent Depletion based scores. The missense variants were defined as deleterious when predicted to be damaging, probably damaging, disease causing and deleterious by the five algorithms applied (SIFT, PolyPhen-2 HumVar, PolyPhen-2 HumDiv, MutationTaster and LRT score) and/or CADD score of more than 20. We considered only deleterious variants with minor allele frequency less than 1% in the burden analysis. Denotes Novel variants presented with chromosome: position (based on the human reference genome build GRCh37/hg19). We identified four deleterious variants (rs776995986, rs769062069, rs765152220, and rs750145841), causing R708W, R708Q, D494V, Y199C missense amino acid substitutions in the ACE2 gene using risk prediction tools as described in the methods section (Table 1). All the ACE2 gene deleterious variants were absent from the African, East Asian and Qatari data and were very rare in Europeans but were present at MAF of 0.063–0.5% in the Iranian population (Table 1). This suggests a more protective effect and a significant decrease in the disease burden in Iran compared to Europe (p < 0.05; Table 2). The positions of the ACE2 receptor polymorphisms on the linearized ACE2 protein model are shown in Figures 1 and 3 D-models for the same are shown in Figures S1 and S2. It is noteworthy that none of the ACE2 polymorphisms identified in this study involved the three ACE2 regions known to directly bind the SARS-CoV-2 S-Protein Receptor Binding Domain (RBD), namely amino acids 30–41, 82–84, and 353–357 (Figure 1).
Table 2

Burden of ACE2 rare variants in the Middle East and gnomAD populations.

Variant IDProteinConsequenceMinor Allele Frequency
P value
KWTIRNQAREUREASAFRKWT vs IRNKWT vs EURIRN vs EUR
rs776995986R708W1.05E-035.64E-030.003.79E-050.000.000.10840.05150.0005
rs769062069R708Q0.001.88E-030.001.46E-050.000.00____0.0005
rs765152220D494V0.006.25E-040.005.37E-050.000.00____0.0995
rs750145841Y199C0.001.25E-030.002.00E-050.000.00____0.0015

The missense variants were defined as deleterious when predicted to be damaging, probably damaging, disease causing and deleterious by the five algorithms applied (SIFT, PolyPhen-2 HumVar, PolyPhen-2 HumDiv, MutationTaster and LRT score) and/or CADD score of more than 20. We considered only deleterious variants with minor allele frequency less than 1% in the burden analysis. P values are calculated using Chi-square test.

KWT-Kuwaitis; IRN-Iranians; QAR-Qataris; EUR-Europeans (non-Finnish); EAS-East Asians; AFR-Africans.

Figure 1

The positions of the ACE2 receptor polymorphisms on the linearized ACE2 protein. The translated protein contains an N-terminal signal sequence (1–17), single catalytic domain (18–740) with zinc-binding motif (HEMGH 374–378), a transmembrane region (741–761), a small C-terminal cytosolic domain (762–805).

Figure 3

(A)ACE2 dimer complex (promoter 1 Blue), (promoter 2 white). The peptidase domain (PD) is the site where the SARS-CoV-2 S protein ribosomal domain (RBD) binds. The C-terminal collectrin-like domain (CLD) is the region where the TMPRSS2 cleaves ACE2. (B) Single domain of the ACE2 mutation N720D, the red region of the protein depicted the more flexible region of the protein due to the N720D mutation with a (ΔΔG): -0.470 kcal/mol and ΔΔS: 0.070 kcal mol−1.K−1 (increase of molecule flexibility). (C) Depicts N720 backbone NH forming a H-bond with N718 and E723 forming an NH–NH bond with S721. (D) D720 backbone NH forming a H-bond with N718 and COO- group of D720 forming a weak polar interaction with L722 backbone.

Burden of ACE2 rare variants in the Middle East and gnomAD populations. The missense variants were defined as deleterious when predicted to be damaging, probably damaging, disease causing and deleterious by the five algorithms applied (SIFT, PolyPhen-2 HumVar, PolyPhen-2 HumDiv, MutationTaster and LRT score) and/or CADD score of more than 20. We considered only deleterious variants with minor allele frequency less than 1% in the burden analysis. P values are calculated using Chi-square test. KWT-Kuwaitis; IRN-Iranians; QAR-Qataris; EUR-Europeans (non-Finnish); EAS-East Asians; AFR-Africans. The positions of the ACE2 receptor polymorphisms on the linearized ACE2 protein. The translated protein contains an N-terminal signal sequence (1–17), single catalytic domain (18–740) with zinc-binding motif (HEMGH 374–378), a transmembrane region (741–761), a small C-terminal cytosolic domain (762–805). Next, we examined whether natural ACE2 gene variations that increase the affinity of ACE2 to the S-protein or facilitate viral entry/viral load exist more frequently in high-burden compared to low-burden populations. Two such genetic variants existed in our data (Table 1). The first, rs4646116, is a missense variant that changes a lysine amino acid at position 26 to arginine (K26R) (Table 1). The K26, which is just proximal to the first region of the ACE2 receptor involved in S-protein binding, has been shown previously to bind the sterically hindering first mannose in the glycan that is linked to N90 and thus stabilizes the glycan moiety hindering the binding of S-protein RBD to ACE2 [41] (Figure 2A). The missense variant R26 creates a new hydrogen bond with D30, which is then poised to build a salt-bridge with the S-protein RBD K417 that increases the affinity of SARS-CoV-2 to the ACE2 receptor [21] (Figure 2B). Indeed, the ACE2 K26R activating variant was extremely rare in East Asian (MAF = 0.007%), Africans (MAF = 0.095%), but the second most common variant in Europeans with MAF of 0.587% (shown in green fonts in Table 1). The MAF of this variant in the Kuwaiti population was nearly half that of Europeans (MAF = 0.29%), and it was absent from the Qatari and Iranian exome data (Table 1). Our structural modeling supports the notion that K26R is an ACE2 receptor activating variant (Figure 2A, B). Consistent with these findings, using a synthetic human ACE2 mutant library, a recent study reported that the R26 variant increased S-protein binding and susceptibility to the virus significantly [42].
Figure 2

K26R polymorphism in ACE2 mapped to the structure of ACE2, green; in complex with S protein RBD, yellow (PDB ID: 6VW1). (A) In native ACE2, K26, magenta, forms two H-bonds with the mannose moiety, cyan, of ACE2 N90 which may stabilize the glycan-ACE2 complex. (B) The mutant R26, magenta, forms one H-bond with mannose. A process that could provoke the moiety-ACE2 stability and increases the affinity of the ACE2 a-helix to S protein RBD binding, where, R26 functions as a backbone and interacts with D30 which in turn dignified to build a salt-bridge with the S protein RBD K417, yellow stick.

K26R polymorphism in ACE2 mapped to the structure of ACE2, green; in complex with S protein RBD, yellow (PDB ID: 6VW1). (A) In native ACE2, K26, magenta, forms two H-bonds with the mannose moiety, cyan, of ACE2 N90 which may stabilize the glycan-ACE2 complex. (B) The mutant R26, magenta, forms one H-bond with mannose. A process that could provoke the moiety-ACE2 stability and increases the affinity of the ACE2 a-helix to S protein RBD binding, where, R26 functions as a backbone and interacts with D30 which in turn dignified to build a salt-bridge with the S protein RBD K417, yellow stick. We also subjected novel and known ACE2 variants to structural predictions that may impact the binding of SARS-CoV-2 to the host cells. These changes (A372G, Y199C, D225G, F452V and V485L) were located proximal to the protein residues that mediate its activity. A previous study indicated that the three amino acid (aa) regions 30–41, especially the residue near lysine 31 and tyrosine 41, 82–84 and 353–357 in ACE2 were essential for the binding of S-protein in coronavirus [43]. Our structural estimation based on the 19 ACE2 missense variants identified in Kuwaitis, Qataris and Iranians (Table 1), revealed the location of K26R variant near the aa residues 30–41 and the novel variants, S331F and T334M, adjacent to tyrosine at 353–357. The A372G is adjacent to the zinc-binding domain (HEMGH 374–378) critical for ACE2 enzymatic activity [44]. While Y199C, D225G, F452V, and V485L all fall within the metallocarboxypeptidase domain (aa 18–617) [45] (Figure 1).

ACE2 N720D receptor variation and structural modeling

The second activating, and by far the most common ACE2 gene variant in Europeans (MAF = 2.52%) and Italians (MAF = 1.6%; detected in 105 of 6984 exomes) [20] was rs41303171, which replaces the amino acid asparagine at position 720 to aspartic acid (N720D) (Green font in Table 1). This ACE2 variant was absent from the East Asian population (13,784 exomes) and was significantly rarer in the Middle East and Africa (Table 1). This particular variant has been reported before, but its clinical relevance was persistently dismissed because the codon 720, being far from ACE2-Spike protein interface, does not appear to be an obvious candidate for ACE2 receptor binding to the S-protein of SARS-CoV-2 [21, 46]. However, we noted that N720D is located 4 amino acids proximal to the TMPRSS2 cleavage site (aa 697–716) as shown in Figure 1. Recent studies demonstrated that TMPRSS2 cleavage of the ACE2 receptor increases SARS-CoV-2 cellular entry [47, 48]. It is not unreasonable to suggest that this activating variant may play a similar role in SARS-CoV-2, rendering people who harbor it more prone to severe infection and higher viral load. ACE2 cleavage by TMPRSS2 enhances the S-protein viral entry [7]. TMPRSS2 cleaves ACE2 between residues 697 and 716, which is the third and fourth helices in the C-terminal collectrin neck domain of the dimer interface of ACE2 [47]. To further dissect the mechanisms underlying the enhanced ACE2-TMPRSS2 accessibility, we performed structural analysis of the recently published ACE2 domain bound to B0AT1 (PDB ID:6M18) [7] and showed that N720 is located on the same interface of the loop region in close vicinity to the TMPRSS2 cleavage site (Figure 3A). Since loop region of protein are unordered and display conformational dynamics, a mutation close to the cleavage site can affect the binding affinity of TMPRSS2. Therefore, we used DynaMut web server to predict the effect of mutation N720D on the stability and flexibility of ACE2 [39,40]. Whereby, the predicted stability change was (ΔΔG): -0.470 kcal/mol, which indicated the destabilization of ACE2 receptor after the introduction of D720 (Figure 3B). The N720D mutation has resulted in an increase in entropy in the loop region (ΔΔSVib: 0.070 kcal mol−1.K−1) [39] (Figure 3B), depicting a more unstable state, which makes ACE2 more readily cleavable by TMPRSS2. (A)ACE2 dimer complex (promoter 1 Blue), (promoter 2 white). The peptidase domain (PD) is the site where the SARS-CoV-2 S protein ribosomal domain (RBD) binds. The C-terminal collectrin-like domain (CLD) is the region where the TMPRSS2 cleaves ACE2. (B) Single domain of the ACE2 mutation N720D, the red region of the protein depicted the more flexible region of the protein due to the N720D mutation with a (ΔΔG): -0.470 kcal/mol and ΔΔS: 0.070 kcal mol−1.K−1 (increase of molecule flexibility). (C) Depicts N720 backbone NH forming a H-bond with N718 and E723 forming an NH–NH bond with S721. (D) D720 backbone NH forming a H-bond with N718 and COO- group of D720 forming a weak polar interaction with L722 backbone. In addition, we modelled in the various non-covalent interactions for both N720 and mutant D720 and other amino acids in the loop region (Figure 3C, D). In Figure 3C, N720 formed a backbone hydrogen-bond with N718, this conformation also resulted in an NH–NH between E723 and S721. Whereas, with the activating variant D720 (Figure 3D) the back-bone hydrogen bond with N718 is still established, however, the conformational change has resulted in a polar interaction between the backbone D720 COO- group and backbone NH of L722, forming a weak polar interaction (through water-mediated hydrogen bond). As such, the D720 variant altered the conformation of the loop, and the NH–NH bond between E723 and S721 cannot be formed. Whereby, such intermolecular amide interactions are significant for protein stability [49]. Such a change in interatomic interactions between amino acids near the cleavage region can decrease the stability and increase the flexibility of the loop [50], which makes it easier for TMPRSS2 to cleave. To gain insights that are pathologically and clinically relevant, we then asked whether a correlation exists between the N720D MAF data and mortality rates reported in the corresponding regions. We identified a significant correlation between N720D MAF and case fatality rate reported from countries of different continents (Figure 4) with Pearson's correlation coefficient of 69.44% (p < 0.0003; C.I. 0.3953–0.8603).
Figure 4

The plot shows a significant correlation between N720D minor allele frequency (MAF) and case fatality rate reported from countries of different continents with Pearson's correlation coefficient of 69.44% (p < 0.0003; C.I. 0.3953–0.8603).

The plot shows a significant correlation between N720D minor allele frequency (MAF) and case fatality rate reported from countries of different continents with Pearson's correlation coefficient of 69.44% (p < 0.0003; C.I. 0.3953–0.8603). Next, we examined ACE2 eQTL variants in the Middle Eastern populations (Table S1). There were few variants in the Kuwaiti and Iranian populations that influenced ACE2 expression. In the Qatari population the variants modifying ACE2 gene expression were similar in frequencies to Europeans (Table S1) with most upregulating ACE2 expression in the brain and tibial nerve tissues (https://gtexportal.org/home/). Notably, the tibial nerves were affected in diabetic conditions [51], which inflicts a high number of patients in the Middle East. The most downregulating eQTL variant, rs112171234, was present in 20% and 6% of the African and Qatari populations respectively. Overall, the eQTL data pertaining to ACE2 expression were not significantly predictive nor informative.

TMPRSS2 variation

Five rare and one common deleterious variants were identified in the TMPRSS2 gene in the Middle Eastern population (Figure S3, Tables 2, and S3). Overall, there was no significant conclusion that may be withdrawn from the TMPRSS2 genetic variation data. On the expression level, we discerned four TMPRSS2 eQTL variants, which were detected only in Qataris among the Middle Eastern populations (Table S4). One of these variants, rs6517673 is intronic and downregulates the expression of TMPRSS2 in the prostate. Two of the eQTL variants, rs79391937 and rs79566442, decrease the expression of the TMPRSS2 gene in the thyroid and ovary tissues, respectively. While the variant rs11701542 upregulates the TMPRSS2 gene expression in the testis (https://gtexportal.org/home/). It is worth noting that two-third of the mortality due to COVID-19 disease affects males [18, 52].

FURIN variation

Genetic variation analysis of the FURIN gene resulted in the identification of 13 known missense variants (Table 3 and Figure S4). Like ACE2, all the identified variants were rare in the Middle Eastern populations. However, unlike the ACE2 gene, no novel variants were observed in FURIN in the Middle Eastern population (Table 3). Among the 13, we detected seven deleterious variants suggesting a possible decrease in FURIN protease function, which can potentially reduce the risk of SARS-CoV-2 in the studied populations. In this context, deleterious FURIN gene variations were observed least in East Asians, Africans, followed by Europeans then Iranians (Table 4). Interestingly, both in Qatar and Kuwait the deleterious FURIN genetic variations were more common, suggesting a possible protective effect against the SARS-CoV-2 (p < 0.05; Table 4). For example, the MAF of R37C, and R81C in Kuwait and Qatar were 0.4%/0.7% and 2.4%/3.4% respectively compared to significantly lower MAF in corresponding variants in Europeans (p < 0.05; Table 4). Together, these data support the premise that FURIN gene variants may play important roles in protection against SARS-CoV-2 in the Middle East. It is worth noting, however, that in Africa FURIN gene variants may not be a contributing factor in viral protection (Table 4). We next sought to determine the extent of FURIN expression in the Middle East. We detected 16 FURIN eQTL variants in the Middle Eastern populations, most of which were reported in the Qatar population (Table 5). We observed a high frequency of the FURIN upregulating variants, rs6226 (93%) and rs8039305 (81%) in the African populations compared to the Middle Eastern, European and East Asian populations (p < 0.05) (Table 5). In the GeneATLAS PheWAS database [53], these frequent variants were significantly related to hypertension (rs6226: P = 1.3569e-09, OR = 1.03; rs8039305: P = 1.7643e-38, OR = 1.05), which was one of the common comorbid conditions associated with the increased risk of SARS-CoV-2 infection in Chinese [54, 55] and American [56] populations.
Table 3

FURIN known missense variants in the Middle East and gnomAD populations.

Variant IDProtein ConsequenceMinor Allele Frequency
Functional Risk Prediction (Scores)
KuwaitIranQatarEuropeEast AsiaAfricaSIFTPP2HVARPP2HDIVMUTTASTERLRTCADD
rs201172453R37C0.004180.003750.007000.0000700.00016D (0.01)P (0.586)D (0.995)N (0.000)N (0.052)22.7
rs749761700R37H0.00105000.0000100T (1.00)B (0.001)B (0.001)N (0.000)N (0.052)0.496
rs16944971A43V0.006280.001880.005000.0025500.07835T (0.25)B (0.007)B (0.010)P (0.411)N (0.007)11.82
rs148110342R81C0.024060.007500.034000.0015900.00016T (0.17)P (0.586)D (0.989)D (0.995)N (0.051)23
rs751909359R86Q00.0007000.0000300T (0.31)B (0.028)B (0.244)D (0.999)D (0.000)22.3
rs116359616V109M0.00105000.0000200.02318D (0.03)B (0.010)B (0.083)D (0.635)N (0.168)4.625
rs143276283Q399R0.002090.0006300.0000600T (1.00)B (0.002)B (0.000)D (0.967)N (0.002)18.69
rs750446344V580I0.00314000.0000400T (0.23)B (0.007)B (0.010)N (0.000)N (0.835)0.211
rs752639409R637Q00.000700000.00006T (0.58)B (0.051)B (0.200)N (0.003)N (0.031)21.3
rs761541008R677W00.0007000.0000500T (0.05)B (0.332)P (0.947)N (0.046)D (0.000)27.4
rs193268286S685P0000.0000400T (0.09)D (0.994)D (0.999)D (0.999)D (0.000)26.6
rs760368022H712Y0.00105000.0000100D (0.01)B (0.089)B (0.367)N (0.016)N (0.032)19.66
rs35641241R745Q0000.001180.001020.00008T (0.13)B (0.375)D (0.989)N (0.149)N (0.021)23.4

SIFT (Sorting Intolerant From Tolerant): D = damaging, T = tolerated.

PP2HVAR (PolyPhen-2 Polymorphism Phenotyping v2 HumVar): D = probably damaging, P = possibly damaging, B = benign.

PP2HDIV (PolyPhen-2 Polymorphism Phenotyping v2 HumDiv): D = probably damaging, P = possibly damaging, B = benign.

MUTTASTER (MutationTaster): A = disease causing automatic, D = disease causing, N = polymorphism, P = polymorphism automatic.

LRT (Likelihood Ratio Test): D = deleterious, N = Neutral, U = unknown.

CADD - Combined Annotation Dependent Depletion based scores.

The missense variants were defined as deleterious when predicted to be damaging, probably damaging, disease causing and deleterious by the five algorithms applied (SIFT, PolyPhen-2 HumVar, PolyPhen-2 HumDiv, MutationTaster and LRT score) and/or CADD score of more than 20. We considered only deleterious variants with minor allele frequency less than 1% in the burden analysis.

Table 4

Burden of FURIN rare variants in the Middle East and gnomAD populations.

Variant IDProteinConsequenceMinor Allele Frequency
P value
KWTIRNQAREUREASAFRKWT vs IRNKWT vs QARKWT vs EURKWT vs AFRIRN vs QARIRN vs EURIRN vs AFRQAR vs EURQAR vs AFREUR vs EASEUR vs AFREAS vs AFR
rs201172453R37C0.0040.0040.0077.0E-0501.6E-04110.00050.000510.00050.00050.01200.0610__0.2724__
rs148110342R81C0.0240.0080.0341.6E-0301.6E-040.00150.44680.00050.00050.00550.00100.00050.00050.0005__0.0005__
rs751909359R86Q00.00103.0E-0500__________0.0645____________
rs752639409R637Q00.0010006.0E-05____________0.1709__________
rs761541008R677W00.00105.0E-0500__________0.1000____________
rs35641241R745Q0001.2E-031.0E-038.0E-05__________________0.58570.00050.0005
rs193268286S685P0004.0E-0500________________________

The missense variants were defined as deleterious when predicted to be damaging, probably damaging, disease causing and deleterious by the five algorithms applied (SIFT, PolyPhen-2 HumVar, PolyPhen-2 HumDiv, MutationTaster and LRT score) and/or CADD score of more than 20. We considered only deleterious variants with minor allele frequency less than 1% in the burden analysis. P values are calculated using Chi-square test.

KWT-Kuwaitis; IRN-Iranians; QAR-Qataris; EUR-Europeans (non-Finnish); EAS-East Asians; AFR-Africans.

Table 5

FURIN eQTL variants in the Middle East and gnomAD populations.

Variant IDGeneTissuePGTExNESGTExMinor Allele Frequency
KWTIRNQAREUREASAFR
rs117976310S2VB, LOC101926928Adrenal Gland3.0E-06-1.1NANA0.010.020.000.002
rs12904679BLM, FURINEsophagus - Mucosa3.1E-06-0.2NANA0.120.120.0030.02
rs4932373FESEsophagus - Mucosa6.6E-190.28NANA0.300.330.090.13
rs2238332BLMPituitary2.7E-06-0.17NANA0.360.360.330.70
rs8039305∗FURINEsophagus - Mucosa1.7E-170.23NA0.410.490.480.170.81
rs6226∗FURINArtery - Tibial1.6E-090.120.720.650.690.680.530.93
rs12904055CRTC3Brain - Putamen (basal ganglia)1.1E-050.2NANA0.240.220.170.31
rs80160059SV2BColon - Sigmoid9.0E-060.54NANA0.060.0010.000.11
rs2071410FURINEsophagus - Mucosa1.7E-200.28NA0.260.270.330.060.18
rs8026133LOC101926928, SLCO3A1Minor Salivary Gland1.4E-060.83NANA0.120.010.090.47
rs2227935BLMEsophagus - Mucosa2.9E-100.320.050.030.030.070.00040.07
rs16944923BLM, FURINSkin - Sun Exposed (Lower leg)9.0E-060.12NANA0.130.140.200.05
rs28385078BLMEsophagus - Mucosa5.7E-100.32NANA0.020.070.000.03
rs116449376BLM, FURINEsophagus - Mucosa1.5E-090.34NANA0.020.060.000.04
rs7495370BLM, FURINWhole Blood5.6E-070.072NANA0.640.570.780.47
rs7165790BLMHeart - Atrial Appendage4.4E-07-0.12NANA0.340.000.0010.00

P GTEx- P values reported in GTEx database; NES-normalized effect size in GTEx database; ∗P values calculated using Chi-square test.

KWT-Kuwaitis; IRN-Iranians; QAR-Qataris; EUR-Europeans (non-Finnish); EAS-East Asians; AFR-Africans. NA denotes Not Available in exome data.

FURIN known missense variants in the Middle East and gnomAD populations. SIFT (Sorting Intolerant From Tolerant): D = damaging, T = tolerated. PP2HVAR (PolyPhen-2 Polymorphism Phenotyping v2 HumVar): D = probably damaging, P = possibly damaging, B = benign. PP2HDIV (PolyPhen-2 Polymorphism Phenotyping v2 HumDiv): D = probably damaging, P = possibly damaging, B = benign. MUTTASTER (MutationTaster): A = disease causing automatic, D = disease causing, N = polymorphism, P = polymorphism automatic. LRT (Likelihood Ratio Test): D = deleterious, N = Neutral, U = unknown. CADD - Combined Annotation Dependent Depletion based scores. The missense variants were defined as deleterious when predicted to be damaging, probably damaging, disease causing and deleterious by the five algorithms applied (SIFT, PolyPhen-2 HumVar, PolyPhen-2 HumDiv, MutationTaster and LRT score) and/or CADD score of more than 20. We considered only deleterious variants with minor allele frequency less than 1% in the burden analysis. Burden of FURIN rare variants in the Middle East and gnomAD populations. The missense variants were defined as deleterious when predicted to be damaging, probably damaging, disease causing and deleterious by the five algorithms applied (SIFT, PolyPhen-2 HumVar, PolyPhen-2 HumDiv, MutationTaster and LRT score) and/or CADD score of more than 20. We considered only deleterious variants with minor allele frequency less than 1% in the burden analysis. P values are calculated using Chi-square test. KWT-Kuwaitis; IRN-Iranians; QAR-Qataris; EUR-Europeans (non-Finnish); EAS-East Asians; AFR-Africans. FURIN eQTL variants in the Middle East and gnomAD populations. P GTEx- P values reported in GTEx database; NES-normalized effect size in GTEx database; ∗P values calculated using Chi-square test. KWT-Kuwaitis; IRN-Iranians; QAR-Qataris; EUR-Europeans (non-Finnish); EAS-East Asians; AFR-Africans. NA denotes Not Available in exome data.

Discussion

At the time of revising this article during December 2020, SARS-CoV-2 has infected 77,394,940 and killed about 1,703,164 people worldwide [1]. The case fatality rates (CFR) varied significantly around the world, with the highest reported in France, United Kingdom, Italy, Belgium, and Spain reaching 15.3%, 15%, 14.1%, 13.6% and 7.9%, respectively (https://ourworldindata.org/mortality-risk-covid; accessed on 7 August 2020). Whereas, the CFRs were distinctly lower in Qatar, Kuwait, South Africa and Japan at 0.2%, 0.7%, 1.8% and 2.4%, respectively and moderately higher in Iran (5.6%) and China (5.5%) that reported the highest CFR in Asia (https://ourworldindata.org/mortality-risk-covid; accessed on 7 August 2020). The lower infection and CFRs in developing countries obviously cannot be attributed to increased number of tests or better medical services than what is already provided in Europe. Moreover, prior knowledge of the SARS-CoV-2 RNA genome revealed by whole genome sequencing of the virus, performed by us and others, has shown remarkable genetic similarities among the virus [57] circulating between Qatar, Kuwait and Europe, which was attributed to traveling and repatriation between these countries [Manuscript in preparation; GISAID [57]]. For these reasons, it is not unrealistic to propose that the differences observed in CFR among countries may be attributed to host/ethnic germline variations in genes involved in SARS-CoV-2 processing and cellular entry or exit. Therefore, we screened the genetic variations and eQTL expression of the SARS-CoV-2 candidate genes, ACE2, TMPRSS2 and FURIN in three Middle Eastern populations: Kuwaiti, Iranian, and Qatari and compared them to available MAF of the continental populations including Europeans, East Asians and Africans from the gnomAD database [31]. Through comprehensive survey, we identified novel and common missense variants at differential frequencies in the study populations. Among the detected nine known ACE2 missense variants, we observed that N720D (rs41303171) and K26R (rs4646116) were the most frequent in the European datasets [21]. Moreover, recent structural predictions by Stawiski and colleagues revealed that the K26R missense variant enhanced the affinity of ACE2 for SARS-CoV-2 whereas N720D had little involvement in the SARS-CoV-2 S-protein interaction [21]. Our structural investigation suggests that the ACE2 receptor N720D variant may enhance TMPRSS2 activation and subsequent viral entry. Interestingly, the prevalence of the most ACE2 activating variant N720D (rs41303171) was highest among Europeans (2.5%), Iranians (0.6%) when compared to Kuwaitis (0.3%), Qataris (0.2%) and other global populations (0.4%) and MAF of this variant significantly correlated with the CFRs (p < 0.0003) in the corresponding countries (Figure 4). However, further functional assessments are required to confirm our predictions. Notably, a recent Iranian population based study missed this activating variant N720D despite its prevalence in the Iranome database and claimed no association between the ACE2 variants and COVID-19 disease severity or mortality among Iranians [58]. Albeit, the study was limited to 45 patients, which is insufficient to establish significant conclusions. While another recent study highlighted the impact of the activating variant N720D (rs41303171) on epidemiological disparities, especially in Europeans when compared to other global populations [59] and stands in support of our observations. In the context of the occurrence of rare deleterious ACE2 variants, while the MAF of R708W in Kuwait is 0.105%, which may indicate again a protective role in the Kuwaiti population, the absence of this and the other four deleterious variants from Africa, East-Asia [31], Qatar [30] does not explain the lower disease burden in these countries. Similarly, we did not observe a significant difference in the burden of novel deleterious variants comparing the Kuwaiti population with Europeans (Table 2), although they may play a protective role against COVID-19 locally. The functional roles of the two Kuwait-specific and the two Iran-specific ACE2 deleterious novel variants Q661P (MAF = 0.1%), A372G (MAF = 0.2%) and V485L (MAF = 0.18%), F452V (MAF = 0.08%) respectively are yet to be experimentally determined. Further, our structural estimations of the ACE2 novel variants predicted the following, (i) Q661P is close to the aa region 652–659, which is important for cleavage by the metalloprotease ADAM17; (ii) R716H and the two deleterious variants R708W and R708Q are located within residues 697–716 essential for cleavage by TMPRSS11D and TMPRSS2. In fact, the mutation of arginine -such as R708- and lysine residues within amino acid residues 697-716 markedly reduced ACE2 cleavage by TMPRSS2 [60]. However, it should be mentioned that sequentially distant aa residues in the ACE2 receptor can be seen brought structurally proximal to each other to create active sites for catalysis [45]. We illustrated this in Figure 5 to show that the novel aa changes (colored green) are proximal to the protein residues that mediate its activity (colored blue and red). Further studies are needed to directly assess the functional aspects of the reported missense variants. We urge the international community to assess ACE2 variation differences among people with mild/asymptomatic disease versus patients presenting with severe respiratory distress syndrome.
Figure 5

ACE2 protein structure (open form, yellow; and closed form with substrate, gray). The active site residues are coded in red color. The zinc binding residues are coded in blue color. The identified novel amino acid changes in the Middle Eastern populations are coded in green color. The novel changes are proximal to the protein residues that mediate its activity.

ACE2 protein structure (open form, yellow; and closed form with substrate, gray). The active site residues are coded in red color. The zinc binding residues are coded in blue color. The identified novel amino acid changes in the Middle Eastern populations are coded in green color. The novel changes are proximal to the protein residues that mediate its activity. Interestingly, there was not a single individual in the Middle East with ACE2 receptor variations in amino acids known to be crucial for SARS-CoV-2 S-protein binding (K31, E35, D38, M82, K353) [61]. This may indicate that natural immunity conferred by the ACE2 receptor variations is lacking or extremely rare. The rareness is evident from a recent comprehensive study that scrutinized ACE2 gene variations in more than 290,000 individuals from 400 different worldwide population groups and in which only eight rare ACE2 gene missense variants (K31R, E35K, E37K, D38V, N33I, H34R, Y83H, and Q388L) with reduced binding to the S-protein of SARS-CoV-2 were reported [21]. Notably, this study has disproved the claim of Cao and colleagues on the absence of such protective ACE2 variants in human populations [18]. It is to be pointed out that there is no explicit published experimental evidence for our proposition that Lys26Arg increases virus entry and N720D increases cleavage of ACE2 which also has a positive effect on virus entry. However, there are sufficient suggestive evidences, as indicated below. A recent study [62] mined the whole-exome sequencing data of 6,930 Italian control individuals (available in the Network of Italian Genomes) from five different centers and identified the ACE2 variants, K26R and N720D along with G211R as common missense changes with a potential to interfere with protein structure and stabilization. Another recent work [18] also listed these two variants as hotspot variants that could make some people differentially susceptible to SARS-CoV-2 infection. We demonstrated through structural modelling that the K26R change creates a new hydrogen bond with D30, which is then poised to build a salt-bridge with the S-protein RBD K417 that increases the affinity of SARS-CoV-2 to the ACE2 receptor [21] (Figure 2B). The work by Stawiski and colleagues (2020) has also revealed that the K26R variant enhanced the affinity of ACE2 for SARS-CoV-2. As mentioned earlier, a recent study using a synthetic human ACE2 mutant library has reported that the R26 variant increased S-protein binding and susceptibility to the virus significantly [42]. As regards N720D change, our data presents a novel prediction that the ACE2 receptor N720D variant may enhance TMPRSS2 activation and subsequent viral entry. The amino acid substitution can make a change in interatomic interactions between amino acids near the cleavage region and subsequently can decrease the stability and increase the flexibility of the loop, which makes it easier for TMPRSS2 to cleave. Benetti and colleagues (2020) also concluded that this N720D residue is located close to the cleavage sequence of TMPRSS2 and is likely to affect the cleavage-dependent virion intake. Though there exists no direct experimental evidence, the reported novel proposition will lead to future studies dealing with TMPRSS2 enzyme activity which is important for coronavirus spread and pathogenesis in the infected host. Recently [63], examined natural amino acid replacements at ACE2 regions binding to SARS-COV-2 spike protein across 22 mammals. Though pigs and dogs, which are not effectively infected by SARS-COV-2, have only a few changes in the binding site, they exhibit relatively low levels of ACE2 in the respiratory tract. They further showed that bats and pangolins, which are now thought as confirmed carriers of viruses closely related to SARS-CoV-2, exhibit up to seven amino acid replacements in ACE2 residues binding to SARS-CoV-2 spike protein. Further examining the intermediate hosts, they reported that none of the ACE2 proteins of any of the discussed intermediate hosts seems to be especially equipped to attach to the spike protein of SARS-CoV-2. Of the two ACE2 amino acid changes that we report, the N720D was not examined in this article; though the article examines changes at the other amino acid position namely 26, the change that we report Lys26Arg has not been seen in any of the mammals – it is a more or less invariable position showing only K26N and K26Q changes in two mammals. The hypothesis, that we proposed initially, is that ethnic genetic variations in the ACE2-TMPRSS2-FURIN genes may act as key regulators for orchestrating SARS-CoV-2 cellular access. It has been reported in literature that systematic identification of the genetic determinants of COVID-19 pathway (such as ACE2 and TMPRSS2 polymorphisms) can guide personalized treatments in this pandemic and may even explain the related epidemiologic observations on susceptibility, severity, and clinical outcome, including both virus and host factors. The reported ACE2 deleterious variations are rare or less frequent and hence might not solely attribute to the overall infection rate. Genetic variation is one of the many risk factors that contribute to the overall clinical outcome; there are other contributors, such as age, gender, and pre-existing health conditions [27, 54, 55], that would act in conjunction with the genetic risk and bring about the inter-individual variability in outcome. Notable pre-existing health conditions that lead to severe outcomes in COVID-19 patients are diabetes and hypertension. Public Health England recently reported that hospital admission rates of ethnic minorities (including Indian, Black, and Arab populations) are three times higher than the majority white British population in the United Kingdom. The economists Platt and Warwick discussed the likely reasons for the ethnic inequalities in terms of disparities in age, gender and geographic profiles of ethnic groups Occupational exposure and economic vulnerability [64]. The impact of COVID-19 on a population also depends on the extent and timeliness of prevention efforts which certainly differ from country to country. There is also resistance from societies to adopt social distancing and isolation especially from countries with high socioeconomical status. However, contributions from genetic factors cannot be ruled out. Some of the eQTL variants for the considered genes in this study and missense variants are at higher frequency in African populations than at European population – for example, the FURIN eQTL variant rs8026133 (Table 5) is 47% in Africa, 12% in Qatar and is only 1% in European population. Thus, a large plethora of genetic variants from the different molecular components of COVID-19 pathway need to be considered. It is also possible that other epigenetic factors can also play a role including the socioeconomic status of those individuals amongst many others. The deleterious variations that we report in ACE2 are rare or less frequent and hence might not attribute to the overall infection or mortality rate. Genetic variation is one of the many risk factors that contribute to the overall clinical outcome; there are other contributors, such as comorbid health conditions, that would act in conjunction with the genetic risk and bring about the inter-individual variability in outcome. The other factors contributing to the clinical outcome include age, gender and pre-existing health conditions [27, 54, 55]. Notable pre-existing health conditions that lead to severe outcomes in COVID-19 patients are diabetes and hypertension. The patients with type 2 diabetes show high prevalence, severity of disease and mortality during SARS-CoV-2 infection (see the commentary by [65]. COVID-19 patients with worse glycemic control evidenced worse prognosis. Notably, as demonstrated by Marfella and colleagues (2020), COVID-19 patients with hyperglycemia could experience a reduced effect of tolicizumab therapy, that is anti-IL6 drug administered for the ICU admitted patients. Thus, hyperglycemia could result in an unfavorable effect on hospital admission, clinical outcomes and drug therapy [66, 67]. Another most common comorbidity and cause of death for COVID-19 patients is hypertension. The severe acute respiratory syndrome is caused by the binding of the virus to the target epithelial lung cells through angiotensin converting enzyme 2 (ACE2). Thus, hypertensive patients with COVID-19 could have worse prognosis. The ACE2 pathway and anti-hypertensive therapies play a central role in COVID-19 [68, 69]. Indeed, the common finding coming from these observations is that the endothelial dysfunction, that is mainly seen in COVID-19 patients with comorbidities, causes severe inflammation and endothelial damage leading to intravascular coagulopathy (DIC) and higher rate of death [70]. In case of TMPRSS2 gene, a deleterious variant rs12329760 (V197M) was frequent in Middle East and global populations from gnomAD (Table S2) and 1000 Genomes Project Phase 3 [71] datasets. Two different in silico investigations predicted the impact of this variant in viral entry into the host cell. One suggested that this variant produces a largest de novo pocket protein, which might affect the TMPRSS2 structure [72] and the other reported decreased stability of the protein in the presence of this variant [73]. However, Vargas-Alarcón and colleagues (2020) have illustrated no possible effect of this variant on the TMPRSS2 post-translational modifications. Additionally, we showed high prevalence of frequent deleterious FURIN variants in the Middle Eastern populations, especially Kuwait and Qatar, which indicated decreased FURIN protease activity that further reduces the risk of viral infection. As such, our results suggest a possible protective role of FURIN gene variants against the SARS-CoV-2 infection in the studied Middle Eastern populations. However, the data presented here add another variable that needs to be urgently addressed. It is not unreasonable to suggest that European human genetic variation presents optimal access for SARS-CoV-2 infection, whereas Iranian genetic background presents sub-optimal opportunities for the infection, and finally, Kuwaitis and Qataris have the least natural inductive genetic background in the ACE2 and FURIN proteins combined for viral entry.

Conclusions

In summary, we report the differential occurrence of gene variants in the Middle Eastern populations that enrich the knowledge on the genetic susceptibility of different global populations to the novel SARS-CoV-2 virus. Especially that in the context of a pandemic, rare variants that increase or decrease disease susceptibility become quantitatively important since millions of people may be infected. The decreased burden of rare deleterious variants and the increased MAF in variants that activate the ACE2 receptor in the European population suggest an inherent susceptibility to SARS-CoV-2 infection and vice versa in the Middle Eastern population. We understand that other confounding factors like SARS-CoV-2 testing, socioeconomical status, the availability of proper medical services and the higher burden of other diseases may be important contributors to the disparities seen in mortality rates around the world. It is imperative that more comprehensive studies should be conducted as more patient data emerges from different parts of the world. Our data are preliminary and highlight the urgent need to correlate patients' medical/infection history to their genetic variants in order to predict in a more accurate and personalized way how human genetic variations may influence this pandemic. We note that our findings are preliminary based on the computational (in silico) analyses, therefore, further experimental validations are required to understand the ramification of the reported gene variants on the entry of SARS-CoV-2 virus into the host cells.

Declarations

Author contribution statement

Hamad Ali, Mohamed Abu-Farha, Rasheed Ahmad, Jehad Abubaker: Analyzed and interpreted the data. Fahd Al-Mulla: Conceived and designed the experiments; Analyzed and interpreted the data; Wrote the paper. Anwar Mohammad, Ashraf Al Madhoun, Dania Haddad: Performed the experiments. Muthukrishnan Eaaswarkhanth: Analyzed and interpreted the data; Wrote the paper. Sumi Elsa John, Rasheeba Nizam, Arshad Channanath, Thangavel Alphonse Thanaraj: Contributed reagents, materials, analysis tools or data.

Funding statement

This work was supported by a Coronavirus emergency resilience grant from (#PN20-93MM-02).

Data availability statement

Data included in article/supplementarymaterial/referenced in article.

Declaration of interests statement

The authors declare no conflict of interest.

Additional information

No additional information is available for this paper.
  41 in total

1.  SIFT: Predicting amino acid changes that affect protein function.

Authors:  Pauline C Ng; Steven Henikoff
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

2.  Exome sequencing identifies potential risk variants for Mendelian disorders at high prevalence in Qatar.

Authors:  Juan L Rodriguez-Flores; Khalid Fakhro; Neil R Hackett; Jacqueline Salit; Jennifer Fuller; Francisco Agosto-Perez; Maey Gharbiah; Joel A Malek; Mahmoud Zirie; Amin Jayyousi; Ramin Badii; Ajayeb Al-Nabet Al-Marri; Lotfi Chouchane; Dora J Stadler; Jason G Mezey; Ronald G Crystal
Journal:  Hum Mutat       Date:  2013-11-10       Impact factor: 4.878

3.  Iranome: A catalog of genomic variations in the Iranian population.

Authors:  Zohreh Fattahi; Maryam Beheshtian; Marzieh Mohseni; Hossein Poustchi; Erin Sellars; Sayyed Hossein Nezhadi; Amir Amini; Sanaz Arzhangi; Khadijeh Jalalvand; Peyman Jamali; Zahra Mohammadi; Behzad Davarnia; Pooneh Nikuei; Morteza Oladnabi; Akbar Mohammadzadeh; Elham Zohrehvand; Azim Nejatizadeh; Mohammad Shekari; Maryam Bagherzadeh; Ehsan Shamsi-Gooshki; Stefan Börno; Bernd Timmermann; Aliakbar Haghdoost; Reza Najafipour; Hamid Reza Khorram Khorshid; Kimia Kahrizi; Reza Malekzadeh; Mohammad R Akbari; Hossein Najmabadi
Journal:  Hum Mutat       Date:  2019-08-17       Impact factor: 4.878

4.  Amide-Amide and Amide-Water Hydrogen Bonds: Implications for Protein Folding and Stability.

Authors:  Eric S Eberhardt; Ronald T Raines
Journal:  J Am Chem Soc       Date:  1994-03-09       Impact factor: 15.419

5.  TMPRSS2 and ADAM17 cleave ACE2 differentially and only proteolysis by TMPRSS2 augments entry driven by the severe acute respiratory syndrome coronavirus spike protein.

Authors:  Adeline Heurich; Heike Hofmann-Winkler; Stefanie Gierer; Thomas Liepold; Olaf Jahn; Stefan Pöhlmann
Journal:  J Virol       Date:  2013-11-13       Impact factor: 5.103

6.  Receptor and viral determinants of SARS-coronavirus adaptation to human ACE2.

Authors:  Wenhui Li; Chengsheng Zhang; Jianhua Sui; Jens H Kuhn; Michael J Moore; Shiwen Luo; Swee-Kee Wong; I-Chueh Huang; Keming Xu; Natalya Vasilieva; Akikazu Murakami; Yaqing He; Wayne A Marasco; Yi Guan; Hyeryun Choe; Michael Farzan
Journal:  EMBO J       Date:  2005-03-24       Impact factor: 11.598

Review 7.  Host cell proteases: Critical determinants of coronavirus tropism and pathogenesis.

Authors:  Jean Kaoru Millet; Gary R Whittaker
Journal:  Virus Res       Date:  2014-11-22       Impact factor: 3.303

Review 8.  Impact of Diabetes in Patients Diagnosed With COVID-19.

Authors:  Mohamed Abu-Farha; Fahd Al-Mulla; Thangavel Alphonse Thanaraj; Sina Kavalakatt; Hamad Ali; Mohammed Abdul Ghani; Jehad Abubaker
Journal:  Front Immunol       Date:  2020-12-01       Impact factor: 7.561

9.  DynaMut: predicting the impact of mutations on protein conformation, flexibility and stability.

Authors:  Carlos Hm Rodrigues; Douglas Ev Pires; David B Ascher
Journal:  Nucleic Acids Res       Date:  2018-07-02       Impact factor: 16.971

10.  An interactive web-based dashboard to track COVID-19 in real time.

Authors:  Ensheng Dong; Hongru Du; Lauren Gardner
Journal:  Lancet Infect Dis       Date:  2020-02-19       Impact factor: 25.071

View more
  12 in total

1.  SARS-CoV-2 Susceptibility and ACE2 Gene Variations Within Diverse Ethnic Backgrounds.

Authors:  Nirmal Vadgama; Alexander Kreymerman; Jackie Campbell; Olga Shamardina; Christiane Brugger; Genomics England Research Consortium; Alexandra M Deaconescu; Richard T Lee; Christopher J Penkett; Casey A Gifford; Mark Mercola; Jamal Nasir; Ioannis Karakikes
Journal:  Front Genet       Date:  2022-04-27       Impact factor: 4.772

2.  SARS-CoV-2: Possible recombination and emergence of potentially more virulent strains.

Authors:  Dania Haddad; Sumi Elsa John; Anwar Mohammad; Maha M Hammad; Prashantha Hebbar; Arshad Channanath; Rasheeba Nizam; Sarah Al-Qabandi; Ashraf Al Madhoun; Abdullah Alshukry; Hamad Ali; Thangavel Alphonse Thanaraj; Fahd Al-Mulla
Journal:  PLoS One       Date:  2021-05-25       Impact factor: 3.240

3.  Clinical characteristics and outcomes of COVID-19 patients with diabetes mellitus in Kuwait.

Authors:  Abdullah Alshukry; Mohammad Bu Abbas; Yaseen Ali; Barrak Alahmad; Abdullah A Al-Shammari; Ghadeer Alhamar; Mohammad Abu-Farha; Jehad AbuBaker; Sriraman Devarajan; Ali A Dashti; Fahd Al-Mulla; Hamad Ali
Journal:  Heliyon       Date:  2021-04-05

4.  Protective Role of a TMPRSS2 Variant on Severe COVID-19 Outcome in Young Males and Elderly Women.

Authors:  Maria Monticelli; Bruno Hay Mele; Elisa Benetti; Chiara Fallerini; Margherita Baldassarri; Simone Furini; Elisa Frullanti; Francesca Mari; Giuseppina Andreotti; Maria Vittoria Cubellis; Alessandra Renieri
Journal:  Genes (Basel)       Date:  2021-04-19       Impact factor: 4.096

5.  Acute Severe Acute Respiratory Syndrome Coronavirus 2 Infection in Pregnancy Is Associated with Placental Angiotensin-Converting Enzyme 2 Shedding.

Authors:  Elizabeth S Taglauer; Elisha M Wachman; Lillian Juttukonda; Timothy Klouda; Jiwon Kim; Qiong Wang; Asuka Ishiyama; David J Hackam; Ke Yuan; Hongpeng Jia
Journal:  Am J Pathol       Date:  2022-01-26       Impact factor: 4.307

6.  A perspective on the applications of furin inhibitors for the treatment of SARS-CoV-2.

Authors:  Kasi Pandima Devi; Mahmoud Reza Pourkarim; Marijn Thijssen; Antoni Sureda; Maryam Khayatkashani; Cosmin Andrei Cismaru; Ioana Berindan Neagoe; Solomon Habtemariam; Soha Razmjouei; Hamid Reza Khayat Kashani
Journal:  Pharmacol Rep       Date:  2022-01-15       Impact factor: 3.919

7.  Ethnicity-Specific Features of COVID-19 Among Arabs, Africans, South Asians, East Asians, and Caucasians in the United Arab Emirates.

Authors:  Fatmah Al Zahmi; Tetiana Habuza; Rasha Awawdeh; Hossam Elshekhali; Martin Lee; Nassim Salamin; Ruhina Sajid; Dhanya Kiran; Sanjay Nihalani; Darya Smetanina; Tatsiana Talako; Klaus Neidl-Van Gorkom; Nazar Zaki; Tom Loney; Yauhen Statsenko
Journal:  Front Cell Infect Microbiol       Date:  2022-03-16       Impact factor: 5.293

Review 8.  Will Host Genetics Affect the Response to SARS-CoV-2 Vaccines? Historical Precedents.

Authors:  Maria K Smatti; Hebah A Alkhatib; Asmaa A Al Thani; Hadi M Yassine
Journal:  Front Med (Lausanne)       Date:  2022-03-11

9.  Development of a clinical risk score to predict death in patients with COVID-19.

Authors:  Ghadeer Alhamar; Ernesto Maddaloni; Abdullah Al Shukry; Salman Al-Sabah; Mohannad Al-Haddad; Sarah Al-Youha; Mohammed Jamal; Sulaiman Almazeedi; Abdullah A Al-Shammari; Mohamed Abu-Farha; Jehad Abubaker; Abdulnabi T Alattar; Ebaa AlOzairi; Francesco Alessandri; Luca D'Onofrio; Gaetano Leto; Carlo Maria Mastroianni; Carmen Mignogna; Giuseppe Pascarella; Francesco Pugliese; Hamad Ali; Fahd Al Mulla; Raffaella Buzzetti; Paolo Pozzilli
Journal:  Diabetes Metab Res Rev       Date:  2022-03-22       Impact factor: 8.128

10.  Emphasizing the link between blood types in multi-ethnic disparities and COVID-19 infection in Makkah, Saudi Arabia.

Authors:  Wesam A Nasif; Abeer S E Ali; Asim A Khogeer; Mohammed H Mukhtar; Mohamed M NourEldein; Ahmed Y Shebly; Shmukh H Alqahtani; Yahya A Alnashri; Ghidaa E Khouj; Ziyad I Gadah; Mohammad A Althubiti
Journal:  Saudi Med J       Date:  2022-02       Impact factor: 1.422

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.