| Literature DB >> 33532652 |
Fahd Al-Mulla1, Anwar Mohammad2, Ashraf Al Madhoun3, Dania Haddad1, Hamad Ali1,4, Muthukrishnan Eaaswarkhanth1, Sumi Elsa John1, Rasheeba Nizam1, Arshad Channanath1, Mohamed Abu-Farha5, Rasheed Ahmad6, Jehad Abubaker2, Thangavel Alphonse Thanaraj1.
Abstract
The severity of the new COVID-19 pandemic caused by the SARS-CoV-2 virus is strikingly variable in different global populations. SARS-CoV-2 uses ACE2 as a cell receptor, TMPRSS2 protease, and FURIN peptidase to invade human cells. Here, we investigated 1,378 whole-exome sequences of individuals from the Middle Eastern populations (Kuwait, Qatar, and Iran) to explore natural variations in the ACE2, TMPRSS2, and FURIN genes. We identified two activating variants (K26R and N720D) in the ACE2 gene that are more common in Europeans than in the Middle Eastern, East Asian, and African populations. We postulate that K26R can activate ACE2 and facilitate binding to S-protein RBD while N720D enhances TMPRSS2 cutting and, ultimately, viral entry. We also detected deleterious variants in FURIN that are frequent in the Middle Eastern but not in the European populations. This study highlights specific genetic variations in the ACE2 and FURIN genes that may explain SARS-CoV-2 clinical disparity. We showed structural evidence of the functionality of these activating variants that increase the SARS-CoV-2 aggressiveness. Finally, our data illustrate a significant correlation between ACE2 variants identified in people from Middle Eastern origins that can be further explored to explain the variation in COVID-19 infection and mortality rates globally.Entities:
Keywords: ACE2; COVID-19; FURIN; Genomic medicine; Middle Eastern populations; Precision medicine; SARS-CoV-2; TMPRSS2; Variants; eQTLs
Year: 2021 PMID: 33532652 PMCID: PMC7843038 DOI: 10.1016/j.heliyon.2021.e06133
Source DB: PubMed Journal: Heliyon ISSN: 2405-8440
ACE2 missense variants in the Middle East and gnomAD populations.
| Variant ID | Protein | Minor Allele Frequency | Functional Risk Prediction (Scores) | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Kuwait | Iran | Qatar | Europe | East Asia | Africa | SIFT | PP2HVAR | PP2HDIV | MUTTASTER | LRT | CADD | ||
| rs372923812 | I741V | 0 | 0 | 0.00655 | 0.00009 | 0 | 0.00005 | T (0.32) | B (0.004) | B (0.001) | N (0.07) | N (0.005) | 0.066 |
| rs4646116 | K26R | 0.00209 | 0 | 0 | 0.00587 | 0.00007 | 0.00095 | T (0.47) | B (0.001) | B (0.0) | N (0.31) | N (0.83) | 8.188 |
| rs200540199 | R716H | 0.00105 | 0 | 0 | 0.00014 | 0 | 0 | D (0.02) | B (0.277) | P (0.771) | N (0.03) | N (0.207) | 11.52 |
| rs769062069 | R708Q | 0 | 0.00188 | 0 | 0.00002 | 0 | 0 | D (0.00) | D (0.935) | D (0.999) | D (0.99) | D (0.00) | 24.9 |
| rs776995986 | R708W | 0.00105 | 0.00584 | 0 | 0.00004 | 0 | 0 | D (0.00) | D (0.997) | D (1.00) | D (1.00) | D (0.00) | 22.9 |
| rs765152220 | D494V | 0 | 0.00063 | 0 | 0.00005 | 0 | 0 | D (0.01) | D (0.926) | D (0.995) | D (1.00) | D (0.00) | 31 |
| rs750145841 | Y199C | 0 | 0.00125 | 0 | 0.00002 | 0 | 0 | T (0.08) | D (0.984) | D (1.0) | D (0.98) | D (0.0004) | 26.2 |
| rs759162332 | Q60R | 0 | 0.00125 | 0 | 0.00003 | 0 | 0 | T (0.21) | B (0.009) | B (0.001) | N (0.20) | N (0.23) | 15.34 |
| rs41303171 | N720D | 0.00314 | 0.00625 | 0.00204 | 0.02521 | 0 | 0.00269 | T (0.07) | B (0.02) | B (0.006) | N (0.20) | N (0.009) | 14.9 |
| I786T | 0 | 0.00188 | 0 | 0 | 0 | 0 | T (0.18) | B (0.088) | B (0.297) | N (0.00) | U (0.001) | 6.519 | |
| P737A | 0.00314 | 0 | 0 | 0 | 0 | 0 | D (0.01) | B (0.210) | P (0.489) | N (0.00) | U (0.953) | 2.44 | |
| Q661P | 0.00105 | 0 | 0 | 0 | 0 | 0 | D (0.02) | B (0.30) | P (0.812) | N (0.00) | U (0.3) | 22.9 | |
| F555L | 0.00105 | 0 | 0 | 0 | 0 | 0 | T (0.09) | B (0.145) | B (0.16) | N (0.00) | U (0.223) | 8.825 | |
| V485L | 0 | 0.00188 | 0 | 0 | 0 | 0 | T (0.10) | P (0.56) | B (0.028) | N (0.00) | U (0.63) | 23.7 | |
| F452V | 0 | 0.00083 | 0 | 0 | 0 | 0 | D (0.03) | B (0.038) | B (0.189) | N (0.00) | U (0.004) | 26.3 | |
| A372G | 0.00209 | 0 | 0 | 0 | 0 | 0 | D (0.05) | B (0.049) | B (0.437) | N (0.00) | U (0.001) | 25.4 | |
| T334M | 0 | 0.00188 | 0 | 0 | 0 | 0 | T (0.11) | B (0.026) | B (0.01) | N (0.00) | U (0.084) | 12.87 | |
| S331F | 0 | 0.00083 | 0 | 0 | 0 | 0 | D (0.00) | P (0.944) | B (0.006) | N (0.00) | U (0.344) | 24.6 | |
| D225G | 0 | 0.00125 | 0 | 0 | 0 | 0 | T (0.37) | B (0.007) | B (0.002) | N (0.00) | U (0.03) | 26.1 | |
SIFT (Sorting Intolerant From Tolerant): D = damaging, T = tolerated.
PP2HVAR (PolyPhen-2 Polymorphism Phenotyping v2 HumVar): D = probably damaging, P = possibly damaging, B = benign.
PP2HDIV (PolyPhen-2 Polymorphism Phenotyping v2 HumDiv): D = probably damaging, P = possibly damaging, B = benign.
MUTTASTER (MutationTaster): A = disease causing automatic, D = disease causing, N = polymorphism, P = polymorphism automatic.
LRT (Likelihood Ratio Test): D = deleterious, N = Neutral, U = unknown.
CADD - Combined Annotation Dependent Depletion based scores.
The missense variants were defined as deleterious when predicted to be damaging, probably damaging, disease causing and deleterious by the five algorithms applied (SIFT, PolyPhen-2 HumVar, PolyPhen-2 HumDiv, MutationTaster and LRT score) and/or CADD score of more than 20. We considered only deleterious variants with minor allele frequency less than 1% in the burden analysis.
Denotes Novel variants presented with chromosome: position (based on the human reference genome build GRCh37/hg19).
Burden of ACE2 rare variants in the Middle East and gnomAD populations.
| Variant ID | Protein | Minor Allele Frequency | P value | |||||||
|---|---|---|---|---|---|---|---|---|---|---|
| KWT | IRN | QAR | EUR | EAS | AFR | KWT vs IRN | KWT vs EUR | IRN vs EUR | ||
| rs776995986 | R708W | 1.05E-03 | 5.64E-03 | 0.00 | 3.79E-05 | 0.00 | 0.00 | 0.1084 | 0.0515 | 0.0005 |
| rs769062069 | R708Q | 0.00 | 1.88E-03 | 0.00 | 1.46E-05 | 0.00 | 0.00 | __ | __ | 0.0005 |
| rs765152220 | D494V | 0.00 | 6.25E-04 | 0.00 | 5.37E-05 | 0.00 | 0.00 | __ | __ | 0.0995 |
| rs750145841 | Y199C | 0.00 | 1.25E-03 | 0.00 | 2.00E-05 | 0.00 | 0.00 | __ | __ | 0.0015 |
The missense variants were defined as deleterious when predicted to be damaging, probably damaging, disease causing and deleterious by the five algorithms applied (SIFT, PolyPhen-2 HumVar, PolyPhen-2 HumDiv, MutationTaster and LRT score) and/or CADD score of more than 20. We considered only deleterious variants with minor allele frequency less than 1% in the burden analysis. P values are calculated using Chi-square test.
KWT-Kuwaitis; IRN-Iranians; QAR-Qataris; EUR-Europeans (non-Finnish); EAS-East Asians; AFR-Africans.
Figure 1The positions of the ACE2 receptor polymorphisms on the linearized ACE2 protein. The translated protein contains an N-terminal signal sequence (1–17), single catalytic domain (18–740) with zinc-binding motif (HEMGH 374–378), a transmembrane region (741–761), a small C-terminal cytosolic domain (762–805).
Figure 3(A)ACE2 dimer complex (promoter 1 Blue), (promoter 2 white). The peptidase domain (PD) is the site where the SARS-CoV-2 S protein ribosomal domain (RBD) binds. The C-terminal collectrin-like domain (CLD) is the region where the TMPRSS2 cleaves ACE2. (B) Single domain of the ACE2 mutation N720D, the red region of the protein depicted the more flexible region of the protein due to the N720D mutation with a (ΔΔG): -0.470 kcal/mol and ΔΔS: 0.070 kcal mol−1.K−1 (increase of molecule flexibility). (C) Depicts N720 backbone NH forming a H-bond with N718 and E723 forming an NH–NH bond with S721. (D) D720 backbone NH forming a H-bond with N718 and COO- group of D720 forming a weak polar interaction with L722 backbone.
Figure 2K26R polymorphism in ACE2 mapped to the structure of ACE2, green; in complex with S protein RBD, yellow (PDB ID: 6VW1). (A) In native ACE2, K26, magenta, forms two H-bonds with the mannose moiety, cyan, of ACE2 N90 which may stabilize the glycan-ACE2 complex. (B) The mutant R26, magenta, forms one H-bond with mannose. A process that could provoke the moiety-ACE2 stability and increases the affinity of the ACE2 a-helix to S protein RBD binding, where, R26 functions as a backbone and interacts with D30 which in turn dignified to build a salt-bridge with the S protein RBD K417, yellow stick.
Figure 4The plot shows a significant correlation between N720D minor allele frequency (MAF) and case fatality rate reported from countries of different continents with Pearson's correlation coefficient of 69.44% (p < 0.0003; C.I. 0.3953–0.8603).
FURIN known missense variants in the Middle East and gnomAD populations.
| Variant ID | Protein Consequence | Minor Allele Frequency | Functional Risk Prediction (Scores) | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Kuwait | Iran | Qatar | Europe | East Asia | Africa | SIFT | PP2HVAR | PP2HDIV | MUTTASTER | LRT | CADD | |||
| rs201172453 | R37C | 0.00418 | 0.00375 | 0.00700 | 0.00007 | 0 | 0.00016 | D (0.01) | P (0.586) | D (0.995) | N (0.000) | N (0.052) | 22.7 | |
| rs749761700 | R37H | 0.00105 | 0 | 0 | 0.00001 | 0 | 0 | T (1.00) | B (0.001) | B (0.001) | N (0.000) | N (0.052) | 0.496 | |
| rs16944971 | A43V | 0.00628 | 0.00188 | 0.00500 | 0.00255 | 0 | 0.07835 | T (0.25) | B (0.007) | B (0.010) | P (0.411) | N (0.007) | 11.82 | |
| rs148110342 | R81C | 0.02406 | 0.00750 | 0.03400 | 0.00159 | 0 | 0.00016 | T (0.17) | P (0.586) | D (0.989) | D (0.995) | N (0.051) | 23 | |
| rs751909359 | R86Q | 0 | 0.00070 | 0 | 0.00003 | 0 | 0 | T (0.31) | B (0.028) | B (0.244) | D (0.999) | D (0.000) | 22.3 | |
| rs116359616 | V109M | 0.00105 | 0 | 0 | 0.00002 | 0 | 0.02318 | D (0.03) | B (0.010) | B (0.083) | D (0.635) | N (0.168) | 4.625 | |
| rs143276283 | Q399R | 0.00209 | 0.00063 | 0 | 0.00006 | 0 | 0 | T (1.00) | B (0.002) | B (0.000) | D (0.967) | N (0.002) | 18.69 | |
| rs750446344 | V580I | 0.00314 | 0 | 0 | 0.00004 | 0 | 0 | T (0.23) | B (0.007) | B (0.010) | N (0.000) | N (0.835) | 0.211 | |
| rs752639409 | R637Q | 0 | 0.00070 | 0 | 0 | 0 | 0.00006 | T (0.58) | B (0.051) | B (0.200) | N (0.003) | N (0.031) | 21.3 | |
| rs761541008 | R677W | 0 | 0.00070 | 0 | 0.00005 | 0 | 0 | T (0.05) | B (0.332) | P (0.947) | N (0.046) | D (0.000) | 27.4 | |
| rs193268286 | S685P | 0 | 0 | 0 | 0.00004 | 0 | 0 | T (0.09) | D (0.994) | D (0.999) | D (0.999) | D (0.000) | 26.6 | |
| rs760368022 | H712Y | 0.00105 | 0 | 0 | 0.00001 | 0 | 0 | D (0.01) | B (0.089) | B (0.367) | N (0.016) | N (0.032) | 19.66 | |
| rs35641241 | R745Q | 0 | 0 | 0 | 0.00118 | 0.00102 | 0.00008 | T (0.13) | B (0.375) | D (0.989) | N (0.149) | N (0.021) | 23.4 | |
SIFT (Sorting Intolerant From Tolerant): D = damaging, T = tolerated.
PP2HVAR (PolyPhen-2 Polymorphism Phenotyping v2 HumVar): D = probably damaging, P = possibly damaging, B = benign.
PP2HDIV (PolyPhen-2 Polymorphism Phenotyping v2 HumDiv): D = probably damaging, P = possibly damaging, B = benign.
MUTTASTER (MutationTaster): A = disease causing automatic, D = disease causing, N = polymorphism, P = polymorphism automatic.
LRT (Likelihood Ratio Test): D = deleterious, N = Neutral, U = unknown.
CADD - Combined Annotation Dependent Depletion based scores.
The missense variants were defined as deleterious when predicted to be damaging, probably damaging, disease causing and deleterious by the five algorithms applied (SIFT, PolyPhen-2 HumVar, PolyPhen-2 HumDiv, MutationTaster and LRT score) and/or CADD score of more than 20. We considered only deleterious variants with minor allele frequency less than 1% in the burden analysis.
Burden of FURIN rare variants in the Middle East and gnomAD populations.
| Variant ID | Protein | Minor Allele Frequency | P value | |||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| KWT | IRN | QAR | EUR | EAS | AFR | KWT vs IRN | KWT vs QAR | KWT vs EUR | KWT vs AFR | IRN vs QAR | IRN vs EUR | IRN vs AFR | QAR vs EUR | QAR vs AFR | EUR vs EAS | EUR vs AFR | EAS vs AFR | |||
| rs201172453 | R37C | 0.004 | 0.004 | 0.007 | 7.0E-05 | 0 | 1.6E-04 | 1 | 1 | 0.0005 | 0.0005 | 1 | 0.0005 | 0.0005 | 0.0120 | 0.0610 | __ | 0.2724 | __ | |
| rs148110342 | R81C | 0.024 | 0.008 | 0.034 | 1.6E-03 | 0 | 1.6E-04 | 0.0015 | 0.4468 | 0.0005 | 0.0005 | 0.0055 | 0.0010 | 0.0005 | 0.0005 | 0.0005 | __ | 0.0005 | __ | |
| rs751909359 | R86Q | 0 | 0.001 | 0 | 3.0E-05 | 0 | 0 | __ | __ | __ | __ | __ | 0.0645 | __ | __ | __ | __ | __ | __ | |
| rs752639409 | R637Q | 0 | 0.001 | 0 | 0 | 0 | 6.0E-05 | __ | __ | __ | __ | __ | __ | 0.1709 | __ | __ | __ | __ | __ | |
| rs761541008 | R677W | 0 | 0.001 | 0 | 5.0E-05 | 0 | 0 | __ | __ | __ | __ | __ | 0.1000 | __ | __ | __ | __ | __ | __ | |
| rs35641241 | R745Q | 0 | 0 | 0 | 1.2E-03 | 1.0E-03 | 8.0E-05 | __ | __ | __ | __ | __ | __ | __ | __ | __ | 0.5857 | 0.0005 | 0.0005 | |
| rs193268286 | S685P | 0 | 0 | 0 | 4.0E-05 | 0 | 0 | __ | __ | __ | __ | __ | __ | __ | __ | __ | __ | __ | __ | |
The missense variants were defined as deleterious when predicted to be damaging, probably damaging, disease causing and deleterious by the five algorithms applied (SIFT, PolyPhen-2 HumVar, PolyPhen-2 HumDiv, MutationTaster and LRT score) and/or CADD score of more than 20. We considered only deleterious variants with minor allele frequency less than 1% in the burden analysis. P values are calculated using Chi-square test.
KWT-Kuwaitis; IRN-Iranians; QAR-Qataris; EUR-Europeans (non-Finnish); EAS-East Asians; AFR-Africans.
FURIN eQTL variants in the Middle East and gnomAD populations.
| Variant ID | Gene | Tissue | P | NES | Minor Allele Frequency | |||||
|---|---|---|---|---|---|---|---|---|---|---|
| KWT | IRN | QAR | EUR | EAS | AFR | |||||
| rs117976310 | Adrenal Gland | 3.0E-06 | -1.1 | NA | NA | 0.01 | 0.02 | 0.00 | 0.002 | |
| rs12904679 | Esophagus - Mucosa | 3.1E-06 | -0.2 | NA | NA | 0.12 | 0.12 | 0.003 | 0.02 | |
| rs4932373 | Esophagus - Mucosa | 6.6E-19 | 0.28 | NA | NA | 0.30 | 0.33 | 0.09 | 0.13 | |
| rs2238332 | Pituitary | 2.7E-06 | -0.17 | NA | NA | 0.36 | 0.36 | 0.33 | 0.70 | |
| rs8039305∗ | Esophagus - Mucosa | 1.7E-17 | 0.23 | NA | 0.41 | 0.49 | 0.48 | 0.17 | 0.81 | |
| rs6226∗ | Artery - Tibial | 1.6E-09 | 0.12 | 0.72 | 0.65 | 0.69 | 0.68 | 0.53 | 0.93 | |
| rs12904055 | Brain - Putamen (basal ganglia) | 1.1E-05 | 0.2 | NA | NA | 0.24 | 0.22 | 0.17 | 0.31 | |
| rs80160059 | Colon - Sigmoid | 9.0E-06 | 0.54 | NA | NA | 0.06 | 0.001 | 0.00 | 0.11 | |
| rs2071410 | Esophagus - Mucosa | 1.7E-20 | 0.28 | NA | 0.26 | 0.27 | 0.33 | 0.06 | 0.18 | |
| rs8026133 | Minor Salivary Gland | 1.4E-06 | 0.83 | NA | NA | 0.12 | 0.01 | 0.09 | 0.47 | |
| rs2227935 | Esophagus - Mucosa | 2.9E-10 | 0.32 | 0.05 | 0.03 | 0.03 | 0.07 | 0.0004 | 0.07 | |
| rs16944923 | Skin - Sun Exposed (Lower leg) | 9.0E-06 | 0.12 | NA | NA | 0.13 | 0.14 | 0.20 | 0.05 | |
| rs28385078 | Esophagus - Mucosa | 5.7E-10 | 0.32 | NA | NA | 0.02 | 0.07 | 0.00 | 0.03 | |
| rs116449376 | Esophagus - Mucosa | 1.5E-09 | 0.34 | NA | NA | 0.02 | 0.06 | 0.00 | 0.04 | |
| rs7495370 | Whole Blood | 5.6E-07 | 0.072 | NA | NA | 0.64 | 0.57 | 0.78 | 0.47 | |
| rs7165790 | Heart - Atrial Appendage | 4.4E-07 | -0.12 | NA | NA | 0.34 | 0.00 | 0.001 | 0.00 | |
P GTEx- P values reported in GTEx database; NES-normalized effect size in GTEx database; ∗P values calculated using Chi-square test.
KWT-Kuwaitis; IRN-Iranians; QAR-Qataris; EUR-Europeans (non-Finnish); EAS-East Asians; AFR-Africans. NA denotes Not Available in exome data.
Figure 5ACE2 protein structure (open form, yellow; and closed form with substrate, gray). The active site residues are coded in red color. The zinc binding residues are coded in blue color. The identified novel amino acid changes in the Middle Eastern populations are coded in green color. The novel changes are proximal to the protein residues that mediate its activity.