| Literature DB >> 32681121 |
Elisa Benetti1, Rossella Tita2, Ottavia Spiga3, Andrea Ciolfi4, Giovanni Birolo5, Alessandro Bruselles6, Gabriella Doddato7, Annarita Giliberti7, Caterina Marconi8, Francesco Musacchia9, Tommaso Pippucci10, Annalaura Torella11, Alfonso Trezza3, Floriana Valentino7, Margherita Baldassarri7, Alfredo Brusco5,12, Rosanna Asselta13,14, Mirella Bruttini2,7, Simone Furini1, Marco Seri8,10, Vincenzo Nigro9,11, Giuseppe Matullo5,12, Marco Tartaglia4, Francesca Mari2,7, Alessandra Renieri15,16, Anna Maria Pinto2.
Abstract
In December 2019, an initial cluster of interstitial bilateral pneumonia emerged in Wuhan, China. A human-to-human transmission was assumed and a previously unrecognized entity, termed coronavirus disease-19 (COVID-19) due to a novel coronavirus (SARS-CoV-2) was described. The infection has rapidly spread out all over the world and Italy has been the first European country experiencing the endemic wave with unexpected clinical severity in comparison with Asian countries. It has been shown that SARS-CoV-2 utilizes angiotensin converting enzyme 2 (ACE2) as host receptor and host proteases for cell surface binding and internalization. Thus, a predisposing genetic background can give reason for interindividual disease susceptibility and/or severity. Taking advantage of the Network of Italian Genomes (NIG), here we mined whole-exome sequencing data of 6930 Italian control individuals from five different centers looking for ACE2 variants. A number of variants with a potential impact on protein stability were identified. Among these, three more common missense changes, p.(Asn720Asp), p.(Lys26Arg), and p.(Gly211Arg) were predicted to interfere with protein structure and stabilization. Rare variants likely interfering with the internalization process, namely p.(Leu351Val) and p.(Pro389His), predicted to interfere with SARS-CoV-2 spike protein binding, were also observed. Comparison of ACE2 WES data between a cohort of 131 patients and 258 controls allowed identifying a statistically significant (P value < 0.029) higher allelic variability in controls compared with patients. These findings suggest that a predisposing genetic background may contribute to the observed interindividual clinical variability associated with COVID-19, allowing an evidence-based risk assessment leading to personalized preventive measures and therapeutic options.Entities:
Mesh:
Substances:
Year: 2020 PMID: 32681121 PMCID: PMC7366459 DOI: 10.1038/s41431-020-0691-z
Source DB: PubMed Journal: Eur J Hum Genet ISSN: 1018-4813 Impact factor: 4.246
Missense, frameshift, and truncating variants identified in ACE2 gene.
| NM_021804.2 (hg19) | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Genomic position | Nucleotide change | Amino acid change | CADD_phred | dbSNP | gnomAD | Hemizygous M | Heterozygous F | Homozygous F | N° of events/n° of alleles | NIG allele frequency | mean DP |
| X:15580093 | c.2353G>T | p.(Asp785Tyr) | 24.1 | rs373153165 | 0.000030 | – | 1 | – | 1/9689 | 0.00010 | 63 |
| X:15580101 | c.2345C>T | p.(Ala782Val) | 11.3 | rs147487891 | 0.000062 | 1 | – | – | 1/9689 | 0.00010 | 128 |
| X:15582235 | c.2221A>G | p.(Ile741Val) | 0.15 | rs372923812 | 0.00010 | 3 | 1 | – | 4/9689 | 0.00041 | 153 |
| X:15582298 | c.2158A>G | p.(Asn720Asp) | 15.1 | rs41303171 | 0.016 | 43 | 60 | – | 103/9689 | 0.011 | 99 |
| X:15584420 | c.2069del | p.(Asn690Metfs*28) | – | – | – | – | 4 | – | 4/9689 | 0.00041 | 125 |
| X:15584478 | c.2012G>A | p.(Arg671Gln) | 10.38 | rs753705431 | 0.000020 | 1 | 2 | – | 3/9689 | 0.00030 | 73 |
| X:15589846 | c.1738A>G | p.(Asn580Asp) | 0.69 | – | – | 1 | 1 | – | 2/9689 | 0.00020 | 221 |
| X:15590348 | c.1640C>G | p.(Ser547Cys) | 24 | rs373025684 | 0.00021 | – | 1 | – | 1/9689 | 0.00010 | 129 |
| X:15591514 | c.1517T>C | p.(Val506Ala) | 27.2 | rs775181355 | 0.0000066 | 1 | 1 | – | 2/9689 | 0.00020 | 22 |
| X:15591517 | c.1514A>G | p.(His505Arg) | 11.62 | rs1016409802 | 0.00065 | – | 1 | – | 1/9689 | 0.00010 | 96 |
| X:15591530 | c.1501G>A | p.(Ala501Thr) | 23.5 | rs140473595 | 0.000022 | – | 1 | – | 1/9689 | 0.00010 | 38 |
| X:15591539 | c.1492T>C | p.(Cys498Arg) | 28.9 | – | – | – | 2 | – | 2/9689 | 0.00020 | 74 |
| X:15591550 | c.1481A>T | p.(Asp494Val) | 28.3 | rs765152220 | 0.000051 | – | 1 | – | 1/9689 | 0.00010 | 205 |
| X:15593928 | c.1302dup | p.(Glu435Argfs*31) | – | – | – | – | 1 | – | 1/9689 | 0.00010 | 128 |
| X:15593930 | c.1300del | p.(Thr434Glnfs*3) | – | – | – | – | 1 | – | 1/9689 | 0.00010 | 127 |
| X:15596320 | c.1189A>G | p.(Asn397Asp) | 26.8 | rs1365935088 | 0.000015 | – | 1 | – | 1/9689 | 0.00010 | 131 |
| X:15596343 | c.1166C>A | p.(Pro389His) | 24.8 | rs762890235 | 0.000039 | – | 2 | – | 1/9689 | 0.00010 | 103 |
| X:15596379 | c.1130G>A | p.(Gly377Glu) | 27.5 | rs767462182 | 0.0000056 | 1 | – | – | 1/9689 | 0.00010 | 98 |
| X:15596380 | c.1129G>T | p.(Gly377Trp) | 34 | – | – | – | 1 | – | 1/9689 | 0.00010 | 76 |
| X:15599363 | c.1051C>G | p.(Leu351Val) | 22.7 | – | – | – | 2 | – | 2/9689 | 0.00020 | 93 |
| X:15599428 | c.986A>G | p.(Glu329Gly) | 12.4 | rs143936283 | 0.000034 | – | 5 | – | 5/9689 | 0.00051 | 125 |
| X:15605887 | c.791C>G | p.(Ala264Gly) | 27.6 | – | – | – | 2 | – | 2/9689 | 0.00020 | 124 |
| X:15605942 | c.736G>A | p.(Ala246Thr) | 0.28 | – | – | 2 | - | – | 2/9689 | 0.00020 | 143 |
| X:15607492 | c.670del | p.(Glu224Lysfs*21) | – | – | – | – | 1 | – | 1/9689 | 0.00010 | 169 |
| X:15607532 | c.631G>A | p.(Gly211Arg) | 14.7 | rs148771870 | 0.0013 | 4 | 8 | – | 12/9689 | 0.0012 | 64 |
| X:15607537 | c.626T>G | p.(Val209Gly) | 11.3 | – | – | – | 1 | – | 1/9689 | 0.00010 | 168 |
| X:15607546 | c.617A>G | p.(Asp206Gly) | 20.8 | rs142443432 | 0.00030 | 1 | 1 | – | 2/9689 | 0.00020 | 68 |
| X:15609885 | c.533del | p.(Pro178Hisfs*9) | – | – | – | – | 1 | – | 1/9689 | 0.00010 | 349 |
| X:15609928 | c.490del | p.(Ala164Leufs*13) | – | – | – | – | 1 | – | 1/9689 | 0.00010 | 167 |
| X:15610405 | c.385del | p.(Thr129Leufs*20) | – | – | – | 1 | – | – | 1/9689 | 0.00010 | 20 |
| X:15612979 | c.334A>G | p.(Lys112Glu) | 18.09 | – | – | – | 1 | – | 1/9689 | 0.00010 | 169 |
| X:15613008 | c.305A>C | p.(Gln102Pro) | 8.311 | rs1395878099 | 0.000011 | – | 2 | – | 2/9689 | 0.00020 | 123 |
| X:15613038 | c.275C>T | p.(Thr92Ile) | 0.031 | rs763395248 | 0.000011 | – | 2 | – | 2/9689 | 0.00020 | 133 |
| X:15613119 | c.194C>T | p.(Ala65Val) | 11.7 | – | – | – | 1 | – | 1/9689 | 0.00010 | 21 |
| X:15618872 | c.163A>G | p.(Thr55Ala) | 23.8 | rs775273812 | 0.0000057 | 1 | – | – | 1/9689 | 0.00010 | 214 |
| X:15618958 | c.77A>G | p.(Lys26Arg) | 10.5 | rs4646116 | 0.0039 | 4 | 7 | – | 11/9689 | 0.00110 | 135 |
| X:15619013 | c.22C>T | p.(Leu8Phe) | 14.2 | rs201035388 | 0.000076 | 1 | 1 | – | 2/9689 | 0.00020 | 111 |
The table reports the genomic position, the nucleotidic, and protein change of exonic ACE2 identified variants. The genomic reference sequence is NM_021804.2 (hg19). CADD_phred scores are reported for the missense variants. When available, dbSNP rs number and the genome aggregation database gnomAD allele frequency are reported. For all variants are reported the number of individuals hemizygous, heterozygous, or homozygous.
Predicted changes in ACE2 protein stability as consequence of residues changes.
DUET program results that display predicted change in folding free energy upon ACE2 missense variant (ΔΔG in kcal/mol). In the first three columns are reported single missense variants with specific position on ACE2 protein. The residues in the first column highlighted in gray are involved in N-glycosylation pattern NxT/S, therefore those missense variants determine the loss glycosylation of Asparagine 53 and 90, respectively. In the fourth column is reported ΔΔG analysis predict effects of missense variants on protein stability using an integrated computational approach. The column “Interaction Network around (5 Å)” shows for each single missense variant the residues around 5 Å. In this column, we highlight in green residues involved in spike SARS-CoV protein interaction, in yellow residues involved in Zinc coordination and finally in magenta residues of Asn involved in N-glycosylation. The last column defines the outcome of protein stability for each single missense variant. An increasing negative value for the ΔΔG is correlated with a higher destabilizing effect, while a positive value is associated with a variant predicted as stabilizing.
Fig. 1ACE2 crystal structure with PDB ID 1R42.
Surface and cartoon representations of protein in gray. In blue stick are represented each single mutated positions, cartoon region in yellow represent segment between amino acid 30–41, cartoon in green represent segment between amino acid 353–357 and cartoon in red represent segment between amino acid 82–84 that are protein regions responsible of interaction with 2019-nCOv spike glycoprotein.
Fig. 2ACE2 wild-type and variants superimposed structures after 100 ns MD simulation.
Cartoon representation of ACE2 wild type (orange) and variants (green) in blue sticks the wild-type residues while in red the corresponding variants. In cyan and pink sticks residues interacting with each specific position.
Fig. 3Structure superimposition snapshot between wild-type protein and variant proteins.
a Root mean square deviation (RMSD) trends for the backbone of ACE2 WT (black line) and some selected variants (colored lines, see legend) during 100 ns of simulation. The molecular dynamics simulation shows a good stability for all systems with exception of G211R mutants. RMSD is a parameter used to define the stability of an element. Wild type shows a steady course in the RMSD value, stabilizing at an average of 0.2 nm, while, the G211R variant shows a gradual increase in RMSD value, stabilizing at an average of 0.6 nm. b SASA graphical representation of ACE2 WT (black line) and ACE2 variants (colored lines, see legend).
Fig. 4Differences in ACE2 variants in COVID-19 patients compared with controls.
The figure shows the variants located in the ACE2 protein domains. The variants present in controls are shown in black while the variants in cases are shown in red. The number of patients carrying the variant is shown in brackets.