| Literature DB >> 32983400 |
Ratul Chowdhury1, Veda Sheersh Boorla1, Costas D Maranas1.
Abstract
SARS-CoV-2 is a novel highly virulent pathogen which gains entry to human cells by binding with the cell surface receptor - angiotensin converting enzyme (ACE2). We computationally contrasted the binding interactions between human ACE2 and coronavirus spike protein receptor binding domain (RBD) of the 2002 epidemic-causing SARS-CoV-1, SARS-CoV-2, and bat coronavirus RaTG13 using the Rosetta energy function. We find that the RBD of the spike protein of SARS-CoV-2 is highly optimized to achieve very strong binding with human ACE2 (hACE2) which is consistent with its enhanced infectivity. SARS-CoV-2 forms the most stable complex with hACE2 compared to SARS-CoV-1 (23% less stable) or RaTG13 (11% less stable). Notably, we calculate that the SARS-CoV-2 RBD lowers the binding strength of angiotensin 2 receptor type I (ATR1) which is the native binding partner of ACE2 by 44.2%. Strong binding is mediated through strong electrostatic attachments with every fourth residue on the N-terminus alpha-helix (starting from Ser19 to Asn53) as the turn of the helix makes these residues solvent accessible. By contrasting the spike protein SARS-CoV-2 Rosetta binding energy with ACE2 of different livestock and pet species we find strongest binding with bat ACE2 followed by human, feline, equine, canine and finally chicken. This is consistent with the hypothesis that bats are the viral origin and reservoir species. These results offer a computational explanation for the increased infection susceptibility by SARS-CoV-2 and allude to therapeutic modalities by identifying and rank-ordering the ACE2 residues involved in binding with the virus.Entities:
Keywords: ATR1; Biophysics; COVID 19; Human ACE2; SARS CoV-2
Year: 2020 PMID: 32983400 PMCID: PMC7500280 DOI: 10.1016/j.csbj.2020.09.019
Source DB: PubMed Journal: Comput Struct Biotechnol J ISSN: 2001-0370 Impact factor: 7.271
Fig. 1SARS-CoV-2 RBD causes the greatest disruption to the original intra-residue contacts of hACE2 achieving the strongest-binding complex. Shown in the figure are the residue contact maps of the hACE2 receptor in the unbound state and when bound with the viral spike protein RBDs from SARS-CoV-1, SARS-CoV-2, and RaTG13, respectively. Filled dots (in green) represent electrostatic (i.e., circles) or hydrophobic (i.e., squares) intra-residue contacts within hACE2. Open circles and squares in the bound state of hACE2 with RBD signify the lost intra-residue contacts within hACE2 upon binding with the three spikes. Shown in yellow, pink and purple filled circles and squares are the inter-residues contacts formed upon binding with the three spike RBDs. Filled circles or squares in the light blue region show indirect/contact-map mediated interactions between hACE2 residues (region 1) and the spike RBD (region 3). SARS-CoV-2 disrupts and co-opts the most intra-hACE2 residue contacts forming the most residue contacts between hACE2 and RBD. RBD self-stabilizing contact information and weak (long-range) electrostatic interactions (between 4.5 Å and 6.0 Å) between the spike and hACE2 are not shown in the figure. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Fig. 2Leu443 present in the SARS-CoV-1 spike RBD is aligned with Phe456 present in SARS-CoV-2 and RaTG13. In SARS-CoV-2, Phe456 simultaneously interacts with hACE2 residues Thr27 and Asp30 whereas only the hydrophobic contact is observed in RatG13. In SARS-CoV-1, Leu443 is unable to establish neither the backbone electrostatic contact nor the hydrophobic stabilization of the methyl group of Thr27 present in hACE2. The thickness of the dashed lines denotes the strength of interaction.
Fig. 3Alanine scan on hACE2 binding residues of spike RBDs of SARS-CoV-2, SARS-CoV-1, and RaTG13 coronavirus. Bars represent the hACE2 Rosetta binding energies upon alanine substitution at the indicated site as a percentage of binding score prior to substitution. SARS-CoV-2 spike RBD appears to be highly optimized for binding hACE2 as the single substitution to more than 90% of the residues forming the RBD to alanine causes more than 50% reduction in binding energy. The symbol × refers to dashes (or gaps) in local sequence alignment of the corresponding RBD chains of the SARS, SARS-CoV-2, and RaTG13, respectively.
List of hydrogen-bonded contacts between the spike RBDs from (SARS-CoV-1, SARS-CoV-2, and RaTG13) and hACE2.
| Sequence ID | Spike residue | hACE2 residue | Distance (Å) |
|---|---|---|---|
| NC_004718_SARS-CoV-1 | Y450 | Q42 | 2.5 |
| Y456 | H34/D30 | 2.8/2.7 | |
| N487 | Q24 | 2.0 | |
| G496 | K353 | 1.8 | |
| T500 | Y41/D355 | 2.6/1.8 | |
| G502 | K353 | 1.9 | |
| NC_045512_SARS-CoV-2 | Y449 | Q42 | 2.0 |
| Q474 | Q24 | 2.9 | |
| Q493 | H34 | 2.8 | |
| S494 | D38 | 1.9 | |
| T500 | Y41 | 1.8 | |
| G502 | K353/Q325/G354 | 2.0/2.4/3.0 | |
| Y505 | R393 | 2.1 | |
| Q506 | Q325 | 2.0 | |
| A475 | S19 | 1.9 | |
| N487 | Q24 | 2.3 | |
| K417 | D30 | 1.9 | |
| MN996532_RaTG13 | K417 | D30 | 1.8 |
| Y473 | T27 | 2.4 | |
| N487 | Q24 | 2.1 | |
| Y493 | H34 | 2.6 | |
| Y498 | Q42 | 1.9 | |
| T500 | Y41 | 1.8 | |
| G502 | K353 | 1.9 |
Fig. 4(a). The role of tyrosine residues in SARS-CoV-2 and RaTG13 RBD is to form strong contacts with hACE2 residues while in SARS-CoV-1 they are primarily responsible for forming stabilizing contacts within the spike and are hence unavailable for hACE2 binding. (b) The role of glycine residues in both all three RBDs is to provide a xGzGx motif for binding hACE2 Lys353 using a strong electrostatic (or cation-π interaction). Here, ‘x’ is a polar residue, and ‘z’ a short chain hydrophobic residue (Ile or Val). The glycine residues along with residue ‘z’ offer a hinge to present polar residue ‘x’ for strong electrostatic interactions with hACE2 residue Lys353.
Fig. 5(a–c). hACE2 binding interfaces of the three spike proteins with six hydrogen-bonded contacts from each of them indicated. (d) Rosetta binding energies between spike RBD and hACE2 averaged from ten independent binding energy minimization trajectories. (e) RaTG13 shows the highest reduction of hACE2 solvent accessible surface area (SASA). (f and g) Even though RaTG13 recruits the highest number of interface residues, SARS-CoV-2 forms the most hydrogen-bonded contacts with hACE2. (h) The sequence alignment of the three RBDs is shown and the residues establishing hydrogen bonds with hACE2 are highlighted in cyan. The mean and median binding energies for each batch of ten RBD-hACE2 simulations are denoted with a horizontal line and a cross (×), respectively. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Fig. 6(a–d) Membrane bound conformation of hACE2-ATR1 complex with and without RBD bound to the extracellular domain of hACE2. The orange highlighted region shows the ordered, tightly bound, C-terminal helix of hACE2 that binds to ATR1 which becomes disordered and poorly bound in the presence of RBD. (e and f) The number of hydrogen bonded contacts at the hACE2-ATR1 interface reduces from 5.17 to 2.71 (averaged across a 50 ns MD trajectory) when the RBD binds to hACE2. (g) The RBD-hACE2 interface shows an average of 9 H-bonded interactions. (h–j) The ATR1-hACE2 complex energy is reduced by 120 kcal/mol when RBD is bound to the extracellular domain of hACE2. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Comparison of hACE2 mutations that show enhanced RBD binding from experiments and computational predictions.
| hACE2 residues | Point mutations to amino acids that show enhanced binding with RBD experimentally | Computationally predicted hACE2 variants that improve binding to RBD | ||
|---|---|---|---|---|
| Single | Double | Triple | ||
| S19 | P > F > Y > W > V | P* > F* > L | S19K + T20H > S19K + T20E | S19K + T20H + I21T > S19K + T20H + E22Q |
| T20 | S > E | S* > D > E* | S19K + T20H > S19K + T20E | S19K + T20H + I21T > S19K + T20H + E22Q |
| I21 | V > G > T | L > V* | none | S19K + T20H + I21T |
| E22 | T > Q > D | Q* > S | none | S19K + T20H + E22Q |
| E23 | F > C > M > Q | Y* > F > T | none | None |
| Q24 | T > P | S > T* | none | None |
| A25 | V > T > F > I | S > T* > L | none | None |
| K26 | D > I > V > A > R | D* > E > R* | none | None |
| T27 | Y > L > M > H | S > M* > W > A | T27I + F28D | None |
| F28 | Y | Y* > V | F28Y + D30E > F28D + L29E > T27I + F28D | None |
| L29 | F > E | T > F* | F28D + L29E > L29F + D30I > L29E + A36E | None |
| D30 | E > I > V > T | E* > V* > T* | F28Y + D30E > L29F + D30I | None |
| K31 | W > Y > F | R > W* | none | None |
| F32 | not reported | A > E | F32E + A36E | F32E + A36E + D37P |
| N33 | D > E > H > S | E* > D* > H* | none | None |
| H34 | V > A > P > S > W | S* > T > G | H34V + E35D | None |
| E35 | C > D > M > A | D* > M* > V | H34V + E35D | None |
| A36 | not reported | E > S | L29E + A36E > F32E + A36E | F32E + A36E + D37P > A36E + D37P + D38E |
| E37 | P | V > P* > A | none | F32E + A36E + D37P > A36E + D37P + D38E |
| D38 | E | E* > S | none | A36E + D37P + D38E |
| L39 | K > R > I > V > A | R* > K* | L39A + F40H | None |
| F40 | D > R > K > C > H | E > R* | L39A + F40H | F40E + Y41A + Q42K > D40E + Y41A + Q42K |
| Y41 | R | F > R* | Y41A + Q42K | F40E + Y41A + Q42K > D40E + Y41A + Q42K |
| Q42 | C > L > M > V > I | M* > V* > I* > L* | Y41A + Q42K | F40E + Y41A + Q42K > F40D + Y41A + Q42K |
| Q325 | P > K | K* > R | none | Q325K + G326E + A387H > Q325K + G354A + D355E |
| G326 | E | D > E* | none | Q325K + G326E + A387H |
| K353 | none | none | none | none |
| G354 | none | A | none | Q325K + G354A + D355E |
| D355 | none | E | none | Q325K + G354A + D355E |
| A387 | H | Q > H* | none | Q325K + G326E + A387H |
**log2 enrichment ratios reflect increase in experimentally binding affinity with RBD upon point mutation with respect to wild-type. ^ Increase in RBD binding energy (in kcal/mol) of an hACE2 variant with respect to WT hACE2. * RBD-binding enhancing hACE2 mutations identified from experiments which were recovered from in silico predictions as well.