| Literature DB >> 32622243 |
Wei Li1.
Abstract
One notable feature of the SARS-CoV-2 genome, the spike (S) protein of SARS-CoV-2 has a polybasic furin cleavage site (FCS) at its S1-S2 boundary through the insertion of 12 nucleotides encoding four amino acid residues PRRA. Quite intriguingly, this polybasic FCS is absent in coronaviruses of the same clade as SARS-CoV-2. Thus, with currently available experimental structural data for S protein, this short article presents a set of comprehensive structural characterization of the insertion of FCS into S protein, and argues against a hypothesis of the origin of SARS-CoV-2 from purposeful manipulation: (1), the inserted FCS is spatially located at a random coil loop region, mostly distantly solvent-exposed (instead of deeply buried), with no structural proximity to the other part of the S protein; (2), the insertion of FCS itself does not alter, neither stabilize nor de-stabilize, the three-dimensional structure of S; (3), the net result here is the insertion of a furin cleavage site into S protein, whose S1 and S2 subunits will still be strongly electrostatically bonded together from a structural and biophysical point of view, even if the polybasic FCS is actually cleaved by furin protease before or after viral cell entry.Entities:
Mesh:
Substances:
Year: 2020 PMID: 32622243 PMCID: PMC7322478 DOI: 10.1016/j.bpc.2020.106420
Source DB: PubMed Journal: Biophys Chem ISSN: 0301-4622 Impact factor: 2.352
A summary of side chain and main chain hydrogen bonding analysis between the polybasic FCS and other part of the S protein of SARS-CoV-2. In this table, the residue naming scheme is Chain ID_residue name_residue number, ∠ADH represents the angle formed by acceptor (A), donor (D) and hydrogen (H) (∠ADH).
| Hydrogen bond No. | Acceptor (A) | Donor (D) | Hydrogen (H) | D-A (Å) | H-A (Å) | ∠ ADH () |
|---|---|---|---|---|---|---|
| 1 | O, A_THR_604 | NE, A_ARG_683 | HE, A_ARG_683 | 2.90 | 1.95 | 15.53 |
| 2 | O, C_ASN_679 | NH1, C_ARG_683 | HH11, C_ARG_683 | 2.73 | 1.73 | 8.46 |
| 3 | O, B_VAL_687 | NH1, B_ARG_685 | HH11, B_ARG_685 | 2.75 | 1.91 | 27.04 |
| 4 | O, B_VAL_687 | NE, B_ARG_685 | HE, B_ARG_685 | 2.86 | 1.99 | 25.09 |
| 5 | O, C_ARG_685 | NZ, C_LYS_310 | HZ3, C_LYS_310 | 2.70 | 1.83 | 24.75 |
Lys310-specific salt bridging analysis of the structure of the S protein. In this table, the residue naming scheme is Chain ID_residue name_residue number.
| Salt bridge | Residue A | Atom A | Residue B | Atom B | Distance (Å) |
|---|---|---|---|---|---|
| 1 | A_LYS_310 | NZ | A_ASP_663 | OD1 | 2.628 |
| 2 | A_LYS_310 | NZ | A_ASP_663 | OD2 | 2.667 |
| 3 | B_LYS_310 | NZ | B_ASP_663 | OD2 | 2.615 |
| 4 | B_LYS_310 | NZ | B_ASP_663 | OD1 | 2.691 |
| 5 | C_LYS_310 | NZ | C_ASP_663 | OD1 | 3.129 |
Lys310-specific hydrogen bonding network analysis of the structure of the S protein. In this table, the residue naming scheme is Chain ID_residue name_residue number, ∠ADH represents the angle formed by acceptor (A), donor (D) and hydrogen (H) (∠ADH).
| Hydrogen bond No. | Acceptor (A) | Donor (D) | Hydrogen (H) | D-A (Å) | H-A (Å) | ∠ ADH () |
|---|---|---|---|---|---|---|
| 1 | OD1, A_ASP_663 | NZ, A_LYS_310 | HZ2, A_LYS_310 | 2.63 | 1.74 | 22.20 |
| 2 | OD2, B_ASP_663 | NZ, B_LYS_310 | HZ3, B_LYS_310 | 2.62 | 1.67 | 16.87 |
| 3 | O, C_ARG_685 | NZ, C_LYS_310 | HZ3, C_LYS_310 | 2.70 | 1.83 | 24.75 |
| 4 | OD1, A_ASP_663 | NZ, A_LYS_310 | HZ2, A_LYS_310 | 2.63 | 1.74 | 22.20 |
| 5 | OD2, B_ASP_663 | NZ, B_LYS_310 | HZ3, B_LYS_310 | 2.62 | 1.67 | 16.87 |
Fig. 1An overall structure of the spike protein of SARS-CoV-2 in green cartoon. In this figure, the inserted FCS fragment from Chain B is coloured red. This figure is prepared using PyMol [30] with supplementary file model.pdb as an input. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
A quantitative analysis by DSSP [27] of the SASA values and the relative SASA values of the FCS residues. The relative SASAs are calculated for each amino acid in the protein by expressing the various summed residue accessible surfaces as a ratio (Value/Reference) of that observed in a ALA-X-ALA tripeptide built using the QUANTA molecular graphics software package [29].
| Residue ID | Chain ID | Residue name | Residue name | Value (Å2) | Reference (Å2) | Value/reference | Notes |
|---|---|---|---|---|---|---|---|
| 681 | A | P | PRO | 40 | 136.13 | 0.29 | Buried |
| 682 | A | R | ARG | 214 | 238.76 | 0.90 | Solvent-exposed |
| 683 | A | R | ARG | 193 | 238.76 | 0.81 | Solvent-exposed |
| 684 | A | A | ALA | 17 | 107.95 | 0.16 | Buried |
| 685 | A | R | ARG | 79 | 238.76 | 0.33 | Buried |
| 681 | B | P | PRO | 112 | 136.13 | 0.82 | Solvent-exposed |
| 682 | B | R | ARG | 175 | 238.76 | 0.73 | Solvent-exposed |
| 683 | B | R | ARG | 214 | 238.76 | 0.90 | Solvent-exposed |
| 684 | B | A | ALA | 59 | 107.95 | 0.55 | Solvent-exposed |
| 685 | B | R | ARG | 60 | 238.76 | 0.25 | Buried |
| 681 | C | P | PRO | 135 | 136.13 | 0.99 | Solvent-exposed |
| 682 | C | R | ARG | 93 | 238.76 | 0.39 | Buried |
| 683 | C | R | ARG | 168 | 238.76 | 0.70 | Solvent-exposed |
| 684 | C | A | ALA | 21 | 107.95 | 0.19 | Buried |
| 685 | C | R | ARG | 149 | 238.76 | 0.62 | Solvent-exposed |
FCS-specific inter-part salt bridging analysis of the S protein of SARS-CoV-2 with FCS at the boundary of its S1 and S2 units. In this table, the residue naming scheme is Chain ID_residue name_residue number. In this table, for residues A and B, one of them is located upstream (residue ID smaller than 681) of the polybasic FCS, while the other is located downstream (residue ID larger than 685) of the polybasic FCS.
| PDB File name | Residue A | Atom A | Residue B | Atom B | Distance (Å) |
|---|---|---|---|---|---|
| Model.pdb | A_ARG_319 | NH1 | B_ASP_737 | OD2 | 2.735 |
| Model.pdb | A_ARG_319 | NH1 | B_ASP_745 | OD1 | 3.587 |
| Model.pdb | A_ARG_319 | NH2 | B_ASP_737 | OD2 | 3.982 |
| Model.pdb | B_ARG_319 | NH1 | C_ASP_745 | OD1 | 2.606 |
| Model.pdb | B_ARG_319 | NH1 | C_ASP_745 | OD2 | 3.565 |
| Model.pdb | B_ARG_319 | NH2 | C_ASP_745 | OD1 | 3.325 |
| Model.pdb | B_ARG_319 | NH2 | C_ASP_745 | OD2 | 2.636 |
| Model.pdb | B_LYS_854 | NZ | A_ASP_614 | OD1 | 3.487 |
| Model.pdb | B_LYS_986 | NZ | C_ASP_427 | OD2 | 3.985 |
| Model.pdb | C_ARG_319 | NH1 | A_ASP_745 | OD1 | 3.819 |
| Model.pdb | C_ARG_646 | NH1 | A_ASP_848 | OD2 | 3.643 |
| Model.pdb | C_ARG_646 | NH2 | A_ASP_848 | OD1 | 3.695 |
| Model.pdb | C_ARG_646 | NH2 | A_ASP_848 | OD2 | 2.627 |
| Model.pdb | C_ARG_847 | NH1 | B_GLU_619 | OE1 | 3.034 |
| Model.pdb | C_LYS_854 | NZ | B_ASP_614 | OD1 | 2.686 |
A summary of side chain and main chain hydrogen bonding analysis between the two structural fragments generated by the hypothesized cleavage at the polybasic FCS of the S protein of SARS-CoV-2. In this table, the residue naming scheme is Chain ID_residue name_residue number, ∠ADH represents the angle formed by acceptor (A), donor (D) and hydrogen (H) (∠ADH). In this table, for residues A and B, one of them is to be located upstream (residue ID smaller than 681) of the polybasic FCS, while the other is to be located downstream (residue ID larger than 685) of the polybasic FCS.
| PDB file name | Acceptor (A) | Donor (D) | Hydrogen (H) | D-A (Å) | H-A (Å) | ∠ ADH () |
|---|---|---|---|---|---|---|
| Model.pdb | OD2, B_ASP_737 | NH1, A_ARG_319 | HH12, A_ARG_319 | 2.74 | 1.78 | 15.71 |
| Model.pdb | O, A_ILE_692 | N, A_GLU_654 | H, A_GLU_654 | 2.86 | 1.86 | 5.94 |
| Model.pdb | O, A_THR_696 | OG, A_SER_659 | HG, A_SER_659 | 2.72 | 1.91 | 26.75 |
| Model.pdb | O, B_PRO_863 | N, A_ALA_668 | H, A_ALA_668 | 2.78 | 1.79 | 9.02 |
| Model.pdb | O, B_LEU_864 | N, A_GLY_669 | H, A_GLY_669 | 2.82 | 1.95 | 24.85 |
| Model.pdb | O, A_GLU_654 | N, A_ALA_694 | H, A_ALA_694 | 2.80 | 1.83 | 12.75 |
| Model.pdb | O, A_GLU_661 | OH, A_TYR_695 | HH, A_TYR_695 | 2.62 | 1.68 | 9.06 |
| Model.pdb | O, A_GLY_669 | N, A_MET_697 | H, A_MET_697 | 2.86 | 1.87 | 9.86 |
| Model.pdb | O, A_TYR_660 | N, A_SER_698 | H, A_SER_698 | 2.75 | 1.74 | 2.22 |
| Model.pdb | O, C_ASP_614 | NE, A_ARG_847 | HE, A_ARG_847 | 2.97 | 2.12 | 27.07 |
| Model.pdb | O, C_ASP_614 | NH1, A_ARG_847 | HH11, A_ARG_847 | 2.71 | 1.79 | 19.16 |
| Model.pdb | OD2, C_ASP_614 | OG1, A_THR_859 | HG1, A_THR_859 | 2.83 | 1.95 | 19.70 |
| Model.pdb | O, C_THR_547 | ND2, A_ASN_978 | HD21, A_ASN_978 | 2.99 | 2.12 | 24.76 |
| Model.pdb | O, C_LEU_984 | NZ, B_LYS_386 | HZ3, B_LYS_386 | 2.72 | 1.89 | 28.40 |
| Model.pdb | O, B_ILE_692 | N, B_GLU_654 | H, B_GLU_654 | 2.81 | 1.82 | 9.58 |
| Model.pdb | O, B_ALA_694 | N, B_VAL_656 | H, B_VAL_656 | 2.96 | 2.02 | 17.75 |
| Model.pdb | O, B_THR_696 | OG, B_SER_659 | HG, B_SER_659 | 2.72 | 1.93 | 28.63 |
| Model.pdb | O, C_PRO_863 | N, B_ALA_668 | H, B_ALA_668 | 2.82 | 1.84 | 9.55 |
| Model.pdb | O, C_LEU_864 | N, B_GLY_669 | H, B_GLY_669 | 2.81 | 1.90 | 20.41 |
| Model.pdb | OH, B_TYR_695 | N, B_SER_673 | H, B_SER_673 | 2.96 | 2.09 | 24.82 |
| Model.pdb | O, B_GLU_654 | N, B_ALA_694 | H, B_ALA_694 | 2.77 | 1.83 | 16.90 |
| Model.pdb | O, B_GLY_669 | N, B_MET_697 | H, B_MET_697 | 2.77 | 1.77 | 5.38 |
| Model.pdb | O, B_TYR_660 | N, B_SER_698 | H, B_SER_698 | 2.65 | 1.65 | 3.25 |
| Model.pdb | OD2, A_ASP_614 | OG1, B_THR_859 | HG1, B_THR_859 | 2.98 | 2.16 | 26.02 |
| Model.pdb | O, A_ARG_983 | N, C_SER_383 | H, C_SER_383 | 2.91 | 1.90 | 2.08 |
| Model.pdb | OD2, A_ASP_745 | OG1, C_THR_549 | HG1, C_THR_549 | 2.94 | 2.12 | 25.69 |
| Model.pdb | OD2, A_ASP_848 | NH2, C_ARG_646 | HH22, C_ARG_646 | 2.63 | 1.79 | 27.55 |
| Model.pdb | O, C_ILE_692 | N, C_GLU_654 | H, C_GLU_654 | 2.81 | 1.89 | 19.55 |
| Model.pdb | O, C_THR_696 | OG, C_SER_659 | HG, C_SER_659 | 2.86 | 2.07 | 28.50 |
| Model.pdb | O, A_PRO_863 | N, C_ALA_668 | H, C_ALA_668 | 2.82 | 1.86 | 14.51 |
| Model.pdb | O, A_LEU_864 | N, C_GLY_669 | H, C_GLY_669 | 2.73 | 1.84 | 22.00 |
| Model.pdb | O, C_TYR_695 | N, C_CYS_671 | H, C_CYS_671 | 2.88 | 1.91 | 13.00 |
| Model.pdb | OG, C_SER_689 | OG1, C_THR_676 | HG1, C_THR_676 | 2.79 | 1.86 | 11.33 |
| Model.pdb | O, C_CYS_671 | N, C_TYR_695 | H, C_TYR_695 | 2.78 | 1.80 | 9.89 |
| Model.pdb | O, C_GLU_661 | OH, C_TYR_695 | HH, C_TYR_695 | 2.57 | 1.63 | 8.69 |
| Model.pdb | O, C_GLY_669 | N, C_MET_697 | H, C_MET_697 | 2.91 | 1.93 | 11.90 |
| Model.pdb | OD1, B_ASP_614 | NZ, C_LYS_854 | HZ2, C_LYS_854 | 2.69 | 1.71 | 10.87 |
| Model.pdb | O, B_THR_547 | ND2, C_ASN_978 | HD21, C_ASN_978 | 2.84 | 1.92 | 19.82 |
A summary of side chain hydrogen bonding analysis between the two structural fragments generated by the hypothesized cleavage at the polybasic FCS of the S protein of SARS-CoV-2. In this table, the residue naming scheme is Chain ID_residue name_residue number, ∠ADH represents the angle formed by acceptor (A), donor (D) and hydrogen (H) (∠ADH). In this table, for residues A and B, one of them is to be located upstream (residue ID scriptsizeer than 681) of the polybasic FCS, while the other is to be located downstream (residue ID larger than 685) of the polybasic FCS.
| PDB file name | Acceptor (A) | Donor (D) | Hydrogen (H) | D-A (Å) | H-A (Å) | ∠ ADH () |
|---|---|---|---|---|---|---|
| Model.pdb | OD2, B_ASP_737 | NH1, A_ARG_319 | HH12, A_ARG_319 | 2.74 | 1.78 | 15.71 |
| Model.pdb | OD2, C_ASP_614 | OG1, A_THR_859 | HG1, A_THR_859 | 2.83 | 1.95 | 19.70 |
| Model.pdb | OD2, A_ASP_614 | OG1, B_THR_859 | HG1, B_THR_859 | 2.98 | 2.16 | 26.02 |
| Model.pdb | OD2, A_ASP_745 | OG1, C_THR_549 | HG1, C_THR_549 | 2.94 | 2.12 | 25.69 |
| Model.pdb | OD2, A_ASP_848 | NH2, C_ARG_646 | HH22, C_ARG_646 | 2.63 | 1.79 | 27.55 |
| Model.pdb | OG, C_SER_689 | OG1, C_THR_676 | HG1, C_THR_676 | 2.79 | 1.86 | 11.33 |
| Model.pdb | OD1, B_ASP_614 | NZ, C_LYS_854 | HZ2, C_LYS_854 | 2.69 | 1.71 | 10.87 |
Fig. 2Four inter-chain salt bridges formed between basic residue pair B_ARG_319 and C_ASP_745 (Table 5). This figure is prepared using PyMol [30] with supplementary file model.pdb as an input. In this figure, the three (A, B and C) chains of the spike protein of SARS-CoV-2 is shown as green, cyan and purple cartoons, the residue naming scheme is Chain ID, residue name, residue number, the four salt bridges are shown here with four dotted yellow lines, with the lengths of the four inter-chain salt bridges being 2.6, 2.6, 3.3 and 3.5 Å, respectively. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)