| Literature DB >> 33276271 |
B Robson1.
Abstract
The aim of the present study is to discuss the design of peptide vaccines and peptidomimetics against SARS-COV-2, to develop and apply a method of protein structure analysis that is particularly appropriate to applying and discussing such design, and also to use that method to summarize some important features of the SARS-COV-2 spike protein sequence. A tool for assessing sidechain exposure in the SARS-CoV-2 spike glycoprotein is described. It extends to assessing accessibility of sidechains by considering several different three-dimensional structure determinations of SARS-CoV-2 and SARS-CoV-1 spike protein. The method is designed to be insensitive to a distance limit for counting neighboring atoms and the results are in good agreement with the physical chemical properties and exposure trends of the 20 naturally occurring sidechains. The spike protein sequence is analyzed with comment regarding exposable character. It includes studies of complexes with antibody elements and ACE2. These indicate changes in exposure at sites remote to those at which the antibody binds. They are of interest concerning design of synthetic peptide vaccines, and for peptidomimetics as a basis of drug discovery. The method was also developed in order to provide linear (one-dimensional) information that can be used along with other bioinformatics data of this kind in data mining and machine learning, potentially as genomic data regarding protein polymorphisms to be combined with more traditional clinical data.Entities:
Keywords: Accessibility; COVID-19; Conformation; Coronavirus; Disorder; Exposure; Glycosylation; SARS-CoV-2; Spike glycoprotein
Mesh:
Substances:
Year: 2020 PMID: 33276271 PMCID: PMC7679524 DOI: 10.1016/j.compbiomed.2020.104124
Source DB: PubMed Journal: Comput Biol Med ISSN: 0010-4825 Impact factor: 4.589
Fig. 16VXX SARS-C0V-2 Spike protein Closed State Chain B (dotted lines indicate disordered part of chain).
Distribution by Percentage of accessibility scores for sidechains in SARS-CoV-2 spike glycoprotein PDB 6ZP5.
| Rmax = | 3 | 4 | 5 | 6 | 6.5 | 7 | 8 | 9 | 10 | 15 | 20 | 25 | 50 | 75 | 100 |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 2 | 1 | 2 | 2 | 5 | 4 | 2 | 2 | 5 | 2 | 4 | 4 | 2 | 9 | |
| 1 | 0 | 2 | 3 | 7 | 13 | 14 | 11 | 13 | 13 | 7 | 6 | 6 | 4 | 10 | 20 |
| 2 | 1 | 6 | 7 | 15 | 15 | 15 | 14 | 14 | 13 | 12 | 7 | 9 | 13 | 13 | |
| 3 | 2 | 13 | 13 | 15 | 15 | 10 | 12 | 11 | |||||||
| 4 | 5 | 17 | 17 | 15 | 14 | 15 | 17 | 14 | 13 | 14 | 14 | 6 | |||
| 5 | 18 | 18 | 14 | 13 | 13 | 14 | 13 | 11 | 14 | 14 | 15 | 15 | 13 | 5 | |
| 6 | 19 | 15 | 12 | 11 | 11 | 11 | 10 | 10 | 11 | 13 | 10 | 2 | |||
| 7 | 17 | 12 | 10 | 8 | 7 | 6 | 8 | 9 | 8 | 9 | 9 | 14 | 13 | 7 | 2 |
| 8 | 19 | 5 | 7 | 4 | 4 | 4 | 5 | 5 | 5 | 6 | 6 | 8 | 10 | 6 | 1 |
| 9 | 9 | 1 | 3 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 4 | 4 | 6 | 4 | 0 |
| 10 (X) | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
Distribution of accessibility scores for sidechains. In SARS-CoV-2 spike glycoprotein.
| Number of scores of this rank | maximum separation of atoms = 5.5 Å (between centers) in spike (6ZP5) | maximum separation of atoms = 5.5 Å (between centers) in spike + BD23 antibody 7BYR) | maximum separation of atoms = 6.5 Å (between centers) in spike (6ZP5) | maximum separation of atoms = 6.5 Å (between centers) in spike + BD23 antibody (7BYR) | maximum separation of atoms = 7.5 Å (between centers) in spike (6ZP5) | maximum separation of atoms = 7.5 Å (between centers) in spike + BD23 antibody (7BYR) |
|---|---|---|---|---|---|---|
| 0 | 20 | 13 | 45 | 56 | 23 | 39 |
| 1 | 43 | 21 | 123 | 117 | 120 | 122 |
| 2 | 110 | 85 | 152 | 173 | 125 | 184 |
| 3 | 158 | 164 | 146 | 173 | 178 | 174 |
| 4 | 181 | 219 | 137 | 141 | 132 | 121 |
| 5 | 145 | 172 | 121 | 123 | 136 | 138 |
| 6 | 126 | 163 | 106 | 111 | 99 | 118 |
| 7 | 86 | 93 | 62 | 58 | 73 | 58 |
| 8 | 49 | 55 | 36 | 35 | 44 | 32 |
| 9 | 25 | 13 | 15 | 12 | 13 | 12 |
| 10 (X) | 0 | 0 | 0 | 0 | 0 | 0 |
Comparison of exposure measures by other methods.
| Standard view | Phys 1 | Phys 2 | Phys 3 | Exposed Rmax = 6.5 Å | Exposed | Exposed | Not in contact with hydro-phobic core | (Moel-bert et al. | Semanta et al. [ | (Semanta et al. [ | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| LYS | +/− | +1.8 | −1.5 | −3.0 | 5.9 | 5.6 | 7.3 | +2.0 | 0.61 | 9.8 | 85 |
| GLU | +/− | +1.6 | −0.74 | −2.5 | 5.4 | 5.5 | 6.0 | +1.9 | 0.59 | 10.6 | 64 |
| ARG | +/− | +1.0 | −2.5 | −3.0 | 5.1 | 4.6 | 6.5 | +0.9 | 0.54 | 16.3 | 77 |
| ASP | +/− | +2.4 | −0.1 | −2.5 | 5.1 | 5.3 | 5.1 | +2.1 | 0.61 | 11.0 | 48 |
| ASN | P | +0.4 | −0.8 | 0 | 5.1 | 5.7 | 5.4 | −0.3 | 0.57 | 12.0 | 48 |
| HIS | +/− P | +1.4 | −0.4 | 0 | 4.5 | 4.5 | 5.8 | −0.05 | 0.42 | 15.2 | 47 |
| GLN | P | +0.2 | −0.8 | 0 | 4.4 | 4.3 | 5.8 | −0.3 | 0.57 | 12.8 | 60 |
| PRO | H | −0.3 | +0.12 | +1.4 | 4.2 | 4.4 | 4.9 | −1.8 | 0.50 | 10.6 | 43 |
| THR | P | +0.1 | 0 | +0.4 | 4.0 | 4.4 | 4.3 | −0.5 | 0.48 | 11.3 | 36 |
| SER | P | +0.3 | −0.2 | −0.3 | 3.8 | 4.5 | 4.1 | −0.2 | 0.57 | 9.3 | 28 |
| TYR | P,H | +0.2 | 0 | +2.3 | 3.6 | 3.2 | 5.1 | −1.2 | 0.32 | 20.6 | 40 |
| TRP | H | −0.2 | +0.8 | +3.4 | 3.2 | 3.0 | 4.5 | −1.4 | 0.28 | 24.8 | 38 |
| GLY | – | +1.1 | +0.5 | 0 | 3.2 | 3.5 | 4.2 | 0 | 0.59 | – | – |
| LEU | H | −0.7 | +1.5 | +1.8 | 3.2 | 3.0 | 5.6 | −1.2 | 0.32 | 14.8 | 23 |
| VAL | H | −0.5 | +1.8 | +1.5 | 2.7 | 3.1 | 4.2 | −1.0 | 0.31 | 12.8 | 19 |
| ILE | H | −0.8 | +1.4 | +1.8 | 2.7 | 2.6 | 4.5 | −1.25 | 0.27 | 15.3 | 21 |
| PHE | H | −0.6 | +1.2 | +2.5 | 2.7 | 2.3 | 5.0 | −1.3 | 0.29 | 20.0 | 25 |
| ALA | H | +0.3 | +0.6 | +0.5 | 2.6 | 3.3 | 3.8 | −0.6 | 0.40 | 7.2 | 18 |
| MET | H | −0.4 | +0.6 | +1.3 | 2.3 | 1.6 | 5.5 | −1.0 | 0.37 | 16.5 | 29 |
| CYS | H | +0.2 | +0.3 | +1.0 | 2.1 | 2.7 | 3.5 | −0.5 | 0.27 | 13.4 | 10 |
Fig. 2Location of loop GSTPCNGVEGFNCYF in the Spike Glycoprotein.
Fig. 3The reflect-complement-reflect method.