| Literature DB >> 35669291 |
Renukaradhya K Math1, Nayana Mudennavar1, Palaksha Kanive Javaregowda1, Ambuja Savanur1.
Abstract
Background: A recent global outbreak of COVID-19 caused by the severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2) created a pandemic and emerged as a potential threat to humanity. The analysis of virus genetic composition has revealed that the spike protein, one of the major structural proteins, facilitates the entry of the virus to host cells. Objective: The spike protein has become the main target for prophylactics and therapeutics studies. Here, we compared the spike proteins of SARS-CoV-2 variants using bioinformatics tools.Entities:
Keywords: 2D map; COVID therapy; COVID-19; NCBI; SARS-CoV-2 treatment; bioinformatics; bioinformatics tools; genomic; isoelectric point; pI; phylogenetic tree; prophylactic; protein; sequence analysis; spike protein; spike protein variants; therapeutic; viral protein
Year: 2022 PMID: 35669291 PMCID: PMC9158474 DOI: 10.2196/37391
Source DB: PubMed Journal: JMIR Bioinform Biotech ISSN: 2563-3570
List of SARS-CoV-2 spike proteins and the comparison of mutational analysis data of the wild type and its variants.
| SARS-CoV-2 variants | First identified | NCBI accession | Linear SeqVrl | Pango lineage | Mutation in spike protein variants sequence | Specific mutation in spike protein variants |
| Wild type | Wuhan, China | YP_009724390 | —a | — | — | — |
| D614G | United States | QTA38988 | 21-Mar-21 | D614G | D614G | D614G |
| Alpha (B.1.1.7) | United Kingdom | P0DTC2 | 02-Jun-21 | B.1.1.7 | E484K, A570D, D614G, P681H | N501Y |
| Beta (B.1.351) | South Africa | 7LYQ_C | 09-Jul-21 | B.1.351 | K417N, E484K, N501Y, D614G | R246I |
| Delta (B.1.617.2) | India | QWP92316 | 12-Jun-21 | B.1.617.2 | E156, K417N, D614G, N501Y | T19R, L452R, T478K, P581R, P681R, D950N |
| Gamma (P.1) | Brazil | 7M8K_C | 28-May-21 | P.1 | K417N, E484K, N501Y, D614G | T20N, P26S, D138Y, R190S, T1027I |
| Omicron (B.1.1.529) | South Africa | 7QO7_C | 19-Jan-2022 | B.1.1.529 | N501Y, Y505H, T547K, D614G, H655Y, etc | A67V, del69-70, T95I, del142-144, Y145D, etc |
aNot available.
Comparison of predicted physicochemical property values of the spike protein of SARS-CoV-2 and its variants.
| SARS-CoV-2 variants | Total amino acids | MWa | pIb | Extinction coefficient (M-1 cm-1) | EC/Ac | Half-life (h) | Instability index | Classification of protein | Aliphatic index | GRAVYd |
| Wild type | 1273 | 141178 | 6.24 | 148960 | 1.055 | 30 | 33.01 | stable | 84.67 | –0.079 |
| D614G | 1252 | 138712 | 6.49 | 147345 | 1.062 | 30 | 32.06 | stable | 85.01 | –0.067 |
| Alpha (B.1.1.7) | 1273 | 141178 | 6.24 | 148960 | 1.055 | 30 | 33.01 | stable | 84.67 | –0.079 |
| Beta (B.1.351) | 1288 | 142201 | 6.38 | 148335 | 1.043 | 30 | 31.29 | stable | 82.03 | –0.142 |
| Delta (B.1.617.2) | 1271 | 140895 | 7.34 | 148960 | 1.057 | 30 | 32.58 | stable | 84.65 | –0.08 |
| Gamma (P.1) | 1257 | 139207 | 6.18 | 140315 | 1.008 | 30 | 31.15 | stable | 83.43 | –0.127 |
| Omicron (B.1.1.529) | 1285 | 142424 | 6.63 | 146845 | 1.018 | 30 | 33.10 | stable | 81.84 | –0.164 |
aMW: molecular weight.
bpI: isoelectric point.
cEC/A: extinction coefficient/absorbance for 1% solutions.
dGRAVY: grand average of hydropathicity.
Comparison of phosphorylation, N-glycosylation, and disulfide bonds values of the spike protein of SARS CoV-2 and its variants.
| SARS-CoV-2 variants | Phosphorylation | N-glycosylation | Disulfide bonds | ||||||
|
| Phosphorylation sites | Predicted Sera, Thrb, and Tyrc sites | Predicted number | N-glycosylation sites | Predicted number | Total number of cysteine | Predicted number | ||
| Wild type | Ser, Thr, Tyr | Ser-64, Thr-44, Tyr-22 | 133 | Asnd-Ser/Thr | 17 | 40 | 15 | ||
| D614G | Ser, Thr, Tyr | Ser-64, Thr-45, Tyr-22 | 131 | Asn-Ser/Thr | 17 | 38 | 15 | ||
| Alpha (B.1.1.7) | Ser, Thr, Tyr | Ser-67, Thr-44, Yyr-22, | 133 | Asn-Ser/Thr | 17 | 40 | 15 | ||
| Beta (B.1.351) | Ser, Thr, Tyr | Ser-71, Thr-43, Tyr-24 | 136 | Asn-Ser/Thr | 17 | 30 | 14 | ||
| Delta (B.1.617.2) | Ser, Thr, Tyr | Ser-65, Thr-43, Tyr-22 | 130 | Asn-Ser/Thr | 16 | 40 | 15 | ||
| Gamma (P.1) | Ser, Thr, Tyr | Ser-66, Thr-43, Tyr-24 | 133 | Asn-Ser/Thr | 20 | 30 | 14 | ||
| Omicron (B.1.1.529) | Ser, Thr, Tyr | Ser-68, Thr-43, Tyr-21 | 132 | Asn-Ser/Thr | 18 | 30 | 14 | ||
aSer: serine.
bThr: threonine.
cTyr: tyrosine.
dAsn: asparagine.
Comparison of the immunological properties of the spike protein of SARS-CoV-2 and its variants.
| SARS-CoV-2 variants | Exposed B-cell epitopes | B-cell epitopes | Protective antigen prediction score | Predicted probability of antigenicity score | Number of epitopes identified in CTLa | Number of strong binders in T-cell | Predicted epitopes in T-cell | Immunogenicity predication score |
| Wild type | 40 | QTQTNSPRRARSV | 0.4646 | 0.717053 | 37 | 21 | IGINITRFQ | 0.3751 |
| D614G | 40 | YHKNNKS | 0.4583 | 0.741478 | 35 | 22 | ITRFQ | 0.96257 |
| Alpha (B.1.1.7) | 40 | QTQTNSPRRARSV | 0.4646 | 0.717053 | 37 | 21 | NGTHWFVTQRNFYEP | 0.3019 |
| Beta (B.1.351) | 40 | HPQFEKGGGSGGGGSG | 0.4542 | 0.643558 | 38 | 23 | QPYRVVVLSFE | 1.23216 |
| Delta (B.1.617.2) | 38 | SLGAENSVAYSN | 0.4709 | 0.744007 | 35 | 22 | IRAAEIRASAN | 0.0304 |
| Gamma (P.1) | 41 | QTQTNSPRRARSV | 0.4583 | 0.596261 | 36 | 22 | IGINITRFQ | 1.07515 |
| Omicron (B.1.1.529) | 33 | QTQTKSHGSASSVA | 0.4646 | 0.717053 | 31 | 20 | IGINITRFQTLLALH | 0.49637 |
aCTL: cytotoxic T-lymphocyte.
The P values of isoelectric point, antigenicity, and immunogenicity of the spike protein of SARS-CoV-2 and its variants.
| SARS-CoV-2 variants | Wild type | D614G | Alpha | Beta | Delta | Gamma | Omicron | |
|
| ||||||||
|
|
| —a | 0.996 | 1.000 | 0.989 | 0.999 | 0.992 | 1.000 |
|
| — | .06 | .007 | .09 | .02 | .08 | .01 | |
|
| ||||||||
|
|
| 0.996 | — | 0.995 | 0.998 | 0.992 | 0.999 | 0.998 |
|
| .06 | — | .06 | .04 | .08 | .03 | .04 | |
|
| ||||||||
|
|
| 1.000 | 0.995 | — | 0.988 | 1.000 | 0.990 | 1.000 |
|
| .007 | .06 | — | .10 | .02 | .09 | .02 | |
|
| ||||||||
|
|
| 0.989 | 0.998 | 0.988 | — | 0.983 | 1.000 | 0.992 |
|
| .09 | .04 | .10 | — | .12 | .01 | .08 | |
|
| ||||||||
|
|
| 0.999 | 0.992 | 1.000 | 0.983 | — | 0.986 | 0.998 |
|
| .02 | .08 | .02 | .12 | — | .11 | .04 | |
|
| ||||||||
|
|
| 0.992 | 0.999 | 0.990 | 1.000 | 0.986 | — | 0.994 |
|
| .08 | .03 | .09 | .01 | .11 | — | .07 | |
|
| ||||||||
|
|
| 1.000 | 0.998 | 1.000 | 0.992 | 0.998 | 0.994 | — |
|
| .01 | .04 | .02 | .08 | .04 | .07 | — | |
aNot applicable.
Figure 1Graphical illustration of the predicted percentages of the secondary and tertiary structure and disorder of the spike proteins of SARS-CoV-2 wild type and its variants. GMQE: global model quality estimate; TM: transmembrane.
Figure 2The 2D gel map of SARS-CoV-2 and its variants.
Figure 3The phylogenetic tree of SARS-CoV-2 and its variants represented in circular cladogram. Protein sequences of MERS-CoV and Rousettus bat coronavirus were used as nearest relatives.