| Literature DB >> 35707000 |
Boon Zhan Sia1, Wan Xin Boon1, Yoke Yee Yap1, Shalini Kumar1, Chong Han Ng1.
Abstract
Background: SARS-CoV-2 virus is a highly transmissible pathogen that causes COVID-19. The outbreak originated in Wuhan, China in December 2019. A number of nonsynonymous mutations located at different SARS-CoV-2 proteins have been reported by multiple studies. However, there are limited computational studies on the biological impacts of these mutations on the structure and function of the proteins.Entities:
Keywords: COVID-19; SARS-CoV-2; co-mutation; nonsynonymous mutation
Mesh:
Substances:
Year: 2022 PMID: 35707000 PMCID: PMC9184924 DOI: 10.12688/f1000research.72904.2
Source DB: PubMed Journal: F1000Res ISSN: 2046-1402
SARS-CoV-2 protein structures used in this study.
| Protein | Nucleotide changes | Amino acid changes | Template structure (PDB ID) |
|---|---|---|---|
| ORF1a nsp5 | C10376T | P108S | 7KPH |
| ORF1b nsp12 | C14408T | P323L | 6YYT |
| C14708T | A423V | 6YYT | |
| S | A23063T | N501Y | 7A92 |
| A23403G | D614G | 7A92 | |
| ORF3a | G25563T | Q57H | 6XDC |
| N | C28725T | P151L | 6VYO |
| G28881A | R203K | QHD43423 | |
| G28882A | R203K | QHD43423 | |
| G28883C | G204R | QHD43423 |
Concurrence ratio of top 10 nonsynonymous mutations in SARS-CoV-2 proteins.
| Coding region and amino acid change | ORF1a nsp5 P108S | ORF1b nsp12 P323L | ORF1b nsp12 A423V | S protein N501Y | S protein D614G | ORF3a Q57H | N protein P151L | N protein R203K | N protein R203K | N protein G204R | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Coding region and amino acid change | Nucleotide change | C10376T | C14408T | C14708T | A23063T | A23403G | G25563T | C28725T | G28881A | G28882A | G28883C |
|
|
| 0 | 99.9 | 98.3 | 0.2 | 99.9 | 4.6 | 97.2 | 91.6 | 91.6 | 91.7 |
|
|
| 99.9 | 0 | 99.9 | 98.8 | 99.8 | 99.7 | 99.9 | 99.7 | 99.8 | 99.8 |
|
|
| 98.3 | 99.9 | 0 | 0.1 | 99.9 | 0.8 | 98.3 | 98.6 | 98.6 | 98.6 |
|
|
| 0.2 | 98.8 | 0.1 | 0 | 99.2 | 25.7 | 0.2 | 67.3 | 65.6 | 66.7 |
|
|
| 99.9 | 99.8 | 99.9 | 99.2 | 0 | 99.9 | 100 | 99.6 | 99.7 | 99.7 |
|
|
| 4.6 | 99.7 | 0.8 | 25.7 | 99.9 | 0 | 1.1 | 0.4 | 0.3 | 0.2 |
|
|
| 97.2 | 99.9 | 98.3 | 0.2 | 100 | 1.1 | 0 | 98.1 | 98.1 | 98.1 |
|
|
| 91.6 | 99.7 | 98.6 | 67.3 | 99.6 | 0.4 | 98.1 | 0 | 99.9 | 99.9 |
|
|
| 91.6 | 99.8 | 98.6 | 65.6 | 99.7 | 0.3 | 98.1 | 99.9 | 0 | 99.9 |
|
|
| 91.7 | 99.8 | 98.6 | 66.7 | 99.7 | 0.2 | 98.1 | 99.9 | 99.9 | 0 |
Figure 1. The numbers of nonsynonymous mutations in 11 coding sequences of SARS-CoV-2 proteins.
Top 10 nonsynonymous mutations of SARS-CoV-2 proteins and their frequency percentage in the primary lineages of VOCs.
| Protein | Nucleotide changes | Amino acid changes | Frequency | Lineage (VOC) | ||||
|---|---|---|---|---|---|---|---|---|
| B.1.1.7
| B.1.351
| P.1
| B.1.617.2 (δ) | BA.1 (ο) | ||||
| ORF1a nsp5 | C10376T | P108S | 4024 | - | - | - | - | - |
| ORF1b nsp12 | C14408T | P323L | 27953 | 100% | 90% | 99% | 100% | 100% |
| C14708T | A423V | 3988 | - | - | - | - | - | |
| S | A23063T | N501Y | 4218 | 99% | 91% | 97% | 0% | 94% |
| A23403G | D614G | 28022 | 100% | 100% | 100% | 100% | 100% | |
| ORF3a | G25563T | Q57H | 5274 | 0% | 98% | 0% | 0% | 0% |
| N | C28725T | P151L | 4007 | - | - | - | - | - |
| G28881A | R203K | 18116 | 99% | 0% | 97% | 0% | 100% | |
| G28882A | R203K | 18092 | 99% | 0% | 97% | 0% | 100% | |
| G28883C | G204R | 18090 | 91% | 0% | 97% | 0% | 99% | |
Figure 2. Visualization of co-mutation in top 10 nonsynonymous mutations in SARS-CoV-2 proteins.
Prediction of nonsynonymous mutation effect on SARS-CoV-2 proteins stability.
| Protein | Mutation | ΔΔG (kcal/mol) | Prediction outcome | ΔΔS Vib ENCoM (kcal.mol −1.K −1) | Molecule flexibility |
|---|---|---|---|---|---|
| ORF1a nsp5 | P108S | −0.288 | Destabilizing | −0.208 | Decrease |
| ORF1b nsp12 | P323L | 1.784 | Stabilizing | −0.432 | Decrease |
| A423V | 0.776 | Stabilizing | −0.348 | Decrease | |
| S | N501Y | 0.013 | Stabilizing | −0.088 | Decrease |
| D614G | −0.072 | Destabilizing | 0.523 | Increase | |
| ORF3a | Q57H | 0.275 | Stabilizing | −0.160 | Decrease |
| N | P151L | 1.111 | Stabilizing | −0.325 | Decrease |
| R203K | 0.749 | Stabilizing | −0.107 | Decrease | |
| G204R | 1.064 | Stabilizing | −2.522 | Decrease |
Prediction of nonsynonymous mutation effect on SARS-CoV-2 proteins function.
| Protein | Mutation | SIFT 4G | Provean |
|---|---|---|---|
| ORF1a nsp5 | P108S | 0.00 (deleterious) | −3.71 (deleterious) |
| ORF1b nsp12 | P323L | - | −0.91 (neutral) |
| A423V | - | 1.21 (neutral) | |
| S | N501Y | - | −0.09 (neutral) |
| D614G | 1.00 (tolerated) | 0.60 (neutral) | |
| ORF3a | Q57H | 0.61 (tolerated) | −3.29 (deleterious) |
| N | P151L | - | −4.93 (deleterious) |
| R203K | 0.11 (tolerated) | −1.60 (neutral) | |
| G204R | 0.08 (tolerated) | −1.66 (neutral) |