| Literature DB >> 32881907 |
Mohd Imran Khan1, Zainul A Khan2, Mohammad Hassan Baig3, Irfan Ahmad4,5, Abd-ElAziem Farouk6, Young Goo Song7, Jae-Jun Dong3.
Abstract
A novel severe acute respiratory syndrome-related coronavirus-2 (SARS-CoV-2) causing COVID-19 pandemic in humans, recently emerged and has exported in more than 200 countries as a result of rapid spread. In this study, we have made an attempt to investigate the SARS-CoV-2 genome reported from 13 different countries, identification of mutations in major coronavirus proteins of these different SARS-CoV-2 genomes and compared with SARS-CoV. These thirteen complete genome sequences of SARS-CoV-2 showed high identity (>99%) to each other, while they shared 82% identity with SARS-CoV. Here, we performed a very systematic mutational analysis of SARS-CoV-2 genomes from different geographical locations, which enabled us to identify numerous unique features of this viral genome. This includes several important country-specific unique mutations in the major proteins of SARS-CoV-2 namely, replicase polyprotein, spike glycoprotein, envelope protein and nucleocapsid protein. Indian strain showed mutation in spike glycoprotein at R408I and in replicase polyprotein at I671T, P2144S and A2798V,. While the spike protein of Spain & South Korea carried F797C and S221W mutation, respectively. Likewise, several important country specific mutations were analyzed. The effect of mutations of these major proteins were also investigated using various in silico approaches. Main protease (Mpro), the therapeutic target protein of SARS with maximum reported inhibitors, was thoroughly investigated and the effect of mutation on the binding affinity and structural dynamics of Mpro was studied. It was found that the R60C mutation in Mpro affects the protein dynamics, thereby, affecting the binding of inhibitor within its active site. The implications of mutation on structural characteristics were determined. The information provided in this manuscript holds great potential in further scientific research towards the design of potential vaccine candidates/small molecular inhibitor against COVID19.Entities:
Mesh:
Substances:
Year: 2020 PMID: 32881907 PMCID: PMC7470274 DOI: 10.1371/journal.pone.0238344
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Schematic representation of genome organization of SARS-CoV-2.
Fig 2Phylogenetic dendogram showing the relationship of SARS-CoV-2 complete sequence (nucleotide) from different geographical locations (13 no.) with SARS-CoV (A) and amino acid sequence of replicase polyprotein of 13 SARS-CoV-2 with SARS-CoV (B).
The evolutionary history was inferred using the Neighbor-Joining method.
Amino acid variation in replicase polyprotein of SARS-CoV-2 strains of 13 different countries.
| Amino acid | India | China | South Korea | Sweden | Vietnam | Brazil | Taiwan | USA | Australia | Japan | Finland | Nepal | Italy |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 671 | T | I | I | I | I | I | I | I | I | I | I | I | I |
| 818 | G | G | G | S | G | G | G | G | G | G | G | G | G |
| 902 | M | M | I | M | M | M | M | M | M | M | M | M | M |
| 2144 | S | P | P | P | P | P | P | P | P | P | P | P | P |
| 2708 | N | S | N | N | N | N | N | N | N | N | N | N | N |
| 2908 | F | I | F | F | F | F | F | F | F | F | F | F | F |
| 3323 | R | R | R | R | C | R | R | R | R | R | R | R | R |
| 3606 | L | L | L | L | L | F | L | L | L | L | L | L | X |
| 4321 | F | F | F | L | F | F | F | F | F | F | F | F | F |
| 4798 | V | A | A | A | A | A | A | A | A | A | A | A | A |
| 6891 | T | T | M | T | T | T | T | T | T | T | T | T | T |
Fig 3Alignment of SARS-CoV-2 major proteins (A) main protease, (B) RNA-dependent RNA polymerase, (C) spike proteins, (D) envelope proteins and (E) nucleocapsid proteins from different countries.
Amino acid variation in spike protein of SARS-CoV-2 strains of 13 different countries.
| Amino acid | QHS34546 India | QHU79173 Finland | QHR84449 Australia | QHZ00379 South Korea | QIG55994 Brazil | QIK50438 Vietnam | QIK50417 Taiwan | QIA98554 Italy | QHU36824 China | QHO60594 USA | BCB15090 Japan | QIC53204 Sweden | QIB84673 Nepal |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 49 | H | Y | H | H | H | H | H | H | H | H | H | H | H |
| 145 | - | Y | Y | Y | Y | Y | Y | Y | Y | Y | Y | Y | Y |
| 221 | S | S | S | W | S | S | S | S | S | S | S | S | S |
| 247 | S | S | R | S | S | S | S | S | S | S | S | S | S |
| 408 | I | R | R | R | R | R | R | R | R | R | R | R | R |
| 797 | F | F | F | F | F | F | F | F | F | F | F | C | F |
Amino acid variation in ORF 3 encoded protein of SARS-CoV-2 strains of 13 different countries.
| Position Amino acid | QHS34546 India | QHU79173 Finland | QHR84449 Australia | QHZ00379 South Korea | QIG55994 Brazil | QIK50438 Vietnam | QIK50417 Taiwan | QIA98554 Italy | QHU36824 China | QHO60594 USA | BCB15090 Japan | QIC53204 Sweden | QIB84673 Nepal |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 128 | W | W | W | L | W | W | W | W | W | W | W | W | W |
| 140 | L | L | L | L | L | L | L | L | L | L | V | L | L |
| 251 | G | G | V | V | V | G | G | V | G | G | G | V | G |
Amino acid variation in envelop protein of SARS-CoV-2 strains of 13 different countries.
| Position Amino acid | QHS34546 India | QHU79173 Finland | QHR84449 Australia | QHZ00379 South Korea | QIG55994 Brazil | QIK50438 Vietnam | QIK50417 Taiwan | QIA98554 Italy | QHU36824 China | QHO60594 USA | BCB15090 Japan | QIC53204 Sweden | QIB84673 Nepal |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 37 | L | L | L | H | L | L | L | L | L | L | L | L | L |
Amino acid variation in nucleocapsid protein of SARS-CoV-2 strains of 13 different countries.
| Position Amino acid | QHS34546 India | QHU79173 Finland | QHR84449 Australia | QHZ00379 South Korea | QIG55994 Brazil | QIK50438 Vietnam | QIK50417 Taiwan | QIA98554 Italy | QHU36824 China | QHO60594 USA | BCB15090 Japan | QIC53204 Sweden | QIB84673 Nepal |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 344 | P | P | P | P | P | P | P | P | P | P | S | P | P |
Mutation in SARS-CoV-2 proteins from different geographical locations and their predicted effect on protein stability.
| Protein Name | Mutation | Country | Stability effect (MUPRO) |
|---|---|---|---|
| 3C-like proteinase (3CLpro) | R60C | Vietnam | DECREASE stability (ΔΔG -1.0163868) |
| Envelope Protein | L37H | South Korea | DECREASE stability (ΔΔG -2.4215632) |
| ORF3a | W128L | South Korea | DECREASE stability (ΔΔG -0.39593766) |
| L140V | Japan | DECREASE stability (ΔΔG -0.90740107) | |
| G251V | Australia, South Korea, Brazil, Italy, Sweden | DECREASE stability (ΔΔG -0.45128408) | |
| Spike Protein | H49Y | Finland | DECREASE stability (ΔΔG -0.20900128) |
| S221W | Brazil | DECREASE stability (ΔΔG -0.45085799) | |
| S247R | Australia | DECREASE stability (ΔΔG -1.3464875) | |
| R408I | India | INCREASE stability (ΔΔG 0.49732107) | |
| F797C | Sweden | DECREASE stability (ΔΔG -1.501262) | |
| Nucleocapsid | P344S | Japan | DECREASE stability (ΔΔG -1.2252261) |
| RNA-dependent RNA polymerase | A406V | India | DECREASE stability (ΔΔG -0.76907034) |
Fig 4Molecular dynamics of complexed SARS-CoV-2 Mpro and the R60C mutant.
Green color indicates the SARS-CoV-2 Mpro while the cyan color indicates the R60C mutant Mpro. (A) Backbone RMSDs of Mpro and its mutated form (B) Rg of Cα atoms (C) Change in Solvent accessible surface area (D) RMSF of the backbone atoms (E) The Lennard–Jones short-range (LJ-SR) and Coulombic short-range (Coul-SR) potential energies.
The Lennard–Jones short-range (LJ-SR) and Coulombic short-range (Coul-SR) potential energies calculated throughout the course of 50 ns of MD simulation.
| Complex | Average (kJ/mol) | Total drift (kJ/mol) |
|---|---|---|
| LJ-SR:Mpro(WT)- Boceprevir | -158.90 | 18.54 |
| LJ-SR:Mpro(R60C)- Boceprevir | -134.43 | -19.01 |
| Coul-SR: Mpro(WT)- Boceprevir | -65.37 | 17.45 |
| Coul-SR: Mpro(R60C)- Boceprevir | -59.56 | 6.49 |
Fig 5(A) The hydrogen bond network of the Mpro (WT) and R60C mutant. (B) The Structure of WT and R60C mutant Mpro.