| Literature DB >> 33221119 |
Md Sorwer Alam Parvez1, Mohammad Mahfujur Rahman2, Md Niaz Morshed2, Dolilur Rahman2, Saeed Anwar3, Mohammad Jakir Hosen4.
Abstract
As the coronavirus disease 2019 (COVID-19), caused by the severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2), rages across the world, killing hundreds of thousands and infecting millions, researchers are racing against time to elucidate the viral genome. Some Bangladeshi institutes are also in this race, sequenced a few isolates of the virus collected from Bangladesh. Here, we present a genomic analysis of these isolates. The analysis revealed that SARS-CoV-2 isolates sequenced from Dhaka and Chittagong were the lineage of Europe and India, respectively. Our analysis identified a total of 42 mutations, including three large deletions, half of which were synonymous. Most of the missense mutations in Bangladeshi isolates found to have weak effects on the pathogenesis. Some mutations may lead the virus to be less pathogenic than the other countries. Molecular docking analysis to evaluate the effect of the mutations on the interaction between the viral spike proteins and the human ACE2 receptor, though no significant difference was observed. This study provides some preliminary insights into the origin of Bangladeshi SARS-CoV-2 isolates, mutation spectrum and its possible pathomechanism, which may give an essential clue for designing therapeutics and management of COVID-19 in Bangladesh.Entities:
Keywords: ACE2 receptor; Bangladeshi isolates; COVID-19; Mutation; SARS-CoV-2; Spike protein
Mesh:
Substances:
Year: 2020 PMID: 33221119 PMCID: PMC7641529 DOI: 10.1016/j.compbiolchem.2020.107413
Source DB: PubMed Journal: Comput Biol Chem ISSN: 1476-9271 Impact factor: 2.877
Fig. 1Maximum likelihood phylogenetic tree reconstructed with the sequences of all Bangladeshi isolates and other countries. The value in the nodes represents the bootstrap value of the branches where the branch length represents the evolutionary distance.
Predicted number of genes and identity compared to the reference strain. (Legends: S1: EPI_ISL_437912; S2: EPI_ISL_445213; S3: EPI_ISL_445214; S4: EPI_ISL_445215; S5: EPI_ISL_445216; S6: EPI_ISL_445217; S7: EPI_ISL_445244; S8: EPI_ISL_450339; S9: EPI_ISL_450340; S10: EPI_ISL_4503441; S11: EPI_ISL_450342; S12: EPI_ISL_450343; S13: EPI_ISL_450344; S14: EPI_ISL_450345; M: Missing).
| No | Protein | S1 | S2 | S3 | S4 | S5 | S6 | S7 | S8 | S9 | S11 | S11 | S12 | S13 | S14 |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | ORF1a Polyprotein | 99.98 | 99.93 | 99.95 | 99.95 | 100 | 99.95 | 100 | 99.95 | 99.98 | 99.98 | 99.95 | 99.98 | 99.98 | 99.98 |
| 2 | ORF1b Polyprotein | 99.96 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 99.96 | 100 | 100 | 100 |
| 3 | Surface Glycoprotein | 99.92 | 100 | 99.84 | 99.92 | 99.92 | 99.92 | 99.92 | 100 | 100 | 100 | 100 | 99.92 | 99.92 | 100 |
| 4 | ORF3a protein | 100 | 99.64 | 100 | 99.64 | 100 | 99.64 | 100 | 100 | 100 | 100 | 100 | 99.27 | 99.64 | 99.64 |
| 5 | envelope protein | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 |
| 6 | Membrane Glycoprotein | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 |
| 7 | ORF6 protein | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 99.36 | 100 | 100 |
| 8 | ORF7a protein | 100 | M | 100 | 100 | 100 | M | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 |
| 9 | ORF7b | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 |
| 10 | ORF8 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 99.17 | 100 | 99.17 | 99.17 | M | M | 99.35 |
| 11 | Neucleocapsid phospoprotein | 99.52 | 99.76 | 99.28 | 99.28 | 100 | 99.28 | 100 | 99.76 | 99.52 | 99.52 | 99.76 | 99.76 | 99.76 | 99.76 |
| 12 | ORF10 | 100 | M | 100 | 100 | 100 | M | 100 | 100 | 100 | 100 | M | 100 | 100 | 100 |
Fig. 2Variations Plot of SARS-CoV-2 in Bangladeshi isolates.
All mutations found in the coding regions of the 14 isolates compared to the reference strain. (Legends: S1: EPI_ISL_437912; S2: EPI_ISL_445213; S3: EPI_ISL_445214; S4: EPI_ISL_445215; S5: EPI_ISL_445216; S6: EPI_ISL_445217; S7: EPI_ISL_445244; S8: EPI_ISL_450339; S9: EPI_ISL_450340; S10: EPI_ISL_4503441; S11: EPI_ISL_450342; S12: EPI_ISL_450343; S13: EPI_ISL_450344; S14: EPI_ISL_450345).
| Strain | Mutation | Protein | Amino Acid Changes | Mutation Types |
|---|---|---|---|---|
| S11, 14 | 283:C > T | ORF1a Polyprotein | No change | Synonymous |
| S9, 10 | 602:C > T | ORF1a Polyprotein | No Change | Synonymous |
| S1,2,3, 4,6 | 1164:A > T | ORF1a Polyprotein | I300F | Missense |
| S1,2,3, 4, 5, 6, 7, 12, 13 | 3038:C > T | ORF1a Polyprotein | No Change | Synonymous |
| S5 | 3689:C > T | ORF1a Polyprotein | No Change | Synonymous |
| S2,3, 4, 6 | 4445:G > T | ORF1a Polyprotein | No Change | Synonymous |
| S8 | 6730:A > G | ORF1a Polyprotein | N2155S | Missense |
| S2, 3, 4, 6 | 8372:G > T | ORF1a Polyprotein | Q2702H | Missense |
| S8, 9, 10, 11, 14 | 8783:C > T | ORF1a Polyprotein | No change | Synonymous |
| S8, 9, 10, 11 | 10330:A > G | ORF1a Polyprotein | D3355G | Missense |
| S14 | 10871:G > T | ORF1a Polyprotein | K3353R | Missense |
| S2 | 10980:G > A | ORF1a Polyprotein | V3572M | Missense |
| S11 | 12120:C > T | ORF1a Polyprotein | P3952S | Missense |
| S8 | 12485:C > T | ORF1a Polyprotein | No Change | Synonymous |
| S1, 2, 3, 4, 5, 6, 7, 12, 13 | 14409:C > T | ORF1ab Polyprotein | P214L | Missense |
| S5, 8, 9, 10, 11, 14 | 15325:C > T | ORF1ab Polyprotein | No Change | Synonymous |
| S8 | 15739:C > T | ORF1ab Polyprotein | No change | Synonymous |
| S4 | 15896:C > T | ORF1ab Polyprotein | No Change | Synonymous |
| S1 | 17020:G > T | ORF1ab Polyprotein | E1084D | Missense |
| S12, 13 | 18878:C > T | ORF1ab Polyprotein | No Change | Synonymous |
| S11 | 19405:G > A | ORF1ab Polyprotein | V1883T | Missense |
| S12, 13 | 22445:C > T | Surface Glycoprotein | No change | Synonymous |
| S14 | 23321:C > T | Surface Glycoprotein | No change | Synonymous |
| S8, 9, 10, 11, 14 | 22469:G > T | Surface Glycoprotein | No change | Synonymous |
| S1,2, 3, 4, 5, 6, 7, 12, 13 | 23404:A > G | Surface Glycoprotein | D623G | Missense |
| S3 | 24488:T > C | Surface Glycoprotein | F1118L | Missense |
| S12, 13 | 25495:G > T | ORF3a protein | No change | Synonymous |
| S14 | 25506:A > T | ORF3a protein | Q38L | Missense |
| S12 | 25512:C > T | ORF3a protein | S40L | Missense |
| S12, 13 | 25564:G > T | ORF3a protein | Q57H | Missense |
| S2, 4, 6 | 25907:G > T | ORF3a protein | G172C | Missense |
| S12, 13 | 26736:C > T | Membrane Glycoprotein | No Change | Synonymous |
| S12 | 27282:G > T | ORF6 protein | W27L | Missense |
| S2 | 27432−27651:DEL | ORF7a protein | Whole protein deletion | Deletion |
| S6 | 27486−27613:DEL | ORF7a protein | Whole protein deletion | Deletion |
| S12, 13 | 27911−28254:DEL | ORF8 | Whole protein deletion | Deletion |
| S14 | 28098:C > T | ORF8 | A65V | Missense |
| S8, 9, 10, 11, 14 | 28145:T > C | ORF8 | L84S | Missense |
| S8, 9, 10, 11, 14 | 28879:G > A | Neucleocapsid phospoprotein | S202N | Missense |
| S1,2,3, 4, 6 | 28882:G > A | Neucleocapsid phospoprotein | R203K | Missense |
| S1,2,3, 4, 6 | 28883:G > A | Neucleocapsid phospoprotein | R203K | Missense |
| S1,2,3, 4, 6 | 28884:G > C | Neucleocapsid phospoprotein | G204R | Missense |
| S9, 10 | 29293:G > T | Neucleocapsid phospoprotein | K373N | Missense |
| S2,3, 4, 6 | 29404:A > G | Neucleocapsid phospoprotein | D377G | Missense |
| S8, 9, 10, 11, 14 | 29643:G > A | ORF10 | No Change | Synonymous |
Prediction of the mutational effects on the structural stability.
| Protein | Amino Acid Changes | SVM2 Prediction Effect | DDG Value (kcal/mol) |
|---|---|---|---|
| ORF1a Polyprotein | I300F | Decrease | −1.79 |
| ORF1a Polyprotein | N2155S | Decrease | −0.60 |
| ORF1a Polyprotein | Q2702H | Decrease | −0.68 |
| ORF1a Polyprotein | D3355G | Decrease | −0.95 |
| ORF1a Polyprotein | K3353R | Increase | −0.13 |
| ORF1a Polyprotein | V3572M | Decrease | −0.88 |
| ORF1a Polyprotein | P3952S | Decrease | −1.21 |
| ORF1b Polyprotein | P214L | Decrease | −0.83 |
| ORF1b Polyprotein | E1084D | Decrease | −0.75 |
| ORF1b Polyprotein | V1883T | Decrease | −1.46 |
| Surface Glycoprotein | D623G | Decrease | −0.93 |
| Surface Glycoprotein | F1118L | Decrease | −0.81 |
| ORF3a protein | Q38L | Increase | 0.12 |
| ORF3a protein | S40L | Increase | 0.40 |
| ORF3a protein | Q57H | Decrease | −0.90 |
| ORF3a protein | G172C | Decrease | −0.83 |
| ORF6 protein | W27L | Decrease | −0.96 |
| ORF8 | A65V | Increase | 0.02 |
| ORF8 | L84S | Decrease | −2.29 |
| Neucleocapsid phospoprotein | S202N | Increase | −0.78 |
| Neucleocapsid phospoprotein | R203K | Decrease | −0.93 |
| Neucleocapsid phospoprotein | G204R | Decrease | −0.52 |
| Neucleocapsid phospoprotein | K373N | Increase | −0.10 |
| Neucleocapsid phospoprotein | D377G | Decrease | −0.44 |
Prediction of the mutational effects on the molecular consequences.
| Protein Name | Mutation | Effects |
|---|---|---|
| Surface Glycoprotein | F1118L | Altered Ordered interface |
| Altered Disordered interface | ||
| Altered DNA binding | ||
| Loss of Sulfation at Y1119 | ||
| Altered Metal binding | ||
| ORF3a | G172C | Loss of O-linked glycosylation at S171 |
| Gain of Disulfide linkage at G172 | ||
| Loss of Intrinsic disorder | ||
| Altered Transmembrane protein | ||
| Altered Ordered interface | ||
| Gain of Loop | ||
| Loss of Proteolytic cleavage at D173 | ||
| ORF6 | W27L | Altered Ordered interface |
| Altered Disordered interface | ||
| Loss of Strand | ||
| Gain of Helix | ||
| Loss of Allosteric site at F22 | ||
| Gain of Sulfation at Y31 | ||
| Altered DNA binding | ||
| Altered Transmembrane protein |
Fig. 3Homology model of the spike proteins; (A) wildtype (B) Model with one mutation: D623 G (C) Model with two mutations: D623 G and F1118 L (D) Superimpose of all models. Here, in B and C, red dot represents the mutation site. In D, purple color represents the wildtype model; the cyan represents a model with one mutation, and the green represents a model with two mutations.
Model Validation assessment score.
| Structures | Rampage Score | ERRAT Score | |
|---|---|---|---|
| Favoured Region | Allowed Region | ||
| Template | 95.8 % | 4.1 % | 76 % |
| Wild type | 92.9 % | 5.7 % | 83 % |
| Mutant Model 1 | 92.6% | 5.3 % | 84.69 % |
| Mutant Model 2 | 92.8% | 5.3 % | 83.78 % |
Molecular docking results of human ACE2 receptor against wild-type and muatant spike protein of SARS-CoV-2.
| Models | Variations | HDOCK Score |
|---|---|---|
| Model 1 | Wild type | −244.42 |
| Model 2 | D623G | −244.42 |
| Model 3 | D623 G, F1118L | −244.42 |
Fig. 4Interaction of Spike protein with ACE2: (A) carton model and (B) Surface model. Here, green represents the receptor binding domain (RBD) of spike protein, and cyan represents human ACE2.