| Literature DB >> 35493250 |
Mohammad Uzzal Hossain1,2, Ishtiaque Ahammad2, Arittra Bhattacharjee2, Zeshan Mahmud Chowdhury2, Md Tabassum Hossain Emon3, Keshob Chandra Das4, Chaman Ara Keya5, Md Salimullah4.
Abstract
Coronavirus disease-19 (COVID-19) caused by SARS-CoV-2 has already killed more than one million people worldwide. Since novel coronavirus is a new virus, mining its genome sequence is of crucial importance for drug/vaccine(s) development. Whole genome sequencing is a helpful tool in identifying genetic changes that occur in a virus when it spreads through the population. In this study, we performed complete genome sequencing of SARS-CoV-2 to unveil the genomic variation and indel, if present. We discovered thirteen (13) mutations in Orf1ab, S and N gene where seven (7) of them turned out to be novel mutations from our sequenced isolate. Besides, we found one (1) insertion and seven (7) deletions from the indel analysis among the 323 Bangladeshi isolates. However, the indel did not show any effect on proteins. Our energy minimization analysis showed both stabilizing and destabilizing impact on viral proteins depending on the mutation. Interestingly, all the variants were located in the binding site of the proteins. Furthermore, drug binding analysis revealed marked difference in interacting residues in mutants when compared to the wild type. Our analysis also suggested that eleven (11) mutations could exert damaging effects on their corresponding protein structures. This journal is © The Royal Society of Chemistry.Entities:
Year: 2021 PMID: 35493250 PMCID: PMC9044322 DOI: 10.1039/d1ra05327b
Source DB: PubMed Journal: RSC Adv ISSN: 2046-2069 Impact factor: 4.036
Fig. 1Workflow of the study.
NIB_01 polymorphisms against reference sequence and their mutational effect
| S. no. | Query (reference) position | Position in subject (NIB-01) sequence | Amino acid | Query → subject | Gene | Mutants observed | Mutants observed in no. in virus | Impact on proteins | Protein ID of mutants | First date of detection in GISAID |
|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 241st | 93 | — |
| 5′UTR | US, UK | 15 042 | Upstream variant | QHD43415.1 | Not available |
| 2 | 627th | 479 | Val121 |
| Orf1ab | — | — | — | — | Not available |
| 3 | 629th | 481 | Leu122 |
| Orf1ab | — | — | — | — | Not available |
| 4 | 1163rd | 1015 | Ile120 |
| Orf1ab | — | — | Coding variant | — | March, 2020 in hCoV-19/France/IDF_HB_112003047702/2020 |
| 5 | 5246th | 5098 | Val843 |
| Orf1ab | — | — | Coding variant | QHD43415.1 | May 2020 in hCoV-19/Bangladesh/NIB-01/2020 |
| 6 | 5385th | 5237 | Ala889 | C → T | Orf1ab | — | — | Coding variant | QHD43415.1 | May 2020 in hCoV-19/Bangladesh/NIB-01/2020 |
| 7 | 7790th | 5642 | Gly1691 |
| Orf1ab | — | — | Coding variant | QHD43415.1 | First found in May 2020 from hCoV-19/Bangladesh/NIB-01/2020 |
| From Later Update/submission: in April 2020, from hCoV-19/Canada/AB-50997/2020 | ||||||||||
| 8 | 8171st | 8023 | Ala1818 |
| Orf1ab | — | — | Coding variant | QHD43415.1 | March 2020 in hCoV-19/England/GSTT-265DF65/2020. |
| 9 | 23 403rd | 23 255 | Asp614 |
| “S” | US, UK | 15 118 | Coding variant | QHD43416.1 | January 2020 in hCoV-19/Japan/20200409-129/2020 |
| 10 | 28 882nd | 28 734 | Arg203 |
| “N” | UK | 4542 | Coding variant | QHD43423.2 | January 2020 in hCoV-19/Japan/20200409-129/2020 |
| 11 | 28 883rd | 28 735 | Gly204 |
| “N” | UK | 4537 | Coding variant | QHD43423.2 | January 2020 in hCoV-19/Japan/20200409-129/2020. |
Fig. 2Novel mutations in the protein sequence in NIB_01 whole genome. In total, seven novel mutations was identified.
Indel profile of Bangladeshi whole genome
| Serial no. | Sequence identifier | Indel type | Region |
|---|---|---|---|
| 1 | hCoV-19/Bangladesh/BCSIR-NILMRC-071/2020|EPI_ISL_466692|2020-05-26 | Insertion | ORF8 (27 910–27 985) |
| 2 | hCoV-19/Bangladesh/BARJ-CVASU-CTG-511/2020|EPI_ISL_450343|2020-05-09 | Deletion | ORF8 (27 913–28 254) |
| 3 | hCoV-19/Bangladesh/BARJ-CVASU-CTG- 517/2020|EPI_ISL_450344|2020-05-03 | Deletion | ORF8 (27 913–28 254) |
| 4 | hCoV-19/Bangladesh/CHRF-0006/2020|EPI_ISL_468074|2020-05-17 | Deletion | ORF8 (27 913–28 254) |
| 5 | hCoV-19/Bangladesh/NGRI-NSTU-31/2020|EPI_ISL_514614|2020-07-21 | Deletion | ORF8 (27 913–28 254) |
| 6 | hCoV-19/Bangladesh/DNAS-CPH-467/2020|EPI_ISL_445213|2020-04-28 | Deletion | ORF7 (27 476–27 668) |
| 7 | hCoV-19/Bangladesh/DNAS-CPH-436/2020|EPI_ISL_445217|2020-04-28 | Deletion | ORF7 (27 476–27 668) |
| 8 | hCoV-19/Bangladesh/DU-50761/2020|EPI_ISL_450842|2020-05-06 | Deletion | ORF7 (27 476–27 668) |
Fig. 3Deletion of sequences in complete genome of the Bangladesh strain. Indel experimentation result from comparative genome browsing against reference genome in Artemis window. (a) Mobile element insertion from n to ORF8 in EPI_ISL_466692. Yellow bar representing homologous sequence moiety between these two regions. (b) EPI_ISL_450343, EPI_ISL_450344, EPI_ISL_468074, EPI_ISL_514614 are sharing the same deletion pattern in ORF8. Majority portion of ORF8, lost in the deletion event. (c) EPI_ISL_445213, EPI_ISL_445217 and EPI_ISL_450842 are also sharing the same deletion pattern in Orf7A. Majority portion of Orf7A, lost in the deletion event.
Fig. 4Location of the wild type and the mutant residues on the tertiary structure of SARS-CoV-2 proteins.
Energy minimization of the wild type and mutant model
| Serial number | Variant in NIB_01 genome position | Energy minimization score | Stability of protein |
|---|---|---|---|
| 1 | V121 (wild type) | −4714.146 kJ mol−1 | Increase |
| 2 | V121D (mutant type) | −4718.002 kJ mol−1 | |
| 3 | L122 (wild type) | −4710.914 kJ mol−1 | Decrease |
| 4 | L122I (mutant type) | −4382.711 kJ mol−1 | |
| 5 | I120 (wild type) | −14 247.945 kJ mol−1 | Increase |
| 6 | I120F (mutant type) | −14 678.767 kJ mol−1 | |
| 7 | V843 (wild type) | −10 989.878 kJ mol−1 | Decrease |
| 8 | V843F (mutant) | −10 847.174 kJ mol−1 | |
| 9 | A889 (wild type) | −10 989.688 kJ mol−1 | Decrease |
| 10 | A889V (mutant type) | −10 731.945 kJ mol−1 | |
| 11 | G1691 (wild type) | −23 276.789 kJ mol−1 | Decrease |
| 12 | G1691C (mutant type) | −22 377.976 kJ mol−1 | |
| 13 | A1818 (wild type) | −23 276.789 kJ mol−1 | Decrease |
| 14 | A1818T (mutant type) | −23 051.338 kJ mol−1 | |
| 15 | D614 (wild type) | −9876.789 kJ mol−1 | Decrease |
| 16 | D614G (mutant type) | −9308.898 kJ mol−1 | |
| 19 | R203 (wild type) | −8789.976 kJ mol−1 | Decrease |
| 20 | R203K (mutant type) | −8356.435 kJ mol−1 | |
| 21 | G204 (wild type) | −8789.976 kJ mol−1 | Decrease |
| 22 | G204R (mutant type) | −8089.172 kJ mol−1 |
Fig. 5Mutation in the ligand binding site of the SARS-CoV-2 proteins.
Drug binding analysis of variant. Red labeled residues denotes the mutated residues
| Variant position in subject (NIB-01) sequence and protein coding region | Drug name | Binding energy |
|---|---|---|
| 479 (wild type) V121 NSP1 | Ivermectin | −7.9 kcal mol−1 |
| 479 (mutant type) V121D NSP1 | Ivermectin | −8.2 kcal mol−1 |
| 481(wild type) L122 NSP1 | Ivermectin | −8.7 kcal mol−1 |
| 481 (mutant type) L122I NSP1 | Ivermectin | −7.7 kcal mol−1 |
| 1015 (wild type) I120 NSP2 | Ivermectin | −7.9 kcal mol−1 |
| 1015 (mutant type) I120F NSP2 | Ivermectin | −6.9 kcal mol−1 |
| 5098 (wild type) V843 NSP3 | Ivermectin | −8.8 kcal mol−1 |
| 5098 (mutant) V843F NSP3 | Ivermectin | −8.0 kcal mol−1 |
| 5237 (wild type) A889 NSP3 | Ivermectin | −7.6 kcal mol−1 |
| 5237 (mutant type) A889V NSP3 | Ivermectin | −7.1 kcal mol−1 |
| 8023 (wild type) A1818 NSP3 | Ivermectin | −6.6 kcal mol−1 |
| 8023 (mutant type) A1818T NSP3 | Ivermectin | −7.5 kcal mol−1 |
| 23 255 (wild type) D614 spike | Remdisivir | −8.7 kcal mol−1 |
| 23 255 (mutant type) D614G spike | Remdisivir | −8.3 kcal mol−1 |
| 28 734 (wild type) R203 N | Remdisivir | −6.8 kcal mol−1 |
| 28 734 (mutant type) R203K N | Remdisivir | −7.8 kcal mol−1 |
| 28 735 (wild type) G204 N | Remdisivir | −7.1 kcal mol−1 |
| 28 735 (mutant type) G204R N | Remdisivir | −8.4 kcal mol−1 |