| Literature DB >> 33737154 |
Md Mahbub Hasan1, Rasel Das2, Md Rasheduzzaman3, Md Hamed Hussain4, Nazmul Hasan Muzahid4, Asma Salauddin3, Meheadi Hasan Rumi3, S M Mahbubur Rashid5, Amam Zonaed Siddiki6, Adnan Mannan7.
Abstract
Coronavirus Disease 2019 (COVID-19) warrants comprehensive investigations of publicly available Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) genomes to gain new insight about their epidemiology, mutations, and pathogenesis. Nearly 0.4 million mutations have been identified so far among the ∼60,000 SARS-CoV-2 genomic sequences. In this study, we compared a total of 371 SARS-CoV-2 published whole genomes reported from different parts of Bangladesh with 467 sequences reported globally to understand the origin of viruses, possible patterns of mutations, and availability of unique mutations. Phylogenetic analyses indicated that SARS-CoV-2 viruses might have transmitted through infected travelers from European countries, and the GR clade was found as predominant in Bangladesh. Our analyses revealed 4604 mutations at the RNA level including 2862 missense mutations, 1192 synonymous mutations, 25 insertions and deletions and 525 other types of mutation. In line with the global trend, D614G mutation in spike glycoprotein was predominantly high (98 %) in Bangladeshi isolates. Interestingly, we found the average number of mutations in ORF1ab, S, ORF3a, M, and N were significantly higher (p < 0.001) for sequences containing the G614 variant compared to those having D614. Previously reported frequent mutations, such as R203K, D614G, G204R, P4715L and I300F at protein levels were also prevalent in Bangladeshi isolates. Additionally, 34 unique amino acid changes were revealed and categorized as originating from different cities. These analyses may increase our understanding of variations in SARS-CoV-2 virus genomes, circulating in Bangladesh and elsewhere.Entities:
Keywords: Bangladesh; COVID-19; Mutation; SARS-CoV-2; Whole-genome sequence
Year: 2021 PMID: 33737154 PMCID: PMC7959702 DOI: 10.1016/j.virusres.2021.198390
Source DB: PubMed Journal: Virus Res ISSN: 0168-1702 Impact factor: 3.303
Fig. 1Phylogenetic analysis of 371 Bangladeshi SARS-CoV-2 genomes with 467 representative sequences from 41 different countries worldwide. Sequence having the original D614 in Spike glycoprotein mutation is ‘shaded cyan’ while G614 in yellow. Outside the main tree and labels, there are three different annotation panels sequentially to show the sequences from Bangladesh (Orange Star), followed by the burden of mutation in different ORFs of SARS-CoV-2 as ‘heatmap’ and finally, GISAID clades as a ‘color strip’. The reference sequence, Wuhan-Hu-1 (Accession number MN908947), is indicated with .
Fig. 2District-wise distribution of GISAID clades among circulating SARS-CoV-2 strains in Bangladesh. Please be noted that district information of 72 sequences out of 371 deposited from Bangladesh is not available from respective metadata. Districts from where samples were collected for sequencing are shaded pink and the number of the sequences from each district is proportional to the circumference of the piecharts. The spatial map was created using layers downloaded from GeoDASH (The Bangladesh Geospatial Data Sharing Platform; https://geodash.gov.bd/) website on ArcGIS Desktop (Esri Inc.,109 Redlands, California, United States) licensed to King’s College London.
Number of common gene variants among 371 SARS-CoV-2 genomes from Bangladesh.
| Genome segment | Missense mutation | Synonymous mutation | Insertion Deletion | Others | Total |
|---|---|---|---|---|---|
| ORF1ab | 1141 | 898 | 1 | 4 | 2044 |
| S | 493 | 118 | 3 | 1 | 615 |
| ORF3a | 89 | 49 | 7 | 0 | 145 |
| E | 3 | 1 | 0 | 0 | 4 |
| M | 20 | 55 | 0 | 1 | 76 |
| ORF6 | 7 | 2 | 0 | 0 | 9 |
| ORF7a | 11 | 7 | 0 | 0 | 18 |
| ORF7b | 7 | 1 | 0 | 0 | 8 |
| ORF8 | 30 | 16 | 2 | 0 | 48 |
| ORF10 | 10 | 5 | 0 | 0 | 15 |
| N | 1051 | 40 | 2 | 1 | 1094 |
| Intergenic | 0 | 0 | 2 | 13 | 15 |
| 5′ UTR | 0 | 0 | 4 | 414 | 418 |
| 3' UTR | 0 | 0 | 4 | 91 | 95 |
| Total | 2862 | 1192 | 25 | 525 | 4604 |
Fig. 3Frequency of common mutations observed in Bangladesh over time. The x-axis of the plot shows the timeline between April 2020 and October 2020 whereas, the Y-axis depicts the frequency of mutations reported from the sequences of the same collection date.
Correlation of the average number of mutations per genome among different genomic segments with D614 G mutation.
| Genome segment | Average number of mutations per genome | Change (%) | ||
|---|---|---|---|---|
| D614 (wild; n = 212) | G614 (n = 626) | |||
| 5′ UTR | 0.34 | 1.12 | 70 | <0.001 |
| ORF1ab | 3.20 | 4.75 | 33 | <0.001 |
| S | 0.51 | 1.50 | 66 | <0.001 |
| ORF3a | 0.40 | 0.41 | 2 | 0.794 |
| E | 0.00 | 0.01 | 100 | 0.501 |
| M | 0.07 | 0.17 | 59 | <0.001 |
| ORF6 | 0.01 | 0.03 | 67 | 0.134 |
| ORF7a | 0.02 | 0.04 | 50 | 0.175 |
| ORF7b | 0.01 | 0.01 | 0 | 0.585 |
| ORF8 | 0.45 | 0.10 | −350 | <0.001 |
| N | 0.58 | 2.24 | 74 | <0.001 |
| ORF10 | 0.01 | 0.03 | 67 | 0.092 |
| 3' UTR | 0.52 | 0.22 | −136 | <0.001 |
| Total | 6.12 | 10.68 | 43 | |
Unique mutations in 371 SARS-CoV-2 genomes from Bangladesh.
| UM | Gene/ORF | District | Remarks |
|---|---|---|---|
| E194Q | ns3 | Barisal | Negatively charged to Glu neutral Gln |
| Q62E | ns7a | Barisal | Neutral Gln to negatively charged Glu |
| N377D | nsp2 | Chandpur | Polar Asn to acidic Asp |
| G773A | nsp3 | Chandpur | Same type of change |
| Y660F | Spike | Chandpur | Aromatic Tyr to to aromatic Phe |
| N11D | n | Chittagong | Polar Asn to acidic Asp |
| N39Y | ns6 | Chittagong | Polar aliphatic Asn to aromatic Tyr |
| A602S | nsp3 | Chittagong | Hydrophobic, and non-polar Ala to hydrophilic Ser |
| V56A | nsp1 | Dhaka | Hydrophobic Val to simple, and non-polar Ala |
| D85E | nsp4 | Dhaka | Both are same type of amino acid |
| N133B | nsp5 | Dhaka | Ambigious |
| L22I | nsp6 | Jessore | Both are same type of amino acids |
| K84T | nsp9 | Jessore | Positively charged ε-amino group containing Lys to hydroxyl group containing Thr |
| T80I | ns8 | Moulvibazar | Hydroxyl group containing Thr to hydrophobic Ile |
| E242A | ns3 | Mymensingh | Acidic Glu to hydrophobic Ala |
| G42V | ns7a | Rangpur | Glycine convert to hydrophobic valine |
| P38R | NS8 | Pabna, Chandpur, Brahmanbaria | Phenylalanine convert to Arginine which contains positively charged group. |
| D179N | nsp16 | Khulna, Rajshahi | Acidic Asp to polar aliphatic Asn |
| V469A | nsp2 | Mymensingh, Barisal, Chandpur | Hydrophobic Val to another hydrophobic, non-polar Ala |
| L373M | nsp3 | Brahmanbaria, Moulvibazar, Rajshahi | Hydrophobic side chain containing Leu to sulfur containing Met |
| V120L | nsp6 | Habiganj, Pabna | Valine to hydrophobic side chain containing Leu |
| H145N | n | NG | Polar, basic His to polar Asn |
| P104R | ns3 | NG | Phenylalanine convert to positively charged Arg |
| V121D | nsp1 | NG | Hydrophobic Val to acidic Asp |
| V31E | nsp12 | NG | Hydrophobic Val to negatively charged Glu |
| F217I | nsp14 | NG | Aromatic Phe to hydrophobic Ile |
| D1774H | nsp3 | NG | Acidic Asp to polar, basic His |
| E120K | nsp3 | NG | Negatively charged Glu to positively charged ε-amino group containing Lys |
| D1108N | nsp3 | NG | Acidic Asp to polar Asn |
| V843F | nsp3 | NG | Hydrophobic Val to aromatic Phe |
| A889V | nsp3 | NG | Hydrophobic, and non-polar Ala to another hydrophobic Val |
| G1691C | nsp3 | NG | Simple Gly to sulfur-containing, acidic Cys |
| E42A | nsp4 | NG | Negatively charged Glu to hydrophobic, non-polar Ala |
| I569S | Spike | NG | Hydrophobic side chain containing amino acid Ile to polar Ser |