Literature DB >> 33936746

Extensive genetic diversity with novel mutations in spike glycoprotein of severe acute respiratory syndrome coronavirus 2, Bangladesh in late 2020.

S Z Afrin1, S K Paul2, J A Begum3, S A Nasreen1, S Ahmed1, F U Ahmad4, M A Aziz5, R Parvin3, M S Aung6, N Kobayashi6.   

Abstract

In Bangladesh, coronavirus disease 2019 (COVID-19) has been highly prevalent during late 2020, with nearly 500 000 confirmed cases. In the present study, the spike (S) protein of severe acute respiratory coronavirus 2 (SARS-CoV-2) circulating in Bangladesh was genetically investigated to elucidate the diversity of mutations and their prevalence. The nucleotide sequence of the S protein gene was determined for 15 SARS-CoV-2 samples collected from eight divisions in Bangladesh, and analysed for mutations compared with the reference strain (hCoV-19/Wuhan/WIV04/2019). All the SARS-CoV-2 S genes were assigned to B.1 lineage in G clade, and individual S proteins had 1-25 mutations causing amino acid substitution/deletion. A total of 133 mutations were detected in 15 samples, with D614G being present in all the samples; 53 were novel mutations as of January 2021. On the receptor-binding domain, 21 substitutions including ten novel mutations were identified. Other novel mutations were located on the N-terminal domain (S1 subunit) and dispersed sites in the S2 subunit, including two substitutions that remove potential N-glycosylation sites. A P681R substitution adjacent to the furin cleavage site was detected in one sample. All the mutations detected were located on positions that are functionally linked to host transition, antigenic drift, host surface receptor binding or antibody recognition sites, and viral oligomerization interfaces, which presumably related to viral transmission and pathogenic capacity.
© 2021 The Author(s).

Entities:  

Keywords:  Bangladesh; mutation; severe acute respiratory syndrome coronavirus 2; spike glycoprotein

Year:  2021        PMID: 33936746      PMCID: PMC8065242          DOI: 10.1016/j.nmni.2021.100889

Source DB:  PubMed          Journal:  New Microbes New Infect        ISSN: 2052-2975


Introduction

Coronavirus disease 2019 (COVID-19) pandemic is caused by a novel species of coronavirus, i.e. severe acute respiratory syndrome coronavirus type 2 (SARS-CoV-2) [1]. The virus was first recognized in Wuhan, China, in December 2019 and since then its worldwide spread has had a detrimental effect on global health and economy [2]. In Bangladesh, the first case of COVID-19 was identified on 8 March 2020 (https://corona.gov.bd/). Then, to prevent further human-to-human transmission, measures were scheduled quickly by the Government of Bangladesh including initial lockdown, social distancing, movement restrictions, closure of public and private facilities, and reduction of domestic and international air transport. Nevertheless, despite these strict measures, Bangladesh has been vulnerable to the current COVID-19 pandemic because it is one of the world's most densely populated countries. As a consequence, until the end of January 2021, a total of 535 230 cases with 8102 death have been recorded (https://corona.gov.bd/?gclid) officially in the country. Given the evolving nature of the SARS-CoV-2 genome, drug and vaccine developers continue to be vigilant for the emergence of new variants or sub-strains of the virus [3]. Knowledge of the molecular background of circulating strains is vital for the appropriate selection of protective vaccine targets and to maintain molecular diagnostic tools for Bangladesh. Although structural proteins of SARS-CoV-2 show extremely high conservation, mutations have been identified, most frequently in the spike (S) protein as well as in the nucleocapsid protein, among global genomic data of this virus [4]. The S protein of coronaviruses is a trimeric glycoprotein that contains two subunits (S1 and S2) that mediate attachment and fusion of viral and cellular membranes, respectively [5,6]. In the process of viral entry into a cell, the receptor-binding domain (RBD) of the S1 subunit attaches to the host cell's receptor angiotensin-converting enzyme 2, while the S2 subunit mediates fusion of viral and cellular membranes [7]. For the process of virus–cell fusion, the S protein requires priming by host proteases in cleavage sites with a polybasic (furin) cleavage motif (RRAR) at the S1/S2 boundary [7,8]. Mutations of the S protein have been considered important because they are associated with alteration of infectivity and antigenicity of SARS-CoV-2 [9,10]. In Bangladesh, the SARS-CoV-2 genome has been analysed in several reports, revealing variation and frequency of mutations in all the viral proteins [[11], [12], [13], [14], [15], [16], [17]]. However, mutations on the S protein have not yet been well investigated and sufficient information is not available. In the present study, we analysed variation and frequency of missense mutations in the S protein of SARS-CoV-2 from geographically wide areas in Bangladesh. The results revealed extensive diversification of the S protein associated with a wide spectrum of mutations that may affect viral transmission and pathogenicity.

Materials and methods

Viral RNA samples from COVID-19 patients were collected from laboratories in eight divisions of Bangladesh (Barisal, Chittagong, Dhaka, Khulna, Mymensingh, Rajshahi, Rangpur, Sylhet) between July and December 2020. Initial detection of SARS-CoV-2 in nasopharyngeal swab samples was performed by quantitative RT-PCR targeting the ORF1ab and N genes using a commercial kit (Novel Coronavirus Nucleic Acid Diagnostic Kit, Sansure Biotech, Changsha, China). Among 229 samples collected, 15 samples with higher viral load, presumed by the Cq value of the ORF1ab and N genes, were selected for further analysis (see Supplementary material, Table S1). Full-length of S gene was amplified by conventional RT-PCR with newly designed primer sets (see Supplementary material, Table S2). PCR products were purified and were subjected to Sanger-sequencing. The obtained sequences were assembled, edited and aligned using BioEdit software. Sequence data obtained in this study were deposited in the GISAID database, from which accession numbers were assigned (see Supplementary material, Table S3). A phylogenetic tree was generated using MEGA X software [18] applying the maximum likelihood method and the Tamura–Nei model [19]. Deduced amino acid sequences of S proteins were aligned and compared with the reference strain (hCoV-19/Wuhan/WIV04/2019). The major amino acid changes at the receptor binding site, antigenic site and cleavage site of the S protein were represented in a three-dimensional model using Swiss-Model (https://swissmodel.expasy.org/) software. This study was approved by the institutional review board of Mymensingh Medical College, Bangladesh (MMC/IRB/2020/290).

Results

Nucleotide sequences of 15 SARS-CoV-2 S genes were assigned to clade G (GISAID nomenclature) and further clustered with the B.1 lineage [20] with 98.1%–99.9% identity to the reference strain hCoV-19/Wuhan/WIV04/2019. Phylogenetically, these 15 S protein genes were scattered over several sub-branches together with viruses of this group distributed globally (see Supplementary material, Fig. S1), showing the highest nucleotide sequence identity (99.21%–100%) to strains from the USA, India, Bangladesh, the Netherlands, Colombia, Egypt, Australia, South Africa and Canada (see Supplementary material, Table S4). A total of 189 single nucleotide polymorphisms by comparison with the S gene sequence of the reference strain were detected in 15 SARS-CoV-2 in Bangladesh (Table 1). Deduced amino acid sequences of 15 samples exhibited a total of 133 replacements (substitution, deletion) with 110 different types of amino acid substitution (see Supplementary material, Table S5). Individual S proteins had 1–25 amino acid replacements (average 8.9 mutations). Among the amino acid replacements identified in the present study, 80 were previously known and 53 were novel mutations as of January 2021.
Table 1

Mutations identified in spike protein of 15 SARS-CoV-2 samples in Bangladesh compared with hCoV-19/Wuhan/WIV04/2019

Sample IDNo. of single nucleotide polymorphismsNo.of amino acid replacementsAmino acid replacement (substitution/deletion)
KnownNovela
D3327D614G, P681R, I1081VP491H, P499A, S514C, N1098D
D81025R158I, S162T, Q173H, V213L, Q218H, L244V, G257C, A260T, R273T, D614G, I664K, T676N, A845S, Q1201RY144del, R237T, N282D, K300T, A668G, I670K, Y1215F, G1219R, I1221K, A1222T, K1266N
C22112D614G, Q1180H, E1182D, N1187YC662W, I670K, C671G, V976G, D979E, V987L, V991M, K1266N
C33020D614G, Q779H, V785F, T883A, F888S, G889A, G891V, A892V, Q895R, G908C, S974L, D1260NV781N, A893T, F898Y, M900K, Q901P, M902K, N907T, H1058Q
R2129L517P, A522S, P527L, T531N, D614GP521H, S530Y, V991M, H1058Q
R511D614GNone
B42718V3D, L18S, P272S, K378R, Y380F, V382L, S477I, T500I, L517I, D614G, K1073NL5T, P295T, G381A, G496W, V512G, L518V, Q920P
B954V11I, D614G, V620IK1266N
S3125K304R, T307S, E309Q, D614GH1058Q
S794R158K, E298K, D614G, V620INone
T1621D614GNone
T1766P561L, D614G, S758I, N764S, A771SS359I
M1172D614GK1266N
M77317L141I, M153I, A163V, N544Y, D574Y, R577S, P579L, D614G, Q675H, P809SY144del, K537T N556H, K557T, R567I, D568Y, G1035A
K622D614G, Q675HNone
Total1891338053

del, deletion.

Analysis includes the sequences available up to 15 January 2021.

Mutations identified in spike protein of 15 SARS-CoV-2 samples in Bangladesh compared with hCoV-19/Wuhan/WIV04/2019 del, deletion. Analysis includes the sequences available up to 15 January 2021. The S1 subunit contained more amino acid substitutions than S2, almost equally distributed in the N-terminal domain and RBD, and others in the subdomains and furin cleavage site (FCS). Among the 15 Bangladeshi SARS-CoV-2 samples, previously reported substitutions in the S1 subunit were located mostly in the N-terminal domain and in the vicinity of the FCS (48 among 58 substitutions). In contrast, novel mutations in the S1 subunit (total 26 substitutions) were identified mainly in the RBD (ten substitutions) and subdomain–FCS (nine substitutions). On the S protein RBD, 21 mutations were identified. Among the ten novel substitutions on RBD, three substitutions (P491H, G496W, P499A) were located on the receptor-binding motif along with two known substitutions. Substitution D614G was detected in all samples, among which two samples had mutation Q675H. Substitution P681R, located adjacent to the FCS motif, was detected in one sample. Two novel substitutions, N282D and N1098D, were presumed to remove a potential N-glycosylation site. As depicted in Fig. 1, the positions of representative novel substitutions in RBD are mapped near the outermost edge of S protein, whereas N1098D is located in the root of the S2 subunit close to the viral envelope. The missense mutations identified in S proteins were predicted to affect antigenicity, receptor binding ability, and viral oligomerization depending on the functional domains in S protein (see Supplementary material, Table S6).
Fig. 1

Structural representation of spike glycoprotein of SARS-CoV-2. Positions of some novel and established substitutions are indicated with red points. The crystal structure shown in chain view was created using the online based modelling tool Swiss-Model (https://swissmodel.expasy.org).

Structural representation of spike glycoprotein of SARS-CoV-2. Positions of some novel and established substitutions are indicated with red points. The crystal structure shown in chain view was created using the online based modelling tool Swiss-Model (https://swissmodel.expasy.org).

Discussion

Mutation rate of SARS-CoV-2 has been estimated to be 6 × 10−4 nucleotide/genome/year, which is lower than other RNA viruses [21,22]. However, minimal mutations in the genome, represented by a single D614G mutation in the S protein, have the capacity to alter the traits of a protein, affecting virus infectivity, clinical outcomes, and also epidemiology [23,24]. Accordingly, it is important to monitor genetic diversity and mutations of SARS-CoV-2 on a global scale during the ongoing pandemic. In Bangladesh, almost solely the D614G substitution in SARS-CoV-2 S protein was noted and its predominance was revealed by genomic analysis [11,[13], [14], [15]], as observed in the present study. However, other mutations in the S protein have not yet been analysed in detail, and only a few have been described [11,16,17]. In a recent study by Hasan et al. [17], a missense mutation in the S protein was reported to be the third most frequent among viral proteins, following ORF1ab and N protein, by analysis of 371 viral genomes. In contrast, lower mutation rates of the S protein have been described elsewhere [14]. Although mutations in other viral proteins were not analysed in the present study, we identified high genetic diversity in the S protein, with missense mutation ranging from only one to 25 per sample. Therefore, associated with a nationwide pandemic in Bangladesh since late 2020, SARS-CoV-2 is suggested to have undergone rapid genetic evolution, acquiring various mutations. The presence of functional domains in the S protein underscores the importance of analysing mutations in this protein. The D614G mutation in the S protein has been globally increasing since the middle of 2020 [4]. In Bangladesh, it was already dominant early in the pandemic (April–June 2020), accounting for almost 90% of the sequences analysed [14]. In late 2020, this mutation appeared to have overwhelmed the country, as observed in the present study. The D614G substitution changes the physicochemical characteristics of the S protein to cause more efficient binding to angiotensin-converting enzyme 2 receptors, leading to increased infectivity of this variant [24,25]. A number of characteristic variants combined with the D614G substitution in the S glycoprotein have been identified previously and described as modifying the pattern of infection [9]. Among them, a D614G+Q675H variant was found in two samples in the present study. In the present study, three novel substitutions were identified on the receptor-binding motif in RBD, suggesting their effect on alteration of viral infectivity. Although more evidence remains to be determined, the novel S protein mutations in SARS-CoV-2 in Bangladesh give rise to the possibility of a fitness advantage to target cells, which is likely to allow more successful person-to-person transmission. Another notable substitution was P681R, which is located adjacent to the FCS motif, which is highly conserved and important for pathogenesis of the virus [26,27]. On the other hand, loss of the FCS motif is related to attenuation of replication in cells and disease in vivo [26]. The P681R mutation was reported in only one study in Bangladesh [16], so appears to be still rare. Further surveillance may be necessary to monitor this mutation regarding its epidemiological and clinical impact. Mutations at N-glycosylation sites (amino acid 282 and 1098) were identified in the present study. Such glycosylation mutants had been reported previously and considered to be implicated in alteration of antigenic epitopes [9]. In conclusion, the present study revealed diverse nucleotide polymorphisms and amino acid substitutions in the S glycoprotein of SARS-CoV-2 in Bangladesh, suggesting rapid genetic evolution during the pandemic phase in late 2020. The epidemic of COVID-19 is still ongoing in Bangladesh, and the UK variant B.1.1.7 of SARS-CoV-2 was first identified in January 2021 [28], so continuous monitoring of genetic variation is necessary.

Funding

This research was supported by the project funded by The World Academy of Science (), grant number 20-284 RG/BIO/AS_G–FR3240314166 and NST (National Science and Technology) fellowship from the , Bangladesh (no. 39.00.0000.012.002.05.20-04).

Conflict of interest

The authors declare no conflicts of interest.
  28 in total

1.  The coronavirus is mutating - does it matter?

Authors:  Ewen Callaway
Journal:  Nature       Date:  2020-09       Impact factor: 49.962

2.  Estimation of the number of nucleotide substitutions in the control region of mitochondrial DNA in humans and chimpanzees.

Authors:  K Tamura; M Nei
Journal:  Mol Biol Evol       Date:  1993-05       Impact factor: 16.240

3.  Attenuated SARS-CoV-2 variants with deletions at the S1/S2 junction.

Authors:  Siu-Ying Lau; Pui Wang; Bobo Wing-Yee Mok; Anna Jinxia Zhang; Hin Chu; Andrew Chak-Yiu Lee; Shaofeng Deng; Pin Chen; Kwok-Hung Chan; Wenjun Song; Zhiwei Chen; Kelvin Kai-Wang To; Jasper Fuk-Woo Chan; Kwok-Yung Yuen; Honglin Chen
Journal:  Emerg Microbes Infect       Date:  2020-12       Impact factor: 7.163

4.  A novel coronavirus outbreak of global health concern.

Authors:  Chen Wang; Peter W Horby; Frederick G Hayden; George F Gao
Journal:  Lancet       Date:  2020-01-24       Impact factor: 79.321

5.  Furin cleavage of SARS-CoV-2 Spike promotes but is not essential for infection and cell-cell fusion.

Authors:  Guido Papa; Donna L Mallery; Anna Albecka; Lawrence G Welch; Jérôme Cattin-Ortolá; Jakub Luptak; David Paul; Harvey T McMahon; Ian G Goodfellow; Andrew Carter; Sean Munro; Leo C James
Journal:  PLoS Pathog       Date:  2021-01-25       Impact factor: 6.823

6.  Coding-Complete Genome Sequences and Mutation Profiles of Nine SARS-CoV-2 Strains Detected from COVID-19 Patients in Bangladesh.

Authors:  Nihad Adnan; Mohib Ullah Khondoker; M Shaminur Rahman; M Firoz Ahmed; Shahana Sharmin; Nadim Sharif; Nafisa Azmuda; Salma Akter; Shamsun Nahar; Taslin Jahan Mou; Mahfuza Marzan; Syeda Moriam Liza; Nowshin Jahan; Tamanna Ali; Shahad Saif Khandker; Maha Jamiruddin; Mousumi Akter Chaity; Laura Grace Caller; M Ahsanul Haq; Mohd Raeed Jamiruddin
Journal:  Microbiol Resour Announc       Date:  2021-03-11

7.  A rapid and cost-effective multiplex ARMS-PCR method for the simultaneous genotyping of the circulating SARS-CoV-2 phylogenetic clades.

Authors:  Mohammad Tanvir Islam; Asm Rubayet Ul Alam; Najmuj Sakib; Mohammad Shazid Hasan; Tanay Chakrovarty; Mohammad Tawyabur; Ovinu Kibria Islam; Hassan M Al-Emran; Mohammad Iqbal Kabir Jahid; Mohammad Anwar Hossain
Journal:  J Med Virol       Date:  2021-02-01       Impact factor: 20.693

8.  The Impact of Mutations in SARS-CoV-2 Spike on Viral Infectivity and Antigenicity.

Authors:  Qianqian Li; Jiajing Wu; Jianhui Nie; Li Zhang; Huan Hao; Shuo Liu; Chenyan Zhao; Qi Zhang; Huan Liu; Lingling Nie; Haiyang Qin; Meng Wang; Qiong Lu; Xiaoyu Li; Qiyu Sun; Junkai Liu; Linqi Zhang; Xuguang Li; Weijin Huang; Youchun Wang
Journal:  Cell       Date:  2020-07-17       Impact factor: 41.582

9.  The SARS, MERS and novel coronavirus (COVID-19) epidemics, the newest and biggest global health threats: what lessons have we learned?

Authors:  Noah C Peeri; Nistha Shrestha; Md Siddikur Rahman; Rafdzah Zaki; Zhengqi Tan; Saana Bibi; Mahdi Baghbanzadeh; Nasrin Aghamohammadi; Wenyi Zhang; Ubydul Haque
Journal:  Int J Epidemiol       Date:  2020-06-01       Impact factor: 7.196

10.  Structural and Functional Analysis of the D614G SARS-CoV-2 Spike Protein Variant.

Authors:  Leonid Yurkovetskiy; Xue Wang; Kristen E Pascal; Christopher Tomkins-Tinch; Thomas P Nyalile; Yetao Wang; Alina Baum; William E Diehl; Ann Dauphin; Claudia Carbone; Kristen Veinotte; Shawn B Egri; Stephen F Schaffner; Jacob E Lemieux; James B Munro; Ashique Rafique; Abhi Barve; Pardis C Sabeti; Christos A Kyratsous; Natalya V Dudkina; Kuang Shen; Jeremy Luban
Journal:  Cell       Date:  2020-09-15       Impact factor: 66.850

View more
  4 in total

1.  Relative Consolidation of the Kappa Variant Pre-Dates the Massive Second Wave of COVID-19 in India.

Authors:  Jitendra Singh; Anvita Gupta Malhotra; Debasis Biswas; Prem Shankar; Leena Lokhande; Ashvini Kumar Yadav; Arun Raghuvanshi; Dipesh Kale; Shashwati Nema; Saurabh Saigal; Sarman Singh
Journal:  Genes (Basel)       Date:  2021-11-16       Impact factor: 4.096

Review 2.  A Detailed Overview of Immune Escape, Antibody Escape, Partial Vaccine Escape of SARS-CoV-2 and Their Emerging Variants With Escape Mutations.

Authors:  Chiranjib Chakraborty; Ashish Ranjan Sharma; Manojit Bhattacharya; Sang-Soo Lee
Journal:  Front Immunol       Date:  2022-02-09       Impact factor: 7.561

3.  Comparative genomics, evolutionary epidemiology, and RBD-hACE2 receptor binding pattern in B.1.1.7 (Alpha) and B.1.617.2 (Delta) related to their pandemic response in UK and India.

Authors:  Chiranjib Chakraborty; Ashish Ranjan Sharma; Manojit Bhattacharya; Bidyut Mallik; Shyam Sundar Nandi; Sang-Soo Lee
Journal:  Infect Genet Evol       Date:  2022-04-13       Impact factor: 4.393

4.  Co-circulation of Multiple SARS-CoV-2 Variants of Concern in Dhaka, Bangladesh, during the Second Wave of the COVID-19 Pandemic.

Authors:  M Abdullah Omar Nasif; Nahida Sultana; Raad Rahmat; Tahmina Akther; Afzalun Nessa; Munira Jahan
Journal:  Microbiol Resour Announc       Date:  2022-07-11
  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.