Literature DB >> 32701194

Emergence of European and North American mutant variants of SARS-CoV-2 in South-East Asia.

Ovinu Kibria Islam¹, Hassan M Al-Emran¹, Md Shazid Hasan¹, Azraf Anwar², Md Iqbal Kabir Jahid¹, Md Anwar Hossain^1,3.

Abstract

The SARS-CoV-2 coronavirus is responsible for the current COVID-19 pandemic, with an ongoing toll of over 5 million infections and 333 thousand deaths worldwide within the first 5 months. Insight into the phylodynamics and mutation variants of this virus is vital to understanding the nature of its spread in different climate conditions. The incidence rate of COVID-19 is increasing at an alarming pace within subtropical South-East Asian nations with high temperatures and humidity. To understand this spread, we analysed 444 genome sequences of SARS-CoV-2 available on the GISAID platform from six South-East Asian countries. Multiple sequence alignments and maximum-likelihood phylogenetic analyses were performed to analyse and characterize the non-synonymous (NS) mutant variants circulating in this region. Global mutation distribution analysis showed that the majority of the mutations found in this region are also prevalent in Europe and North America, and the concurrent presence of these mutations at a high frequency in other countries indicates possible transmission routes. Unique spike protein and non-structural protein mutations were observed circulating within confined area of a given country. We divided the circulating viral strains into four major groups and three subgroups on the basis of the most frequent NS mutations. Strains with a unique set of four co-evolving mutations were found to be circulating at a high frequency within India, specifically. Group 2 strains characterized by two co-evolving NS mutants which alter in RdRp (P323L) and spike (S) protein (D614G) were found to be common in Europe and North America. These European and North American variants have rapidly emerged as dominant strains within South-East Asia, increasing from a 0% prevalence in January to an 81% by May 2020. These variants may have an evolutionary advantage over their ancestral types and could present a large threat to South-East Asia for the coming winter.

Entities: Chemical

Keywords: COVID-19; SARS-CoV-2; genome sequence; non-synonymous mutation; phylogeny

Mesh：

Year: 2020 PMID： 32701194 PMCID： PMC7405211 DOI： 10.1111/tbed.13748

Source DB: PubMed Journal: Transbound Emerg Dis ISSN： 1865-1674 Impact factor: 4.521

INTRODUCTION

The SARS‐CoV‐2 ssRNA virus was initially detected in December 2019 within China's Hubei province and its associated COVID‐19 pandemic is currently ongoing, with a global toll of over 5 million infections and 333 thousand deaths so far (WHO coronavirus disease [COVID‐19] situation report‐124, published on 23 May 2020). While China encountered the pandemic first, North American and European nations have also suffered crippling blows from this pandemic. Moreover, the infection numbers within South‐East Asian, South American and African nations are growing. The number of infections in resource‐poor, low‐income countries may be underestimated due to their limited testing capacity. A number of earlier studies indicated that high temperatures and high humidity could decrease the spread of the virus (Demongeot, Flet‐Berliac, & Seligmann, 2020; Sajadi et al., 2020; Wang, Tang, Feng, & Lv, 2020). Despite having such climate conditions, COVID‐19 cases are increasing at an alarming rate in relatively hot and humid subtropical South‐East Asian countries. As of 22 May, a total of 182,278 cases of SARS‐CoV‐2‐infected cases were identified in South‐East Asia, with a death toll of over 5,500 (WHO coronavirus disease [COVID‐19] situation report‐124, published on 23 May 2020). Scientists from all over the world are making an unprecedented effort to expose the genomic profiles and characterize the mutation variants of the circulating virus to get insight into its evolutionary patterns and driving force. Tang et al. analysed 103 genomic sequences and indicated that the circulating SARS‐CoV‐2 strains have two major lineages, one with a synonymous mutation (NSP4_S75S) and the other with a non‐synonymous (NS) mutation (NS8_L84S; Tang et al., 2020). In another study, Forster, Peter et al. found 3 central variants by analysing 160 sequences and claimed that B‐type viruses (with amino acid [aa] substitution, NS8_L84S) were common in East Asia, whereas A type (ancestral lineage) and C type (NS3_G251V variant) were prevalent in Europe and North America (Forster, Forster, Renfrew, & Forster, 2020). Changchuan Yin analysed 558 genome sequences and found 15 high‐frequency single SNP genotypes (Yin, 2020). He suggested four major groups with either single or co‐evolving mutations: group 1 with NSP6_L37F, group 2 with NS3_G251V, group 3 with co‐evolving mutation at two sites (NSP4_S75S and NS8_L84S) and group 4 with co‐evolving mutations at four sites (241C>T‐leader sequence, NSP3_F105F, NSP12_P323L and S_D614G). In addition, GISAID differentiated COVID‐19 into three major clades: Clade S (prevalent in North America), Clade V (prevalent in Asia and Europe) and Clade G (prevalent in Europe), having NS mutations with aa substitutions at NS8_L84S, NS3_G251V and S_D614G, respectively (Fuertes et al., 2020). Later on, they divided the G clade into GR clade and GH clade that contain N_203‐204: RG>KR and NS3_Q57H aa substitutions, respectively. Finally, based on genomic data available from the GISAID, Nextstrain's SARS‐CoV‐2 global genomic epidemiology analysis shows 10 major clades for this virus. In this study, we investigated 444 sequenced genomes of SARS‐CoV‐2 available from South‐East Asia to provide an insight into the genomic divergence, phylogeny and recurrent mutations in this region.

METHODS AND MATERIALS

Data collection

In South‐East Asia, a total of 444 high‐coverage SARS‐CoV‐2 genomes from India (n = 327), Bangladesh (n = 20), Indonesia (n = 8), Thailand (n = 83), Sri Lanka (n = 4), Nepal (n = 1) and Myanmar (n = 1) have been submitted to GISAID (www.gisaid.org) as of 23 May 2020. This study analysed all 444 genome sequences using hCoV19/Wuhan/WIV04/2019 (Accession: EPI_ISL_402124) as a reference genome for phylogenetic and mutation analysis.

Mutation analyses

Genome sequences of all 444 SARS‐CoV‐2 were aligned using the MAFFT (multiple alignment using fast Fourier transform) algorithm (Katoh, Misawa, Kuma, & Miyata, 2002), and the alignment was visualized in Jalview 2.11.0. A subset of 60 genome sequences were selected from India (n = 30), Bangladesh (n = 10), Indonesia (n = 8), Thailand (n = 7), Sri Lanka (n = 4) and Nepal (n = 1) for intensive analysis of unique and co‐evolving mutations and their phylogeny. The number and ratio of sequence by country were determined based on the available number of sequences and the total number of COVID‐19 cases up to 23 May 2020. The selection of sequences analysed was based on genome quality, discreteness of their location and random sample collection dates. Accession ID, collection dates and locations of this subset are provided in Supporting Information S1. Non‐synonymous mutations with their specific mutation sites in the 60 genomes were determined using ‘CoVsurver enabled by GISAID’ in GISAID’s EpiCoV database. Additionally, the frequencies of those recurrent NS mutations in all 444 genomes of South‐East Asia were determined using Spyder 4.0.1 application under Anaconda Navigator, a free and open‐source distribution of the python programming language (Van Rossum & Drake, 2009).

Spike protein analysis

The S protein sequences were compared with the reference S protein to identify NS mutation sites. The 3D structure of the S protein was constructed using Phyre2 (Kelley, Mezulis, Yates, Wass, & Sternberg, 2015), and the mutation sites were visualized with PyMOL(TM) 2.4.0 software (Schrodinger, 2017).

Phylogenetic and evolutionary analysis

The aligned sequences were used for the construction of a phylogenetic tree using the maximum‐likelihood method in Molecular Evolutionary Genetics Analysis (MEGA X) software (Kumar, Stecher, Li, Knyaz, & Tamura, 2018). Interactive Tree Of Life v5 (Letunic & Bork, 2019) was used to adjust the branch and label colour of the phylogenetic tree.

Mutation cluster‐time plot

A mutation‐time plot for South‐East Asia was prepared by analysing all 444 sequences available in GISAID. A proportion of mutation‐based clusters were plotted monthly from 1 January until 23 May 2020. Additionally, the numbers of COVID‐19 infections and deaths were added in the same time plot collected from WHO situation reports.

Transmission route analysis

Transmission patterns (performed on 23 May 2020) were observed in South‐East Asia using country filters on Nextstrain, an online‐based real‐time pathogen evolution tracking tool (Hadfield et al., 2018). Nextstrain server filtered 329 sequences from 444 available sequences to create the transmission map. The frequencies of NS mutations found after in depth analysis of 60 genomes were analysed globally using GISAID‐based SARS‐CoV‐2 mutation statistics server. Countries with a high frequency of those specific mutations were marked on a geographic heat map using the online tool Maptive.

RESULTS

Preliminary mutation analysis with 60 of 444 SARS‐CoV‐2 sequences from South‐East Asia revealed 78 NS mutations. Most of them (n = 52) were found in non‐structural proteins. Other mutations with aa substitutions were present in S protein (n = 13), N protein (n = 9), M protein (n = 3) and E protein (n = 1). The Nepal SARS‐CoV‐2 genome, which was sequenced early, had no NS mutations compared with our reference sequence (Table 1).

TABLE 1

The frequency of SARS‐CoV‐2 cases identified in South‐East Asia region by country and their possessed mutations among selected 60 strains. Case reports were until 23 May 2020

Country	India	Bangladesh	Indonesia	Thailand	Sri Lanka	Nepal	Overall
Selected sequences for detailed study	30	10	8	7	4	1	60
Collection dates	31 Jan−6 May	18 Apr−13 May	17 Mar−14 Apr	22 Jan−3 Apr	4–31 Mar	13 Jan	13 January–13 May
Non‐synonymous (N‐S) mutations	39	22	12	8	13	0	78
Unique N‐S mutations	11	4	2	3	1	0	21
Spike protein N‐S mutations	8	2	4	2	1	0	13
E protein N‐S mutation	1	0	0	0	0	0	1
M protein N‐S mutations	1	0	0	1	1	0	3
N protein N‐S mutations	4	6	1	2	3	0	9
NS3 N‐S mutations	1	3	1	1	2	0	6
NS7b N‐S mutations	0	0	0	1	0	0	1
NS8 N‐S mutations	1	2	0	1	0	0	2
NSP2 N‐S mutations	8	1	0	1	1	0	10
NSP3 N‐S mutations	5	2	1	2	1	0	11
NSP4 N‐S mutations	2	0	0	0	0	0	2
NSP5 N‐S mutations	0	2	0	0	0	0	2
NSP6 N‐S mutations	1	0	1	0	1	0	1
NSP8 N‐S mutations	0	1	0	0	0	0	1
NSP12 N‐S mutations	3	1	4	1	1	0	6
NSP13 N‐S mutations	2	1	0	0	0	0	3
NSP14 N‐S mutations	2	1	0	1	1	0	5
NSP15 N‐S mutations	0	0	0	1	1	0	2

The frequency of SARS‐CoV‐2 cases identified in South‐East Asia region by country and their possessed mutations among selected 60 strains. Case reports were until 23 May 2020 This subset of data identified 21 NS mutations including four aa alterations in S proteins that solely observed in South‐East Asia (Table 2). The majority of these unique mutations (n = 15) have arisen only once to date. The remaining six were present more than once, but each of these variants circulated in a specific country or region. Moreover, we found 13 mutations with aa replacement in S protein. We also identified the 10 most frequent mutations with aa substitution (N_R203K, N_G204R, N_P13L, NS3_Q57H, NS3_Q57H, NS8_L84S, NSP12_A97V, NSP12_P323L, NSP3_T1198K, NSP6_L37F and S_D614G) and separated the variants into four major groups and three subgroups accordingly (Figure 1). Group 1 consists of 11 sequences, including the reference sequence hCoV19/Wuhan/WIV04/2019 (Accession: EPI_ISL_402124). All sequences within this group were observed earlier in this year (13 January–1 April) and that do not possess any of those predominant mutations. Group 2 consists of the co‐evolving mutations NSP12_P323L and S_D614G. Most of the sequences (n = 24) belong to this group and their isolation dates range from 10 March to 4 May 2020. We further separated this group into two subgroups: those in 2a have additional N_R203K and N_G204R (28881–28883: GGG>AAC) trinucleotide mutations and those in 2b have mutations with aa substitution at NS3 (Q57H). Group 3 is characterized by NSP6_L37F and three other mutations (NSP12_A97V, N_P13L, NSP3_T1198K) co‐evolved with this mutation in 12 Indian sequences and form a unique subgroup 3a. Group 4 consists of eight variants from Bangladesh, India and Thailand with a common aa substitution at NS8 (L84S).

TABLE 2

Unique mutations with amino acid substitutions observed solely in South‐East Asia until 23 May 2020

Mutation sites	Country	No. of virus (study)	No. of virus (total)
E_L65M	India	1	1
M_L29J	India	1	1
N_K347N	Indonesia	2	2
NS3_I263M	Sri Lanka	1	1
NSP2_G212D	Thailand	1	1
NSP2_I120F	Bangladesh	4	9
NSP2_L266I	India	1	1
NSP3_L1553J	India	1	1
NSP3_N1337S	Bangladesh	1	1
NSP3_S1485Y	India	1	1
NSP5_D92G	Bangladesh	3	4
NSP12_V880I	India	1	4
NSP13_K40R	India	1	2
NSP13_T214I	India	1	1
NSP14_D415G	Thailand	1	1
NSP14_S434N	India	1	1
NSP14_V459I	Bangladesh	1	1
Spike_A829T	Thailand	3	31
Spike_A930V	India	1	1
Spike_E471Q	India	1	1
Spike_S116C	Indonesia	1	1

FIGURE 1

Phylogenetic relations of selected 60 SARS‐CoV‐2 strain sequences separated into four clusters

Unique mutations with amino acid substitutions observed solely in South‐East Asia until 23 May 2020 Phylogenetic relations of selected 60 SARS‐CoV‐2 strain sequences separated into four clusters The predominant mutations found from the primary analysis were also observed in 444 genomes and the frequency percentage of those mutations are shown country wise in Figure 2a. Analysis with 444 sequences also shows that 40 sequences belong to group 1, 203 to group 2, 145 to group 3 and 45 to group 4 (Figure 2b). Moreover, subgroups 2a, 2b and 3b contain 35, 93 and 109 genomes, respectively.

FIGURE 2

(a) The frequencies of 10 most recurrent mutations in countries of South‐East Asia compared with worldwide frequencies in percentage. (b) Number of sequences separated in different group and subgroup S protein analysis of 444 sequences revealed 49 sites for aa substitution (Supporting Information S2). Only four of the aa substitution sites were found in receptor‐binding domain (R408I, I434K, E471Q and S494P), two of them resided in critical receptor‐binding motif. All the aa substitution sites were marked in 3D structure of S protein (Figure 3).In the cluster‐based time plot, 444 available sequences were studied from January until May 2020. The group 2 cluster was not observed until February (0%, n = 0), but was 36% (n = 58) in March, 48% (n = 84) in April and 81% (n = 61) in May. The group 3 cluster was 0% to 7% in those months, with a sudden increase of more than 40% in March and April. The infections increased from 17 cases in January up to 128,257 in May 2020 (Figure 4). During those months, the number of deaths increased up to 3,468 cases by May.

FIGURE 3

FIGURE 4

The cluster‐based time plot of 444 complete genome sequences available in GISAID from January to May 2020. Bar charts indicate the frequency of mutants identified in the respective months. Line graph indicates the number of COVID‐19 infection and death cases

Mutation sites with amino acid substitution in spike proteins found in South‐East Asia. Receptor‐binding domain (RBD) chain is marked with green. Most abundant mutation D614G is marked with yellow and mutations reside in RBD are marked with green balls The cluster‐based time plot of 444 complete genome sequences available in GISAID from January to May 2020. Bar charts indicate the frequency of mutants identified in the respective months. Line graph indicates the number of COVID‐19 infection and death cases A geographic heat map (Figure 5) revealed that most of the 78 NS mutations found from 60 sequences were also common in Europe and North America (according to global mutation statistics database; Supporting Information S3). The transmission map also revealed that group 2 sequences (A2 clade of Nextstrain) from this study were found to be dominant among viruses circulating in South‐East Asia (Figure 6). This clade contains characteristic S D614G mutation also found to be prevalent in Europe and North America (Cao et al., 2020; Gonzalez‐Reiche et al., 2020).

FIGURE 5

FIGURE 6

Transmission map constructed by Nextstrain with 329 sequences showing possible transmission routes into six South‐East Asian countries. This map is also separated the sequences of different clades

A total of 78 non‐synonymous mutations found in 60 sequences of South‐East Asia. Markers showing the number of these mutations present in relatively high frequency in respective countries (Supporting Information S3) Transmission map constructed by Nextstrain with 329 sequences showing possible transmission routes into six South‐East Asian countries. This map is also separated the sequences of different clades

DISCUSSION

The SARS‐CoV‐2 pandemic has cost more than 333 thousand lives already, with many more deaths being reported each day. During this period, the global community has yet to predict its virulence, seasonal variation, transmission properties and immunity. However, it is apparent that the fatality rate varies by region and that the degree of virulence varies from person to person (Khafaie & Rahim, 2020; Yao et al., 2020). Some regions in Europe and North America were affected the more than others, while most of Asia, Africa and Australia remain less affected. Analysis of the ssRNA genome is now crucial to understand the pathogenesis and transmission of the virus. This study has characterized the SARS‐CoV‐2 virus circulating in South‐East Asia into four major groups and three subgroups by studying common NS mutations. Group 1 consists of 40 strains that do not have the recurrent mutations. Those strains are mostly found in January and February in this subcontinent (Figure 2a,b). Group 2 involves 46% (n = 203) of the variants in this study. Strains belonging to this group co‐evolved with characteristic NS mutation, NSP12_P323L and S_D614G. These variants were initially prevalent in Europe and North America, and now constitute 68% of the virus all over the world. A recent study analysed 95 sequences and also found NSP12_P323L variants to be at a higher frequency, and reported that this variant was mostly found outside of Wuhan, China (Khailany, Safdar, & Ozaslan, 2020). Another study suggests that RNA‐dependent RNA‐polymerase (RdRp) aa substitution at the 323rd position (NSP12_P323L) causes RdRp fidelity, which, in turn, increases the number of mutations within the virus and causes co‐evolved mutations (Pachetti et al., 2020). NSP12_P323L was co‐evolved with S_D614G; this particular non‐silent S protein mutation generates an additional elastase cleavage site near the S1–S2 junction and thus facilitates fusion and cell entry (Koyama, Weeraratne, Snowdon, & Parida, 2020). This variant (S_D614G) was first observed in 28 January 2020 and was initially prevalent in Europe. Within 4 months, this variant has now rapidly outcompeted its ancestral subtype all over the world (Bhattacharyya et al., 2020). These studies explain the frequency of group 2 variants in South‐East Asia and why these variants have subdivided into additional subgroups involving co‐evolving mutations. We differentiated group 2 into two subgroups, 2a and 2b, which involve N_203‐204: RG>R and NS3_Q57H aa substitutions, respectively, along with NSP12_P323L and S_D614G. Several studies (Ayub, 2020; Lorusso et al., 2020; Yin, 2020) mention trinucleotide block mutations in nucleotides (28881–28883: GGG>AAC) which resulted in 2 aa changes (N_203‐204: RG>KR) that affect the serine–arginine‐rich motif of N protein. This trinucleotide block mutations were found in 35 sequences, 10 of them were from Dhaka, Bangladesh. NS3_Q57H mutation variants have been commonly found in the USA (Mercatelli & Giorgi, 2020) and Europe and are predicted to be deleterious (Issa, Merhi, Panossian, Salloum, & Tokajian, 2020). Subgroup 2a strains are predominant in Bangladesh (50%) while 2b strains are common in India (25%) (Figure 2a). Unlike the others, group 3a was unique, with four co‐evolving mutations. Of these, the NSP6_L37F mutation variant was common (Mercatelli & Giorgi, 2020); this mutation variant has also been frequently found in the UK, USA, Australia and India. The other three mutations are relatively less common and found mostly in India and Australia. Group 4, on the other hand, consists of a characteristic NS8_L84S mutation variant, which was declared as S type by Tang et al. (Tang et al., 2020). This mutation was later reported as C type by another group (Forster, Forster, Renfrew, & Forster, 2020) and was clustered as S clade by GISAID (Fuertes et al., 2020). Group 4 included 45 variants prevalent mostly in Thailand (43%). The analysis of S proteins revealed 49 aa substitution sites. Among them, four sites reside in receptor‐binding domain. The most abundant aa substitution D614G was found 203 times in all countries except Nepal. The effect of D614G mutation is described earlier, and the effect of other S protein mutations is yet to be evaluated. A recent study conducted with 10,014 sequences identified 13 frequent NS mutations (Mercatelli & Giorgi, 2020), while we found only seven of them, along with three less common mutation, at high frequency in this region (Figure 2a). Most of the S protein mutations identified in this region (Supporting Information S2) were also observed in Europe and North America according to the GISAID global mutation database. S protein mutations with aa substitution at 614 position, found in 46% of the studied strains in this region, were also prevalent in Europe and North America. On the contrary, the aa substitutions found at the 1109th position of the S protein found in one Bangladeshi strain was found in another strain from Switzerland. We observed another aa substitution in the S protein at the 76th (S_T76I) position in an Indonesian strain, which was also found in two strains from West Bengal, India (data not shown). This specific aa substitution was identified on 55 occasions according to the global database. Among them, 49 were from Australia, suggesting that this variant might have been transmitted from Australia. Additionally, global mutation distribution statistics showed that S_A829T mutation was observed in 31 sequences, all of them from Thailand (Table 2; Supporting Information S3). NSP2_I120F mutation was found in nine of the 12 cases from Dhaka, Bangladesh, and NSP2_D92G mutation was present in four out of the five sequences from Chittagong, Bangladesh (data not shown). These cities are separated by a distance of 250 km, suggesting that those viruses carrying novel mutations were circulating in an area‐specific manner. Non‐synonymous mutation and phylogenetic analysis conducted through the Nextstrain database was particularly useful in getting a closer look at mutation variants and their possible routes of transmission. We found a common N_203‐204: RG>KR aa substitution (nine out of 12 strains) in Dhaka, Bangladesh. However, instead of the common N_203‐204: RG>KR aa substitution, a less common aa substitution was observed at the 202nd position of N protein (N_S202N) among the most (five out of seven) strains of Chittagong. The mutation distribution database showed that strains having trinucleotide block mutation in N protein were prevalent in Europe and that the N_S202N mutant was found more commonly in recent strains of Saudi Arabia. Phylogenetic analysis by Nextstrain also revealed that the Chittagong strains (belongs to Nextstrain B4 clade and group 4 of our study) have close relationship with the Saudi Arabian strains, while Dhaka strains (A2 clade in Nextstrain, group 2 in this study) are similar to the European ones. The geographical heat map (Figure 6) of these NS mutations indicate that most of these mutations were also frequently found in the UK, USA, Australia, Saudi Arabia and other European countries, revealing possible transmission routes to South‐East Asia. Phylogenetic analysis by Nextstrain produced a similar transmission route map (Figure 5). This study also confirmed, through phylogenetic and mutation analysis, that a high percentage of group 2 strains are linked to European and North American strains (A2 clade in Nextstrain analysis) in India and Bangladesh. We could not analyse the strains from Maldives, Bhutan and Timor‐Leste because they do not have whole genome sequence data of the virus at the time of our analysis. Among the seven countries with available genome sequences, only India, Bangladesh and Indonesia have reported a higher number of SARS‐CoV‐2 infections. The frequencies of infection have increased exponentially from mid‐April, 2020. In our study, it was also shown that group 2 variants containing NSP12_P323L and S_D614G mutations were not found earlier than March. The time plot data (Figure 4) delineate that this group 2 cluster is emerging rapidly from 0% in January and February, to 81% in May 2020, suggesting that the European and North American strains are the most recent predominant strains in this subcontinent. A study conducted in early March reported that NSP12_P323L (14408C>T) and S_D614G (23403A>G) mutations were recurrent in Europe and had not been detected in Asia until then, supporting our statement (Pachetti et al., 2020). Along with other co‐evolving mutations, NSP12_P323L and S_D614G probably provide variants with an evolutionary advantage over their ancestral types, allowing them to survive and circulate in this densely populated region. Time plot is also indicating that the infection and death cases increased along with the increased presence of group 2 strains in this region. Manuel Becerra‐Flores and Timothy Cardozo recently reported that prevalence of D614G mutant correlates with higher case fatality rate in different states of the USA (Becerra‐Flores & Cardozo, 2020). In our study, we focused on the NS mutations found in SARS‐CoV‐2 variants in South‐East Asia. More experimental work is required to determine the biological effect of these variants on transmission and pathogenesis of this virus. A number of earlier studies hypothesized that high temperatures and high humidity could result in reduced SARS‐CoV‐2 transmission. A recent study dealt with combined genomic and climatic approach to provide crucial information about the pathogenesis and spread of the virus (Bajaj & Arya, 2020). However, the seasonality of SARS‐CoV‐2 has not been established yet and despite of having comparatively hot and humid climate, the infection rate of SARS‐CoV‐2 is increasing in South‐East Asia. Our study shows that the European and North American mutant variant containing aa substitutions in S protein (D614G) and Rdrp (P323L) have recently become the dominant variant in this region. With the arrival of this variant in this region, the infection and death cases are also increasing. Given that this variant is emerging rapidly and that winter is approaching, the next wave of SARS‐CoV‐2 may take place in South‐East Asia.

CONFLICT OF INTEREST

All authors declared that they do not have any conflicts of interests or relationship with this study financially or otherwise.

ETHICS STATEMENT

The authors confirm that the ethical policies of the journal, as noted on the journal's author guidelines page, have been adhered to. No ethical approval was required as this study did not collect any samples or questionnaires from animals or humans. Supplementary Material Click here for additional data file. Supplementary Material Click here for additional data file. Supplementary Material Click here for additional data file. Supplementary Material Click here for additional data file.

21 in total

1. MEGA X: Molecular Evolutionary Genetics Analysis across Computing Platforms.

Authors: Sudhir Kumar; Glen Stecher; Michael Li; Christina Knyaz; Koichiro Tamura
Journal: Mol Biol Evol Date: 2018-06-01 Impact factor: 16.240

2. Temperature, Humidity, and Latitude Analysis to Estimate Potential Spread and Seasonality of Coronavirus Disease 2019 (COVID-19).

Authors: Mohammad M Sajadi; Parham Habibzadeh; Augustin Vintzileos; Shervin Shokouhi; Fernando Miralles-Wilhelm; Anthony Amoroso
Journal: JAMA Netw Open Date: 2020-06-01

3. Cross-Country Comparison of Case Fatality Rates of COVID-19/SARS-COV-2.

Authors: Morteza Abdullatif Khafaie; Fakher Rahim
Journal: Osong Public Health Res Perspect Date: 2020-04

4. Introductions and early spread of SARS-CoV-2 in the New York City area.

Authors: Ana S Gonzalez-Reiche; Matthew M Hernandez; Emilia Mia Sordillo; Viviana Simon; Harm van Bakel; Mitchell J Sullivan; Brianne Ciferri; Hala Alshammary; Ajay Obla; Shelcie Fabre; Giulio Kleiner; Jose Polanco; Zenab Khan; Bremy Alburquerque; Adriana van de Guchte; Jayeeta Dutta; Nancy Francoeur; Betsaida Salom Melo; Irina Oussenko; Gintaras Deikus; Juan Soto; Shwetha Hara Sridhar; Ying-Chih Wang; Kathryn Twyman; Andrew Kasarskis; Deena R Altman; Melissa Smith; Robert Sebra; Judith Aberg; Florian Krammer; Adolfo García-Sastre; Marta Luksza; Gopi Patel; Alberto Paniz-Mondolfi; Melissa Gitman
Journal: Science Date: 2020-05-29 Impact factor: 47.728

5. A "One-Health" approach for diagnosis and molecular characterization of SARS-CoV-2 in Italy.

Authors: Alessio Lorusso; Paolo Calistri; Maria Teresa Mercante; Federica Monaco; Ottavio Portanti; Maurilia Marcacci; Cesare Cammà; Antonio Rinaldi; Iolanda Mangone; Adriano Di Pasquale; Marino Iommarini; Maria Mattucci; Paolo Fazii; Pierluigi Tarquini; Rinalda Mariani; Alessandro Grimaldi; Daniela Morelli; Giacomo Migliorati; Giovanni Savini; Silvio Borrello; Nicola D'Alterio
Journal: One Health Date: 2020-04-19

6. The Phyre2 web portal for protein modeling, prediction and analysis.

Authors: Lawrence A Kelley; Stefans Mezulis; Christopher M Yates; Mark N Wass; Michael J E Sternberg
Journal: Nat Protoc Date: 2015-05-07 Impact factor: 13.491

7. Temperature Decreases Spread Parameters of the New Covid-19 Case Dynamics.

Authors: Jacques Demongeot; Yannis Flet-Berliac; Hervé Seligmann
Journal: Biology (Basel) Date: 2020-05-03

8. Phylogenetic network analysis of SARS-CoV-2 genomes.

Authors: Peter Forster; Lucy Forster; Colin Renfrew; Michael Forster
Journal: Proc Natl Acad Sci U S A Date: 2020-04-08 Impact factor: 11.205

9. Genomic characterization of a novel SARS-CoV-2.

Authors: Rozhgar A Khailany; Muhamad Safdar; Mehmet Ozaslan
Journal: Gene Rep Date: 2020-04-16

21 in total

Review 1. Role of the Microbiome in the Pathogenesis of COVID-19.

Authors: Rituparna De; Shanta Dutta
Journal: Front Cell Infect Microbiol Date: 2022-03-31 Impact factor: 5.293

Review 2. Characterization of SARS-CoV-2 different variants and related morbidity and mortality: a systematic review.

Authors: SeyedAhmad SeyedAlinaghi; Pegah Mirzapour; Omid Dadras; Zahra Pashaei; Amirali Karimi; Mehrzad MohsseniPour; Mahdi Soleymanzadeh; Alireza Barzegary; Amir Masoud Afsahi; Farzin Vahedi; Ahmadreza Shamsabadi; Farzane Behnezhad; Solmaz Saeidi; Esmaeil Mehraeen
Journal: Eur J Med Res Date: 2021-06-08 Impact factor: 2.175

3. Effect of mutations in the SARS-CoV-2 spike protein on protein stability, cleavage, and cell-cell fusion function.

Authors: Chelsea T Barrett; Hadley E Neal; Kearstin Edmonds; Carole L Moncman; Rachel Thompson; Jean M Branttie; Kerri Beth Boggs; Cheng-Yu Wu; Daisy W Leung; Rebecca E Dutch
Journal: bioRxiv Date: 2021-01-25

4. SARS-CoV-2 Early Screening at the Point of Entry: Travelers From Bangladesh to Italy-July 2020.

Authors: Martina Rueca; Antonino Di Caro; Cesare Ernesto Maria Gruber; Francesco Messina; Emanuela Giombini; Maria Beatrice Valli; Eleonora Lalle; Simone Lanini; Francesco Vairo; Maria Rosaria Capobianchi; Barbara Bartolini
Journal: Front Genet Date: 2021-02-09 Impact factor: 4.599

5. Gastroenterology manifestations and COVID-19 outcomes: A meta-analysis of 25,252 cohorts among the first and second waves.

Authors: Rami M Elshazli; Adam Kline; Abdelaziz Elgaml; Mohamed H Aboutaleb; Mohamed M Salim; Mahmoud Omar; Ruhul Munshi; Nicholas Mankowski; Mohammad H Hussein; Abdallah S Attia; Eman A Toraih; Ahmad Settin; Mary Killackey; Manal S Fawzy; Emad Kandil
Journal: J Med Virol Date: 2021-02-23 Impact factor: 20.693

6. A rapid and cost-effective multiplex ARMS-PCR method for the simultaneous genotyping of the circulating SARS-CoV-2 phylogenetic clades.

Authors: Mohammad Tanvir Islam; Asm Rubayet Ul Alam; Najmuj Sakib; Mohammad Shazid Hasan; Tanay Chakrovarty; Mohammad Tawyabur; Ovinu Kibria Islam; Hassan M Al-Emran; Mohammad Iqbal Kabir Jahid; Mohammad Anwar Hossain
Journal: J Med Virol Date: 2021-02-01 Impact factor: 20.693

Review 7. Molecular Evolution of Severe Acute Respiratory Syndrome Coronavirus 2: Hazardous and More Hazardous Strains Behind the Coronavirus Disease 2019 Pandemic and Their Targeting by Drugs and Vaccines.

Authors: Hardeep Singh Tuli; Katrin Sak; Poonam Aggarwal; Ashif Iqubal; Sushil K Upadhaya; Jagjit Kaur; Ginpreet Kaur; Diwakar Aggarwal
Journal: Front Cell Infect Microbiol Date: 2021-12-14 Impact factor: 5.293

8. Rapid, inexpensive methods for exploring SARS CoV-2 D614G mutation.

Authors: Sirwan M A Al-Jaf; Sherko S Niranji; Zana H Mahmood
Journal: Meta Gene Date: 2021-07-18

9. Emergence of European and North American mutant variants of SARS-CoV-2 in South-East Asia.

Authors: Ovinu Kibria Islam; Hassan M Al-Emran; Md Shazid Hasan; Azraf Anwar; Md Iqbal Kabir Jahid; Md Anwar Hossain
Journal: Transbound Emerg Dis Date: 2020-08-09 Impact factor: 4.521

10. D614G Spike Mutation Increases SARS CoV-2 Susceptibility to Neutralization.

Authors: Drew Weissman; Mohamad-Gabriel Alameh; Thushan de Silva; Paul Collini; Hailey Hornsby; Rebecca Brown; Celia C LaBranche; Robert J Edwards; Laura Sutherland; Sampa Santra; Katayoun Mansouri; Sophie Gobeil; Charlene McDanal; Norbert Pardi; Nick Hengartner; Paulo J C Lin; Ying Tam; Pamela A Shaw; Mark G Lewis; Carsten Boesler; Uğur Şahin; Priyamvada Acharya; Barton F Haynes; Bette Korber; David C Montefiori
Journal: Cell Host Microbe Date: 2020-12-01 Impact factor: 31.316