Literature DB >> 33558859

Genetic diversity and genomic epidemiology of SARS-CoV-2 in Morocco.

Bouabid Badaoui1, Khalid Sadki1, Chouhra Talbi1, Driss Salah1, Lina Tazi1.   

Abstract

Coronavirus disease 2019 (COVID-19) is an infectious disease caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), declared as a pandemic due to its rapid spread worldwide. In this study, we investigate the genetic diversity and genomic epidemiology of SARS-CoV-2, using 22 virus genome sequences reported by three different laboratories in Morocco till June 7,2020, as well as 40,366 virus genomes from all around the world. The SARS-CoV-2 genomes from Moroccan patients revealed 62 mutations, of which 30 were mis-sense mutations. The mutations Spike_D614G and NSP12_P323L were present in all the 22 analyzed sequences, followed by N_G204R and N_R203K, which occurred in 9 among the 22 sequences. The mutations NSP10_R134S, NSP15_D335N, NSP16_I169L, NSP3_L431H, NSP3_P1292L and Spike_V6F occurred once in Moroccan sequences, with no record in other sequences worldwide. Phylogenetic analyses revealed that Moroccan SARS-CoV-2 genomes included 9 viruses belonging to Clade 20A, 9 to Clade 20B and 2 to Clade 20C, suggesting that the epidemic spread in Morocco did not display a predominant SARS-CoV-2 route. Therefore, multiple and unrelated introductions of SARS-CoV-2 into Morocco through different routes have occurred, giving rise to the diversity of virus genomes in the country. Further, in all probability, the SARS-CoV-2 circulated in a cryptic way in Morocco, starting from January 15, 2020 before the first case was officially discovered on March 2, 2020.
© 2021 Chinese Medical Association Publishing House. Published by Elsevier B.V.

Entities:  

Keywords:  Genetic diversity; Genomic epidemiology; Morocco; SARS-CoV-2

Year:  2021        PMID: 33558859      PMCID: PMC7857134          DOI: 10.1016/j.bsheal.2021.01.003

Source DB:  PubMed          Journal:  Biosaf Health        ISSN: 2590-0536


Introduction

An outbreak of respiratory illness, named coronavirus disease 2019 (COVID-19), caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), was reported, for the first time, in Wuhan (Hubei Province, China) in early December 2019 and subsequently many cases were reported in other countries worldwide. On 30 January, 2020, WHO declared that the corona-virus disease constituted a Public Health Emergency of International Concern. This pandemic has infected more than 7,725,457 people globally and caused more than 427,683 deaths as of June 7, 2020 (https://www.worldometers.info/coronavirus/). This disease was confirmed to have spread to Morocco on 2 March, 2020, when the first COVID-19 case was confirmed. Until June 7, 2020, Morocco reported 8,692 confirmed cases and 212 deaths. The most important feature of this disease is expressed by a high-level of inflammatory response, including pro-inflammatory cytokines in an especially severe form which cause pneumonia and severe acute respiratory syndrome [1]. SARS-CoV-2 genome has a size of 29.8–29.9 kb, harboring a long 5′ end that contains orf1ab. The 3′ end codes for the structural proteins: small envelope (E) protein, matrix (M) protein, spike (S) protein and nucleocapsid (N) protein. Further, the SARS-CoV-2 genome contains six accessory proteins, encoded by ORF3a, ORF6, ORF7a, ORF7b, and ORF8 genes [2]. Compared to SARS-CoV, SARS-CoV-2 has high transmission and less pathogenicity [3] due to hitherto unknown reasons. Virus evolution in nature arises through manifold mechanisms, of which nucleotide substitution is of utmost importance [4]. Genomic epidemiology of emerging viruses is of major relevance for capturing virus evolution and spread [5,6]. This approach has been proved to be very efficient during the Ebola virus epidemic in West Africa [7] and the Zika virus spread in Brazil [8]. To investigate the mutations underlying the evolution of SARS-CoV-2 in Morocco, 40,390 genome sequences of SARS-CoV-2, of which 22 were from Moroccan patients, were collected from GISAID (https://www.gisaid.org/) to study the diversity and genomic epidemiology of SARS-CoV-2 in Morocco.

Materials and methods

To assess the genetic variation of SARS-CoV-2 in Morocco, a total of 40,390 complete genomes of SARS-CoV-2 and their corresponding metadata were retrieved from GISAID database [9] and analyzed using the Nextstrain bioinformatics pipeline [10]. The platform Augur was used to perform a multi-alignment using all the genomes through MAFFT [11] and ‘Wuhan-Hu-1/2019’, and ‘Wuhan/WH01/2019’ as reference genomes. The phylogeny was built by maximum likelihood using IQTREE [12]. For comparative analysis of SARS-CoV-2 genome sequences, we used the following protocol: i) Collection of SARS-CoV-2 sequences and the corresponding metadata from the GISAID database. ii) Filtering the SARS-CoV-2 sequences to exclude inaccurate sequences based on missing bases and sequence length and to set a fixed number of samples per group according to their similarities. iii) Performing a multi-sequence alignment via MAFFT. vi) Inferring a phylogenetic tree from the multi-sequence alignment, getting a time-resolved tree through TreeTime and inferring ancestral traits sequences, and v) Identifying the mutations.

Results

Genetic diversity and evolution of SARS-CoV-2 in Morocco

In this study, the analysis of 22 genomes from Moroccan patients revealed 62 mutations (Supplementary Table 1) of which 30 were mis-sense mutations (Supplementary Table 2). Among these, Spike_D614G and NSP12_P323L were present in all the 22 analyzed sequences, followed by N_G204R and N_R203K occurring in 9 of the 22 sequences (Table 1 ).
Table 1

Most frequent non-synonymous mutations in SARS-CoV-2 isolates from Moroccan patients.

SNP mutationProtein mutationFrequency
14408C > TRNA pol (nsp12, P323L)22
23403A > Gspike glycoprotein (S protein), D614G22
28881G > A; 28882G > Anucleocapsid phosphoprotein (R203K)9
28883G > Cnucleocapsid phosphoprotein (G204R)9
Most frequent non-synonymous mutations in SARS-CoV-2 isolates from Moroccan patients. Until June 7, 2020, the mutations Spike_D614G and NSP12_P323L had occurred 23,612 and 23,543 times respectively throughout 75 countries (https://www.gisaid.org/). Further, the mutations N R203K and N G204R have occurred, till now, 8,744 and 8,715 times respectively in 65 countries (https://www.gisaid.org/). The mutation Spike M1237I, found just once in our sequences, had occurred earlier, 11 times worldwide in six countries. The mutations NSP12_M196I, NSP3_A1819V, M_L13F, NSP14_D324A, NSP14_T75I and NSP5_V125I were found once in our sequences and were reported in only one sequence in the GISAID database. Therefore, NSP12_M196I, NSP3_A1819V, M_L13F, NSP14_D324A, NSP14_T75I and NSP5_V125I occurred once in India (hCoV-19/India/GBRC46/2020), Poland (hCoV-19/Poland/PL_P15/2020), Switzerland/100144/2020; United Kingdom / England EPI_ISL_423899, Australia, EPI_ISL_426742 and South America EPI_ISL_445334. The mutations NSP10_R134S, NSP15_D335N, NSP16_I169L, NSP3_L431H, NSP3_P1292L and Spike_V6F were highlighted once in Moroccan sequences, with no record in other sequences around the world.

Genomic epidemiology of SARS-CoV-2 in Morocco

We now report the putative history of SARS-CoV-2 transmission in Morocco, as revealed by genomic epidemiology (Fig. 1 and Fig. S1). The virus entered and started circulating in Morocco, around the beginning of February, more likely from Belgium, Spain and France (Clade 20C was the first one to be introduced into Morocco); around March 4, 2020, new infected cases came from Belgium to Morocco (Clade 20A), around March 12, 2020, other infected persons entered Morocco from France and passing through Spain (Clade 20A). Three days later (March 14, 2020), infected cases reached Morocco from Switzerland (Clade A). Around March 17, 2020, other SARS-CoV-2 strains entered Morocco from USA (Clade 20A and 20B) and Germany (Clade 20B). Around March 22, 2020, travel- associated cases landed in Morocco from China (Clade A). Other infected arrivals were registered between the 16 and 19 April, 2020 from USA and Vietnam, more likely through sea trades that continued during the lockdown.
Fig. 1

Phylogeny of 40,390 SARS-CoV-2 genomes collected from GISAID database. The viruses collected from Morocco are highlighted, in colorful dots, according to the clades to which they belong. Clustering of related viruses indicates community transmission after an introduction event. Morocco witnessed three major transmission events.

Phylogeny of 40,390 SARS-CoV-2 genomes collected from GISAID database. The viruses collected from Morocco are highlighted, in colorful dots, according to the clades to which they belong. Clustering of related viruses indicates community transmission after an introduction event. Morocco witnessed three major transmission events. The virus genomes from Moroccan patients held between 4 and 16 mutations relative to the common ancestor [Wuhan-Hu-1/2019′, ‘Wuhan/WH01/2019’] (Fig. 2 ). This pattern got a molecular evolution rate of 25.331 substitutions per year, consistent with what was reported before [13].
Fig. 2

Clock Diagram showing the evolution of mutations number in SARS-CoV-2 strains. The viruses collected in Morocco, colored in orange, got between 4 and 16 mutations compared to the reference sequence.

Clock Diagram showing the evolution of mutations number in SARS-CoV-2 strains. The viruses collected in Morocco, colored in orange, got between 4 and 16 mutations compared to the reference sequence. Phylogenetic analysis (Fig. 1) shows that the SARS-CoV-2 genomes from Moroccan patients are dispersed across the evolutionary tree of SARS-CoV-2, estimated from 40,390 genomes available on GISAID as of June 7, 2020. These included 9 viruses from Clade 20A, 9 from Clade 20B and 2 from Clade 20C (Fig. 3 ). This reveals that the epidemic spread in Morocco did not show a predominant SARS-CoV-2 lineage. However, it is more likely that the virus circulated covertly around the beginning of February, before the official discovery of the first case on March 2.
Fig. 3

Radial tree for the Phylogeny of 40,390 SARS-CoV-2 genomes collected from GISAID database. The viruses collected from Morocco, highlighted, in orange dots, belong to three clades 20A, 20B and 20C.

Radial tree for the Phylogeny of 40,390 SARS-CoV-2 genomes collected from GISAID database. The viruses collected from Morocco, highlighted, in orange dots, belong to three clades 20A, 20B and 20C.

Discussion

Among the missense mutations, Spike_D614G, NSP12_P323L, N_R203K and N_G204R occurred with high frequency worldwide. It is very likely that this contributes to increase the SARS-CoV-2 transmissibility. Throughout its evolution within the host, the virus seeks to proliferate efficiently, while concurrently circumventing host morbidity to set out a maximum transmission [14]. This is in concordance with the concomitant reduced pathogenicity that accompanies its transmission increase. The Spike_D614G mutation manifested a major effect on the efficiency of the virus to infect hosts [15] and showed a high ability to hinder the immune systems of hosts that already dealt with version of SARS-CoV-2 without the Spike_D614G mutation [16]. This aspect should be emphasized for future vaccine researches. Though the mutation NSP12_P323L aroused the substitution of proline that plays a prominent role in protein folding and aggregation, neither increased the SARS-CoV-2's infectivity or its fitness regarding natural selection [17]. This might be because the change from proline to leucine amino-acid (P323L) did not change the protein function, as both amino-acids pertain to the non-polar aliphatic R groups. Other missense mutations occurred either with small frequency like Spike_M1237I that was reported 11 times all over the world or those reported only once worldwide, like NSP12_M196I, NSP3_A1819V, M_L13F, NSP14_D324A, NSP14_T75I and NSP5_V125I. These mutations seem to be rare and may represent the virus adaptation in the face of specific genetic backgrounds of the host, climate conditions or other unknown factors. Concerning the mutation Spike_M1237I, it's noteworthy that the spike surface glycoprotein is crucial for the virus binding to receptors on the host cell [18]. This glycoprotein is also the major target of neutralizing antibodies [19]. Hence, mutations in the spike surface glycoprotein could provoke change in the antigenicity of SARS-CoV-2. The mutations NSP10_R134S, NSP15_D335N, NSP16_I169L, NSP3_L431H, NSP3_P1292L and Spike_V6F that occurred in the Moroccan sequences with no record in other sequences worldwide, should get careful attention and should be investigated to figure out their potential effects on the SARS-CoV-2 virulence, as well as their association with immunological and clinical symptoms. Special focus should be afforded to the mutation Spike_V6F, as the structural protein spike has been proven to be essential for the virus' ability to infect the hosts and was used as an important target for vaccine development [20]. Genomic epidemiology, using Nextstrain, applied to SARS-CoV-2 transmission in Morocco, revealed many aspects of the epidemic already known to the authorities, like the introduction of SARS-CoV-2 strains into Morocco from Belgium, Spain and France at the beginning of the epidemic. However, this analysis highlighted many other so far unknown events, like the arrival of strains from USA and Vietnam after the lockdown, possibly through sea trades. SARS-CoV-2 genomes from Moroccan patients are dispersed across the evolutionary tree of SARS-CoV-2, with 9 viruses belonging to Clade 20A, 9 to Clade 20B and 2 to Clade 20C. This suggests that multiple and unrelated introductions of COVID-19 into Morocco have occurred through different routes, giving rise to the diversity of virus lineages reported in this study. This finding suggests that different SARS-CoV-2 strains, with different mutation patterns, coexist in Morocco. The contribution of each of those mutations needs to be investigated, in order to ascertain possible drug resistance and eventually different SARS-CoV-2 mortality rates related to these mutations. It is tempting to speculate that the specific evolutionary profiles of the SARS-CoV-2 in Morocco might be the product of the interaction between its evolutionary routes before reaching the country and its adaptation to the Moroccan genetic background. This genomic survey of SARS-CoV-2 in Morocco has at least two limitations. First, the sample size is relatively small, which might hinder a compelling interpretation of the epidemiological situation in the country. Albeit samples come from geographically diverse regions in Morocco, under-sampling of virus genomes makes it hard to acquire a compelling picture of SARS-CoV-2 transmission. Deep genomic surveillance of SARS-CoV-2, with an extensive sampling, reveals the circulating lineages of the virus, their changes over time, as well as how the interventions are affecting virus evolution. However, the limited diagnostic and sequencing infrastructure in the country has seriously hindered the epidemic management and unquestionably, genomic and bioinformatics infrastructure has to be deployed to handle future outbreaks more effectively. It is paramount to learn the lessons from COVID-19 and develop the necessary tools for the next pandemic preparedness. Second, all the samples analyzed, except one, were obtained from public health laboratories and thus may not be representative of the general population.

Conclusions

The genetic analysis of the SARS-CoV-2 genomes from Moroccan patients revealed some new mutations, with no record in other sequences worldwide. These mutations should be investigated to ascertain their potential effects on the SARS-CoV-2 virulence and to evaluate their impacts on the immune response. The phylogenetic analyses revealed that the COVID-19 spread occurred through multiple and unrelated introductions of COVID-19 into Morocco through different routes. SARS-CoV-2 might have circulated in a covert way in Morocco around the beginning of February, before the discovery of the first case on March 2, 2020.

Conflict of interest statement

The authors declare that there are no conflicts of interest.

Author contributions

Bouabid Badaoui: Conceptualization, Formal Analysis, Methodology, Writing - Original Draft. Khalid Sadki: Writing - Review & Editing. Chouhra Talbi: Writing - Review & Editing. Driss Salah: Data Curation, Writing - Original Draft. Lina Tazi: Conceptualization, Writing - Review & Editing.
  19 in total

Review 1.  Human Coronavirus: Host-Pathogen Interaction.

Authors:  To Sing Fung; Ding Xiang Liu
Journal:  Annu Rev Microbiol       Date:  2019-06-21       Impact factor: 15.500

2.  Virulence evolution and the trade-off hypothesis: history, current state of affairs and the future.

Authors:  S Alizon; A Hurford; N Mideo; M Van Baalen
Journal:  J Evol Biol       Date:  2009-02       Impact factor: 2.411

3.  Establishment and cryptic transmission of Zika virus in Brazil and the Americas.

Authors:  N R Faria; J Quick; I M Claro; J Thézé; J G de Jesus; M Giovanetti; M U G Kraemer; S C Hill; A Black; A C da Costa; L C Franco; S P Silva; C-H Wu; J Raghwani; S Cauchemez; L du Plessis; M P Verotti; W K de Oliveira; E H Carmo; G E Coelho; A C F S Santelli; L C Vinhal; C M Henriques; J T Simpson; M Loose; K G Andersen; N D Grubaugh; S Somasekar; C Y Chiu; J E Muñoz-Medina; C R Gonzalez-Bonilla; C F Arias; L L Lewis-Ximenez; S A Baylis; A O Chieppe; S F Aguiar; C A Fernandes; P S Lemos; B L S Nascimento; H A O Monteiro; I C Siqueira; M G de Queiroz; T R de Souza; J F Bezerra; M R Lemos; G F Pereira; D Loudal; L C Moura; R Dhalia; R F França; T Magalhães; E T Marques; T Jaenisch; G L Wallau; M C de Lima; V Nascimento; E M de Cerqueira; M M de Lima; D L Mascarenhas; J P Moura Neto; A S Levin; T R Tozetto-Mendoza; S N Fonseca; M C Mendes-Correa; F P Milagres; A Segurado; E C Holmes; A Rambaut; T Bedford; M R T Nunes; E C Sabino; L C J Alcantara; N J Loman; O G Pybus
Journal:  Nature       Date:  2017-05-24       Impact factor: 49.962

Review 4.  Quasispecies theory and the behavior of RNA viruses.

Authors:  Adam S Lauring; Raul Andino
Journal:  PLoS Pathog       Date:  2010-07-22       Impact factor: 6.823

5.  IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies.

Authors:  Lam-Tung Nguyen; Heiko A Schmidt; Arndt von Haeseler; Bui Quang Minh
Journal:  Mol Biol Evol       Date:  2014-11-03       Impact factor: 16.240

6.  Epidemiological and clinical characteristics of 99 cases of 2019 novel coronavirus pneumonia in Wuhan, China: a descriptive study.

Authors:  Nanshan Chen; Min Zhou; Xuan Dong; Jieming Qu; Fengyun Gong; Yang Han; Yang Qiu; Jingli Wang; Ying Liu; Yuan Wei; Jia'an Xia; Ting Yu; Xinxin Zhang; Li Zhang
Journal:  Lancet       Date:  2020-01-30       Impact factor: 79.321

Review 7.  Towards a genomics-informed, real-time, global pathogen surveillance system.

Authors:  Jennifer L Gardy; Nicholas J Loman
Journal:  Nat Rev Genet       Date:  2017-11-13       Impact factor: 53.242

8.  SARS-CoV-2 spike-protein D614G mutation increases virion spike density and infectivity.

Authors:  Lizhou Zhang; Cody B Jackson; Huihui Mou; Amrita Ojha; Haiyong Peng; Brian D Quinlan; Erumbi S Rangarajan; Andi Pan; Abigail Vanderheiden; Mehul S Suthar; Wenhui Li; Tina Izard; Christoph Rader; Michael Farzan; Hyeryun Choe
Journal:  Nat Commun       Date:  2020-11-26       Impact factor: 14.919

9.  The proximal origin of SARS-CoV-2.

Authors:  Kristian G Andersen; Andrew Rambaut; W Ian Lipkin; Edward C Holmes; Robert F Garry
Journal:  Nat Med       Date:  2020-04       Impact factor: 87.241

10.  Mutations in SARS-CoV-2 viral RNA identified in Eastern India: Possible implications for the ongoing outbreak in India and impact on viral structure and host susceptibility.

Authors:  Arindam Maitra; Mamta Chawla Sarkar; Harsha Raheja; Nidhan K Biswas; Sohini Chakraborti; Animesh Kumar Singh; Shekhar Ghosh; Sumanta Sarkar; Subrata Patra; Rajiv Kumar Mondal; Trinath Ghosh; Ananya Chatterjee; Hasina Banu; Agniva Majumdar; Sreedhar Chinnaswamy; Narayanaswamy Srinivasan; Shanta Dutta; Saumitra DAS
Journal:  J Biosci       Date:  2020       Impact factor: 1.826

View more
  6 in total

1.  Unlocking capacities of genomics for the COVID-19 response and future pandemics.

Authors:  Sergey Knyazev; Karishma Chhugani; Varuni Sarwal; Ram Ayyala; Harman Singh; Smruthi Karthikeyan; Dhrithi Deshpande; Pelin Icer Baykal; Zoia Comarova; Angela Lu; Yuri Porozov; Tetyana I Vasylyeva; Joel O Wertheim; Braden T Tierney; Charles Y Chiu; Ren Sun; Aiping Wu; Malak S Abedalthagafi; Victoria M Pak; Shivashankar H Nagaraj; Adam L Smith; Pavel Skums; Bogdan Pasaniuc; Andrey Komissarov; Christopher E Mason; Eric Bortz; Philippe Lemey; Fyodor Kondrashov; Niko Beerenwinkel; Tommy Tsan-Yuk Lam; Nicholas C Wu; Alex Zelikovsky; Rob Knight; Keith A Crandall; Serghei Mangul
Journal:  Nat Methods       Date:  2022-04       Impact factor: 47.990

2.  Phylogenetic and genome-wide mutational analysis of SARS-CoV-2 strains circulating in Nigeria: no implications for attenuated COVID-19 outcomes.

Authors:  Daniel B Kolawole; Malachy I Okeke
Journal:  Osong Public Health Res Perspect       Date:  2022-04-22

3.  Genomic Diversity of SARS-CoV-2 in Algeria and North African Countries: What We Know So Far and What We Expect?

Authors:  Taha Menasria; Margarita Aguilera
Journal:  Microorganisms       Date:  2022-02-18

4.  Using genomic epidemiology of SARS-CoV-2 to support contact tracing and public health surveillance in rural Humboldt County, California.

Authors:  Gunnar Stoddard; Allison Black; Patrick Ayscue; Dan Lu; Jack Kamm; Karan Bhatt; Lienna Chan; Amy L Kistler; Joshua Batson; Angela Detweiler; Michelle Tan; Norma Neff; Joseph L DeRisi; Jeremy Corrigan
Journal:  BMC Public Health       Date:  2022-03-07       Impact factor: 3.295

5.  Detection of SARS-CoV-2 in a dog with hemorrhagic diarrhea.

Authors:  Miguel Padilla-Blanco; Santiago Vega; Luis Enjuanes; Alfonso Morey; Teresa Lorenzo; Clara Marín; Carmen Ivorra; Elisa Maiques; Vicente Rubio; Consuelo Rubio-Guerri
Journal:  BMC Vet Res       Date:  2022-10-12       Impact factor: 2.792

6.  Molecular Analysis of SARS-CoV-2 Circulating in Bangladesh during 2020 Revealed Lineage Diversity and Potential Mutations.

Authors:  Rokshana Parvin; Sultana Zahura Afrin; Jahan Ara Begum; Salma Ahmed; Mohammed Nooruzzaman; Emdadul Haque Chowdhury; Anne Pohlmann; Shyamal Kumar Paul
Journal:  Microorganisms       Date:  2021-05-12
  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.