| Literature DB >> 33170902 |
Meriem Laamarti1, Tarek Alouane1, Souad Kartti1, M W Chemao-Elfihri1, Mohammed Hakmi1, Abdelomunim Essabbar1, Mohamed Laamarti1, Haitam Hlali1, Houda Bendani1, Nassma Boumajdi1, Oussama Benhrif1, Loubna Allam1, Naima El Hafidi1, Rachid El Jaoudi1, Imane Allali2, Nabila Marchoudi3, Jamal Fekkak3, Houda Benrahma4, Chakib Nejjari5, Saaid Amzazi2, Lahcen Belyamani6, Azeddine Ibrahimi1.
Abstract
In late December 2019, an emerging viral infection COVID-19 was identified in Wuhan, China, and became a global pandemic. Characterization of the genetic variants of SARS-CoV-2 is crucial in following and evaluating it spread across countries. In this study, we collected and analyzed 3,067 SARS-CoV-2 genomes isolated from 55 countries during the first three months after the onset of this virus. Using comparative genomics analysis, we traced the profiles of the whole-genome mutations and compared the frequency of each mutation in the studied population. The accumulation of mutations during the epidemic period with their geographic locations was also monitored. The results showed 782 variants sites, of which 512 (65.47%) had a non-synonymous effect. Frequencies of mutated alleles revealed the presence of 68 recurrent mutations, including ten hotspot non-synonymous mutations with a prevalence higher than 0.10 in this population and distributed in six SARS-CoV-2 genes. The distribution of these recurrent mutations on the world map revealed that certain genotypes are specific to geographic locations. We also identified co-occurring mutations resulting in the presence of several haplotypes. Moreover, evolution over time has shown a mechanism of mutation co-accumulation which might affect the severity and spread of the SARS-CoV-2. The phylogentic analysis identified two major Clades C1 and C2 harboring mutations L3606F and G614D, respectively and both emerging for the first time in China. On the other hand, analysis of the selective pressure revealed the presence of negatively selected residues that could be taken into considerations as therapeutic targets. We have also created an inclusive unified database (http://covid-19.medbiotech.ma) that lists all of the genetic variants of the SARS-CoV-2 genomes found in this study with phylogeographic analysis around the world.Entities:
Mesh:
Substances:
Year: 2020 PMID: 33170902 PMCID: PMC7654798 DOI: 10.1371/journal.pone.0240345
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 5Phylogenetic analysis of 3067 SARS-CoV 2 genomes grouped according to the country of origin.
The length of the branches represents the distance in time.
Selective pressure analysis on the spike and orf1ab genes of SARS-CoV-2.
| Genes | Ω | FEL method | MEME method | SLAC method | FUBAR method | |||
|---|---|---|---|---|---|---|---|---|
| 0.571391 | ||||||||
| Codons 215, 474, 809, 820, 921, 1044 | Codon 5 | Codons 215, | ||||||
| 474, 541, 809, 820, 921, 1044 | ||||||||
| 0.75951 | ||||||||
| Codon 2244 | Codons | Codon2244 | Codons 1473, | |||||
| 1171, 2923, 3003, 3715, | ||||||||
| 2244, 3090 | ||||||||
| 5221, 5704, | ||||||||
| 6267, 6961 | ||||||||