| Literature DB >> 33262895 |
Abdel-Rahman N Zekri1, Khaled Easa Amer2, Mohammed M Hafez1, Zeinab K Hassan1, Ola S Ahmed1, Hany K Soliman1, Abeer A Bahnasy3, Wael Abdel Hamid4, Ahmad Gad4, Mahmoud Ali2, Wael Hassan2, Mahmoud Samir2, Ahmad Raouf2, Ayman A Khattab2, Mona Salah El Din Hamdy5, May Sherif Soliman5, Maha Hamdi El Sissy5, Sara Mohamed El Khateeb5, Moushira Hosny Ezzelarab5, Lamiaa A Fathalla6, Mohamed Abouelhoda7.
Abstract
Introduction: The novel coronavirus severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has spread throughout the globe, causing a pandemic. In Egypt over 96,108 individuals were infected so far. Objective: In the present study, the objective is to perform a complete genome sequence of SAR-CoV2 isolated from Egyptian coronavirus disease (COVID-19) patients.Entities:
Keywords: Next generation sequencing; Sars-CoV2; real time PCR
Year: 2020 PMID: 33262895 PMCID: PMC7688418 DOI: 10.1016/j.jare.2020.11.012
Source DB: PubMed Journal: J Adv Res ISSN: 2090-1224 Impact factor: 10.479
High frequency mutations in SARS-CoV-2 sequences of Egypt and the world. The table shows the most frequent variations in the Egyptian population. The table also includes the frequency of these variations in different populations. The numbers in bold indicate that this frequency is significantly different from the Egyptian frequency using Chi-Square/Fisher Exact test (P < 0.05). Supplementary Table S5 includes comparisons among other populations.
| Genome Change | Position | Gene | Protein Change | Mutation type | Egypt count (n = 61) | EgyFreq |
|---|---|---|---|---|---|---|
| c.1841A > G | 23,403 | S | p.Asp614Gly | Missense | 60 | 0.98361 |
| c.-25C > T | 241 | ORF1ab | Non-coding | Upstream | 59 | 0.96721 |
| c.2772C > T | 3037 | ORF1ab | p.Phe924Phe | Synonymous | 57 | 0.93443 |
| c.14144C > T | 14,408 | ORF1ab | p.Pro4715Leu | Missense | 56 | 0.91803 |
| c.171G > T | 25,563 | ORF3a | p.Gln57His | Missense | 30 | 0.4918 |
| c.18613C > T | 18,877 | ORF1ab | p.Leu6205Leu | Synonymous | 17 | 0.27869 |
| c.3108C > A | 3373 | ORF1ab | p.Asp1036Glu | Missense | 13 | 0.21311 |
| c.12269C > T | 12,534 | ORF1ab | p.Thr4090Ile | Missense | 10 | 0.16393 |
| c.2169C > T | 23,731 | S | p.Thr723Thr | Synonymous | 10 | 0.16393 |
| c.3737C > T | 4002 | ORF1ab | p.Thr1246Ile | Missense | 10 | 0.16393 |
Distribution of variations in different genes. Total world Samples in June and October are 46,612 and 89,632, respectively. The VarFreq is the number of variations in the gene divided by the total number of samples. The relative frequency is the number of variations divided. Freq norm is the frequency divided by the total number of variations in each group. N/S is the ration between the number of non-synonymous and synonymous variations in the same gene.
| Gene | Len | Var Count World June | Var Count World June | Var Count | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| M | 669 | 464 | 515 | 2 | ||||||
| N | 908 | 1107 | 1371 | 23 | ||||||
| E | 228 | 266 | 320 | 1 | ||||||
| ORF1ab | 21,290 | 14,779 | 16,221 | 131 | ||||||
| ORF3a | 828 | 809 | 1003 | 6 | ||||||
| ORF6 | 186 | 191 | 230 | 1 | ||||||
| ORF7a | 498 | 427 | 524 | 6 | ||||||
| ORF8 | 193 | 366 | 439 | 4 | ||||||
| S | 3822 | 3405 | 3146 | 30 | ||||||
| M | 669 | 0.022 | 0.032 | 1.03 | 0.022 | 0.033 | 1.14 | 0.010 | 0.015 | 1 |
| N | 908 | 0.051 | 0.057 | 1.88 | 0.058 | 0.064 | 2.14 | 0.113 | 0.125 | 1.75 |
| E | 228 | 0.012 | 0.054 | 2.02 | 0.014 | 0.060 | 3.2 | 0.005 | 0.021 | 0 |
| ORF1ab | 21,290 | 0.686 | 0.032 | 1.24 | 0.692 | 0.032 | 1.25 | 0.645 | 0.030 | 1.48 |
| ORF3a | 828 | 0.038 | 0.045 | 2.38 | 0.043 | 0.052 | 2.5 | 0.030 | 0.036 | 4 |
| ORF6 | 186 | 0.009 | 0.048 | 1.91 | 0.010 | 0.053 | 2.17 | 0.005 | 0.026 | 0 |
| ORF7a | 498 | 0.020 | 0.040 | 1.92 | 0.022 | 0.045 | 2.08 | 0.030 | 0.059 | 2 |
| ORF8 | 193 | 0.017 | 0.088 | 1.9 | 0.019 | 0.097 | 1.88 | 0.020 | 0.102 | 3 |
| S | 3822 | 0.158 | 0.041 | 1.31 | 0.134 | 0.035 | 1.16 | 0.148 | 0.039 | 2 |
Fig. 1Variant frequencies in SARS-Cov2 isolate in Egypt in comparison to world population. Part (a) frequency of all variants. Part (b), frequency of variants in ORF genes.
Number of gene variations in SARS-CoV2 genomes. E: envelope protein; M: membrane glycoprotein; N: nucleocapsid phosphoprotein; ORF: open reading frame; S: spike glycoprotein; SARS-CoV-2: severe acute respiratory syndrome coronavirus 2. Note: We compared 61 whole genomes to the NC_045512.2 genome sequence.
| Genome segment | Missense mutation | Synonymous mutation | Non-coding region | Other mutation | Frameshift deletion/in frame del | Stop-gained | Total | |||
|---|---|---|---|---|---|---|---|---|---|---|
| Mutation | Deletion | Insertion | Upstream | downstream | ||||||
| ORF1ab | 74 | 50 | 0 | 0 | 0 | 5 | 0 | 2(1,1) | 0 | 131 |
| S | 14 | 7 | 0 | 0 | 0 | 0 | 7 | 2 (1,1) | 0 | 30 |
| ORF3a | 4 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 6 |
| E | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| M | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| ORF6 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| ORF7 | 4 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 6 |
| ORF8 | 3 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
| N | 14 | 8 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 23 |
| Total | ||||||||||
Fig. 2Phylogenetic analysis of the Egyptian sequences plus 250 sequences sub-sampled from the Nextstrain dataset. The tree is annotated with GISAID clade information. (High resolution plot is in Supplementary File S6). Each sequence is named as follows: “Type:GISAID_ID:Country:Year:Month”.
Fig. 3Phylogenetic analysis of Egyptian sequences and extended neighbor set composed of 786 genome sequences. The tree is annotated with GISAID clade information. The black dots show location of the Egyptian sequences. (High resolution plot is in Supplementary File S6). Each sequence is named as follows: “Type:GISAID_ID:Country:Year:Month”.