| Literature DB >> 35605880 |
Nancy Fayad1, Walid Abi Habib1, Rabeh El-Shesheny2, Ahmed Kandeil3, Youmna Mourad4, Jacques Mokhbat5, Ghazi Kayali6, Jimi Goldstein7, Jad Abdallah8.
Abstract
The COVID-19 pandemic continues to pose a global health concern, despite the ongoing vaccination campaigns, due to the emergence and rapid spread of new variants of the causative agent SARS-CoV-2. These variants are identified and tracked via the marker mutations they carry, and the classification system put in place following tremendous sequencing efforts. In this study, the genomes of 1,230 Lebanese SARS-CoV-2 strains collected throughout 2 years of the outbreak in Lebanon were analyzed, 115 of which sequenced within this project. Strains were classified into seven GISAID clades, the major one being GRY, and 36 Pango lineages, with three variants of concern identified: alpha, delta and omicron. A time course distribution of GISAID clades allowed the visualization of change throughout the two years of the Lebanese outbreak, in conjunction with major events and measures in the country. Subsequent phylogenetic analysis showed the clustering of strains belonging to the same clades. In addition, a mutational survey showed the presence of mutations in the structural, non-structural and accessory proteins. Twenty five (25) mutations were labeled as major, i.e. present in more than 30% of the strains, such as the common Spike_D614G and NSP3_T183I. Whereas 635 were labeled as uncommon, i.e. found in very few of the analyzed strains as well as GISAID records, such as NSP2_I349V. Distribution of these mutations differed between 2020, and the first and the second half of 2021. In summary, this study highlights key genomic aspects of the Lebanese SARS-CoV-2 strains collected in 2020, the first year of the outbreak in Lebanon, versus those collected in 2021, the second year of COVID-19 in Lebanon.Entities:
Keywords: COVID-19; Clades; GISAID; Lebanon; Mutations; Phylogenetic relationship; Two years
Mesh:
Substances:
Year: 2022 PMID: 35605880 PMCID: PMC9121641 DOI: 10.1016/j.virusres.2022.198824
Source DB: PubMed Journal: Virus Res ISSN: 0168-1702 Impact factor: 6.286
COVID-19 in Lebanon: number of cases, related deaths and strains sequenced.
| 2020 | 2021 | 2021/2022 | |
|---|---|---|---|
| Confirmed COVID-19 cases | 181,503 | 358,885 | 306,994 |
| Number of COVID-19 related deaths | 1,455 | 6,274 | 1,710 |
| Number of sequenced genomes (In this study) | 161 (96) | 923 (21) | 86 |
| Genome length range (bp) | 28603 - 29905 | 29148 - 29904 | 29295 - 29776 |
| Prevalent GISAID clades | G (65.8%) | GRY (87.97%) | GK (98.76%) |
| Prevalent Pango Lineage | B.1.398 (61.5%) | B.1.1.7 (80.9%) | AY.4 (60.49%) |
As reported by the Lebanese Ministry of Public Health.
Percentage of the total analyzed strains.
Fig. 1Sunburst graph showing the distribution of Pango Lineages among the 1,230 Lebanese SARS-CoV-2 strains. Percentage of the strains belonging to each lineage or group is indicated.
Fig. 2Percentage pie chart distribution of (A) GISAID clades among the 1,230 Lebanese SARS-CoV-2 strains, and (B) strains collected in 2019, 2020, 2021 or 2022 for each of the GISAID clades. Legend keys are shown under the charts.
Fig. 3(A) Time course distribution of the GISAID clades since the start of the Lebanese outbreak in February 2020 until late January 2022. Legend keys are shown under the chart. (B) Number of confirmed COVID-19 cases per month and key events related to the pandemic. The asterisk sign indicated months during which no strains were collected and sequenced.
Fig. 4Unrooted circular phylogenetic tree representing the relationship between the 1,230 Lebanese SARS-CoV-2 strains: genome were aligned with MAFFT then a tree was inferred using FastTree v2.10.1 algorithm under the general time reversible (GTR) model with a bootstrap value of 1,000. The resulting tree was then annotated with iToL (22)(22)(22)(22). Legend key is shown under the tree for the tree tips – VOC; inner-most circles – GISAID clades for, one, the reference strains, and two, the analyzed Lebanese SARS-CoV-2 strains; middle circle - strains sequenced within this study, indicated with a black strip; and the outer-most circle – collection year.
The most common mutations found in the encoded proteins of strains collected in 2020 vs in 2021.
| 2020 (n = 161) | 2021-2022 (n = 1068) | ||||||
|---|---|---|---|---|---|---|---|
| February - December | January - May (n= 983) | June 2021 - January 2022 (n=85) | |||||
| N_R203K | 90.64% | N_D63G | 94.12% | ||||
| Major Mutations | N_S194L | 56.52% | N_G204R | 90.34% | N_R203M | 91.76% | |
| June -December | |||||||
| N_T391I | 40.37% | N_D3L | 89.93% | N_G215C | 80.00% | ||
| June -December | N_S235F | 89.42% | N_D377Y | 56.47% | |||
| Other noteworthy Mutations | N_G204R | 12.42% | NA | NA | NA | NA | |
| N_R203K | |||||||
| Major Mutations | NA | NA | NA | NA | M_I82T | 94.12% | |
| Other noteworthy Mutations | NA | NA | NA | NA | NA | NA | |
| Major Mutations | S_D614G | 99.90% | |||||
| S_A570D | 89.93% | ||||||
| S_P681H | 89.93% | S_D614G | 100% | ||||
| S_D614G | 98.14% | S_T716I | 89.83% | S_L452R | 94.12% | ||
| S_T95I | 60.25% | S_D1118H | 89.73% | S_T478K | 96.47% | ||
| June -December | |||||||
| S_S982A | 89.32% | S_D950N | 90.59% | ||||
| S_V70del | 89.32% | S_P681R | 90.59% | ||||
| S_H69del | 89.22% | Spike_T19R | 82.35% | ||||
| S_Y144del | 86.98% | ||||||
| S_N501Y | 59.41% | ||||||
| Other noteworthy Mutations | NA | NA | NA | NA | Spike_T250I | 17.64% | |
| Spike_S680T | 15.29% | ||||||
| Spike_T29A | 14.11% | ||||||
| Spike_E156G | 10.58% | ||||||
| Spike_F157del | 10.58% | ||||||
| Spike_R158del | 10.58% | ||||||
| Major Mutations | NSP12_P323L | 98.82% | |||||
| NSP12_G671S | 94.12% | ||||||
| NSP12_P323L | 99.19% | NSP13_P77L | 92.94% | ||||
| NSP12_P323L | 98.14% | NSP3_T183I | 90.13% | NSP4_T492I | 80.00% | ||
| NSP3_V473I | 63.98% | NSP6_F108del | 89.73% | NSP14_A394V | 78.82% | ||
| June - December | |||||||
| NSP3_S1717L | 47.83% | NSP6_G107del | 89.73% | NSP3_A488S | 78.82% | ||
| July - December | |||||||
| NSP6_S106del | 89.73% | NSP3_P1469S | 78.82% | ||||
| NSP3_A890D | 89.62% | NSP4_V167L | 78.82% | ||||
| NSP3_I1412T | 84.64% | NSP6_T77A | 78.82% | ||||
| NSP3_P1228L | 77.65% | ||||||
| Other noteworthy Mutations | NSP12_G823S | 13.66% | NA | NA | NSP2_K81N | 21.17% | |
| NSP12_A311S | 16.47% | ||||||
| NSP14_A119V | 16.47% | ||||||
| NSP3_A579V | 16.47% | ||||||
| NSP4_A446V | 14.11% | ||||||
| NSP6_V149A | 14.11% | ||||||
| Other noteworthy Mutations | NSP12_G823S | 13.66% | NA | NA | NA | NA | |
| June - December | |||||||
| Major Mutations | NA | NA | NS3_W131C | 78.33% | |||
| NS8_Y73C | 89.17% | NS3_S26L | 98.55% | ||||
| NS8_Q27stop | 87.87% | NS7b_T40I | 42.40% | ||||
| NS8_R52I | 87.54% | ||||||
| NS8_K68stop | 81.80% | ||||||
| Other noteworthy Mutations | NS3_Q57H | 19.88% | NA | NA | NS3_D238Y | 16.47% | |
| NS8_D63Y | 19.25% | ||||||
| July - December | |||||||
For each mutation, we indicate the percentage of strains that present it, as well as the timeframe during which it was a major or a noteworthy one.
Major mutations: found in more than 30% of the strains.
Noteworthy mutations found in more than 10% of the strains.
Common mutations found in 2020, January-May 2021, and June-July 2021.
Different mutations occurring at the same position in the protein;