| Literature DB >> 35156006 |
Olayinka Sunday Okoh1, Nicholas Israel Nii-Trebi2, Abdulrokeeb Jakkari3, Tosin Titus Olaniran4,5, Tosin Yetunde Senbadejo6, Anna Aba Kafintu-Kwashie7, Emmanuel Oluwatobi Dairo5,8, Tajudeen Oladunni Ganiyu6, Ifiokakaninyene Ekpo Akaninyene4,5, Louis Odinakaose Ezediuno9, Idowu Jesulayomi Adeosun10,11, Michael Asebake Ockiya12, Esther Moradeyo Jimah5,13, David J Spiro14, Elijah Kolawole Oladipo5,10, Nídia S Trovão14.
Abstract
There is a dearth of information on COVID-19 disease dynamics in Africa. To fill this gap, we investigated the epidemiology and genetic diversity of SARS-CoV-2 lineages circulating in the continent. We retrieved 5229 complete genomes collected in 33 African countries from the GISAID database. We investigated the circulating diversity, reconstructed the viral evolutionary divergence and history, and studied the case and death trends in the continent. Almost a fifth (144/782, 18.4%) of Pango lineages found worldwide circulated in Africa, with five different lineages dominating over time. Phylogenetic analysis revealed that African viruses cluster more closely with those from Europe. We also identified two motifs that could function as integrin-binding sites and N-glycosylation domains. These results shed light on the epidemiological and evolutionary dynamics of the circulating viral diversity in Africa. They also emphasize the need to expand surveillance efforts in Africa to help inform and implement better public health measures.Entities:
Keywords: Genomics; Phylogenetics; Virology
Year: 2022 PMID: 35156006 PMCID: PMC8817759 DOI: 10.1016/j.isci.2022.103880
Source DB: PubMed Journal: iScience ISSN: 2589-0042
Figure 1Number of COVID-19 reported cases and deaths per 100,000 population in the different continents
The red line represents the average absolute number of COVID-19 cases worldwide per 100,000 population in the world (i.e., (global COVID-19 cases/world population) x 100,000). The orange points represent the average number of deaths per 100,000 population in the different continents with its scale on the left y axis. The orange line represents the number of deaths globally per 100,000 world population.
Figure 2COVID-19 cases reported in African countries
Absolute number of cases (left). Number of cases per 100,000 population (right).
Figure 3Reported deaths from COVID-19 in Africa
Absolute number of total deaths (left) per country. Number of deaths per 1,000 reported cases (right).
Figure 4Number of SARS-CoV-2 tests conducted in African countries
Absolute number of tests (left). Number of tests per 100,000 population (right).
Figure 5COVID-19 positivity rate in Africa
Number of positive patients out of every 1000 COVID-19 tests. Gray shades represent countries for which data is not available.
Figure 6Sequences from African countries submitted to GISAID
Absolute number of sequences from Africa submitted to GISAID (left). Sequences available in GISAID per 1,000 SARS-CoV-2 cases in Africa (right). Gray shade represents countries for which no sequences were available (n = 27/57 (47.4%) countries).
Figure 7The incidence of the top five Pango lineages circulating in Africa between March 1, 2020 and January 7, 2021.
Figure 8Maximum likelihood tree colored by continent
Phylogenetic tree inferred for a dataset with genetic sequences from all continents.
Figure 9Evolutionary divergence of SARS-CoV-2 across continents
Violin plots represent the distribution of pairwise genetic distances between all sequences for isolates in each continent. Vertical lines depict the mean pairwise genetic distance between all samples in each continent.
Repeat patterns and motifs in SARS-CoV-2 genomes from Africa
| ID | Motif | Accession number | Length | Functional site class | p-value | e-value |
|---|---|---|---|---|---|---|
| 1 | ILRKGGR | ELM: ELME000129 (LIG_IBS_1) | 50 | Integrin binding sites | 3.43E-04 | 5.63E-02 |
| N-glycosylation site | 4.70e-03 | 7.71E-01 | ||||
| 2 | ILRKGGR | ELM: ELME000316 (LIG_Integrin_isoDGR_1) | 50 | Integrin binding sites | 2.68E-02 | 2.68E-02 |
| REAGENT or RESOURCE | SOURCE | IDENTIFIER |
|---|---|---|
| R | ||
| Biostrings R package | ||
| maptools R package | ||
| RColorBrewer R package | ||
| maps R package | ||
| mapdata R package | ||
| readxl R package | ||
| ggplot2 R package | ||
| dplyr R package | ||
| gridExtra R package | ||
| ggcorrplot R package | ||
| ggpubr R package | ||
| ggmap R package | ||
| mapproj R package | ||
| rio R package | ||
| tidyverse R package | ||
| readr R package | ||
| graphics R package | ||
| sm R package | ||
| Lubridate | ||
| Aweek | ||
| vioplot R package | ||
| MAFFT | ||
| Aliview | ||
| Figtree | Not applicable | |
| TempEst | ||
| IQTree | ||
| GLAM2 | ||
| Tomtom | Shobhit Gupta, JA Stamatoyannopolous, Timothy Bailey and William Stafford Noble, 2007 | |
| Data and code | This study | |
| Sequence data from GISAID | ||
| GISAID database authors and laboratories | This study | |