Literature DB >> 35317482

Study of the SARS-CoV-2 genomic data generation to evaluate the introduction of genomics in epidemiological surveillance and public health decision making.

Tiatou Souho1,2, Lallepak Lamboni1,2, Bianza Moise Bakadia2, Essodolom Taale1,3, Koffi Kibalou Palanga3, Sabiba Kou'santa Amouzou1.   

Abstract

Introduction: the limited number of equipped laboratories and the lack of expertise left Africa lagging behind in terms of contribution in genomic data generation. The COVID-19 pandemic has drawn the attention of all public health stakeholders so that it can be used as a marker of the efforts that public health systems can produced. The main purpose of the present analytical study was to evaluate the contribution of the African continent in the genomic surveillance of SARS-CoV-2.
Methods: data from the two most popular genomic databases on SARS-CoV-2 (GISAID EpiCov and NCBI Virus) were extracted and analyzed. Comparisons were made using the sequencing ratio which represents the number of genomic sequence published over one thousands confirmed cases.
Results: considering continental blocks, the Africa occupied the fourth place after Oceania, Europe and North America based on sequencing ratios. However, when the considered comparison parameter is the number of sequences, the African continent was the fifth contributor after Europe, North America, Asia and South America.
Conclusion: the study showed that African countries have effectively integrated the genomic data generation in the public health response strategies but the effective use of these data for a perfect surveillance is not clearly established. There is a need for capacity building in genomic data analyses for a better response to public health threats in Africa. Copyright: Tiatou Souho et al.

Entities:  

Keywords:  Africa; COVID-19; Genomic surveillance; SARS-CoV-2

Mesh:

Year:  2022        PMID: 35317482      PMCID: PMC8917452          DOI: 10.11604/pamj.2022.41.55.32344

Source DB:  PubMed          Journal:  Pan Afr Med J


Introduction

Since its first detection in November 2019 in China, the COVID-19 has continued to spread around the world, leading to the declaration of a pandemic by the World Health Organization (WHO) in March 2020 [1]. As of August 27th 2021, the disease already infected more than 214,468,601 persons and caused more than 4,470,969 deaths in the world [2]. Beyond these impressive numbers of cases and deaths, the COVID-19 presents the particularity of being regarded as a serious threat by authorities in all countries including in Africa where implementation of response programmes when dealing with public health issues is usually preceded by long advocacy periods [3-5]. Actually, Africa has long been a continent with insufficient resources as to public healthcare in terms of facilities, equipment, qualified personnel, and expertise for scientific research, which in the beginning of the pandemic raised concerns in the scientific community about the resilience of the continent towards this health crisis [6,7]. Luckily, technical, societal and economic measures have been enabled in order to fight the disease [8,9]. Such efficiency is probably the result of experience acquired during the Ebola virus outbreaks and HIV/AIDS management that likely enhanced the preparedness and response capacity in the continent especially in sub-Saharan countries, though there is still a long way to go for the response at the research level which needs to go much more faster [10,11]. Among noticeable efforts in Africa, molecular diagnostic tools have been reinforced for the virus detection and human resources have been deployed. In addition, some dedicated facilities have been built to sustain the response to the pandemic. Genomic data discovery and sharing are determinant steps in the design of appropriate programmes against public health threats related to infectious agents [12,13]. One of the added values of genomic data collection and studies is the possibility to understand genomic dynamics. Indeed, viral mutations are responsible for the spread of different variants and hamper the effectiveness of public health interventions including diagnosis, vaccination and treatment [14,15]. Building capacities in Africa to gather genomic data on the SARS-CoV-2 and perform studies on these genomes is therefore an important approach to react in response to the pandemic. Several countries in Africa are doing their best to sequence some viral strains isolated from patients, animals or environment. Overall, African countries really afforded scientific partnerships in order to substantially contribute to global efforts for genomic studies of a virus. The purpose of the present study was to evaluate the extent at which the African scientific community participate in the SARS-CoV-2 genomic studies more than one year after the first confirmation of the disease in the continent.

Methods

Study design: the present study was mainly based on data collection and analyses. Nucleotides´ sequences and sequences metadata were collected from two principal platforms: the GISAID Initiative platform [16] and the NCBI Virus platform [17]. Epidemiologic data were collected from the WHO Coronavirus (COVID-19) Dashboard [2]. All these data were extracted on August 10th 2021. Data analysis: metadata from both platforms were mainly used to study the geographic sources of sequences as well as the hosts from where the viral materiel was obtained for sequencing. In the present analyses, the raw number of genomic sequences (partial or complete coverage) was used to gauge the position of the African continent and its countries. The regional organisation of countries in genomic databases does not match the subdivisions in the WHO Coronavirus Dashboard, epidemiologic data were considered by countries and reorganized to allow continental comparisons. The comparison was made using a “sequencing ratio” calculated by dividing the number of published sequences of the virus isolated from human beings, environment or animals by the number of confirmed cases.

Results

The number of sequences in both GISAID and NCBI Virus platforms is continuously growing. On August 10th, the precise number of sequences concerning the SARS-CoV-2 was 1064504 and 2716522 in NCBI Virus and GISAID initiative platforms, respectively. The number of records for every continent and source of virus from both platforms is presented in Table 1. The table also shows that genomic surveillance in animals and the environment is achieved in all continents except in Oceania. However, the number of sequences from animals or the environment is reduced in comparison to sequences from human hosts.
Table 1

number of sequences retrieved from GISAID and NCBI Virus platforms

Data sourceContinentNumber of all sequencesNumber of Complete genomesSource of biological material
HumansAnimalsEnvironment
GISAID initiative Europe159199515476481590080921994
North America837020832669836474375171
Asia1851551793991850365564
South America5031446983502801915
Africa33071325633306173
Oceania2361924562361900
Total27211742662718271855013771247
NCBI Virus Europe6225771798486225174020
North America4178312158934176721563
Oceania13939103011393900
Asia645436276424300
Africa21611290215470
South America92754091278
Total1063889411499106361824031
number of sequences retrieved from GISAID and NCBI Virus platforms The sequencing ratio expressed as the number of sequencing for a thousand of confirmed cases was used to evaluate the implication of genomic data generation in public health response strategies. The ratios by geographic region are presented in Table 2. Oceania scores the highest sequencing ratio; when considering the GISAID, for 1000 confirmed cases in Oceania, around 185 patients are subjected to virus isolation, virus genome sequencing and sequence submission to GISAID. In the decreasing order of sequencing ratios, Oceania is followed by Europe, North America, Africa, Asia and South America. With data from the NCBI Virus database, the order of continent is the same except that North America comes before Europe.
Table 2

sequencing ratios for every continent considering data from the GISAID and the NCBI Virus databases

Data sourceContinentConfirmed casesNumber of sequences from human sourceSequencing ratio (‰)
GISAID Oceania12773123,619184.91
Europe61564901159008025.83
North America3788285183647422.08
Africa7114221330614.65
Asia555625561850363.33
South America41042146502801.23
NCBI Oceania12773113939109.13
North America3788285141767211.03
Europe6156490162251710.11
Africa711422121540.3
Asia5556255664240.12
South America410421469120.02
sequencing ratios for every continent considering data from the GISAID and the NCBI Virus databases In order to evaluate the homogeneity in the contribution of different African countries in data generation, we analyzed the number of sequences and calculated sequencing ratios for every country and results are presented in Table 3, Table 3 (suite) and Table 4. The top three countries with highest sequencing ratios are Gambia, Reunion and Mauritius, considering the GISAID database. Several African countries did not publish their sequences in the NCBI database. From those present in the NCBI Virus database, the highest ratios were obtained for Djibouti, Sierra Leone and Egypt. Egypt was the only African country with sequences from animal hosts. Environment-isolated virus sequences were reported for Malawi and Morocco.
Table 3

sequencing ratios of African countries considering data from the GISAID database

CountryConfirmed casesNumber of submissions on GISAIDComplete genome coverageSource of biological materialsequencing ratio (‰)
HumanAnimalEnvironment
Gambia876368768268778.40
Reunion4024524692469246961.35
Mauritius521924924924947.71
Equatorial Guinea895119119019121.34
Burkina Faso1362626426426419.37
Angola4374778478378417.92
Congo1329322722722717.08
DR Congo5198587086987016.74
Djibouti1166313913913911.92
Kenya21257323922392239211.25
Ghana10822611131105111310.28
Guinea-Bissau478848484810.03
Guinea271122502502509.22
Central African Republic11174100991008.95
Benin90658080808.83
Gabon25487219992198.59
Senegal680125225165227.68
Togo169461291291297.61
Malawi5613536936936816.56
Uganda958756006006006.26
Rwanda766454744744746.18
Botswana1307717627627625.83
Table 3 (suite)

sequencing ratios of African countries considering data from the GISAID database

CountryConfirmed casesNumber of submissions on GISAIDComplete genome coverageSource of biological materialsequencing ratio (‰)
HumanAnimalEnvironment
Sierra Leone63153636365.70
Nigeria1785089949749945.57
South Africa25402221372113,584137215.40
South Sudan111396060605.39
Zimbabwe1168535615585614.80
Niger56822424244.22
Mozambique1331774894894893.67
Egypt2847891016994100973.54
Zambia2002016925566923.46
Madagascar427811221221222.85
Côte d’Ivoire513991451451452.82
Cameroon820642172122172.64
Mali146483636362.46
Somalia159393333332.07
Namibia1212032312302311.91
Chad49809991.81
Comoros40316661.49
Lesotho138431818181.30
Cabo Verde340784040401.17
Eswatini317383333331.04
Morocco70132533031932820.47
Tunisia6136281321251320.22
Algeria1823683524350.19
Ethiopia2845312525250.09
Libya2698472222220.08
sequencing ratios of African countries considering data from the GISAID database sequencing ratios of African countries considering data from the GISAID database sequencing ratios of African countries considering data from the NCBI Virus database

Discussion

The ongoing pandemic is generated by the spread of a previously unknown virus. The lack of information on this pathogen has lead to the multiplicity of treatment and preventive solutions that have been thus far proposed from several laboratories around de world [18]. Up to now there is no standard treatment and available vaccines still require to be well presented to populations to increase their acceptability in some regions [19]. In this context, it is important to gather maximum data about the virus in order to provide appropriate tools for the design of effective treatment and preventive approaches. One of the most useful data that should be obtained about the virus is its genome. Given the worldwide spread of the virus, it is important that every part of the world contributes to data generation. In the present study, we investigated metadata from the most popular genomic data platforms in order to determine the level of implication of the African continent in gathering these data. Viral genomic data search platforms are accessible worldwide. In the study, we focused on the two most popular platforms (GISAID and NCBI). GISAID is the most popular database for SARS-CoV-2 sequence submissions and provides a rapid data sharing system [20]. Thus, data from this platform are mainly used to evaluate the potential for data generation in the present study. On the other hand, NCBI Virus, the most used genomic database in Africa, was explored to extract data that could give an insight on the real capacity of African institutions to actually produce and work through the whole process of genomic data generation, annotation and publication. During analyses, metadata from both platforms were considered separately because genomic data can be submitted to several databases. As shown in Table 1, in the GISAID database, the continent that contributes with the highest number of sequences is Europe, followed by North America and Asia. When considering data from NCBI Virus, Europe and North American continents remain the major contributors. In all cases, the highest contributors are high-income countries, whereas the African continent occupies the fifth position. Hence, based on these raw data, the number of submissions seems to reflect the availability of sequencing equipments, financial resources, and qualified human resources. In order to realize a more equitable comparison, we introduced the sequencing ratio which can be considered as an index that links genomic data generation and sharing with disease burden which is represented by the number of confirmed cases (Table 2). This indicator of regional efforts to genomic data production shows that Oceania produced much more efforts with almost 185 virus isolate sequencing for every thousand confirmed cases. In Europe and North America, viral isolation and sequencing are performed 25 and 22 times for every thousand confirmed cases. In Africa, for every thousand confirmed cases, around 5 patients undergo virus isolation, sequencing and data submission to the GISAID platform. Several African countries did not submit any sequence to the NCBI Virus database. This shows that the GISAID database is their preferred platform for genomic data submission and perhaps for further genomic explorations as well. The sequencing ratio in Africa is five folds lower than the European one and four times lower than the ratio in North America. This may be explained by the cost of the analysis, since genomic data acquisition is still expensive even with several methods having been developed for direct sequencing from clinical samples [21]. Indeed, among all the constraints that could impede the development of genomic explorations in Africa, the reduced financial resources represents the most important one. It conditions the building of scientific facilities, equipment acquisition and capacity building. The gross domestic product per capita in Africa ranges from 1660 USD in the sub-Saharan region to 3640 USD in North Africa, whereas in Europe, it ranges from 12280 USD in Eastern Europe to 46280 USD in the Western Europe [22]. Therefore, with around a ten-fold low Gross domestic product per capita, the African continent managed to perform a sequencing ratio which is only 5 times lower than the one in Europe. This underlines the investment at countries level in responding to this public health threat. Moreover, there is solidarity in generating genomic data because several African countries don´t possess DNA analysers and therefore have to send their viral isolates to laboratories in other countries for sequencing. For an appropriate control of the pandemic, it is important to perform animal host surveillance and genomic data from viruses in animals should be produced [23]. From whole African continent, sequences of virus isolated from animals are reported only from Egypt. These sequences were obtained from Felis catus and Canin lupus familliaris. Environment-isolated virus sequences were only provided by two countries: Malawi and Morocco. It seems that the epidemiologic surveillance of the pandemic, at least at the genomic level, is centred in patients. This strategy could be improved by including surveillance of animal hosts for a better understanding of the virus genomic dynamics and the place of animals in the transmission and the rise of new variants. Overall analyses performed in the course of the present study show that the COVID-19 pandemic acted as a stimulator that accelerated the genomic revolution in Africa. The continent has faced several simultaneous infectious public health threats but the genomic investigations on these infectious agents did not reach the level at which SARS-CoV-2 genomic data were generated and published. As a comparison, in less than 2 years, there are 1290 SARS-CoV-2 complete genome sequences whereas the number of complete genome sequences is 1428 for HIV-1; 11 for HIV-2 and 584 for Ebolavirus in NCBI Virus database [17]. The rapid spread of the SARS-CoV-2 and the emergence of many variants have prompted the African continent to the genomic era.

Conclusion

The present study was mainly focused on the potential for genomic data generation. Studies on these data for the design of new diagnostic, treatment and/or preventive approaches in Africa are rare. There is a need for national, regional or even continental facilities for genomic surveillance of infectious agents and the capacity building for the development of a pool of experts that can be involved in genomic data generation as well as studying genomic data for evidence-based public health decision making.

What is known about this topic

Lack of information in the potential of African countries to produce genomic data; Lack of genomic data on infectious agents.

What this study adds

COVID-19 pandemic has accelerated investments and capacity building in viruses genomic data production in Africa; African countries invested a lot in SARS-CoV-2 genomic data generation; We found that animal surveillance is an aspect that should be reinforced.
Table 4

sequencing ratios of African countries considering data from the NCBI Virus database

CountryConfirmed casesNumber of sequences on NCBIComplete genome coverageHuman source sequencesSequencing ratio
Djibouti1166312312312310.55
Sierra Leone631566576610.45
Egypt2847898257638182.87
Ghana1082262971852972.74
Kenya212573503145032.37
Benin90651212121.32
Zimbabwe116853980980.84
Gambia87636660.68
Gabon2548717170.67
Togo16946115110.65
Guinea27112132130.48
Somalia159395550.31
Uganda958752814280.29
Libya2698474617460.17
Mali146482220.14
Tunisia6136287758770.13
Ethiopia2845317670.02
Nigeria1785084240.02
Morocco7013251514150.02
Cameroon82064110.01
Zambia2002011110.005
South Africa25402224440.002
  19 in total

Review 1.  Vaccinology in the genome era.

Authors:  C Daniela Rinaudo; John L Telford; Rino Rappuoli; Kate L Seib
Journal:  J Clin Invest       Date:  2009-09       Impact factor: 14.808

2.  Energy poverty in healthcare facilities: a "silent barrier" to improved healthcare in sub-Saharan Africa.

Authors:  Nadia S Ouedraogo; Caroline Schimanski
Journal:  J Public Health Policy       Date:  2018-08       Impact factor: 2.222

Review 3.  New approaches and omics tools for mining of vaccine candidates against vector-borne diseases.

Authors:  Josipa Kuleš; Anita Horvatić; Nicolas Guillemin; Asier Galan; Vladimir Mrljak; Mangesh Bhide
Journal:  Mol Biosyst       Date:  2016-08-16

4.  COVID-19 in Africa.

Authors:  Charlotte Payne
Journal:  Nat Hum Behav       Date:  2020-05

5.  Genomics and Infectious Diseases: Expert Perspectives on Public Health Considerations regarding Actionability and Privacy.

Authors:  Alexis Walker; Angie Boyce; Priya Duggal; Chloe L Thio; Gail Geller
Journal:  Ethics Hum Res       Date:  2020-05

6.  When silence goes viral, Africa sneezes! A perspective on Africa's subdued research response to COVID-19 and a call for local scientific evidence.

Authors:  Willis Gwenzi; Piotr Rzymski
Journal:  Environ Res       Date:  2020-12-26       Impact factor: 6.498

7.  Can an infectious disease genomics project predict and prevent the next pandemic?

Authors:  Rajesh Gupta; Mark H Michalski; Frank R Rijsberman
Journal:  PLoS Biol       Date:  2009-10-26       Impact factor: 8.029

8.  Lessons learned from Evidence-Informed Decision-Making in Nutrition & Health (EVIDENT) in Africa: a project evaluation.

Authors:  Pankti Motani; Anais Van de Walle; Richmond Aryeetey; Roosmarijn Verstraeten
Journal:  Health Res Policy Syst       Date:  2019-01-31

9.  A global survey of potential acceptance of a COVID-19 vaccine.

Authors:  Jeffrey V Lazarus; Scott C Ratzan; Adam Palayew; Lawrence O Gostin; Heidi J Larson; Kenneth Rabin; Spencer Kimball; Ayman El-Mohandes
Journal:  Nat Med       Date:  2020-10-20       Impact factor: 53.440

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.