| Literature DB >> 34462754 |
Anderson F Brito1,2, Elizaveta Semenova1,3, Gytis Dudas1,4, Gabriel W Hassler5, Chaney C Kalinich1,6, Moritz U G Kraemer7, Joses Ho8,9, Houriiyah Tegally10, George Githinji11,12, Charles N Agoti11, Lucy E Matkin7, Charles Whittaker13,14, Benjamin P Howden15, Vitali Sintchenko16,17, Neta S Zuckerman18, Orna Mor18, Heather M Blankenship19, Tulio de Oliveira10,20,21,22, Raymond T P Lin23, Marilda Mendonça Siqueira24, Paola Cristina Resende24, Ana Tereza R Vasconcelos25, Fernando R Spilki26, Renato Santana Aguiar27,28, Ivailo Alexiev29, Ivan N Ivanov29, Ivva Philipova29, Christine V F Carrington30, Nikita S D Sahadeo30, Céline Gurry8, Sebastian Maurer-Stroh8,9,23, Dhamari Naidoo31, Karin J von Eije32,33, Mark D Perkins33, Maria van Kerkhove33, Sarah C Hill34, Ester C Sabino35, Oliver G Pybus7,34, Christopher Dye7, Samir Bhatt13,14,36, Seth Flaxman37, Marc A Suchard5,38,39, Nathan D Grubaugh1,40, Guy Baele41, Nuno R Faria7,13,16,35.
Abstract
Genomic sequencing provides critical information to track the evolution and spread of SARS-CoV-2, optimize molecular tests, treatments and vaccines, and guide public health responses. To investigate the spatiotemporal heterogeneity in the global SARS-CoV-2 genomic surveillance, we estimated the impact of sequencing intensity and turnaround times (TAT) on variant detection in 167 countries. Most countries submit genomes >21 days after sample collection, and 77% of low and middle income countries sequenced <0.5% of their cases. We found that sequencing at least 0.5% of the cases, with a TAT <21 days, could be a benchmark for SARS-CoV-2 genomic surveillance efforts. Socioeconomic inequalities substantially impact our ability to quickly detect SARS-CoV-2 variants, and undermine the global pandemic preparedness.Entities:
Year: 2021 PMID: 34462754 PMCID: PMC8404891 DOI: 10.1101/2021.08.21.21262393
Source DB: PubMed Journal: medRxiv
Fig. 1.Disparities in SARS-CoV-2 global genomic surveillance.
(A) Percentage of reported cases that were sequenced per country, per epidemiological week (EW), between February 23rd, 2020 and March 27th, 2021 (based on metadata submitted to GISAID up to May 30th, 2021). Updated numbers on sequence submissions and proportion of sequenced cases are available on the GISAID Submissions Dashboard at gisaid.org. (B) Frequency and overall percentage of sequenced cases per country. This plot summarizes the data shown in (A), where the x-axis shows the percentage of EWs with sequenced cases, and the y-axis displays the overall percentage of cases shown in the rightmost column of panel (A). (C) Percentage of cases sequenced per EW per country, per geographic region (classified according to the UNSD geoscheme). Each circle represents an EW with at least one sequenced case, and their diameters highlight the incidence (cases per 100,000 habitants), e.g. “ISL-EW38–2020” shows data from week 38 in 2020, in Iceland. (D) Distribution of turnaround times of genomes collected in different geographic regions, in 2020 and 2021. Countries are highlighted in panels of this figure using the ISO 3166–1 nomenclature.
Figure 2.Detection of SARS-CoV-2 lineages under different genomic surveillance scenarios.
(A) The probability of detecting at least one genome of a rare lineage under different sequencing regimes. (B) Relative importance of decreasing genome sequencing turnaround time (TAT) versus increasing sequencing percentage, measured as probability that a lineage found in simulated datasets was detected before it had reached 100 cases (described in Fig. S6). (C-G) Probability of lineage detection considering TATs of 7, 14, 21, 28 and 35 days.
Empirical country sequencing capacities at different income levels and lines of inquiry enabled at each level.
Countries at each income level have markedly different sequencing capacities, allowing for different degrees of epidemic resolution and lines of inquiry. Characteristics of each income class are shown in Table S4.
| Income class | Median weekly genomes (when sequencing at all) | Mean weekly genomes (when sequencing at all) | Probability of detecting a lineage at 5% prevalence under mean weekly sequencing regime | Maximum probable prevalence of an undetected lineage under mean weekly sequencing regime | Lines of inquiry available |
|---|---|---|---|---|---|
| Low income countries (LICs) | 4 | 8.64 | 0.351 | 0.262 | Presence/absence of prevalent lineages |
| Lower middle income countries (LMCs) | 5 | 25.97 | 0.727 | 0.095 | + Quantification of lineage prevalence with some error; identification of preliminary patterns of geographic spread |
| Upper middle income countries (UMCs) | 7 | 33.16 | 0.810 | 0.073 | |
| High income countries (HICs) | 38 | 524.80 | 1.000 | 0.005 | + Investigations of lineage dynamics, and transmissibility; high precision lineage tracking (molecular evolution and geographic spread) |
Figure 3.Case sequencing percentages and socioeconomic covariates.
Covariates that show the highest correlation with the overall percentage of COVID-19 sequenced cases (during the period shown in Fig. 1A). (A) Expenditure on R&D per capita; (B) GDP per capita; (C) Socio-demographic index; (D) Overall percentage of influenza virus sequenced cases in 2019 (HA segment). For correlations between covariates and turnaround time, see Fig. S7. The colour scheme is the same as in Figure 1 and 2. Solid line shows the linear fit. *PPP = purchasing power parity, USD = US dollar 2005.