Juliet R C Pulliam1, Cari van Schalkwyk1, Nevashan Govender2, Anne von Gottberg2,3, Cheryl Cohen2,4, Michelle J Groome2,3, Jonathan Dushoff1,5, Koleka Mlisana6,7,8, Harry Moultrie2,3. 1. South African DSI-NRF Centre of Excellence in Epidemiological Modelling and Analysis (SACEMA), Stellenbosch University, Stellenbosch, South Africa. 2. National Institute for Communicable Diseases, Division of the National Health Laboratory Service, Johannesburg, South Africa. 3. School of Pathology, Faculty of Health Sciences, University of the Witwatersrand, Johannesburg, South Africa. 4. School of Public Health, Faculty of Health Sciences, University of the Witwatersrand, Johannesburg, South Africa. 5. McMaster University, Hamilton, Ontario, Canada. 6. National Health Laboratory Service, Johannesburg, South Africa. 7. School of Laboratory Medicine and Medical Sciences, University of KwaZulu-Natal, Durban, South Africa. 8. Centre for the AIDS Programme of Research in South Africa (CAPRISA), Durban, South Africa.
Abstract
We provide two methods for monitoring reinfection trends in routine surveillance data to identify signatures of changes in reinfection risk and apply these approaches to data from South Africa's severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) epidemic to date. Although we found no evidence of increased reinfection risk associated with circulation of the Beta (B.1.351) or Delta (B.1.617.2) variants, we did find clear, population-level evidence to suggest immune evasion by the Omicron (B.1.1.529) variant in previously infected individuals in South Africa. Reinfections occurring between 1 November 2021 and 31 January 2022 were detected in individuals infected in all three previous waves, and there has been an increase in the risk of having a third infection since mid-November 2021.
We provide two methods for monitoring reinfection trends in routine surveillance data to identify signatures of changes in reinfection risk and apply these approaches to data from South Africa's severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) epidemic to date. Although we found no evidence of increased reinfection risk associated with circulation of the Beta (B.1.351) or Delta (B.1.617.2) variants, we did find clear, population-level evidence to suggest immune evasion by the Omicron (B.1.1.529) variant in previously infected individuals in South Africa. Reinfections occurring between 1 November 2021 and 31 January 2022 were detected in individuals infected in all three previous waves, and there has been an increase in the risk of having a third infection since mid-November 2021.
As of 31 January 2022, South Africa had more than 3.6 million cumulative laboratory-confirmed cases of SARS-CoV-2, concentrated in four waves of infection (Fig. 1). The first case was detected in early March 2020 and was followed by a wave that peaked in July 2020 and ended in September. The second wave, which peaked in January 2021 and ended in February, was driven by the Beta (B.1.351 / 501Y.V2 / 20H) variant, which was first detected in South Africa in October 2020 (
). The third wave, which peaked in July and ended in September 2021, was dominated by the Delta (B.1.617.2 / 478K.V1 / 21A) variant (
). In late November 2021, the Omicron (B.1.1.529 / 21K) variant was detected in Gauteng Province, the smallest yet most populous province, and associated with rapidly increasing case numbers (
). The estimated effective reproduction number in Gauteng based on PCR-confirmed cases was 2.3 as of 18 November 2021, which was as high as had been seen at any point during the prior three waves, and peaked above 3 in late November (
,
). The proportion of positive PCR tests with S-gene target failure (SGTF), a marker of the BA.1 sublineage of the Omicron variant, subsequently increased across all provinces (
).
Fig. 1.
Daily numbers of detected primary infections, individuals eligible to be considered for reinfection, and suspected reinfections in South Africa.
(A) Time series of detected primary infections. Black line indicates 7-day moving average; black points are daily values. Colored bands represent wave periods, defined as the period for which the 7-day moving average of cases (detected infections and reinfections) was at least 15% of the corresponding wave peak (purple = wave 1, pink = wave 2, orange = wave 3, turquoise = wave 4). (B) Population at risk for reinfection (individuals who tested positive at least 90 days ago and have not yet had a suspected reinfection). (C) Time series of suspected second infections. Blue line indicates 7-day moving average; blue points are daily values.
Daily numbers of detected primary infections, individuals eligible to be considered for reinfection, and suspected reinfections in South Africa.
(A) Time series of detected primary infections. Black line indicates 7-day moving average; black points are daily values. Colored bands represent wave periods, defined as the period for which the 7-day moving average of cases (detected infections and reinfections) was at least 15% of the corresponding wave peak (purple = wave 1, pink = wave 2, orange = wave 3, turquoise = wave 4). (B) Population at risk for reinfection (individuals who tested positive at least 90 days ago and have not yet had a suspected reinfection). (C) Time series of suspected second infections. Blue line indicates 7-day moving average; blue points are daily values.Following emergence of three variants of concern in South Africa, a key question remained in late 2021 of whether there was epidemiologic evidence of increased risk of SARS-CoV-2 reinfection with these variants (i.e., immune escape from natural infection). Laboratory-based studies suggest that convalescent serum has a reduced neutralizing effect on the Beta, Delta, and Omicron variants compared to wild type virus in vitro (
–
); however, this finding does not necessarily translate into immune evasion at the population level.To examine whether reinfection risk has changed through time, it is essential to account for potential confounding factors affecting the incidence of reinfection: namely, the changing force of infection experienced by all individuals in the population and the growing number of individuals eligible for reinfection through time. These factors are tightly linked to the timing of epidemic waves. We examine reinfection trends in South Africa using two approaches that account for these factors to address the question of whether circulation of variants of concern has been associated with increased reinfection risk, as would be expected if their emergence was driven or facilitated by immune evasion.
Identification of and characterization of reinfections
We define a suspected reinfection as a positive SARS-CoV-2 test in an individual with at least one previous positive test and whose most recent positive test occurred at least 90 days earlier. Based on routinely collected line list data maintained by the National Institute for Communicable Diseases (NICD) with specimen receipt dates between 04 March 2020 and 31 January 2022, we identified 105,323 individuals with at least two suspected infections, 1,778 individuals with at least three suspected infections, and 18 individuals with four suspected infections.
Time between successive positive tests
The distribution of times between successive positive tests for individuals’ first and second infections has peaks near 170, 350, and 520 days (Fig. 2A). The shape of the distribution is strongly influenced by the timing of South Africa’s epidemic waves, which have been spaced roughly six months apart. The first peak corresponds mainly to individuals whose primary infection and second infection occurred in consecutive waves (e.g., initially infected in wave 1 and reinfected in wave 2, initially infected in wave 2 and reinfected in wave 3, or initially infected in wave 3 and reinfected in wave 4), while the second peak corresponds mainly to individuals initially infected in wave 1 and reinfected in wave 3 or initially infected in wave 2 and reinfected in wave 4. The third peak corresponds to individuals initially infected in wave 1 and reinfected in wave 4.
Fig. 2.
Time between consecutive infections, based on the time between successive positive tests.
Note that the time since the previous positive test must be at least 90 days to be considered a reinfection. (A) Time in days between the last positive test of the putative first infection and the first positive test of the suspected second infection. (B) Time in days between the last positive test of the putative second infection and the first positive tests of the suspected third infection. Colors represent suspected reinfections diagnosed on or after 01 November 2021. In both panels, bars for these individuals are colored by the wave during which the previous infection occurred (purple = wave 1, pink = wave 2, orange = wave 3, light grey = inter-wave).
Time between consecutive infections, based on the time between successive positive tests.
Note that the time since the previous positive test must be at least 90 days to be considered a reinfection. (A) Time in days between the last positive test of the putative first infection and the first positive test of the suspected second infection. (B) Time in days between the last positive test of the putative second infection and the first positive tests of the suspected third infection. Colors represent suspected reinfections diagnosed on or after 01 November 2021. In both panels, bars for these individuals are colored by the wave during which the previous infection occurred (purple = wave 1, pink = wave 2, orange = wave 3, light grey = inter-wave).Almost all suspected third infections occurred after 31 October 2021, i.e., during the period of Omicron circulation. The distribution of times between successive positive tests for individuals’ second and third infections has peaks corresponding to those whose second infections occurred in the second and third waves.
Individuals with multiple suspected reinfections
1,778 individuals were identified who had three or more suspected infections. Prior to the emergence of Omicron, most of these individuals initially tested positive during the first wave, with suspected reinfections associated with waves two and three; however, 1,492 individuals with multiple reinfections (83.9%) experienced their third infection after 31 October 2021, which suggests that most third infections are associated with transmission of the Omicron variant (Fig. 3).
Fig. 3.
Timing of infections for individuals with multiple suspected reinfections.
Circles represent the first positive test of the first detected infection; triangles represent the first positive test of the suspected second infection; squares represent the first positive test of the suspected third infection; crosses represent the first positive test of the suspected fourth infection. Colored bands represent wave periods, defined as the period for which the 7-day moving average of cases was at least 15% of the corresponding wave peak (purple = wave 1, pink = wave 2, orange = wave 3, turquoise = wave 4).
Timing of infections for individuals with multiple suspected reinfections.
Circles represent the first positive test of the first detected infection; triangles represent the first positive test of the suspected second infection; squares represent the first positive test of the suspected third infection; crosses represent the first positive test of the suspected fourth infection. Colored bands represent wave periods, defined as the period for which the 7-day moving average of cases was at least 15% of the corresponding wave peak (purple = wave 1, pink = wave 2, orange = wave 3, turquoise = wave 4).
Population-level reinfection trends in South Africa
The population at risk of reinfection has risen monotonically since the beginning of the epidemic, with relatively rapid increases associated with each wave (delayed by 90 days because of our definition of reinfection; Fig. 1B). No suspected reinfections were detected until 23 June 2020, after which the number of suspected reinfections increased gradually. The 7-day moving average of suspected second infections reached a peak of approximately 160 during the second epidemic wave and 350 during the third wave (Fig. 1). Following the third wave, the number of reinfections began to increase dramatically in mid-November 2021. During the fourth wave, the 7-day moving average of suspected second infections reached nearly 2,700 and the 7-day moving average of all suspected reinfections (including second, third, and fourth infections) reached approximately 2,750.
Comparison of data to projections from a null model
We developed a catalytic model to project the expected number of reinfections through time under the assumption of a constant reinfection hazard coefficient (i.e., a null model of no change in reinfection risk). The model assumes the reinfection hazard is proportional to the 7-day moving average of the total number of diagnosed infections (primary infections and reinfections). During our early monitoring of reinfection risk, we fitted the reinfection hazard coefficient to data from 02 June 2020 to 30 September 2020 to parameterize the null model of no change in the reinfection hazard coefficient through time, and projected the number of reinfections through 30 June 2021. Based on this, we concluded there was no population-level evidence of immune escape and recommended on-going monitoring of reinfection trends (
).Given that there was no evidence of divergence from the null projection during the second wave, and to improve convergence of the MCMC fitting algorithm, for the present analysis, we repeated the fitting process using a window of 02 June 2020 to 28 February 2021 (representing the end of the month in which the second wave ended). This led to good convergence with regard to estimation of both the negative binomial dispersion parameter and the reinfection hazard coefficient (fig. S4) and allowed us to fit the model to all nine provinces. The 7-day moving average of observed reinfections and most individual daily values fall within the projection interval from the beginning of the projection period though the end of the third wave (Fig. 4). From early November 2021, however, the 7-day moving average of observed reinfections reached the upper bound of the projection interval, with many individual daily numbers falling well above the projection interval, both nationally and in Gauteng (Fig. 4). This observed deviation from the projection under the null model is a signature of immune evasion, and the timing of this deviation suggests it is associated with the emergence of the Omicron variant. A similar pattern has now been seen across all provinces (figs. S5 to S7).
Fig. 4.
Observed and expected temporal trends in reinfection numbers.
Blue lines (points) represent the 7-day moving average (daily values) of suspected reinfections. Grey lines (bands) represent median predictions (95% projection intervals) from the null model. (A and C) The null model was fit to data on suspected reinfections through 28 February 2021. (B and D) Comparison of data to projections from the null model over the projection period. The divergence observed reinfections from the projection interval in November suggests immune escape. (A) and (B): National. (C) and (D) Gauteng.
Observed and expected temporal trends in reinfection numbers.
Blue lines (points) represent the 7-day moving average (daily values) of suspected reinfections. Grey lines (bands) represent median predictions (95% projection intervals) from the null model. (A and C) The null model was fit to data on suspected reinfections through 28 February 2021. (B and D) Comparison of data to projections from the null model over the projection period. The divergence observed reinfections from the projection interval in November suggests immune escape. (A) and (B): National. (C) and (D) Gauteng.
Estimation of time-varying infection and reinfection hazards
We also examined changes in the reinfection risk via a method that relies on reconstruction of the numbers of observed and unobserved first and second infections through time (see Materials and Methods for details). Based on this approach, the estimated hazard coefficient for primary infection increased steadily through the end of the third wave, as expected under a combination of relaxing of restrictions, behavioral fatigue, and introduction of variants with increased transmissibility (Beta and Delta). The estimated hazard coefficient for reinfection, in contrast, remained relatively constant throughout this period, with the exception of an initial spike in mid-2020 (Fig. 5). Because both reinfection numbers and the population eligible for reinfection were very low at the time, this increase may be an artifact of intense follow-up of the earliest cases or simply noise due to small numbers. The estimated mean ratio of reinfection hazard to primary infection hazard decreased slightly from 0.15 in wave 1 to 0.12 in wave 2 and 0.09 in wave 3. The absolute values of the hazard coefficients and hazard ratio are sensitive to assumed observation probabilities for primary infections and reinfections; however, the temporal trends are robust (fig. S8).
Fig. 5.
Estimates of infection and reinfection hazards.
(A) Estimated time-varying hazard coefficients for primary infection (black) and second infections (blue). Colored bands represent wave periods, defined as the period for which the 7-day moving average of cases was at least 15% of the corresponding wave peak (purple = wave 1, pink = wave 2, orange = wave 3, turquoise = wave 4). (B) Ratio of the estimated hazard for reinfections to the estimated hazard for primary infections.
Estimates of infection and reinfection hazards.
(A) Estimated time-varying hazard coefficients for primary infection (black) and second infections (blue). Colored bands represent wave periods, defined as the period for which the 7-day moving average of cases was at least 15% of the corresponding wave peak (purple = wave 1, pink = wave 2, orange = wave 3, turquoise = wave 4). (B) Ratio of the estimated hazard for reinfections to the estimated hazard for primary infections.The picture changed following the end of the third wave. Although there is substantial uncertainty in the estimated hazard coefficient for primary infection, it appeared to decrease from early October 2021, with a simultaneous increase in the estimated reinfection hazard coefficient (Fig. 5). This change became more marked from the beginning of November, with the mean ratio of reinfection hazard to primary infection hazard for the period from 01 November 2021 to the beginning of the fourth wave increasing to 0.25, and a mean ratio during the fourth wave of 0.27.These findings are consistent with the estimates from a generalized linear mixed model based on the reconstructed data set. In this analysis, the relative hazard ratio for wave 2 versus wave 1 was 0.71 (CI95: 0.60–0.85) and for wave 3 versus wave 1 was 0.54 (CI95: 0.45–0.64). The relative hazard ratio for the period of Omicron emergence (01 November 2021 to the start of the fourth wave) versus wave 1 was 1.75 (CI95: 1.48–2.10), and for wave 4 versus wave 1 was 1.70 (CI95: 1.44–2.04).
Discussion and limitations
Our analyses suggest that the cumulative number of reinfections observed through the end of wave 3 was consistent with the null model of no change in reinfection risk through time. Furthermore, our findings suggest that the relative hazard of reinfection versus primary infection decreased with each subsequent wave of infections through September 2021, as would be expected if the risk of primary infection increased without a corresponding increase in reinfection risk. Thus, our analyses show no population-level evidence of immune escape associated with emergence of the Beta or Delta variants. In contrast, in November 2021, the number of daily new reinfections spiked and exceeded the 95% projection interval from the null model, accompanied by a notable increase in the hazard ratio for reinfection versus primary infection. The timing of these changes strongly suggests that they were driven by the emergence of the Omicron variant. This finding has now been supported by analyses of reinfection with Omicron in the United Kingdom and Qatar (
–
).Differences in the time-varying force of infection, original and subsequent circulating lineages, testing strategies, and vaccine coverage limit the usefulness of direct comparisons of rates of reinfections across countries or studies. Pre-Omicron reinfection does however appear to be relatively uncommon. The PCR-confirmed reinfection rate ranged from 0% – 1.1% across eleven studies included in a systematic review (
). While none of the studies included in the systematic review reported increasing risk of reinfection over time, the duration of follow-up was less than a year and most studies were completed prior to the identification of variants of concern; furthermore, all studies predated the emergence of Omicron. Our findings for the period prior to the emergence of Omicron are consistent with results from the PHIRST-C community cohort study conducted in two locations in South Africa, which found that infection prior to the second wave provided 84% protection against reinfection during the second (Beta) wave (
), comparable to estimates of the level of protection against reinfection for wild type virus from the SIREN study in the UK (
).A preliminary analysis of reinfection trends in England suggested that the Delta variant may have a higher risk of reinfection compared to the Alpha variant (
); however, this analysis did not take into account the temporal trend in the population at risk for reinfection, which may have biased the findings. Because the Alpha variant never dominated transmission in South Africa, we are unable to analyze the relative risk of reinfection for the Alpha and Delta variants in this context; however, data from Qatar suggest that protection provided by prior infection is similar for Alpha and Delta (
).Our findings regarding the Beta and Delta variants are somewhat at odds with in vitro neutralization studies. Both the Beta and Delta variants are associated with decreased neutralization by some anti-receptor binding-domain (anti-RBD) and anti-N-terminal domain (anti-NTD) monoclonal antibodies though both Beta and Delta each remain responsive to at least one anti-RBD (
,
,
). In addition, Beta and Delta are relatively poorly neutralized by convalescent sera obtained from unvaccinated individuals infected with non-VOC virus (
–
,
). Lastly sera obtained from individuals after both one and two doses of the BNT162b2 (Pfizer) or ChAdOx1 (AstraZeneca) vaccines displayed lower neutralization of the Beta and Delta variants when compared to non-VOC and Alpha variant (
); although this does not have direct bearing on reinfection risk, it is an important consideration for evaluating immune escape more broadly. Non-neutralizing antibodies and T-cell responses could explain the apparent disjuncture between our findings and the in vitro immune evasion demonstrated by both Beta and Delta.
Strengths of this study
Our study has three major strengths. First, we analyzed a large routine national data set comprising all confirmed cases in the country, allowing a comprehensive analysis of suspected reinfections in the country. Second, we found consistent results using two different analytical methods, both of which accounted for the changing force of infection and increasing numbers of individuals at risk for reinfection. Third, our real-time routine monitoring was sufficient to detect a population-level signal of immune evasion during the initial period of emergence of the Omicron variant in South Africa, prior to results from laboratory-based neutralization tests, providing timely information of importance to global public health planning.
Limitations of this study
The primary limitation of this study is that changes in testing practices, health-seeking behavior, or access to care have not been accounted for in these analyses. Estimates based on serological data from blood donors suggest substantial geographic variability in detection rates (
), which may contribute to the observed differences in reinfection patterns by province (fig. S1). Detection rates likely also vary through time and by other factors affecting access to testing, which may include occupation, age, and socioeconomic status. In particular, rapid antigen tests, which were introduced in South Africa in late 2020, may be under-reported despite mandatory reporting requirements. Although we have incorporated adjustments that account for late reporting of antigen tests, if under-reporting of antigen tests was substantial and time-varying it could still influence our findings. However, comparing temporal trends in infection risk among those eligible for reinfection with the rest of the population, as in approach 2, mitigates against potential failure to detect a substantial increase in risk.Civil unrest during July 2021 severely disrupted testing in Gauteng and KwaZulu-Natal, the two most populous provinces in the country. Case data are unreliable during the period of unrest and a key assumption of our models - that the force of infection is proportional to the number of positive tests - was violated during this period, resulting in increased misclassification of individuals regarding their status as to whether they are at risk of primary or re-infection. The effect of this misclassification on the signal of immune escape during the period of Omicron’s emergence would likely be small and would be expected to bias subsequent reinfection hazard estimates downwards.The purpose of our analysis is to detect changes in the relative reinfection risk through time, rather than to precisely estimate what the reinfection risk is at any particular point in time. While issues related to underdetection of both primary infections and reinfections are likely to affect the projection intervals against which we compare observed reinfections, we believe that our assessment of changes in the reinfection hazard are fairly robust to these detection issues. In effect, Approach 1 follows an open cohort of individuals who have had a first detected infection. Through time, this may include an increasing number of individuals whose first true infection was missed and whose first diagnosed infection is in fact a reinfection. These individuals would presumably be at a reduced risk of acquiring a new infection relative to those whose first detected infection was their first true infection. Two other factors would bias the results in the same direction: undetected reinfections in the cohort of individuals having had a first detected infection and deaths within this cohort, which are not accounted for due to not having a mortality line list that can be linked to the positive test data. All three factors artificially inflate the estimated denominator of individuals at risk for a second detected infection, thereby reducing the apparent reinfection risk. These factors may explain the slightly lower observed than projected number of reinfections throughout the Delta wave but did not have a substantial enough effect to prevent detection of the increased reinfection risk associated with the Omicron variant.The other main limitation of this study is that reinfections were not confirmed by sequencing or by requiring a negative test between putative infections. Nevertheless, the 90-day window period between consecutive positive tests reduces the possibility that suspected reinfections were predominantly the result of prolonged viral shedding. Furthermore, due to data limitations, we were unable to examine whether symptoms and severity in primary episodes correlate with protection against subsequent reinfection.Lastly, while vaccination may increase protection in previously infected individuals (
–
), vaccination coverage in South Africa was very low during much of the study period, with 22.5% of the population fully vaccinated by 30 November 2021 (
). Nevertheless, increasing vaccination uptake may reduce the risks of both primary infection and reinfection. The vaccination status of individuals with suspected reinfections identified in this study was unknown. Application of our approach to other locations with higher vaccine coverage would require a more nuanced consideration of the potential effect of vaccination. Further areas for future methodological development include accounting for potential waning of natural and vaccine-derived immunity, as well as methods to track changes in the risk of multiple (three or more) infections.Given the limitations outlined above, estimates of the extent of immune evasion based on our approach, which aims to detect changing trends rather than make precise estimates, should be treated with caution.
Conclusion
We find evidence of a substantial increase in the risk of reinfection that is temporally consistent with the timing of the emergence of the Omicron variant in South Africa, suggesting that Omicron’s selection advantage is at least partially driven by an increased ability to infect previously infected individuals.In contrast, we find no evidence that reinfection risk increased as a result of the emergence of Beta or Delta variants, suggesting that the selective advantage that allowed these variants to spread derived primarily from increased transmissibility, rather than immune evasion. The discrepancy between the population-level evidence presented here and expectations based on laboratory-based neutralization assays for Beta and Delta highlights the need to identify better correlates of immunity for assessing immune escape in vitro.Immune evasion from prior infection has important implications for public health globally. As new variants emerge, methods to quantify the extent of immune evasion for both natural and vaccine-derived immunity, as well as changes in transmissibility and disease severity, will be urgent priorities to inform facility readiness planning and other public health operations.
Methods
Data sources
Data analyzed in this study came from two sources maintained by the National Institute for Communicable Diseases (NICD): the outbreak response component of the Notifiable Medical Conditions Surveillance System (NMC-SS) deduplicated case list and the line list of repeated SARS-CoV-2 tests. All positive tests conducted in South Africa appear in the combined data set, regardless of the reason for testing or type of test (PCR or antigen detection), and include the large number of positive tests that were retrospectively added to the data set on 23 November 2021 (
). We note that, of the 18,585 cases reported on 23 November, 93% had a specimen receipt date before 01 November 2021, and 6% had specimen receipt dates on or after 21 November 2021.A combination of deterministic (national identity number, names, dates of birth) and probabilistic linkage methods were utilized to identify repeated tests conducted on the same person. In addition, provincial COVID-19 contact tracing teams identify and report repeated SARS-CoV-2 positive tests to the NICD, whether detected via PCR or antigen tests. The unique COVID-19 case identifier which links all tests from the same person was used to merge the two datasets. Irreversibly hashed case IDs were generated for each individual in the merged data set.Primary infections and suspected repeat infections were identified using the merged data set. Repeated case IDs in the line list were identified and used to calculate the time between consecutive positive tests for each individual, using specimen receipt dates. If the time between sequential positive tests was at least 90 days, the more recent positive test was considered to indicate a suspected new infection. We present a descriptive analysis of suspected third and fourth infections, although only suspected second infections were considered in the analyses of temporal trends. Incidence time series for primary infections and reinfections are calculated by specimen receipt date of the first positive test associated with the infection, and total observed incidence is calculated as the sum of first infections and reinfections. The specimen receipt date was chosen as the reference point for analysis because it is complete within the data set; however, problems have been identified with accuracy of specimen receipt dates for tests associated with substantially delayed reporting from some laboratories. For these tests, which had equivalent entries for specimen receipt date and specimen report date that were more than 7 days after the sample collection date, the specimen receipt date was adjusted to be 1 day after the sample collection date, reflecting the median delay across all tests.All analyses were conducted in the R statistical programming language (R version 4.0.5 (2021-03-31)).
Timing of reinfections
We calculated the time between successive infections as the number of days between the last positive test associated with an individual’s first identified infection (i.e., within 89 days of a previous positive test, if any) and the first positive test associated with their suspected subsequent infection (i.e., at least 90 days after the most recent positive test). We analyzed the distribution of these times for all second and third infections, and for the subset of second and third infections occurring since 01 November 2021.
Statistical analysis of reinfection trends
We analyzed the NICD national SARS-CoV-2 routine surveillance data to evaluate whether reinfection risk has changed since emergence of variants of concern in South Africa. We evaluated the daily numbers of suspected reinfections using two approaches. First, we constructed a simple null model based on the assumption that the reinfection hazard experienced by previously diagnosed individuals is proportional to the incidence of detected infections and fit this model to the pattern of suspected second infections observed through 28 February 2021. The null model assumes no change in the reinfection hazard coefficient through time. We then compared observed reinfections after the fitting period to expected reinfections under projections from the null model.Second, we evaluated whether there has been a change in the relative hazard of reinfection versus primary infection, to distinguish between increased overall transmissibility of the variants and any additional risk of reinfection due to potential immune escape. To do this, we calculated a hazard coefficient at each time point for primary and second infections and compared their relative values through time.
Approach 1: Catalytic model assuming a constant reinfection hazard coefficient
Model description
For a case testing positive on day
(by specimen receipt date), we assumed the reinfection hazard is
for each day from
to
and
for each day
, where
is the 7-day moving average of the detected case incidence (first infections and reinfections) for day
. The probability of a case testing positive on day
having a diagnosed reinfection by day
is thus
, and the expected number of cases testing positive on day
that have had a diagnosed reinfection by day
is
, where
is the detected case incidence (putative first infections only) for day
. Thus, the expected cumulative number of reinfections by day
is
, and the expected daily incidence of reinfections on day
is
.
Model fitting
The model was fitted to observed reinfection incidence through 28 February 2021 assuming data are negative binomially distributed with mean
. The reinfection hazard coefficient (
) and the inverse of the negative binomial dispersion parameter (
) were fitted to the data using a Metropolis-Hastings Monte Carlo Markov Chain (MCMC) estimation procedure implemented in the R Statistical Programming Language. We ran 4 MCMC chains with random starting values for a total of 10,000 iterations per chain, discarding the first 1,000 iterations (burn-in). Convergence was assessed using the Gelman-Rubin diagnostic (
).
Model-based projection
We used 1,500 samples from the joint posterior distribution of fitted model parameters to simulate possible reinfection time series under the null model, generating 100 stochastic realizations per parameter set. We then calculated projection intervals as the middle 95% of daily reinfection numbers across these simulations.We applied this approach at the national and provincial levels.
Approach 2: Estimation of time-varying infection and reinfection hazards
We estimated the time-varying empirical hazard of infection as the daily incidence per susceptible individual. This approach requires reconstruction of the number of susceptible individuals through time. We distinguish between three “susceptible” groups: naive individuals who have not yet been infected (
), previously infected individuals who had undetected infections at least 90 days ago and have not yet had a second infection (
), and previously infected individuals who had a prior positive test at least 90 days ago and have not yet had a second infection (
). We estimate the numbers of individuals in each of these categories on day
as follows:
where
is the total population size and
is the total number of primary infections on day
, of which
were observed and
were missed.
where
is the number of new infections among individuals whose first infection was missed. These individuals are assumed to experience the same infection hazard as individuals whose primary infection was diagnosed and who have not yet been reinfected, estimated as
. Because individuals are not eligible for reinfection until at least 90 days after their primary infection, we set
when
.
where
is the probability of detection for individuals who have had a previously identified infection, and
is the number of individuals with a second detected infection on day
. Only the possibility of second infections are accounted for in the model, which was developed to monitor reinfection risk against a background in which reinfections were rare.This setup allows recursive calculation of
and therefore
, where
is the probability of a second infection being observed in an individual whose first infection was missed, and
, where
is the number of individuals with their first positive test on day
(i.e., detected cases). The daily hazard of infection for previously uninfected individuals is then estimated as
.If we assume that the hazard of infection is proportional to the 7-day moving average of infection incidence (
), we can then examine the infectiousness of the virus through time as
and
. We constructed uncertainty intervals around
,
, and their ratio, taking into account both measurement noise and uncertainty in the observation parameters (see the supplementary materials for details).We also used this approach to construct a data set with the daily numbers of individuals eligible to have a primary infection (
) or suspected second infection (
) by wave. Wave periods were defined as the time surrounding the wave peak for which the 7-day moving average of case numbers was above 15% of the wave peak. We then analyzed these data using a generalized linear mixed model to estimate the relative hazard of infection in the population eligible for suspected second infection, compared to the hazard in the population not eligible for suspected second infection. For this analysis, we assume
and
, which falls within the plausible range of observation probabilities (fig. S8).Our primary regression model was a Poisson model with a log link function,
:
The outcome variable (
) was the reconstructed daily number of observed infections in the two groups (
and
). Our main interest for this analysis was in whether the relative hazard was higher in the second wave, third wave, pre-wave period in which Omicron emerged, and/or fourth wave, relative to during the first wave, thus potentially indicating immune evasion. This effect is measured by the interaction term between group and wave. The offset term is used to ensure that the estimated coefficients can be appropriately interpreted as per capita rates. We used day as a proxy for force of infection and reporting patterns and examined models where day was represented as a random effect (to reflect that observed days can be thought of as samples from a theoretical population) and as a fixed effect (to better match the Poisson assumptions). As focal estimates from the two models were indistinguishable, we present only the results based on the random effect assumption.
Authors: Pengfei Wang; Manoj S Nair; Lihong Liu; Sho Iketani; Yang Luo; Yicheng Guo; Maple Wang; Jian Yu; Baoshan Zhang; Peter D Kwong; Barney S Graham; John R Mascola; Jennifer Y Chang; Michael T Yin; Magdalena Sobieszczyk; Christos A Kyratsous; Lawrence Shapiro; Zizhang Sheng; Yaoxing Huang; David D Ho Journal: Nature Date: 2021-03-08 Impact factor: 69.504
Authors: Leonidas Stamatatos; Julie Czartoski; Yu-Hsin Wan; Leah J Homad; Vanessa Rubin; Hayley Glantz; Moni Neradilek; Emilie Seydoux; Madeleine F Jennewein; Anna J MacCamy; Junli Feng; Gregory Mize; Stephen C De Rosa; Andrés Finzi; Maria P Lemos; Kristen W Cohen; Zoe Moodie; M Juliana McElrath; Andrew T McGuire Journal: Science Date: 2021-03-25 Impact factor: 63.714
Authors: Heba N Altarawneh; Hiam Chemaitelly; Mohammad R Hasan; Houssein H Ayoub; Suelen Qassim; Sawsan AlMukdad; Peter Coyle; Hadi M Yassine; Hebah A Al-Khatib; Fatiha M Benslimane; Zaina Al-Kanaani; Einas Al-Kuwari; Andrew Jeremijenko; Anvar H Kaleeckal; Ali N Latif; Riyazuddin M Shaik; Hanan F Abdul-Rahim; Gheyath K Nasrallah; Mohamed G Al-Kuwari; Adeel A Butt; Hamad E Al-Romaihi; Mohamed H Al-Thani; Abdullatif Al-Khal; Roberto Bertollini; Patrick Tang; Laith J Abu-Raddad Journal: N Engl J Med Date: 2022-02-09 Impact factor: 91.245
Authors: Eamon O Murchu; Paula Byrne; Paul G Carty; Cillian De Gascun; Mary Keogan; Michelle O'Neill; Patricia Harrington; Máirín Ryan Journal: Rev Med Virol Date: 2021-05-27 Impact factor: 11.043
Authors: Florian Krammer; Komal Srivastava; Hala Alshammary; Angela A Amoako; Mahmoud H Awawda; Katherine F Beach; Maria C Bermúdez-González; Dominika A Bielak; Juan M Carreño; Rachel L Chernet; Lily Q Eaker; Emily D Ferreri; Daniel L Floda; Charles R Gleason; Joshua Z Hamburger; Kaijun Jiang; Giulio Kleiner; Denise Jurczyszak; Julia C Matthews; Wanni A Mendez; Ismail Nabeel; Lubbertus C F Mulder; Ariel J Raskin; Kayla T Russo; Ashley-Beathrese T Salimbangon; Miti Saksena; Amber S Shin; Gagandeep Singh; Levy A Sominsky; Daniel Stadlbauer; Ania Wajnberg; Viviana Simon Journal: N Engl J Med Date: 2021-03-10 Impact factor: 91.245
Authors: Raquel Viana; Sikhulile Moyo; Daniel G Amoako; Houriiyah Tegally; Cathrine Scheepers; Christian L Althaus; Ugochukwu J Anyaneji; Phillip A Bester; Maciej F Boni; Mohammed Chand; Wonderful T Choga; Rachel Colquhoun; Michaela Davids; Koen Deforche; Deelan Doolabh; Louis du Plessis; Susan Engelbrecht; Josie Everatt; Jennifer Giandhari; Marta Giovanetti; Diana Hardie; Verity Hill; Nei-Yuan Hsiao; Arash Iranzadeh; Arshad Ismail; Charity Joseph; Rageema Joseph; Legodile Koopile; Sergei L Kosakovsky Pond; Moritz U G Kraemer; Lesego Kuate-Lere; Oluwakemi Laguda-Akingba; Onalethatha Lesetedi-Mafoko; Richard J Lessells; Shahin Lockman; Alexander G Lucaci; Arisha Maharaj; Boitshoko Mahlangu; Tongai Maponga; Kamela Mahlakwane; Zinhle Makatini; Gert Marais; Dorcas Maruapula; Kereng Masupu; Mogomotsi Matshaba; Simnikiwe Mayaphi; Nokuzola Mbhele; Mpaphi B Mbulawa; Adriano Mendes; Koleka Mlisana; Anele Mnguni; Thabo Mohale; Monika Moir; Kgomotso Moruisi; Mosepele Mosepele; Gerald Motsatsi; Modisa S Motswaledi; Thongbotho Mphoyakgosi; Nokukhanya Msomi; Peter N Mwangi; Yeshnee Naidoo; Noxolo Ntuli; Martin Nyaga; Lucier Olubayo; Sureshnee Pillay; Botshelo Radibe; Yajna Ramphal; Upasana Ramphal; James E San; Lesley Scott; Roger Shapiro; Lavanya Singh; Pamela Smith-Lawrence; Wendy Stevens; Amy Strydom; Kathleen Subramoney; Naume Tebeila; Derek Tshiabuila; Joseph Tsui; Stephanie van Wyk; Steven Weaver; Constantinos K Wibmer; Eduan Wilkinson; Nicole Wolter; Alexander E Zarebski; Boitumelo Zuze; Dominique Goedhals; Wolfgang Preiser; Florette Treurnicht; Marietje Venter; Carolyn Williamson; Oliver G Pybus; Jinal Bhiman; Allison Glass; Darren P Martin; Andrew Rambaut; Simani Gaseitsiwe; Anne von Gottberg; Tulio de Oliveira Journal: Nature Date: 2022-01-07 Impact factor: 49.962
Authors: Chang Liu; Helen M Ginn; Wanwisa Dejnirattisai; Piyada Supasa; Beibei Wang; Aekkachai Tuekprakhon; Rungtiwa Nutalai; Daming Zhou; Alexander J Mentzer; Yuguang Zhao; Helen M E Duyvesteyn; César López-Camacho; Jose Slon-Campos; Thomas S Walter; Donal Skelly; Sile Ann Johnson; Thomas G Ritter; Chris Mason; Sue Ann Costa Clemens; Felipe Gomes Naveca; Valdinete Nascimento; Fernanda Nascimento; Cristiano Fernandes da Costa; Paola Cristina Resende; Alex Pauvolid-Correa; Marilda M Siqueira; Christina Dold; Nigel Temperton; Tao Dong; Andrew J Pollard; Julian C Knight; Derrick Crook; Teresa Lambe; Elizabeth Clutterbuck; Sagida Bibi; Amy Flaxman; Mustapha Bittaye; Sandra Belij-Rammerstorfer; Sarah C Gilbert; Tariq Malik; Miles W Carroll; Paul Klenerman; Eleanor Barnes; Susanna J Dunachie; Vicky Baillie; Natali Serafin; Zanele Ditse; Kelly Da Silva; Neil G Paterson; Mark A Williams; David R Hall; Shabir Madhi; Marta C Nunes; Philip Goulder; Elizabeth E Fry; Juthathip Mongkolsapaya; Jingshan Ren; David I Stuart; Gavin R Screaton Journal: Cell Date: 2021-06-17 Impact factor: 41.582
Authors: Elrazi A Ali; Mohammed A Alamin; Mohammad Abu-Tineh; Khalid Ahmed; Awni Alshurafa; Waail Rozi; Mohamed A Yassin Journal: Cureus Date: 2022-05-21
Authors: Sylvia Rothenberger; Daniel L Hurdiss; Marcel Walser; Francesca Malvezzi; Jennifer Mayor; Sarah Ryter; Hector Moreno; Nicole Liechti; Andreas Bosshart; Chloé Iss; Valérie Calabro; Andreas Cornelius; Tanja Hospodarsch; Alexandra Neculcea; Thamar Looser; Anja Schlegel; Simon Fontaine; Denis Villemagne; Maria Paladino; Dieter Schiegg; Susanne Mangold; Christian Reichen; Filip Radom; Yvonne Kaufmann; Doris Schaible; Iris Schlegel; Christof Zitt; Gabriel Sigrist; Marcel Straumann; Julia Wolter; Marco Comby; Feyza Sacarcelik; Ieva Drulyte; Heyrhyoung Lyoo; Chunyan Wang; Wentao Li; Wenjuan Du; H Kaspar Binz; Rachel Herrup; Sabrina Lusvarghi; Sabari Nath Neerukonda; Russell Vassell; Wei Wang; Julia M Adler; Kathrin Eschke; Mariana Nascimento; Azza Abdelgawad; Achim D Gruber; Judith Bushe; Olivia Kershaw; Charles G Knutson; Kamal K Balavenkatraman; Krishnan Ramanathan; Emanuel Wyler; Luiz Gustavo Teixeira Alves; Seth Lewis; Randall Watson; Micha A Haeuptle; Alexander Zürcher; Keith M Dawson; Daniel Steiner; Carol D Weiss; Patrick Amstutz; Frank J M van Kuppeveld; Michael T Stumpp; Berend-Jan Bosch; Olivier Engler; Jakob Trimpert Journal: Nat Biotechnol Date: 2022-07-21 Impact factor: 68.164
Authors: Joseph A Lewnard; Vennis X Hong; Manish M Patel; Rebecca Kahn; Marc Lipsitch; Sara Y Tartof Journal: Nat Med Date: 2022-06-08 Impact factor: 87.241
Authors: Diana Erazo; Maria F Vincenti-Gonzalez; Joris A F van Loenhout; Pierre Hubin; Mathil Vandromme; Piet Maes; Maxime Taquet; Johan Van Weyenbergh; Lucy Catteau; Simon Dellicour Journal: Viruses Date: 2022-06-16 Impact factor: 5.818