Literature DB >> 32556265

Reconstruction of Transmission Pairs for Novel Coronavirus Disease 2019 (COVID-19) in Mainland China: Estimation of Superspreading Events, Serial Interval, and Hazard of Infection.

Xiao-Ke Xu1, Xiao Fan Liu2, Ye Wu3,4, Sheikh Taslim Ali5, Zhanwei Du6, Paolo Bosetti7, Eric H Y Lau5, Benjamin J Cowling5, Lin Wang7,8.   

Abstract

BACKGROUND: Knowledge on the epidemiological features and transmission patterns of novel coronavirus disease (COVID-19) is accumulating. Detailed line-list data with household settings can advance the understanding of COVID-19 transmission dynamics.
METHODS: A unique database with detailed demographic characteristics, travel history, social relationships, and epidemiological timelines for 1407 transmission pairs that formed 643 transmission clusters in mainland China was reconstructed from 9120 COVID-19 confirmed cases reported during 15 January-29 February 2020. Statistical model fittings were used to identify the superspreading events and estimate serial interval distributions. Age- and sex-stratified hazards of infection were estimated for household vs nonhousehold transmissions.
RESULTS: There were 34 primary cases identified as superspreaders, with 5 superspreading events occurred within households. Mean and standard deviation of serial intervals were estimated as 5.0 (95% credible interval [CrI], 4.4-5.5) days and 5.2 (95% CrI, 4.9-5.7) days for household transmissions and 5.2 (95% CrI, 4.6-5.8) and 5.3 (95% CrI, 4.9-5.7) days for nonhousehold transmissions, respectively. The hazard of being infected outside of households is higher for people aged 18-64 years, whereas hazard of being infected within households is higher for young and old people.
CONCLUSIONS: Nonnegligible frequency of superspreading events, short serial intervals, and a higher risk of being infected outside of households for male people of working age indicate a significant barrier to the identification and management of COVID-19 cases, which requires enhanced nonpharmaceutical interventions to mitigate this pandemic.
© The Author(s) 2020. Published by Oxford University Press for the Infectious Diseases Society of America. All rights reserved. For permissions, e-mail: journals.permissions@oup.com.

Entities:  

Keywords:  COVID-19; hazard of infection; serial interval; superspreading event; transmission

Mesh:

Year:  2020        PMID: 32556265      PMCID: PMC7337632          DOI: 10.1093/cid/ciaa790

Source DB:  PubMed          Journal:  Clin Infect Dis        ISSN: 1058-4838            Impact factor:   9.079


In December 2019, a novel coronavirus disease (COVID-19) emerged in Wuhan of Hubei Province, China. The World Health Organization announced a public health emergency of international significance on 30 January 2020 [1] and classified the threat as a global pandemic on 11 March 2020 [2]. More than 8 million confirmed cases and 440 290 deaths have been reported from >200 countries and territories as of 17 June 2020 [3]. On 23 January 2020, China raised the national emergency response to the highest level, which triggered an unprecedented travel ban starting from the lockdown of Wuhan on 23 January, 14 cities in Hubei province on 24 January, and >30 provinces thereafter. Despite this unprecedented intervention, we estimated that COVID-19 cases had been introduced into 130 (95% credible interval [CrI], 190–369) cities in mainland China prior to the lockdown of Wuhan on 23 January 2020 [4]. Similar findings on the rapid geographic expansion of COVID-19 have also been reported in several recent studies [5-8]. Starting from the last week of January 2020, >260 Chinese cities have implemented intensive social distancing and confinement policies, which brought the epidemic under control [7-10]. However, the epidemic has still caused >10 000 confirmed cases in China outside Hubei Province within a month. To enhance public health preparedness and awareness, Chinese health authorities have publicly reported detailed records of confirmed COVID-19 cases since mid-January. This provides a unique resource for studying the transmission patterns, routes, and risk factors of COVID-19.

METHODS

Data Collection

In mainland China, 27 provincial and 264 urban health commissions have publicly posted 9120 confirmed case reports online during 15 January–29 February 2020, which accounts for 72% of all cases confirmed in mainland China outside Hubei province. We compiled a unique line-list database using these reports, which contains detailed information about demographic features, social relationships, travel history, and key epidemiological timelines (eg, dates of symptom onset, confirmation, and hospitalization). In contrast to several published COVID-19 data repositories [11-16], which focus on describing information about individual cases, our database allows the reconstruction of transmission pairs and clusters by inferring potential causal associations among different cases. The entire dataset of transmission pairs is available at our GitHub (https://github.com/linwangidd/covid19_transmissionPairs_China). See the Supplementary Materials for more details.

Statistical Analysis

We reconstructed 1407 transmission pairs using the epidemiological evidence among reported cases. The section “Reconstruction of Transmission Pairs” in the Supplementary Materials specifies how we identified a pair or a group of confirmed cases using information about their close contacts, stratified transmission pairs into household and nonhousehold settings using information about familial relationships, and determined the direction of transmission between infector and infectee using information about travel histories. For each transmission pair, we term the infector the “primary case” and the infectee the “secondary case.” We also consider connected chains of confirmed cases, in which we term the original case the “index” and the entire chain of cases, including the index, the “transmission cluster” (Figure 1A).
Figure 1.

COVID-19 transmission clusters. A, Six hundred forty-three transmission clusters, stratified by the size of cluster n. Red, green, and blue nodes denote primary cases, household secondary cases, and nonhousehold secondary cases, respectively. B, Empirical offspring distribution for the number of secondary cases infected by each of the 809 primary cases. C, Empirical and estimated serial interval distributions for household or nonhousehold transmissions. Pink and blue bars indicate the empirical distributions of serial interval data for household and nonhousehold transmissions, respectively. Red curve indicates the estimated serial interval distributions for household transmissions, based on the posterior median estimates of the mean and standard deviation (SD) of normal distribution. Blue curve indicates the estimated serial interval distributions for nonhousehold transmissions, based on the posterior median estimates of the mean and SD of normal distribution. Fitting results with alternative distributions (eg, Gumbel or Logistic distribution) are available in Supplementary Tables 3–5.

COVID-19 transmission clusters. A, Six hundred forty-three transmission clusters, stratified by the size of cluster n. Red, green, and blue nodes denote primary cases, household secondary cases, and nonhousehold secondary cases, respectively. B, Empirical offspring distribution for the number of secondary cases infected by each of the 809 primary cases. C, Empirical and estimated serial interval distributions for household or nonhousehold transmissions. Pink and blue bars indicate the empirical distributions of serial interval data for household and nonhousehold transmissions, respectively. Red curve indicates the estimated serial interval distributions for household transmissions, based on the posterior median estimates of the mean and standard deviation (SD) of normal distribution. Blue curve indicates the estimated serial interval distributions for nonhousehold transmissions, based on the posterior median estimates of the mean and SD of normal distribution. Fitting results with alternative distributions (eg, Gumbel or Logistic distribution) are available in Supplementary Tables 3–5. We categorized each transmission pair by the social relationship between primary and secondary cases (eg, familial members of the same household, nonhousehold relatives, colleagues, classmates, friends, and other face-to-face contacts). Considering that during the Spring Festival travel season (10 January–18 February 2020), several billion human movements can occur because of the tradition of Chinese New Year (to visit and live with their original families), we considered any transmission pair with immediate familial relationships (eg, a person’s spouse, parents, and children) as a household transmission pair, and with other familial relationships (eg, a person’s siblings with age >17 years) or close contacts with no familial information (eg, classmates, colleagues) as a nonhousehold transmission pair. The numbers of household (662) and nonhousehold (745) transmission pairs are almost even. Following Lloyd-Smith et al [17], we defined the threshold of observing superspreading events (SSEs) as the 99th percentile of the offspring distribution for the number of secondary cases caused by a primary case. Household and nonhousehold transmissions were combined together for computing the offspring distribution. To estimate the threshold of observing SSE, we used a Poisson, exponential, and power-law distribution to fit the empirical offspring distribution via the Distribution Fitter App in Matlab R2020a [18]. Since the power-law distribution gives the smallest Akaike information criterion compared to the Poisson and exponential distributions (Supplementary Table 2), the threshold of observing SSEs is set as 3.78, which indicates the occurrence of an SSE if 4 or more secondary cases were infected by a single primary case. For each transmission pair with known symptom onset times for both primary and secondary cases, we computed the empirical serial interval as the number of days between the symptom onset dates of the primary case and of the secondary case [19]. Due to the presence of negative-valued serial intervals and the skewness of the empirical distribution (Figure 1B), we estimated the serial interval distributions by fitting a normal distribution via the Markov chain Monte Carlo (MCMC) method with Gibbs sampling and noninformative flat prior. We confirmed the convergence of MCMC chains via trace plot and diagnosis, and obtained the posterior estimates of parameters by running 100 000 iterations with a burn-in of 40 000 iterations and a thinning interval of 10. Fitting serial interval data with a Gumbel or logistic distribution gives similar estimates (Supplementary Tables 3–5). We estimated the age-stratified hazard of infection for household vs nonhousehold transmissions by the ratio between the probability that a secondary case of age group b was infected by a primary case of age group a within the same household and the probability that a secondary case of age group b was infected by a primary case of age group a outside of households, that is, . If , then the infection within households has a higher risk than the infection outside of households for secondary cases of age group b being infected by primary cases of age group a. We estimated the sex-specific hazard of infection for household vs nonhousehold transmissions by the ratio between the probability that a secondary case of gender b was infected by a primary case of gender a within the same household and the probability that a secondary case of gender b was infected by a primary case of gender a via nonhousehold transmission.

RESULTS

We in total reconstructed 643 transmission clusters formed by 1407 transmission pairs (Figure 1A). The size is <5 for 587 transmission clusters, whereas the size exceeds 20 for the largest cluster. We identified 34 primary cases as the superspreaders, with 5 SSEs occurring within households. Stratification by household setting demonstrates that 356, 92, and 34 primary cases infected 1, 2, and ≥3 secondary cases within households, respectively, and 276, 78, and 75 primary cases infected 1, 2, and ≥3 secondary cases outside of households, respectively (Figure 1B). Fitting a normal distribution to serial interval data for household transmissions estimates the mean and standard deviation (SD) of serial intervals as 5.0 (95% CrI, 4.4–5.5) days and 5.2 (95% CrI, 4.9–5. 7) days, respectively. Given the posterior median estimates of the mean and SD, the median serial interval distribution is estimated to be 5.0 (interquartile range [IQR], 1.5–8.5) days for household transmissions. Fitting a normal distribution to serial interval data for nonhousehold transmissions estimates the mean and SD of serial intervals as 5.2 (95% CrI, 4.6–5.8) days and 5.3 (95% CrI, 4.9–5.7) days, respectively. Given the posterior median estimates of the mean and SD, the median serial interval distribution is estimated to be 5.2 (IQR, 1.6–8.8) days for nonhousehold transmissions. See Supplementary Table 3 for more results. Notably, 25 of 339 household and 25 of 340 nonhousehold transmission pairs reported negative-valued serial intervals, implying presymptomatic transmission. We performed several sensitivity analyses on estimating serial interval distributions (Supplementary Tables 4 and 5), such as the stratification of transmission pairs by the location of primary cases (imported vs local), and estimating transmission pairs with more clear epidemiological evidence (eg, primary case linked only to a single secondary case). Results of these sensitivity analyses are consistent with those estimated with all transmission pairs. Hazard of being infected within households was higher for young (<18 years) and elderly (>65 years) people, whereas the hazard of being infected outside of households was higher for the age group 18–64 years (Table 1). Primary cases of elderly (>65 years) people were more prone to cause household infections. Hazard of infection between different sexes was higher for household than for nonhousehold transmission (Table 2).
Table 1

Age-stratified Hazard of Infection for Household Versus Nonhousehold Transmissions

Secondary Cases, Age, y
0–1718–4950–64≥65Total
Primary Cases, Age, y0–170.00.80.81.10.7
18–496.30.70.92.01.1
50–641.70.90.70.60.8
≥652.31.40.62.11.3
Total3.50.80.81.41.0
Table 2.

Sex-specific Hazard of Infection for Household Versus Nonhousehold Transmissions

Secondary Cases
MaleFemaleTotal
Primary CasesMale0.61.61.0
Female1.20.70.9
Total0.81.21.0
Age-stratified Hazard of Infection for Household Versus Nonhousehold Transmissions Sex-specific Hazard of Infection for Household Versus Nonhousehold Transmissions

DISCUSSION

We have built a line-list database with detailed demographic information, travel history, epidemiological timelines, and social relationships for 1407 transmission pairs that formed 643 transmission clusters in mainland China outside Hubei province. We identified 34 primary cases as superspreaders. The majority of SSEs were observed for nonhousehold transmissions, which is consistent with a recent study [21] on transmission settings of COVID-19 (eg, hospitals, residential care, prisons, boarding schools, cruise ships). This indicates the importance of nonpharmaceutical interventions (eg, isolation, quarantine, social distancing, and confinement [7, 22–24]) in mitigating the COVID-19 epidemic. Household studies are helpful to identify risk factors for certain demographic groups [25, 26]. The analysis of the age-stratified and sex-specific hazard of infection suggests a higher risk of infection within households for young (<18 years of age), elderly (>65 years of age), and female people. The higher risk of being infected outside of households for male people aged between 18 and 64 years may indicate their role in driving household secondary infections, perhaps because these were travelers of working age from Wuhan. We identified 50 transmission pairs (~3.5%) with a secondary case reporting symptom onset earlier than the primary case (ie, negative-valued serial intervals), which is consistent with recent clinical reports [27, 28] and epidemiological studies [29, 30]. We estimated that the mean serial interval is around 5 days for both household and nonhousehold infections, which is considerably shorter than the mean serial interval estimated for severe acute respiratory syndrome (eg, 8.4 days [31]) and Middle East respiratory syndrome (eg, 7.6 days [32]). Our findings have several limitations. First, the household sizes and primary cases with no secondary infections were not provided from the original public case reports. This may give rise to biased estimates if we estimate the household reproduction number and secondary attack rate from the raw data. Field surveys will be helpful to adjust such biases. Second, the information on nosocomial infections and public gathering settings was not available from original case reports, so the observation of SSEs may be less common from our dataset. Third, caution is needed when attempting to generalize the age-stratified hazard of infection to other demographic settings. For example, in our study (Table 1), the fact that children (<18 years of age) never acquired COVID-19 from other children at home may be more a reflection of the usual household composition in Chinese cities (single child living with parents) than the transmission characteristics of the virus. China had a lower proportion of households with multiple children [33], which may reduce the risk of transmission between children, especially during lockdown and school closure. In sum, the notable threat of SSEs, short serial intervals, and a higher risk of being infected outside of households for adult men of working age (18–64 years) indicate a significant barrier to the identification and management of COVID-19 cases; enhanced nonpharmaceutical interventions will be required to mitigate this pandemic.

Supplementary Data

Supplementary materials are available at Clinical Infectious Diseases online. Consisting of data provided by the authors to benefit the reader, the posted materials are not copyedited and are the sole responsibility of the authors, so questions or comments should be addressed to the corresponding author. Click here for additional data file.
  26 in total

Review 1.  Serial intervals of respiratory infectious diseases: a systematic review and analysis.

Authors:  Margaretha Annelie Vink; Martinus Christoffel Jozef Bootsma; Jacco Wallinga
Journal:  Am J Epidemiol       Date:  2014-10-07       Impact factor: 4.897

2.  Early epidemiological analysis of the coronavirus disease 2019 outbreak based on crowdsourced data: a population-level observational study.

Authors:  Kaiyuan Sun; Jenny Chen; Cécile Viboud
Journal:  Lancet Digit Health       Date:  2020-02-20

3.  Impact assessment of non-pharmaceutical interventions against coronavirus disease 2019 and influenza in Hong Kong: an observational study.

Authors:  Benjamin J Cowling; Sheikh Taslim Ali; Tiffany W Y Ng; Tim K Tsang; Julian C M Li; Min Whui Fong; Qiuyan Liao; Mike Yw Kwan; So Lun Lee; Susan S Chiu; Joseph T Wu; Peng Wu; Gabriel M Leung
Journal:  Lancet Public Health       Date:  2020-04-17

4.  Nowcasting and forecasting the potential domestic and international spread of the 2019-nCoV outbreak originating in Wuhan, China: a modelling study.

Authors:  Joseph T Wu; Kathy Leung; Gabriel M Leung
Journal:  Lancet       Date:  2020-01-31       Impact factor: 79.321

5.  Superspreading and the effect of individual variation on disease emergence.

Authors:  J O Lloyd-Smith; S J Schreiber; P E Kopp; W M Getz
Journal:  Nature       Date:  2005-11-17       Impact factor: 49.962

6.  What settings have been linked to SARS-CoV-2 transmission clusters?

Authors:  Quentin J Leclerc; Naomi M Fuller; Lisa E Knight; Sebastian Funk; Gwenan M Knight
Journal:  Wellcome Open Res       Date:  2020-06-05

7.  An investigation of transmission control measures during the first 50 days of the COVID-19 epidemic in China.

Authors:  Huaiyu Tian; Yonghong Liu; Yidan Li; Chieh-Hsi Wu; Bin Chen; Moritz U G Kraemer; Bingying Li; Jun Cai; Bo Xu; Qiqi Yang; Ben Wang; Peng Yang; Yujun Cui; Yimeng Song; Pai Zheng; Quanyi Wang; Ottar N Bjornstad; Ruifu Yang; Bryan T Grenfell; Oliver G Pybus; Christopher Dye
Journal:  Science       Date:  2020-03-31       Impact factor: 47.728

8.  Secondary attack rate and superspreading events for SARS-CoV-2.

Authors:  Yang Liu; Rosalind M Eggo; Adam J Kucharski
Journal:  Lancet       Date:  2020-02-27       Impact factor: 79.321

9.  An interactive web-based dashboard to track COVID-19 in real time.

Authors:  Ensheng Dong; Hongru Du; Lauren Gardner
Journal:  Lancet Infect Dis       Date:  2020-02-19       Impact factor: 25.071

10.  The effect of human mobility and control measures on the COVID-19 epidemic in China.

Authors:  Moritz U G Kraemer; Chia-Hung Yang; Bernardo Gutierrez; Chieh-Hsi Wu; Brennan Klein; David M Pigott; Louis du Plessis; Nuno R Faria; Ruoran Li; William P Hanage; John S Brownstein; Maylis Layan; Alessandro Vespignani; Huaiyu Tian; Christopher Dye; Oliver G Pybus; Samuel V Scarpino
Journal:  Science       Date:  2020-03-25       Impact factor: 47.728

View more
  34 in total

Review 1.  Transmission of SARS-CoV-2 by Children.

Authors:  Joanna Merckx; Jeremy A Labrecque; Jay S Kaufman
Journal:  Dtsch Arztebl Int       Date:  2020-08-17       Impact factor: 5.594

2.  Modelling COVID-19 outbreak on the Diamond Princess ship using the public surveillance data.

Authors:  Shi Zhao; Peihua Cao; Daozhou Gao; Zian Zhuang; Weiming Wang; Jinjun Ran; Kai Wang; Lin Yang; Mohammad R Einollahi; Yijun Lou; Daihai He; Maggie H Wang
Journal:  Infect Dis Model       Date:  2022-05-26

Review 3.  Rapid review of available evidence on the serial interval and generation time of COVID-19.

Authors:  John Griffin; Miriam Casey; Áine Collins; Kevin Hunt; David McEvoy; Andrew Byrne; Conor McAloon; Ann Barber; Elizabeth Ann Lane; SImon More
Journal:  BMJ Open       Date:  2020-11-23       Impact factor: 2.692

4.  Clinical Characteristics and Outcomes of COVID-19 in West Virginia.

Authors:  Sijin Wen; Apoorv Prasad; Kerri Freeland; Sanjiti Podury; Jenil Patel; Roshan Subedi; Erum Khan; Medha Tandon; Saurabh Kataria; Wesley Kimble; Shitiz Sriwastava
Journal:  Viruses       Date:  2021-05-05       Impact factor: 5.048

5.  A hospital cluster of COVID-19 associated with a SARS-CoV-2 superspreading event.

Authors:  Po-Yen Huang; Ting-Shu Wu; Chun-Wen Cheng; Chih-Jung Chen; Chung-Guei Huang; Kuo-Chien Tsao; Chun-Sui Lin; Ting-Ying Chung; Chi-Chun Lai; Cheng-Ta Yang; Yi-Ching Chen; Cheng-Hsun Chiu
Journal:  J Microbiol Immunol Infect       Date:  2021-07-21       Impact factor: 10.273

6.  Preliminary Assessment of Chinese Strategy in Controlling Reemergent Local Outbreak of COVID-19.

Authors:  Zhouhan Wang; Yanqi Jin; Xi Jin; Yingfeng Lu; Xiaopeng Yu; Lanjuan Li; Yimin Zhang
Journal:  Front Public Health       Date:  2021-07-02

7.  Serial interval of SARS-CoV-2 was shortened over time by nonpharmaceutical interventions.

Authors:  Sheikh Taslim Ali; Lin Wang; Eric H Y Lau; Xiao-Ke Xu; Zhanwei Du; Ye Wu; Gabriel M Leung; Benjamin J Cowling
Journal:  Science       Date:  2020-07-21       Impact factor: 47.728

8.  SARS-CoV-2 outbreak investigation in a German meat processing plant.

Authors:  Thomas Günther; Adam Grundhoff; Manja Czech-Sioli; Daniela Indenbirken; Alexis Robitaille; Peter Tenhaken; Martin Exner; Matthias Ottinger; Nicole Fischer; Melanie M Brinkmann
Journal:  EMBO Mol Med       Date:  2020-10-27       Impact factor: 12.137

9.  Real-time, interactive website for US-county level Covid-19 event risk assessment.

Authors:  Aroon Chande; Seolha Lee; Mallory Harris; Troy Hilley; Clio Andris; Joshua S Weitz
Journal:  medRxiv       Date:  2020-08-29

Review 10.  Pathophysiological Basis and Rationale for Early Outpatient Treatment of SARS-CoV-2 (COVID-19) Infection.

Authors:  Peter A McCullough; Ronan J Kelly; Gaetano Ruocco; Edgar Lerma; James Tumlin; Kevin R Wheelan; Nevin Katz; Norman E Lepor; Kris Vijay; Harvey Carter; Bhupinder Singh; Sean P McCullough; Brijesh K Bhambi; Alberto Palazzuoli; Gaetano M De Ferrari; Gregory P Milligan; Taimur Safder; Kristen M Tecson; Dee Dee Wang; John E McKinnon; William W O'Neill; Marcus Zervos; Harvey A Risch
Journal:  Am J Med       Date:  2020-08-07       Impact factor: 4.965

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.