| Literature DB >> 34238978 |
Cécile Kremer1, Andrea Torneri2, Sien Boesmans3, Hanne Meuwissen3, Selina Verdonschot3, Koen Vanden Driessche4,5, Christian L Althaus6, Christel Faes7, Niel Hens7,2.
Abstract
The number of secondary cases, i.e. the number of new infections generated by an infectious individual, is an important parameter for the control of infectious diseases. When individual variation in disease transmission is present, like for COVID-19, the distribution of the number of secondary cases is skewed and often modeled using a negative binomial distribution. However, this may not always be the best distribution to describe the underlying transmission process. We propose the use of three other offspring distributions to quantify heterogeneity in transmission, and we assess the possible bias in estimates of the mean and variance of this distribution when the data generating distribution is different from the one used for inference. We also analyze COVID-19 data from Hong Kong, India, and Rwanda, and quantify the proportion of cases responsible for 80% of transmission, [Formula: see text], while acknowledging the variation arising from the assumed offspring distribution. In a simulation study, we find that variance estimates may be biased when there is a substantial amount of heterogeneity, and that selection of the most accurate distribution from a set of distributions is important. In addition we find that the number of secondary cases for two of the three COVID-19 datasets is better described by a Poisson-lognormal distribution.Entities:
Year: 2021 PMID: 34238978 PMCID: PMC8266910 DOI: 10.1038/s41598-021-93578-x
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1The top panel shows the expected proportion of all transmission that is (a) due to the 20% most infectious cases for different levels of overdispersion and different distributions, with the offspring mean R fixed at 0.8; and (b) due to a given proportion of infectious cases, where cases are ranked by their transmission potential, for (dotted), (dashed), (full), and the different distributions, with R fixed at 0.8. The lower panel shows the realized proportion of all transmission that is (c) due to the 20% most infectious cases, shaded vertical bars show the range surrounding the proportions at (see Supplementary Methods); and (d) due to a given proportion of infectious cases.
Estimates of the offspring mean R and its standard deviation (), in addition to the coefficient of variation (CV), using the different mixture distributions, and their AIC value and corresponding Akaike weights (), for three COVID-19 datasets.
| Dataset | Distribution | AIC | ||||
|---|---|---|---|---|---|---|
| Hong Kong[ | NB | 0.583 (0.448–0.718) | 1.175 (0.944–1.490) | 2.016 (1.816–2.395) | 593.925 | 0.078 |
| POLN | 0.587 (0.456–0.779) | 1.413 (0.969–2.442) | 2.409 (1.869–3.426) | 590.009 | 0.551 | |
| POWB | 0.580 (0.445–0.745) | 1.218 (0.970–1.734) | 2.101 (1.841–2.560) | 591.747 | 0.231 | |
| POGG | 0.580 (0.3789–0.724) | 1.258 (0.923–1.550) | 2.171 (2.059–2.466) | 592.738 | 0.141 | |
| India[ | NB | 0.484 (0.480–0.494) | 0.973 (0.962–0.985) | 2.009 (1.994–2.024) | 163974.5 | 0.000 |
| POLN | 0.484 (0.477–0.491) | 1.077 (1.055–1.101) | 2.226 (2.195–2.258) | 162980.6 | 0.000 | |
| POWB | 0.483 (0.476–0.489) | 0.997 (0.984–1.011) | 2.067 (2.049–2.084) | 163530.8 | 1.000 | |
| POGG | 0.484 (0.477–0.490) | 1.012 (1.000–1.024) | 2.094 (2.086–2.102) | 163286.5 | 0.000 | |
| Rwanda | NB | 0.259 (0.216–0.302) | 0.623 (0.547–0.731) | 2.406 (2.223–2.743) | 1015.261 | 0.157 |
| POLN | 0.260 (0.219–0.311) | 0.657 (0.560–0.820) | 2.528 (2.267–2.892) | 1013.073 | 0.468 | |
| POWB | 0.259 (0.217–0.311) | 0.631 (0.557–0.783) | 2.436 (2.225–2.750) | 1014.350 | 0.247 | |
| POGG | 0.259 (0.216–0.301) | 0.634 (0.561–0.706) | 2.451 (2.340–2.603) | 1015.667 | 0.128 |
Estimates of the proportion of cases responsible for 80% of transmission (, following Eqs. (1) and (2) using the different mixture distributions, for three COVID-19 datasets. Estimates based on the negative binomial distribution correspond to those reported in the literature for the two published datasets.
| Dataset | Distribution | ||
|---|---|---|---|
| Eq. ( | Eq. ( | ||
| Hong Kong[ | NB | 0.288 (0.208–0.345) | 0.191 (0.145–0.223) |
| POLN | 0.332 (0.236–0.438) | 0.195 (0.153–0.242) | |
| POWB | 0.294 (0.223–0.358) | 0.189 (0.148–0.221) | |
| POGG | 0.303 (0.279–0.325) | 0.190 (0.145–0.203) | |
| India[ | NB | 0.319 (0.314–0.324) | 0.191 (0.189–0.194) |
| POLN | 0.373 (0.367–0.379) | 0.195 (0.193–0.199) | |
| POWB | 0.322 (0.318–0.327) | 0.189 (0.187–0.191) | |
| POGG | 0.333 (0.332–0.335) | 0.191 (0.189–0.192) | |
| Rwanda | NB | 0.323 (0.223–0.390) | 0.138 (0.114–0.157) |
| POLN | 0.389 (0.318–0.459) | 0.139 (0.120–0.157) | |
| POWB | 0.331 (0.241–0.394) | 0.137 (0.117–0.157) | |
| POGG | 0.344 (0.337–0.350) | 0.137 (0.122–0.151) | |
Figure 2Proportion of most infectious cases responsible for a certain proportion of transmission, based on estimates from (a,b) Hong Kong, (c,d) India, and (e,f) Rwanda. Proportions are obtained based on the distribution of the individual reproduction number (left), and based on the complete offspring distribution (right). The shaded areas in the right panels represent the range surrounding specific proportions when considering the discrete nature of the realized offspring distributions (see Supplementary Methods).
Different mixture distributions, assuming a Poisson distribution for the effective contact process.
| Distribution for | Offspring distribution | ||
|---|---|---|---|