| Literature DB >> 35831702 |
Kin Yau Wong1, Qingning Zhou2, Tao Hu3.
Abstract
The incubation period is a key characteristic of an infectious disease. In the outbreak of a novel infectious disease, accurate evaluation of the incubation period distribution is critical for designing effective prevention and control measures . Estimation of the incubation period distribution based on limited information from retrospective inspection of infected cases is highly challenging due to censoring and truncation. In this paper, we consider a semiparametric regression model for the incubation period and propose a sieve maximum likelihood approach for estimation based on the symptom onset time, travel history, and basic demographics of reported cases. The approach properly accounts for the pandemic growth and selection bias in data collection. We also develop an efficient computation method and establish the asymptotic properties of the proposed estimators. We demonstrate the feasibility and advantages of the proposed methods through extensive simulation studies and provide an application to a dataset on the outbreak of COVID-19.Entities:
Keywords: COVID-19; Cox proportional hazards model; Sieve estimation; Survival analysis; Truncated data
Year: 2022 PMID: 35831702 PMCID: PMC9281361 DOI: 10.1007/s10985-022-09567-3
Source DB: PubMed Journal: Lifetime Data Anal ISSN: 1380-7870 Impact factor: 1.429
Mean times to infection and truncation rates in the simulation studies
| Mean time to infection | Truncation Rate | ||
|---|---|---|---|
| 0.2 | 99.2 | ||
| 0.3 | 65.8 | ||
| 0.01 | 0.2 | 9.0 | 0.75 |
| 0.3 | 5.9 | 0.56 | |
| 0.1 | 0.2 | 2.6 | 0.25 |
| 0.3 | 1.7 | 0.17 |
Simulation results under the correct model with
| Setting | Parameter | True | Bias | ESD | SEE | Cov |
|---|---|---|---|---|---|---|
| 1 | 0.500 | 0.007 | 0.062 | 0.060 | 0.94 | |
| 0.111 | 0.110 | 0.95 | ||||
| 0.010 | 0.004 | 0.031 | 0.019 | 0.94 | ||
| 0.200 | 0.013 | 0.104 | 0.107 | 0.91 | ||
| 2 | 0.500 | 0.006 | 0.060 | 0.060 | 0.94 | |
| 0.108 | 0.109 | 0.96 | ||||
| 0.010 | 0.005 | 0.022 | 0.016 | 0.93 | ||
| 0.300 | 0.019 | 0.110 | 0.100 | 0.93 | ||
| 3 | 0.500 | 0.005 | 0.059 | 0.058 | 0.95 | |
| 0.109 | 0.107 | 0.94 | ||||
| 0.100 | 0.018 | 0.144 | 0.095 | 0.95 | ||
| 0.200 | 0.051 | 0.149 | 0.111 | 0.91 | ||
| 4 | 0.500 | 0.006 | 0.060 | 0.058 | 0.94 | |
| 0.109 | 0.106 | 0.95 | ||||
| 0.100 | 0.017 | 0.168 | 0.113 | 0.96 | ||
| 0.300 | 0.074 | 0.191 | 0.154 | 0.90 |
* The true value and bias for are 1.609 and 0.004
** The true value and bias for are 1.204 and 0.041
“True” stands for the true parameter value, “ESD” stands for the empirical standard deviation of the estimated values, “SEE” stands for the mean standard error estimates, and “Cov” stands for the empirical coverage of the 95% confidence interval. The number of replicates in which one or more parameter estimates are at the boundary for Settings 1–4 are 50, 18, 4, and 3, respectively. Replicates with estimates at the boundary are discarded
Simulation results under the correct model with
| Setting | Parameter | True | Bias | ESD | SEE | Cov |
|---|---|---|---|---|---|---|
| 1 | 0.500 | 0.003 | 0.043 | 0.042 | 0.96 | |
| 0.078 | 0.078 | 0.94 | ||||
| 0.010 | 0.001 | 0.012 | 0.010 | 0.95 | ||
| 0.200 | 0.002 | 0.077 | 0.079 | 0.93 | ||
| 2 | 0.500 | 0.002 | 0.042 | 0.042 | 0.95 | |
| 0.077 | 0.077 | 0.96 | ||||
| 0.010 | 0.003 | 0.013 | 0.009 | 0.94 | ||
| 0.300 | 0.008 | 0.069 | 0.068 | 0.95 | ||
| 3 | 0.500 | 0.001 | 0.041 | 0.041 | 0.95 | |
| 0.076 | 0.076 | 0.95 | ||||
| 0.100 | 0.010 | 0.066 | 0.058 | 0.95 | ||
| 0.200 | 0.018 | 0.079 | 0.070 | 0.93 | ||
| 4 | 0.500 | 0.002 | 0.040 | 0.040 | 0.95 | |
| 0.075 | 0.075 | 0.96 | ||||
| 0.100 | 0.006 | 0.062 | 0.057 | 0.97 | ||
| 0.300 | 0.036 | 0.127 | 0.105 | 0.93 |
* The true value and bias for are 1.609 and 0.017
** The true value and bias for are 1.204 and 0.050
See NOTE to Table 2. The number of replicates in which one or more parameter estimates are at the boundary for Settings 1–4 are 17, 1, 0, and 0, respectively. Replicates with estimates at the boundary are discarded
Fig. 1Estimated cumulative baseline hazard functions under the correct model with
Simulation results under the correct model with and a large censoring proportion
| Setting | Parameter | True | Bias | ESD | SEE | Cov |
|---|---|---|---|---|---|---|
| 1 | 0.500 | 0.007 | 0.072 | 0.071 | 0.95 | |
| 0.134 | 0.133 | 0.95 | ||||
| 0.010 | 0.004 | 0.025 | 0.019 | 0.95 | ||
| 0.200 | 0.008 | 0.104 | 0.105 | 0.92 | ||
| 2 | 0.500 | 0.006 | 0.068 | 0.070 | 0.96 | |
| 0.131 | 0.130 | 0.95 | ||||
| 0.010 | 0.006 | 0.026 | 0.017 | 0.93 | ||
| 0.300 | 0.018 | 0.109 | 0.100 | 0.93 | ||
| 3 | 0.500 | 0.005 | 0.067 | 0.065 | 0.94 | |
| 0.124 | 0.122 | 0.96 | ||||
| 0.100 | 0.023 | 0.214 | 0.108 | 0.95 | ||
| 0.200 | 0.050 | 0.148 | 0.112 | 0.91 | ||
| 4 | 0.500 | 0.005 | 0.066 | 0.063 | 0.94 | |
| 0.121 | 0.118 | 0.94 | ||||
| 0.100 | 0.016 | 0.151 | 0.099 | 0.96 | ||
| 0.300 | 0.061 | 0.181 | 0.151 | 0.91 |
* The true value and bias for are 1.609 and 0.012
** The true value and bias for are 1.204 and 0.011
See NOTE to Table 2. The number of replicates in which one or more parameter estimates are at the boundary for Settings 1–4 are 27, 4, 3, and 0, respectively. Replicates with estimates at the boundary are discarded
Simulation results under misspecified infection time distributions
| Setting | Parameter | True | Bias | ESD | SEE | Cov |
|---|---|---|---|---|---|---|
| Weibull | 0.500 | 0.008 | 0.041 | 0.042 | 0.95 | |
| 0.079 | 0.076 | 0.94 | ||||
| Lognormal | 0.500 | 0.001 | 0.040 | 0.041 | 0.96 | |
| 0.077 | 0.075 | 0.94 |
See NOTE to Table 2. The number of replicates in which in which one or more parameter estimates are at the boundary for Setting Weibull and Setting Lognormal are 154 and 40, respectively. Replicates with estimates at the boundary are discarded
Fig. 2Estimated cumulative baseline hazard functions under misspecified infection time distributions
Simulation results under alternative simple methods
| Setting | Midpoint Imputation | Interval-Censored Data Regression | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Parameter | True | Bias | ESD | SEE | Cov | Bias | ESD | SEE | Cov | ||
| 0.01 | 0.2 | 0.500 | 0.04 | 0.039 | 0.89 | 0.003 | 0.047 | 0.043 | 0.93 | ||
| 0.025 | 0.073 | 0.073 | 0.94 | 0.085 | 0.078 | 0.93 | |||||
| 0.01 | 0.3 | 0.500 | 0.039 | 0.039 | 0.90 | 0.001 | 0.045 | 0.043 | 0.93 | ||
| 0.024 | 0.074 | 0.073 | 0.93 | 0.082 | 0.078 | 0.94 | |||||
| 0.1 | 0.2 | 0.500 | 0.038 | 0.039 | 0.92 | 0.045 | 0.042 | 0.93 | |||
| 0.016 | 0.073 | 0.073 | 0.94 | 0.002 | 0.083 | 0.077 | 0.93 | ||||
| 0.1 | 0.3 | 0.500 | 0.038 | 0.039 | 0.93 | 0.044 | 0.041 | 0.92 | |||
| 0.014 | 0.074 | 0.073 | 0.95 | 0.003 | 0.082 | 0.076 | 0.94 | ||||
See NOTE to Table 2
Fig. 3Estimated cumulative baseline hazard functions using alternative simple methods
Simulation results under
| Setting | Parameter | True | Bias | ESD | SEE | Cov | |
|---|---|---|---|---|---|---|---|
| 1 | 400 | 0.500 | 0.005 | 0.060 | 0.060 | 0.96 | |
| 0.112 | 0.110 | 0.95 | |||||
| 0.200 | 0.007 | 0.093 | 0.091 | 0.91 | |||
| 2 | 400 | 0.500 | 0.006 | 0.060 | 0.060 | 0.95 | |
| 0.113 | 0.110 | 0.94 | |||||
| 0.300 | 0.007 | 0.098 | 0.096 | 0.92 | |||
| 3 | 800 | 0.500 | 0.003 | 0.042 | 0.042 | 0.95 | |
| 0.079 | 0.078 | 0.95 | |||||
| 0.200 | 0.000 | 0.067 | 0.066 | 0.94 | |||
| 4 | 800 | 0.500 | 0.004 | 0.043 | 0.042 | 0.96 | |
| 0.077 | 0.078 | 0.97 | |||||
| 0.300 | 0.000 | 0.067 | 0.070 | 0.95 |
See NOTE to Table 2.The number of replicates in which one or more parameter estimates are at the boundary for Settings 1–4 are 45, 57, 7, and 9, respectively. Replicates with estimates at the boundary are discarded
Fig. 4Estimated survival functions for different subgroups
Fig. 5Conditional CDF of S