| Literature DB >> 26733471 |
Ruth H Keogh1, Punam Mangtani2, Laura Rodrigues3, Patrick Nguipdop Djomo4.
Abstract
BACKGROUND: Traditional analyses of standard case-control studies using logistic regression do not allow estimation of time-varying associations between exposures and the outcome. We present two approaches which allow this. The motivation is a study of vaccine efficacy as a function of time since vaccination.Entities:
Mesh:
Substances:
Year: 2016 PMID: 26733471 PMCID: PMC4702367 DOI: 10.1186/s12874-015-0104-0
Source DB: PubMed Journal: BMC Med Res Methodol ISSN: 1471-2288 Impact factor: 4.615
Fig. 1Diagram illustrating a case-cohort study showing survival times (black circles) and censoring times (lines not ending in a circle) in a prospective cohort, also showing the subcohort and the controls used for each case in the case-cohort analysis (grey circles). The full risk set for each case is indicated by the dotted lines
Simulation study results
| True HR | True log HR | OR or HR | Log OR or Log HR | Difference from true log HR | Emp SD | Model SE | Cov | RE | |
|---|---|---|---|---|---|---|---|---|---|
| 1. Logistic regression analysis using controls in multiple time periods, controls definition (i) | |||||||||
| Age 0-4 | 0.25 | −1.386 | 0.246 | −1.403 | −0.017 | 0.260 | 0.258 | 0.943 | - |
| Age 5-9 | 0.34 | −1.079 | 0.335 | −1.093 | −0.015 | 0.238 | 0.244 | 0.962 | - |
| Age 10–14 | 0.46 | −0.777 | 0.458 | −0.781 | −0.004 | 0.235 | 0.234 | 0.947 | - |
| Age 15-19 | 0.62 | −0.478 | 0.619 | −0.480 | −0.002 | 0.205 | 0.197 | 0.943 | - |
| 1. Logistic regression analysis using controls in multiple time periods, controls definition (ii) | |||||||||
| Age 0-4 | 0.25 | −1.386 | 0.244 | −1.412 | −0.026 | 0.290 | 0.293 | 0.947 | 80 |
| Age 5-9 | 0.34 | −1.079 | 0.332 | −1.104 | −0.025 | 0.267 | 0.270 | 0.952 | 79 |
| Age 10–14 | 0.46 | −0.777 | 0.456 | −0.785 | −0.008 | 0.260 | 0.257 | 0.948 | 82 |
| Age 15-19 | 0.62 | −0.478 | 0.617 | −0.483 | −0.005 | 0.230 | 0.226 | 0.940 | 79 |
| 1. Logistic regression analysis using controls in multiple time periods, controls definition (iii) | |||||||||
| Age 0-4 | 0.25 | −1.386 | 0.246 | −1.403 | −0.017 | 0.301 | 0.293 | 0.948 | 75 |
| Age 5-9 | 0.34 | −1.079 | 0.334 | −1.098 | −0.020 | 0.260 | 0.266 | 0.954 | 84 |
| Age 10–14 | 0.46 | −0.777 | 0.460 | −0.776 | 0.000 | 0.246 | 0.246 | 0.954 | 91 |
| Age 15-19 | 0.62 | −0.478 | 0.619 | −0.480 | −0.002 | 0.217 | 0.211 | 0.942 | 89 |
| 1. Logistic regression analysis using controls in multiple time periods, controls definition (iv) | |||||||||
| Age 0-4 | 0.25 | −1.386 | 0.243 | −1.416 | −0.030 | 0.351 | 0.346 | 0.951 | 55 |
| Age 5-9 | 0.34 | −1.079 | 0.328 | −1.114 | −0.035 | 0.299 | 0.301 | 0.953 | 63 |
| Age 10–14 | 0.46 | −0.777 | 0.458 | −0.780 | −0.003 | 0.273 | 0.273 | 0.952 | 74 |
| Age 15-19 | 0.62 | −0.478 | 0.616 | −0.484 | −0.006 | 0.248 | 0.247 | 0.951 | 68 |
| 2. Logistic regression analysis, not using controls across multiple time periods | |||||||||
| Age 0-4 | 0.25 | −1.386 | 0.244 | −1.411 | −0.025 | 0.318 | 0.307 | 0.954 | 67 |
| Age 5-9 | 0.34 | −1.079 | 0.327 | −1.118 | −0.039 | 0.317 | 0.315 | 0.946 | 56 |
| Age 10–14 | 0.46 | −0.777 | 0.454 | −0.789 | −0.012 | 0.305 | 0.301 | 0.950 | 59 |
| Age 15-19 | 0.62 | −0.478 | 0.616 | −0.485 | −0.007 | 0.218 | 0.214 | 0.939 | 88 |
| 3. Case-cohort analysis | |||||||||
| Age 0-4 | 0.25 | −1.386 | 0.249 | −1.390 | −0.004 | 0.277 | 0.267 | 0.944 | - |
| Age 5-9 | 0.34 | −1.079 | 0.337 | −1.087 | −0.008 | 0.240 | 0.245 | 0.957 | - |
| Age 10–14 | 0.46 | −0.777 | 0.461 | −0.775 | 0.002 | 0.236 | 0.233 | 0.942 | - |
| Age 15-19 | 0.62 | −0.478 | 0.623 | −0.474 | 0.004 | 0.206 | 0.198 | 0.939 | - |
OR or HR: Exponential of the mean estimated log OR (logistic analyses) or log HR (case-cohort analysis) across 1000 simulations.
Log OR or log HR: Mean of the estimated log OR (logistic analyses) or log HR (case-cohort analysis) across 1000 simulations.
Difference from true log HR: Mean difference between the estimate of the log HR or log OR and the true log HR across the 1000 simulations.
Emp SD: Empirical standard deviation of the estimates of the log HRs or log ORs across the 1000 simulations.
Model SE: The mean of the model-based standard errors for the estimates of the log HRs or log ORs across the 1000 simulations.
Cov (Coverage): The proportion of the 1000 95 % confidence intervals for each of the log HRs or log ORs ratios which contain the true log HR.
RE (Relative efficiency): percentage efficiency relative to the logistic analysis using controls definition (i). The relative efficiency is the ratio of the squared empirical standard deviation for the reference method (i) to the squared empirical standard deviation for the comparison method (control definitions (ii), (iii), (iv), and not reusing controls), expressed as a percentage