| Literature DB >> 19371431 |
Abstract
BACKGROUND: The spatial scan statistic is a widely used statistical method for the automatic detection of disease clusters from syndromic data. Recent work in the disease surveillance community has proposed many variants of Kulldorff's original spatial scan statistic, including expectation-based Poisson and Gaussian statistics, and incorporates a variety of time series analysis methods to obtain expected counts. We evaluate the detection performance of twelve variants of spatial scan, using synthetic outbreaks injected into four real-world public health datasets.Entities:
Mesh:
Year: 2009 PMID: 19371431 PMCID: PMC2691403 DOI: 10.1186/1476-072X-8-20
Source DB: PubMed Journal: Int J Health Geogr ISSN: 1476-072X Impact factor: 3.918
Dataset description
| dataset | minimum | maximum | mean | standard deviation |
| ED | 5 | 62 | 34.40 | 8.34 |
| TH | 4 | 99 | 41.44 | 17.96 |
| CC | 338 | 5474 | 2428.46 | 923.47 |
| AF | 83 | 2875 | 1321.70 | 279.88 |
Minimum, maximum, mean, and standard deviation of daily counts for each of the four public health datasets (respiratory ED visits, OTC thermometer sales, OTC cough/cold medication sales, and OTC anti-fever medication sales).
Figure 1Aggregate time series of counts for four public health datasets. For the ED dataset, each daily count represents the total number of Emergency Department visits with respiratory chief complaints. For the three OTC datasets, each daily count represents the total number of sales of medication/medical supplies in the given product category.
Comparison of detection power on ED and TH datasets, for varying outbreak sizes
| method | ED large | ED medium | ED small | TH large | TH medium | TH small |
| KULL MA | 6.05 (35.4%) | 3.11 (98.4%) | 3.67 (84.9%) | 6.58 (12.8%) | 4.36 (92.8%) | 4.06 (93.1%) |
| KULL MA-DOW | 5.74 (39.7%) | 3.04 (98.3%) | 3.60 (85.1%) | 6.54 (12.2%) | 4.36 (92.4%) | 4.06 (92.2%) |
| KULL MA-WK | 6.01 (37.0%) | 3.11 (98.5%) | 3.65 (85.0%) | 6.57 (13.1%) | 4.36 (92.8%) | 4.06 (93.1%) |
| KULL MA-WK-DOW | 5.74 (40.0%) | 3.04 (98.3%) | 3.61 (85.1%) | 6.54 (12.3%) | 4.36 (92.4%) | 4.06 (92.2%) |
| EBP MA | ||||||
| EBP MA-DOW | 3.79 (96.5%) | |||||
| EBP MA-WK | 3.53 (96.3%) | |||||
| EBP MA-WK-DOW | 3.64 (93.2%) | 3.56 (97.5%) | 3.81 (94.7%) | |||
| EBG MA | 2.98 (100%) | 2.77 (99.9%) | 3.37 (88.4%) | 4.46 (88.8%) | 4.34 (90.2%) | 4.27 (86.6%) |
| EBG MA-DOW | 3.04 (100%) | 2.90 (99.4%) | 3.41 (88.3%) | 5.00 (78.4%) | 4.94 (77.9%) | 4.76 (74.5%) |
| EBG MA-WK | 2.99 (99.9%) | 2.78 (99.8%) | 4.71 (79.8%) | 4.42 (88.2%) | 4.31 (85.1%) | |
| EBG MA-WK-DOW | 3.15 (99.7%) | 2.97 (98.8%) | 3.40 (87.9%) | 5.24 (63.2%) | 5.02 (73.1%) | 4.76 (73.6%) |
Average days to detection, and percentage of outbreaks detected, at 1 false positive per month. Methods in bold are not significantly different (in terms of days to detect, at α = .05) from the best-performing method.
Comparison of detection power on CC and AF datasets, for varying outbreak sizes
| method | CC large | CC medium | CC small | AF large | AF medium | AF small |
| KULL MA | 6.43 (14.8%) | 2.53 (100%) | 2.33 (99.5%) | 6.64 (10.6%) | 3.20 (100%) | 3.00 (99.3%) |
| KULL MA-DOW | 5.69 (60.8%) | 6.44 (23.9%) | ||||
| KULL MA-WK | 6.43 (14.8%) | 2.53 (100%) | 2.33 (99.5%) | 6.64 (10.6%) | 3.20 (100%) | 3.00 (99.3%) |
| KULL MA-WK-DOW | 5.69 (60.8%) | 6.44 (23.9%) | ||||
| EBP MA | 4.61 (87.5%) | 4.07 (96.7%) | 4.13 (94.0%) | 3.95 (99.9%) | 3.87 (98.0%) | |
| EBP MA-DOW | 4.59 (88.6%) | 4.06 (95.8%) | 4.10 (94.4%) | 4.70 (97.5%) | 4.37 (99.0%) | 4.32 (95.9%) |
| EBP MA-WK | 3.30 (98.2%) | 2.76 (100%) | 2.83 (99.2%) | 4.64 (82.5%) | 4.07 (98.2%) | 3.87 (96.7%) |
| EBP MA-WK-DOW | 2.58 (99.9%) | 2.57 (99.3%) | 4.65 (83.8%) | 4.01 (98.5%) | 3.89 (97.0%) | |
| EBG MA | 4.73 (80.7%) | 4.30 (89.5%) | 4.43 (76.5%) | 4.80 (91.4%) | 4.56 (94.5%) | 4.47 (84.1%) |
| EBG MA-DOW | 4.81 (80.5%) | 4.36 (89.8%) | 4.54 (75.0%) | 4.96 (89.2%) | 4.68 (93.2%) | 4.70 (78.9%) |
| EBG MA-WK | 3.73 (91.5%) | 3.07 (99.4%) | 3.12 (95.4%) | 4.93 (75.7%) | 4.47 (93.3%) | 4.27 (85.9%) |
| EBG MA-WK-DOW | 3.68 (92.4%) | 3.03 (99.5%) | 3.06 (96.3%) | 5.04 (74.0%) | 4.54 (92.0%) | 4.36 (84.2%) |
Average days to detection, and percentage of outbreaks detected, at 1 false positive per month. Methods in bold are not significantly different (in terms of days to detect, at α = .05) from the best-performing method.
Figure 2Detectability results for four public health datasets. Number of injected cases needed for detection of 90% of outbreaks at 1 false positive per month, as a function of outbreak size. The x-axis of each graph represents the number of Allegheny County zip codes affected by the outbreak, out of 88 monitored zip codes for the ED dataset and 58 monitored zip codes for the three OTC datasets.
False positive rates with randomization testing
| method | ED dataset | TH dataset | CC dataset | AF dataset |
| KULL MA | .046 | .141 | .544 | .358 |
| KULL MA-DOW | .050 | .146 | .284 | .202 |
| KULL MA-WK | .050 | .114 | .568 | .337 |
| KULL MA-WK-DOW | .053 | .130 | .289 | .186 |
| EBP MA | .068 | .162 | .517 | .403 |
| EBP MA-DOW | .071 | .141 | .409 | .340 |
| EBP MA-WK | .064 | .159 | .520 | .422 |
| EBP MA-WK-DOW | .071 | .149 | .348 | .332 |
| EBG MA | .334 | .398 | .244 | .204 |
| EBG MA-DOW | .473 | .496 | .268 | .249 |
| EBG MA-WK | .349 | .390 | .218 | .226 |
| EBG MA-WK-DOW | .466 | .485 | .252 | .249 |
Proportion of days significant at α = 0.05, for each of the four public health datasets with no outbreaks injected.
Detection power with and without randomization testing
| method | ED dataset | TH dataset | CC dataset | AF dataset |
| KULL MA | 3.23/3.23 | 4.41/4.23 | 5.40/ | 4.52/ |
| KULL MA-DOW | 3.45/3.04 | 4.95/ | 3.73/ | 3.65/ |
| KULL MA-WK | 3.31/3.23 | 5.26/ | 6.04/ | 5.30/ |
| KULL MA-WK-DOW | 3.19/3.04 | 5.20/ | 3.57/ | 3.99/ |
| EBP MA | 2.54/2.50 | 3.95/ | 6.36/ | 5.89/ |
| EBP MA-DOW | 2.65/2.53 | 3.51/3.44 | 4.59/4.10 | 5.62/ |
| EBP MA-WK | 2.74/2.50 | 5.04/ | 5.84/ | 5.11/ |
| EBP MA-WK-DOW | 2.92/2.59 | 4.31/ | 5.05/ | 5.30/ |
| EBG MA | 4.50/ | 5.90/ | 4.94/4.43 | 4.92/4.63 |
| EBG MA-DOW | 5.48/ | 5.15/4.66 | 5.61/ | 5.00/4.79 |
| EBG MA-WK | 4.87/ | 5.92/ | 3.82/ | 4.58/4.43 |
| EBG MA-WK-DOW | 5.53/ | 5.92/ | 4.90/ | 4.53/4.56 |
Average days to detection at 1 false positive per month for "medium-sized" outbreaks injected into each dataset, using empirically determined thresholds on p-value (computed by randomization testing) and score (without randomization testing) respectively. If there is a significant difference between the detection times with and without randomization, the better-performing method is marked in bold.
Detection power with and without randomization testing, using empirical/asymptotic p-values
| method | ED dataset | TH dataset | CC dataset | AF dataset |
| KULL MA | 3.17/3.23 | 4.24/4.23 | 2.60/2.55 | 3.18/3.19 |
| KULL MA-DOW | 3.26/3.04 | 4.23/4.09 | 2.26/2.26 | 2.83/2.80 |
| KULL MA-WK | 3.21/3.23 | 4.02/4.23 | 2.58/2.55 | 3.08/3.19 |
| KULL MA-WK-DOW | 3.21/3.04 | 4.03/4.09 | 2.41/2.26 | 2.82/2.80 |
| EBP MA | 2.48/2.50 | 3.28/3.29 | 3.90/3.99 | |
| EBP MA-DOW | 2.49/2.53 | 3.57/3.44 | 4.20/4.36 | |
| EBP MA-WK | 2.67/2.50 | 3.64/3.40 | 3.08/ | 4.62/ |
| EBP MA-WK-DOW | 2.92/2.59 | 4.02/3.75 | 2.90/ | 4.53/ |
| EBG MA | 2.84/2.91 | 4.00/4.19 | 4.52/4.43 | 4.35/4.63 |
| EBG MA-DOW | 3.05/3.01 | 4.92/4.66 | 4.67/4.50 | 4.83/4.79 |
| EBG MA-WK | 2.91/2.87 | 4.20/4.24 | 3.25/3.16 | 4.51/4.43 |
| EBG MA-WK-DOW | 2.95/3.04 | 4.99/4.73 | 3.08/2.96 | 4.46/4.56 |
Average days to detection at 1 false positive per month for "medium-sized" outbreaks injected into each dataset, using empirically determined thresholds on p-value (computed by randomization testing, using empirical/asymptotic p-values [37]) and score (without randomization testing) respectively. If there is a significant difference between the detection times with and without randomization, the better-performing method is marked in bold.
Score and p-value thresholds corresponding to one false positive per month
| method | ED dataset | TH dataset | CC dataset | AF dataset |
| KULL MA | 7.3/0.029 | 10.6/3.7 × 10-3 | 25.0/2.4 × 10-7 | 18.6/9.0 × 10-6 |
| KULL MA-DOW | 6.0/0.034 | 8.7/4.6 × 10-3 | 15.4/4.8 × 10-5 | 12.0/5.9 × 10-4 |
| KULL MA-WK | 7.3/0.025 | 10.6/5.3 × 10-3 | 25.0/2.0 × 10-7 | 18.6/1.0 × 10-5 |
| KULL MA-WK-DOW | 6.0/0.033 | 8.7/5.6 × 10-3 | 15.4/6.0 × 10-5 | 12.0/3.2 × 10-4 |
| EBP MA | 6.7/0.025 | 10.3/2.6 × 10-3 | 68.6/3.7 × 10-13 | 31.4/1.3 × 10-11 |
| EBP MA-DOW | 6.0/0.029 | 9.2/1.9 × 10-3 | 57.8/2.6 × 10-11 | 30.9/1.7 × 10-9 |
| EBP MA-WK | 6.4/0.030 | 10.2/2.8 × 10-3 | 34.9/1.4 × 10-12 | 33.9/3.0 × 10-14 |
| EBP MA-WK-DOW | 6.0/0.019 | 9.1/3.2 × 10-3 | 26.8/5.3 × 10-11 | 29.4/6.3 × 10-10 |
| EBG MA | 13.9/4.5 × 10-5 | 20.9/2.1 × 10-7 | 28.6/8.2 × 10-11 | 19.9/6.1 × 10-7 |
| EBG MA-DOW | 15.9/2.7 × 10-6 | 27.9/3.4 × 10-11 | 30.8/3.5 × 10-13 | 23.7/2.1 × 10-8 |
| EBG MA-WK | 13.5/6.1 × 10-5 | 21.0/3.1 × 10-7 | 16.7/1.2 × 10-6 | 17.5/2.1 × 10-6 |
| EBG MA-WK-DOW | 16.0/2.2 × 10-6 | 26.8/1.6 × 10-11 | 19.4/7.1 × 10-8 | 20.9/6.0 × 10-8 |
Score threshold (computed without randomization testing) and p-value threshold (computed by randomization testing, using empirical/asymptotic p-values) corresponding to an observed false positive rate of 0.0329, i.e. one false positive per month.