| Literature DB >> 15904524 |
Toshiro Tango1, Kunihiko Takahashi.
Abstract
BACKGROUND: The spatial scan statistic proposed by Kulldorff has been applied to a wide variety of epidemiological studies for cluster detection. This scan statistic, however, uses a circular window to define the potential cluster areas and thus has difficulty in correctly detecting actual noncircular clusters. A recent proposal by Duczmal and Assunção for detecting noncircular clusters is shown to detect a cluster of very irregular shape that is much larger than the true cluster in our experiences.Entities:
Year: 2005 PMID: 15904524 PMCID: PMC1173134 DOI: 10.1186/1476-072X-4-11
Source DB: PubMed Journal: Int J Health Geogr ISSN: 1476-072X Impact factor: 3.918
Figure 1An entire study population for simulation studies. The 113 regions comprising wards, cities and villages in the area of Tokyo Metropolis and Kanagawa prefecture in Japan. The region number used in the text is shown. Especially, The region numbers of four hot-spot clusters A-D are A = {14, 15, 20}, B = {14, 15, 20, 26}, C = {14, 15, 26, 27}, and D = {73, 74, 75, 76, 78}, respectively.
Figure 2A random sample from cluster model C. Dots describe the centroids of regions with some cases. Circles are drawn only for the regions whose standardized risk ratios are statistically significantly larger than 1 at α = 0.05 and the region number is placed in stead of dot. The radius is set inversely proportional to the tail probability.
Figure 3The most likely cluster detected by the circular and the flexible spatial scan statistic. (a) Detected by the circular spatial scan statistic for both K = 15 and K = 20 and (b) by the flexible spatial scan statistic for both K = 15 and K = 20, when applied to a random sample from the cluster model C = {14, 15, 26, 27}.
Figure 4The most likely cluster detected by the Duczmal and Assunção's scan statistic. (a) Detected for K = 15 and (b) for K = 20, when applied to a random sample from the cluster model C = {14, 15, 26, 27}.
Regions detected as the most likely cluster by three procedures. Regions detected as the most likely cluster by the circular scan, the flexible scan and Duczmal and Assunção's scan, with the maximum length of cluster set to be K = 15 for the simulated random sample from the cluster model C where the hot spot cluster is assumed to be the set of connected four regions {14, 15, 26, 27} with the assumed relative risk θ = 3.0. For details, see text.
| region no. | population | observed no. cases | expected no. cases | relative risk estimated (true) | Log likelihood ratio (LLR) and estimated relative risk | ||
| Circular | Flexible | Duczmal et al. | |||||
| 14 | 319,687 | 14 | 3.794 | 3.69 (3.0) | * | * | * |
| 15 | 529,485 | 21 | 6.283 | 3.34 (3.0) | * | * | * |
| LLR = 20.1 | |||||||
| 26 | 139,077 | 6 | 1.650 | 3.64 (3.0) | * | * | |
| 27 | 165,564 | 6 | 1.964 | 3.05 (3.0) | * | * | |
| 33 | 105,899 | 4 | 1.257 | 3.18 (1.0) | * | * | |
| LLR = 29.7 | |||||||
| 24 | 466,347 | 8 | 5.534 | 1.44 (1.0) | * | ||
| 31 | 197,677 | 3 | 2.346 | 1.27 (1.0) | * | ||
| 32 | 349,050 | 5 | 4.142 | 1.20 (1.0) | * | ||
| 48 | 58,635 | 1 | 0.696 | 1.43 (1.0) | * | ||
| 54 | 3,808 | 1 | 0.045 | 22.12(1.0) | * | ||
| 69 | 119,575 | 3 | 1.419 | 2.11 (1.0) | * | ||
| 77 | 177,742 | 5 | 2.109 | 2.37 (1.0) | * | ||
| 78 | 125,127 | 2 | 1.485 | 1.34 (1.0) | * | ||
| 90 | 194,866 | 5 | 2.312 | 2.16 (1.0) | * | ||
| 110 | 21,535 | 1 | 0.256 | 3.91 (1.0) | * | ||
| LLR = 31.8 | |||||||
Comparison of the circular and the flexible spatial scan statistic for the cluster model A. Comparison of bivariate power distribution P(l, s) × 1000 between the circular spatial scan statistic and the flexible spatial scan statistic for the hot-spot cluster A = {14, 15, 20}. Nominal α-level is set as 0.05 and 1000 trials are carried out. For more details, see text.
| Flexible ( | Circular ( | ||||||||||
| Length | Include | Total | Length | Include | Total | ||||||
| 0 | 1 | 2 | 3 | 0 | 1 | 2 | 3 | ||||
| 1 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | ||||
| 2 | 0 | 0 | 0 | 0 | 2 | 1 | 0 | 0 | 1 | ||
| 3 | 0 | 0 | 0 | 142 | 142 | 3 | 0 | 0 | 0 | 738 | 738 |
| 4 | 0 | 0 | 0 | 116 | 116 | 4 | 0 | 0 | 0 | 134 | 134 |
| 5 | 0 | 0 | 0 | 137 | 137 | 5 | 0 | 0 | 0 | 39 | 39 |
| 6 | 0 | 0 | 0 | 149 | 149 | 6 | 0 | 0 | 0 | 12 | 12 |
| 7 | 0 | 0 | 0 | 165 | 165 | 7 | 0 | 0 | 0 | 9 | 9 |
| 8 | 0 | 0 | 0 | 131 | 131 | 8 | 0 | 0 | 0 | 1 | 1 |
| 9 | 0 | 0 | 0 | 84 | 84 | 9 | 0 | 0 | 2 | 3 | 5 |
| 10 | 0 | 0 | 0 | 27 | 27 | 10 | 0 | 0 | 0 | 2 | 2 |
| 11 | 0 | 0 | 0 | 11 | 11 | 11 | 0 | 0 | 0 | 4 | 4 |
| 12 | 0 | 0 | 0 | 2 | 2 | 12 | 0 | 0 | 0 | 12 | 12 |
| 13 | 0 | 0 | 0 | 0 | 0 | 13 | 0 | 0 | 0 | 14 | 14 |
| 14 | 0 | 0 | 0 | 0 | 0 | 14 | 0 | 0 | 0 | 3 | 3 |
| 15 | 0 | 0 | 0 | 0 | 0 | 15 | 0 | 0 | 0 | 6 | 6 |
| Total | 0 | 0 | 0 | 964 | 964 | Total | 1 | 0 | 2 | 977 | 980 |
| usual power = 0.964 | usual power = 0.980 | ||||||||||
Comparison of the circular and the flexible spatial scan statistic for the cluster model B. Comparison of bivariate power distribution P(l, s) × 1000 between the circular spatial scan statistic and the flexible spatial scan statistic for the hot-spot cluster B = {14, 15, 20, 26}. Nominal α-level is set as 0.05 and 1000 trials are carried out. For more details, see text.
| Flexible ( | Circular ( | ||||||||||||
| Length | Include | Total | Length | Include | Total | ||||||||
| 0 | 1 | 2 | 3 | 4 | 0 | 1 | 2 | 3 | 4 | ||||
| 1 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | ||||||
| 2 | 0 | 0 | 0 | 0 | 2 | 0 | 0 | 0 | 0 | ||||
| 3 | 0 | 0 | 0 | 0 | 0 | 3 | 0 | 0 | 0 | 523 | 523 | ||
| 4 | 0 | 0 | 0 | 0 | 127 | 127 | 4 | 0 | 0 | 0 | 65 | 0 | 65 |
| 5 | 1 | 0 | 0 | 0 | 157 | 158 | 5 | 0 | 0 | 0 | 23 | 0 | 23 |
| 6 | 0 | 0 | 0 | 0 | 205 | 205 | 6 | 0 | 0 | 0 | 7 | 66 | 73 |
| 7 | 0 | 0 | 0 | 2 | 198 | 200 | 7 | 0 | 0 | 0 | 0 | 15 | 15 |
| 8 | 0 | 0 | 0 | 1 | 151 | 152 | 8 | 0 | 0 | 0 | 0 | 32 | 32 |
| 9 | 0 | 0 | 0 | 5 | 85 | 90 | 9 | 0 | 0 | 0 | 1 | 15 | 16 |
| 10 | 0 | 0 | 0 | 1 | 24 | 25 | 10 | 0 | 0 | 0 | 0 | 7 | 7 |
| 11 | 0 | 0 | 0 | 0 | 17 | 17 | 11 | 0 | 0 | 0 | 2 | 3 | 5 |
| 12 | 0 | 0 | 0 | 0 | 5 | 5 | 12 | 0 | 0 | 0 | 2 | 63 | 65 |
| 13 | 0 | 0 | 0 | 0 | 0 | 0 | 13 | 0 | 0 | 0 | 0 | 96 | 96 |
| 14 | 0 | 0 | 0 | 0 | 0 | 0 | 14 | 0 | 0 | 0 | 0 | 30 | 30 |
| 15 | 0 | 0 | 0 | 0 | 0 | 0 | 15 | 0 | 0 | 0 | 0 | 22 | 22 |
| Total | 1 | 0 | 0 | 9 | 969 | 979 | Total | 0 | 0 | 0 | 623 | 349 | 972 |
| usual power = 0.979 | usual power = 0.972 | ||||||||||||
Comparison of the circular and the flexible spatial scan statistic for the cluster model C. Comparison of bivariate power distribution P(l, s) × 1000 between the circular spatial scan statistic and the flexible spatial scan statistic for the hot-spot cluster C = {14, 15, 26, 27}. Nominal α-level is set as 0.05 and 1000 trials are carried out. For more details, see text.
| Flexible ( | Circular ( | ||||||||||||
| Length | Include | Total | Length | Include | Total | ||||||||
| 0 | 1 | 2 | 3 | 4 | 0 | 1 | 2 | 3 | 4 | ||||
| 1 | 0 | 0 | 0 | 1 | 1 | 0 | 1 | ||||||
| 2 | 0 | 0 | 0 | 0 | 2 | 0 | 0 | 351 | 351 | ||||
| 3 | 0 | 0 | 0 | 0 | 0 | 3 | 2 | 0 | 4 | 0 | 6 | ||
| 4 | 0 | 0 | 0 | 0 | 138 | 138 | 4 | 0 | 0 | 3 | 0 | 0 | 3 |
| 5 | 0 | 0 | 0 | 3 | 147 | 150 | 5 | 2 | 0 | 2 | 0 | 0 | 4 |
| 6 | 1 | 0 | 0 | 2 | 200 | 203 | 6 | 1 | 0 | 0 | 0 | 0 | 1 |
| 7 | 0 | 1 | 0 | 4 | 147 | 152 | 7 | 0 | 0 | 0 | 81 | 0 | 81 |
| 8 | 0 | 0 | 2 | 9 | 107 | 118 | 8 | 0 | 0 | 10 | 18 | 38 | 66 |
| 9 | 0 | 0 | 0 | 10 | 71 | 81 | 9 | 0 | 0 | 2 | 0 | 26 | 28 |
| 10 | 1 | 0 | 2 | 5 | 28 | 36 | 10 | 0 | 0 | 0 | 29 | 3 | 32 |
| 11 | 0 | 0 | 0 | 0 | 10 | 10 | 11 | 0 | 0 | 1 | 13 | 1 | 15 |
| 12 | 0 | 0 | 0 | 0 | 2 | 2 | 12 | 0 | 0 | 2 | 4 | 60 | 66 |
| 13 | 0 | 0 | 0 | 0 | 0 | 0 | 13 | 0 | 0 | 0 | 5 | 62 | 67 |
| 14 | 0 | 0 | 0 | 0 | 0 | 0 | 14 | 0 | 0 | 0 | 10 | 27 | 37 |
| 15 | 0 | 0 | 0 | 0 | 0 | 0 | 15 | 0 | 0 | 0 | 6 | 37 | 43 |
| Total | 2 | 1 | 4 | 33 | 850 | 890 | Total | 6 | 0 | 375 | 166 | 254 | 801 |
| usual power = 0.890 | usual power = 0.801 | ||||||||||||
Comparison of the circular and the flexible spatial scan statistic for the cluster model D. Comparison of bivariate power distribution P(l, s) × 1000 between the circular spatial scan statistic and the flexible spatial scan statistic for the hot-spot cluster D = {73, 74, 75, 76, 78}. Nominal α-level is set as 0.05 and 1000 trials are carried out. For more details, see text.
| Flexible ( | Circular ( | ||||||||||||||
| Length | Include | Total | Length | Include | Total | ||||||||||
| 0 | 1 | 2 | 3 | 4 | 5 | 0 | 1 | 2 | 3 | 4 | 5 | ||||
| 1 | 0 | 0 | 0 | 1 | 6 | 0 | 6 | ||||||||
| 2 | 1 | 0 | 0 | 1 | 2 | 3 | 5 | 0 | 8 | ||||||
| 3 | 0 | 0 | 0 | 0 | 0 | 3 | 0 | 0 | 0 | 14 | 14 | ||||
| 4 | 1 | 0 | 0 | 1 | 0 | 2 | 4 | 1 | 0 | 4 | 5 | 0 | 10 | ||
| 5 | 0 | 1 | 0 | 3 | 1 | 242 | 247 | 5 | 0 | 0 | 2 | 1 | 0 | 0 | 3 |
| 6 | 1 | 0 | 0 | 1 | 2 | 162 | 166 | 6 | 1 | 0 | 0 | 1 | 363 | 0 | 365 |
| 7 | 2 | 3 | 0 | 5 | 5 | 93 | 108 | 7 | 0 | 0 | 1 | 0 | 56 | 0 | 57 |
| 8 | 1 | 2 | 1 | 6 | 7 | 53 | 70 | 8 | 0 | 0 | 2 | 2 | 28 | 0 | 32 |
| 9 | 0 | 2 | 0 | 1 | 5 | 38 | 46 | 9 | 0 | 0 | 2 | 2 | 10 | 0 | 14 |
| 10 | 0 | 2 | 0 | 1 | 1 | 18 | 22 | 10 | 1 | 0 | 0 | 3 | 3 | 0 | 7 |
| 11 | 0 | 0 | 0 | 2 | 2 | 5 | 9 | 11 | 0 | 0 | 0 | 0 | 3 | 11 | 14 |
| 12 | 0 | 0 | 1 | 0 | 0 | 1 | 2 | 12 | 0 | 0 | 0 | 2 | 3 | 8 | 13 |
| 13 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 13 | 0 | 0 | 0 | 1 | 1 | 16 | 18 |
| 14 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 14 | 0 | 0 | 1 | 0 | 0 | 5 | 6 |
| 15 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 15 | 0 | 1 | 0 | 0 | 1 | 7 | 9 |
| Total | 6 | 10 | 2 | 20 | 23 | 612 | 673 | Total | 12 | 6 | 12 | 31 | 468 | 47 | 576 |
| usual power = 0.673 | usual power = 0.576 | ||||||||||||||
Cost comparison Expected number of undetected regions included in the true cluster E(s* - S), expected number of detected regions not in the true cluster E(L - S) and the ratio of costs C/C2 (r = 1, 2) incurred by incomplete identification of the true cluster. The spatial scan statistic with low values is better.
| Hot-spot Cluster | Scan statistic | the ratio | |||
| Flexible (K = 15) | 0.108 | 2.951 | 3.059 | 3.167 | |
| Circular (K = 15) | 0.065 | 0.722 | 0.787 | 0.852 | |
| Flexible (K = 15) | 0.097 | 2.548 | 2.645 | 2.742 | |
| Circular (K = 15) | 0.735 | 2.525 | 3.260 | 3.995 | |
| Flexible (K = 15) | 0.492 | 2.243 | 2.735 | 3.227 | |
| Circular (K = 15) | 1.736 | 3.153 | 4.889 | 6.625 | |
| Flexible (K = 15) | 1.774 | 1.088 | 2.862 | 4.636 | |
| Circular (K = 15) | 2.770 | 1.709 | 4.479 | 7.249 | |