| Literature DB >> 23785424 |
Xing Zhao1, Xiao-Hua Zhou, Zijian Feng, Pengfei Guo, Hongyan He, Tao Zhang, Lei Duan, Xiaosong Li.
Abstract
As a useful tool for geographical cluster detection of events, the spatial scan statistic is widely applied in many fields and plays an increasingly important role. The classic version of the spatial scan statistic for the binary outcome is developed by Kulldorff, based on the Bernoulli or the Poisson probability model. In this paper, we apply the Hypergeometric probability model to construct the likelihood function under the null hypothesis. Compared with existing methods, the likelihood function under the null hypothesis is an alternative and indirect method to identify the potential cluster, and the test statistic is the extreme value of the likelihood function. Similar with Kulldorff's methods, we adopt Monte Carlo test for the test of significance. Both methods are applied for detecting spatial clusters of Japanese encephalitis in Sichuan province, China, in 2009, and the detected clusters are identical. Through a simulation to independent benchmark data, it is indicated that the test statistic based on the Hypergeometric model outweighs Kulldorff's statistics for clusters of high population density or large size; otherwise Kulldorff's statistics are superior.Entities:
Mesh:
Year: 2013 PMID: 23785424 PMCID: PMC3681795 DOI: 10.1371/journal.pone.0065419
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Choropleth map of empirical Bayes estimates of relative risk of Japanese encephalitis in Sichuan province in 2009.
Figure 2Detected clusters of Japanese encephalitis in Sichuan province in 2009.
On a significance level of 0.05, the test statistics based on Poisson and Hypergeometric models obtained almost the same results. They detected two significant clusters, the most likely cluster in the southeast with 18 counties and one secondary cluster in the northeast with 12 counties, both with P value of 0.0001. They differs slightly on statistically insignificant clusters.
SEN and PPV of test statistics based on the Poisson and the Hypergeometric models.
| county | population | ||||||
| SEN (%) | PPV (%) | SEN (%) | PPV (%) | ||||
| rural | ◊ | 1 | H | 99.19 | 97.30 | 99.19 | 97.08 |
| P | 99.81 | 97.73 | 99.81 | 97.37 | |||
| ◊ | 2 | H | 87.37 | 96.11 | 95.98 | 95.93 | |
| P | 87.58 | 96.58 | 96.36 | 96.37 | |||
| ◊ | 4 | H | 93.38 | 82.36 | 93.86 | 87.00 | |
| P | 93.49 | 83.23 | 93.97 | 87.87 | |||
| 8 | H | 86.71 | 83.84 | 88.87 | 85.88 | ||
| P | 86.70 | 85.12 | 88.94 | 87.18 | |||
| 16 | H | 82.74 | 88.02 | 84.80 | 86.17 | ||
| P | 82.20 | 88.89 | 84.35 | 87.17 | |||
| urban | ◊ | 1 | H | 90.22 | 80.96 | 90.22 | 83.19 |
| P | 91.97 | 83.85 | 91.97 | 86.11 | |||
| 2 | H | 86.70 | 81.36 | 88.09 | 81.94 | ||
| P | 86.20 | 82.53 | 87.79 | 83.23 | |||
|
| 4 | H | 86.09 | 76.66 | 86.21 | 79.83 | |
| P | 84.00 | 76.39 | 84.08 | 79.30 | |||
|
| 8 | H | 83.47 | 73.48 | 86.40 | 81.26 | |
| P | 81.07 | 72.69 | 84.00 | 80.03 | |||
|
| 16 | H | 82.71 | 73.49 | 85.78 | 84.53 | |
| P | 80.48 | 72.47 | 83.55 | 83.02 | |||
| rural and mixed | ◊ | 1 | H | 94.10 | 74.24 | 89.32 | 87.54 |
| P | 94.70 | 75.46 | 89.77 | 88.32 | |||
| ◊ | 2 | H | 84.22 | 77.69 | 88.43 | 89.61 | |
| P | 84.23 | 78.88 | 88.47 | 90.44 | |||
| 4 | H | 84.27 | 71.79 | 84.50 | 87.53 | ||
| P | 84.22 | 72.93 | 84.08 | 88.29 | |||
| 8 | H | 78.31 | 77.39 | 82.21 | 89.10 | ||
| P | 77.87 | 78.57 | 81.57 | 89.94 | |||
| 16 | H | 74.12 | 84.43 | 80.22 | 90.21 | ||
| P | 73.27 | 85.29 | 79.20 | 90.88 | |||
| mixed and urban | ◊ | 1 | H | 84.63 | 71.37 | 84.54 | 88.29 |
| P | 84.94 | 73.37 | 84.86 | 89.56 | |||
| 2 | H | 78.43 | 75.57 | 81.70 | 89.37 | ||
| P | 78.32 | 77.04 | 81.64 | 90.50 | |||
| 4 | H | 70.80 | 73.12 | 72.37 | 88.02 | ||
| P | 68.80 | 73.97 | 69.80 | 88.34 | |||
| 8 | H | 62.29 | 75.54 | 67.72 | 87.88 | ||
| P | 59.69 | 75.66 | 64.14 | 87.67 | |||
|
| 16 | H | 53.75 | 78.60 | 58.20 | 88.52 | |
| P | 50.83 | 78.32 | 54.21 | 87.69 | |||
P: Denote the test statistic based on the Poisson probability model.
H: Denote the test statistic based on the Hypergeometric probability model.
◊: Denote that the test statistic based on the Poisson model performs better.
: Denote that the test statistic based on the Hypergeometric model performs better.