| Literature DB >> 32288990 |
Hesam Izakian1, Witold Pedrycz1,2.
Abstract
The spatial and spatio-temporal scan statistics proposed by Kulldorff have been applied to a number of geographical disease cluster detection problems. As the shape of the scanning window used in these methods is circular or elliptic, they cannot find irregularly shaped clusters, say clusters occurring along river valleys or in cases where disease transmission is linked to the road network. In this study, we propose a more flexible geometric structure to be used as a spatial or spatio-temporal scanning window. A particle swarm optimization (PSO) is used to optimize the scanning window to determine disease clusters. We evaluated the proposed method over a number of spatial and spatio-temporal datasets (Breast cancer mortality in Northeastern US 1988-1992 and different types of cancer in New Mexico 1982-2007). Experimental results demonstrate that the introduced approach surpasses the results produced by the circular and elliptic scan statistics in terms of efficiency, especially when dealing with irregularly shaped clusters.Entities:
Keywords: Particle swarm optimization; Scan statistics; Scanning window
Year: 2012 PMID: 32288990 PMCID: PMC7104009 DOI: 10.1016/j.swevo.2012.02.001
Source DB: PubMed Journal: Swarm Evol Comput ISSN: 2210-6502 Impact factor: 7.177
Fig. 1Ring topology for particle neighborhood formed on the basis of particle indices.
Fig. 2A circle with eight sectors (a), and its corresponding irregular shape (b).
Fig. 3Particle structure for spatio-temporal scan statistics.
Fig. 4Selecting regions within the scanning window. (a) The circular scanner, (b) the selected regions when the whole region should be in the scanner, (c) the selected regions when any part of the regions should be in the scanner, (d) the selected regions when the centroid of regions should be in the scanner.
Checking the connectivity of the selected regions by scanner.
| |
| |
| |
| the selected regions are disconnected; |
| the selected regions are connected; |
Pseudo-code of the proposed PSO approach.
| Create and initialize a swarm with |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| update the velocity using |
| update the position using |
| |
Effect of the “number of sectors” on the likelihood ratio in the PSO optimization. For each entry of the table, the first number is the best result among 10 independent runs, followed by the average and the standard deviation.
| Dataset | Number of sectors | ||||||
|---|---|---|---|---|---|---|---|
| 1 | 2 | 4 | 8 | 16 | 32 | 64 | |
| Mortality dataset | 48.05 | 63.23 | 67.03 | 94.20 | 108.39 | 121.89 | 121.89 |
| 48.05 | 61.27 | 67.03 | 89.99 | 105.59 | 109.37 | 109.50 | |
| 0.00 | 1.69 | 0.00 | 4.09 | 1.82 | 7.90 | 5.65 | |
| Breast | 345.39 | 345.39 | 345.39 | 345.39 | 345.39 | 345.39 | 345.39 |
| 345.39 | 345.39 | 345.39 | 345.39 | 345.39 | 345.39 | 345.39 | |
| 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |
| Liver | 82.41 | 82.41 | 89.21 | 92.71 | 95.43 | 95.43 | 95.43 |
| 82.41 | 82.41 | 89.21 | 92.22 | 92.97 | 93.37 | 94.65 | |
| 0.00 | 0.00 | 0.00 | 1.53 | 2.01 | 0.85 | 1.41 | |
| Lung | 122.98 | 122.98 | 128.15 | 149.11 | 156.06 | 163.67 | 166.05 |
| 122.98 | 122.98 | 124.10 | 142.06 | 153.71 | 157.45 | 160.07 | |
| 0.00 | 0.00 | 2.49 | 3.37 | 1.66 | 4.94 | 4.38 | |
| Lymphoma | 24.82 | 24.82 | 27.01 | 27.01 | 27.74 | 27.74 | 27.74 |
| 24.82 | 24.82 | 27.01 | 27.01 | 27.15 | 27.63 | 27.71 | |
| 0.00 | 0.00 | 0.00 | 0.00 | 0.29 | 0.24 | 0.02 | |
| Prostate | 222.25 | 222.25 | 234.03 | 239.62 | 240.00 | 244.21 | 244.21 |
| 222.25 | 222.25 | 226.82 | 229.30 | 239.73 | 240.53 | 240.19 | |
| 0.00 | 0.00 | 4.24 | 5.70 | 0.19 | 2.75 | 1.42 | |
| Stomach | 12.33 | 18.20 | 21.46 | 21.46 | 21.46 | 21.46 | 21.46 |
| 12.33 | 18.20 | 21.46 | 21.46 | 21.46 | 21.46 | 21.46 | |
| 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |
| Skin | 25.11 | 25.72 | 27.58 | 28.26 | 28.26 | 28.26 | 28.26 |
| 25.11 | 25.72 | 27.58 | 28.05 | 28.26 | 28.26 | 28.26 | |
| 0.00 | 0.00 | 0.00 | 0.33 | 0.00 | 0.00 | 0.00 | |
| Thyroid | 171.40 | 178.43 | 181.67 | 181.67 | 181.67 | 182.33 | 186.70 |
| 171.40 | 175.64 | 181.40 | 180.48 | 181.67 | 181.46 | 182.57 | |
| 0.00 | 2.85 | 0.85 | 1.33 | 0.00 | 0.64 | 2.19 | |
Effect of “selectable population at risk” on likelihood ratio. The first number in each cell of the table is the best result obtained for 10 independent runs, which is followed by the average and the standard deviation.
| Dataset | Population rate at risk | ||||
|---|---|---|---|---|---|
| 10% | 20% | 30% | 40% | 50% | |
| Mortality | 73.28 | 104.66 | 121.89 | 121.89 | 121.89 |
| 69.86 | 96.59 | 112.46 | 110.08 | 109.50 | |
| 3.24 | 5.23 | 10.45 | 9.01 | 5.65 | |
| Breast | 58.75 | 90.83 | 90.83 | 281.26 | 345.39 |
| 58.75 | 90.83 | 90.83 | 281.26 | 345.39 | |
| 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |
| Liver | 16.38 | 24.43 | 35.68 | 76.27 | 95.43 |
| 16.34 | 24.41 | 33.91 | 73.69 | 94.65 | |
| 0.05 | 0.03 | 2.73 | 1.509 | 1.41 | |
| Lung | 104.195 | 137.901 | 149.41 | 163.48 | 166.05 |
| 104.195 | 134.813 | 149.41 | 160.11 | 160.07 | |
| 0.00 | 2.879 | 0.00 | 3.12 | 4.38 | |
| Lymphoma | 7.58 | 8.96 | 14.10 | 21.63 | 27.74 |
| 7.58 | 8.74 | 13.84 | 21.01 | 27.71 | |
| 0.00 | 0.27 | 0.302 | 0.84 | 0.02 | |
| Prostate | 63.45 | 86.48 | 100.05 | 180.53 | 244.21 |
| 63.45 | 82.92 | 96.06 | 178.91 | 240.19 | |
| 0.00 | 3.07 | 2.34 | 2.50 | 1.42 | |
| Stomach | 21.46 | 21.46 | 21.46 | 21.46 | 21.46 |
| 21.46 | 21.46 | 21.46 | 21.46 | 21.46 | |
| 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |
| Skin | 5.75 | 8.01 | 8.57 | 17.98 | 28.26 |
| 5.75 | 7.60 | 7.90 | 17.62 | 28.26 | |
| 0.00 | 0.32 | 0.56 | 0.39 | 0.00 | |
| Thyroid | 38.70 | 57.79 | 104.84 | 152.19 | 186.70 |
| 38.02 | 56.71 | 99.33 | 151.68 | 182.57 | |
| 0.70 | 1.18 | 13.48 | 0.37 | 2.19 | |
Comparison of circular, elliptic and PSO-based scan statistic for spatial and spatiotemporal datasets. The mortality dataset is spatial and the others are spatiotemporal ones.
| Dataset | Method | Counties | Cluster period | Cases | Expected | LLR | |
|---|---|---|---|---|---|---|---|
| Mortality | Circular | PADelaware, PAMontgomery, PAPhiladelphia | NA | 3507 | 2 972.10 | 48.05 | 0.001 |
| Elliptic | NJBergen, NJEssex, NJSomerset, NJUnion, NYWestchester, PADelaware | NA | 4502 | 3 936.79 | 78.90 | 0.001 | |
| PSO | NJAtlantic, NJBergen, NJBurlington, NJEssex, NJMercer, NJMiddlesex, NJMonmouth, NJOcean, NJUnion, NYNassau, NYWestchester, PADelaware, PAMontgomery, PAPhiladelphia | NA | 11 562 | 10 107.30 | 121.89 | 0.001 | |
| Breast | Circular | Bernalillo, LosAlamos, Sandoval, SantaFe | 1987–2007 | 12 766 | 10 634.01 | 345.39 | 0.001 |
| Elliptic | Bernalillo, LosAlamos, Sandoval, SantaFe | 1987–2007 | 12 766 | 10 634.01 | 345.39 | 0.001 | |
| PSO | Bernalillo, LosAlamos, Sandoval, SantaFe | 1987–2007 | 12 766 | 10 634.01 | 345.39 | 0.001 | |
| Liver | Circular | Bernalillo, Cibola, McKinley, Sandoval, Socorro, Torrance, Valencia | 1992–2007 | 979 | 692.91 | 82.41 | 0.001 |
| Elliptic | Bernalillo, Cibola, Guadalupe, McKinley, Sandoval, Socorro, Torrance, Valencia | 1992–2007 | 988 | 697.06 | 84.98 | 0.001 | |
| PSO | Bernalillo, Guadalupe, McKinley, RioArriba, Sandoval, Taos, Torrance, Valencia | 1992–2007 | 1029 | 718.62 | 95.43 | 0.001 | |
| Lung | Circular | Chaves, Curry, DeBaca, Eddy, Lea, Quay, Roosevelt | 1987–2007 | 3336 | 2 568.93 | 122.98 | 0.001 |
| Elliptic | Chaves, Curry, DeBaca, Eddy, Lea, Quay, Roosevelt | 1987–2007 | 3336 | 2 568.93 | 122.98 | 0.001 | |
| PSO | Chaves, Cibola, Curry, DeBaca, Eddy, Guadalupe, Lea, Otero, Quay, Roosevelt, Sandoval, SanJuan, Sierra, Torrance, Valencia | 1982–2007 | 7872 | 6 662.39 | 166.05 | 0.001 | |
| Lymphoma | Circular | Bernalillo, LosAlamos, Sandoval, SantaFe | 1990–2007 | 2473 | 2 201.75 | 24.82 | 0.001 |
| Elliptic | Bernalillo, LosAlamos, Sandoval, SanJuan, SantaFe | 1990–2007 | 2723 | 2 434.20 | 27.01 | 0.001 | |
| PSO | Bernalillo, Curry, DeBaca, Harding, Lincoln, LosAlamos, Mora, Quay, Roosevelt, Sandoval, SantaFe, Union | 1990–2007 | 2821 | 2 526.51 | 27.74 | 0.001 | |
| Prostate | Circular | Bernalillo, LosAlamos, Sandoval, SantaFe, Valencia | 1990–2007 | 10 779 | 9 138.03 | 222.25 | 0.001 |
| Elliptic | Bernalillo, Chaves, Lincoln, LosAlamos, Sandoval, SantaFe, Torrance | 1990–2007 | 11 496 | 9 771.17 | 239.61 | 0.001 | |
| PSO | Bernalillo, Chaves, Cibola, Lincoln, LosAlamos, Sandoval, SantaFe, Socorro | 1990–2007 | 11 786 | 10 037.4 | 244.21 | 0.001 | |
| Stomach | Circular | Colfax, Harding, LosAlamos, Mora, RioArriba, SanMiguel, Taos, | 1982–2007 | 347 | 266.137 | 12.33 | 0.068 |
| Elliptic | Guadalupe, Mora, RioArriba, SanMiguel, Taos | 1982–2007 | 297 | 201.73 | 21.15 | 0.032 | |
| PSO | Guadalupe, Harding, Mora, Quay, RioArriba, SanMiguel, Taos | 1982–2007 | 335 | 232.90 | 21.46 | 0.049 | |
| Skin | Circular | Bernalillo, Catron, Cibola, Sandoval, SanJuan, Socorro, Valencia | 1986–2007 | 329 | 243.404 | 25.11 | 0.017 |
| Elliptic | Bernalillo, Lincoln, Sandoval, SanJuan, Socorro, Valencia | 1986–2007 | 331 | 242.85 | 26.63 | 0.021 | |
| PSO | Bernalillo, Lincoln, LosAlamos, Sandoval, SanJuan, Socorro, Valencia | 1986–2007 | 340 | 249.07 | 28.26 | 0.006 | |
| Thyroid | Circular | Bernalillo, Cibola, LosAlamos, RioArriba, Sandoval, SantaFe | 2000–2007 | 1070 | 620.908 | 171.40 | 0.001 |
| Elliptic | Bernalillo, Catron, Grant, Hidalgo, LosAlamos, RioArriba, Sandoval, Socorro, Valencia | 2000–2007 | 1035 | 589.11 | 174.54 | 0.001 | |
| PSO | Bernalillo, Cibola, DeBaca, Guadalupe, Lincoln, LosAlamos, RioArriba, Roosevelt, Sandoval, SantaFe, Socorro | 1998–2007 | 1 325 | 816.015 | 186.70 | 0.001 | |
Comparison between circular, elliptic, and PSO methods. The first number is the run time and the second number is LLR value.
| Increase amount | Circular | Elliptic | PSO | ||||
|---|---|---|---|---|---|---|---|
| 5% | 10% | 20% | 5% | 10% | 20% | NA | |
| Mortality | 0.017, 48.05 | 0.005, 39.97 | 0.003, 30.99 | 21.78, 68.33 | 0.73, 53.47 | 0.036, 47.62 | 62.71, 121.89 |
| Breast | 1.33, 345.39 | 0.174, 345.39 | 0.032, 306.51 | 116.13, 345.39 | 4.59, 345.39 | 0.233, 341.35 | 27.45, 345.39 |
| Liver | 1.34, 82.41 | 0.164, 82.41 | 0.033, 75.39 | 120.35, 84.98 | 4.58, 84.82 | 0.21, 80.59 | 47.85 95.43 |
| Lung | 1.48, 122.98 | 0.162, 122.98 | 0.041, 122.98 | 128.86, 122.98 | 4.50, 122.98 | 0.23, 122.98 | 44.40, 166.05 |
| Lymphoma | 1.39, 24.82 | 0.178, 24.82 | 0.032, 21.56 | 122.97, 27.01 | 4.73, 26.61 | 0.23, 26.61 | 28.57, 27.74 |
| Prostate | 1.32, 222.25 | 0.165, 222.25 | 0.034, 209.48 | 118.19, 239.61 | 4.18, 224.64 | 0.21, 213.86 | 25.93 244.21 |
| Stomach | 1.37, 12.33 | 0.169, 12.17 | 0.035, 12.17 | 125.93, 21.15 | 4.52, 21.14 | 0.20, 19.35 | 24.03, 21.46 |
| Skin | 1.35, 22.03 | 0.168, 20.04 | 0.036, 18.74 | 140.94, 26.63 | 5.02, 26.38 | 0.23, 24.07 | 29.59, 28.26 |
| Thyroid | 1.51, 170.86 | 0.182, 170.86 | 0.039, 170.86 | 149.87, 174.54 | 4.59, 174.54 | 0.23, 170.86 | 40.01, 186.70 |