| Literature DB >> 21970516 |
Stéphanie Goujon-Bellec1, Claire Demoury, Aurélie Guyot-Goubin, Denis Hémon, Jacqueline Clavel.
Abstract
BACKGROUND: For many years, the detection of clusters has been of great public health interest. Several detection methods have been developed, the most famous of which is the circular scan method. The present study, which was conducted in the context of a rare disease distributed over a large territory (7675 cases registered over 17 years and located in 1895 units), aimed to evaluate the performance of several of the methods in realistic hot-spot cluster situations.Entities:
Mesh:
Year: 2011 PMID: 21970516 PMCID: PMC3204219 DOI: 10.1186/1476-072X-10-53
Source DB: PubMed Journal: Int J Health Geogr ISSN: 1476-072X Impact factor: 3.918
Number of Communes, area and population of the 1895 living zones (LZ) in France.
| Mean | Minimum | Q1 | Median | Q3 | Maximum | |
|---|---|---|---|---|---|---|
| Number of | 19.1 | 1 | 7 | 13 | 24 | 556 |
| Area (km2) | 282.3 | 0.4 | 108.8 | 193 | 340.2 | 3863.2 |
| Total population (1999) | 30542.0 | 270 | 6219.8 | 9754.5 | 17 968 | 9802327 |
| Population 0-14 years (1999) | 5453.7 | 71 | 1068 | 1725 | 3298 | 1823195 |
| AL incidence rate | 40.8 | 0.0 | 0.0 | 36.5 | 58.6 | 545.9 |
| Expected cases of AL per LZ (1990-2006) | 4.1 | 0.1 | 0.8 | 1.3 | 2.4 | 1388.3 |
AL: childhood acute leukemia. Q1: first quartile. Q3: third quartile
Figure 1Illustration of five cluster candidate areas in the flexible scan method [3] . (a) Neighborhood of the cross unit. (b)-(f) 5 particular cluster candidate areas (in yellow) included in the neighborhood of the cross unit.
Figure 2Illustration of the genetic algorithm method - offspring resulting from the cross between two parent areas. (a)-(b) the two parent areas. (c) cross between the two parent areas with units in parent 'a' and parent 'b' coded positively and negatively, respectively, and the intersection coded 0 (in green). (d)-(f) three offspring created using the genetic algorithm procedure (see [7]).
Description of the simulated alternative cluster scenarios
| #1 | #2 | #3 | #4 | #5 | #6 | #7 | #8 | #9 | |
|---|---|---|---|---|---|---|---|---|---|
| Linear | U-shaped | Compact | Linear | U-shaped | Compact | Linear | U-shaped | Compact | |
| No. LZ = 6 | No. LZ = 10 | No. LZ = 8 | No. LZ = 7 | No. LZ = 7 | No. LZ = 11 | No. LZ = 12 | No. LZ = 16 | No. LZ = 13 | |
| Total | |||||||||
| Mean | 24632.3 | 15306.7 | 17088.0 | 51536.0 | 50155.6 | 36743.3 | 79046.4 | 59653.8 | 76085.8 |
| SD | 35843.1 | 15350.6 | 16754.8 | 100719.4 | 101328.7 | 81488.4 | 223805.2 | 194750.3 | 224749.6 |
| Min | 6519 | 3963 | 6519 | 8267 | 7177 | 2949 | 5202 | 2259 | 4051 |
| Max | 97315 | 57355 | 57355 | 279841 | 279841 | 279841 | 788887 | 788887 | 788887 |
No. AL: number of childhood acute leukemia; No. LZ: number of living zones in the cluster
1 "small", "moderate" and "large" clusters are clusters with about 20, 45 and 115 cases of childhood acute leukemia over the period 1990-2006, respectively.
Figure 3The 9 clusters under study (3 cluster shapes and 3 cluster locations). The nine scenarios considered consisted in a combination of 3 cluster shapes (linear, U-shaped and compact) and 3 locations equivalent to 3 population sizes (20 expected cases for clusters 1-3,45 expected cases for clusters 4-6 and 115 expected cases for clusters 7-9).
Performance of cluster detection methods on one replicated dataset.
| Scan-c | Scan-e0 | FleX | GA-1 | Double | Mlink | |
|---|---|---|---|---|---|---|
| p-value | < 0.0001 | < 0.0001 | < 0.0001 | < 0.0001 | < 0.0001 | < 0.0001 |
| No. LZ | 18 | 18 | 13 | 13 | 8 | 13 |
| True Positive LZ1 | 10 | 11 | 8 | 9 | 6 | 7 |
| Sensitivity2 | 0.91 | 1.00 | 0.73 | 0.82 | 0.55 | 0.64 |
| PPV3 | 0.56 | 0.61 | 0.62 | 0.69 | 0.75 | 0.54 |
| Cost4 | 9 | 7 | 8 | 6 | 7 | 10 |
Results for the sixth cluster scenario (compact cluster, E = 50.1 cases, covering 11 LZ), with a relative risk of 2.
Results based on 250 Monte Carlo replications. Scan-c: circular scan method, Scan-e0: elliptic scan method with no penalty, FleX: unrestricted flexible scan method, GA-1: strongly penalized genetic algorithm, Double and Mlink: dynamic minimum spanning tree method with double and maximum link connections, respectively. No. LZ: number of living zones in the detected cluster. 1 number of living zones in the intersection of the true and detected clusters. 2 sensitivity: proportion of living zones in the true cluster that are correctly detected. 3 proportion of living zones in the detected cluster that are in the "true" cluster. 4 number of living zones that are either missed or erroneously detected.
Usual power of the cluster detection methods
| #1 | #2 | #3 | #4 | #5 | #6 | #7 | #8 | #9 | ||
|---|---|---|---|---|---|---|---|---|---|---|
| Linear | U-Shaped | Compact | Linear | U-Shaped | Compact | Linear | U-Shaped | Compact | ||
| No LZ = 6 | No. LZ = 10 | No. LZ = 8 | No. LZ = 7 | No. LZ = 7 | No. LZ = 11 | No. LZ = 12 | No. LZ = 16 | No. LZ = 13 | ||
| Scan-c | 0.18 | 0.16 | 0.14 | 0.30 | 0.32 | 0.40 | ||||
| Scan-e0 | 0.11 | 0.11 | 0.11 | 0.26 | 0.26 | 0.36 | 0.79 | |||
| FleX | 0.12 | 0.15 | 0.16 | 0.25 | 0.29 | 0.36 | 0.78 | 0.74 | 0.76 | |
| GA-1 | 0.14 | 0.11 | 0.10 | 0.33 | 0.30 | 0.44 | ||||
| Double | 0.14 | 0.13 | 0.13 | 0.26 | 0.25 | 0.39 | 0.79 | 0.78 | ||
| Mlink | 0.65 | 0.60 | 0.60 | 0.71 | 0.73 | 0.79 | ||||
| Scan-c | 0.46 | 0.49 | 0.52 | |||||||
| Scan-e0 | 0.51 | 0.51 | 0.50 | |||||||
| FleX | 0.52 | 0.57 | 0.52 | |||||||
| GA-1 | 0.32 | 0.41 | 0.43 | |||||||
| Double | 0.44 | 0.42 | 0.48 | |||||||
| Mlink | ||||||||||
| Scan-c | ||||||||||
| Scan-e0 | ||||||||||
| FleX | ||||||||||
| GA-1 | ||||||||||
| Double | ||||||||||
| Mlink | ||||||||||
Scan-c: circular scan method, Scan-e0: standard elliptic scan method, FleX: unrestricted flexible scan method, GA-1: strongly penalized genetic algorithm, Double and Mlink: dynamic minimum spanning tree method with double and maximum link connections, respectively. No. LZ: size of living zones in the cluster (number of living zones); RR: relative risk in the true cluster. Results based on 250 Monte Carlo replications.
1 "small", "moderate" and "large" clusters are clusters with about 20, 45 and 115 cases of childhood acute leukemia over the period 1990-2006, respectively.
Power to detect at least one LZ in the true cluster.
| #1 | #2 | #3 | #4 | #5 | #6 | #7 | #8 | #9 | ||
|---|---|---|---|---|---|---|---|---|---|---|
| Linear | U-Shaped | Compact | Linear | U-Shaped | Compact | Linear | U-Shaped | Compact | ||
| No. LZ = 6 | No. LZ = 10 | No. LZ = 8 | No. LZ = 7 | No. LZ = 7 | No. LZ = 11 | No. LZ = 12 | No. LZ = 16 | No. LZ = 13 | ||
| Scan-c | 0.04 | 0.07 | 0.05 | 0.22 | 0.21 | 0.34 | ||||
| Scan-e0 | 0.03 | 0.02 | 0.05 | 0.19 | 0.19 | 0.31 | 0.76 | |||
| FleX | 0.04 | 0.07 | 0.07 | 0.17 | 0.19 | 0.28 | 0.76 | 0.72 | 0.72 | |
| GA-1 | 0.02 | 0.05 | 0.02 | 0.24 | 0.23 | 0.36 | ||||
| Double | 0.03 | 0.06 | 0.05 | 0.17 | 0.16 | 0.32 | 0.76 | 0.74 | ||
| Mlink | 0.11 | 0.13 | 0.11 | 0.34 | 0.34 | 0.50 | ||||
| Scan-c | 0.38 | 0.41 | 0.44 | |||||||
| Scan-e0 | 0.43 | 0.39 | 0.46 | |||||||
| FleX | 0.44 | 0.54 | 0.47 | |||||||
| GA-1 | 0.25 | 0.37 | 0.34 | |||||||
| Double | 0.38 | 0.36 | 0.42 | |||||||
| Mlink | 0.59 | 0.57 | 0.58 | |||||||
| Scan-c | ||||||||||
| Scan-e0 | ||||||||||
| FleX | ||||||||||
| GA-1 | ||||||||||
| Double | ||||||||||
| Mlink | ||||||||||
Scan-c: circular scan method, Scan-e0: standard elliptic scan method, FleX: unrestricted flexible scan method, GA-1: strongly penalized genetic algorithm, Double and Mlink: dynamic minimum spanning tree method with double and maximum link connections, respectively. No. LZ: size of the cluster (number of living zones); RR: relative risk in the true cluster. Results based on 250 Monte Carlo replications.
1 "small", "moderate" and "large" clusters are clusters with about 20, 45 and 115 cases of childhood acute leukemia over the period 1990-2006, respectively.
Average sensitivity, PPV and cost of cluster detection methods
| #1 | #2 | #3 | #4 | #5 | #6 | #7 | #8 | #9 | ||
|---|---|---|---|---|---|---|---|---|---|---|
| Linear | U-Shaped | Compact | Linear | U-Shaped | Compact | Linear | U-Shaped | Compact | ||
| 6 LZ | 10 LZ | 8 LZ | 7 LZ | 7 LZ | 11 LZ | 12 LZ | 16 LZ | 13 LZ | ||
| RR = 1.5 | Scan-c | 0.52 | 0.40 | 0.68 | ||||||
| Scan-e0 | 0.52 | 0.38 | 0.61 | |||||||
| FleX | 0.43 | 0.35 | 0.49 | |||||||
| GA-1 | 0.37 | 0.30 | 0.47 | |||||||
| Double | 0.42 | 0.31 | 0.39 | |||||||
| Mlink | 0.49 | 0.38 | 0.46 | |||||||
| RR = 2.0 | Scan-c | 0.49 | 0.63 | 0.81 | 0.59 | 0.49 | 0.82 | |||
| Scan-e0 | 0.64 | 0.46 | 0.75 | 0.71 | 0.45 | 0.78 | ||||
| FleX | 0.57 | 0.68 | 0.64 | 0.50 | 0.43 | 0.64 | ||||
| GA-1 | 0.54 | 0.54 | 0.78 | 0.45 | 0.35 | 0.60 | ||||
| Double | 0.48 | 0.48 | 0.63 | 0.50 | 0.37 | 0.51 | ||||
| Mlink | 0.50 | 0.54 | 0.69 | 0.58 | 0.46 | 0.57 | ||||
| RR = 3.0 | Scan-c | 0.48 | 0.43 | 0.79 | 0.51 | 0.75 | 0.93 | 0.63 | 0.54 | 0.90 |
| Scan-e0 | 0.82 | 0.18 | 0.82 | 0.85 | 0.50 | 0.92 | 0.89 | 0.49 | 0.88 | |
| FleX | 0.68 | 0.70 | 0.82 | 0.69 | 0.86 | 0.78 | 0.61 | 0.51 | 0.75 | |
| GA-1 | 0.65 | 0.60 | 0.73 | 0.53 | 0.57 | 0.86 | 0.57 | 0.39 | 0.70 | |
| Double | 0.43 | 0.39 | 0.81 | 0.52 | 0.57 | 0.79 | 0.60 | 0.44 | 0.59 | |
| Mlink | 0.44 | 0.44 | 0.66 | 0.55 | 0.63 | 0.81 | 0.66 | 0.49 | 0.62 | |
| RR = 1.5 | Scan-c | 0.46 | 0.50 | 0.69 | ||||||
| Scan-e0 | 0.40 | 0.39 | 0.53 | |||||||
| FleX | 0.45 | 0.49 | 0.57 | |||||||
| GA-1 | 0.47 | 0.52 | 0.58 | |||||||
| Double | 0.54 | 0.54 | 0.62 | |||||||
| Mlink | 0.43 | 0.46 | 0.51 | |||||||
| RR = 2.0 | Scan-c | 0.39 | 0.46 | 0.81 | 0.57 | 0.57 | 0.84 | |||
| Scan-e0 | 0.47 | 0.33 | 0.65 | 0.61 | 0.52 | 0.73 | ||||
| FleX | 0.45 | 0.49 | 0.67 | 0.57 | 0.61 | 0.74 | ||||
| GA-1 | 0.42 | 0.41 | 0.77 | 0.68 | 0.69 | 0.76 | ||||
| Double | 0.56 | 0.55 | 0.77 | 0.70 | 0.67 | 0.80 | ||||
| Mlink | 0.50 | 0.51 | 0.75 | 0.63 | 0.59 | 0.74 | ||||
| RR = 3.0 | Scan-c | 0.51 | 0.68 | 0.73 | 0.43 | 0.59 | 0.90 | 0.68 | 0.63 | 0.92 |
| Scan-e0 | 0.73 | 0.25 | 0.73 | 0.73 | 0.44 | 0.90 | 0.73 | 0.53 | 0.89 | |
| FleX | 0.57 | 0.76 | 0.77 | 0.65 | 0.70 | 0.83 | 0.75 | 0.79 | 0.88 | |
| GA-1 | 0.55 | 0.79 | 0.82 | 0.52 | 0.55 | 0.89 | 0.88 | 0.86 | 0.86 | |
| Double | 0.72 | 0.73 | 0.88 | 0.70 | 0.73 | 0.92 | 0.85 | 0.81 | 0.89 | |
| Mlink | 0.65 | 0.66 | 0.80 | 0.68 | 0.72 | 0.91 | 0.82 | 0.70 | 0.91 | |
| RR = 1.5 | Scan-c | 14.00 | 16.70 | 8.60 | ||||||
| Scan-e0 | 15.50 | 19.80 | 12.50 | |||||||
| FleX | 13.20 | 16.20 | 11.30 | |||||||
| GA-1 | 13.30 | 16.10 | 11.40 | |||||||
| Double | 11.60 | 15.60 | 11.40 | |||||||
| Mlink | 14.40 | 17.40 | 13.60 | |||||||
| RR = 2.0 | Scan-c | 10.30 | 8.40 | 4.80 | 11.10 | 14.60 | 4.80 | |||
| Scan-e0 | 9.20 | 11.70 | 8.10 | 9.30 | 16.00 | 6.90 | ||||
| FleX | 8.30 | 7.40 | 7.60 | 10.70 | 13.60 | 7.70 | ||||
| GA-1 | 9.20 | 9.50 | 5.30 | 9.60 | 13.40 | 7.70 | ||||
| Double | 7.00 | 6.90 | 6.60 | 9.10 | 13.40 | 8.40 | ||||
| Mlink | 8.50 | 8.00 | 6.60 | 10.10 | 14.80 | 9.10 | ||||
| RR = 3.0 | Scan-c | 8.00 | 9.10 | 4.50 | 9.10 | 5.70 | 2.10 | 8.70 | 12.70 | 2.40 |
| Scan-e0 | 3.60 | 13.80 | 4.60 | 3.70 | 8.50 | 2.30 | 5.50 | 15.40 | 3.10 | |
| FleX | 5.50 | 5.50 | 3.80 | 5.20 | 3.80 | 4.30 | 7.20 | 10.10 | 4.70 | |
| GA-1 | 6.00 | 6.10 | 3.70 | 7.20 | 6.60 | 2.90 | 6.20 | 11.00 | 5.50 | |
| Double | 5.10 | 8.10 | 2.80 | 5.20 | 4.60 | 3.30 | 6.40 | 11.00 | 6.40 | |
| Mlink | 5.90 | 9.10 | 4.80 | 5.30 | 4.60 | 3.20 | 6.40 | 12.60 | 6.10 | |
Scan-c: circular scan method, Scan-e0: standard elliptic scan method, FleX: unrestricted flexible scan method, GA-1: strongly penalized genetic algorithm, Double and Mlink: dynamic minimum spanning tree method with double and maximum link connections, respectively. Results based on 250 Monte Carlo replications. 1 "small", "moderate" and "large" clusters are clusters with about 20, 45 and 115 cases of childhood acute leukemia over the period 1990-2006, respectively.