| Literature DB >> 31419252 |
Abstract
Spatial scan statistics are widely used for cluster detection analysis in geographical disease surveillance. While this method has been developed for various types of data such as binary, count, and continuous data, spatial scan statistics for matched case-control data, which often arise in spatial epidemiology, have not been considered. We propose spatial scan statistics for matched case-control data. The proposed test statistics consider the correlations between matched pairs. We evaluate the statistical power and cluster detection accuracy of the proposed methods through simulations compared to the Bernoulli-based method. We illustrate the proposed methods using a real data example. The simulation study clearly revealed that the proposed methods had higher power and higher accuracy for detecting spatial clusters for matched case-control data than the Bernoulli-based spatial scan statistic. The cluster detection result of the real data example also appeared to reflect a higher power of the proposed methods. The proposed methods are very useful for spatial cluster detection for matched case-control data.Entities:
Mesh:
Year: 2019 PMID: 31419252 PMCID: PMC6697355 DOI: 10.1371/journal.pone.0221225
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Probability (data) structure for the matched case-control data with respect to belonging to window z (in) or not (out).
| For a given | Control | |||
|---|---|---|---|---|
| in | out | |||
| Case | in | |||
| out | ||||
| 1 ( | ||||
Fig 1A true cluster created for the simulation study.
Five different scenarios assumed for the probability structure in the simulation study.
| ( | ||
|---|---|---|
| (0.25, 0.25, 0.25, 0.25) | 1 | 1 |
| (0.05, 0.25, 0.15, 0.55) | 1.67 | 1.71 |
| (0.10, 0.20, 0.10, 0.60) | 2 | 1.71 |
| (0.15, 0.15, 0.05, 0.65) | 3 | 1.71 |
| (0.18, 0.12, 0.02, 0.68) | 6 | 1.71 |
Estimated power, sensitivity, and PPV with the number of matched pairs = 100 (highest value across three methods is shown in bold).
| ( | Bernoulli-based | |||
|---|---|---|---|---|
| (0.05, 0.25, 0.15, 0.55) | Power | 0.153 | 0.113 | |
| Sensitivity | 0.799 | 0.720 | ||
| PPV | 0.623 | 0.603 | ||
| (0.10, 0.20, 0.10, 0.60) | Power | 0.190 | 0.110 | |
| Sensitivity | 0.820 | 0.691 | ||
| PPV | 0.633 | 0.676 | ||
| (0.15, 0.15, 0.05, 0.65) | Power | 0.187 | 0.085 | |
| Sensitivity | 0.865 | 0.664 | ||
| PPV | 0.605 | 0.596 | ||
| (0.18, 0.12, 0.02, 0.68) | Power | 0.207 | 0.078 | |
| Sensitivity | 0.886 | 0.562 | ||
| PPV | 0.567 | 0.586 |
Estimated power, sensitivity, and PPV with the number of matched pairs = 400 (highest value across three methods is shown in bold).
| ( | Bernoulli-based | |||
|---|---|---|---|---|
| (0.05, 0.25, 0.15, 0.55) | Power | 0.585 | 0.586 | |
| Sensitivity | 0.886 | 0.882 | ||
| PPV | 0.811 | |||
| (0.10, 0.20, 0.10, 0.60) | Power | 0.737 | 0.648 | |
| Sensitivity | 0.923 | 0.878 | ||
| PPV | 0.848 | 0.821 | ||
| (0.15, 0.15, 0.05, 0.65) | Power | 0.914 | 0.689 | |
| Sensitivity | 0.966 | 0.865 | ||
| PPV | 0.927 | 0.837 | ||
| (0.18, 0.12, 0.02, 0.68) | Power | 0.995 | 0.684 | |
| Sensitivity | 0.983 | 0.842 | ||
| PPV | 0.943 | 0.820 |
Fig 2The most likely cluster detected by the two proposed methods.
Estimated power, sensitivity, and PPV with the number of matched pairs = 200 (highest value across three methods is shown in bold).
| ( | Bernoulli-based | |||
|---|---|---|---|---|
| (0.05, 0.25, 0.15, 0.55) | Power | 0.289 | 0.184 | |
| Sensitivity | 0.845 | 0.837 | ||
| PPV | 0.732 | 0.715 | ||
| (0.10, 0.20, 0.10, 0.60) | Power | 0.372 | 0.232 | |
| Sensitivity | 0.864 | 0.799 | ||
| PPV | 0.744 | 0.734 | ||
| (0.15, 0.15, 0.05, 0.65) | Power | 0.548 | 0.210 | |
| Sensitivity | 0.923 | 0.766 | ||
| PPV | 0.817 | 0.684 | ||
| (0.18, 0.12, 0.02, 0.68) | Power | 0.678 | 0.158 | |
| Sensitivity | 0.925 | 0.747 | ||
| PPV | 0.803 | 0.683 |