| Literature DB >> 36085072 |
Abstract
BACKGROUND: In public health and epidemiology, spatial scan statistics can be used to identify spatial cluster patterns of health-related outcomes from population-based health survey data. Although it is appropriate to consider the complex sample design and sampling weight when analyzing complex sample survey data, the observed survey responses without these considerations are often used in many studies related to spatial cluster detection.Entities:
Keywords: Geographic surveillance; Health survey; Sampling design; Sampling weight; Spatial cluster detection
Mesh:
Year: 2022 PMID: 36085072 PMCID: PMC9463844 DOI: 10.1186/s12942-022-00311-6
Source DB: PubMed Journal: Int J Health Geogr ISSN: 1476-072X Impact factor: 5.310
Fig. 1Significant spatial clusters detected with high diabetes diagnosis rates of male adults using the KCHS 2018 data. A Individual-level data (frequency). B Individual-level data (weighted frequency). C Aggregate-level data (crude rate)
Fig. 2Significant spatial clusters detected with high diabetes diagnosis rates of male adults with age adjustment using the KCHS 2018 data. A Individual-level data (frequency with age adjustment). B Individual-level data (weighted frequency with age adjustment). C Aggregate-level data (age standardized rate)
The number of significant clusters detected with high diabetes diagnosis rates of male adults at optimized value of the MRCS when using different types of data from the KCHS 2018 data (see Fig. 1)
| Type of data | MRCS | Number of significant clusters |
|---|---|---|
| Individual-level data | ||
| Frequency | 20 | 3 |
| Weighted frequency | 4 | 31 |
| Aggregate-level data | ||
| Crude rate | 15 | 1 |
The number of significant clusters detected with high diabetes diagnosis rates of male adults with age adjustment at optimized value of the MRCS when using different types of data from the KCHS 2018 data (see Fig. 2)
| Type of data | MRCS | Number of significant clusters |
|---|---|---|
| Individual-level data | ||
| Frequency | 10 | 1 |
| Weighted frequency | 2 | 53 |
| Aggregate-level data | ||
| Age-standardized rate | 10 | 1 |
Fig. 3The true simulated cluster models among the 250 districts of South Korea. True cluster model (A). True cluster model (B)
Three different sampling proportions used in the simulation
| Sampling proportion scenarios | Sex | Age groups | |||
|---|---|---|---|---|---|
| 20‒34 years | 35‒49 years | 50‒64 years | Over 65 years | ||
| (1) | Male | SRS | SRS | SRS | SRS |
| Female | SRS | SRS | SRS | SRS | |
| (2) | Male | 0.07 | 0.11 | 0.14 | 0.13 |
| Female | 0.08 | 0.12 | 0.16 | 0.19 | |
| (3) | Male | 0.03 | 0.15 | 0.18 | 0.09 |
| Female | 0.04 | 0.16 | 0.20 | 0.15 | |
SRS simple random sampling
The average sensitivity and PPV (standard deviation in parentheses) over 100 iterations for each data type used for spatial cluster detection under six simulation scenarios
| Type of data | Individual-level data | Aggregate-level data | |
|---|---|---|---|
| Frequency | Weighted frequency | Crude rate | |
| Scenario 1 [true cluster (A) + sampling proportion (1)] | |||
| Sensitivity | 0.9214 (0.0425) | 0.9381 (0.0536) | |
| PPV | 0.8071 (0.1312) | 0.1966 (0.0187) | |
| Scenario 2 [true cluster (A) + sampling proportion (2)] | |||
| Sensitivity | 0.9320 (0.0406) | 0.9286 (0.0607) | |
| PPV | 0.7861 (0.1255) | 0.1889 (0.0163) | |
| Scenario 3 [true cluster (A) + sampling proportion (3)] | |||
| Sensitivity | 0.9203 (0.0379) | 0.9280 (0.0549) | |
| PPV | 0.7999 (0.1186) | 0.1852 (0.0193) | |
| Scenario 4 [true cluster (B) + sampling proportion (1)] | |||
| Sensitivity | 0.9163 (0.0350) | 0.9192 (0.0218) | |
| PPV | 0.8734 (0.0752) | 0.3865 (0.0370) | |
| Scenario 5 [true cluster (B) + sampling proportion (2)] | |||
| Sensitivity | 0.9151 (0.0479) | 0.9189 (0.0202) | |
| PPV | 0.8749 (0.0766) | 0.3640 (0.0419) | |
| Scenario 6 [true cluster (B) + sampling proportion (3)] | |||
| Sensitivity | 0.9183 (0.0339) | 0.9244 (0.0476) | |
| PPV | 0.8678 (0.0765) | 0.3509 (0.0379) | |
The largest values of average sensitivity and PPV for each scenario are in bold