| Literature DB >> 21219638 |
Kirsten M M Beyer1, Audrey F Saftlas, Anne B Wallis, Corinne Peek-Asa, Gerard Rushton.
Abstract
BACKGROUND: The need to estimate the distance from an individual to a service provider is common in public health research. However, estimated distances are often imprecise and, we suspect, biased due to a lack of specific residential location data. In many cases, to protect subject confidentiality, data sets contain only a ZIP Code or a county.Entities:
Mesh:
Year: 2011 PMID: 21219638 PMCID: PMC3024211 DOI: 10.1186/1476-072X-10-4
Source DB: PubMed Journal: Int J Health Geogr ISSN: 1476-072X Impact factor: 3.918
Figure 1Multiple selection increases with proportion of ZCTA population of women of respondent's age in the census block.
Figure 2The effects of residential geocode on distance estimation and determination of closest facility. PANEL 1 shows that in some cases, the ZIP Code center is much closer to representing the geography of the sampled locations than the ZCTA centroid, although the effect on distance estimation may be negligible. PANEL 2 illustrates that respondents may be comparably close to multiple service centers. Use of ZCTA centroid or ZIP Code center geocode alone may result in different determinations of which facility is closest. A probabilistic sampling approach includes all facilities to the degree that they are relevant.
Measures of central tendency for estimated distances (in miles) between respondent locations and IPV service centers using three methods
| PSM (median) | PSM (mean) | ZCTA centroid as geocode† | ZIP Code population center as geocode | |
|---|---|---|---|---|
| Range | 0.39 - 66.4 | 0.53 - 65.6 | 0.09 - 67.2 | 0.00* - 66.6 |
| Median | 5.2 | 5.4 | 5.4 | 6.0 |
| Mean (SD) | 10.1 (11.2) | 10.2 (11.2) | 10.5 (11.1) | 9.9 (11.2) |
| 1st Quartile | 2.9 | 3.1 | 3.6 | 3.4 |
| 3rd Quartile | 12.2 | 12.3 | 12.1 | 11.7 |
The table compares distance estimates obtained using the Probabilistic Sampling Method (PSM), ZCTA centroid, and ZIP Code population center geocoding approaches (n = 1467). * The value of 0 reflects the distance (e.g., 0 miles) between the ZIP Code center point and itself when it is used as a geocode for both the subject's residence and the location of the IPV service center. † Some distances calculated using the ZCTA centroid as the geocode for the subject's residence reflect the distance between the ZCTA centroid and the ZIP Code center point, when IPV service centers were geocoded to ZIP Code centers.
Figure 3Comparison of the PSM distances with two commonly used alternative methods for estimating distances between residence and service location.
Figure 4Width of the 93% confidence interval increases with median distance to closest IPV resource (PSM method).
Figure 5Predicting the PSM median distance with ZCTA centroid and ZIP Code center distances.
U.S. examples for using the Probabilistic Sampling Method (PSM) for distance estimation
| Region of known residence (p) | Smaller areas within the region for which demographic data is available (q) | Typical ratio (q/p) |
|---|---|---|
| county | Zip Code, census tract, township, block group, census block | Can vary widely from 36:1 for rural counties to 1000:1 for metropolitan counties. |
| Zip Code | Census tract, census block group, census block | From 10:1 to 10,000:1 |
| Census tract | Census block group, census block | Variable; 50:1 |
| Census tract | 94 Meter grid population data for the U.S. (LandScan U.S.A.) | Highly variable: 1000:1 |