| Literature DB >> 28235407 |
Elodie Faure1, Aurélie M N Danjou1,2, Françoise Clavel-Chapelon3,4,5, Marie-Christine Boutron-Ruault6,7,8, Laure Dossus3,4, Béatrice Fervers1,2.
Abstract
BACKGROUND: Environmental exposure assessment based on Geographic Information Systems (GIS) and study participants' residential proximity to environmental exposure sources relies on the positional accuracy of subjects' residences to avoid misclassification bias. Our study compared the positional accuracy of two automatic geocoding methods to a manual reference method.Entities:
Keywords: Environmental epidemiology; Epidemiology; GIS; Geocoding; Geographic information system; Residential history
Mesh:
Year: 2017 PMID: 28235407 PMCID: PMC5324215 DOI: 10.1186/s12940-017-0217-5
Source DB: PubMed Journal: Environ Health ISSN: 1476-069X Impact factor: 5.984
Fig. 1Illustration of address locations in urban and rural areas with the three distinct methods. a example of residence located in urban area; b example of residence located in rural area (circle: ArcGIS online location for method R (a); triangle: manually improved location with method R used as reference; cross: location with method A; square: location with method B; dashed lines representing the distances between addresses located with ArcGIS online for method R and method R (a); methods A and R (a, b) and methods B and R (a, b))
Comparison of positional errorsa (in meter) of addresses located by two automatic geocoding methods
| Method of geocoding | Level of accuracy (accuracy code) |
| Median distance to the reference method location (IQR) in meters | Min-max, in meters | Distance in meters, | |||||
|---|---|---|---|---|---|---|---|---|---|---|
| [0–25] | [26–50] | [51–100] | [101–400] | [401–800] | >800 | |||||
| Method Ab (Batch Geocoder) | Not found (0) | 8 (0.4) | 5919.5 (815.4–5639076.5) | 117.7–5679417.1 | – | – | – | 2 (25.0) | – | 6 (75.0) |
| City (4) or Postal code (5) | 405 (18.2) | 108.2 (0.0–787.3) | 0.0–35204.7 | 160 (39.5) | 13 (3.2) | 26 (6.4) | 60 (14.8) | 48 (11.9) | 98 (24.2) | |
| Street segment (6) | 363 (16.3) | 0.0 (0.0–108.1) | 0.0–519630.8 | 226 (62.3) | 18 (5.0) | 25 (6.9) | 41 (11.3) | 10 (2.8) | 43 (11.8) | |
| Address (8) or Point of interest (9) | 1448 (65.1) | 0.0 (0.0–0.0) | 0.0–292249.4 | 1241 (85.7) | 58 (4.0) | 66 (4.6) | 56 (3.9) | 7 (0.5) | 20 (1.4) | |
| Total | 2224 (100.0) | 0.0 (0.0–37.2) | 0.0–5679417.1 | 1627 (73.2) | 89 (4.0) | 117 (5.3) | 159 (7.1) | 65 (2.9) | 167 (7.5) | |
| Method B (ArcGis Locator) | Postal code (6) or Town Hall (5) | 377 (15.5) | 788.9 (114.0–1845.1) | 0.3–8545.1 | 53 (14.1) | 11 (2.9) | 25 (6.6) | 44 (11.7) | 56 (14.9) | 188 (49.9) |
| Locality (4) or Street segment (3) | 558 (23.0) | 110.3 (26.7–320.6) | 0.0–14477.6 | 132 (23.7) | 55 (9.9) | 82 (14.7) | 174 (31.2) | 47 (8.4) | 68 (12.2) | |
| Interpolated address (2) or Address (1) | 1490 (61.4) | 12.5 (6.0–35.6) | 0.0–3951.9 | 1000 (67.1) | 237 (15.9) | 133 (8.9) | 100 (6.7) | 5 (0.3) | 15 (1.0) | |
| Total | 2425 (100.0) | 26.5 (8.0–134.8) | 0.0–14477.6 | 1185 (48.9) | 303 (12.5) | 240 (9.9) | 318 (13.1) | 108 (4.5) | 271 (11.2) | |
apositional error was determined by calculating Euclide an distance (in meter) between addresses located by each automatic method (method A and method B) of geocoding regarding to a reference method (method R)
bthe administrative division of territories in France did not allow obtaining addresses geocoded to district and state levels (level 2), county (level 3) and intersection of streets (level 7). No addresses have been geocoded to the country level
Agreement in accuracy level (Cohen’s Kappa coefficient) for methods A and B (automatic geocoding) in comparison with method R (manual reference method)
| Method R | Method A | Method B | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| City | Street segment | Address | Total | Cohen's kappa coefficient | Postal code | Street segment (N addresses) | Address | Total | Cohen's kappa coefficient | ||
| Overall | Town Hall | 147 | 20 | 14 | 181 | 0.60 | 133 | 49 | 5 | 187 | 0.61 |
| Street segment | 212 | 309 | 140 | 661 | 119 | 439 | 131 | 689 | |||
| Address | 46 | 34 | 1294 | 1374 | 125 | 70 | 1354 | 1549 | |||
| Total | 405 | 363 | 1448 | 2216 | 377 | 558 | 1490 | 2425 | |||
| For urban addressesa | Town Hall | 62 | 10 | 11 | 83 | 0.56 | 54 | 27 | 5 | 86 | 0.52 |
| Street segment | 94 | 166 | 130 | 390 | 64 | 226 | 126 | 416 | |||
| Address | 44 | 26 | 1265 | 1335 | 114 | 63 | 1331 | 1508 | |||
| Total | 200 | 202 | 1406 | 1808 | 232 | 316 | 1462 | 2010 | |||
| For rural addressesa | Town Hall | 85 | 10 | 3 | 98 | 0.39 | 79 | 22 | 0 | 101 | 0.54 |
| Street segment | 118 | 143 | 10 | 271 | 55 | 213 | 5 | 273 | |||
| Address | 2 | 8 | 29 | 39 | 11 | 7 | 23 | 41 | |||
| Total | 205 | 161 | 42 | 408 | 145 | 242 | 28 | 415 | |||
| For 1990–2000 period addresses | Town Hall | 133 | 17 | 13 | 163 | 0.61 | 125 | 45 | 5 | 175 | 0.60 |
| Street segment | 182 | 278 | 120 | 580 | 112 | 396 | 117 | 625 | |||
| Address | 45 | 29 | 1141 | 1215 | 121 | 67 | 1216 | 1404 | |||
| Total | 360 | 324 | 1274 | 1958 | 358 | 508 | 1338 | 2204 | |||
| For 2000–2008 period addresses | Town Hall | 14 | 3 | 1 | 18 | 0.56 | 8 | 4 | 0 | 12 | 0.70 |
| Street segment | 30 | 31 | 20 | 81 | 7 | 43 | 14 | 64 | |||
| Address | 1 | 5 | 153 | 159 | 4 | 3 | 138 | 145 | |||
| Total | 45 | 39 | 174 | 258 | 19 | 50 | 152 | 221 | |||
aAssignment of urban and rural status of addresses was based on definitions established by the French national institute for statistics and economic studies (INSEE)
Fig. 2Accuracy level of addresses (located with method R) of the study population and their distribution according to urban unit in the Rhône-Alpes region and the city of Lyon