| Literature DB >> 31546670 |
Julia Díez1, Alba Cebrecos2, Iñaki Galán3,4, Hugo Pérez-Freixo5, Manuel Franco6,7, Usama Bilal8,9.
Abstract
Previous studies have suggested that European settings face unique food environment issues; however, retail food environments (RFE) outside Anglo-Saxon contexts remain understudied. We assessed the completeness and accuracy of an administrative dataset against ground truthing, using the example of Madrid (Spain). Further, we tested whether its completeness differed by its area-level socioeconomic status (SES) and population density. First, we collected data on the RFE through the ground truthing of 42 census tracts. Second, we retrieved data on the RFE from an administrative dataset covering the entire city (n = 2412 census tracts), and matched outlets using location matching and location/name matching. Third, we validated the administrative dataset against the gold standard of ground truthing. Using location matching, the administrative dataset had a high sensitivity (0.95; [95% CI = 0.89, 0.98]) and positive predictive values (PPV) (0.79; [95% CI = 0.70, 0.85]), while these values were substantially lower using location/name matching (0.55 and 0.45, respectively). Accuracy was slightly higher using location/name matching (k = 0.71 vs 0.62). We found some evidence for systematic differences in PPV by area-level SES using location matching, and in both sensitivity and PPV by population density using location/name matching. Administrative datasets may offer a reliable and cost-effective source to measure retail food access; however, their accuracy needs to be evaluated before using them for research purposes.Entities:
Keywords: Spain; differential exposure; food outlets; ground-truthing; retail food environment; secondary data; validity
Mesh:
Year: 2019 PMID: 31546670 PMCID: PMC6801710 DOI: 10.3390/ijerph16193538
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 3.390
Classification of food outlets.
| Food Outlet Category | Characteristics |
|---|---|
|
| |
| Supermarkets | Full-line, self-service food outlets that allow the supply of a wide variety of products of daily consumption, food and non-food, without the intermediation of a person employed to serve the buyers (unless requested). This category includes both large chain, small and discount supermarkets. |
| Small grocers | Neighborhood stores, self-service outlets selling a variety of products and which are neither a specialized food store, a convenience store, nor a supermarket. |
| Convenience food stores | Outlets with a diversified product offering including food, drinks, snacks, or magazines. They usually open more than 18 hours a day, have two or fewer cash registers, and are often associated (in Spain) with gas stations. |
|
| |
| Fruit & Vegetables stores | Specialized food outlet with retail sale of fresh, prepared or preserved fruits and vegetables. |
| Butcheries | Specialized food outlet with retail sale of fresh, frozen, or cured meat and meat products, including poultry and the retail sale of dairy products and eggs |
| Fishmongers | Specialized food outlet with retail sale of fresh, frozen, or cured fish and other seafood products |
| Bakeries | Specialized food outlet with retail sale of bread, cakes, flour confectionery and sugar confectionery |
| Other specialized food stores | Specialized food outlet that does not fit into any other category (e.g., gourmet food stores) |
Figure 1Classification algorithm used to classify food outlets based on a declared code of economic activity and outlet name. We used name recognition to differentiate supermarkets from small grocers, with a list of 60 supermarket names that we obtained from the Yellow Pages® [3].
Figure 2Flowchart illustrating data matching process using both (1) a liberal matching strategy (only by location); and (2) a strict matching strategy (both by location and outlet name).
Completeness of the administrative dataset by outlet matching strategy.
| Measure | Liberal Matching ( | Strict Matching ( | ||
|---|---|---|---|---|
| Est. 1 | 95% CI 2 | Est. 1 | 95% CI 2 | |
| Sensitivity | 0.95 | [0.89, 0.98] | 0.55 | [0.44, 0.64] |
| Positive Predictive Value | 0.79 | [0.70, 0.85] | 0.45 | [0.37, 0.54] |
1 Est., Validity Statistic Estimate. 2 CI, Confidence Interval.
Assessment of the systematic bias of the administrative data according to area-level socioeconomic status and population density.
| Area-Level Characteristic | Liberal Matching ( | Strict Matching ( | ||
|---|---|---|---|---|
| Sens. 1 | PPV 2 | Sens. 1 | PPV 2 | |
| Socioeconomic status | ||||
| Low | 0.93 [0.86, 0.99] | 0.92 [0.83, 1.00] | 0.44 [0.26, 0.63] | 0.44 [0.25, 0.63] |
| Middle | 0.98 [0.90, 1.00] | 0.63 [0.49, 0.77] | 0.69 [0.40, 0.98] | 0.69 [0.40, 0.98] |
| High | 0.98 [0.91, 1.00] | 0.70 [0.62, 0.78] | 0.78 [0.57, 0.99] | 0.78 [0.57, 0.99] |
| Population density | ||||
| Low | 0.97 [0.92, 1.00] | 0.79 [0.70, 0.88] | 0.71 [0.56, 0.86] | 0.71 [0.56, 0.86] |
| Middle | 0.91 [0.83, 0.99] | 0.78 [0.61, 0.95] | 0.35 [0.24, 0.49] | 0.37 [0.24, 0.49] |
| High | 0.95 [0.89, 1.00] | 0.78 [0.64, 0.92] | 0.51 [0.33, 0.69] | 0.51 [0.33, 0.69] |
1 Sens., Sensitivity; 2 PPV, Positive Predictive Value. 3 p-values test the null hypothesis that sensitivity or PPV are the same in areas of low, middle and high socioeconomic status (SES) or population density.
Accuracy of the algorithm to classify correctly the type of outlet in the administrative dataset, by outlet matching strategy.
| Measure | Liberal Matching ( | Strict Matching ( | ||
|---|---|---|---|---|
| Est. 1 | 95% CI 2 | Est. 1 | 95% CI 2 | |
| Percent Agreement | 0.71 | [0.62, 0.80] | 0.77 | [0.66, 0.88] |
| Cohen’s Kappa | 0.62 | [0.57, 0.66] | 0.71 | [0.56, 0.85] |
1 Est., Validity Statistic Estimate; 2 CI, Confidence Interval.