| Literature DB >> 35262348 |
Jules Kerckhoffs1, Jibran Khan2,3, Gerard Hoek1, Zhendong Yuan1, Thomas Ellermann2, Ole Hertel4, Matthias Ketzel2,5, Steen Solvang Jensen2, Kees Meliefste1, Roel Vermeulen1,6.
Abstract
High-resolution air quality (AQ) maps based on street-by-street measurements have become possible through large-scale mobile measurement campaigns. Such campaigns have produced data-only maps and have been used to produce empirical models [i.e., land use regression (LUR) models]. Assuming that all road segments are measured, we developed a mixed model framework that predicts concentrations by an LUR model, while allowing road segments to deviate from the LUR prediction based on between-segment variation as a random effect. We used Google Street View cars, equipped with high-quality AQ instruments, and measured the concentration of NO2 on every street in Amsterdam (n = 46.664) and Copenhagen (n = 28.499) on average seven times over the course of 9 and 16 months, respectively. We compared the data-only mapping, LUR, and mixed model estimates with measurements from passive samplers (n = 82) and predictions from dispersion models in the same time window as mobile monitoring. In Amsterdam, mixed model estimates correlated rs (Spearman correlation) = 0.85 with external measurements, whereas the data-only approach and LUR model estimates correlated rs = 0.74 and 0.75, respectively. Mixed model estimates also correlated higher rs = 0.65 with the deterministic model predictions compared to the data-only (rs = 0.50) and LUR model (rs = 0.61). In Copenhagen, mixed model estimates correlated rs = 0.51 with external model predictions compared to rs = 0.45 and rs = 0.50 for data-only and LUR model, respectively. Correlation increased for 97 locations (rs = 0.65) with more detailed traffic information. This means that the mixed model approach is able to combine the strength of data-only mapping (to show hyperlocal variation) and LUR models by shrinking uncertain concentrations toward the model output.Entities:
Keywords: Google Street View; LUR; NO2 measurements; hyperlocal variation; mixed-effect model
Mesh:
Substances:
Year: 2022 PMID: 35262348 PMCID: PMC9178915 DOI: 10.1021/acs.est.1c05806
Source DB: PubMed Journal: Environ Sci Technol ISSN: 0013-936X Impact factor: 11.357
Figure 1Study sites: (a) City of Amsterdam and (b) Copenhagen metropolitan area containing three municipalities, Copenhagen, Frederiksberg, and Tårnby. The bold black lines show the border of the study sites. Background maps ESRI.
Overview of GSV Data and Comparison Data Sets in Amsterdam and Copenhagen
| city | data | number of sites | year | name |
|---|---|---|---|---|
| AMS | Amsterdam Air View data (data-only, LUR, mixed model) | 46,664 | 2019–2020 | AAV |
| Palmes tubes measurements[ | 82 | 2019–2020 | Palmes | |
| model predictions by the National Institute
for Public Health
and the Environment[ | 7004 | 2019 | NSL | |
| Dutch National Air Quality Monitoring Programme | 7 | 2019 | LML | |
| CPH | Copenhagen Air View data (data-only, LUR, mixed model) | 28,499 | 2018–2020 | CAV |
| AirGIS model predictions (2019)[ | 58,234 | 2019 | LPDV | |
| AirGIS model predictions along streets | 97 | 2019 | CPH-97 | |
| Danish National Air Quality
Monitoring Programme[ | 3 | 2018–2020 | NOVANA |
Matches exact time window of GSV measurements. AMS: Amsterdam; CPH: Copenhagen.
Figure 2Maps of measurements, predictions, and variance in Amsterdam. (a) Data-only map, (b) standard error of the mean, (c) LUR model (fixed effects), (d) mixed-effect model, (e) random components, and (f) distribution of NO2 measurements and predictions. Note: Boxes represent the IQR; the horizontal line is the median; vertical lines extend to IQR times 1.5 (limited to data); dots are individual outliers; the black squared dot is the mean. Full size maps in the Supporting Information (Figure A1).
Summary Statistics, Correlation, and Bias Scores for all Comparisons in AMSa
| summary
statistics | correlation
and bias | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| min | Q1 | med | Q3 | max | RMSE | mean bias | mean relative bias (%) | ||
| Comparison to
NSL Predictions, | |||||||||
| NSL | 12 | 23 | 26 | 28 | 41 | ||||
| data-only | 7 | 23 | 29 | 37 | 116 | 0.50 | 11.35 | 5.2 | 20 |
| LUR model | 16 | 27 | 31 | 34 | 84 | 0.61 | 6.76 | 5.1 | 20 |
| mixed model | 15 | 27 | 30 | 35 | 69 | 0.65 | 7.47 | 5.5 | 21 |
| Comparison to
Palmes Measurements, | |||||||||
| Palmes | 17 | 23 | 28 | 32 | 44 | ||||
| data-only | 13 | 23 | 34 | 42 | 58 | 0.74 | 10.23 | 6.3 | 23 |
| LUR model | 20 | 28 | 32 | 35 | 52 | 0.75 | 6.49 | 4.5 | 16 |
| mixed model | 17 | 28 | 34 | 38 | 57 | 0.85 | 7.67 | 6.1 | 22 |
| Comparison to
Palmes Measurements with Overlapping NSL Sites, | |||||||||
| Palmes | 19 | 26 | 31 | 33 | 41 | ||||
| NSL | 23 | 29 | 33 | 35 | 40 | 0.54 | 4.84 | 2.1 | 7 |
| data-only | 16 | 32 | 36 | 43 | 54 | 0.74 | 9.95 | 6.5 | 22 |
| LUR model | 24 | 32 | 35 | 37 | 50 | 0.45 | 6.93 | 4.7 | 16 |
| mixed model | 23 | 33 | 35 | 41 | 51 | 0.72 | 8.04 | 6.4 | 21 |
Summary statistics, RMSE, and mean bias in μg/m3. Min = minimum, Q1 = the 25th percentile, med = median or the 50th percentile, Q3 = the 75th percentile, max = maximum, rs = Spearman’s rank correlation, RMSE = root-mean square error, mean bias calculated as mean [(ref – test), mean relative bias calculated as mean bias/ref] where: ref = NSL, Palmes, and test = AAV data.
Figure 3Maps of measurements, predictions, and variance in Copenhagen. (a) Data-only map, (b) standard error of the mean, (c) LUR model (fixed effects), (d) mixed-effect model, (e) random components, and (f) distribution of NO2 measurements and predictions. Note: Boxes represent the IQR; the horizontal line is the median; vertical lines extend to IQR times 1.5 (limited to data); dots are individual outliers; the black squared dot is the mean. Full size maps in the Supporting Information (Figure B1).
Summary Statistics, Correlation, and Bias Scores for all Comparisons in CPHa
| summary
statistics | correlation
and bias | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| min | Q1 | med | Q3 | max | RMSE | mean bias | mean relative bias | ||
| Comparison to
LPDV Model Predictions ( | |||||||||
| LPDV | 11 | 14 | 15 | 18 | 47 | ||||
| data-only | 5 | 10 | 13 | 17 | 128 | 0.45 | 7.18 | –1.87 | –11 |
| LUR model | 8 | 12 | 15 | 18 | 50 | 0.50 | 4.06 | –1.08 | –6 |
| mixed model | 8 | 12 | 15 | 18 | 53 | 0.51 | 4.15 | –1.14 | –7 |
| Comparison to CPH-97 Model
Predictions ( | |||||||||
| CPH-97 | 17 | 22 | 25 | 28 | 39 | ||||
| data-only | 12 | 19 | 25 | 31 | 53 | 0.67 | 7.75 | 0.81 | 3 |
| LUR model | 16 | 24 | 27 | 30 | 50 | 0.55 | 5.16 | 1.78 | 7 |
| mixed model | 16 | 23 | 27 | 30 | 52 | 0.65 | 5.94 | 2.04 | 8 |
Summary statistics, RMSE and mean bias in μg/m3. min = minimum, Q1 = the 25th percentile, med = median or the 50th percentile, Q3 = the 75th percentile, max = maximum, rs = Spearman’s rank correlation, RMSE = root-mean square error, mean bias calculated as mean [(ref – test), mean relative bias calculated as mean bias/ref] where: ref = LPDV, CPH-97, and test = CAV data.