| Literature DB >> 30179234 |
Stefan Leyk1, Johannes H Uhl1.
Abstract
Human settlement plays a key role in understanding social processes such as urbanization and interactions between human and environmental systems but not much is known about the landscape evolution before the era of operational remote sensing technology. In this study, housing and property databases are used to create new gridded settlement layers describing human settlement processes at fine spatial and temporal resolution in the conterminous United States between 1810 and 2015. The main products are a raster composite layer representing the year of first settlement, and a raster time series of built-up intensity representing the sum of building areas in a pixel. Several accompanying uncertainty surfaces are provided to ensure the user is informed about inherent spatial, temporal and thematic uncertainty in the data. A validation study using high quality reference data confirms high levels of accuracy of the resulting data products. These settlement data will be of great interest in disciplines in which the long-term evolution of human settlement represents crucial information to explore novel research questions.Entities:
Mesh:
Year: 2018 PMID: 30179234 PMCID: PMC6122163 DOI: 10.1038/sdata.2018.175
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 6.444
Figure 1Workflow for the production of the main data products, FBUY and BUI, and the associated uncertainty layers.
This figure illustrates the complete workflow including the data processing and integration steps, the creation of the historical settlement layers and the accompanying uncertainty layers.
Figure 2The main settlement data products, BUI and FBUY, their associated uncertainty layers and methodological details for positional uncertainty computation.
(a) FBUY composite raster layer for the conterminous U.S. with counties where no data is available shown in grey outlines, (b) FBUY composite raster layer for greater Dallas-Fort Worth (Texas), (c) Selected BUI layers for greater Boston (Massachusetts) for 1810, 1860, 1910, 1960 and 2010, (d) IPU_BUI for the year 2015, (e) OPU_BUI for the year 2015, (f) TPixMiss, (g) APixMiss for the year 2015, all (d–g) for South Boulder (Colorado), (h) GeoMiss at county level, (i) IPU calculation, and OPU calculation accompanying text descriptions.
Descriptions and characteristics of input, output and reference data used in this study.
| Name/Category | Short Description | Data type | Temporal resolution | Spatial resolution |
|---|---|---|---|---|
| | Housing and property database provided by Zillow, with geolocation and temporal information | CSV | annual | point locations |
| | Sum of gross indoor area (in m2) of all structures located within a given cell at a given year (cumulative) 1810-2015 | Geo-TIFF, raster series, Integer | Semi-decadal | 250 m |
| | First recorded year a structure has been built within a given cell extent; first year of settlement | Geo-TIFF, raster composite, Integer | Semi-decadal | 250 m |
| | For each cell in BUI rasters, the uncertainty that a point located inside could be outside; accompanying BUI raster series | Geo-TIFF, raster series, Float | Semi-decadal | 250 m |
| | For each cell in BUI rasters, the uncertainty that a point located in adjacent cells could be inside; accompanying BUI raster series | Geo-TIFF, raster series, Float | Semi-decadal | 250 m |
| | For each cell in BUI rasters, the average positional uncertainty, accompanying BUI raster series | Geo-TIFF, raster series, Float | Semi-decadal | 250 m |
| | For each cell in FBUY, the uncertainty that a point located inside could be outside, limited to unique locations of structures recorded at the time; accompanying FBUY raster composite | Geo-TIFF, raster composite, Float | Semi-decadal | 250 m |
| | For each cell in FBUY, the uncertainty that a point located in adjacent cell could be inside, limited to unique locations of structures recorded at the time; accompanying FBUY raster composite | Geo-TIFF, raster composite, Float | Semi-decadal | 250 m |
| | For each cell in FBUY, the average positional uncertainty, limited to unique locations of structures recorded at the time; accompanying FBUY raster composite | Geo-TIFF, raster composite, Float | Semi-decadal | 250 m |
| | For each cell, the number of missing built-year records; accompanying FBUY raster composite | Geo-TIFF, raster layer, Integer | N/A | 250 m |
| | For each cell, the number of missing records for building indoor area for each point in time; accompanying BUI raster series | Geo-TIFF, raster series, Integer | Semi-decadal | 250 m |
| | Vector data, delineating county boundaries in 2010 with various uncertainty measures in the attribute table for mapping uncertainty ` | ESRI shapefile | N/A | County |
| -> TMiss: | The sum of missing built year records in a county | Attribute, Integer | N/A | County |
| -> GeoMiss: | The sum of missing or invalid geographic coordinates in a county | Attribute, Integer | N/A | County |
| -> LUMiss: | The sum of records without landuse in a county | Attribute, Integer | N/A | County |
| -> AMiss: | The sum of missing records for building indoor area over all years in a county | Attribute, Integer | N/A | County |
| Reference-Built-up intensity (BUIRef) | For each cell, the sum of building footprint area (in m2) of all structures located inside at a given year (cumulative) 1810-2015 | Geo-TIFF, raster series, Integer | Semi-decadal | 250 m |
| Reference-First Built-Up Year (FBUYRef) | For each cell, the earliest built-up year reported used for validation of FBUY | Geo-TIFF, raster composite, Integer | Semi-decadal | 250 m |
Figure 3Results from the evaluation experiments for FBUY and BUI using the reference data.
(a) Confusion matrices for the FBUY classes at 5-year (upper row) and 10-year (lower row) temporal aggregation, each for spatial aggregations of 250 m, 750 m, and 1,250 m (from left to right). (b) Accuracy measures (precision, recall, G-mean, F-measure and Kappa) of semi-decadal FBUY layers plotted over time, for spatial aggregations of 250 m, 750 m, and 1,250 m (from left to right). (c) Scatterplots to show the association between BUI and BUIref values, color-coded for different time periods, based on spatial aggregations of 250 m, 750 m, and 1,250 m (from left to right); regression lines shown are estimated for the regression of BUI to estimate BUIref in 2015. (d) Slope, intercept and R-squared (from left to right) for linear regression analyses of BUI to estimate BUIref for all points in time between 1810 and 2015, at spatial aggregations of 250 m (blue), 750 m (red), and 1,250 m (green).