| Literature DB >> 35963932 |
Caitlín Mc Shane1, Johannes H Uhl2,3, Stefan Leyk4,5.
Abstract
Multiple aspects of our society are reflected in how we have transformed land through time. However, limited availability of historical-spatial data at fine granularity have hindered our ability to advance our understanding of the ways in which land was developed over the long-term. Using a proprietary, national housing and property database, which is a result of large-scale, industry-fuelled data harmonization efforts, we created publicly available sequences of gridded surfaces that describe built land use progression in the conterminous United States at fine spatial (i.e., 250 m × 250 m) and temporal resolution (i.e., 1 year - 5 years) between the years 1940 and 2015. There are six land use classes represented in the data product: agricultural, commercial, industrial, residential-owned, residential-income, and recreational facilities, as well as complimentary uncertainty layers informing the users about quantifiable components of data uncertainty. The datasets are part of the Historical Settlement Data Compilation for the U.S. (HISDAC-US) and enable the creation of new knowledge of long-term land use dynamics, opening novel avenues of inquiry across multiple fields of study.Entities:
Year: 2022 PMID: 35963932 PMCID: PMC9376068 DOI: 10.1038/s41597-022-01591-0
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 8.501
The cumulative number of ZTRAX property records per land use theme and year.
| Land Use Type | 1940 | 1985 | 2015 |
|---|---|---|---|
| Agriculture | 170,378 | 371,886 | 6,238,359 |
| Commercial | 586,298 | 1,986,790 | 5,135,934 |
| Industrial | 61,256 | 368,772 | 938,317 |
| Recreational | 11,761 | 51,729 | 246,247 |
| Residential-Income | 1,656,334 | 2,980,961 | 4,117,566 |
| Residential-Owned | 11,535,823 | 51,435,548 | 100,062,915 |
Fig. 1Land use-specific property counts in 1945, 1985, and 2015, for Houston, Texas. The top 3 rows display theme specific counts. The bottom row displays the aggregated counts of the agricultural, industrial, and recreational land use classes.
Technical specifications and access information for the created historical land use datasets.
| File name | Description | Temporal resolution | Temporal coverage | Spatial resolution | File Format | URL | DOI |
|---|---|---|---|---|---|---|---|
| LU_ThemeMaj_YYY Y.tif | Annual gridded surfaces depicting the majority land use class per grid cell | 1 year | 1940–2015 | 250 m × 250 m | GeoTIFF | 10.7910/DVN/LNBJIO | |
| LU_ThemeCount_A_YYYY_to_YYYY.tif | Semi decadal gridded surface showing the cumulative count of agricultural structures | 5 years | 1940–2015 | 250 m × 250 m | GeoTIFF | 10.7910/DVN/I30REZ | |
| LU_ThemeCount_C_YYYY_to_YYYY.tif | Semi-decadal gridded surface showing the cumulative count of commercial structures | 5 years | 1940–2015 | 250 m × 250 m | GeoTIFF | 10.7910/DVN/I30REZ | |
| LU_ThemeCount_I_YYYY_to_YYYY.tif | Semi-decadal gridded surface showing the cumulative count of industrial structures | 5 years | 1940–2015 | 250 m × 250 m | GeoTIFF | 10.7910/DVN/I30REZ | |
| LU_ThemeCount_R C_YYYY_to_YYYY.tif | Semi-decadal gridded surface showing the cumulative count of recreational structures | 5 years | 1940–2015 | 250 m × 250 m | GeoTIFF | 10.7910/DVN/I30REZ | |
| LU_ThemeCount_R I_YYYY_to_YYYY.tif | Semi-decadal gridded surface showing the cumulative count of residential-income structures | 5 years | 1940–2015 | 250 m × 250 m | GeoTIFF | 10.7910/DVN/I30REZ | |
| LU_ThemeCount_R O_YYYY_to_YYYY.tif | Semi-decadal gridded surface showing the cumulative count of residential-owned structures | 5 years | 1940–2015 | 250 m × 250 m | GeoTIFF | 10.7910/DVN/I30REZ | |
| LuUncert_County_ YYYY _to_YYYY.shp | Decadal shapefile surfaces describing the attribute missingness for land use and built year for all records | 10 years | 1940–2015 | County | ESRI Shapefile | 10.7910/DVN/JXJ5WH | |
| LuUncert_County_ 2016.shp | Shapefile surface that describes the attribute missingness using all records missing one or both (land use, built year) attributes | — | 1940–2015 | County | ESRI Shapefile | 10.7910/DVN/JXJ5WH | |
| LU_UncertPix_YYYY _to_YYYY.tif | Decadal gridded surfaces describing the land use attribute missingness for all georeferenced records | 10 years | 1940–2015 | 250 m × 250 m | GeoTIFF | 10.7910/DVN/JXJ5WH | |
| LU_UncertPix_2016 s.tif | Gridded surface showing the attribute missingness for both land use and built year | — | 2015 | 250 m × 250 m | GeoTIFF | 10.7910/DVN/JXJ5WH | |
| Uncert_ExcldLU_YYYY_to_YYYY.tif | Gridded surface showing cumulative counts of structures represented in ZTRAX and excluded from the land use data | 10 years | 1940–2015 | 250 × 250 | GeoTIFF | 10.7910/DVN/JXJ5WH |
Cross-tabulation of land use (lu) and year built (by) completeness; “n” indicates missingness, e.g., nby = “no built year”.
| Counts [N] | Percentages [%] | |||||
|---|---|---|---|---|---|---|
| nby | by | sum | nby | by | sum | |
| nlu | 2187645 | 77796 | 2265441 | 1.76 | 0.06 | 1.82 |
| lu | 28744633 | 93272917 | 1.22E + 08 | 23.13 | 75.05 | 98.18 |
| sum | 30932278 | 93350713 | 1.24E + 08 | 24.9 | 75.11 | 100 |
Fig. 2Attribute completeness in ZTRAX: Percentage of records per county with a valid (a) land use attribute, (b) location attribute (i.e., latitude and longitude), and (c) year built attribute.
OSM-based agreement assessment using correlations and recall measures across the rural-urban continuum (RUC).
| RUCC | Spearman correlation | Recall | ||||
|---|---|---|---|---|---|---|
| Residential | Commercial | Industrial | Residential | Commercial | Industrial | |
| 0.663 | 0.368 | 0.585 | 0.883 | 0.652 | 0.432 | |
| Pop > = 1 m | ||||||
| 0.621 | 0.254 | 0.26 | 0.707 | 0.576 | 0.233 | |
| Pop > = 250 K & pop <1 m | ||||||
| 0.601 | 0.292 | 0.261 | 0.569 | 0.534 | 0.195 | |
| Pop <250 K | ||||||
| 0.652 | 0.296 | 0.15 | 0.696 | 0.526 | 0.194 | |
| Pop> = 20 K adjacent to metro area | ||||||
| 0.575 | 0.252 | 0.162 | 0.682 | 0.579 | 0.148 | |
| Pop > = 20 K & not adjacent to metro area | ||||||
| 0.557 | 0.297 | 0.31 | 0.504 | 0.437 | 0.103 | |
| Pop > = 2,500 & pop < = 19,999 adjacent to metro | ||||||
| 0.586 | 0.284 | 0.033 | 0.339 | 0.347 | 0.062 | |
| Pop > = 2,500 & pop < = 19,999 not adjacent to metro | ||||||
| 0.494 | 0.337 | −0.065 | 0.36 | 0.344 | 0.047 | |
| Pop < = 2,500 adjacent to metro area | ||||||
| 0.523 | 0.184 | −0.087 | 0.337 | 0.256 | 0.039 | |
| Pop < = 2,500 not adjacent to metro area | ||||||
| CONUS | 0.676 | 0.334 | 0.546 | 0.77 | 0.608 | 0.336 |
Brief descriptions of each RUCC are provided in terms of population (pop) below the RUCC designation (1 = urban, 9 = rural).
Fig. 3Comparison to building-level land use classes from OpenStreetMap. (a) Distribution of Spearman’s correlation coefficient based on 250 × 250 m grid cell counts of residential, commercial, and industrial records, and (b) distribution of county-level recall values; Panels (c) and (d) show the distributions of county-level correlation and recall, disaggregated for each rural-urban continuum code.
Fig. 4Record-level comparison of ZTRAX land use classes and LULC land use categories, carried out for the full sample, for rural counties (RUCC 6–9) and urban counties (RUCC 1–5). Values are shown in % of the sample of N = 486,000 ZTRAX records used.
Grid-cell-level comparison of ZTRAX land use classes and NLCD 2001 and 2016 land cover classes.
| Mode-based resampling | 1-hot encoding | |||||||
|---|---|---|---|---|---|---|---|---|
| NLCD 2001 | NLCD 2016 | NLCD 2001 | NLCD 2016 | |||||
| not covered by HISDAC | covered by HISDAC | not covered by HISDAC | covered by HISDAC | not covered by HISDAC | covered by HISDAC | not covered by HISDAC | covered by HISDAC | |
| Reference | Proportions of NLCD class | |||||||
| Open Water | 98.18 | 1.82 | 97.26 | 2.74 | 98.11 | 1.89 | 97.17 | 2.83 |
| Perennial Ice/Snow | 100 | 0 | 100 | 0 | 100 | 0 | 100 | 0 |
| Developed. Open Space | 44.67 | 55.33 | 32.96 | 67.04 | 66.21 | 33.79 | 55.94 | 44.06 |
| Developed. Low Intensity | 29.27 | 70.73 | 17.8 | 82.2 | 47.21 | 52.79 | 35.86 | 64.14 |
| Developed. Medium Intensity | 28.12 | 71.88 | 20.22 | 79.78 | 35.09 | 64.91 | 28.16 | 71.84 |
| Developed High Intensity | 36.57 | 63.43 | 28.21 | 71.79 | 37.96 | 62.04 | 30.47 | 69.53 |
| Barren Land | 98.86 | 1.14 | 98.47 | 1.53 | 98.55 | 1.45 | 98.13 | 1.87 |
| Deciduous Forest | 89.6 | 10.4 | 84.44 | 15.56 | 89.67 | 10.33 | 84.65 | 15.35 |
| Evergreen Forest | 96.7 | 3.3 | 95.03 | 4.97 | 96.65 | 3.35 | 95.02 | 4.98 |
| Mixed Forest | 88.86 | 11.14 | 83.12 | 16.88 | 88.91 | 11.09 | 83.44 | 16.56 |
| Shrub/Scrub | 98.95 | 1.05 | 98.38 | 1.62 | 98.93 | 1.07 | 98.35 | 1.65 |
| Grassland/Herbaceous | 98.02 | 1.98 | 96.95 | 3.05 | 98.09 | 1.91 | 97.02 | 2.98 |
| Pasture/Hay | 84.07 | 15.93 | 76.01 | 23.99 | 85.3 | 14.7 | 77.89 | 22.11 |
| Cultivated Crops | 95.35 | 4.65 | 91.13 | 8.87 | 95.34 | 4.66 | 91.27 | 8.73 |
| Woody Wetlands | 95.01 | 4.99 | 91.3 | 8.7 | 94.69 | 5.31 | 90.96 | 9.04 |
| Emergent Herbaceous Wetlands | 97.48 | 2.52 | 95.66 | 4.34 | 97.03 | 2.97 | 94.98 | 5.02 |
| Reference | Proportions of HISDAC class | |||||||
| Open Water | 5.57 | 0.1 | 5.5 | 0.15 | 5.52 | 0.11 | 5.52 | 0.11 |
| Perennial Ice/Snow | 0.01 | 0 | 0.01 | 0 | 0.01 | 0 | 0.01 | 0 |
| Developed. Open Space | 0.67 | 0.84 | 0.52 | 1.07 | 1.9 | 0.97 | 1.9 | 0.97 |
| Developed. Low Intensity | 0.36 | 0.87 | 0.24 | 1.09 | 0.81 | 0.9 | 0.81 | 0.9 |
| Developed. Medium Intensity | 0.2 | 0.51 | 0.18 | 0.72 | 0.29 | 0.54 | 0.29 | 0.54 |
| Developed High Intensity | 0.1 | 0.17 | 0.09 | 0.23 | 0.11 | 0.19 | 0.11 | 0.19 |
| Barren Land | 1.04 | 0.01 | 1.04 | 0.02 | 1.06 | 0.02 | 1.06 | 0.02 |
| Deciduous Forest | 9.81 | 1.14 | 8.92 | 1.64 | 9.21 | 1.06 | 9.21 | 1.06 |
| Evergreen Forest | 12.87 | 0.44 | 12.2 | 0.64 | 12.35 | 0.43 | 12.35 | 0.43 |
| Mixed Forest | 2.9 | 0.36 | 2.67 | 0.54 | 3.27 | 0.41 | 3.27 | 0.41 |
| Shrub/Scrub | 23.29 | 0.25 | 23.19 | 0.38 | 22.95 | 0.25 | 22.95 | 0.25 |
| Grassland/Herbaceous | 13.98 | 0.28 | 14.25 | 0.45 | 14.07 | 0.27 | 14.07 | 0.27 |
| Pasture/Hay | 6.51 | 1.23 | 5.37 | 1.7 | 6.26 | 1.08 | 6.26 | 1.08 |
| Cultivated Crops | 16.78 | 0.82 | 16.68 | 1.62 | 16.05 | 0.78 | 16.05 | 0.78 |
| Woody Wetlands | 4.51 | 0.24 | 4.36 | 0.42 | 4.54 | 0.25 | 4.54 | 0.25 |
| Emergent Herbaceous Wetlands | 1.39 | 0.04 | 1.36 | 0.06 | 1.59 | 0.05 | 1.59 | 0.05 |
Grid-cell-level comparison of ZTRAX land use classes and LCZ 2016–2018 urban land use categories.
| % of LCZ class | % of HISDAC class | |||||||
|---|---|---|---|---|---|---|---|---|
| Mode-based resampling | 1-hot encoding | Mode-based resampling | 1-hot encoding | |||||
| not covered by HISDAC | covered by HISDAC | not covered by HISDAC | covered by HISDAC | not covered by HISDAC | covered by HISDAC | not covered by HISDAC | covered by HISDAC | |
| Compact highrise | 43.98 | 56.02 | 66.29 | 33.71 | 0 | 0 | 0 | 0 |
| Compact midrise | 11.57 | 88.43 | 17.07 | 82.93 | 0 | 0.01 | 0 | 0.01 |
| Compact lowrise | 0.41 | 99.59 | 1.32 | 98.68 | 0 | 0 | 0 | 0.01 |
| Open highrise | 21.68 | 78.32 | 34.11 | 65.89 | 0 | 0 | 0 | 0 |
| Open midrise | 64.99 | 35.01 | 67.93 | 32.07 | 0 | 0 | 0.01 | 0 |
| Open lowrise | 27.42 | 72.58 | 35.96 | 64.04 | 1.12 | 2.97 | 1.98 | 3.53 |
| Lightweight low-rise | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| Large lowrise | 38.15 | 61.85 | 41.51 | 58.49 | 0.03 | 0.05 | 0.06 | 0.08 |
| Sparsely built | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| Heavy Industry | 89.09 | 10.91 | 87.71 | 12.29 | 0.01 | 0 | 0.03 | 0 |
| Dense trees | 87.77 | 12.23 | 85.54 | 14.46 | 22.78 | 3.17 | 28.6 | 4.83 |
| Scattered trees | 91.92 | 8.08 | 87.96 | 12.04 | 15.7 | 1.38 | 28.41 | 3.89 |
| Bush. scrub | 99.01 | 0.99 | 98.75 | 1.25 | 15.1 | 0.15 | 20.01 | 0.25 |
| Low plants | 89.82 | 10.18 | 88.21 | 11.79 | 29.74 | 3.37 | 37.83 | 5.05 |
| Bare rock or paved | 99.04 | 0.96 | 98.54 | 1.46 | 1.39 | 0.01 | 2.04 | 0.03 |
| Bare soil or sand | 99.34 | 0.66 | 99.02 | 0.98 | 10.2 | 0.07 | 14.2 | 0.14 |
| Water | 97.49 | 2.51 | 93.01 | 6.99 | 3.92 | 0.1 | 5.75 | 0.43 |
Fig. 5Cross-comparison of ZTRAX records and building demolition records in Colorado. The bar charts show the proportions of demolished buildings in different categories established by comparing demolition year and the year built on record in ZTRAX, separately for ZTRAX records reported as vacant and non-vacant. Urban counties have RUCC 1–5, rural counties have RUCC 6–9.
Estimated likelihoods of land use transitions over time.
| New land use type | |||||||
|---|---|---|---|---|---|---|---|
| RES-INCOME | RES-OWNED | COM | IND | AG | REC | ||
| RES-INCOME | possible | possible | unlikely | unlikely | unlikely | ||
| RES-OWNED | possible | possible | unlikely | unlikely | unlikely | ||
| COM | possible | possible | unlikely | unlikely | unlikely | ||
| Initial land use type | IND | possible | possible | possible | unlikely | possible | |
| AG | possible | possible | unlikely | unlikely | possible | ||
| REC | unlikely | unlikely | unlikely | unlikely | unlikely | ||
Fig. 6Visual assessment of Bing overhead imagery collected at the locations of a stratified random sample of ZTRAX records for the six land use classes used herein. (a) residential (owned), (b) residential (income), (c) commercial, (d) industrial, (e) agricultural, and (f) recreational. The images collected at each location per land use class are arranged based on their color similarity, using color moments and t-distributed stochastic neighbour transform (t-SNE). The small patches to the right of each mosaic show exemplary enlargements, providing further detail on the building characteristics at each ZTRAX location. The yellow rectangles show the locations of the enlargements (the upper enlargement corresponds to the upper of the two rectangles per land use class.
| Measurement(s) | Structural Land Use |
| Technology Type(s) | Python |
| Sample Characteristic - Organism | Structures |
| Sample Characteristic - Environment | Built Environment |
| Sample Characteristic - Location | United States |