| Literature DB >> 35761991 |
Johannes H Uhl1,2,3, Stefan Leyk2,3.
Abstract
Despite abundant data on the spatial distribution of contemporary human settlements, historical datasets on the long-term evolution of human settlements at fine spatial and temporal granularity are scarce, limiting our quantitative understanding of long-term changes of built-up areas. This is because commonly used large-scale mapping methods (e.g., computer vision) and suitable data sources (i.e., aerial imagery, remote sensing data, LiDAR data) have only been available in recent decades. However, there are alternative data sources such as cadastral records that are digitally available, containing relevant information such as building construction dates, allowing for an approximate, digital reconstruction of past building distributions. We conducted a non-exhaustive search of open and publicly available data resources from administrative institutions in the United States and gathered, integrated, and harmonized cadastral parcel data, tax assessment data, and building footprint data for 33 counties, wherever building footprint geometries and building construction year information was available. The result of this effort is a unique dataset that we call the Multi-Temporal Building Footprint Dataset for 33 U.S. Counties (MTBF-33). MTBF-33 contains over 6.2 million building footprints including their construction year, and can be used to derive retrospective depictions of built-up areas from 1900 to 2015, at fine spatial and temporal grain. Moreover, MTBF-33 can be employed for data validation purposes, or to train statistical learning models aiming to extract historical information on human settlements from remote sensing data, historical maps, or similar data sources.Entities:
Keywords: Building stock; Built environment; Change detection; Historical GIS; Human settlements; Long-term land use change; Remote sensing; Urbanization; historical spatial data
Year: 2022 PMID: 35761991 PMCID: PMC9233215 DOI: 10.1016/j.dib.2022.108369
Source DB: PubMed Journal: Data Brief ISSN: 2352-3409
Fig. 1MTBF-33 multi-temporal building footprint data examples, shown for (a) Boulder (Colorado) (b) Sarasota (Florida), (c) Boston (Massachusetts), and (d) Minneapolis (Minnesota).
Fig. 2MTBF-33 multi-temporal building footprint dataset, showing examples for most of the 33 counties covered in MTBF-33. Water mask in grey derived from GHS-BUILT R2018A (epoch 2014, [2]). Counties are sorted by their FIPS in the same order as shown in Table 1 (upper left: Boulder County, lower right: Mecklenburg county). Parts of New York County and Kings County (New York City) are jointly shown as “Manhattan”. Not shown are Queens and Richmond counties (New York City).
Overview and statistics of the 33 counties covered in MTBF-33.
| County | County | Buildings w/ | Total | Percent | Year built | Year built | Year built | Year built | |
|---|---|---|---|---|---|---|---|---|---|
| FIPS | Name | State | valid year built | buildings | complete | minimum | maximum | mean | median |
| 08013 | Boulder | Colorado | 76,929 | 80,255 | 95.9 | 1858 | 2014 | 1968 | 1971 |
| 12057 | Hillsborough | Florida | 410,076 | 421,046 | 97.4 | 1842 | 2014 | 1981 | 1984 |
| 12081 | Manatee | Florida | 154,416 | 173,173 | 89.2 | 1870 | 2013 | 1980 | 1982 |
| 12115 | Sarasota | Florida | 194,043 | 198,685 | 97.7 | 1877 | 2013 | 1981 | 1981 |
| 18163 | Vanderburgh | Indiana | 93,797 | 108,798 | 86.2 | 1810 | 2015 | 1951 | 1950 |
| 24005 | Baltimore | Maryland | 281,216 | 308,933 | 91.0 | 1676 | 2015 | 1958 | 1958 |
| 25001 | Barnstable | Massachusetts | 172,542 | 185,818 | 92.9 | 1626 | 2014 | 1961 | 1971 |
| 25003 | Berkshire | Massachusetts | 82,036 | 89,790 | 91.4 | 1650 | 2015 | 1937 | 1950 |
| 25005 | Bristol | Massachusetts | 225,156 | 233,704 | 96.3 | 1500 | 2013 | 1948 | 1957 |
| 25007 | Dukes | Massachusetts | 18,758 | 23,524 | 79.7 | 1660 | 2014 | 1963 | 1980 |
| 25009 | Essex | Massachusetts | 224,351 | 270,398 | 83.0 | 1600 | 2014 | 1937 | 1947 |
| 25011 | Franklin | Massachusetts | 43,436 | 50,209 | 86.5 | 1666 | 2013 | 1936 | 1951 |
| 25013 | Hampden | Massachusetts | 192,281 | 207,195 | 92.8 | 1600 | 2015 | 1947 | 1953 |
| 25015 | Hampshire | Massachusetts | 69,505 | 77,982 | 89.1 | 1629 | 2014 | 1947 | 1960 |
| 25017 | Middlesex | Massachusetts | 460,722 | 500,047 | 92.1 | 1600 | 2015 | 1942 | 1950 |
| 25019 | Nantucket | Massachusetts | 13,547 | 13,971 | 97.0 | 1640 | 2011 | 1962 | 1983 |
| 25021 | Norfolk | Massachusetts | 216,150 | 242,631 | 89.1 | 1500 | 2015 | 1944 | 1951 |
| 25023 | Plymouth | Massachusetts | 207,264 | 230,788 | 89.8 | 1600 | 2015 | 1950 | 1962 |
| 25025 | Suffolk | Massachusetts | 106,037 | 109,876 | 96.5 | 1637 | 2015 | 1924 | 1920 |
| 25027 | Worcester | Massachusetts | 317,302 | 344,307 | 92.2 | 1650 | 2015 | 1948 | 1957 |
| 27003 | Anoka | Minnesota | 128,498 | 135,307 | 95.0 | 1852 | 2015 | 1977 | 1979 |
| 27019 | Carver | Minnesota | 40,488 | 41,768 | 96.9 | 1816 | 2015 | 1969 | 1984 |
| 27037 | Dakota | Minnesota | 145,903 | 163,179 | 89.4 | 1832 | 2014 | 1973 | 1978 |
| 27053 | Hennepin | Minnesota | 380,301 | 387,856 | 98.1 | 1843 | 2010 | 1955 | 1956 |
| 27123 | Ramsey | Minnesota | 239,544 | 245,279 | 97.7 | 1850 | 2015 | 1946 | 1951 |
| 27163 | Washington | Minnesota | 86,216 | 95,014 | 90.7 | 1742 | 2015 | 1973 | 1983 |
| 34025 | Monmouth | New Jersey | 206,624 | 212,951 | 97.0 | 1684 | 2015 | 1961 | 1963 |
| 36005 | Bronx | New York | 102,658 | 103,865 | 98.8 | 1780 | 2015 | 1941 | 1931 |
| 36047 | Kings | New York | 329,283 | 331,813 | 99.2 | 1800 | 2015 | 1931 | 1925 |
| 36061 | New York | New York | 45,322 | 46,209 | 98.1 | 1765 | 2014 | 1921 | 1910 |
| 36081 | Queens | New York | 454,506 | 457,628 | 99.3 | 1661 | 2015 | 1939 | 1935 |
| 36085 | Richmond | New York | 138,609 | 140,050 | 99.0 | 1665 | 2014 | 1962 | 1969 |
| 37119 | Mecklenburg | North Carolina | 402,242 | 418,056 | 96.2 | 1792 | 2015 | 1980 | 1984 |
Results of the agreement assessment between MTBF-33 (in 2015) and Microsoft building footprint data.
| Agreement measure | All counties | Higher-density counties | Lower-density counties |
|---|---|---|---|
| Overall accuracy | 0.933 | 0.969 | 0.919 |
| Precision | 0.956 | 0.960 | 0.954 |
| Recall | 0.900 | 0.990 | 0.853 |
| F-measure | 0.928 | 0.974 | 0.901 |
| Kappa index | 0.865 | 0.935 | 0.833 |
Fig. 3Comparison of retrospective building footprint distributions to historical maps for Boulder, Colorado, in 1904 (scale 1:62,500), 1957 (scale 1:62,500), and 1984 (scale 1:100,000). Map source: USGS-HTMC.
| Subject | Geography |
| Specific subject area | Urban change detection, long-term land development, built environment, human settlements |
| Type of data | Geospatial vector data |
| How the data were acquired | Data were collected, integrated, and harmonized from web-based, open and publicly available sources published by local or regional governmental organizations, such as county or state governments. |
| Data format | Raw: ESRI Shapefile, ESRI File Geodatabase, Excel spreadsheets, CSV files. Filtered: ESRI Shapefile |
| Description of data collection | We identified U.S. counties or states that provide building footprint data and cadastral parcel data attributed with building construction year information. In a non-exhaustive search we identified 33 U.S. counties where these criteria were met. We integrated and harmonized these data to create geospatial vector datasets holding over 6.2 million building footprints attributed with their construction year. |
| Data source location | Source data was collected in 2016 from the following resources: |
| Data accessibility | Repository name: Mendeley Data |
| Related research article | S. Leyk, J. H. Uhl, D. Balk, B. Jones, Assessing the accuracy of multi-temporal built-up land layers across rural-urban trajectories in the United States, Remote Sens. Environ. 204 (2018). |