| Literature DB >> 35058477 |
Linda See1, Ivelina Georgieva2, Martina Duerauer2, Thomas Kemper3, Christina Corbane3, Luca Maffenini3, Javier Gallego3, Martino Pesaresi3, Flavius Sirbu4, Rekib Ahmed5, Kateryna Blyshchyk6, Brigitte Magori4, Volodymyr Blyshchyk7, Oleksandr Melnyk7, Roman Zadorozhniuk7, Marian-Traian Mandici8, Yuan-Fong Su9,10, Ahmed Harb Rabia11, Ana Pérez-Hoyos3, Roman Vasylyshyn7, Chandra Kant Pawe12, Svitlana Bilous7,13, Serhii B Kovalevskyi7, Sergii S Kovalevskyi7, Kusumbor Bordoloi5, Andrii Bilous7, Kripal Panging5, Valentyn Bilous7, Reinhard Prestele14, Dhrubajyoti Sahariah5, Anjan Deka5, Nityaranjan Nath5, Rui Neves15, Viktor Myroniuk7, Mathias Karner2, Steffen Fritz2.
Abstract
Several global high-resolution built-up surface products have emerged over the last five years, taking full advantage of open sources of satellite data such as Landsat and Sentinel. However, these data sets require validation that is independent of the producers of these products. To fill this gap, we designed a validation sample set of 50 K locations using a stratified sampling approach independent of any existing global built-up surface products. We launched a crowdsourcing campaign using Geo-Wiki ( https://www.geo-wiki.org/ ) to visually interpret this sample set for built-up surfaces using very high-resolution satellite images as a source of reference data for labelling the samples, with a minimum of five validations per sample location. Data were collected for 10 m sub-pixels in an 80 × 80 m grid to allow for geo-registration errors as well as the application of different validation modes including exact pixel matching to majority or percentage agreement. The data set presented in this paper is suitable for the validation and inter-comparison of multiple products of built-up areas.Entities:
Year: 2022 PMID: 35058477 PMCID: PMC8776881 DOI: 10.1038/s41597-021-01105-4
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 6.444
Fig. 1Screenshot from the Geo-Wiki Global Built-up Surface Validation branch showing an example of the data collection screen for built-up surfaces.
Fig. 2Screenshots of the (a) left hand and (b) right hand panels of the Geo-Wiki Global Built-up Surface Validation branch.
Fig. 3An example of the feedback provided on a control point.
Fig. 4Schematic showing how the items in the data record are organized by grid and sub-pixel.
The attributes and descriptions associated with the first two items in the data record.
| Attribute | Description |
|---|---|
| SubmissionID | Unique ID of the submission |
| PointID | Unique ID of the grid from 1 to 50000 |
| UserID | Unique ID of the user |
| Timestamp | Date and time at which the validation was made |
| X_Centroid | The longitude of the centroid of the 8 × 8 grid |
| Y_Centroid | The latitude of the centroid of the 8 × 8 grid |
| BingImageDate | Date of the Microsoft Bing image |
| GoogleImageDate | Date of the Google Maps image |
| *ControlPoint | Yes |
| No | |
| Legend | Change |
| LegendItemID | 4080 = Yes |
| 4081 = No | |
| 4082 = Not sure | |
| LegendItemName | Yes |
| No | |
| Not sure | |
| Null (if skipped) | |
| *QualityofChange | 100 = agreement with majority (if non-control point) or the control/includes not sure as majority (and yes/no answers) |
| 50 = majority was split between change and no-change (if non-control point) | |
| 0 = no agreement with the majority (if non-control point) or the control | |
| Null = not applicable if a non-built-up area or skipped | |
| BuiltupCells | The number of cells that are built-up (from 0 to 64) or null if skipped |
| NonBuiltupCells | The number of cells that are not built-up (from 0 to 64) or null if skipped |
| DoNotKnowCells | The number of cells that are marked as ‘I don’t know’ (from 0 to 64) or null if skipped |
| SkipReason | 0 = not skipped |
| 1 = skipped because no Google imagery, low resolution or clouds | |
| 2 = skipped because too difficult | |
| *QualityofAnswer | 100 = agreement with majority (if non-control point) or the control over built-up/non-built-up |
| 50 = majority was split between built-up/non-built up | |
| 0 = no agreement with majority (if non-control point) or the control over built-up/non-built-up | |
| Null = skipped | |
| Comment | Text-based comment about the location if present or null (as this was optional) |
Attributes marked with an asterisk apply only to the first item (Geo-WikiBuilt-upCentroidsAll.csv).
The attributes and descriptions associated with items 3 and 4 in the data record.
| Attribute | Description |
|---|---|
| SubmissionID | Unique ID for the submission associated with a grid |
| SubmissionItemID | Unique ID for the submission item, i.e., unique for each individual cell |
| UserID | Unique ID of the user |
| PointID | Unique ID of the grid from 1 to 50000 |
| SubpixelID | Unique ID of the individual cell in a grid |
| Legend | Built-up |
| LegendItemID | 4002 = Built-up |
| 4006 = Not built-up | |
| 4089 = I don’t know | |
| LegendItemName | Built-up |
| Not built-up | |
| I don’t know | |
| SubpixelX_Centroid | The longitude of the centroid of the individual cell in the grid |
| SubpixelY_Centroid | The latitude of the centroid of the individual cell in the grid |
The attributes and descriptions associated with item 6 in the data record.
| Attribute | Description |
|---|---|
| PointID | Unique ID of the grid from 1 to 50000 |
| ControlPoint | Yes |
| No | |
| Legend | Change |
| LegendItemID | 4080 = Yes |
| 4081 = No | |
| 4082 = Not sure | |
| LegendItemName | Yes |
| No | |
| Not sure | |
| Null (if skipped) | |
| SkipReasonMajority | 0 = all not skipped |
| 1 = majority not skipped | |
| 2 = majority skipped | |
| 3 = no majority |
The attributes and descriptions associated with item 7 in the data record.
| Attribute | Description |
|---|---|
| PointID | Unique ID of the grid from 1 to 50000 |
| SubpixelID | Unique ID of the individual cell in a grid |
| ControlPoint | Yes |
| No | |
| Legend | Built-up |
| LegendItemID | 4002 = Built-up |
| 4006 = Not built-up | |
| 4089 = I don’t know | |
| LegendItemName | Built-up |
| Not built-up | |
| I don’t know | |
| SubpixelX_Centroid | The longitude of the centroid of the individual cell in the grid |
| SubpixelY_Centroid | The latitude of the centroid of the individual cell in the grid |
Fig. 5Global distribution of the sample points displayed as the total per 100 km2 pixels.
Fig. 6Global distribution of the built-up sample points displayed as the total by 100 km2 pixels.
Fig. 7Global distribution of the sample points where imagery is missing, low resolution or cloud-covered, displayed as the total by 100 km2 pixels.
Fig. 8Global distribution of the built-up sample points with change information, displayed as the total by 100 km2 pixels.
The comparison of expert data and all participant data summarized by categories of built-up.
| Expert control points | Class agreement (%) | ||||||
|---|---|---|---|---|---|---|---|
| Non-built-up | <25% | 25–50% | 50–75% | >75% | |||
| Participant points | Non-built-up | 0 | 670 | 585 | 592 | 372 | 0.0 |
| <25% | 0 | 6694 | 761 | 36 | 15 | 89.2 | |
| 25–50% | 0 | 694 | 7540 | 1314 | 40 | 78.6 | |
| 50–75% | 0 | 17 | 933 | 6240 | 730 | 78.8 | |
| >75% | 0 | 35 | 24 | 753 | 4035 | 83.3 | |
| Class agreement (%) | N/A | 82.5 | 76.6 | 69.8 | 77.7 | OA = 76.4% | |
Notes: OA is overall agreement; N/A is not applicable since control points had only built-up.
The comparison of expert data with the majority answer from participants, summarized by categories of built-up.
| Expert control points | Class agreement (%) | ||||||
|---|---|---|---|---|---|---|---|
| Non-built-up | <25% | 25–50% | 50–75% | >75% | |||
| Participant points | Non-built-up | 0 | 3 | 0 | 1 | 0 | 0 |
| <25% | 0 | 376 | 35 | 1 | 0 | 91.3 | |
| 25–50% | 0 | 32 | 498 | 79 | 2 | 81.5 | |
| 50–75% | 0 | 1 | 45 | 434 | 51 | 81.7 | |
| >75% | 0 | 0 | 1 | 37 | 261 | 84.5 | |
| Class agreement (%) | N/A | 91.3 | 86.0 | 78.6 | 83.1 | OA = 84.5% | |
Notes: OA is overall agreement; N/A is not applicable since control points had only built-up.
The comparison of expert data with answers from participants for 4, 10 and 20 classes of percentage built-up, by full agreement, percentage of classifications 1 class higher and 1 class lower than the experts, and the percentage remaining.
| Agreement (%) | One class higher than experts (%) | One class lower than experts (%) | Remaining (%) | |||||
|---|---|---|---|---|---|---|---|---|
| All data | Majority | All data | Majority | All data | Majority | All data | Majority | |
| 4 classes (25%) | 76.4 | 84.5 | 7.4 | 6.1 | 10.8 | 9.0 | 5.3 | 0.3 |
| 10 classes (10%) | 56.3 | 68.1 | 13.9 | 10.9 | 17.4 | 17.3 | 12.4 | 3.7 |
| 20 classes (5%) | 39.4 | 48.9 | 15.8 | 14.1 | 17.9 | 22.8 | 26.9 | 14.2 |
The total number of locations where there was full agreement, majority agreement and an equal split of built-up (BU) and non-built-up (NBU) without and including locations that were skipped.
| Skipping | Full agreement | Majority agreement | Equal BU & NBU | Total | ||||
|---|---|---|---|---|---|---|---|---|
| BU | NBU | BU + NBU | BU | NBU | BU + NBU | |||
| No locations skipped | 565 [1.56%) | 35564 [98.44%] | 36129 (93.75%) | 1472 [61.28%] | 930 [38.72%] | 2402 (6.23%) | 6 (0.02%) | 38537 |
| Including locations skipped | 842 [1.97%) | 41986 [98.03%] | 42828 (93.79%) | 1735 [63.83%] | 983 [36.17%] | 2718 (5.95%) | 118 (0.26%) | 45664 |
| Locations completely skipped | N/A | N/A | N/A | N/A | N/A | N/A | N/A | 1177 |
N/A = not applicable.
The numbers in round brackets indicate the percentage of full and majority agreement and equal BU/NBU from the total, while numbers in square brackets indicate the percentage of BU and NBU within each category of agreement.
Fig. 9Validation modes can be applied to individual sub-pixels or blocks of sub-pixels as follows: A: 2 × 2 or 4 sub-pixels, B: 4 × 4 or 16 sub-pixels, C: 6 × 6 or 36 sub-pixels or the full grid D: 8 × 8 sub-pixels or 64 sub-pixels.
| Measurement(s) | built-up areas |
| Technology Type(s) | visual interpretation of satellite imagery |
| Factor Type(s) | geographic location |
| Sample Characteristic - Environment | land • area of developed space |
| Sample Characteristic - Location | global |