| Literature DB >> 28949323 |
Juan Carlos Laso Bayas1, Myroslava Lesiv1, François Waldner2, Anne Schucknecht3,4, Martina Duerauer1, Linda See1, Steffen Fritz1, Dilek Fraisl1, Inian Moorthy1, Ian McCallum1, Christoph Perger1, Olha Danylo1, Pierre Defourny2, Javier Gallego3, Sven Gilliams5, Ibrar Ul Hassan Akhtar6,7, Swarup Jyoti Baishya8, Mrinal Baruah8, Khangsembou Bungnamei8, Alfredo Campos9,10, Trishna Changkakati8, Anna Cipriani11,12, Krishna Das8, Keemee Das8, Inamani Das8, Kyle Frankel Davis13,14, Purabi Hazarika8, Brian Alan Johnson15, Ziga Malek16, Monia Elisa Molinari17, Kripal Panging8, Chandra Kant Pawe8, Ana Pérez-Hoyos3, Parag Kumar Sahariah18, Dhrubajyoti Sahariah8, Anup Saikia8, Meghna Saikia19, Peter Schlesinger20,21, Elena Seidacaru22, Kuleswar Singha8, John W Wilson23.
Abstract
A global reference data set on cropland was collected through a crowdsourcing campaign using the Geo-Wiki crowdsourcing tool. The campaign lasted three weeks, with over 80 participants from around the world reviewing almost 36,000 sample units, focussing on cropland identification. For quality assessment purposes, two additional data sets are provided. The first is a control set of 1,793 sample locations validated by students trained in satellite image interpretation. This data set was used to assess the quality of the crowd as the campaign progressed. The second data set contains 60 expert validations for additional evaluation of the quality of the contributions. All data sets are split into two parts: the first part shows all areas classified as cropland and the second part shows cropland average per location and user. After further processing, the data presented here might be suitable to validate and compare medium and high resolution cropland maps generated using remote sensing. These could also be used to train classification algorithms for developing new maps of land cover and cropland extent.Entities:
Year: 2017 PMID: 28949323 PMCID: PMC5613736 DOI: 10.1038/sdata.2017.136
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 6.444
Figure 1Schematic representation of the design and implementation of the crowdsourcing campaign to collect reference samples designed for cropland map validation, implemented using the Geo-Wiki (http://www.geo-wiki.org/) crowdsourcing tool.
Strata, sample distribution and strata sizes in the cropland validation campaign.
| Calculated weights needed for computing accuracy indexes are also shown. | ||||
|---|---|---|---|---|
| 1 (0%) | 500 | 1.39 | 84.01 | 168026 |
| 2 (0–25%) | 10960 | 30.56 | 18.76 | 1712 |
| 3 (25–75%) | 15984 | 44.57 | 14.66 | 917 |
| 4 (>75%) | 8422 | 23.48 | 16.47 | 1955 |
Figure 2The Geo-Wiki interface (http://www.geo-wiki.org) for collecting cropland information based on image interpretation.
(a) is the sub-grid of pixels that users must classify; (b) is the Submit button that users must press once they have completed their interpretation; (c) allows the user to change the background imagery; (d) shows the ‘View in Google Earth’ button, which users can press to be shown the location in Google Earth so that that they can view historical imagery; and (e) shows the NDVI profiles that can be viewed when the user clicks on a location.
Figure 3Definition and examples of cropland (yellow shading) and areas of non-cropland as shown in a gallery of examples on Geo-Wiki (http://www.geo-wiki.org/Application/modules/sigma_validation/sigma_gallery.html).
Quality score calculation per location. Units for agreement are in number of grid cells/sub-pixels per 300 m×300 m location.
| 25 | 25 | 12 | −1 |
| 24 | 23 | 11 | −3 |
| 23 | 21 | 10 | −5 |
| 22 | 19 | 9 | −7 |
| 21 | 17 | 8 | −9 |
| 20 | 15 | 7 | −11 |
| 19 | 13 | 6 | −13 |
| 18 | 11 | 5 | −15 |
| 17 | 9 | 4 | −17 |
| 16 | 7 | 3 | −19 |
| 15 | 5 | 2 | −21 |
| 14 | 3 | 1 | −23 |
| 13 | 1 | 0 | −25 |
Financial rewards offered according to the final ranking of the participants.
| 1 | € 750 |
| 2 | € 500 |
| 3 | € 300 |
| 4 | € 100 |
| 5 | € 85 |
| 6 | € 65 |
| 7–9 | € 50 |
| 10–30 | € 25 |
The format and field descriptions of data records containing all grid cells marked as cropland.
| location_id | Numeric, continuous | Unique number identifying each location in the campaign. | 47286 |
| userid | Numeric, continuous | Numeric field used to uniquely identify participants/users | 11182 |
| sub_id | Numeric, continuous | Sequentially assigned number identifying every submission done in the system | 383725 |
| comment | Text | Comments entered by the participant | Apparent pastures |
| timestamp | Date and time | Exact time and date when the submission was entered into the system | 2016-09-16 13:20:19 |
| used_gmaps | Yes=‘t’No=‘f’ | Registers whether the participant was viewing the Google background imagery when the submission was done | t |
| viewed_ge | Yes=‘t’No=‘f’ | Registers whether the participant pressed the button labelled View in Google Earth | f |
| skip_reason | Numeric, categorical | Registers whether the participant did not skip the point ( | 0 |
| sub_item_id | Numeric, continuous | Unique identifier of each grid cell classified as cropland at a given location by a given user | 10579829 |
| sub_item_x | Numeric, continuous | Longitude of each grid cell centroid inside a frame/location (decimal degrees) | 30.95357144 |
| sub_item_y | Numeric, continuous | Latitude of each grid cell centroid inside a frame/location (decimal degrees) | −20.75119048 |
Format and field descriptions of data records containing average (mean) cropland per frame/location and user.
| location_id | Numeric, continuous | Unique number identifying each location in the campaign. | 47286 |
| userid | Numeric, continuous | Numeric field used to uniquely identify the participants/users. | 5 |
| sumcrop | Numeric, continuous | Average (mean) cropland at a given location in percentage | 80 |
| loc_cent_X | Numeric, continuous | Longitude of a frame/location centroid (decimal degrees) | −39.75 |
| loc_cent_Y | Numeric, continuous | Latitude of a frame/location centroid (decimal degrees) | −8.047619048 |
Figure 4Geographical location, previous knowledge and general information from the participants who filled in the survey at the end of the cropland validation campaign (n=50).
Figure 5Cropland validation campaign and worldwide spatial distribution of cropland.
The (a) presents cropland data collected during the cropland validation campaign, showing the mean cropland percentage per location and on the (b) the IIASA-IFPRI hybrid cropland map is shown for comparison. The third (c) shows the number of validations at each location during the campaign.