| Literature DB >> 32330167 |
Daniel Runfola1,2, Austin Anderson2, Heather Baier2, Matt Crittenden2, Elizabeth Dowker2, Sydney Fuhrig2, Seth Goodman1,2, Grace Grimsley2, Rachel Layko2, Graham Melville2, Maddy Mulder2, Rachel Oberman3, Joshua Panganiban2, Andrew Peck2, Leigh Seitz4, Sylvia Shea2, Hannah Slevin2, Rebecca Youngerman5, Lauren Hobbs6.
Abstract
We present the geoBoundaries Global Administrative Database (geoBoundaries): an online, open license resource of the geographic boundaries of political administrative divisions (i.e., state, county). Contrasted to other resources geoBoundaries (1) provides detailed information on the legal open license for every boundary in the repository, and (2) focuses on provisioning highly precise boundary data to support accurate, replicable scientific inquiry. Further, all data is released in a structured form, allowing for the integration of geoBoundaries with large-scale computational workflows. Our database has records for every country around the world, with up to 5 levels of administrative hierarchy. The database is accessible at http://www.geoboundaries.org, and a static version is archived on the Harvard Dataverse.Entities:
Mesh:
Year: 2020 PMID: 32330167 PMCID: PMC7182183 DOI: 10.1371/journal.pone.0231866
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Current state of the geoBoundaries database.
All countries are shaded to indicate the depth of hierarchy of the administrative zones collected. Higher numbers indicate deeper hierarchies are available.
Minimal data schema for geoBoundaries files.
All fields noted in this table must be collected and validated for inclusion in a release. *URLs provided as exemplars only; within the database, full paths to exact landing pages from which data was retrieved are included.
| Field Name | Description | Type | Example | Unique Vals. |
|---|---|---|---|---|
| boundaryID | A unique ID created for every boundary in the geoBoundaries database by concatenating ISO 3166-1 3 letter country code, boundary level, geoBoundaries version, and an incremental ID. | String | ‘AFG-ADM1-2-0-0-G1’ | 632 |
| boundaryISO | The ISO 3166-1 3-letter country codes for each boundary. | String | ‘ARM’ | 198 |
| boundaryYear | The year for which a boundary is representative. | Integer | 2018 | 18 |
| boundaryType | The type of boundary defined. | String | ‘ADM 1’ | 10 |
| boundarySource-K | The name of the | String | ‘Government of Armenia’ | 189 |
| boundaryLicense | The specific license the data is released under. | String | ‘Creative Commons Attribution 4.0 International (CC BY 4.0)’ | 24 |
| licenseDetail | Any details necessary for the interpretation or use of the license noted. | String | ‘Free to adapt and redistribute’ | 58 |
| licenseSource* | A resolvable URL (checked at the time of data release) declaring the license under which a data product is made available. | String | ‘ | 145 |
| boundarySourceURL* | A resolvable URL (checked at the time of data release) from which source data was retrieved. | String | ‘ | 136 |
| boundaryUpdate | A date encoded following ISO 8601 (Year-Month-Date) describing the last date this boundary was updated, for use in programmatic updating based on new releases. | String | ‘2019-12-23’ | 23 |
| downloadURL | A URL from which the geoBoundary can be downloaded. | String | See | 632 |
A summary of license types currently included in the geoBoundaries dataset.
Explicit detail on the license for every boundary is provided in the metadata.
| License Name (Source) | Count of Boundary Sets |
|---|---|
| Open Data Commons Open Database License 1.0 | 402 |
| Creative Commons Attribution (Various Versions) | 61 |
| Public Domain | 12 |
| Open Government Licence (v3.0 and v1.0) | 7 |
| MIMU Data License (MIMU) | 4 |
| Other License Types | 131 |
Data schema for individual shapes in geoBoundaries.
Fields denoted with a * must be populated for inclusion into the database; other fields are considered optional. Some fields are replicated from the data schema for geoBoundaries files, so that users do not need to join different files for common use cases.
| Field Name | Description | Type | Example |
|---|---|---|---|
| shapeID* | The boundary ID, followed by the letter ‘B’ and a unique integer for each shape which is a member of that boundary. | String | ‘AFG-ADM1-2-0-0-B1’ |
| shapeName | The identified name for a given shape. ‘None’ if not identified. | String | ‘Atome’ |
| shapeGroup* | The country or similar organizational group that a shape belongs to, in ISO 3166-1 where relevant. | String | ‘AFG’ |
| shapeType* | The type of boundary represented by the shape. | String | ‘ADM 1’ |
| shapeISO | ISO codes for individual administrative districts, where available. Where possible, these conform to ISO 3166-2, but this is not guaranteed in all cases. ‘None’ if not identified. | String | ‘AF-SAM’ |
Fig 2Example file structure of the geoBoundaries data product.
This structure can be used to construct a download URL for any file in the database—for example, https://geoboundaries.org/data/geoBoundaries-2_0_0/JPN/ADM0/geoBoundaries-2_0_0-JPN-ADM0-shp.zip can be used to download the shapefile for the specified country and ADM level.