| Literature DB >> 23794864 |
Abstract
Occurrence records for named, native Australian millipedes from the Global Biodiversity Information Facility (GBIF) and the Atlas of Living Australia (ALA) were compared with the same records from the Millipedes of Australia (MoA) website, compiled independently by the author. The comparison revealed some previously unnoticed errors in MoA, and a much larger number of errors and other problems in the aggregated datasets. Errors have been corrected in MoA and in some data providers' databases, but will remain in GBIF and ALA until data providers have supplied updates to these aggregators. An audit by a specialist volunteer, as reported here, is not a common occurrence. It is suggested that aggregators should do more, or more effective, data checking and should query data providers when possible errors are detected, rather than simply disclaim responsibility for aggregated content.Entities:
Keywords: ALA; Australia; Diplopoda; GBIF; Millipede; data cleaning; data quality; occurrence records
Year: 2013 PMID: 23794864 PMCID: PMC3677402 DOI: 10.3897/zookeys.293.5111
Source DB: PubMed Journal: Zookeys ISSN: 1313-2970 Impact factor: 1.546
Figure 1.Illustration of ‘uncertainty’, ‘distance’ and ‘offset’. In MoA, spatial uncertainty is defined using the point-radius method, where a site is assumed to be at the centre of a circle whose radius is the uncertainty u. In both diagrams, d is the Euclidean distance between the MoA estimate of the site’s location (blue cross) and the aggregator estimate (red square). The offset o is the distance d minus the uncertainty u. In diagram 1 the aggregator site is within the circle of uncertainty surrounding the MoA site, and the offset is negative. In diagram 2 the aggregator site is outside the circle of uncertainty and the offset is positive
Figure 2.Exclusions from the GBIF and ALA datasets (see text for details). A not identified to species or only tentatively identified to species B undescribed species C non-native species D no latitude and longitude E manuscript names F miscellaneous duplicates G Penicillata H ALA preliminary exclusions (not in Australia, images, provider K observations). D and F categories do not include records already excluded for taxonomic reasons