| Literature DB >> 22859890 |
Andrew Hill1, Robert Guralnick, Arfon Smith, Andrew Sallans, Michael Denslow, Joyce Gross, Zack Murrell, Peter Oboyski, Joan Ball, Andrea Thomer, Robert Prys-Jones, Javier de Torre, Patrick Kociolek, Lucy Fortson.
Abstract
Legacy data from natural history collections contain invaluable and irreplaceable information about biodiversity in the recent past, providing a baseline for detecting change and forecasting the future of biodiversity on a human-dominated planet. However, these data are often not available in formats that facilitate use and synthesis. New approaches are needed to enhance the rates of digitization and data quality improvement. Notes from Nature provides one such novel approach by asking citizen scientists to help with transcription tasks. The initial web-based prototype of Notes from Nature is soon widely available and was developed collaboratively by biodiversity scientists, natural history collections staff, and experts in citizen science project development, programming and visualization. This project brings together digital images representing different types of biodiversity records including ledgers , herbarium sheets and pinned insects from multiple projects and natural history collections. Experts in developing web-based citizen science applications then designed and built a platform for transcribing textual data and metadata from these images. The end product is a fully open source web transcription tool built using the latest web technologies. The platform keeps volunteers engaged by initially explaining the scientific importance of the work via a short orientation, and then providing transcription "missions" of well defined scope, along with dynamic feedback, interactivity and rewards. Transcribed records, along with record-level and process metadata, are provided back to the institutions. While the tool is being developed with new users in mind, it can serve a broad range of needs from novice to trained museum specialist. Notes from Nature has the potential to speed the rate of biodiversity data being made available to a broad community of users.Entities:
Keywords: Biodiversity; Citizen Science; Digitization; Museum Collections; Natural History Museums; Open Source; Transcription
Year: 2012 PMID: 22859890 PMCID: PMC3406478 DOI: 10.3897/zookeys.209.3472
Source DB: PubMed Journal: Zookeys ISSN: 1313-2970 Impact factor: 1.546
Figure 1.Organization of the Notes from Nature platform.
Figure 2.Example biocollections source images showing (a) The Natural History Museum, London bird specimen ledger; (b) The Southeast Regional Network of Expertise and Collections herbarium sheet label; (c) Calbug specimen and label image.
Figure 3.The Notes from Nature transcription tool for NHMUK museum ledgers. The tool gives users basic methods to navigate through a page of collections records while transcribing each major component of the record, viewing help dialogs, or skipping difficult to transcribe record entries. For help dialogs, we provide more than one example for each record element. The record outline is a movable window and, during transcription, the image and the tool location on that image is also captured as metadata, so that data managers can return quickly return to the source material for any record.
Figure 4.The simplified transcription replication and validation step. Following three independent transcriptions of a record, data is reconciled and returned to the original data provider. Records sent back to the provider can be fully complete, partially complete, of fully incomplete. Fully complete records are those where all three citizen scientist volunteers (CS) agree on every field of the record. Partial records include only those fields where CS agree. Fully incomplete records indicate that volunteers were largely unable to transcribe the record consistently. Data collected that does not become part of the final record is still made available for further review by the data provider.