| Literature DB >> 22859876 |
Gil Nelson1, Deborah Paul, Gregory Riccardi, Austin R Mast.
Abstract
This paper describes and illustrates five major clusters of related tasks (herein referred to as task clusters) that are common to efficient and effective practices in the digitization of biological specimen data and media. Examples of these clusters come from the observation of diverse digitization processes. The staff of iDigBio (The U.S. National Science Foundation's National Resource for Advancing Digitization of Biological Collections) visited active biological and paleontological collections digitization programs for the purpose of documenting and assessing current digitization practices and tools. These observations identified five task clusters that comprise the digitization process leading up to data publication: (1) pre-digitization curation and staging, (2) specimen image capture, (3) specimen image processing, (4) electronic data capture, and (5) georeferencing locality descriptions. While not all institutions are completing each of these task clusters for each specimen, these clusters describe a composite picture of digitization of biological and paleontological specimens across the programs that were observed. We describe these clusters, three workflow patterns that dominate the implemention of these clusters, and offer a set of workflow recommendations for digitization programs.Entities:
Keywords: ADBC; Biological specimen collections; biodiversity informatics; curation; digitization; iDigBio; imaging; paleontological specimen collections; task cluster; workflow
Year: 2012 PMID: 22859876 PMCID: PMC3406464 DOI: 10.3897/zookeys.209.3135
Source DB: PubMed Journal: Zookeys ISSN: 1313-2970 Impact factor: 1.546
Summary List of Collections Visited.
| Yale Peabody Museum (YPM) | >1000000 | KE EMu | Proprietary | |
| Invertebrate Zoology | 3000000 | |||
| 350000 lots | ||||
| Vascular Plants | 350000 | |||
| Global Plants Initiative | ||||
| Connecticut Plants Survey | ||||
| Harvard Museum of Compartive Zoology (MCZ) | several hundred thousand | MCZbase (Arctos) | Oracle | |
| MCZ, Entomology ( | 1 million pinned | |||
| Harvard University Herbaria (HUH) | > 5 million | Specify 6, custom | MySQL | |
| HUH, California Plants | ||||
| American Museum of Natural History (AMNH) | Division of Invertebrate Zoology | > 24000000 | Planetary Biodiversity Inventory (PBI) for Plant Bugs custom database | MySQL |
| American Museum of Natural History (AMNH) | Ornithology | > 1000000 | KE EMu Microsoft Access | Proprietary |
| New York Botanical Garden (NYBG) | > 7000000 | KE EMu | Proprietary | |
| Barnaby Legume Monographs | ||||
| Caribbean Project (ledgers & notebooks) | ||||
| Amazon Project | ||||
| Kohlmeyer Marine Fungus Collection | ||||
| University of Kansas (KU) | Biodiversity Institute, | > 4.8 million pinned | Specify 6 | MySQL |
| Botanical Research Institute of Texas (BRIT) | > 1000000 | Apiary | MySQL | |
| Valdosta State University Herbarium (VSC) | > 60,000 | Specify 6 | MySQL | |
| Tall Timbers Research Station (TTRS) | 11,000 | custom database | MySQL Microsoft Access | |
| 1200 | ||||
| 4000 | ||||
| 1000 | ||||
| Robert K. Godfrey Herbarium (FSU) | > 200,000 | custom database | MySQL |
† indicates where observers saw the actual digitization process in action.
‡ number of specimens (unless otherwise stated).
Figure 2.Specimen image capture. Fossil specimen imaging, specimen label imaging. Two very different imaging set-ups. Yale Peabody Museum, University of Kansas - Entomology.
Figure 6.Dominant Digitization Workflows Observed.
Figure 3.Custom specimen holder. Museum of Compartive Zoology (MCZ) Rhopalocera (Lepidoptera) Rapid Digitization Project.