| Literature DB >> 33177945 |
Joaquim Santos1, Paulo Rupino da Cunha2, Fátima Sales1.
Abstract
The present work is a contribution towards accelerating the digitisation process of natural history collections, usually a slow process. A two-stage process was developed at the herbarium of the University of Coimbra: (i) a new workflow was established to automatically create records in the herbarium master database with minimum information, while capturing digital images; (ii) these records are then used to populate a web-based crowdsourcing platform where citizens are involved in the transcription of specimen labels from the digital images. This approach simplifies and accelerates databasing, reduces specimen manipulation and promotes the involvement of citizens in the scientific goals of the herbarium. The novel features of this process are: (i) the validation method of the crowdsourcing contribution that ensures quality control, enabling the data to integrate the master database directly and (ii) the field-by-field integration in the master database enables immediate corrections to any record in the catalogue. Joaquim Santos, Paulo Rupino da Cunha, Fátima Sales.Entities:
Keywords: accelerating herbarium digitisation; automate databasing processes; crowdsourcing platform
Year: 2020 PMID: 33177945 PMCID: PMC7599201 DOI: 10.3897/BDJ.8.e55959
Source DB: PubMed Journal: Biodivers Data J ISSN: 1314-2828
Figure 1.Create taxon name process. This process is called by the main process (Fig. 2) if the full taxon name is not found. A taxon name can have several infraspecific ranks, which are processed using the same routine in the final loop.
Figure 2.Process to create records in the database from specimen images. If there is the need to create a taxon in the database, the "create taxon name" process is called (Fig. 1). One specimen can have multiple barcodes, which are transcribed in the image filename. Each barcode will correspond to one record in the database, with one determination.
Collaborative application: user categories and roles. Validation of submitted data considers user’s proficiency as a criterion of confidence.
|
|
|
|
|
|
| CONTRIBUTOR | Basic | First time user. Fields displayed are restricted. | 0 | 10 |
| Beginner | More fields are displayed, but some are restricted. | 10 | 20 | |
| Competent | More fields are displayed, but some are restricted. | 50 | 30 | |
| Advanced | More fields are displayed, but some are restricted. | 100 | 40 | |
| Expert | Can submit all fields. | 500 | 50 | |
| ADMINISTRATOR | Administrator | Can perform all tasks above, data management (submission approval) | - | 60 |
| ROOT | Root | Can perform all tasks above, administrator management, specimen management. | - | 60 |
Figure 3.Proposed data flow. Local database (Specify) is managed by its own software interface. Records can also be created automatically from image processing (described in Stage 1) and edited with the crowdsourcing platform. Data is made available in the online catalogue. Any data in the online catalogue can be edited using the crowdsourcing platform.