| Literature DB >> 24723785 |
Ed Baker1, Simon Rycroft1, Vincent S Smith1.
Abstract
We describe an implementation of the Darwin Core Archive (DwC-A) standard that allows for the exchange of biodiversity information contained within the Scratchpads virtual research environment with external collaborators. Using this single archive file Scratchpad users can expose taxonomies, specimen records, species descriptions and a range of other data to a variety of third-party aggregators and tools (currently Encyclopedia of Life, eMonocot Portal, CartoDB, and the Common Data Model) for secondary use. This paper describes our technical approach to dynamically building and validating Darwin Core Archives for the 600+ Scratchpad user communities, which can be used to serve the diverse data needs of all of our content partners.Entities:
Keywords: Darwin Core; Darwin Core Archive; Scratchpads; biodiversity; biodiversity informatics; taxonomy; virtual research environment
Year: 2014 PMID: 24723785 PMCID: PMC3964728 DOI: 10.3897/BDJ.2.e1039
Source DB: PubMed Journal: Biodivers Data J ISSN: 1314-2828
Figure 1.The star schema showing the relation of Darwin Core Archive extension files to the core file.
Figure 2.Data flow from Scratchpad to Darwin Core Archive. The dwca_export module (orange) defines a number of custom Drupal Views (queries) that collect the data required for archive generation from the Scratchpad (blue) and combines them with the meta.xml which describes the information in the archive. For eMonocot Scratchpads the emonocot_dwca module (green) provides an intermediary function replacing the Scratchpads internal unique identifiers with those used throughout the eMonocot project (see eMonocot modifications section).
Mapping from the Scratchpads taxonomy model to the GBIF Darwin Core taxonomy model
| Scratchpads: Usage | Scratchpads: Unacceptability Reason | DwC: taxonomicStatus | DwC: nomenclaturalStatus |
|---|---|---|---|
| accepted/valid | accepted | ||
| valid | valid | ||
| not accepted / invalid | -None- | ||
| not accepted / invalid | synonym | synonym | |
| not accepted / invalid | homotypic (nomenclatural) synonym | homotypicSynonym | |
| not accepted / invalid | heterotypic (taxonomic) synonym | heterotypicSynonym | |
| not accepted / invalid | homonym (illegitimate) | heterotypicSynonym | illegitimum |
| not accepted / invalid | superfluous renaming (illegitimate) | homotypicSyonym | superfluum |
| not accepted / invalid | rejected name | synonym | rejiciendum |
| not accepted / invalid | invalidly published, nomen nudum | synonym | nudum |
| not accepted / invalid | invalidly published, other | synonym | invalidum |
| not accepted / invalid | misapplied | misapplied | |
| not accepted / invalid | pro parte | proParteSynonym | |
| not accepted / invalid | horticultural | ||
| not accepted / invalid | database artifact | ||
| not accepted / invalid | orthographic Variant (misspelling) | synonym | orthographia |
| not accepted / invalid | other | ||
| not accepted / invalid | junior synonym | ||
| not accepted / invalid | objective synonym | ||
| not accepted / invalid | subjective synonym | ||
| not accepted / invalid | original name/combination | ||
| not accepted / invalid | subsequent name/combination | combinatio | |
| not accepted / invalid | junior homonym | synonym | illegitimum |
| not accepted / invalid | homonym & junior synonym | synonym | |
| not accepted / invalid | unavailable, database artifact | ||
| not accepted / invalid | unavailable, literature misspelling | synonym | orthographia |
| not accepted / invalid | unavailable, incorrect original spelling | negatum | |
| not accepted / invalid | unavailable, suppressed by ruling | oppressa | |
| not accepted / invalid | unavailable, nomen nudum | synonym | nudum |
| not accepted / invalid | unavailable, other | ||
| not accepted / invalid | unjustified emendation | ||
| not accepted / invalid | unnecessary replacement | synonym | superfluum |
| not accepted / invalid | nomen oblitum | ||
| not accepted / invalid | misapplied | ||
| not accepted / invalid | pro parte | proParteSynonym | |
| not accepted / invalid | nomen dubium | synonym | dubium |
The fields used in our classification.txt – the core of our DwC-A star schema.
| Field | Description | Term |
|---|---|---|
| Taxon ID | A universally unique identifier (UUID) of this name (World Checklist of Monocots [WCM] identifier for eMonocot Scratchpads) | |
| Taxon Name | The taxon name – made by concatenating the unit names and unit indicators (the Scratchpads stores all parts of the scientific name, and indicators such as sp. & spp. in separate fields) | |
| Taxonomic Status | See Table | |
| Taxonomic Rank | e.g. species, genus, family | |
| Taxon Author(s) | Plain text names of the author(s) of this taxon | |
| Reference | Citation of the reference containing the description of this taxon | |
| Reference ID | URL to the reference containing the description of this taxon within the Scratchpad | |
| Taxonomic Parent | The parent of this name in the classification, if this name is accepted. | |
| Nomenclatural Status | See Table | |
| Accepted Name | The UUID of the associated accepted name, if this name is not accepted |
Figure 3.A single Taxon Description node on a Scratchpad corresponds to one or more rows in the description.txt file.
Mapping schema of Scratchpads textual descriptions of taxa to the GBIF (as used by eMonocot) and EoL extensions in the Darwin Core Archive.
| Field | Description | GBIF term | EoL term |
|---|---|---|---|
| Taxon ID | The Scratchpads universally unique identifier for the taxonomic name (WCM identifier for eMonocot Scratchpads) | ||
| Type | The type of the textual description: "general", "ecology", "behaviour", etc. | ||
| Text | The textual description. E.g. if Type is "general" this field will contain a general description of the taxon | ||
| Rights | Textual description of the rights associated with this content, e.g. "All Rights Reserved" | ||
| AccessURI | The URL of the Scratchpad node containing this description | ||
| Source | Scratchpad URL of the bibliographic reference this description is from (only applies to eMonocot sites) | ||
| Copyright Owner | The copyright owner of the description | ||
| Language | The language of the description | ||
| CV Term | Image keywords | ||
| Format | URL of the SPM type of the description, e.g. | ||
| Type | MIME type of the textual content, generally: text/html | ||
| agentID | UUID of the author who contributed the content to the site | ||
| License | URL of the license used to release the content, if any | ||
| Identifier | A unique identifier for this particular textual description. Formed by concatenating the universally unique identifier of the description node, the # character, and the field name. |
eol_agents
| Field | Description | Term |
|---|---|---|
| User ID | The universally unique identifier of the user | |
| Family Name | The user's last/family name | |
| First Names | The user's given/first names | |
| Full Name | Full name of the user – concatentaion with space of above two fields | |
| Organisation | The organisation the user works for, if any | |
| Username | The user's username on this Scratchpad |
Structure of comments.txt – the non-standard extension for synchronising comments between a Scratchpad and the eMonocot portal
| Field | Description | Term |
|---|---|---|
| CommentID | URL of the comment | |
| Target | URL of the node the comment was made on | |
| Title | Title of the comment | |
| Body | The comment itself | |
| Created | Date and time the comment was created | |
| Modified | Date and time the comment was last edited |
references
| Field | Description | GBIF / PURL Term | EoL Term |
|---|---|---|---|
| Taxon ID | UUID of taxa in the publication | ||
| Identifier | UUID of the reference in the Scratchpad site | ||
| DOI | Digital Object Identifier | ||
| ISBN | International Standard Book Number | ||
| ISSN | International Standard Serial Number | ||
| Citation | Plain text citation of the work | ||
| Title | |||
| Node URL | |||
| Language | |||
| Indicates if if publication is original description of a taxon | |||
| Publication Date | |||
| Created Date | Date the reference was added to the Scratchpad | ||
| Modified Date | Date the reference was last modified in the Scratchpad | ||
specimens.txt
| Field | Description | Term |
|---|---|---|
| Taxon ID | ||
| Type Status | e.g. Holotype | |
| Institution Code | e.g. BMNH for Natural History Museum, London | |
| Collection Code | e.g. E for Entomology | |
| Catalogue Number | Unique specimen identifier | |
| Latitude | Decimal latitude | |
| Longitude | Decimal longitude |
vernacular_names.txt
| Field | Description | Term |
|---|---|---|
| Taxon ID | ||
| Vernacular Name | The vernacular (common) name | |
| Language | The language of the vernacular name | |
| Locality | Where is the vernacular name used | |
| Remarks | Other information |