| Literature DB >> 22371687 |
David Remsen1, Sandra Knapp, Teodor Georgiev, Pavel Stoev, Lyubomir Penev.
Abstract
The paper describes a pilot project to convert a conventional floristic checklist, written in a standard word processing program, into structured data in the Darwin Core Archive format. After peer-review and editorial acceptance, the final revised version of the checklist was converted into Darwin Core Archive by means of regular expressions and published thereafter in both human-readable form as traditional botanical publication and Darwin Core Archive data files. The data were published and indexed through the Global Biodiversity Information Facility (GBIF) Integrated Publishing Toolkit (IPT) and significant portions of the text of the paper were used to describe the metadata on IPT. After publication, the data will become available through the GBIF infrastructure and can be re-used on their own or collated with other data.Entities:
Keywords: Darwin Core Archive; Data mining; taxonomic checklists
Year: 2012 PMID: 22371687 PMCID: PMC3281575 DOI: 10.3897/phytokeys.9.2770
Source DB: PubMed Journal: PhytoKeys ISSN: 1314-2003 Impact factor: 1.635
Figure 1.A typical species record from the checklist.
Taxon table - One row per name.
| ID | A unique identifier for each name in the database including synonyms, higher taxa and the accepted name itself. This is the key element for the data interoperability. All related data refer to this identifier |
| Scientific Name | Stores the full (with authorship) scientific name of the taxon |
| Taxonomic Status | Identifies whether it is an accepted name or a synonym |
| Rank | Indicates the rank of the taxon |
| Accepted name ID | In case of synonym, this field indicates the ID of the accepted name |
| Parent ID | In case of accepted name, this field indicates the ID of its taxonomic parent |
| Nomen. Status | For synonyms containing also nomenclatural comments (e.g., “comb. Superfl.”) |
| Original Publication | Indicates the original publication where the taxon was published |
| Remarks | Stores additional remarks regarding the taxon |
Distribution table - One row per geographic region per taxon.
| ID | The ID of the taxon being referenced |
| Country | Stores Paraguay as a constant value |
| Locality | Stores the department name(s) |
Description table - One row per description per taxon.
| ID | The ID of the taxon being referenced |
| Type | The type of description being recorded (e.g., Habit, Genetic, Reproduction) |
| Description | The descriptive text itself |
Figure 2.Taxon records imported into a database.
Figure 3.An updated database with final column titles and unique identifier added for each record.
Figure 4.Data correctly aligned with columns.
Figure 5.Example of a scientific name entry.
Figure 6.Synonym records (highlighted) added and linked.