| Literature DB >> 33014632 |
Abstract
PREMISE: The standardization of plant names is a critical step in various fields of biology, including biodiversity, biogeography, and vegetation research. The WorldFlora package is introduced here to help achieve this goal by matching lists of plant names with a static copy from World Flora Online (WFO), an ongoing global effort to complete an online flora of all known vascular plants and bryophytes by 2020. METHODS ANDEntities:
Keywords: biodiversity informatics; online flora; plant name identification; species name matching; spelling errors; taxonomic databases
Year: 2020 PMID: 33014632 PMCID: PMC7526431 DOI: 10.1002/aps3.11388
Source DB: PubMed Journal: Appl Plant Sci ISSN: 2168-0450 Impact factor: 1.936
Results of matching plant names from four data sets. The difference between correct and credible matches is clarified in the text.
| Plant taxa counts | GTS | CTTS | Wagner | SALVIAS |
|---|---|---|---|---|
| Taxa | 1000 | 1741 | 600 | 1000 |
| Unique direct matches | 930 | 1513 | 0 | 694 |
| Multiple direct matches | 27 | 125 | 0 | 40 |
| Unique fuzzy matches | 41 | 100 | 491 | 232 |
| Multiple fuzzy matches | 2 | 3 | 41 | 22 |
| No match | 0 | 0 | 68 | 12 |
| Correct matches | 957 | 1728 | 500 | 951 |
| Credible matches | 965 | 1740 | 568 | 975 |
CTTS = commercial timber tree species data set; GTS = Global Tree Search data set; SALVIAS = testing data set from Boyle et al. (2013); Wagner = combined testing data set from Wagner (2016).
Details about access, data manipulation, outputs, and R scripts are available in Appendices S1 (GTS data set), S2 (CTTS data set), S3 (Wagner data set), and S4 (SALVIAS data set).
Overview of some of the WorldFlora::WFO.match arguments.
| Argument | Details |
|---|---|
| Authorship | If this variable is found in the submitted data, the result will include a column of “Auth.dist” with the LD between the submitted and matched naming author. |
| acceptedNameUsageID.match | In the default setting, where the WFO includes an acceptedNameUsageID (typically indicating the accepted name for a synonym), then the matched details in the results will show the accepted name, and the columns “Old.status”, “Old.ID”, and “Old.name” will show those of the first match. |
| Fuzzy.max | The maximum number of names matched by the fuzzy algorithm |
| Fuzzy.min | In the default setting (TRUE), only the matches with the smallest LD are retained. |
| Fuzzy.two | Flags (TRUE/FALSE) whether the match was obtained for only the first two words of the submitted name. |
| Fuzzy.one | Flags (TRUE/FALSE) whether the match was obtained for only the first word of the submitted name. |
| spec.name.nonumber | “Number.detected” flags (TRUE/FALSE) whether the submitted name contained a number. If that was the case, a match was searched only for the first word of the submitted name. |
| spec.name.tolower | Converts all characters of the submitted name to lowercase, except the first character |
| spec.name.nobrackets | “Brackets.detected” flags (TRUE/FALSE) whether the submitted name contained a bracket. If that was the case, the function searched only for the part of the submitted name before the bracket. |