| Literature DB >> 30455216 |
Eimear Nic Lughadha1, Barnaby E Walker2, Cátia Canteiro2, Helen Chadburn2, Aaron P Davis2, Serene Hargreaves2, Eve J Lucas2, André Schuiteman2, Emma Williams2, Steven P Bachman2, David Baines2,3, Amy Barker2, Andrew P Budden2, Julia Carretero2, James J Clarkson2, Alexandra Roberts2, Malin C Rivers4.
Abstract
Herbarium specimens provide verifiable and citable evidence of the occurrence of particular plants at particular points in space and time, and are vital resources for assessing extinction risk in the tropics, where plant diversity and threats to plants are greatest. We reviewed approaches to assessing extinction risk in response to the Convention on Biological Diversity's Global Strategy for Plant Conservation Target 2: an assessment of the conservation status of all known plant species by 2020. We tested five alternative approaches, using herbarium-derived data for trees, shrubs and herbs in five different plant groups from temperate and tropical regions. All species were previously fully assessed for the IUCN Red List. We found significant variation in the accuracy with which different approaches classified species as threatened or not threatened. Accuracy was highest for the machine learning model (90%) but the least data-intensive approach also performed well (82%). Despite concerns about spatial, temporal and taxonomic biases and uncertainties in herbarium data, when specimens represent the best available evidence for particular species, their use as a basis for extinction risk assessment is appropriate, necessary and urgent. Resourcing herbaria to maintain, increase and disseminate their specimen data is essential to guide and focus conservation action.This article is part of the theme issue 'Biological collections for understanding biodiversity in the Anthropocene'.Entities:
Keywords: IUCN Red List; conservation assessment; digitization; extent of occurrence; machine learning; natural history collections
Mesh:
Year: 2018 PMID: 30455216 PMCID: PMC6282085 DOI: 10.1098/rstb.2017.0402
Source DB: PubMed Journal: Philos Trans R Soc Lond B Biol Sci ISSN: 0962-8436 Impact factor: 6.237
Qualitative comparison of the alternative approaches to identifying threatened species for which quantitative comparisons are a focus of this study.
| method/approach | general purpose as stated by authors | more detailed statement of purpose | accuracy (or other metric) reported by authors | data required | skills required | most appropriate metric for evaluating success |
|---|---|---|---|---|---|---|
| ConR [ | Enable practitioners to conduct preliminary assessments for large numbers of species efficiently | Generate preliminary conservation assessments based on IUCN criterion B that are both reliable and informative | 71% of African palm species correctly classified as threatened or not threatened | georeferenced specimen coordinates | basic data handling and knowledge of R | accuracy |
| Random Forests [ | Predict the conservation status of Data Deficient species | Determine the true conservation status of DD species without the need for focused field surveys | 90% accuracy with 94% sensitivity for a model trained on a global dataset of terrestrial mammals | georeferenced specimen coordinates | geospatial analysis and statistical modelling, knowledge of a programming language | accuracy, sensitivity |
| rCAT [ | To calculate EOO and AOO for use in criterion B assessments | Allow practitioners to generate preliminary conservation assessments based on part of IUCN criterion B | unreported | georeferenced specimen coordinates | basic data handling and knowledge of R | accuracy |
| Specimen Count [ | Construct a sample of herbarium material as rich as possible in species of conservation concern | Originally proposed as a way of concentrating specimen digitization effort on species most likely to be of conservation concern | 82%a sensitivity for a threshold of 10 specimens, tested on endemic legumes of Madagascar | number of herbarium specimens known for a species | basic data handling | accuracy, sensitivity |
| US Method [ | Provide a preliminary assessment of the conservation status of plants using data from herbarium specimens | To quickly and accurately identify those species that are | 95.1% sensitivity based on plant species of Hawaii, used for calibration; reported numbers correspond to an accuracy between 52.5% and 96.3% | basic specimen information—number of specimens and collection year and/or locality if available | basic data handling and statistical calculations e.g. calculate median | sensitivity |
aSensitivity reported is for species of conservation concern, including Near Threatened species as well as Threatened species.
Groups included in our dataset, with number of species and their distribution.
| group | species (specimens) included | total known species | distribution |
|---|---|---|---|
| Coffea | 105 (4352) | 124 | global (confined to Old World) |
| Legumes | 837 (166532) | 22347 | global |
| Myrcia (sect. | 97 (3239) | 124 | Neotropical |
| MadPalms | 176 (1997) | 203 | Madagascar |
| OrchidsNG | 96 (1001) | 3136 | New Guinea endemics |
| total | 1311 |
List of predictors used for each of the compared approaches.
| threat assessment approach | |||||||
|---|---|---|---|---|---|---|---|
| predictor | rCAT | ConR | US | Specimen Count | Random Forests | short name | type |
| collection year | ✓ | — | collection-related | ||||
| locality | ✓ | ✓ | ✓ | — | collection-related | ||
| number of specimens | ✓ | ✓ | — | collection-related | |||
| genus | ✓ | genus | taxonomic | ||||
| family | ✓ | family | taxonomic | ||||
| order | ✓ | order | taxonomic | ||||
| number of habitats | ✓ | n_habitats | geographical | ||||
| biogeographic realm | ✓ | realm_value | geographical | ||||
| extent of occurrence (EOO) | ✓ | range_eoo | geographical | ||||
| maximum elevation | ✓ | elevation_max | geographical | ||||
| minimum elevation | ✓ | elevation_min | geographical | ||||
| latitude of range centroid | ✓ | latitude_centroid | geographical | ||||
| mean annual temperature | ✓ | av_temp | climatic | ||||
| mean temperature seasonality | ✓ | season_temp | climatic | ||||
| mean annual precipitation | ✓ | av_precip | climatic | ||||
| mean precipitation seasonality | ✓ | season_precip | climatic | ||||
| external threat index | ✓ | eti | threat-related | ||||
| mean GDP | ✓ | mean_gdp | threat-related | ||||
| mean human population density | ✓ | mean_hpd | threat-related | ||||
| minimum human population density | ✓ | min_hpd | threat-related | ||||
| mean human footprint | ✓ | mean_hfi | threat-related | ||||
Figure 1.Comparison of each approach (a) on the whole dataset by accuracy (correct prediction of threat status), sensitivity (correct prediction of species as threatened), and specificity (correct prediction of species as not threatened) and (b) on each group by accuracy. Stars indicate a significant difference from the default accuracy.
Summary of results for each method applied to the test sets overall and to each plant group. Italicized accuracies are significantly better than the default accuracy. Superscript letters indicate significantly better performance than Random Forests (RF), rCAT (RC), ConR (CO), Specimen Count (SC), or the US Method (US).
| group | number of species | accuracy/% | default accuracy/% | sensitivity | specificity | |
|---|---|---|---|---|---|---|
| Random Forests | all | 326 | 72 | 0.85SC | 0.91CO,SC,US | |
| Coffea | 32 | 75 | 62 | 0.80 | 0.67 | |
| Legumes | 204 | 94 | 93 | 0.64 | 0.96 | |
| MadPalms | 52 | 88 | 85 | 0.98 | 0.38 | |
| Myrcia | 19 | 74 | 63 | 0.67 | 0.86 | |
| OrchidsNG | 19 | 84 | 89 | 1.00 | 0.82 | |
| rCAT | all | 1311 | 72 | 0.83SC | 0.92CO,SC,US | |
| Coffea | 105 | 68 | 0.86 | 0.88 | ||
| Legumes | 837 | 89 | 0.78 | 0.94 | ||
| MadPalms | 176 | 84 | 83 | 0.85 | 0.8 | |
| Myrcia | 97 | 62 | 0.7 | 0.88 | ||
| OrchidsNG | 96 | 82 | 76 | 0.96 | 0.78 | |
| ConR | all | 1303 | 72 | 0.87SC | 0.80US | |
| Coffea | 105 | 68 | 0.94 | 0.91 | ||
| Legumes | 829 | 83 | 89 | 0.73 | 0.85 | |
| MadPalms | 176 | 83 | 0.92 | 0.83 | ||
| Myrcia | 97 | 67 | 62 | 0.76 | 0.62 | |
| OrchidsNG | 96 | 51 | 76 | 1.00 | 0.36 | |
| Specimen Count | all | 1311 | 72 | 0.69 | 0.87CO,US | |
| Coffea | 105 | 73 | 68 | 0.62 | 0.97 | |
| Legumes | 837 | 87 | 89 | 0.45 | 0.93 | |
| MadPalms | 176 | 84 | 83 | 0.84 | 0.87 | |
| Myrcia | 97 | 65 | 62 | 0.65 | 0.65 | |
| OrchidsNG | 96 | 53 | 76 | 1.00 | 0.38 | |
| US Method | all | 1311 | 72 | 0.86SC | 0.74 | |
| Coffea | 105 | 68 | 0.73 | 0.88 | ||
| Legumes | 837 | 78 | 89 | 0.75 | 0.78 | |
| MadPalms | 176 | 83 | 0.99 | 0.53 | ||
| Myrcia | 97 | 64 | 62 | 0.78 | 0.55 | |
| OrchidsNG | 96 | 58 | 76 | 1.00 | 0.45 |
Figure 2.Random Forest classification results: predictor importance measured as mean decrease in accuracy by permutation for (a) all predictors and (b) the five most important predictors for each group, accompanied by (c) the probability of threat by IUCN Red List category predicted by the Random Forest classifier, with the dashed line showing the threshold for classification as threatened. (See table 3 for abbreviations).
Figure 3.Diagram showing the proportions of species following different pathways towards being classified as potentially threatened or not threatened at each step in the US Method for all plant groups in our study.