| Literature DB >> 32551077 |
Jenny L McCune1,2, Hanna Rosner-Katz1, Joseph R Bennett1, Richard Schuster1, Heather M Kharouba3.
Abstract
Species distribution models (SDMs) are used to test ecological theory and to direct targeted surveys for species of conservation concern. Several studies have tested for an influence of species traits on the predictive accuracy of SDMs. However, most used the same set of environmental predictors for all species and/or did not use truly independent data to test SDM accuracy. We built eight SDMs for each of 24 plant species of conservation concern, varying the environmental predictors included in each SDM version. We then measured the accuracy of each SDM using independent presence and absence data to calculate area under the receiver operating characteristic curve (AUC) and true positive rate (TPR). We used generalized linear mixed models to test for a relationship between species traits and SDM accuracy, while accounting for variation in SDM performance that might be introduced by different predictor sets. All traits affected one or both SDM accuracy measures. Species with lighter seeds, animal-dispersed seeds, and a higher density of occurrences had higher AUC and TPR than other species, all else being equal. Long-lived woody species had higher AUC than herbaceous species, but lower TPR. These results support the hypothesis that the strength of species-environment correlations is affected by characteristics of species or their geographic distributions. However, because each species has multiple traits, and because AUC and TPR can be affected differently, there is no straightforward way to determine a priori which species will yield useful SDMs based on their traits. Most species yielded at least one useful SDM. Therefore, it is worthwhile to build and test SDMs for the purpose of finding new populations of plant species of conservation concern, regardless of these species' traits.Entities:
Keywords: dispersal; generalist; lifespan; niche models; range size; specialist
Year: 2020 PMID: 32551077 PMCID: PMC7297770 DOI: 10.1002/ece3.6254
Source DB: PubMed Journal: Ecol Evol ISSN: 2045-7758 Impact factor: 2.912
Characteristics of plant species or their geographic distributions (“traits”) hypothesized or shown to affect SDM accuracy. Starred traits were retained as candidate predictors after removing highly collinear variables
| Trait | Type | Description | Hypothesized influence | References |
|---|---|---|---|---|
| Woodiness* | Lifespan | Woody or not woody. Woody plants tend to be longer lived | Longer‐lived species have more accurate SDMs because they are more likely to persist long‐term in suitable habitat, whereas herbaceous species are more likely to respond to transient habitat features (e.g., light gaps) which are not easy to model on the scale of SDMs; woody species are more conspicuous and less likely to be overlooked, leading to more accurate presence/absence data |
Syphard & Franklin, ( Hanspach et al., ( |
| Dispersal type* | Dispersal‐related |
Mechanism for seed dispersal. None = no known mechanism (gravity) Animals = seeds dispersed by mammals or birds Wind/none = very tiny seeds that may possibly float on air currents, but may not go far in low‐wind conditions in forest understories Winged = seeds have morphological adaptations for wind dispersal | Species with adaptations for long‐distance seed dispersal are better able to reach all suitable habitat, leading to more accurate SDMs; alternatively, species with shorter dispersal distances may have more accurate SDMs due to adaptations for survival in place rather than dispersal |
Graham et al., ( Hanspach et al., ( Pulliam, ( Syphard & Franklin, ( |
| Seed weight* | Dispersal‐related | The average total weight (in grams) of 1,000 seeds/spores. Compiled mainly from the Kew Seed Information Database (Royal Botanic Gardens Kew[, | Species with lighter propagules have more accurate SDMs because they can travel farther on wind currents leading to greater ability to reach all suitable habitat (but see above) | |
| Soil type diversity* | Degree of edaphic specialization | Simpson's diversity index calculated for the soil texture type at all mapped occurrences. Soil texture categorized into 24 types according to the Soil Survey Complex of the Ontario Ministry of Agriculture (see Table | More specialized species have more accurate SDMs because their range of suitable habitat is limited and easier to differentiate compared with generalists |
Brotons et al., ( Franklin, ( Hernandez et al., ( McPherson and Jetz, ( Seoane et al., ( |
| Geological diversity | Simpson's diversity index calculated for the surficial geology type at all mapped occurrences. Surficial geology categorized into 40 types according to the Ontario Geological Survey ( | |||
| Mean latitude* | Geographic distribution | Mean latitude of all occurrences | Species with more southerly distributions have more accurate SDMs because they are restricted to a small subset of climatic conditions within the study area | Luoto et al., ( |
| Maximum range extent* | Geographic distribution | The largest distance between two occurrences (km) in the study region | Widespread species have less accurate SDMs because local adaptation that varies across the range introduces prediction error |
Hernandez et al., ( McPherson and Jetz, ( Stockwell and Peterson, ( Syphard & Franklin, ( |
| Range area | Geographic distribution | The total area of a convex hull enclosing all known occurrences in the study region (ha) | ||
| Total number of occurrences | Prevalence | The total number of occurrences known in the study area | Species with lower prevalence in the study area or sparser records within the species’ range have more accurate SDMs (but the response may be nonlinear) |
Luoto et al., ( McPherson, Jetz & Rogers,( Tessarolo et al., ( |
| Occurrence density* | Prevalence | The total number of occurrences divided by the species’ range within the study area |
Figure 1The study area (shaded) in southern Ontario. Occurrence records used to build SDMs for Panax quinquefolius (wild ginseng, circles) and Castanea dentata (American chestnut, triangles) are shown as examples. Polygons are convex hulls enclosing the extent of the records of each species in the study area
Plant species for which we built SDMs, and values for each tested trait. For explanation of traits, see Table 1. Only species for which we obtained at least 10 independent presence records are shown
| Species | Family | Woodiness | Dispersal type | Seed weight (grams/1,000 seeds) | Soil type diversity (lower = more specialized) | Mean latitude (UTM 17N) | Maximum range extent (km) | Occurrence density (occurrences per 100 square km) |
|---|---|---|---|---|---|---|---|---|
|
| Orchidaceae | Not woody | Wind/none | 0.0015 | 0.5391 | 4,823,988 | 332 | 0.08 |
|
| Araceae | Not woody | Animal | 50.1150 | 0.5919 | 4,763,055 | 354 | 0.59 |
|
| Annonaceae | Woody | Animal | 847.0000 | 0.6914 | 4,723,402 | 342 | 0.35 |
|
| Aspleniaceae | Not woody | Wind/none | 0.0001 | 0.5585 | 4,930,190 | 215 | 2.62 |
|
| Fagaceae | Woody | Animal | 3,467.3000 | 0.6024 | 4,744,171 | 340 | 2.28 |
|
| Ulmaceae | Woody | Animal | 93.6544 | 0.4772 | 4,754,197 | 532 | 0.32 |
|
| Cornaceae | Woody | Animal | 102.0000 | 0.6338 | 4,761,902 | 348 | 1.43 |
|
| Orchidaceae | Not woody | Wind/none | 0.0019 | 0.7490 | 4,964,400 | 622 | 0.12 |
|
| Ranunculaceae | Not woody | None | 2.7672 | 0.3788 | 4,755,022 | 70 | 4.29 |
|
| Apiaceae | Not woody | None | 2.2570 | 0.6397 | 4,756,502 | 323 | 0.22 |
|
| Asteraceae | Not woody | Wind/none | 0.6443 | 0.2364 | 4,769,268 | 66 | 4.27 |
|
| Gentianaceae | Not woody | None | 9.4498 | 0.4790 | 4,782,447 | 102 | 1.67 |
|
| Oleaceae | Woody | Winged | 70.1800 | 0.7258 | 4,663,494 | 322 | 0.99 |
|
| Fabaceae | Woody | None | 1843.0000 | 0.7204 | 4,658,654 | 345 | 0.85 |
|
| Saxifragaceae | Not woody | Wind/none | 0.0252 | 0.5692 | 4,638,165 | 65 | 6.81 |
|
| Ranunculaceae | Not woody | Animal | 10.9036 | 0.5668 | 4,732,533 | 328 | 0.34 |
|
| Juglandaceae | Woody | Animal | 14,026.0000 | 0.4979 | 4,955,907 | 790 | 1.92 |
|
| Orchidaceae | Not woody | Wind/none | 0.0040 | 0.4182 | 4,702,675 | 595 | 0.23 |
|
| Boraginaceae | Not woody | None | 21.7810 | 0.5518 | 4,779,621 | 189 | 0.30 |
|
| Magnoliaceae | Woody | Animal | 88.5100 | 0.6343 | 4,743,870 | 251 | 0.37 |
|
| Boraginaceae | Not woody | None | 2.9223 | 0.4152 | 4,744,859 | 257 | 0.27 |
|
| Nyssaceae | Woody | Animal | 140.0000 | 0.6526 | 4,731,319 | 346 | 0.38 |
|
| Araliaceae | Not woody | Animal | 27.7540 | 0.3724 | 4,941,323 | 701 | 0.66 |
|
| Thelypteridaceae | Not woody | Wind/none | 0.0001 | 0.5329 | 4,790,511 | 636 | 0.12 |
SDM, species distribution models.
The relative importance of species’ traits predicting (a) AUC and (b) TPR. Model terms are listed in order of decreasing influence as measured by the difference in AICc between a model without the variable and the full model
| Model | Estimate (SE) |
| dAICc | Chisq |
|
|---|---|---|---|---|---|
| (a) AUC | |||||
| Full | 12 | 0 | |||
| Occurrence density | 0.45 (0.06) | 11 | 50.3 | 52.58 | <.001 |
| Woodiness | 1.38 (0.19) | 11 | 44.3 | 46.56 | <.001 |
| Log (seed weight) | −1.06 (0.15) | 11 | 42.4 | 44.64 | <.001 |
| Log (seed weight)2 | −0.61 (0.07) | 11 | 42.4 | 44.64 | <.001 |
| Dispersal type | NA | 9 | 12.7 | 19.5 | <.001 |
| Soil type diversity | −0.10 (0.06) | 11 | 1.0 | 3.23 | .07 |
| (b) TPR | |||||
| Full | 12 | 0 | |||
| Dispersal type | NA | 9 | 329.2 | 336.0 | <.0001 |
| Woodiness | −1.27 (0.1) | 11 | 153.3 | 155.6 | <.0001 |
| Occurrence density | 0.63 (0.05) | 11 | 145.9 | 148.2 | <.0001 |
| Latitude | −0.46 (0.04) | 11 | 121.4 | 123.7 | <.0001 |
| Log (seed weight) | −0.48 (0.1) | 11 | 22.0 | 24.3 | <.0001 |
| Maximum range extent | 0.22 (0.06) | 11 | 13.2 | 15.5 | <.0001 |
| Soil type diversity | 0.10 (0.04) | 11 | 3.1 | 5.4 | .02 |
AUC, area under the receiver operating characteristic curve; TPR, true positive rate.
Figure 2Partial residual plots showing the effect of each variable in the final model on the AUC (area under the receiver operating characteristic curve) as determined by independent field surveys. Note that for each variable, all other variables are held constant at their median (or the most common category, for categorical variables)
Figure 3Partial residual plots showing the effect of each variable in the final model on the TPR (true positive rate) as determined by independent field surveys. Note that for each variable, all other variables are held constant at their median (or the most common category, for categorical variables)