| Literature DB >> 28827833 |
Jana Michaelis1, Martin R Diekmann1.
Abstract
The study aimed to examine the effects of different numbers of presences and frequencies (proportions) of occurrences of species in a plot data set of forest vegetation on the species response curves and their niche attributes, based on Huisman-Olff-Fresco models (HOF). We modeled responses of 72 to 105 herbaceous forest species along a pH gradient under 14 different random sampling scenarios by varying the number of presences and absences used for model fitting. Mean niche attributes were calculated from 100 repetitive runs for each scenario and species. Re-prediction success of HOF models among the repetitive runs was highest when the total number of plots was high and the frequency of occurrences was low. With low plot numbers and high frequencies, less complicated model types (no response or monotonically increasing/decreasing responses) predominate. Measures of species niche boundaries (limits & borders) and niche width were strongly influenced by changes in sampling characteristics. With an increasing number of presences and an increasing frequency, limits and borders shifted to more extreme values, leading to wider niches. In contrast, species optima showed almost no change between the scenarios. Thus, the detected ecological response of a species often depends on the size of the data set and the relation between presences and absences of a species. In general, high data quantities are required for reliable response curve modeling with HOF models, which prevents the assessment of the responses of many rare species. To avoid undesired bias by differing sampling characteristics when comparing niches between different species or between data sets, the data basis used for model fitting should be adjusted according to the niche attribute in question, for example by keeping the frequency of the species constant.Entities:
Mesh:
Year: 2017 PMID: 28827833 PMCID: PMC5565184 DOI: 10.1371/journal.pone.0183152
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Examples of HOF models (types II-VII) showing the responses of species along a pH gradient.
A species with model type I shows no response along the gradient (b shows Chrysosplenium alternifolium). Thick vertical solid lines describe the position of the optima, thin vertical solid lines denote the upper and lower central borders. The dotted grey line corresponds to a probability of occurrence of y = 0.05, and its intersection(s) with the response curve marks the lower and / or upper limit.
Modeling set-up with 14 different combinations of presence and frequency based on random sampling.
Four different presence scenarios (number of randomly selected presences being 10, 25, 50 or 100) combined with four different frequency scenarios (by varying the number of absences) were modeled. Two combinations could not be applied due to model restrictions or data paucity.
| Frequency | ||||
|---|---|---|---|---|
| Presence | 0.068 | 0.116 | 0.5 | 0.714 |
| 10 | 10 / 138 | 10 / 76 | 10 / 10 | - |
| 25 | 25 / 345 | 25 / 190 | 25 / 25 | 25 / 10 |
| 50 | 50 / 689 | 50 / 380 | 50 / 50 | 50 / 20 |
| 100 | - | 100 / 762 | 100 / 100 | 100 / 40 |
| Presence / Absence | ||||
For each data combination (14) and each species (105/72), model fitting was repeated 100 times, resulting in a total number of 137,100 HOF models. From the 100 repetitions, mean niche parameters, a model stability index and the probability of getting a certain niche parameter were calculated for every species—data combination. Moreover, differences in model choice (which model types were chosen) were evaluated visually.
Fig 2Frequency distribution of model types chosen in the different Pre:Fre scenarios.
Dps gives the number of overall data points used for model fitting.
Fig 3Spearman-correlation matrices for all Pre:Fre scenarios.
Given are: a) model type (chosen most often from 100 repetitive model fittings for each species), b) optimum, c) lower limit and d) upper limits. Central numbers show the correlation coefficients. In a) axes are sorted based on the number of data points used for fitting. For b)–d) axes are sorted by frequency and presence numbers.
Fig 4Results of the linear mixed model identifying the trends of niche parameters.
Results for a) Index of Qualitative Variation (IQV), b) optimumany, c) LowLimany and d) UppLimany along the frequency gradient for all four Pre scenarios are presented. Colored lines show the regressions for every single species, whereas the black line reflects the mean across species (the population trend).