| Literature DB >> 31695869 |
Soumen Dey1, Mohan Delampady1, Arjun M Gopalaswamy1.
Abstract
A vast amount of ecological knowledge generated over the past two decades has hinged upon the ability of model selection methods to discriminate among various ecological hypotheses. The last decade has seen the rise of Bayesian hierarchical models in ecology. Consequently, commonly used tools, such as the AIC, become largely inapplicable and there appears to be no consensus about a particular model selection tool that can be universally applied. We focus on a specific class of competing Bayesian spatial capture-recapture (SCR) models and apply and evaluate some of the recommended Bayesian model selection tools: (1) Bayes Factor-using (a) Gelfand-Dey and (b) harmonic mean methods, (2) Deviance Information Criterion (DIC), (3) Watanabe-Akaike's Information Criterion (WAIC) and (4) posterior predictive loss criterion. In all, we evaluate 25 variants of model selection tools in our study. We evaluate these model selection tools from the standpoint of selecting the "true" model and parameter estimation. In all, we generate 120 simulated data sets using the true model and assess the frequency with which the true model is selected and how well the tool estimates N (population size), a parameter of much importance to ecologists. We find that when information content is low in the data, no particular model selection tool can be recommended to help realize, simultaneously, both the goals of model selection and parameter estimation. But, in general (when we consider both the objectives together), we recommend the use of our application of the Bayes Factor (Gelfand-Dey with MAP approximation) for Bayesian SCR models. Our study highlights the point that although new model selection tools are emerging (e.g., WAIC) in the applied statistics literature, those tools based on sound theory even under approximation may still perform much better.Entities:
Keywords: Bayes factors; Bayesian inference; DIC; WAIC; hierarchical models; posterior predictive loss
Year: 2019 PMID: 31695869 PMCID: PMC6822056 DOI: 10.1002/ece3.5551
Source DB: PubMed Journal: Ecol Evol ISSN: 2045-7758 Impact factor: 2.912
Notations of variables and parameters used in this article
| Variables and parameters | Definition |
|---|---|
|
| A bounded geographic region of scientific or operational relevance where a population of individuals of certain species reside |
|
| Population size of the superpopulation, that is, the number of individuals within |
|
|
Maximum number of individuals within the state space This is a fixed quantity defined by the investigator |
|
| Proportion of individuals that are real and present within |
|
| Probability that an individual is male |
|
| Number of trap stations in |
|
| Number of sampling occasions |
|
| Maximum permissible value of movement range for each individual during the survey |
|
| Baseline trap entry probability in the models |
|
| Baseline detection probability in the models |
|
|
|
|
| Euclidean distance between points |
|
| Probability that an individual |
|
| Probability that an individual |
|
| Probability that an individual |
Bold symbols represent collections (vectors).
Notations of latent variables and data used in this article
| Latent variables | Definition |
|---|---|
|
| Locations of the activity centers of |
|
| Location of individual |
|
| A vector of Bernoulli variables, |
|
| A vector of Bernoulli variables, |
|
| Vector of “missing” binary observations on sexes of the list of |
|
| Each |
Bold symbols represent collections (vectors).
An example of detection histories for two fully identified individuals and partially identified individuals is presented. The circled 1s indicate the simultaneous captures of an individual by the detectors 1 and 2
| Occasion trap | Detectors 1 | Occasion trap | Detectors 2 | |||||||
|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 2 | 3 | 4 | 1 | 2 | 3 | 4 | |||
| Fully identified individual 1 | 1 | 0 | 1 | 0 | 1 | 1 | 0 | 0 | 1 | 0 |
| 2 | 1 | 0 | 0 | ① | 2 | 0 | 0 | 0 | ① | |
| 3 | 0 | 0 | 1 | 1 | 3 | 1 | 0 | 0 | 0 | |
| Fully identified individual 2 | 1 | ① | 0 | 0 | 0 | 1 | ① | 1 | 0 | 0 |
| 2 | 0 | 0 | 0 | 1 | 2 | 0 | 0 | 0 | 0 | |
| 3 | 1 | 1 | 0 | 0 | 3 | 0 | 0 | 1 | 0 | |
| Partially identified individual | 1 | 1 | 0 | 0 | 1 | 1 | — | — | — | — |
| 2 | 0 | 0 | 1 | 0 | 2 | — | — | — | — | |
| 3 | 0 | 0 | 0 | 0 | 3 | — | — | — | — | |
| Partially identified individual | 1 | — | — | — | — | 1 | 0 | 0 | 1 | 0 |
| 2 | — | — | — | — | 2 | 1 | 0 | 0 | 0 | |
| 3 | — | — | — | — | 3 | 0 | 0 | 1 | 0 | |
Specification differences in the four competing models
| Model | Trap entry and detection parameter separated? | Sex‐specific |
|---|---|---|
|
| Yes | Yes |
|
| No | Yes |
|
| Yes | No |
|
| No | No |
Bayesian model selection methods used in this study
| Sl. no. | Model selection method | Variant | Approximation method | Choices of tuning density | Eq. No. |
|---|---|---|---|---|---|
| 1. | Bayes factor | Gelfand‐Dey estimator | MAP | Multivariate normal density, multivar‐ iate‐ | ( |
| 2. | Bayes factor | Gelfand‐Dey estimator | IL | ‐Do‐ | ( |
| 3. | Bayes factor | Harmonic mean estimator | — | — | ( |
| 4. | DIC |
| MAP | — | ( |
| 5. | DIC |
| MAP | — | ( |
| 6. | WAIC |
| — | — | ( |
| 7. | WAIC |
| — | — | ( |
| 8. | WAIC |
| — | — | ( |
| 9. | Posterior predictive loss | — | — | — | ( |
Parameter specifications corresponding to different simulation scenarios
| Scenario |
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|
| 1 | 400 | 100 | 40 | 0.01 | 0.3 | 0.3 | 0.15 |
| 2 | 400 | 100 | 40 | 0.01 | 0.9 | 0.3 | 0.15 |
| 3 | 400 | 100 | 40 | 0.01 | 0.3 | 0.4 | 0.20 |
| 4 | 400 | 100 | 40 | 0.01 | 0.9 | 0.4 | 0.20 |
| 5 | 400 | 100 | 40 | 0.03 | 0.8 | 0.3 | 0.15 |
| 6 | 400 | 100 | 40 | 0.03 | 0.8 | 0.4 | 0.20 |
| 7 | 400 | 100 | 40 | 0.05 | 0.3 | 0.3 | 0.15 |
| 8 | 400 | 100 | 40 | 0.05 | 0.5 | 0.3 | 0.15 |
| 9 | 400 | 100 | 40 | 0.05 | 0.9 | 0.3 | 0.15 |
| 10 | 400 | 100 | 40 | 0.05 | 0.3 | 0.4 | 0.20 |
| 11 | 400 | 100 | 40 | 0.05 | 0.5 | 0.4 | 0.20 |
| 12 | 400 | 100 | 40 | 0.05 | 0.9 | 0.4 | 0.20 |
Figure 1Array of trap locations (denoted by “+”) within the state space (0.5) × (0.7)
Figure 2Plot (a): The proportion of times Gelfand‐Dey estimator of Bayes factor favors any particular model using the MAP approximation approach and a multivariate normal density for g. Plot (b): The proportion of times Gelfand‐Dey estimator of Bayes factor favors any particular model using the integrated likelihood approximation approach and a multivariate normal density for g. Plot (c): The proportion of times harmonic mean estimator of Bayes factor using the favors any particular model. Plots (d)–(i) correspond to WAIC1, WAIC2, WAIC3, DIC1, DIC2, and posterior predictive loss, respectively
Figure 3Plot of average RMSE estimates of N over different simulation scenarios
Contrasting the performance of model selection tools based on the intended purpose and perceived applicability with findings from our specific study. In the table, we provide answers to the following questions: (a) Does this approach select the true model? (b) Does this approach favor models providing reliable estimates of parameters (specifically for N)? (c) How difficult is the approach to implement in practice? Comments in bold draw attention to the noticeable differences between the expected performance of a tool and its performance in our particular study
| Model selection tool | Intended purpose and applicability | Reference | Findings from our specific study |
|---|---|---|---|
| Bayes factor (by Gelfand‐Dey estimator) |
(a) Yes (b) Yes (c) Difficult |
(a) Yes, very often (MAP) Yes, quite often (IL) (b) Yes, very often (MAP) Yes, quite often (IL) (c) Moderately difficult (MAP) Moderately difficult (IL) | |
| Bayes factor (by harmonic mean estimator) |
(a) (b) Yes (c) Easy | Newton and Raftery ( |
(a) (b) Yes, very often (c) Easy |
| DIC1 |
(a) No (b) (c) | Spiegelhalter et al. ( |
(a) No (b) (c) |
| DIC2 |
(a) No (b) (c) | Spiegelhalter et al. ( |
(a) No (b) (c) |
| WAIC1 |
(a) No (b) (c) Easy | Watanabe ( |
(a) No (b) (c) Easy |
| WAIC2 |
(a) (b) (c) Easy | Watanabe ( |
(a) (b) (c) Easy |
| WAIC3 |
(a) No (b) (c) Easy |
(a) No (b) (c) Easy | |
| Posterior predictive loss |
(a) (b) (c) Moderately difficult | Gelfand and Ghosh ( |
(a) (b) (c) Moderately difficult |