| Literature DB >> 29988441 |
Rebecca Fisher1,2, Shaun K Wilson2,3, Tsai M Sin4, Ai C Lee4, Tim J Langlois2.
Abstract
Full-subsets information theoretic approaches are becoming an increasingly popular tool for exploring predictive power and variable importance where a wide range of candidate predictors are being considered. Here, we describe a simple function in the statistical programming language R that can be used to construct, fit, and compare a complete model set of possible ecological or environmental predictors, given a response variable of interest and a starting generalized additive (mixed) model fit. Main advantages include not requiring a complete model to be fit as the starting point for candidate model set construction (meaning that a greater number of predictors can potentially be explored than might be available through functions such as dredge); model sets that include interactions between factors and continuous nonlinear predictors; and automatic removal of models with correlated predictors (based on a user defined criterion for exclusion). The function takes continuous predictors, which are fitted using smoothers via either gam, gamm (mgcv) or gamm4, as well as factor variables which are included on their own or as two-level interaction terms within the gam smooth (via use of the "by" argument), or with themselves. The function allows any model to be constructed and used as a null model, and takes a range of arguments that allow control over the model set being constructed, including specifying cyclic and linear continuous predictors, specification of the smoothing algorithm used, and the maximum complexity allowed for smooth terms. The use of the function is demonstrated via case studies that highlight how appropriate model sets can be easily constructed and the broader utility of the approach for exploratory ecology.Entities:
Keywords: collinearity; complete‐subsets modeling; gam; generalized additive models; information theoretic approaches; multimodel inference; multiple regression
Year: 2018 PMID: 29988441 PMCID: PMC6024142 DOI: 10.1002/ece3.4134
Source DB: PubMed Journal: Ecol Evol ISSN: 2045-7758 Impact factor: 2.912
Figure 1Variable importance scores from a full‐subsets analyses exploring the influence of habitat variables and management zoning on the abundance and biomass of four functional fish feeding guilds (see Appendix S3). Habitat variables included a visual assessment of complexity (complexity); the square root of rugosity (sqrt.rug), cover of low complexity (sqrtLC), and high complexity (sqrtHC) corals and macroalgae cover (sqrtMacro); the first and second axis scores from a principle components analysis (SCORE1 and SCORE2); and management zone (ZONE)
Figure 2Variable importance scores from full‐subsets analyses of the abundance of Dosinia subrosea, Myadora striata, and Pagurus novizelandiae, with variables within the most parsimonious model for each taxon indicated (X, see Table A4.2 in Appendix S4)
Figure 3Gonadosomatic index (GSI) as a function of lunar date (left‐hand plots) and month of the year (right‐hand plots) for Monodonta labio (upper) and Patelloida saccharina (lower), with colors indicating sex (male and female). Fitted gam curves (solid lines) and 95% confidence bands (dashed lines) are also shown