Literature DB >> 18397250

A working guide to boosted regression trees.

J Elith1, J R Leathwick, T Hastie.   

Abstract

1. Ecologists use statistical models for both explanation and prediction, and need techniques that are flexible enough to express typical features of their data, such as nonlinearities and interactions. 2. This study provides a working guide to boosted regression trees (BRT), an ensemble method for fitting statistical models that differs fundamentally from conventional techniques that aim to fit a single parsimonious model. Boosted regression trees combine the strengths of two algorithms: regression trees (models that relate a response to their predictors by recursive binary splits) and boosting (an adaptive method for combining many simple models to give improved predictive performance). The final BRT model can be understood as an additive regression model in which individual terms are simple trees, fitted in a forward, stagewise fashion. 3. Boosted regression trees incorporate important advantages of tree-based methods, handling different types of predictor variables and accommodating missing data. They have no need for prior data transformation or elimination of outliers, can fit complex nonlinear relationships, and automatically handle interaction effects between predictors. Fitting multiple trees in BRT overcomes the biggest drawback of single tree models: their relatively poor predictive performance. Although BRT models are complex, they can be summarized in ways that give powerful ecological insight, and their predictive performance is superior to most traditional modelling methods. 4. The unique features of BRT raise a number of practical issues in model fitting. We demonstrate the practicalities and advantages of using BRT through a distributional analysis of the short-finned eel (Anguilla australis Richardson), a native freshwater fish of New Zealand. We use a data set of over 13 000 sites to illustrate effects of several settings, and then fit and interpret a model using a subset of the data. We provide code and a tutorial to enable the wider use of BRT by ecologists.

Entities:  

Mesh:

Year:  2008        PMID: 18397250     DOI: 10.1111/j.1365-2656.2008.01390.x

Source DB:  PubMed          Journal:  J Anim Ecol        ISSN: 0021-8790            Impact factor:   5.091


  560 in total

1.  Unravelling the limits to tree height: a major role for water and nutrient trade-offs.

Authors:  Michael D Cramer
Journal:  Oecologia       Date:  2011-10-30       Impact factor: 3.225

2.  Density-regulated population dynamics and conditional dispersal alter the fate of mutations occurring at the front of an expanding population.

Authors:  T Münkemüller; M J Travis; O J Burton; K Schiffers; K Johst
Journal:  Heredity (Edinb)       Date:  2010-08-18       Impact factor: 3.821

3.  Regional variation exaggerates ecological divergence in niche models.

Authors:  William Godsoe
Journal:  Syst Biol       Date:  2010-02-26       Impact factor: 15.683

4.  Buckley-James boosting for survival analysis with high-dimensional biomarker data.

Authors:  Zhu Wang; C Y Wang
Journal:  Stat Appl Genet Mol Biol       Date:  2010-06-08

5.  Ecological modeling of the spatial distribution of wild waterbirds to identify the main areas where avian influenza viruses are circulating in the Inner Niger Delta, Mali.

Authors:  Julien Cappelle; Olivier Girard; Bouba Fofana; Nicolas Gaidet; Marius Gilbert
Journal:  Ecohealth       Date:  2010-09-24       Impact factor: 3.184

6.  The ecology of microscopic life in household dust.

Authors:  Albert Barberán; Robert R Dunn; Brian J Reich; Krishna Pacifici; Eric B Laber; Holly L Menninger; James M Morton; Jessica B Henley; Jonathan W Leff; Shelly L Miller; Noah Fierer
Journal:  Proc Biol Sci       Date:  2015-09-07       Impact factor: 5.349

7.  Fishing, fast growth and climate variability increase the risk of collapse.

Authors:  Malin L Pinsky; David Byler
Journal:  Proc Biol Sci       Date:  2015-08-22       Impact factor: 5.349

8.  GIS-based groundwater potential mapping using boosted regression tree, classification and regression tree, and random forest machine learning models in Iran.

Authors:  Seyed Amir Naghibi; Hamid Reza Pourghasemi; Barnali Dixon
Journal:  Environ Monit Assess       Date:  2015-12-19       Impact factor: 2.513

9.  Modelling spatial patterns of urban growth in Africa.

Authors:  Catherine Linard; Andrew J Tatem; Marius Gilbert
Journal:  Appl Geogr       Date:  2013-10

10.  Improving propensity score weighting using machine learning.

Authors:  Brian K Lee; Justin Lessler; Elizabeth A Stuart
Journal:  Stat Med       Date:  2010-02-10       Impact factor: 2.373

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.