A Mayr1, H Binder, O Gefeller, M Schmid. 1. Andreas Mayr, Institut für Medizininformatik, Biometrie und Epidemiologie, Friedrich-Alexander-Universität Erlangen- Nürnberg (FAU), Waldstr. 6, 91054 Erlangen, Germany, E-mail: andreas.mayr@fau.de.
Abstract
BACKGROUND: The concept of boosting emerged from the field of machine learning. The basic idea is to boost the accuracy of a weak classifying tool by combining various instances into a more accurate prediction. This general concept was later adapted to the field of statistical modelling. Nowadays, boosting algorithms are often applied to estimate and select predictor effects in statistical regression models. OBJECTIVES: This review article attempts to highlight the evolution of boosting algorithms from machine learning to statistical modelling. METHODS: We describe the AdaBoost algorithm for classification as well as the two most prominent statistical boosting approaches, gradient boosting and likelihood-based boosting for statistical modelling. We highlight the methodological background and present the most common software implementations. RESULTS: Although gradient boosting and likelihood-based boosting are typically treated separately in the literature, they share the same methodological roots and follow the same fundamental concepts. Compared to the initial machine learning algorithms, which must be seen as black-box prediction schemes, they result in statistical models with a straight-forward interpretation. CONCLUSIONS: Statistical boosting algorithms have gained substantial interest during the last decade and offer a variety of options to address important research questions in modern biomedicine.
BACKGROUND: The concept of boosting emerged from the field of machine learning. The basic idea is to boost the accuracy of a weak classifying tool by combining various instances into a more accurate prediction. This general concept was later adapted to the field of statistical modelling. Nowadays, boosting algorithms are often applied to estimate and select predictor effects in statistical regression models. OBJECTIVES: This review article attempts to highlight the evolution of boosting algorithms from machine learning to statistical modelling. METHODS: We describe the AdaBoost algorithm for classification as well as the two most prominent statistical boosting approaches, gradient boosting and likelihood-based boosting for statistical modelling. We highlight the methodological background and present the most common software implementations. RESULTS: Although gradient boosting and likelihood-based boosting are typically treated separately in the literature, they share the same methodological roots and follow the same fundamental concepts. Compared to the initial machine learning algorithms, which must be seen as black-box prediction schemes, they result in statistical models with a straight-forward interpretation. CONCLUSIONS: Statistical boosting algorithms have gained substantial interest during the last decade and offer a variety of options to address important research questions in modern biomedicine.
Authors: Damini Dey; Piotr J Slomka; Paul Leeson; Dorin Comaniciu; Sirish Shrestha; Partho P Sengupta; Thomas H Marwick Journal: J Am Coll Cardiol Date: 2019-03-26 Impact factor: 24.094
Authors: Paulino José García Nieto; Esperanza García-Gonzalo; Fernando Sánchez Lasheras; José Ramón Alonso Fernández; Cristina Díaz Muñiz; Francisco Javier de Cos Juez Journal: Environ Sci Pollut Res Int Date: 2018-05-30 Impact factor: 4.223
Authors: Stefanie Hieke; Axel Benner; Richard F Schlenk; Martin Schumacher; Lars Bullinger; Harald Binder Journal: PLoS One Date: 2016-05-09 Impact factor: 3.240
Authors: Andreas Mayr; Benjamin Hofner; Elisabeth Waldmann; Tobias Hepp; Sebastian Meyer; Olaf Gefeller Journal: Comput Math Methods Med Date: 2017-08-02 Impact factor: 2.238