| Literature DB >> 29846899 |
Paulino José García Nieto1, Esperanza García-Gonzalo2, Fernando Sánchez Lasheras3, José Ramón Alonso Fernández4, Cristina Díaz Muñiz4, Francisco Javier de Cos Juez5.
Abstract
Cyanotoxins are a type of cyanobacteria that is poisonous and poses a health threat in waters that could be used for drinking or recreational purposes. Thus, it is necessary to predict their presence to avoid risks. This paper presents a nonparametric machine learning approach using a gradient boosted regression tree model (GBRT) for prediction of cyanotoxin contents from cyanobacterial concentrations determined experimentally in a reservoir located in the north of Spain. GBRT models seek and obtain good predictions in highly nonlinear problems, like the one treated here, where the studied variable presents low concentrations of cyanotoxins mixed with high concentration peaks. Two types of results have been obtained: firstly, the model allows the ranking or the dependent variables according to its importance in the model. Finally, the high performance and the simplicity of the model make the gradient boosted tree method attractive compared to conventional forecasting techniques.Entities:
Keywords: Cyanobacteria; Cyanotoxins; Gradient boosting; Harmful algal blooms (HABs); Regression trees; Statistical machine learning techniques
Mesh:
Substances:
Year: 2018 PMID: 29846899 DOI: 10.1007/s11356-018-2219-4
Source DB: PubMed Journal: Environ Sci Pollut Res Int ISSN: 0944-1344 Impact factor: 4.223