Literature DB >> 29730758

Application of feature selection and regression models for chlorophyll-a prediction in a shallow lake.

Xue Li1, Jian Sha1, Zhong-Liang Wang2.   

Abstract

As a representative index of the algal bloom, the concentration of chlorophyll-a (Chl-a) is a key parameter of concern for environmental managers. The relationships between environmental variables and Chl-a are complex and difficult to establish. Two machine learning methods, including support vector machine for regression (SVR) and random forest (RF), were used in this study to predict Chl-a concentration based on multiple variables. To improve the model accuracy and reduce the input number, two feature selection methods, including minimum redundancy and maximum relevance method (mRMR) and RF, were integrated with regression models. The results showed that the RF model had a higher predictive ability than the SVR model. Furthermore, the less computational time cost and unnecessary prior data transformation also indicated a better applicability of the RF model. The comparison between ensemble models of mRMR-RF and RF-RF showed that the RF-RF yielded a better performance with fewer variables. Seven variables selected from the candidate predictors could interpret most information, and their potential implications to Chl-a were discussed based on the level of importance. Overall, the RF-RF ensemble model can be considered as a useful approach to determine the significant stressors and achieve satisfactory prediction of Chl-a concentration.

Entities:  

Keywords:  Feature selection; Minimum redundancy and maximum relevance; Random forest; Support vector machine

Mesh:

Substances:

Year:  2018        PMID: 29730758     DOI: 10.1007/s11356-018-2147-3

Source DB:  PubMed          Journal:  Environ Sci Pollut Res Int        ISSN: 0944-1344            Impact factor:   4.223


  8 in total

1.  Data integration and network reconstruction with ~omics data using Random Forest regression in potato.

Authors:  Animesh Acharjee; Bjorn Kloosterman; Ric C H de Vos; Jeroen S Werij; Christian W B Bachem; Richard G F Visser; Chris Maliepaard
Journal:  Anal Chim Acta       Date:  2011-04-13       Impact factor: 6.558

2.  Feature selection based on mutual information: criteria of max-dependency, max-relevance, and min-redundancy.

Authors:  Hanchuan Peng; Fuhui Long; Chris Ding
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2005-08       Impact factor: 6.226

3.  Climate change: links to global expansion of harmful cyanobacteria.

Authors:  Hans W Paerl; Valerie J Paul
Journal:  Water Res       Date:  2011-08-18       Impact factor: 11.236

4.  Relationships among nutrients, chlorophyll-a, and dissolved oxygen in agricultural streams in Illinois.

Authors:  Allyson M Morgan; Todd V Royer; Mark B David; Lowell E Gentry
Journal:  J Environ Qual       Date:  2006-05-31       Impact factor: 2.751

5.  Prediction of active sites of enzymes by maximum relevance minimum redundancy (mRMR) feature selection.

Authors:  Yu-Fei Gao; Bi-Qing Li; Yu-Dong Cai; Kai-Yan Feng; Zhan-Dong Li; Yang Jiang
Journal:  Mol Biosyst       Date:  2012-11-02

6.  Novel speech signal processing algorithms for high-accuracy classification of Parkinson's disease.

Authors:  Athanasios Tsanas; Max A Little; Patrick E McSharry; Jennifer Spielman; Lorraine O Ramig
Journal:  IEEE Trans Biomed Eng       Date:  2012-01-09       Impact factor: 4.538

7.  A random forest classifier for the prediction of energy expenditure and type of physical activity from wrist and hip accelerometers.

Authors:  Katherine Ellis; Jacqueline Kerr; Suneeta Godbole; Gert Lanckriet; David Wing; Simon Marshall
Journal:  Physiol Meas       Date:  2014-10-23       Impact factor: 2.833

8.  Development of early-warning protocol for predicting chlorophyll-a concentration using machine learning models in freshwater and estuarine reservoirs, Korea.

Authors:  Yongeun Park; Kyung Hwa Cho; Jihwan Park; Sung Min Cha; Joon Ha Kim
Journal:  Sci Total Environ       Date:  2014-09-19       Impact factor: 7.963

  8 in total
  1 in total

1.  Inland harmful cyanobacterial bloom prediction in the eutrophic Tri An Reservoir using satellite band ratio and machine learning approaches.

Authors:  Hao-Quang Nguyen; Nam-Thang Ha; Thanh-Luu Pham
Journal:  Environ Sci Pollut Res Int       Date:  2020-01-08       Impact factor: 4.223

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.