Literature DB >> 29272835

Feature selection approaches for predictive modelling of groundwater nitrate pollution: An evaluation of filters, embedded and wrapper methods.

V F Rodriguez-Galiano1, J A Luque-Espinar2, M Chica-Olmo3, M P Mendes4.   

Abstract

Recognising the various sources of nitrate pollution and understanding system dynamics are fundamental to tackle groundwater quality problems. A comprehensive GIS database of twenty parameters regarding hydrogeological and hydrological features and driving forces were used as inputs for predictive models of nitrate pollution. Additionally, key variables extracted from remotely sensed Normalised Difference Vegetation Index time-series (NDVI) were included in database to provide indications of agroecosystem dynamics. Many approaches can be used to evaluate feature importance related to groundwater pollution caused by nitrates. Filters, wrappers and embedded methods are used to rank feature importance according to the probability of occurrence of nitrates above a threshold value in groundwater. Machine learning algorithms (MLA) such as Classification and Regression Trees (CART), Random Forest (RF) and Support Vector Machines (SVM) are used as wrappers considering four different sequential search approaches: the sequential backward selection (SBS), the sequential forward selection (SFS), the sequential forward floating selection (SFFS) and sequential backward floating selection (SBFS). Feature importance obtained from RF and CART was used as an embedded approach. RF with SFFS had the best performance (mmce=0.12 and AUC=0.92) and good interpretability, where three features related to groundwater polluted areas were selected: i) industries and facilities rating according to their production capacity and total nitrogen emissions to water within a 3km buffer, ii) livestock farms rating by manure production within a 5km buffer and, iii) cumulated NDVI for the post-maximum month, being used as a proxy of vegetation productivity and crop yield.
Copyright © 2017 Elsevier B.V. All rights reserved.

Entities:  

Keywords:  Embedded methods; Feature selection; Groundwater; Machine learning algorithms; Nitrates; Wrapper methods

Year:  2017        PMID: 29272835     DOI: 10.1016/j.scitotenv.2017.12.152

Source DB:  PubMed          Journal:  Sci Total Environ        ISSN: 0048-9697            Impact factor:   7.963


  7 in total

1.  Enhanced Evolutionary Feature Selection and Ensemble Method for Cardiovascular Disease Prediction.

Authors:  V Jothi Prakash; N K Karthikeyan
Journal:  Interdiscip Sci       Date:  2021-05-14       Impact factor: 2.233

Review 2.  The application of artificial intelligence and radiomics in lung cancer.

Authors:  Yaojie Zhou; Xiuyuan Xu; Lujia Song; Chengdi Wang; Jixiang Guo; Zhang Yi; Weimin Li
Journal:  Precis Clin Med       Date:  2020-08-24

3.  Tumor classification and biomarker discovery based on the 5'isomiR expression level.

Authors:  Shengqin Wang; Zhihong Zheng; Peichao Chen; Mingjiang Wu
Journal:  BMC Cancer       Date:  2019-02-07       Impact factor: 4.430

4.  A Hybrid Gene Selection Method Based on ReliefF and Ant Colony Optimization Algorithm for Tumor Classification.

Authors:  Lin Sun; Xianglin Kong; Jiucheng Xu; Zhan'ao Xue; Ruibing Zhai; Shiguang Zhang
Journal:  Sci Rep       Date:  2019-06-20       Impact factor: 4.379

5.  A stacking-based model for predicting 30-day all-cause hospital readmissions of patients with acute myocardial infarction.

Authors:  Zhen Zhang; Hang Qiu; Weihao Li; Yucheng Chen
Journal:  BMC Med Inform Decis Mak       Date:  2020-12-14       Impact factor: 2.796

6.  Machine Learning Algorithms for the Prediction of Central Lymph Node Metastasis in Patients With Papillary Thyroid Cancer.

Authors:  Yijun Wu; Ke Rao; Jianghao Liu; Chang Han; Liang Gong; Yuming Chong; Ziwen Liu; Xiequn Xu
Journal:  Front Endocrinol (Lausanne)       Date:  2020-10-21       Impact factor: 5.555

7.  Uncovering Pathways Highly Correlated to NUE through a Combined Metabolomics and Transcriptomics Approach in Eggplant.

Authors:  Antonio Mauceri; Meriem Miyassa Aci; Laura Toppino; Sayantan Panda; Sagit Meir; Francesco Mercati; Fabrizio Araniti; Antonio Lupini; Maria Rosaria Panuccio; Giuseppe Leonardo Rotino; Asaph Aharoni; Maria Rosa Abenavoli; Francesco Sunseri
Journal:  Plants (Basel)       Date:  2022-03-04
  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.