| Literature DB >> 31667387 |
Nur Zahidah Shafii1, Ahmad Shakir Mohd Saudi1, Jyh Chyang Pang1, Izuddin Fahmy Abu1, Norzahir Sapawe2, Mohd Khairul Amri Kamarudin3, Hammad Farhi Mohd Saudi4.
Abstract
There has been a growing concern on the rising of environmental issues in Malaysia over the last decade. Many environmental studies conducted in this country began to utilise the chemometrics techniques to overcome the limitation in the environmental monitoring studies. Chemometrics becomes an important tool in environmental fields to evaluate the relationship of various environmental variables particularly in a large and complex database. The review aimed to analyse and summarize the current evidences and limitations on the application of chemometrics techniques in the environmental studies in Malaysia. The study performed a comprehensive review of relevant scientific journals concerning on the major environmental issues in the country, published between 2013 and 2017. A total of 29 papers which focused on the environmental issues were reviewed. Available evidences suggested that chemometrics techniques have a greater accuracy, flexibility and efficiency to be applied in environmental modelling. It also reported that chemometrics techniques are more practical for cost effective and time management in sampling and monitoring purposes. However, chemometrics is relatively new in environmental field in Malaysia and various scopes need to be considered in the future as the current studies focused on very limited number of major environmental issues. Overall, chemometrics techniques have a lot of advantages in solving environmental problems. The development of chemometrics in environmental studies in the country is necessary to advance understanding, thus able to produce more significant impacts towards the effective environmental management.Entities:
Keywords: Chemometrics; Environment; Environmental science; Flood; Malaysia; Pollution; Review
Year: 2019 PMID: 31667387 PMCID: PMC6812457 DOI: 10.1016/j.heliyon.2019.e02534
Source DB: PubMed Journal: Heliyon ISSN: 2405-8440
Fig. 1Research strategy for the review.
Classification of different chemometrics employed in the reviewed articles.
| Chemometrics techniques | |
|---|---|
Cluster analysis (CA) Artificial neural network (ANN) | Multiple linear regression (MLR) |
Discriminant analysis (DA) Artificial neural network (ANN) | Statistical process control (SPC) |
Factor analysis (FA) Principal component analysis (PCA) | |
Application of chemometrics technique in air quality and pollution.
| No. | Research aims | Chemometrics technique | Key findings | References |
|---|---|---|---|---|
| 1. | Identification of spatial and temporal air quality pattern recognition at three selected Malaysian air monitoring stations based on 7 years database (January 2000–December 2010). | DA, HACA, PCA and ANN | The usefulness of chemometrics and ANN modelling techniques in evaluating and interpreting large air quality datasets in order to enable to get better information about the air quality pattern. | |
| 2. | Application of Feed-Forward Artificial Neural Network Model to predict the API within the seven selected Malaysian air monitoring stations in the southern region of Peninsular Malaysia based on 7 years database (2005–2011). | PCA and ANN | Rotated PC scores are more efficient and effective in reducing the predictor variables without losing important information. | |
| 3. | Prediction of the level of air pollution using Principal Component Analysis and Artificial Neural Network techniques at ten monitoring stations in Malaysia for 7 years (2005–2011) | PCA and ANN | A significant reduction of input data using rotated PCA scores with a good predictive power similar to the standard model. | |
| 4. | Identification of spatial variations on API at 8 monitoring stations in the southern region of peninsular Malaysia for 3 years (2005–2007). | HACA, DA, PCA-FA, ANN and MLR | ANN model shows better prediction compared to the MLR. | |
| 5. | Identification of potential source apportionment of air pollution in ten Malaysian monitoring stations for 7 years (2006–2012). | PCA | Application of the PCA method can be applied for the source apportionment purpose for the future and effective management of air quality. | |
| 6. | Assessment of the spatial variation and source apportionment of air pollution at five selected monitoring stations in Peninsular Malaysia covering 2007–2011 (5years). | HACA, PCA and MLR | Cut down the cost of equipment, reducing cost and time of monitoring redundant stations and pollutants. | |
| 7. | Recognition of the pollution sources and identification the most significant pollutant at 14 monitoring stations around Peninsular Malaysia from January to December 2007. | HACA, DA, PCA-FA and MLR | Provide meaningful information on the spatial variability of a large and complex air quality data. | |
| 8. | Selection of the most significant variables of air pollutants using sensitivity analysis at ten selected Malaysian monitoring stations based on database for a 7 year period (2006–2012). | ANN-API-AP, ANN-API-LCO, ANN-API-LO3, ANN-API-LPM10, ANN-API-LSO2, ANN-API-LNO2, ANN-API-LCH4, ANN-API-LNmHC, ANN-API-LTHC, ANN-API-LO, and ANN-API-DOE | ANN-API-LO model was the best predictor model as only two parameters utilized as input for API prediction. | |
| 9. | Identification of the spatial variation of air pollutants and its pattern at seven air monitoring stations in the northern part of Peninsular Malaysia for 4 years (2008–2011). | HACA, DA and ANN | The predictive ability of chemometrics is at least as good as the standard model. | |
| 10. | Determination of air quality pattern at Putrajaya monitoring station based on three years observation (2011–2013). | PCA, FA and SPC | PCA and FA model identified five pollutants that affected air quality. | |
| 11. | Assessment of ambient air pollution pattern in Shah Alam, Malaysia for 5 years (2009–2013). | PCA and SPC | The level of air quality was manipulated by the climate condition, gas, non-gas and secondary air pollutants. |
ANN = Artificial Neural Network; API = air-pollutant index; AP = all parameters; DA = discriminant analysis; DOE = Department of Environment; FA = factor analysis; HACA = hierarchical agglomerative cluster analysis; LCO = leave carbon monoxide; LO3 = leave ground-level ozone; LPM10 = leave particulate matter; LSO2 = leave sulphur dioxide; LNO2 = leave nitrogen dioxide; LCH4 = leave methane; LNmHC = leave non-methane hydrocarbon; LTHC = leave total hydrocarbon; LO = leave-out; MLR = multiple linear regression; PCA = principal components analysis; SO2 = sulphur dioxide; SPC = statistical process analysis.
Application of chemometrics technique in water quality and pollution.
| No. | Research aims | Chemometrics technique | Key findings | References |
|---|---|---|---|---|
| 1. | Identification of the surface water pollution at 13 monitoring stations of Terengganu River Basin from 5 years (2003–2007) | CA, DA and PCA-FA | The usefulness for analysis and interpretation of complex databases. | |
| 2. | Analysis the relationship between the physicochemical levels and the drinking water quality in water treatment plant at Kuala Kubu Bharu, Malaysia (January to December 2011). | DA and PCA-FA | The drinking water quality was within the national standards. | |
| 3. | Assessment of water quality from 50 selected monitoring stations of Johor River for 5 years (2003–2007). | HACA, DA and PCA | One monitoring station of each cluster is sufficient to represent a reasonably accurate spatial water quality assessment for the entire river, thus reduce the needs for numerous sampling stations. | |
| 4. | Spatial assessment of water quality affected by the land-use changes in the Kuantan River Basin for 6 years (2003–2008). | MLR, HACA, DA and PCA | A good and efficient prediction of missing data using MLR model. | |
| 5. | Classification of water quality at nine monitoring stations in the Muda River Basin for 10 years data set (1998–2007) | CA, PCA and DA | Chemometric techniques were effective for river water classification as it exhibited a correct classification efficiency of 100%. | |
| 6. | Analysis of surface water pollution in the eight monitoring sites along Kinta River for 8 years (2006–2013). | CA, DA, PCA and MLR | A reduction in the number of monitoring stations and parameters for a cost effective and time management in the monitoring processes. | |
| 7. | Determination of water quality status for six monitoring stations at Linggi River based on 6 years observation (1997–2012). | CA and PCA | Reliability of surface water classification facilitate local authorities to reduce cost of monitoring cost by reducing the monitoring station and guide for better decision making. | |
| 8. | Spatial characterization of water quality data and identification of pollutants sources from 12 sampling stations at Terengganu River for 5 years (2006–2010). | HACA and PCA | The application disclosed important information on the spatial variability of large and complex river water quality data sets to control pollution sources. | |
| 9. | Variation of physicochemical sources for drinking water quality at 28 water treatment plants in Klang Valley for a 4 year period (2009–2012). | DA and PCA | The significant input on the spatial variability of a large and multifaceted drinking water quality data. | |
| 10. | Determination of the spatial variation and source identification of heavy metal pollution in 8 monitoring stations along the Straits of Malacca for 5 years (2006–2010). | CA, DA and PCA | Monitoring of heavy metals could be optimal through a single state, each representing the northern and southern region. | |
| 11. | Assessment of river water quality assessment at 39 monitoring stations at Johor River Basin for 5 years (2003–2007). | PCA, APCS-MLR and SPC | The most significant parameters which contributed to the river pollution should be used as reference for authority as it could save time and money budget in water quality sampling and lab analysis of the redundant parameters. | |
| 12. | Detection of control limit for source apportionment in 11 monitoring stations Perlis River Basin for 5 years (2003–2007). | PCA, APCS-MLR and SPC | The significant parameters that impacted the water quality could be used as a reference by the government agencies for monitoring purposes. |
ANN = Artificial Neural Network; APCS = absolute principal component scores; CA = cluster analysis; DA = discriminant analysis; HACA = hierarchical agglomerative cluster analysis; FA = factor analysis; MLR = multiple linear regression; PCA = principal components analysis; SPC = statistical process analysis.
Application of chemometrics technique in flood.
| No. | Research aims | Chemometrics technique | Key findings | References |
|---|---|---|---|---|
| 1. | Identification and recognition of flood risk pattern in four monitoring stations at Kuantan River Basin for 30 years (1982–2012). | FA, Time series analysis and ANN | The prediction of the risk class for flood would construct proper mitigating measure more efficiently for flood occurrence, thus able to cut down cost of destruction and save life. | |
| 2. | Recognition of flood risk pattern in four monitoring stations at Muda River Basin for 30 years (1982–2012). | FA, Time series analysis and ANN | The prediction was accurate and could be used for future prediction in the risk assessment for flood occurrence. | |
| 3. | Assessment of flood risk index in four monitoring stations at Johor River Basin for 30 years (1982–2012). | FA, SPC, ANN and FRI | Water level was the most practicable variable to be used for the warning alert system. | |
| 4. | Prediction of hydrological modelling for flood risk in four monitoring stations at Langat River Basin covered a 30 year period (1982–2012). | ANN, FA and SPC | The prediction of risk in Risk Class was accurate and relevant to be taken into consideration for future flood prediction. | |
| 5. | Development of new flood risk index in tropical area, Muda River Basin for 30 years (1982–2012). | PCA, SPC, ANN and FRI | The control limit of water level allowed a continuous monitoring system by local authorities to execute early preventive measures. | |
| 6. | Analysis of the relationship of rainfall pattern and water level on major flood in Pahang River Basin in 2014. | PCA and SPC | High relationship between rainfall and water level. |
ANN = artificial neural network; DA = discriminant analysis; FA = factor analysis; FRI = flood risk index; HACA = hierarchical agglomerative cluster analysis; MLR = multiple linear regression; PCA = principal components analysis; SPC = statistical process analysis.
Fig. 2The advantages of the chemometrics.