| Literature DB >> 35022442 |
Abstract
Fluorescence spectroscopy can provide high-level chemical characterization and quantification that is suitable for use in online process monitoring and control. However, the high-dimensionality of excitation-emission matrices and superposition of underlying signals is a major challenge to implementation. Herein the use of Convolutional Neural Networks (CNNs) is investigated to interpret fluorescence spectra and predict the formation of disinfection by-products during drinking water treatment. Using deep CNNs, mean absolute prediction error on a test set of data for total trihalomethanes, total haloacetic acids, and the major individual species were all < 6 µg/L and represent a significant difference improved by 39-62% compared to multi-layer perceptron type networks. Heat maps that identify spectral areas of importance for prediction showed unique humic-like and protein-like regions for individual disinfection by-product species that can be used to validate models and provide insight into precursor characteristics. The use of fluorescence spectroscopy coupled with deep CNNs shows promise to be used for rapid estimation of DBP formation potentials without the need for extensive data pre-processing or dimensionality reduction. Knowledge of DBP formation potentials in near real-time can enable tighter treatment controls and management efforts to minimize the exposure of the public to DBPs.Entities:
Year: 2022 PMID: 35022442 PMCID: PMC8755818 DOI: 10.1038/s41598-021-03881-w
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Mean absolute error (MAE) of prediction for a test set (n = 28) of THM concentrations and HAA concentrations using a MLP. (a) The effect of number of layers on performance, (b) the effect of number of nodes in 1 hidden layer on performance. Error bars represent 95% confidence intervals based on 8 random initializations of the network weights.
Mean absolute error (MAE) of predictions on tests set for several model types. Range or error (±) is calculated as the 95% confidence interval, where applicable. Bolded numbers represent the optimal model for each species as determined by t tests at 95% confidence levels.
| Disinfection by-product species | Range of DBP concentrations (μg L−1) | Mean absolute error (μg L−1) | ||||||
|---|---|---|---|---|---|---|---|---|
| MLP (1 layer) | CNN (1 layer) | CNN (4 pooling layers, 1 convolutional layer) | CNN (4 pooling layers, 4–5 convolutional layers) | PARAFAC-MLP | PCA-MLP | 3-way PLS | ||
| Total THMs | 26.5–208.2 | 12.3 ± 0.2 | 6.6 ± 0.1 | 6.1 ± 0.2 | 18.7 ± 0.6 | 15.4 ± 0.5 | 15.9 | |
| Trichloromethane | 24.0–174.3 | 8.8 ± 0.2 | 7.0 ± 0.6 | 4.9 ± 0.4 | 16.6 ± 0.9 | 13.8 ± 0.5 | 12.6 | |
| Bromodichloromethane | 13.6–62.6 | 6.3 ± 0.4 | 4.7 ± 0.2 | 4.1 ± 0.2 | 6.5 ± 0.3 | 7.4 ± 1.2 | 6.4 | |
| Total HAAs | 28.1–139.5 | 10.0 ± 0.2 | 4.5 ± 0.2 | 12.5 ± 0.2 | 12.4 ± 0.3 | 11.1 | ||
| Dichloroacetic acid | 17.7–85.8 | 7.8 ± 0.3 | 6.1 ± 0.7 | 4.8 ± 0.2 | 6.1 ± 0.3 | 8.0 ± 0.3 | 8.9 | |
| Trichloroacetic acid | 10.4–81.4 | 8.4 ± 0.2 | 5.9 ± 0.6 | 4.6 ± 0.1 | 7.0 ± 0.3 | 6.8 ± 1.2 | 5.3 | |
Figure 2Impact of CNN structure and depth on MAE of test set predictions. (a) Varies the number of convolutional layers without any max pooling layers, (b) varies the number of layer sets with 1 convolutional layer followed by max pooling, (c) varies the number of convolutional layers between max pooling layers (4 max pooling layers in total), (d) varies the number of convolutional filters for 1 convolutional layer without max pooling, (e) varies the size of the receptive field for 1 convolutional layer without max pooling. Red boxes delineate areas of optimal performance, where all models within the box performed similarly based on t tests at 95% confidence levels.
Figure 3Feature maps of convolutional filters (24) from 4 convolutional layers chosen between max pooling layers. (a) First convolutional layer, (b) after the first max pooling, (c) after the second max pooling, (d) after the third max pooling. All max pooling was carried out over a (2, 2) window.
Figure 4Heat maps from random occlusion of variable importance for CNN prediction of (a) total THMs, (b) trichloromethane, (c) bromodichloromethane, (d) total HAAs, (e) trichloroacetic acid, (f) dichloroacetic acid.
Figure 5Heat maps from random occlusion of variable importance for MLP prediction of (a) total THMs, (b) trichloromethane, (c) bromodichloromethane, (d) total HAAs, (e) trichloroacetic acid, (f) dichloroacetic acid.
Figure 6General schematic of convolutional neural network structure. The number of convolutional layers as well as the number of layers (convolution + max pooling) can be varied.
Descriptions of the general model types used. n refers to the number of layers and is varied based on model depth.
| Model description | Structure |
|---|---|
| Multilayer perceptron | Input; dimensions = |
| 1. Dense ( | |
| 2. Batch normalization | |
| 3. Activation ( | |
| Dense ( | |
| CNN, no max pooling | Input; dimensions = |
| 1. Convolution 2D ( | |
| 2. Batch normalization | |
| 3. Activation ( | |
| Flatten | |
| Dense ( | |
| CNN, with max pooling | Input; dimensions = |
| 1. Convolution 2D ( | |
| 2. Batch normalization | |
| 3. Activation ( | |
| 4. Max pooling 2D ( | |
| Flatten | |
| Dense ( | |
| CNN | Input; dimensions = |
| | |
| 1. Convolution 2D ( | |
| 2. Batch normalization | |
| 3. Activation ( | |
| Max pooling 2D (2, 2) | |
| Flatten | |
| Dense ( |
Description of the occlusion method used to identify spectral heat maps or areas of importance.
| Occlusion method | |
|---|---|
| 1: | Train a network using original training data ( |
| 2: | Predict outputs ( |
| 3: | |
| 4: | Randomly select EEM patch from |
| 5: | Set patch = 0 to create corrupted test set, |
| 6: | Predict outputs ( |
| 7: | Average error calculated over all test data. |
| 8: | Check all excitation/emission were included in random selection |
| 9: | Calculate average error over all iterations for each excitation/emission pair |