Literature DB >> 35521194

Identification of mine water inrush using laser-induced fluorescence spectroscopy combined with one-dimensional convolutional neural network.

Feng Hu¹, Mengran Zhou¹, Pengcheng Yan¹, Datong Li¹, Wenhao Lai¹, Kai Bian¹, Rongying Dai¹.

Abstract

The application of laser-induced fluorescence (LIF) combined with machine learning methods can make up for the shortcomings of traditional hydrochemical methods in the accurate and rapid identification of mine water inrush in coal mines. However, almost all of these methods require preprocessing such as principal component analysis (PCA) or drawing the spectral map as an essential step. Here, we provide our solution for the classification of mine water inrush, in which a one-dimensional convolutional neural network (1D CNN) is trained to automatically identify mine water inrush according to the LIF spectroscopy without the need for preprocessing. First, the architecture and parameters of the model were optimized and the 1D CNN model containing two convolutional blocks was determined to be the best model for the identification of mine water inrush. Then, we evaluated the performance of the 1D CNN model using the LIF spectral dataset of mine water inrush containing 540 training samples and 135 test samples, and we found that all 675 samples could be accurately identified. Finally, superior classification performance was demonstrated by comparing with a traditional machine learning algorithm (genetic algorithm-support vector machine) and a deep learning algorithm (two-dimensional convolutional neural network). The results show that LIF spectroscopy combined with 1D CNN can be used for the fast and accurate identification of mine water inrush without the need for complex pretreatments. This journal is © The Royal Society of Chemistry.

Entities: Chemical

Year: 2019 PMID： 35521194 PMCID： PMC9061159 DOI： 10.1039/c9ra00805e

Source DB: PubMed Journal: RSC Adv ISSN： 2046-2069 Impact factor: 4.036

Introductions

There are five common types of disasters in coal mines: gas,[1] water,[2] fire,[3] coal dust[4] and roof;[5] water disaster has now become the second largest disaster in coal mines.[6] The coal mine site not only needs to provide the early warning of water inrush,[7] but also needs to correctly identify the type of water source the first time after the occurrence of water inrush disaster[8]. Only in this way can we take the most effective control measures against water inrush in a targeted manner. For the identification of water inrush in the coal mine, the traditional methods are mostly based on hydrochemistry,[9] through which we obtain pH, ion concentration, conductivity and other parameters to establish the classification model of mine water inrush.[10] However, hydrochemical methods have some limitations such as being time-consuming[11] and requiring specialized instruments and skilled operators, which restrict their applications in online warning of mine water inrush. To overcome these shortcomings, great focus has been put on developing fast, online and reliable methods for the identification of mine water inrush. In recent years, laser-induced fluorescence[12,13] (LIF) technology has been applied to the identification of coal mine water inrush and combined with machine learning[14] to achieve the rapid and accurate identification of water inrush. Yan et al.[15] obtained the fluorescence spectra of water samples and then realized the accurate identification of five different types of mine water inrush by means of noise reduction pretreatment, principal component analysis (PCA) and soft independent modeling of class analogy (SIMCA). Wang et al.[16] employed PCA to extract feature information and then used an extreme learning machine (ELM) algorithm to establish a multivariate classification learning model, which greatly reduced the learning time of the classification. However, these methods for water source identification need to go through numerous tedious treatments such as noise reduction pretreatment and dimensional reduction treatment. Recent research has shown that even for relatively simple problems, the majority of ‘reasonable’ preprocessing methods as well as their respective parameter settings may actually decrease the performance of the final model.[17] With this in mind, Zhou et al.[18] proposed a convolutional neural network (CNN) combined with LIF spectral images to construct an identification model for mine water inrush. This method uses the original spectral data to draw the spectral graph and then realizes the water source identification with the help of CNN, avoiding complex pretreatment methods. Nevertheless, there are some shortcomings in this method, that is, the spectral graph should be drawn before recognition and the original spectral data cannot be directly identified. A one-dimensional convolutional neural network[19,20] (1D CNN), an important branch of CNN, has many advantages of ordinary CNN (usually refers to two-dimensional CNN (2DCNN)). 1D CNN has a wide range of applications in structural damage detection,[21] human activity recognition,[22] and classification of electroencephalography (EEG).[23,24] In particular, 1D CNN has many successful cases in the field of spectral analysis. Acquarelli et al.[25] used 1D CNN to classify vibrational spectroscopic data and identify important spectral regions. Malek et al.[26] proposed an alternative, 1D-CNN model obtained by particle swarm optimization (PSO) training for the regression analysis of the spectroscopic signal. Liu et al.[27] described a unified solution for the identification of chemical species, in which a 1D CNN model was trained to identify substances according to their Raman spectroscopy without the need of preprocessing. The aim of this research was to explore whether 1D CNN is suitable for the identification of LIF spectra in water inrush. In the following text, materials and methods are introduced first. The construction and optimization of the 1D CNN model are described next. A support vector machine[28] (SVM) is a kind of traditional classification model with wide applications and good results; genetic algorithm[29] (GA) is a common parameter optimization method of SVM. Then, the optimized 1D CNN model is compared with the genetic algorithm-support vector machine (GA-SVM) model and the 2D CNN model. At the same time, the performance of the model is compared according to the parameters of recognition rate, model complexity, training time, etc. Finally, the paper concludes with a summary of this study.

Materials and methods

Instrumentation

A schematic diagram of the experimental setup used for the LIF system is shown in Fig. 1; it mainly consists of a laser, a spectrometer and a computer system. The laser is a blue-violet semiconductor laser (Beijing Huayuan Tuoda Laser Technology Co., Ltd., Beijing, China) with the optical wavelength set at 405 nm and the maximum energy output of the laser is 150 mW. The laser is collimated by a focusing lens (focal distance 50 mm) and focused on the sample surface. The laser-induced plasma light signal is collected by using an optical fiber and is transmitted to the spectrometer (USB2000+; Ocean Optics Co., Largo, FL, USA) equipped with a 2048-pixel linear CCD array (Sony ILX511; Sony Corporation, Tokyo, Japan). The spectral range of the spectrometer is from 200 nm to 1100 nm.

Fig. 1

The schematic diagram of the LIF system.

In this experiment, the relevant parameters were optimized as follows: the energy of the laser was 120 mW, the spectral range of spectrometer was 340–1021 nm, and the integration time was 1 ms. For the purpose of avoiding the influence of background light and other human factors on the experimental results, the experiment was carried out in a dark room. Fluorescence spectral data for all samples were collected and recorded using Spectra Suite software on a computer.[30]

Materials and samples

Considering that goaf water is the most harmful and most common source of water inrush, goaf water, sandstone water and their mixture were selected as the research object of this experiment. The experimental material was collected in the Huainan mining area of the Anhui Province on July 18, 2018. The goaf water and sandstone water were mixed in different volume ratios; 75 samples of each type were selected to form the following sample set: (1) Group A: single goaf water. (2) Group B: the mixture of goaf water and sandstone water has a volume ratio of 4 : 1. (3) Group C: the mixture of goaf water and sandstone water has a volume ratio of 3 : 1. (4) Group D: the mixture of goaf water and sandstone water has a volume ratio of 2 : 1. (5) Group E: the mixture of goaf water and sandstone water has a volume ratio of 1 : 1. (6) Group F: the mixture of goaf water and sandstone water has a volume ratio of 1 : 2. (7) Group G: the mixture of goaf water and sandstone water has a volume ratio of 1 : 3. (8) Group H: the mixture of goaf water and sandstone water has a volume ratio of 1 : 4. (9) Group I: single sandstone water. All samples were kept from light and sealed to ensure that the experimental data were more realistic and reliable. Also, 60 samples per group were randomly selected and a total of 540 fluorescent spectral data were used as training sets. The remaining 25 samples for each group and a total of 135 fluorescent spectral data were used as the test sets.

1D CNN for LIF spectral data classification

1D CNN, as an important branch of CNN, usually includes convolutional layers, activation layers, and pooling layers. According to the literature,[27,31] we designed two typical structures of 1D CNN, as shown in Fig. 2. In structure A, each convolutional block (Conv Block) contains a batch normalization[32] (BN) layer. However, the BN layer is not included in the Conv Block in structure B, and the BN layer is placed behind the input layer. In structure A as well as in structure B, it is easy to build a deep 1D CNN model by increasing the number of Conv Block. In addition, in order to choose the best structure, we have compared the performances of the 1D CNN model of these two structures in Section 3.1.

Fig. 2

Two typical structures of 1D CNN.

Each component of 1D CNN is explained in detail as follows: The input to 1D CNN for LIF spectroscopy classification is one dimensional and it contains the entire spectrum (input size is 1 × 2048). In the Conv Block, the 1D convolutional layer is used to extract feature maps and different numbers of 1D convolutional filters of the same size are applied in each layer (stride of 1). As Conv Block goes deeper, the number of convolutional filters is doubled (starting from 16). The 1D convolutional layer can be expressed bywhere x and y denote the ith input map and the jth output map, respectively, k is a convolutional kernel between the maps i and j, (*) denotes vector convolution, and b is the bias of the jth map. BN is a technique for improving the performance and stability of neural networks, which can prevent gradient vanishing and over-fitting, as expressed by the following equation:Here, ε is a random noise (for stability), μB represents the mini-batch mean, σ is the mini-batch variance, α represents a scale parameter, and β is a shift parameter. Both α and β are trainable and updated in an epoch-wise manner. Rectified linear units[33] (ReLU) are used as a function of the activation layer that usually follows a convolutional layer or a fully connected layer. The ReLU can be described in eqn (3).Here, x denotes the feature maps of the convolutional layer, w represents the weight factor, and b is the bias. Max pooling is the most commonly used pooling strategy, which is mainly used to reduce the number of dimensions in the feature maps and network parameters. Both structure A and structure B use max pooling with the size of 4. The upper layers of the CNN are fully connected by the dense layer, followed by the softmax layer. In the two structures of Fig. 1, two dense layers are used. The first dense layer has 18 outputs, and the second dense layer has the same number of outputs as the sample category (9). The softmax layer produces values in the range [0, 1] as classification confidence scores. Classification loss is calculated by comparing the confidence scores and true labels of samples. The softmax function and loss function are expressed as follows:Here, z denotes the input of softmax layer, m denotes a sample, n denotes a category and K denotes the total number of categories.

1D CNN training

Two typical structures of the 1D CNN model proposed above were initially developed on Keras (v 2.2.4), a lightweight library used to build and train deep-learning algorithms, with a TensorFlow (v 1.10.0) background. The models were built on a workstation with a GeForce RTX 2070 graphics processing unit (GPU) with 288 Tensor cores and 2304 GPU cores and 8 GB of GDDR6 memory in a Windows 10 environment. The training of the 1D CNN was performed using the Adam algorithm, which is a variant of stochastic gradient descent, for 1000 epochs with learning rate equal to 0.001, β1 = 0.9, β2 = 0.99, and ε = 1 × 10−8. For optimization, during training and testing, the batch size was set to 128.

Results and discussion

Original spectral data

The spectral data of water samples were obtained by the LIF system; the spectral curves of all samples are shown in Fig. 3. It can be seen from the graph that the single goaf water and the single sandstone water are recognizable. However, it becomes indistinguishable when the two kinds of water are mixed together. There is a slight overlap of mixed water samples; thus, it is necessary to classify and identify with the help of chemometrics.

Fig. 3

Original LIF spectroscopy of water samples.

Comparison of the two 1D CNN structures

To test the two 1D CNN structures proposed in Section 2.3, the 1D CNN model with a Conv Block is constructed based on Fig. 2, and the classification performances of CNN models with two structures on the LIF spectra of mine water inrush are compared. By changing the size of the convolution kernel (from 3 to 21), accuracy and loss of 1D CNN models with two structures were recorded over 10 tests. The average accuracy and loss of 1D CNN models with two structures were calculated, as shown in Fig. 4. With the increase in the convolution kernel size, the accuracy of both structures shows a decreasing trend, and their loss tends to increase. This shows that the smaller the convolution kernel size, the easier it is to extract the appropriate features, which is helpful to improve the accuracy of classification. As can be seen from Fig. 4(a), the accuracy of the 1D CNN model of structure B is higher than that of structure A under the same convolution kernel size, which indicates that the 1D CNN model of structure B is more suitable for the LIF spectral classification of mine water inrush. Based on Fig. 4(a) and (b), we can determine that the 1D CNN of structure B should be used to construct an LIF spectral classification model for mine water inrush, and the kernel size of all convolutional layers in the model is set to 3.

Fig. 4

Results of the 1D CNN model of the two structures under different convolution kernel sizes.

Optimization of the 1D CNN architecture

According to Fig. 2(b), 1D CNN models of different depths are constructed by concatenating different numbers of Conv Blocks. The model was trained and tested using LIF spectral data from mine water samples, and ten replicates were performed at each depth. The trainable parameters, training time, accuracy and loss were considered in the selection of the optimal 1D CNN model. The average results of the 10 trials are shown in Table 1. First, we compared the accuracy and found that the accuracy rate of the 1D CNN model with one or two Conv Blocks can reach a maximum of 100.00%. Then, comparing the loss of these two depth 1D CNN models, we found that the loss of the model that contains two Conv Blocks is smaller and the model is more stable. Next, we note that the number of trainable parameters of the 1D CNN model with two Conv Blocks is almost half that of the 1D CNN model with one Conv Block. In addition, as the model goes deeper, the training time gradually increases. Taking these indicators into account, the 1D CNN model with two Conv Blocks is more suitable for the classification of LIF spectroscopy data of mine water inrush.

Comparison of different numbers of Conv Blocks of 1D CNN model

Conv Blocks	Parameters	Time (epoch per ms)	Accuracy (%)	Loss
1 Conv Block	147 425	40.48 ± 0.27	100.00 ± 0.00	0.0013 ± 0.0007
2 Conv Blocks	74 977	47.48 ± 0.40	100.00 ± 0.00	0.0006 ± 0.0002
3 Conv Blocks	43 745	51.66 ± 0.37	99.85 ± 0.16	0.0033 ± 0.0025
4 Conv Blocks	48 865	54.47 ± 0.34	99.94 ± 0.10	0.0041 ± 0.0031

Performance of the proposed 1D CNN model

With the above results, we obtained the architecture of the 1D CNN model for LIF spectral classification for mine water inrush. As shown in Fig. 5, we can see more details about the 1D CNN model. The optimal architecture is a 2 Conv Block 1D CNN model: (1) Conv Block 1 comprises 16 filters with a kernel size of 3, and Conv Block 2 comprises 32 filters with a kernel size of 3. (2) Each convolutional layer is followed by an ReLU activation layer and a max pooling layer with the size of 3.

Fig. 5

The architecture of the optimized 1D CNN classification model.

Fig. 6 shows the changes in accuracy and loss under different epochs. The results show that as the number of epochs increases, the accuracy tends to increase, while the loss shows a decreasing trend. The accuracy and loss reached stable values after hundreds of epochs of learning when applied to the LIF spectroscopy of mine water inrush. The accuracy of both the training set and the test set was 100.00%, which indicated that the 1D CNN model can accurately identify 540 training set samples and 135 test set samples. In other words, the proposed 1D CNN model is feasible and effective for the classification of LIF spectroscopy data of mine water inrush.

Fig. 6

Accuracy and loss of the proposed 1D CNN model for LIF spectroscopy.

Comparison with the traditional classification model

The performances of the GA-SVM model, PCA-GA-SVM model, and 1D CNN model are mainly compared here. Fig. 7 shows the accuracy of the three models under 10 trials. When the LIF spectroscopy data of mine water inrush are classified directly by using GA-SVM, the average recognition rate of all samples is 82.10%. On this basis, PCA is introduced to reduce the dimension of the original spectral data, and the average recognition rate increases to 98.77%. However, using the 1D CNN model for the classification of LIF spectroscopy data, the accuracy under all 10 trials is 100.00%. Compared with SVM models, the 1D CNN model has higher performance for LIF spectroscopy. At the same time, it can also be seen that SVM models have a certain dependence on the preprocessing of raw spectral data.

Fig. 7

Accuracy of three models in 10 trials.

Comparison with 2D CNN model

The input to the 2D CNN model is an image; thus, the fluorescence spectral image is drawn using the data of wavelength and fluorescence intensity. Based on Fig. 2, a similar structure 2D CNN model was constructed with the convolution kernel size set to 3 × 3 and the max pooling size set to 2 × 2. We increased the number of Conv Blocks to increase the depth of the network, while 10 repetitive tests were performed at different depths. Table 2 shows the average results of 10 repetitive trials. When using a 2D CNN model for recognition, it takes at least 3 Conv Blocks to achieve a 100.00% recognition rate. In other words, for the analysis of LIF spectroscopy of mine water inrush, the 2D CNN model needs a deeper network depth to achieve the recognition performance of the 1D CNN model. Comparing Tables 1 and 2, we can also find that the 2D CNN model has a longer training time and a larger number of trainable parameters at the same network depth. Therefore, the 1D CNN model is more suitable than the 2D CNN model for LIF spectral analysis of mine water inrush.

Results of different numbers of Conv Blocks of 2D CNN model

Conv Blocks	Parameters	Time (epoch per ms)	Accuracy (%)	Loss
1 Conv Block	558 525	93.75 ± 2.06	77.68 ± 7.33	0.3985 ± 0.1580
2 Conv Blocks	253 917	114.88 ± 1.15	95.42 ± 9.30	0.0794 ± 0.1585
3 Conv Blocks	125 085	124.11 ± 2.07	100.00 ± 0.00	0.0019 ± 0.0025
4 Conv Blocks	125 469	137.77 ± 2.20	100.00 ± 0.00	0.0001 ± 0.0002

Conclusions

In this paper, we have proposed a 1D CNN model for the classification of mine water inrush from LIF spectroscopy, which not only exhibits outstanding performance, but also avoids the need for preprocessing raw spectral data. The 1D CNN model was designed with 1D convolution kernels, and various architectures were compared to identify the optimal architecture for the LIF spectral analysis of mine water inrush. The optimal architecture is a 1D CNN model with 2 Conv Blocks: Conv Block 1 contains 16 filters and Conv Block 2 contains 32 filters; the convolution kernel size of both Conv Blocks is 3. The 1D CNN model was trained and tested using the LIF spectroscopy of mine water inrush (goaf water, sandstone water, and their mixture), and all samples of training sets and test sets could be accurately identified. To validate the performance of the 1D CNN model, we compared it with the GA-SVM model and the 2D CNN model. Compared with SVM, we found that the 1D CNN model has higher accuracy, and using CNN to analyse LIF spectrum does not require preprocessing (such as PCA) of raw data. Compared to the 2D CNN model, the 1D CNN model has fewer Conv Blocks and less training time when achieving 100.00% recognition accuracy. In addition, the 1D CNN model can directly analyse the original spectral data, while 2D CNN needs to draw the spectral map based on spectral data before analysis. The present investigation considers the classification of mine water inrush from LIF spectroscopy using 1D CNN with 2 Conv Blocks. It will be interesting to extend the investigation to the identification by using a simpler architecture while considering designing a more versatile 1D CNN model that can be used for LIF spectroscopy of more substances.

Conflicts of interest

There are no conflicts to declare.

8 in total

1. Rapid identification of closely related muscle foods by vibrational spectroscopy and machine learning.

Authors: David I Ellis; David Broadhurst; Sarah J Clarke; Royston Goodacre
Journal: Analyst Date: 2005-10-26 Impact factor: 4.616

2. Mesoporous silicon photonic crystal microparticles: towards single-cell optical biosensors.

Authors: Bin Guan; Astrid Magenau; Krisopher A Kilian; Simone Ciampi; Katharina Gaus; Peter J Reece; J Justin Gooding
Journal: Faraday Discuss Date: 2011 Impact factor: 4.008

3. Convolutional neural networks for vibrational spectroscopic data analysis.

Authors: Jacopo Acquarelli; Twan van Laarhoven; Jan Gerretzen; Thanh N Tran; Lutgarde M C Buydens; Elena Marchiori
Journal: Anal Chim Acta Date: 2016-12-27 Impact factor: 6.558

4. A novel deep learning approach for classification of EEG motor imagery signals.

Authors: Yousef Rezaei Tabar; Ugur Halici
Journal: J Neural Eng Date: 2016-11-30 Impact factor: 5.379

Review 5. "Getting the best sensitivity from on-capillary fluorescence detection in capillary electrophoresis" - A tutorial.

Authors: Victor A Galievsky; Alexander S Stasheuski; Sergey N Krylov
Journal: Anal Chim Acta Date: 2016-06-20 Impact factor: 6.558

6. Deep convolutional neural networks for Raman spectrum recognition: a unified solution.

Authors: Jinchao Liu; Margarita Osadchy; Lorna Ashton; Michael Foster; Christopher J Solomon; Stuart J Gibson
Journal: Analyst Date: 2017-10-23 Impact factor: 4.616

7. Automated Detection of Obstructive Sleep Apnea Events from a Single-Lead Electrocardiogram Using a Convolutional Neural Network.

Authors: Erdenebayar Urtnasan; Jong-Uk Park; Eun-Yeon Joo; Kyoung-Joung Lee
Journal: J Med Syst Date: 2018-04-23 Impact factor: 4.460

8. [Research on the Source Identification of Mine Water Inrush Based on LIF Technology and SIMCA Algorithm].

Authors: Peng-cheng Yan; Meng-ran Zhou; Qi-meng Liu; Kai-yuan Zhang; Chen-yang He
Journal: Guang Pu Xue Yu Guang Pu Fen Xi Date: 2016-01 Impact factor: 0.589

8 in total