| Literature DB >> 31514321 |
Ye Zhang1, Mingchao Li2, Shuai Han3, Qiubing Ren4, Jonathan Shi5.
Abstract
It is significant to identify rock-mineral microscopic images in geological engineering. The task of microscopic mineral image identification, which is often conducted in the lab, is tedious and time-consuming. Deep learning and convolutional neural networks (CNNs) provide a method to analyze mineral microscopic images efficiently and smartly. In this research, the transfer learning model of mineral microscopic images is established based on Inception-v3 architecture. The four mineral image features, including K-feldspar (Kf), perthite (Pe), plagioclase (Pl), and quartz (Qz or Q), are extracted using Inception-v3. Based on the features, the machine learning methods, logistic regression (LR), support vector machine (SVM), random forest (RF), k-nearest neighbors (KNN), multilayer perceptron (MLP), and gaussian naive Bayes (GNB), are adopted to establish the identification models. The results are evaluated using 10-fold cross-validation. LR, SVM, and MLP have a significant performance among all the models, with accuracy of about 90.0%. The evaluation result shows LR, SVM, and MLP are the outstanding single models in high-dimensional feature analysis. The three models are also selected as the base models in model stacking. The LR model is also set as the meta classifier in the final prediction. The stacking model can achieve 90.9% accuracy, which is higher than all the single models. The result also shows that model stacking effectively improves model performance.Entities:
Keywords: CNN; deep learning; machine learning; model stacking; rock-mineral microscopic images; transfer learning
Year: 2019 PMID: 31514321 PMCID: PMC6767609 DOI: 10.3390/s19183914
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Overall framework for rock-mineral microscopic images identification.
Figure 2Transfer learning based on Inception-v3.
The outline of the Inception-v3 model.
| Type | Patch Size/Stride or Remarks | Input Size |
|---|---|---|
| Conv | 3 × 3/2 | 299 × 299 × 3 |
| Conv | 3 × 3/1 | 149 × 149 × 3 |
| Conv Padded | 3 × 3/1 | 147 × 147 × 32 |
| Pool | 3 × 3/2 | 147 × 147 × 64 |
| Conv | 3 × 3/1 | 73 × 73 × 64 |
| Conv | 3 × 3/2 | 71 × 71 × 80 |
| Conv | 3 × 3/1 | 35 × 35 × 192 |
| 3 × Inception | 1 × 1 and 3 × 3/1 | 35 × 35 × 288 |
| 5 × Inception | 17 × 17 × 768 | |
| 2 × Inception | 1 × 1, 1 × 3, 3 × 1, and 3 × 3/2 | 8 × 8 × 1280 |
| Pool | 8 × 8 | 8 × 8 × 2048 |
| Linear | Logits | 1 × 1 × 2048 |
| Softmax | Classifier | 1 × 1 × 1000 |
Figure 3MLP model structure.
Figure 4k-fold cross-validation.
Figure 5Model stacking.
Figure 6Microscope.
The numbers of sample data of rock-mineral microscopic images.
| Mineral | Number of Images in Each Class | Total Number of Images |
|---|---|---|
| Kf | 116 | - |
| Pe | 122 | 481 |
| Pl | 122 | - |
| Qz | 121 | - |
Figure 7The four cut mineral microscopic images: (a) Kf; (b) Pe; (c) Pl; (d) Qz.
Figure 8The feature map visualization of Kf.
The parameters set in the models.
| Method | Parameters | Value |
|---|---|---|
| LR | Solver | Newton-cg |
| Multi_class | Multinomial | |
| LinearSVC | C | 1.0 |
| RF | N_jobs | 4 |
| Criterion | Entropy | |
| N_estimators | 70 | |
| Min_samples_split | 5 | |
| KNN | N_neighbors | 1 |
| N_jobs | 4 | |
| MLP | Hidden_layer_sizes | 300 |
The data test with different deep learning models.
| Models | Accuracy (%) | Accuracy Standard Deviation (%) |
|---|---|---|
| LR | 90.0 | 4.5 |
| SVM | 90.6 | 5.3 |
| RF | 81.9 | 3.2 |
| KNN | 83.0 | 5.4 |
| MLP | 89.8 | 3.5 |
| GNB | 78.0 | 4.3 |
The data test with different deep learning models.
| Models | Accuracy (%) | Accuracy Standard Deviation (%) |
|---|---|---|
| LR | 90.0 | 4.5 |
| SVM | 90.6 | 5.3 |
| MLP | 89.8 | 3.5 |
| Stacking Model | 90.9 | 4.0 |