| Literature DB >> 30845726 |
Mingliang Mei1, Ji Chang2, Yuling Li3, Zerui Li4, Xiaochuan Li5, Wenjun Lv6.
Abstract
Autonomous robots that operate in the field can enhance their security and efficiency by accurate terrain classification, which can be realized by means of robot-terrain interaction-generated vibration signals. In this paper, we explore the vibration-based terrain classification (VTC), in particular for a wheeled robot with shock absorbers. Because the vibration sensors are usually mounted on the main body of the robot, the vibration signals are dampened significantly, which results in the vibration signals collected on different terrains being more difficult to discriminate. Hence, the existing VTC methods applied to a robot with shock absorbers may degrade. The contributions are two-fold: (1) Several experiments are conducted to exhibit the performance of the existing feature-engineering and feature-learning classification methods; and (2) According to the long short-term memory (LSTM) network, we propose a one-dimensional convolutional LSTM (1DCL)-based VTC method to learn both spatial and temporal characteristics of the dampened vibration signals. The experiment results demonstrate that: (1) The feature-engineering methods, which are efficient in VTC of the robot without shock absorbers, are not so accurate in our project; meanwhile, the feature-learning methods are better choices; and (2) The 1DCL-based VTC method outperforms the conventional methods with an accuracy of 80.18%, which exceeds the second method (LSTM) by 8.23%.Entities:
Keywords: LSTM; autonomous robots; terrain classification; vibration
Year: 2019 PMID: 30845726 PMCID: PMC6427223 DOI: 10.3390/s19051137
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Examples of eight terrain types along with corresponding samples of undampened vibration signals (left) and dampened vibration signals (right). (a) Asphalt. (b) Cobble. (c) Concrete. (d) Grass 1 (artificial grassland). (e) Grass 2 (natural grassland). (f) Gravel. (g) Plastic. (h) Tile.
Figure 2Illustration of dampened vibration-based terrain classification.
Figure 3The overall structure of 1DCL. The marker ⊛ denotes convolution operation.
Figure 4The experimental robot.
Figure 5Confusion matrix of binary classification. The confusion matrix of multi-class classification can be reduced to several confusion matrices of binary classification.
Performance of Seven Different Classifiers Using Time-Domain and Freq-Domain Feature Extraction Methods.
| Features | Metrics | Classifiers | ||||||
|---|---|---|---|---|---|---|---|---|
| SVM | ELM | kNN | NB | DT | RF | Adaboost | ||
| Time-domain | Accuracy | 65.30% | 65.95% | 61.53% | 57.96% | 58.39% | 65.89% |
|
| F1-score | 0.6438 |
| 0.6065 | 0.5617 | 0.5803 | 0.6475 | 0.6465 | |
| FFT-based | Accuracy | 67.99% | 62.96% | 57.75% | 57.67% | 59.62% | 68.07% |
|
| F1-score | 0.6674 | 0.6068 | 0.5695 | 0.5612 | 0.5898 | 0.6679 |
| |
| PSD-based | Accuracy | 67.52% | 69.84% | 65.69% | 58.37% | 60.26% |
| 67.91% |
| F1-score | 0.6622 | 0.6853 | 0.6454 | 0.5595 | 0.5999 |
| 0.6699 | |
Figure 6True positive rate for 8 terrain types with different classifiers using (a) Time-domain features, (b) FFT-based features, (c) PSD-based features.
Figure 7Normalized confusion matrices of (a) T-NB, (b) T-RF, (c) PSD-NB, (d) PSD-RF.
Performance of Feature-Learning Approaches.
| Metrics | Networks | ||||
|---|---|---|---|---|---|
| CNN | LSTM | 1DCL | 1DCL-FC | 1DCL-3Conv | |
| Accuracy | 70.17% | 71.95% |
| 67.94% | 72.93% |
| F1-score | 0.6893 | 0.7030 |
| 0.6622 | 0.7159 |
Running Time of Feature-Engineering and Feature-Learning Approaches.
| Classifiers | Training Time (s) | Testing Time (ms) | ||||
|---|---|---|---|---|---|---|
| T | FFT | PSD | T | FFT | PSD | |
| SVM | 15.43 | 188.9 | 90.20 | 18.78 | 235.2 | 115.4 |
| ELM | 0.5709 | 1.401 | 0.5674 | 19.93 | 27.20 | 21.72 |
| kNN | 0.01219 | 0.01956 | 0.01423 | 8.298 | 69.40 | 36.63 |
| NB |
| 0.03499 | 0.01702 | 1.225 | 4.144 | 1.779 |
| DT | 0.01089 | 0.1596 | 0.08480 | 0.1409 | 0.4181 | 0.3472 |
| RF | 1.5371 | 22.80 | 11.81 | 16.35 | 10.71 | 9.431 |
| Adaboost | 6.911 | 173.1 | 82.26 | 81.54 | 96.39 | 82.07 |
| CNN | 251.8 | 7.110 | ||||
| LSTM | 746.5 | 13.18 | ||||
| 1DCL | 1295 | 15.22 | ||||
Figure 8Accuracy and testing time of different approaches at varying vibration signal segment length. We take two representative approaches as examples: FFT-SVM and 1DCL.