| Literature DB >> 27347956 |
Cristina Tîrnăucă1, José L Montaña2, Santiago Ontañón3, Avelino J González4, Luis M Pardo5.
Abstract
Imagine an agent that performs tasks according to different strategies. The goal of Behavioral Recognition (BR) is to identify which of the available strategies is the one being used by the agent, by simply observing the agent's actions and the environmental conditions during a certain period of time. The goal of Behavioral Cloning (BC) is more ambitious. In this last case, the learner must be able to build a model of the behavior of the agent. In both settings, the only assumption is that the learner has access to a training set that contains instances of observed behavioral traces for each available strategy. This paper studies a machine learning approach based on Probabilistic Finite Automata (PFAs), capable of achieving both the recognition and cloning tasks. We evaluate the performance of PFAs in the context of a simulated learning environment (in this case, a virtual Roomba vacuum cleaner robot), and compare it with a collection of other machine learning approaches.Entities:
Keywords: ambient intelligence; behavioral cloning; behavioral recognition; learning from observation; probabilistic finite automaton; virtual agents
Year: 2016 PMID: 27347956 PMCID: PMC4970012 DOI: 10.3390/s16070958
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Training examples for LfO.
| Model 1 | Model 2 | Model 3 | ||||||
|---|---|---|---|---|---|---|---|---|
| 1st | Class | 1st | 2nd | Class | 1st | 2nd | 3rd | Class |
| − | − | − | ||||||
| … | … | … | … | … | … | … | … | … |
Figure 1Training Maps, where obstacles are shown in black. The starting position of the vacuum cleaner is shown in red.
Distance matrix.
| 9000/1500 obs | ||||||
|---|---|---|---|---|---|---|
| 8.4267 | 9.2504 | 7.3178 | 7.8728 | 8.9871 | 9.0486 | |
| 8.9414 | 4.3265 | 3.8846 | 4.6463 | 4.6070 | ||
| 8.4996 | 6.5079 | 4.7726 | 5.8376 | 4.3072 | 4.3743 | |
| 4.1331 | 4.3935 | 4.3924 | ||||
| 9.1459 | 7.1087 | 5.3849 | 5.8159 | 3.9715 | 3.9815 | |
| 9.1683 | 7.1035 | 5.4043 | 5.7853 | |||
| 9.8276 | 8.5274 | 8.7760 | 9.7278 | 9.7654 | ||
| 9.8016 | 6.4832 | 5.8042 | 6.0120 | 6.0783 | ||
| 9.8502 | 8.0886 | 6.8461 | 7.4848 | 6.4688 | 6.5510 | |
| 9.8487 | 5.6960 | 6.3857 | 6.4331 | |||
| 9.8663 | 8.1203 | 7.8902 | 8.4213 | 5.5873 | 5.6445 | |
| 9.8681 | 8.1002 | 7.8278 | 8.3599 | |||
| 11.8912 | 10.5335 | 10.7006 | 11.8687 | 11.8792 | ||
| 11.8415 | 8.8885 | 8.2754 | 8.2794 | 8.3277 | ||
| 11.8359 | 11.4252 | 8.4840 | 9.0659 | 8.1677 | 8.2454 | |
| 11.8261 | 10.7019 | 7.8168 | 7.8871 | |||
| 11.8808 | 11.2667 | 9.2245 | 9.6458 | 6.7169 | 6.8174 | |
| 11.8824 | 11.2563 | 9.1614 | 9.5850 | |||
Confusion matrix.
| 9000/1500 obs | ||||||
| 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||
| 0.12 | 0.10 | 0.00 | 0.00 | 0.01 | ||
| 0.03 | 0.00 | 0.20 | 0.00 | 0.03 | 0.04 | |
| 0.01 | 0.01 | 0.10 | 0.09 | |||
| 0.13 | 0.00 | 0.03 | 0.01 | 0.39 | 0.24 | |
| 0.00 | 0.00 | 0.04 | 0.00 | |||
| 9000/1500 obs | ||||||
| 0.28 | 0.00 | 0.04 | 0.01 | 0.01 | 0.01 | |
| 0.05 | 0.24 | 0.01 | 0.00 | 0.00 | ||
| 0.24 | 0.00 | 0.21 | 0.01 | 0.03 | 0.03 | |
| 0.00 | 0.03 | 0.01 | ||||
| 0.01 | 0.00 | 0.01 | 0.00 | 0.18 | 0.05 | |
| 0.00 | 0.00 | 0.07 | 0.00 | |||
| 9000/1500 obs | ||||||
| 0.27 | 0.00 | 0.03 | 0.01 | 0.01 | 0.01 | |
| 0.05 | 0.01 | 0.00 | 0.00 | 0.00 | ||
| 0.07 | 0.00 | 0.20 | 0.01 | 0.03 | 0.02 | |
| 0.00 | 0.01 | 0.01 | ||||
| 0.01 | 0.00 | 0.02 | 0.00 | 0.16 | 0.03 | |
| 0.00 | 0.00 | 0.10 | 0.00 | |||
Predictive accuracy for the generated maps (higher is better).
| Model 1 | PFA | DT | PNN | KNN | RProp | NB |
|---|---|---|---|---|---|---|
| 0.925 | 0.924 | 0.925 | 0.452 | 0.504 | ||
| 0.275 | 0.270 | 0.253 | ||||
| 0.571 | 0.626 | 0.561 | 0.336 | |||
| 0.388 | 0.392 | 0.394 | 0.381 | 0.381 | ||
| 0.478 | 0.478 | 0.475 | 0.454 | 0.456 | ||
| 0.467 | 0.468 | 0.464 | 0.442 | 0.447 | ||
| Average | 0.529 | 0.528 | 0.527 | 0.427 | 0.396 | |
| 0.288 | 0.079 | 0.124 | 0.448 | 0.078 | ||
| 0.270 | 0.269 | 0.272 | 0.267 | 0.267 | ||
| 0.509 | 0.570 | 0.578 | 0.547 | 0.328 | ||
| 0.391 | 0.419 | 0.411 | 0.392 | 0.370 | ||
| 0.522 | 0.554 | 0.543 | 0.504 | 0.460 | ||
| 0.528 | 0.558 | 0.574 | 0.513 | 0.451 | ||
| Average | 0.418 | 0.407 | 0.419 | 0.445 | 0.326 | |
| 0.079 | 0.080 | 0.119 | 0.080 | 0.071 | ||
| 0.267 | 0.276 | 0.267 | 0.271 | 0.271 | ||
| 0.702 | 0.705 | 0.725 | 0.758 | 0.563 | ||
| 0.923 | 0.909 | 0.929 | 0.930 | 0.929 | ||
| 0.881 | 0.885 | 0.910 | 0.871 | 0.836 | ||
| 0.880 | 0.882 | 0.930 | 0.876 | 0.815 | ||
| Average | 0.657 | 0.621 | 0.647 | 0.632 | 0.581 |
Monte Carlo distance between original and cloned behavior (lower is better).
| Model 1 | PFA | DT | PNN | KNN | RProp | NB |
|---|---|---|---|---|---|---|
| 0.866 | 0.962 | 0.866 | 4.836 | 4.797 | ||
| 5.130 | 5.062 | 5.062 | - | - | ||
| 3.012 | 2.599 | 2.599 | 3.119 | 5.572 | ||
| 3.853 | 3.762 | 3.823 | 3.343 | 3.767 | ||
| 3.497 | 3.608 | 3.497 | 3.316 | 4.324 | ||
| 3.506 | 3.591 | 3.506 | 4.354 | 4.354 | ||
| Average | 3.242 | 3.264 | 3.221 | 3.794 | 4.563 | |
| 6.165 | 7.072 | 7.274 | 6.718 | 6.159 | ||
| 6.886 | 6.958 | 7.189 | 7.538 | 7.575 | ||
| 5.538 | 6.460 | 5.408 | 5.760 | - | ||
| 5.157 | 5.602 | 5.757 | 4.912 | - | ||
| 4.818 | 4.501 | 4.336 | - | 7.663 | ||
| 4.502 | 4.727 | 4.544 | 4.524 | 7.683 | ||
| Average | 5.711 | 5.745 | 5.718 | 5.590 | 7.239 | |
| 6.961 | 8.082 | 7.953 | 7.524 | 6.955 | ||
| 7.894 | 7.941 | 7.983 | 7.978 | 8.348 | ||
| 6.000 | 6.924 | 6.685 | 5.613 | 5.613 | ||
| 5.470 | 6.324 | 5.325 | 5.708 | - | ||
| 5.914 | 5.324 | 4.903 | 5.871 | 8.460 | ||
| 5.845 | 5.416 | 4.934 | 5.234 | 8.459 | ||
| Average | 6.188 | 6.647 | 6.226 | 6.227 | 7.542 |
Empirical distribution of the winning tool (smallest Monte Carlo distance).
| Model 1 | PFA | DT | PNN | KNN | RProp | NB |
|---|---|---|---|---|---|---|
| 22.50 | 21.50 | 22.50 | 1.00 | 1.00 | ||
| 1.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||
| 10.30 | 10.30 | 10.30 | 24.30 | 1.50 | ||
| 3.00 | 5.00 | 4.33 | 21.00 | 3.33 | ||
| 0.92 | 0.92 | 0.92 | 0.92 | 1.17 | ||
| 1.50 | 1.50 | 1.50 | 1.17 | 1.17 | ||
| Average | 6.54 | 6.54 | 6.59 | 8.06 | 1.36 | |
| 29.95 | 21.25 | 4.70 | 5.12 | 7.95 | ||
| 16.50 | 16.00 | 4.50 | 2.17 | 2.17 | ||
| 9.50 | 22.00 | 22.33 | 8.83 | 0.17 | ||
| 24.00 | 7.50 | 4.50 | 33.00 | 0.00 | ||
| 26.50 | 17.33 | 22.33 | 1.67 | 1.67 | ||
| 15.92 | 8.92 | 17.08 | 22.08 | 1.67 | ||
| Average | 21.69 | 12.80 | 10.39 | 17.34 | 6.12 | |
| 16.83 | 8.28 | 5.45 | 8.37 | 29.95 | ||
| 15.50 | 10.00 | 5.50 | 15.00 | 5.00 | ||
| 13.00 | 12.33 | 10.33 | 7.17 | 7.17 | ||
| 11.67 | 10.67 | 20.67 | 19.83 | 0.00 | ||
| 7.42 | 11.42 | 28.42 | 11.42 | 1.67 | ||
| 8.67 | 9.17 | 22.17 | 10.67 | 1.67 | ||
| Average | 20.15 | 10.31 | 15.42 | 12.08 | 7.58 |