| Literature DB >> 34041274 |
Franz A Van-Horenbeke1, Angelika Peer1.
Abstract
Recognizing the actions, plans, and goals of a person in an unconstrained environment is a key feature that future robotic systems will need in order to achieve a natural human-machine interaction. Indeed, we humans are constantly understanding and predicting the actions and goals of others, which allows us to interact in intuitive and safe ways. While action and plan recognition are tasks that humans perform naturally and with little effort, they are still an unresolved problem from the point of view of artificial intelligence. The immense variety of possible actions and plans that may be encountered in an unconstrained environment makes current approaches be far from human-like performance. In addition, while very different types of algorithms have been proposed to tackle the problem of activity, plan, and goal (intention) recognition, these tend to focus in only one part of the problem (e.g., action recognition), and techniques that address the problem as a whole have been not so thoroughly explored. This review is meant to provide a general view of the problem of activity, plan, and goal recognition as a whole. It presents a description of the problem, both from the human perspective and from the computational perspective, and proposes a classification of the main types of approaches that have been proposed to address it (logic-based, classical machine learning, deep learning, and brain-inspired), together with a description and comparison of the classes. This general view of the problem can help on the identification of research gaps, and may also provide inspiration for the development of new approaches that address the problem in a unified way.Entities:
Keywords: action recognition; activity recognition; behavior recognition; goal recognition; human-computer interaction; human-machine interaction; human-robot interaction; plan recognition
Year: 2021 PMID: 34041274 PMCID: PMC8141730 DOI: 10.3389/frobt.2021.643010
Source DB: PubMed Journal: Front Robot AI ISSN: 2296-9144
Figure 1Illustration of the concepts of action, plan, and goal. A same goal may be achieved through different plans. These plans are often hierarchical structures whose elements can be seen as actions from the point of view of the higher-level elements, and as goals from the point of view of the lower-level elements.
Figure 2Examples of scenarios of different characteristics, (A) is a grid navigation scenario (agnostic, no intervention, fully observable, deterministic, discrete, single agent), (B) is a poker game (adversarial, direct communication, partially observable, stochastic, discrete, multiagent), (C) is a platform video game (agnostic, no intervention, partially observable, deterministic, continuous, single agent), (D) is a human-robot collaboration scenario (intended, online intervention, partially observable, stochastic, continuous, single agent).
Classification of the original studies presented along the review according to their approach (logic-based, classical machine learning, deep learning, or brain-inspired), objective (activity/action, plan, or goal recognition), observed agent involvement (agnostic, adversarial, or intended), observer intervention (no intervention, offline intervention, online intervention, or direct communication), environment (fully or partially observable, deterministic or stochastic, and discrete or continuous), recognition time (offline or online), knowledge of possible classes, i.e., activities/plans/goals (known or unknown), number of agents (single or multiple), and application.
| Shrager and Finin ( | Goal | Agnostic | No | Fully obs. | Det. | Disc. | Online | Known | Single | Software help |
| Avrahami-Zilberbrand and Kaminka ( | Plan | Adversarial | No | Part. obs. | Stoch. | Cont. | Online | Unknown | Single | Surveillance |
| Perrault and Allen ( | Goal | Intended | No | Fully obs. | Det. | Disc. | Offline | Known | Single | Language und. |
| Keren et al. ( | Goal | Agnostic | Offline | Fully obs. | Det. | Disc. | Online | Known | Single | Destin. guess |
| Shvo and McIlraith ( | Goal | Agnostic | Online | Part. obs. | Stoch. | Cont. | Online | Known | Single | General |
| Mirsky et al. ( | Plan | Intended | Direct | Fully obs. | Det. | Disc. | Online | Known | Single | Software help |
| Keren et al. ( | Goal | Agnostic | Offline | Part. obs. | Det. | Disc. | Online | Known | Single | Destin. guess |
| Wayllace et al. ( | Goal | Agnostic | Offline | Fully obs. | Stoch. | Disc. | Online | Known | Single | Destin. guess |
| Kaminka et al. ( | Plan | Agnostic | No | Part. obs. | Stoch. | Cont. | Online | Known | Single | General |
| Vered et al. ( | Goal | Agnostic | No | Fully obs. | Det. | Cont. | Online | Known | Single | Destin. guess |
| Zhuo ( | Plan | Agnostic | No | Part. obs. | Det. | Disc. | Offline | Known | Multiple | Theoretic |
| Genter et al. ( | Goal | Agnostic | No | Fully obs. | Det. | Disc. | Online | Known | Multiple | Video games |
| Skocir et al. ( | Activity | Agnostic | No | Part. obs. | Stoch. | Cont. | Online | Known | Single | Smart homes |
| Ha et al. ( | Goal | Agnostic | No | Fully obs. | Det. | Cont. | Online | Known | Single | Video games |
| Bouchard et al. ( | Plan | Agnostic | No | Part. obs. | Stoch. | Cont. | Online | Unknown | Single | Elderly care |
| (Sengupta et al., | Plan | Intended | Direct | Fully obs. | Det. | Disc. | Online | Unknown | Single | Decision support |
| Kautz and Allen ( | Plan | Agnostic | No | Fully obs. | Det. | Disc. | Offline | Known | Single | Theoretic |
| Zhuo ( | Plan | Agnostic | No | Fully obs. | Det. | Cont. | Offline | Known | Single | Experimental |
| Schank and Abelson ( | Plan | Intended | No | Fully obs. | Det. | Disc. | Offline | Known | Single | Language und. |
| Sohrabi et al. ( | Plan | Agnostic | No | Part. obs. | Det. | Disc. | Offline | Known | Single | Theoretic |
| Jarvis et al. ( | Plan | Adversarial | No | Part. obs. | Stoch. | Cont. | Online | Known | Single | Surveillance |
| Pereira and Han ( | Goal | Agnostic | No | Part. obs. | Stoch. | Cont. | Online | Known | Single | Elderly care |
| Raghavan et al. ( | Plan | Agnostic | No | Fully obs. | Det. | Disc. | Offline | Known | Single | Various |
| Quaresma and Lopes ( | Plan | Agnostic | No | Fully obs. | Det. | Disc. | Offline | Known | Double | Language und. |
| Vered and Kaminka ( | Goal | Agnostic | No | Part. obs. | Det. | Cont. | Online | Known | Single | Destin. guess |
| Myers ( | Plan | Intended | No | Part. obs. | Det. | Disc. | Offline | Known | Single | Planning help |
| Geib and Goldman ( | Plan | Agnostic | No | Part. obs. | Det. | Disc. | Online | Known | Single | Theoretic |
| Ramirez and Geffner ( | Plan | Agnostic | No | Fully obs. | Det. | Disc. | Online | Known | Single | Destin. guess |
| Pereira et al. ( | Goal | Agnostic | No | Part. obs. | Stoch. | Cont. | Online | Known | Single | General |
| Granada et al. ( | Action/Plan | Agnostic | No | Part. obs. | Stoch. | Cont. | Offline | Known | Single | Cooking |
| Amado et al. ( | Action/Goal | Agnostic | No | Part. obs. | Det. | Disc. | Online | Known | Single | Puzzle solving |
| Akkaladevi and Heindl ( | Action | Intended | No | Part. obs. | Stoch. | Cont. | Online | Known | Single | Human-robot int. |
| Han and Pereira ( | Goal | Agnostic | No | Part. obs. | Stoch. | Cont. | Online | Known | Single | Elderly care |
| Kelley et al. ( | Action/Goal | Agnostic | No | Part. obs. | Stoch. | Cont. | Online | Known | Multiple | Human-robot int. |
| Saria and Mahadevan ( | Action/Plan | Agnostic | No | Part. obs. | Stoch. | Cont. | Online | Known | Multiple | Experimental |
| Laviers et al. ( | Goal | Adversarial | No | Fully obs. | Stoch. | Cont. | Online | Known | Multiple | Video games |
| Oh et al. ( | Plan | Intended | No | Fully obs. | Det. | Disc. | Online | Known | Single | Assist. agents |
| Horvitz et al. ( | Goal | Agnostic | No | Fully obs. | Det. | Disc. | Online | Known | Single | Software help |
| Rebelo et al. ( | Activity | Intended | No | Part. obs. | Stoch. | Cont. | Online | Known | Single | Orthotics |
| Liao et al. ( | Action/Goal | Agnostic | No | Part. obs. | Stoch. | Cont. | Online | Unknown | Single | Destin. guess |
| Oliver et al. ( | Activity | Agnostic | No | Part. obs. | Stoch. | Cont. | Online | Known | Single | Office awar. |
| Bui et al. ( | Action/Plan | Agnostic | No | Part. obs. | Stoch. | Cont. | Online | Known | Single | Surveillance |
| Zhao et al. ( | Activity | Agnostic | No | Part. obs. | Stoch. | Cont. | Offline | Known | Single | Experimental |
| Liao et al. ( | Activity | Agnostic | No | Part. obs. | Stoch. | Cont. | Offline | Known | Single | Daily life |
| Baker and Tenenbaum ( | Plan | Agnostic | No | Part. obs. | Stoch. | Cont. | Online | Known | Single | Experimental |
| Samanta and Chanda ( | Activity | Agnostic | No | Part. obs. | Stoch. | Cont. | Online | Known | Single | Video tagging |
| Fan et al. ( | Activity | Agnostic | No | Part. obs. | Stoch. | Cont. | Offline | Known | Single | Daily life |
| Vahdatpour et al. ( | Activity | Agnostic | No | Part. obs. | Stoch. | Cont. | Online | Unknown | Single | Elderly care |
| Polyvyanyy et al. ( | Goal | Agnostic | No | Part. obs. | Stoch. | Cont. | Online | Unknown | Single | General |
| Meng and Huang ( | Goal | Intended | No | Fully obs. | Det. | Disc. | Offline | Known | Single | Language und. |
| Rahmat et al. ( | Goal | Adversarial | No | Fully obs. | Det. | Disc. | Online | Unknown | Single | Net. security |
| Hammerla et al. ( | Activity | Agnostic | No | Part. obs. | Stoch. | Cont. | Online | Known | Single | Daily life |
| Bevilacqua et al. ( | Activity | Agnostic | No | Part. obs. | Stoch. | Cont. | Online | Known | Single | Experimental |
| Ronao and Cho ( | Activity | Agnostic | No | Part. obs. | Stoch. | Cont. | Online | Known | Single | Daily life |
| Amado et al. ( | Goal | Agnostic | No | Part. obs. | Det. | Disc. | Offline | Known | Single | Experimental |
| Ordóñez and Roggen ( | Activity | Agnostic | No | Part. obs. | Stoch. | Cont. | Online | Known | Single | Daily life |
| Zhang et al. ( | Activity | Agnostic | No | Part. obs. | Stoch. | Cont. | Online | Known | Single | Daily life |
| Min et al. ( | Goal | Agnostic | No | Fully obs. | Det. | Cont. | Online | Known | Single | Video games |
| Sheng and Huber ( | Activity | Agnostic | No | Part. obs. | Stoch. | Cont. | Online | Known | Single | Daily life |
| Dwivedi et al. ( | Activity | Agnostic | No | Part. obs. | Stoch. | Cont. | Online | Known | Single | Video tagging |
| Xian et al. ( | Activity | Agnostic | No | Part. obs. | Stoch. | Cont. | Offline | Known | Multiple | Video tagging |
| Kuniyoshi et al. ( | Action | Agnostic | No | Part. obs. | Stoch. | Cont. | Online | Unknown | Single | Human-robot int. |
| Bonaiuto et al. ( | Action | Agnostic | No | Part. obs. | Stoch. | Cont. | Online | Known | Single | Experimental |
| Oztop et al. ( | Action/Goal | Adversarial | No | Part. obs. | Stoch. | Cont. | Online | Known | Single | Experimental |
| Haruno et al. ( | Action | Agnostic | No | Part. obs. | Stoch. | Cont. | Online | Unknown | Single | Human-robot int. |
| Demiris and Khadhouri ( | Action/Goal | Agnostic | No | Part. obs. | Stoch. | Cont. | Online | Known | Single | Human-robot int. |
| Yamashita and Tani ( | Action | Agnostic | No | Part. obs. | Stoch. | Cont. | Online | Known | Single | Human-robot int. |
| Alkurdi et al. ( | Action | Agnostic | No | Part. obs. | Stoch. | Cont. | Online | Known | Single | Daily life |
| Oltramari and Lebiere ( | Action | Agnostic | No | Part. obs. | Stoch. | Cont. | Online | Known | Single | Surveillance |
Summary comparison among the types of approaches in terms of their ability to represent structured data, their expressivity, their capacity to deal with uncertainty, their flexibility to adapt to different data, their robustness, their competency at handling sensor data, the human effort required to develop the system, their scalability, their aptness to deal with open environments, and the maturity of the technology.
| Structure | + + | − | + | + |
| Expressivity | + + | o | − | o |
| Uncertainty | o | + + | + + | + + |
| Flexibility | −− | + | + + | + |
| Robustness | − | + | + + | + |
| Sensory input | −− | o | + + | + |
| Human effort | −− | − | − | o |
| Scalability | − | − | + | o |
| Open environment | − | − | o | + |
| Maturity | + + | + + | + | − |
A score is presented (− −, −, o, +, + +) for each type of approach and criterion, with the scores ranging from − − (very bad at it) to + + (very good at it).