Literature DB >> 33500912

Exploiting Three-Dimensional Gaze Tracking for Action Recognition During Bimanual Manipulation to Enhance Human-Robot Collaboration.

Alireza Haji Fathaliyan1, Xiaoyu Wang1, Veronica J Santos1.   

Abstract

Human-robot collaboration could be advanced by facilitating the intuitive, gaze-based control of robots, and enabling robots to recognize human actions, infer human intent, and plan actions that support human goals. Traditionally, gaze tracking approaches to action recognition have relied upon computer vision-based analyses of two-dimensional egocentric camera videos. The objective of this study was to identify useful features that can be extracted from three-dimensional (3D) gaze behavior and used as inputs to machine learning algorithms for human action recognition. We investigated human gaze behavior and gaze-object interactions in 3D during the performance of a bimanual, instrumental activity of daily living: the preparation of a powdered drink. A marker-based motion capture system and binocular eye tracker were used to reconstruct 3D gaze vectors and their intersection with 3D point clouds of objects being manipulated. Statistical analyses of gaze fixation duration and saccade size suggested that some actions (pouring and stirring) may require more visual attention than other actions (reach, pick up, set down, and move). 3D gaze saliency maps, generated with high spatial resolution for six subtasks, appeared to encode action-relevant information. The "gaze object sequence" was used to capture information about the identity of objects in concert with the temporal sequence in which the objects were visually regarded. Dynamic time warping barycentric averaging was used to create a population-based set of characteristic gaze object sequences that accounted for intra- and inter-subject variability. The gaze object sequence was used to demonstrate the feasibility of a simple action recognition algorithm that utilized a dynamic time warping Euclidean distance metric. Averaged over the six subtasks, the action recognition algorithm yielded an accuracy of 96.4%, precision of 89.5%, and recall of 89.2%. This level of performance suggests that the gaze object sequence is a promising feature for action recognition whose impact could be enhanced through the use of sophisticated machine learning classifiers and algorithmic improvements for real-time implementation. Robots capable of robust, real-time recognition of human actions during manipulation tasks could be used to improve quality of life in the home and quality of work in industrial environments.
Copyright © 2018 Haji Fathaliyan, Wang and Santos.

Entities:  

Keywords:  action recognition; bimanual manipulation; eye tracking; gaze fixation; gaze object sequence; gaze saliency map; human–robot collaboration; instrumental activity of daily living

Year:  2018        PMID: 33500912      PMCID: PMC7805858          DOI: 10.3389/frobt.2018.00025

Source DB:  PubMed          Journal:  Front Robot AI        ISSN: 2296-9144


  14 in total

Review 1.  In what ways do eye movements contribute to everyday activities?

Authors:  M F Land; M Hayhoe
Journal:  Vision Res       Date:  2001       Impact factor: 1.886

2.  Slowing down to stay out of trouble in the operating room: remaining attentive in automaticity.

Authors:  Carol-anne Moulton; Glenn Regehr; Lorelei Lingard; Catherine Merritt; Helen MacRae
Journal:  Acad Med       Date:  2010-10       Impact factor: 6.893

Review 3.  Eye movements in natural behavior.

Authors:  Mary Hayhoe; Dana Ballard
Journal:  Trends Cogn Sci       Date:  2005-04       Impact factor: 20.229

4.  An adaptive algorithm for fixation, saccade, and glissade detection in eyetracking data.

Authors:  Marcus Nyström; Kenneth Holmqvist
Journal:  Behav Res Methods       Date:  2010-02

5.  3-D-Gaze-Based Robotic Grasping Through Mimicking Human Visuomotor Function for People With Motion Impairments.

Authors:  Songpo Li; Xiaoli Zhang; Jeremy D Webb
Journal:  IEEE Trans Biomed Eng       Date:  2017-03-03       Impact factor: 4.538

6.  Expert surgeon's quiet eye and slowing down: expertise differences in performance and quiet eye duration during identification and dissection of the recurrent laryngeal nerve.

Authors:  Adrian Harvey; Joan N Vickers; Ryan Snelgrove; Matthew F Scott; Sheila Morrison
Journal:  Am J Surg       Date:  2013-10-02       Impact factor: 2.565

7.  Hand movements: a window into haptic object recognition.

Authors:  S J Lederman; R L Klatzky
Journal:  Cogn Psychol       Date:  1987-07       Impact factor: 3.468

8.  Saccade size in reading depends upon character spaces and not visual angle.

Authors:  R E Morrison; K Rayner
Journal:  Percept Psychophys       Date:  1981-10

9.  RECOGNIZING BEHAVIOR IN HAND-EYE COORDINATION PATTERNS.

Authors:  Weilie Yi; Dana Ballard
Journal:  Int J HR       Date:  2009       Impact factor: 1.616

Review 10.  Recognition of Activities of Daily Living with Egocentric Vision: A Review.

Authors:  Thi-Hoa-Cuc Nguyen; Jean-Christophe Nebel; Francisco Florez-Revuelta
Journal:  Sensors (Basel)       Date:  2016-01-07       Impact factor: 3.576

View more
  3 in total

1.  Human-Inspired Robotic Eye-Hand Coordination Enables New Communication Channels Between Humans and Robots.

Authors:  Stephanie Olson; Moaed Abd; Erik D Engeberg
Journal:  Int J Soc Robot       Date:  2020-09-18       Impact factor: 3.802

2.  Gaze-Based Intention Estimation for Shared Autonomy in Pick-and-Place Tasks.

Authors:  Stefan Fuchs; Anna Belardinelli
Journal:  Front Neurorobot       Date:  2021-04-16       Impact factor: 2.650

3.  Target position and avoidance margin effects on path planning in obstacle avoidance.

Authors:  Mohammad R Saeedpour-Parizi; Shirin E Hassan; Ariful Azad; Kelly J Baute; Tayebeh Baniasadi; John B Shea
Journal:  Sci Rep       Date:  2021-07-27       Impact factor: 4.379

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.