| Literature DB >> 35719206 |
Peter Q Lee1, Vidyasagar Rajendran1, Katja Mombaur1.
Abstract
Buzzwire tasks are often used as benchmarks and as training environments for fine motor skills and high precision path following. These tasks require moving a wire loop along an arbitrarily shaped wire obstacle in a collision-free manner. While there have been some demonstrations of buzzwire tasks with robotic manipulators using reinforcement learning and admittance control, there does not seem to be any examples with humanoid robots. In this work, we consider the scenario where we control one arm of the REEM-C humanoid robot, with other joints fixed, as groundwork for eventual full-body control. In pursuit of this, we contribute by designing an optimal control problem that generates trajectories to solve the buzzwire in a time optimized manner. This is composed of task-space constraints to prevent collisions with the buzzwire obstacle, the physical limits of the robot, and an objective function to trade-off reducing time and increasing margins from collision. The formulation can be applied to a very general set of wire shapes and the objective and task constraints can be adapted to other hardware configurations. We evaluate this formulation using the arm of a REEM-C humanoid robot and provide an analysis of how the generated trajectories perform both in simulation and on hardware.Entities:
Keywords: buzzwire; hardware; humanoid; optimal control; robot; trajectory optimization
Year: 2022 PMID: 35719206 PMCID: PMC9203844 DOI: 10.3389/frobt.2022.898890
Source DB: PubMed Journal: Front Robot AI ISSN: 2296-9144
FIGURE 1REEM-C grasping the loop end-effector and moving it along the buzzwire obstacle with an illustration of the end-effector intersecting the obstacle. The loop plane circle is in gray, with the centre, κ(q), and normal, κ′(q), in green. The reference point, at position ϵ(β), and the tangent, ϵ′(β) (in red) move along the wire. In the top right corner, the left arm of the REEM-C is pictured with joint axes labelled J 1 through J 7.
FIGURE 2(A) Frames from the optimized trajectory on Obstacle-A executed by the REEM-C arm in the Gazebo simulator (top row) and the same trajectory executed on the full REEM-C humanoid hardware from a front view (middle row) and top-down view (bottom row). (B) Frames from the optimized trajectory on Obstacle-B executed by the REEM-C arm in the Gazebo simulator (top row) and the same trajectory executed on the full REEM-C humanoid hardware from a front view (middle row) and top-down view (bottom row). (C) Frames from the optimized trajectory on Obstacle-C executed by the REEM-C arm in the Gazebo simulator (top row) and the same trajectory executed on the full REEM-C humanoid in the Gazebo simulator from a top view showing the sine shape on the top portion of the wireframe (bottom row).
Summary of optimized trajectories over 50 different initializations. The median time (t ) and 95% translation threshold (γ*) ± their interquartile range are reported among the instances that converged. The p-values come from testing the change of t and γ* between (α = 0, ν = 0) to (α = 30, ν = 1) and (α = 30, ν = 1) to (α = 150, ν = 5), using Mann-Whitney U tests, assuming a threshold of significance of 0.05. The p-values were adjusted with Bonferonni corrections for the multiple tests done on γ* and t , respectively.
| Obstacle |
|
|
|
|
|
|
|---|---|---|---|---|---|---|
| A | 0 | 0 | 0.036 ± 0.001 | — | 2.958 ± 0.079 | — |
| A | 30 | 1 | 0.040 ± 0.002 | 4 × 10−9 | 3.045 ± 0.169 | 4 × 10−2 |
| A | 150 | 5 | 0.045 ± 0.001 | 7 × 10−8 | 3.551 ± 0.013 | 7 × 10−8 |
| B | 0 | 0 | 0.034 ± 0.001 | — | 3.920 ± 0.021 | — |
| B | 30 | 1 | 0.038 ± 0.001 | 3 × 10−8 | 3.979 ± 0.047 | 1 × 10−5 |
| B | 150 | 5 | 0.039 ± 0.003 | 1 × 10−2 | 4.586 ± 0.048 | 7 × 10−9 |
| C | 0 | 0 | 0.027 ± 0.002 | — | 5.964 ± 0.063 | — |
| C | 30 | 1 | 0.031 ± 0.002 | 3 × 10−4 | 6.293 ± 0.303 | 1 × 10−5 |
| C | 150 | 5 | 0.034 ± 0.001 | 1 × 10−6 | 6.966 ± 0.487 | 9, ×, 10−5 |
FIGURE 3(A) Euclidean distance between the end-effector centre and the target obstacle position throughout the simulated (red) and hardware trials (blue) for obstacles A, B and C. (B) Difference in orientation between the obstacle tangent and the end-effector normal for obstacles A, B and C (simulations in green and experiments in magenta). The dashed grey lines indicate the time when the end-effector is at the sharp bend of the obstacle.