Literature DB >> 28805689

Visual Servoing for an Autonomous Hexarotor Using a Neural Network Based PID Controller.

Carlos Lopez-Franco^1,2, Javier Gomez-Avila³, Alma Y Alanis⁴, Nancy Arana-Daniel⁵, Carlos Villaseñor⁶.

Abstract

In recent years, unmanned aerial vehicles (UAVs) have gained significant attention. However, we face two major drawbacks when working with UAVs: high nonlinearities and unknown position in 3D space since it is not provided with on-board sensors that can measure its position with respect to a global coordinate system. In this paper, we present a real-time implementation of a servo control, integrating vision sensors, with a neural proportional integral derivative (PID), in order to develop an hexarotor image based visual servo control (IBVS) that knows the position of the robot by using a velocity vector as a reference to control the hexarotor position. This integration requires a tight coordination between control algorithms, models of the system to be controlled, sensors, hardware and software platforms and well-defined interfaces, to allow the real-time implementation, as well as the design of different processing stages with their respective communication architecture. All of these issues and others provoke the idea that real-time implementations can be considered as a difficult task. For the purpose of showing the effectiveness of the sensor integration and control algorithm to address these issues on a high nonlinear system with noisy sensors as cameras, experiments were performed on the Asctec Firefly on-board computer, including both simulation and experimenta results.

Entities: Chemical Disease Gene

Keywords: hexarotor; unmanned aerial vehicle; visual servoing

Year: 2017 PMID： 28805689 PMCID： PMC5579741 DOI： 10.3390/s17081865

Source DB: PubMed Journal: Sensors (Basel) ISSN： 1424-8220 Impact factor: 3.576

1. Introduction

The use of Unmanned Aerial Vehicles (UAVs) has been increased over the last few decades. UAVs have shown satisfactory flight and navigation capabilities, which are very important in applications like surveillance, mapping, search and rescue, etc. The ability to move freely in a 3D space represents a great advantage over ground vehicles, especially when the robot is supposed to travel long distances or move in dangerous environments, like in search and rescue tasks. Commonly, UAVs have four rotors; however, having more than four gives them a higher lifting capacity. The hexarotor has some advantages over the highly popular quadrotor, such as their increased load capacity, higher speed and safety, because the two extra rotors allow the UAV landing even if it loses one of the motors. However, the hexarotor is a highly nonlinear and under actuated system because it has fewer control inputs than degrees of freedom and its Lagrangian dynamics contain feedforward nonlinearities; in other words, there are some acceleration directions that can only be produced by a combination of the actuators. In contrast with ground vehicles, it is not possible to use sensors like encoders to estimate its position. A good alternative is to use visual information as a reference, due to the high amount of information that a camera can provide in contrast with their low power consumption and low weight. Since it is not possible to know the position of a hexarotor with common on-board sensors such as Inertial Measurement Units (IMU), some works use off board sensor systems [1,2,3,4,5]; however, this kind of control limits the application to indoor navigation and adds noise and delays because of the communication between the robot and the ground station. For such reason, visual control of UAVs has been widely performed. Although stereo vision is extensively used in mapping applications [6,7], when used in UAV navigation like in [8,9], it requires 3D reconstruction or optical flow, which are computationally expensive algorithms. In this approach, monocular vision is used, and the feature error position in the image coordinate plane is related with the robot velocity vector that reduces this error [10,11,12,13,14,15]. Consequently, we can set the position of the robot based on the camera information to control its navigation not only in indoor environments [16,17]. Classical Image Based Visual Servo (IBVS) control stabilizes attitude and position separately [18], which is not possible for underactuated systems. In [18], an Image Based approach is used for an underactuated system but approximating the depth distance to the features. In [19], a PID controller is implemented on an hexarotor and comparisons between quaternions and Euler angles are made. In [20], the authors propose a visual servoing algorithm combined with a proportional derivative (PD) controller. However, PID approaches are not effective on highly nonlinear systems with model uncertainties such as the hexarotor [21,22]. According to this, another approach is required. In this paper, we propose the use of a Neural Network based PID. The advantages of using neural networks to control nonlinear systems are that the controller will have the adaptability and learning capabilities of the neural network [23], making the system able to adapt to actuator faults such as loss of effectiveness, which is described in [24] and solves disadvantages of the traditional PID [25] such as uncertainties of the system, communication time-delay, parametric uncertainties, external disturbances, actuator saturations and unmodeled system dynamics, among others. If to all of these issues we add the complexity to integrate servo control algorithms with vision sensors and a neural PID in a real-time implementation, it is required to have a well-designed coordination between all of the elements of this implementation, requiring different processing stages with their respective communication architecture (software and hardware). The rest of the paper is structured as follows: Section 2 describes the robot and its dynamics. In Section 3, the visual servo control approach is introduced. In Section 5, the design of the PID controller and weights adjustment are shown. Section 4 presents the relationship between the error signals from the visual algorithm and the control signals of the hexarotor. Section 6 and Section 7 present the simulation and experimental results of the proposed approach and its comparison with the conventional PID controller. Finally, the conclusions are given in Section 8.

2. Hexarotor Dynamic Modeling

The hexarotor consists of six arms connected symmetrically to the central hub. At the end of each arm, a propeller driven by a brushless Direct Current (DC) motor is attached. Each propeller produces an upward thrust and, since they are located outside the center of gravity, differential thrust is used to rotate the hexarotor. In addition, the rotation of the propellers also produces a torque in the opposite direction of the rotation of the motors; therefore, there must be two groups of rotors spinning in the opposite direction for the purpose of making this reaction torque equal to zero. The pose of an hexarotor is given by its position and its orientation in the three Euler angles roll, pitch and yaw, respectively. For the sake of simplicity, and will be abbreviated and . The transformation from world frame O to body frame (Figure 1) is given by

Figure 1

Structure of hexarotor and coordinate frames.

The dynamic model of the robot expressed in the body frame in Newton–Euler formalism is obtained as in [26]. where I is the inertia matrix; V the lineal speed vector and the body angular speed. The equations of motion for the helicopter of Figure 1 can be written as in [27] where is the position vector, the rotation matrix from the body frame to the world frame, represents the rotation dynamics, represents the skew symmetric matrix, is the rotor speed, the body inertia, the rotor inertia, b is the thrust factor and is the torque applied to the body frame due to the rotors. Since we are dealing with an hexarotor, this torque vector differs from the well-known quadrotor torque vector and, if we are working with a structure like the one in Figure 2, it can be written as where l is the distance from the center of gravity of the robot to the rotor and b is the thrust factor. The full dynamic model is where , , , and represent the system inputs and in the case of the hexarotor are obtained as follows: where d is the drag factor.

Figure 2

Geometry of hexarotor.

3. Visual Servo Control

In this paper, we use an Image Based Visual Servo control approach and the eye-in-hand case. The camera is mounted on the robot and the movement of the hexarotor induces camera motion [28]. The purpose of the vision based control is to minimize the error where is the vector of captured features and in the function of a vector of 2D points coordinates in the image plane, and are the set of known parameters of the camera (e.g., camera intrinsic parameters). Vector contains the desired values. Since the error is defined on the image space and the robot moves in the 3D space, it is necessary to relate changes in the image features with the hexarotor displacement. The image Jacobian [29] (also known as interaction matrix) captures the relation between features and robot velocities as shown where is the variation of the features position, is the interaction matrix and denotes the camera translational () and rotational () velocities. Considering as the control input, we can try to ensure an exponential decrease of the error with where is a positive constant, is the pseudo-inverse of , k is the number of features and the feature error. To calculate the interaction matrix, consider a 3D point with coordinates in the camera frame, the projected point in the image plane x with coordinates is defined as where are the coordinates of the point in the image space expressed in pixel units, are the coordinates of the principal point, is the ratio of pixel dimensions and f the focal length. If we derive (10), we have The relation between a fixed 3D point and the camera spatial velocity is stated as follows: Then, we can write the derivatives of the 3D coordinates as Substituting (13) in (11), we can state the pixel coordinates variation as follows: which can be written with where Z is the actual distance from the vision sensor to the feature, for this reason, most of the IBVS algorithms need to approximate this depth. In our case, we use an RGB-D sensor and this distance is known. To control the six degrees of freedom (DoF), at least three points are necessary [28]. In that particular case, we would have three interaction matrices , , , one for each feature, and the complete interaction matrix is now When using three points, there are some configurations for which is singular and four global minima [30]. More precisely, there are four poses for the camera such that , and these four poses are impossible to differentiate [31]. With this in mind, it is usual to consider more points [28]. On the other hand, only one pose achieves when using four points. Moreover, we can use the pseudo-inverse or the transpose of the interaction matrix indistinctly to solve for in (15) [32,33]. In this paper, four points are used. In addition, our pattern does not move and because of the nature of the hexarotor, we suppose that the pattern will never be rotated since any rotation in roll or pitch will produce a translation. In other words, the hexarotor is an underactuated system, and it means there are some acceleration directions that can be only produced by a combination of the actuators. Most of the time, this is due to a lower number of actuators than degrees of freedom of the system; however, in the hexarotor, there is no actuator that can produce by itself a translational acceleration in the x and y directions. In consequence, it is not possible to have this kind of robot static and tilted at the same time, and, consequently, rotational velocities related to roll and pitch in are 0.

4. Control of Hexarotor

The hexarotor has four control inputs , , which represents the translation in the z-axis, , which represents the roll torque, the pitch, and represents the yaw torque. The visual algorithm act as a proportional controller, where in (9) works as a proportional gain. When combined with the Artificial Neural Network (ANN) based PID, we can adapt not only this proportional gain but also the derivative and the integral gains. Since the system is underactuated, we can use the translational velocities computed by IBVS as input roll and pitch torques, and the error will be reduced. This is shown in Figure 3.

Figure 3

Block diagram of our Image Based Visual Servo (IBVS) control algorithm combined with Artificial Neural Network based PID (PID-ANN). The Artificial Neural Network based PID module consists of four modules (one for each Degree of Freedom). The output of the IBVS block is the velocity vector with angular velocity equal to 0 for roll and pitch since we assume the pattern will never be rotated in those angles. The output of IBVS control block consists of translational velocities , , , which must be mapped to , and , respectively, in order to be added to the PID-ANN output. These control actions will be traduced to translational displacement of the robot.

The velocity mapping block in Figure 3 traduces the velocities vector from IBVS to hexarotor inputs, i.e., roll, thrust, pitch and yaw. In our case, and , since we assume the pattern will never be rotated because it is an underactuated system.

5. Neural Network Based PID

Considering the control loop with unitary feedback as shown in Figure 4.

Figure 4

Control loop for conventional PID in discrete time.

Conventional digital Proportional-Integral-Derivative (PID) with unitary feedback is described in [34] and the control law is given by where is the error calculated as the difference between the reference signal and the system output . The terms , , and are the proportional, integral and derivative gains, respectively. These gains are related as follows: where K is the gain, T is the sample time, is the integration time and the derivative time. Applying the inverse Z-transform to (18), the PID sequence is given by Despite conventional PID being widely used to control these vehicles due to its simplicity and performance, it is not intended to control highly nonlinear systems, such as hexarotors. In order to handle these nonlinearities, an ANN based PID controller is used. The purpose of the ANN is not only to deal with the nonlinear system, but also adjusting the PID controller gains to the end that it can also handle with uncertainties in the model. The topology of the PID-ANN used is shown in Figure 5.

Figure 5

PID-ANN topology. There is one module PID-ANN for every DoF. represents the error in that DoF, is a vector that represents the error, derivative of the error and the integral of the error and is the learning rate, which must be heuristically selected to be small enough to avoid saturation of the neuron and not necessarily the same for every gain (proportional, derivative and integral). The Euclidean norm block normalizes the neuron weights in order to avoid divergence since the neuron weights are the PID controller gains. The output of the neuron is the control input that will be traduced as thrust (), roll (), pitch () or yaw (), depending on the state error at the input of the ANN.

From Figure 5, the vector represents the proportional error, the derivative of the error and the integral of the error. They are defined as follows: Accordingly, the control law of the conventional PID can be rewritten as The Neuron input is defined as where vector represents the weights of the network, which are incremented by with a learning factor . The new value of will be The Euclidean norm will be used to limit the values of as The activation function of the neuron is the hyperbolic tangent; therefore, the output will be where A is a gain factor to escalate the maximum value of the activation function, which is between and b is a scalar to avoid saturation of the neuron. The control law of the ANN based PID is expressed as follows: and there is one PID-ANN module for every to control in (6). has information of the rotor speed combination necessary to achieve a specific rotation, i.e., if the robot needs to move in more than one direction at the same time, there will be more than one with values different to zero.

6. Simulation Results

Simulations are implemented in Matlab (Matlab R2016a, The MathWorks Inc., Natick, MA, USA) using the Robotics Toolbox [35]. For the visual servo algorithm, four points are used. In the first experiment, the robot starts on the ground and has to reach a certain position given by these 2D points. With the intention of proving the algorithms, we simulate uncertainty of the system changing two parameters separately in two simulations. In the first simulation, at the second 10, the mass of the robot has been increased 50%. It can be seen that conventional PID controller is unable to keep the position. The results are shown in Figure 6.

Figure 6

Simulation using conventional PID. At 10 s, the mass of the system is incremented 50% and . (a) Cartesian velocities; (b) control input; (c) features error.

In the second simulation, the mass of the system remains constant, but the moment of inertia is increased from to . Figure 7 shows the results of using conventional PID when the moment of inertia is increased. The results show that the control input is excessively high to the robot (Figure 7b), making the system unable to follow the reference (Figure 7c).

Figure 7

Simulation using conventional PID. Moment of inertia is increased. Mass remains constant. It can be seen that the robot is not stable when changes. (a) Cartesian velocities; (b) control input; (c) features error.

In the following simulations, the PID-ANN is now controlling the system under the same conditions. The mass has been incremented at second 10. It can be seen in Figure 8 that the controller can be adapted to this mass increment and keep the reference.

Figure 8

Simulation using conventional PID-ANN. At 10 s, the mass of the system is incremented 50% and . The system remains at desired position. (a) Cartesian velocities; (b) control input; (c) features error.

Finally, we show, in Figure 9 the results of changing the moment of inertia while mass remains constant. In contrast with conventional PID, the PID-ANN is able to keep its position.

Figure 9

Simulation using ANN based PID. Moment of inertia is increased. Mass remains constant. It can be seen that the robot remains stable at desired position when changes. (a) Cartesian velocities; (b) control input; (c) features error.

7. Experimental Results

The hexarotor used in the experiments is the Asctec Firefly (Ascending Technologies, Krailling, Germany). The actual configuration of the experiment is shown in Figure 10. The vision sensor has been changed, we use the Intel RealSense R200 camera (Intel, Santa Clara, CA, USA) with RGB and infrared depth sensing features and an indoor range from 0.4 m to 2.8 m. This change in the vision sensor will modify robot mass and moment of inertia. This uncertainty can be absorbed by the neural network.

Figure 10

Actual experiment configuration. The corners of the Quick Response (QR) code represent the 3D features.

Vision information is highly noisy and presents a high computational cost even when working with low resolution images (in this case, ). The more time the algorithm uses in image capture and processing phase, the more error will exist between what the robot sees and the actual position. Computer vision algorithms, such as optical flow approaches, requires tracking of a set of n features using some kind of descriptor. Other approaches use stereo vision but that requires a 3D reconstruction. We propose to use only four points to reduce this time. It is important to note that coordination between vision sensors, neural network and model system working at different processing stages and its communication at their respective architectures is crucial to achieve real-time implementation. A QR-code is used, and we track it with a Zbar bar code reader library. This pattern is chosen because of its robustness to rotations and illumination changes. The algorithms are implemented on the onboard computer of the hexarotor. For a first experiment, the moment of inertia and mass of the system changed and a previously tuned conventional PID controller will be compared with the proposed algorithm. Figure 11 shows results when the pattern is fixed at a certain position and the hexarotor is at hover position. As can be seen in Figure 11b, the conventional controller cannot achieve system stabilization at a fixed position when the model changes. Table 1 shows the Root Mean Square Error (RMSE) and the Average Absolute Deviation (AAD) in pixel units. The pair is the location of feature in image coordinates.

Figure 11

Experimental results when mass and moment of inertia changed. The pattern remains at the same position during the test. (a) control input conventional PID; (b) error conventional PID; (c) control input ANN-PID; (d) error ANN-PID.

Table 1

Controllers’ comparison.

	Root Mean Square Error (Pixel Units)
	x1	y1	x2	y2	x3	y3	x4	y4
PID	1741.9	393.1	1712.0	339.1	1677.5	329.2	1723.9	380.1
ANNPID	212.416	66.549	191.1447	57.4298	824.085	59.7654	861.8207	62.6225
	Average Absolute Deviation (Pixel Units)
	x1	y1	x2	y2	x3	y3	x4	y4
PID	55.2177	9.6437	54.0636	10.5457	52.4827	10.704	53.069	9.6577
ANNPID	43.2281	7.4259	42.5686	11.3874	46.6832	11.484	47.5576	7.5503

Comparison Table between conventional Proportional Integral Derivative (PID) controller and the Artificial Neural Network based PID controller (ANNPID).

In Figure 11b, the solid increasing lines represent the x position of the four features in image coordinates (pixel units). As can be seen, if a conventional PID is not correctly tuned for this specific system, its position diverges. On the other hand, when the system is controlled by the PID-ANN, its position does not diverge (Figure 11d) even when the controller has not been previously tuned. Once ANN-PID has demonstrated its effectiveness over the PID controller, the experiment is repeated, but now the QR pattern has movement. As shown in Figure 12, the hexarotor does not lose sight of the objective.

Figure 12

Experimental results at hover position when mass and moment of inertia changed. The pattern is moving during the test. (a) control input ANN-PID; (b) error ANN-PID.

8. Conclusions

In this paper, we propose a Neural Network based PID controller with visual feedback to control an hexarotor. The hexarotor is equipped with an RGB-D sensor that allows for estimating the feature error, this error has been used to compute the camera velocities. The proposed approach is able to deal with delays due to image processing, system uncertainties, noises and changes in the model since the ANN is continuously adapting the PID gains. In contrast with conventional PID controllers, where it is mandatory to tune it according to a specific system, the ANN can deal with nonlinearities and changes in the system.

1 in total

1. Adaptive neural network control for a class of MIMO nonlinear systems with disturbances in discrete-time.

Authors: Shuzhi Sam Ge; Jin Zhang; Tong Heng Lee
Journal: IEEE Trans Syst Man Cybern B Cybern Date: 2004-08

1 in total

3 in total

1. Uncalibrated Visual Servoing for Underwater Vehicle Manipulator Systems with an Eye in Hand Configuration Camera.

Authors: Jiyong Li; Hai Huang; Yang Xu; Han Wu; Lei Wan
Journal: Sensors (Basel) Date: 2019-12-11 Impact factor: 3.576

2. Adaptive neural PD controllers for mobile manipulator trajectory tracking.

Authors: Jesus Hernandez-Barragan; Jorge D Rios; Javier Gomez-Avila; Nancy Arana-Daniel; Carlos Lopez-Franco; Alma Y Alanis
Journal: PeerJ Comput Sci Date: 2021-02-19

3. Modeling and Identification for Vector Propulsion of an Unmanned Surface Vehicle: Three Degrees of Freedom Model and Response Model.

Authors: Dongdong Mu; Guofeng Wang; Yunsheng Fan; Xiaojie Sun; Bingbing Qiu
Journal: Sensors (Basel) Date: 2018-06-08 Impact factor: 3.576

3 in total