Aws Abdulsalam Najm1, Ibraheem Kasim Ibraheem1, Ahmad Taher Azar2,3, Amjad J Humaidi4. 1. Department of electrical engineering, College of engineering, University of Baghdad, Baghdad 10001, Iraq. 2. Robotics and Internet-of-Things Lab (RIOTU), Prince Sultan University, Riyadh 11586, Saudi Arabia. 3. Faculty of Computers and Artificial Intelligence, Benha University, Benha 13518, Egypt. 4. Department of Control and Systems Engineering, University of Technology, Baghdad 10001, Iraq.
Abstract
A consensus control law is proposed for a multi-agent system of quadrotors with leader-follower communication topology for three quadrotor agents. The genetic algorithm (GA) is the proposed optimization technique to tune the consensus control parameters. The complete nonlinear model is used without any further simplifications in the simulations, while simplification in the model is used to theoretically design the controller. Different case studies and tests are done (i.e., trajectory tracking formation and switching topology) to show the effectiveness of the proposed controller. The results show good performance in all tests while achieving the consensus of the desired formations.
A consensus control law is proposed for a multi-agent system of quadrotors with leader-follower communication topology for three quadrotor agents. The genetic algorithm (GA) is the proposed optimization technique to tune the consensus control parameters. The complete nonlinear model is used without any further simplifications in the simulations, while simplification in the model is used to theoretically design the controller. Different case studies and tests are done (i.e., trajectory tracking formation and switching topology) to show the effectiveness of the proposed controller. The results show good performance in all tests while achieving the consensus of the desired formations.
Entities:
Keywords:
active disturbance rejection control; hybrid control system; nonlinear PID; quadrotor system; unmanned aerial vehicle
In the last decade, unmanned aircrafts (UAs) have attracted researchers in different technical and scientific communities. UAs have a wide range of applications in various fields, such as agriculture, healthcare, and entertainment. Due to the great variety of UA types, UAs then can be classified by their application, range, altitude, endurance, vehicle type, and size. Many UA controlling missions are difficult, and some are impossible to accomplish using only a single agent UA, but instead a cooperative control must be designed for multiple agents to accomplish these missions. Interest in cooperative control of multi-agent systems (MASs) has recently increased, because of its broad applications, such as fire detection [1]. A literature survey [2] summarized some of the potential applications of the multi-agent system, such as large object transportation, which was done in the GRASP laboratory at the University of Pennsylvania, surveillance and searching of objects, quadrotor localization in outdoor environment, which was done at the Czech Technical University, and object transportation missions [3]. There are many cooperation control problems, such as consensus control, formation control, flocking control, and so on. Another important research field is optimization, which has attracted many researchers due to the advantages provided by optimization, especially with complex and nonlinear systems. There has been much research in this field with different optimization techniques, such as genetic algorithm (GA) optimization [4,5], particle swarm optimization (PSO) [6], ant colony optimization (ACO) [7], as well as hybridization of two or more techniques [8,9].
1.1. Multi-Agent Systems
Generally, MASs are systems composed of multiple cooperating agents. Two significant abilities exist in agents that are controlled by computers (computerized agents). Firstly, they have the ability to do an independent task (i.e., deciding what to do to fulfill their task goals). Secondly, they have the ability to communicate with each other (i.e., they not only swap data, but they also participate with their peers by negotiation, cooperation, coordination, and so on) [10]. As mentioned earlier, there are many cooperative control problems. In this paper, light will be shed on the most common cooperating control problems, i.e., consensus control and formation control.
1.1.1. Consensus Control
Consensus problem is closely related to formation control (forming a specific geometrical shape, which is discussed next), flocking (a behavior presented when a group of birds are foraging or in-flight), and distributed estimation [11]; thus, it is an important and essential problem in the field of cooperative control. Consensus means reaching an agreement between multiple agents to achieve a common final state by communicating with each other through a communication network [12]. Figure 1 shows the consensus problem for four agents in their elevations only, where the four agents on the left side have different elevations above the ground (i.e., , , , and ) before applying the consensus control algorithm, while the agents on the right hand side present the agents after applying the consensus controller and reaching consensus in the elevation (i.e., ).
Figure 1
Consensus problem.
1.1.2. Formation Control
The formation control problem requires that all the agents reach a specific geometrical formation; thus, formation control differs from consensus by adding a distance to achieve the geometrical shape by all the agents; therefore, the formation control can be varied into consensus control by making the additional distance equal to zero. There are two cases for the formation control problem according to information known to agents. The first case is where the desired formation and its location are known for the team agents (i.e., centralized case). The second case is where the desired formation is known for the agents, while its location must be achieved by the team agents after negotiation [13]. Figure 2 shows five agents before and after applying formation control. In the initial positions, the agents are randomly distributed in two dimensions, and the heights of the agents are not taken into consideration. A formation controller is applied to achieve a triangle geometry shape, which is presented in the final formation.
Figure 2
Before and after applying formation control on 5 agents.
Consensus and formation control algorithms can be categorized into leaderless and leader–follower depending on how the agents are connected in the communication network represented using graph theory [14].
1.1.3. Optimization Techniques
Finding the optimum performance index by tuning the controller parameters without any constraint is the purpose of the optimization techniques in this paper; therefore, the optimization technique used must be unconstrained. There are several methods to solve an unconstrained minimization problem, which can be classified as nongradient methods (e.g., random search method, grid search method, simplex method, etc.) and gradient methods (e.g., Fletcher–Reeves method, Newton’s method, Marquardt method, etc.).For simple problems that involve a small number of variables, the nongradient methods are efficient, while the gradient methods require the first and sometimes the second derivative of the objective function (performance index). Both methods are not efficient in the case of this work due to the number of variables available in the tuning process and the complexity gained by getting the first or second derivative of the performance index used. Therefore, the modern optimization techniques are the best choice for the tuning and optimization problem of the proposed controller.There is a wide range of modern methods available, such as genetic algorithms, simulated annealing, PSO, ACO, etc., which can be used in the tuning process. Due to the ability of the GA to avoid local optimal solutions, it was chosen as the optimization method to tune the parameters of the proposed controller.
1.1.4. Genetic Algorithm
The genetic algorithm (GA) is categorized as an evolutionary algorithm that is suitable in searching for a global solution. Inspiration in GA development comes from natural genetics working principles. Three main operators are used at each step to produce the next generation from the current one. These three operators are selection, crossover, and mutation, which are briefly explained in the following [4]:Selection: The selection operation forms a mating pool for the next generation by selecting genes (parents) from the existing generation.Crossover: The crossover process combines two of the parents to form the next generation (children).Mutation: The mutation forms new children by applying random changes to the parents.A concise diagram for the GA steps is shown in Figure 3. In this paper, the MATLAB function ga was used in the optimization and tuning processes.
Figure 3
Steps diagram of the genetic algorithm (GA).
1.2. Literature Survey
There is a large variation of research on cooperative control techniques for MASs. Getting an agreement between the agents (i.e., consensus) was the aim of some of the researchers, while others intended to obtain a specific formation for the MAS under some constraints (i.e., communication topology, leader/leaderless, …, etc.). This depends on the application where the control technique is used. Some of the recent research, such as authors in [15], proposed a distributed adaptive controller for a class of an under-actuated Lagrangian system to track a dynamic leader and control the actuated variables, while being subjected to parameter uncertainties and external disturbance with or without actuator faults. In [16], the authors studied the distributed consensus problem for a team of quadrotors with fixed and switching topologies after transforming the quadrotor model to a simplified one using feedback linearization technique. Furthermore, others in [17] presented a two-layer hierarchical model predictive control (MPC) to stabilize a quadrotors team subjected to some constraints. Authors in [18] designed an adaptive consensus controller based on multiple surface control (MSC) to achieve a specific leader–follower formation for a team of quadrotors, taking into consideration the uncertainties generated by the ground effects. The results presented showed good performance with good adaptation of the slack variables and the disturbance of landing or taking off. Five different controllers were developed in [19]. Two of them (i.e., the nonlinear suboptimal control technique and the integral backstepping (IBS) controller) were based on Lyapunov theory and were proposed for trajectory tracking and leader–follower formation control. The others were proportional derivative square (), which was compared with control by applying them on the attitude system and the linear quadratic regulator (LQR) with the optimal iterative LQR, which were applied and compared for attitude, trajectory tracking, and formation control. Researchers in [20] used a simplified quadrotor model to develop a novel distributed consensus algorithm, while one of the uniform constant delays and asynchronous time-varying delays existed as a communication delay. A backstepping based on proportional integral derivative (PID) control was designed in [21] to solve the formation problem for a team of quadrotors in a decentralized manner. The system stability and retention of the desired geometric formation with or without the existence of the obstacles were presented in the simulations. Authors in [22] proposed a novel consensus leader–follower formation control based on a non-smooth backstepping design. The different numerical simulations verified the effectiveness of the proposed controller, and theoretical analysis showed that all the quadrotors converged to the desired formation.
1.3. Motivation and Contributions
In the literature survey mentioned previously, a simplified model for the UA system was used in most of the research. This issue and others motivated us to simulate the complete UA quadrotor system model by using a proposed consensus controller based on optimization techniques. Moreover, the leader–follower topology was used in some case studies, where the leader agent was designed as a complete quadrotor system controlled by the Nonlinear PID (NLPID) and Improved Active Disturbance Rejection Control (IADRC) scheme proposed in our previous work, with the followers controlled by the controller scheme proposed in this work.The main contributions of this paper can be summarized as follows:Proposing a new consensus controlling scheme based on optimization techniques for a MAS of 6-degree of freedom (DOF) quadrotors.Simulating a leader–follower topology in some cases by adding additional terms to control the geometry of the agents, and if these terms are equalized to zero, the agents goes to the consensus case.
1.4. Notations
In this paper, vectors and matrices are denoted by bold symbols (e.g., or ). Dot notation is used for time derivative. In this notation a dot is placed above the function name to denote the time derivative of a function (e.g., the first and second derivatives of with respect to time are denoted by , ). The integration in this paper is denoted by the symbol (e.g., the integration of the function with respect to time between a lower limit and upper limit is denoted by ). The sign is used in this paper as a Kronecker product operation.
1.5. Paper Organization
The rest of the paper is organized as follows: Some preliminaries on the quadrotor mathematical model, graph theory, mathematical representation of consensus control, and finally the problem statement of this paper are given in Section 2; thereafter, the proposed consensus control scheme is demonstrated in Section 3. Next, some simulations and tests are performed and discussed in Section 4. Finally, Section 5 includes the paper conclusions and some of the future aspects.
2. Preliminaries and Problem Statement
2.1. Quadrotor Mathematical Model
As mentioned in our previous works, the mathematical model of the quadrotor system is derived in Appendix A based on Newton–Euler equation and can be represented in Equation (1), while Newton’s law is used to obtain Equation (2), which is helpful to design a controller for the under-actuated quadrotor system using the acceleration in 3 dimensions [23,24]; all the parameters used are described in Table 1.
Table 1
Parameter descriptions.
Parameters
Description
Units
[xyz]
Linear position vector
m
[ϕθψ]
Angular position vector
Rad
[uvw]
Linear velocity vector
m/s
[pqr]
Angular velocity vector
Rad/s
[IxxIyyIzz]
Moment of inertia vector
kg·m2
ft
Total thrust generated by rotors
N
[τxτyτz]
Control torques
N m
[fwxfwyfwz]
Wind force vector
N
[τwxτwyτwz]
Wind torque vector
N m
g
Gravitational force
m/s2
m
Total mass
kg
c()≡cos(),s()≡sin(), and t()≡tan()
2.2. Graph Theory
The mathematical preliminaries of the consensus control are important to be studied and understood to pave the way for dealing with the consensus control. The consensus control design procedure consists of a number of consecutive steps, which are listed below:Determining the number of agents.Deriving a graph based on agent communication.Deriving an adjacency matrix from the graph.Converting the adjacency matrix into a Laplacian matrix.Using the Laplacian matrix to design the consensus controller.Thus, graph theory plays a key role in these steps, since it can reflect the communication topology of MASs [25].Graph theory is commonly used in consensus control to describe the topology of the connection between the agents. Connection topology can be categorized as directed and undirected based on the information transmitted between the agents.A graph is built upon a finite set, which has a limited number of elements. This finite set is called vertex set , which includes the vertices of the graph. Assuming a graph has vertices, then the vertex set can be represented as . A set of two vertices from in the form , where and is called edge, while a set of edges is denoted by . Sometimes another notation is used for the vertices and edges of the graph as and , respectively, and for simplicity edge is denoted as .The directed graph (digraph) is defined as a graph that consists of a vertex set and directed edge set . In a directed graph, the direction of the edge is shown by an arrow. The graph is undefined if the vertex set is undefined.
2.2.1. Adjacency and Degree Matrices
For an undirected graph , each vertex has a degree . This degree is equal to the number of vertices adjacent to in . All the vertices’ degrees of the graph are represented in matrix form as a diagonal matrix, which is called a degree matrix and is given as
where is the degree matrix of graph , and is the number of vertices or agents.The adjacency matrix is a symmetrical matrix that reflects adjacency links between the vertices in graph , and can be derived from the following relationship:In all previously mentioned cases, the edge weights are considered to be one; but sometimes other weights are used instead of one. Generally, for weighted digraphs, the adjacency matrix is given as
where is the weight for the edge between vertices . The diagonal degree matrix is given as
where is the weighted in-degree of vertex , which represents the sum of the weights of the edge that is coming from vertex into vertex and is given in the following relation:
2.2.2. Laplacian Matrix
There are many ways to represent the Laplacian of a graph. The most straight forward method is given as follows:
where is the degree matrix of graph , and is the adjacency matrix for graph .
2.3. Mathematical Representation of Consensus Control
Consensus control stability and convergence analysis is a complex method due to the fact that consensus control is dealing with MAS, not a single agent. This is the reason why researchers in their publications simplify the system modeling to simplify the analysis approach, unlike the system modeling in a single agent. Some of the commonly used approaches for consensus control are the single-integrator system, double-integrator, and higher-order MASs, which are described next [25].
2.3.1. Single-Integrator MAS
MASs of single-integrator dynamics can be mathematically represented as follows:
where are the states vector for agent , is the input vector for agent , and refers to the number of agents in the MAS. The consensus control is represented by
where is the element of the Laplacian matrix, and is the control matrix, which needs to be determined.To write the consensus in a general matrix form, substitute Equation (10) in Equation (9).
where , and is defined in Equation (8).
2.3.2. Double-Integrator MAS
MASs of double-integrator dynamics can be mathematically represented as
where are the position and velocity vectors, respectively, for agent , and is the input vector for agent . The consensus control is represented byThe general matrix representation for the double-integrator MAS can be achieved by using Equation (12) with the controller in Equation (13) as follows:
where and .
2.3.3. Higher-Order MAS
An extension for the single and double-integrator MASs is given as follows:
where is the state vector for agent , is the input vector for agent , and is the output vector for agent . The consensus control can be achieved using the control defined in Equation (10) applied on the MAS of Equation (15). Then, the matrix form representation is as follows:
2.4. Problem Statement
A network of homogenous connected MAS is considered as a leader–follower connection, where each agent mathematical model can be represented in the compact form. The control signal of the leader is treated as a single agent control problem, while the followers’ controllers have to be designed and tuned based on the GA optimization technique such that they stabilize the unstable followers, reach consensus between all the MAS agents, and achieve desired formations while tracking different trajectories.
3. Proposed Controlling Scheme Design
The consensus control is proposed in this paper to achieve tracking formation for the MAS of the quadrotors while using the leader–follower connection topology between the agents (directed graph). The aim of the tracking formation is to keep the agents in a formation on the 3D space () and the angle of rotation around -axis () while tracking the leader. Therefore, the leader control is different from the followers, which can be represented by normal quadrotor control using a combination of NLPID for the outer loop control (OLC) and IADRC for the inner loop control (ILC) of the quadrotor system, which was designed in our previous works [26,27,28] and is shown in Figure 4. The followers’ control is a mix of two controllers; the first is the local one and the second is the consensus controller, as shown in Figure 5.
Figure 4
IADRC control for ILC and NLPID control for OLC of quadrotor system.
Figure 5
Local and consensus controllers.
The follower agent control system for () states are the IADRC designed for the leader agent, before proceeding in the proposed controller for the states () a brief explanation and equations for the NLPID and the IADRC are given.
3.1. Nonlinear PID (NLPID) Controller
The NLPID controller is developed based on the PID controller to be better while dealing with highly complicated nonlinear systems such as the 6-DOF quadrotor system. The NLPID controller is given in Equation (17).
where could be or , and the function is a positive function with coefficients . The term is introduced here to make the NLPID controller more sensitive for small errors. Small errors which is approximate to zero, the value of the term approaches to the upper bound , while for large errors, the term approaches the lower bound , this means that the limits of the term is bounded in the range .
3.2. Improved Active Disturbance Rejection Control (IADRC)
ADRC, is in general, a grouping of three dynamic systems: Extended State Observer (ESO), Tracking Differentiator (TD), and State Error Feedback (SEF) controller. In our previous paper, IADRC is proposed, which is a grouping of the abovementioned three dynamic systems taking into consideration that linear ESO (LESO) is used, TD is replaced by improved TD (ITD) and that SEF is changed by NLPID control.
3.2.1. Design of Linear Extended State Observer (LESO)
The key idea of this element is to convert the ESO with the error equation of the system to a disturbed asymptotical stable system, in which the high-gain eliminates the effect of the total disturbance error. The mathematical representation for the LESO used is given as,
where is control input, , and are gain constants of the observer needed to be tuned, . = , are relevant constants, and is the bandwidth of the observer and need to be chosen to get better estimation for the state and the disturbance with minimum OPI, and represent the states of the plant estimated by the ESO.
3.2.2. Design of Improved Tracking Differentiator (ITD)
The aim of this element is to generate a smooth approximation of the reference signal and its derivatives for the control system, which can be mathematically represented as:
where are the tracking signals (i.e., the reference signal and the derivatives of ) tracked by the differentiator signals and having initial values . The parameters and are the ITD parameters which have to be tuned to get the best tracking with small errors.
3.2.3. Design of Nonlinear PD Controller (NLPD)
The NLPID will be used as the SEF in the IADRC, which is the same in Equation (17), but for general chain of integrators system of order . The general chain of integrators system can be mathematically represented as:
where could be and are the controller parameters and are tuned to get minimum OPI. Due to the derivation in [24] of the IADRC, which changes the system into a chain of integrators, the integrator term in the NLPID will be neglected. The NLPD controller for general chain of integrators system of order is represented as:
3.3. Consensus Control for Position Subsystems
The position dynamics of the follower agents are given by
where is the number of agents, while the leader agent is denoted by . The proposed controller for position subsystems can be represented as
where and are the local and the consensus control signals for each agent, respectively. The consensus controller is designed as a single integrator system consensus controller, which is given by Equation (10) adding the formation position terms, while the local controller is designed to ensure that
and is given by
where and are the controllers’ parameters, which need to be tuned and optimized to give better tracking performance. Then Equation (24) can be written as
where and are the reference state in the and , respectively, and and are the consensus control parameters, which need to be tuned and optimized to give better performance.
3.4. Consensus Control for and Subsystems
The dynamics of subsystem is given asIt is very clear that it is coupled and nonlinear. Therefore, to overcome this problem and design the consensus controller, the quadrotor system is assumed to be near the hovering point where . Then the dynamics of subsystem is represented as follows:
which is a double integrator system and can be controlled using the consensus controller expressed in Equation (13), assuming that and adding the formation position terms. The controller of the subsystem is given as
while for the dynamics of subsystem is given asThe assumption of the hovering point is still true and the inertias of the quadrotor are nearly equal (i.e., ). Then, the dynamics of the subsystem is represented as a double integrator in the following form:
which can be controlled using the consensus controller given in Equation (13); for simplicity, is assumed to equal one (), because its effect will be reflected on the value of the consensus controller gain () and adding the formation position terms. The controller of the subsystem is given as
where and are the reference state in the and , respectively, and and are the consensus control parameters that need to be tuned and optimized to give better performance. The overall quadrotor system with the consensus controller is shown in Figure 6.
Figure 6
The overall ith quadrotor system with the consensus controller.
As shown in Figure 6, the local control units (i.e., IADRC) of the and subsystems were removed because the IADRC units keep tracking reference signals, which in this case must be zero signal. Therefore, the agents’ states will be forced to reach zero. This causes canceling of the consensus control effect.In contrast, the and subsystems do not have consensus controllers, while the local control units are still in place. This is due to the fact that the and subsystems have their own desired signals generated internally from the and position subsystems, as explained earlier, and can be treated as an internal part of the system. Therefore, adding consensus controllers will affect the and position subsystems and will never achieve the desired formation.The linearized subsystems in Equation (23), Equation (28) and Equation (31) of the 6-DOF quadrotor are used only for theoretical design of the controller to choose the appropriate one, while the complete nonlinear model represented by Equation (1) is used in all the simulations.
4. Simulation Results and Discussion
In this section, a MAS of three quadrotors using the leader–follower topology for communication network was studied, as shown in Figure 7. All quadrotor agents’ parameters values are defined in Table 2. The leader agent control system (NLPID–IADRC combination) parameters are given in Table 3, Table 4, Table 5 and Table 6.
Figure 7
Three quadrotors leader–follower connection topology.
Table 2
Quadrotor model parameters.
Parameter
Value
Description
Unit
Ixx
8.5532×10−3
Moment of inertia around x-axis
kg·m2
Iyy
8.5532×10−3
Moment of inertia around y-axis
kg·m2
Izz
1.476×10−2
Moment of inertia around z-axis
kg·m2
g
9.81×100
Gravitational force
m/s2
m
9.64×10−1
Total mass
kg
b
7.66×10−5
Thrust coefficient
N·s2
d
5.63×10−6
Drag coefficient
N·m·s2
l
2.2×10−1
Motor to center length
m
Table 3
LESOs Parameters of the leader quadrotor.
z
ϕ
θ
ψ
ω0
300
861.36
671.76
749.05
b0
0.5
0.0041
0.005
0.0047
Table 4
ITD parameters of the leader quadrotor.
z,ϕ,θ,ψ
α
0.9789
β
2.7936
γ
16.7728
R
26.5005
Table 5
NLPD parameters of the leader quadrotor.
z
ϕ
θ
ψ
k11
32.4800
5.6390
0.51080
0.6993
k12
11.4360
0.0764
0.0390
0.2109
k21
9.0756
0.7495
0.0660
0.2415
k22
0.1518
0.0471
0.0666
0.1024
μ1
0.2814
0.0763
0.5191
0.1272
μ2
0.4234
0.5997
0.7747
0.3760
α1
0.9684
0.9591
0.9574
0.9741
α2
0.9580
0.9547
1.0034
0.9417
Table 6
NLPID parameters for the position of the leader quadrotor.
x
y
x
y
k11
3.0735
2.3046
μ1
0.1214
0.1057
k12
0.5674
0.1108
μ2
0.9859
0.9744
k21
6.0204
4.5269
μ3
0.7656
0.6008
k22
2.0076
2.0040
α1
0.9106
0.9101
k31
2.2409×10−4
2.3828×10−4
α2
0.9999
0.9994
k32
5.5135×10−7
6.8046×10−7
α3
0.9140
0.9137
The Laplacian matrix for the MAS based on the connection topology shown in Figure 7 is given as
where the first row represents the leader agent, which does not receive any information from the followers; it just transmits its information to them. The weights used for the connection were ones. This represents that there was information transfer between the agents.The single agent system has only one error for each state, while for the MAS the number of errors depends on the number of agents in the system. In the next simulations, for each state of the quadrotor, there were three errors (i.e., for state the errors were , , and where ). For the reason mentioned previously, the multi-objective performance index (OPI) used here depended on the integral time absolute error (ITAE) performance index, which is given as
where represents the for each of the three consensus errors. was chosen to be for all and , while was chosen to be for all .The tuning process to obtain the optimum OPI was done using random initial values for states, such as , , and for the leader, the first follower, and the second follower, respectively. The reference signals used were zero for , , and , while a step input was used for the altitude (), and taking into considerations , , , and mentioned in Equation (26), Equation (29) and Equation (32), respectively, all were equal to zero during the tuning process.All the values of the consensus control parameters are shown in Table 7. The performance indices and the OPI values after GA tuning are given in Table 8 based on the initial values and the reference signals mentioned.
Table 7
Consensus controller parameters.
x
y
kx
Kx
ky
Ky
4.0259
0.9798
3.9732
1.0631
z
ψ
Kz1
Kz1
Kψ1
Kψ2
14,169.9808
427.2904
13.8067
0.5709
Table 8
Performance indices and OPI values for consensus control.
Consensus Control
ITAE1
ITAE2
ITAE3
x
1.076761
1.009015
1.879853
y
1.203705
1.177282
2.106002
z
0.838854
0.838702
0.001418
ψ
0.005238
0.005962
0.001104
OPI
0.845325
Some tests were done on the MAS with the proposed consensus control scheme. These are represented as follows.
4.1. Achieving Consensus
The purpose of this test was to check the errors between the agents’ states while following the leader agent and the time it took to reach consensus between the agents. The initial states for each agent are given in Table 9, while the reference signals applied for the altitude (), position (, ), and the Yaw angle () were zero signal with simulation time .
Table 9
Achieving consensus test initial states.
Initial States
x
y
z
ψ
Leader
0.001
0.001
0.001
0
Follower 1
0.5
0.5
0.8
π/4
Follower 2
−0.5
−0.5
0.4
−π/4
Figure 8, Figure 9, Figure 10 and Figure 11 a show the time response for the altitude (), position (, ), and the Yaw angle () states, while Figure 8, Figure 9, Figure 10 and Figure 11 b show the errors between agents. The subscript refers to the leader, while refers to the followers. The smoothness in the time response for all cases was very clear with fast consensus-reaching (i.e., less than for the and position, while for the altitude and the yaw angle the time taken to reach consensus was less than ). The errors occurring in all the states always converged to zero after a very short time even if the initial states were different.
Figure 8
Consensus in -position (a) -position time response. (b) Error of the -position.
Figure 9
Consensus in -position (a) -position time response. (b) Error of the -position.
Figure 10
Consensus in -position (a) -position time response. (b) Error of the -position.
Figure 11
Consensus in -angle (a) -angle time response. (b) Error of the angle.
4.2. Fixed Triangle Formation
A fixed formation for the agents was reached, while the leader was supposed to keep increasing in its altitude. The initial states for this test and the tests coming after were the same, as represented in Table 9, taking into consideration that the followers' initial altitude was the same altitude as that of the leader (i.e., initial for both followers). The reference signals of the leader for the , , and were zero signal, while for a ramp input was used with a slope with simulation time . The values of the reference states to form the triangle shape (, , , and ) mentioned in Equation (26), Equation (29) and Equation (32), respectively, are given asThe time response for the position and the altitude of the leader and the two followers are shown in Figure 12, while the motion of the agents in 3D is represented in Figure 13. From the results represented in Figure 12, the followers tracked the leader trajectory perfectly with fast response, keeping the distances given previously to form the triangle shape, while the altitude of the followers matched the leader path without a variation in the response. The motion in Figure 13 of the agents shows the agents, while changing their positions to form the triangle shape, which was formed by the three agents after a few seconds from starting the test, and the shape was kept fixed while the leader altitude was increasing.
Figure 12
Time response of fixed triangle formation for MAS.
Figure 13
Three dimensional view of fixed triangle formation for MAS.
4.3. Fixed Formation Tracking Ramp Trajectory
This test differed from the previous one by the reference signals used with the leader to track them, where the ( and ) position was a changeable not zero signal. Forming a fixed triangle shape while the agents were moving was required in this test. The reference states values for the triangle shape are given as follows:The reference trajectory signals for the leader agent were a ramp input signal for the and position states with starting time and slope , a step input for the altitude , and a zero signal for the yaw angle , with a full simulation time .Figure 14 shows the motion of the leader while tracking the reference trajectory and the followers keeping the required shape while tracking the leader trajectory. The shape was well performed by the three agents using the proposed controllers. The time response for this test is represented in Figure 15, which shows the fast-tracking of the followers to make the required shape with the leader agent without peaks and with a very smooth response.
Figure 14
Three-dimensional view of fixed formation tracking ramp trajectory.
Figure 15
Time response of fixed formation tracking ramp trajectory.
4.4. Leader Orientation-Based Formation Tracking a Helical Trajectory
The previous test showed good trajectory tracking performance while keeping the formation shape fixed. This test differed from the previous one by following the rotation of the leader and keeping the formation shape, but rotation occurred when the leader changed its position. This test is very important for the search and rescue applications of the MAS. In this test, the helical path was used as a reference trajectory for the leader agent. These reference signals are given in Table 10.
Table 10
Leader orientation-based formation tracking a helical trajectory input signals.
State
Reference Trajectory
Time (s)
x
cos(0.01πt)
[25,tf *]
y
sin(0.01πt)
[25,tf *]
z
0.2t
[0,tf *]
ψ
0
[0,tf *]
* .
Another difference in this test as compared to all the cases mentioned previously is that the reference states (, , , and ) were time-varying here to guarantee that the same triangle shape was kept depending on the leader orientation. The signals of the reference states of the and position are given in Table 11, while the altitude and the yaw angle reference states (, and ) were zero for all agents.
Table 11
The and reference states.
Leader
Follower 1
Follower 2
ϱx
0
−cos(θL)+sin(θL)
−cos(θL)+sin(θL)
ϱy
0
cos(θL)−sin(θL)
−cos(θL)−sin(θL)
θL=tan−1yLxL
The 3D motion of the agents while forming the triangle shape is shown in Figure 16. The triangle shape was well performed by the agents depending on the orientation of the leader and the path it was following. In Figure 16, each subgraph represents the triangle shape formed by the agents at a different time to show that the orientation of the triangle was changing depending on the leader direction.
Figure 16
Three-dimensional views of leader orientation-based formation tracking a helical trajectory, (a) Agents position at s. (b) Agents position at s. (c) Agents position at s. (d) Agents position at s.
The time responses are presented in Figure 17 for the , , and states. The results show a good tracking for the desired signals, while the leader direction was changing the value of the reference states ( and ) for the two follower agents (i.e., in second of the simulation , , and , which represented a triangle shape directed into the () direction, while in second 125 of the simulation , and , , which represented a triangle shape with the same dimensions but directed into the () direction). The mentioned two examples from the results showed that the shape was kept, while its direction was changed depending on the leader agent.
Figure 17
Time response of leader orientation-based formation tracking a helical trajectory.
4.5. Switching Formation Topology
This test required changing the formation shape during hovering mode for the MAS. The reference trajectory signals used for the leader agent are given in Table 12, while the state references for and used are given in Table 13.
Table 12
Switching formation topology input signals.
State
Reference Trajectory
Time (s)
x
−u(t−50)
[110,tf *]
z
u(t)
[0,tf *]
y,ψ
0
[0,tf *]
* .
Table 13
The and reference states for changing formation.
Leader
Follower 1
Follower 2
ϱx
0
u(t−25)+u(t−50)
u(t−25)
ϱy
0
1−u(t−50)
−1+u(t−50)+u(t−75)
The results of this test are represented in the plots of Figure 18, which show that the agents formed a line shape on the -direction in a fast response time through the interval of the time. In the second time interval , a triangle shape headed by the leader agent and directed to the -direction was performed by the agents in a smooth and fast response. In the third time interval , the leader started to move to the -direction, while the agents formed a line shape on the -direction, and the shape was well performed. In the last time interval , another triangle shape was performed but this time was headed by the second follower and was directed to the -direction.
Figure 18
Time response of switching formation topology.
In this test, the 3D motion in Figure 19 was represented in multiple subgraphs to show the exact formation achieved by the MAS. These subgraphs present the motion sequence of the three agents, and it is clear that all the desired formations were performed perfectly without any error.
Figure 19
Three-dimensional views of switching formation topology, (a) Agents formation at s. (b) Agents formation at s. (c) Agents formation at s. (d) Agents formation at s.
4.6. Fixed Formation Tracking with Exogenous Disturbances
In this test, the same trajectory and parameters of the fixed formation tracking ramp trajectory test (third test) were used in addition to the existence of different disturbances applied on the leader agent to show the ability of the proposed controller to cope with such practical case. The disturbances applied on the different states of the leader agent were a step of at , pulses for the roll, and pitch and yaw angles ( at , at , at ) with pulse widths of .Figure 20 and Figure 21 show the 3D motion of the agents and the time response of each agent while tracking the trajectory and the disturbances applied. The agents performed the trajectory tracking while different disturbances existed in a perfect way, where small peaks and fast tracking are shown in Figure 21. This proves the high immunity of the proposed controlling scheme against external disturbances.
Figure 20
Three-dimensional view of fixed formation tracking with disturbances.
Figure 21
Time response of fixed formation tracking with disturbances.
5. Conclusions and Future Aspects
The consensus control proposed for three agents of quadrotors in this work showed good performance in time. From the simulation cases and the results presented, it can be concluded that the complex nonlinear model for the quadrotor system is treated as a multiple double integrator subsystem. As compared to the other simplified quadrotor models used by some researchers mentioned in the literature survey, the proposed controller outperformed the others by dealing with the complete nonlinear model of the quadrotor system with good performance. This work can be further developed by taking into consideration the following points: proving the closed-loop stability of the proposed control scheme with the nonlinear quadrotor model, developing the proposed controller to achieve containment and flocking control, and implementing the consensus control on a hardware platform using the proposed controlling scheme. Another direction is to extend other control design methods, as in [29,30], to control the single agent UA based on its nonlinear model while using the proposed consensus control scheme.