Literature DB >> 35132111

Neuro-adaptive augmented distributed nonlinear dynamic inversion for consensus of nonlinear agents with unknown external disturbance.

Abstract

This paper presents a novel neuro-adaptive augmented distributed nonlinear dynamic inversion (N-DNDI) controller for consensus of nonlinear multi-agent systems in the presence of unknown external disturbance. N-DNDI is a blending of neural network and distributed nonlinear dynamic inversion (DNDI), a new consensus control technique that inherits the features of Nonlinear Dynamic Inversion (NDI) and is capable of handling the unknown external disturbance. The implementation of NDI based consensus control along with neural networks is unique in the context of multi-agent consensus. The mathematical details provided in this paper show the solid theoretical base, and simulation results prove the effectiveness of the proposed scheme.

Entities: Chemical

Year: 2022 PMID： 35132111 PMCID： PMC8821713 DOI： 10.1038/s41598-022-05663-4

Source DB: PubMed Journal: Sci Rep ISSN： 2045-2322 Impact factor: 4.379

Introduction

Cooperation among agents, i.e., the consensus, is a fundamental and essential requirement to execute a complex task cooperatively. In a real-world scenario, the agents face a variety of issues while making the consensus. These issues are associated with communication among the agents, plant’s uncertainty and unknown external disturbances. The former does not affect the agent dynamics, but the latter does a lot resulting in a mission failure. Considering the importance of a mission, the researchers focused on designing adaptive controllers capable of handling unknown disturbances. These controllers implement adaptive control laws, including the neural network (NN) based approximation scheme and the conventional linear or nonlinear control theory depending on the plant dynamics. The primary reason for selecting the NN is that it is an efficient technique to approximate unknown nonlinear functions[1], especially the radial basis function (RBF) neural network, which is widely used due to its simple structure. Such neuro-adaptive controllers are proposed to solve a variety of consensus problems. A few examples are mentioned here. A leader-follower synchronization problem for uncertain dynamical nonlinear agents was solved using neuro-adaptive scheme[2]. A cooperative tracking problem of agents with unknown dynamics[3] was proposed using a neural network-based controller. A bipartite consensus[4] was achieved using a neural network to learn the uncertainties of agents. Another leader-follower output consensus problem was solved[5] using a neuro-adaptive controller for a class of uncertain heterogeneous non-affine pure-feedback multi-agent systems in the presence of time-delay and input saturation. An adaptive leader-following consensus control for a class of strict-feedback agents[6] was solved using neuro-adaptive control. An exciting example of distributed finite-time formation tracking control problem for multiple unmanned helicopters was presented by Wang et al.[7]. The authors used the radial basis function neural network (RBFNN) technique to design a novel finite-time multivariable neural network disturbance observer (FMNNDO) to approximate the unknown external disturbance and model uncertainty law. In addition to nonlinear systems, a neural-network-based leaderless consensus control problem of fractional-order multi-agent systems (FOMASs) with unknown nonlinearities and unknown external disturbances was reported[8]. The effect of actuator fault on consensus asymptotic convergence of nonlinear agents with unknown dynamics was discussed by Li et al.[9]. Other examples include event-triggered consensus control problem for nonstrict-feedback nonlinear systems with a dynamic leader[10], fixed-time leader-follower consensus problem for multi-agent systems (MASs) with output constraints, unknown control direction, unknown system dynamics, an unknown external disturbance[11], stochastic nonlinear multi-agent systems with input saturation[12] etc. These papers implemented a variety of nonlinear controllers (e.g. feedback linearization, Lyapunov function, sliding mode, backstepping etc.) and a neural network approximation for uncertainty and unknown disturbances. In this paper, we have presented a neuro-adaptive augmented distributed controller, which is designed based on Distributed Nonlinear Dynamic Inversion (DNDI)[13]. We named it N-Distributed NDI (N-DNDI). It can be mentioned that the adaptive control expression in the papers mentioned earlier contains a linear or nonlinear error feedback term, and an adaptive term is added to it. However, N-DNDI is a new neuro-adaptive structure augmented in the DNDI frame. The primary reasons for selecting NDI are given as follows.Many researchers have used NDI to solve their research problems. Enns et al.[14] implemented NDI to design a flight controller. Singh et al.[15] developed a controller for autonomous landing of a UAV. Padhi et al.[16] described reactive obstacle avoidance schemes for UAVs in a Partial Integrated Guidance and Control (PIGC) framework using neuro-adaptive augmented dynamic inversion. Mondal et al.[17] applied NDI to propose a formation flying scheme. They presented how the NDI is implemented for tracking the leader’s commands in terms of coordinate, velocity, and orientation. Caverly et al.[18] used NDI to control the attitude of a flexible aircraft. Horn et al.[19] designed a controller of rotorcraft using Dynamic Inversion. Lombaerts et al.[20] proposed NDI-based attitude control of a hovering quad tilt-rotor eVTOL Vehicle. The NDI is an effective way to design a controller for plants with nonlinear dynamics. The nonlinearities in the plant are eliminated by using feedback linearization theory. Moreover, the response of the closed-loop plant is similar to a stable linear system. The NDI controller has many advantages. Examples of these advantages include 1. simple and closed-form control expression, 2. easily implementable, global exponential stability of the tracking error, 3. use of nonlinear kinematics in the plant inversion, 4. minimize the need for individual gain tuning, etc. The contribution is given as follows.The rest of the paper is organized as follows. In section “Preliminaries”, preliminaries are given. Section “Problem formulation” presents the problem definition. The mathematical details of the DNDI are provided in section “Nominal distributed nonlinear dynamic inversion (DNDI) controller”. The mathematical details of N-DNDI are given in section “Neuro-adaptive augmented DNDI for consensus”. The simulation study is presented in section “Simulation results”. The conclusion is given in section “Conclusion”. In this paper, a novel neuro-adaptive Distributed NDI (N-DNDI) is proposed to achieve the consensus among a class of nonlinear agents in the presence of unknown external disturbance. It can be mentioned that DNDI is a new consensus protocol[13] and augmentation of the neural network with DNDI is a new formulation. Hence, this is new in the context of MASs and not reported in the literature. The main advantage of N-DNDI is it inherits the features of NDI. Moreover, the augmentation of the neural network provides a very good approximation of the unknown external disturbances. Therefore, N-DNDI is a perfect combination for designing consensus controllers for nonlinear agents. The realistic simulation study justifies the effectiveness of blending DNDI and neural networks. The formulation to accommodate the neuro-adaptive structure in the DNDI framework is a significant contribution. Moreover, the mathematical details for convergence are provided to show the solid theoretical base of this new controller.

Preliminaries

The topics which are relevant to the problem considered in this paper are given in this section.

Consensus of multiple agents

The consensus of MASs on communication network is discussed in this section. The definition of the consensus is given as follows.

Definition 1

Let us consider a MASs with N agents, where denotes the states of the ith agent. The MASs will achieve the consensus if as . The consensus protocol aims to minimize the error in similar states of the individual agent with their neighbour by sharing information over the communication network, which is generally described using graph theory.

Graph theory

The communication among the agents can be represented by a weighted graph written by . The vertices of the graph denote the agents, and the set of edges, denoted by , represents the communication among the agents. The weighted adjacency matrix of G is denoted by if , otherwise . There is no self loop in the graph. This fact is expressed by selecting the diagonal elements of the adjacency matrix A as zero, i.e., , . The degree matrix is denoted by , where . The Laplacian matrix is written as . A graph with the property that is said to be undirected graph. If any two nodes , there exists a path from to , then the graph is called a connected graph. In this paper, we suppose that the topology G of the network is undirected and connected.

Radial basis function neural networks (RBFNNs)

Due to the ‘linear in the weight’ property, the Neural networks are widely implemented to approximate unknown functions and the radial basis function neural network (RBFNN) is a good candidate[21]. A continuous unknown nonlinear function can be approximated bywhere is input vector, is the weights of RBFs, denotes the basis function vector. ‘q’ denotes the number of neurons. is the approximation error. The ith basis function is given bywhere is the center of the receptors and is width of the ith gaussian function.

Useful lemma

The useful lemmas used in this paper are given as follows.

Lemma 1

[22] The Laplacian matrix L in an undirected graph is semi-positive definite, it has a simple zero eigenvalue and all the other eigenvalues are positive if and only if the graph is connected. Therefore, L is symmetric and it has N non-negative, real-valued eigenvalues .

Lemma 2

[23] Let be continuous positive vector functions, by Cauchy inequality and Young’s inequality, there exists the following inequality:where

Lemma 3

[24] Let be a continuous positive function with bounded initial R(0). If the inequality holds where, , then the following inequality holds.

Problem formulation

In this section, the problem definition is given. The objective is to design a neuro-adaptive consensus protocol that enables a class of nonlinear agents to achieve the consensus in the presence of external disturbance. Let us consider a group of N nonlinear agents. They are connected by the undirected and connected network topology. All the agents are homogeneous, i.e., they have similar dynamics. The dynamics of ith agent is given by Eqs. (5)–(6) as follows.where, , are states and control respectively. f is a continuously differentiable vector-valued function representing the nonlinear dynamics. is the unknown bounded and smooth external disturbance term with .

Assumption 1

The matrix is invertible for all time.

Nominal distributed nonlinear dynamic inversion (DNDI) controller

It is relevant to get an overview of the DNDI controller[13] and its convergence behaviour before augmenting neuro-adaptive structure is explained.

Brief overview of DNDI

A brief overview of DNDI controller is presented here. The block diagram of the consensus control scheme with nominal DNDI is shown in the Fig. 1.

Figure 1

Block diagram of distributed NDI or DNDI.

Block diagram of distributed NDI or DNDI. The nominal dynamics of ith agent is given as follows.where, , . denotes the consensus error of ith agent given bywhere , , , and . is identity matrix. ‘’ denotes the Kroneker product. Enforcing the first order error dynamics we getDifferentiation of Eq. (9) yieldsSubstitution of the expressions for and in Eq. (10) givesSimplification of Eq. (12) gives the expression of control for ith agent as follows.

Convergence of DNDI

Convergence study of DNDI is presented here. Let us consider a smooth scalar function given by can be written aswhere, is the left eigenvalue matrix of , is eigenvalue matrix, .where , , and .

Remark 1

It can be observed from Eqs. (14) and (16) that

Remark 2

According to Lemma 1, . Hence, is invertible.

Remark 3

is positive definite matrix. Hence, V is positive definite subject to consensus error and qualify for a Lyapunov function. Differentiating Eq. (14), we getwhere, . Substituting the control expression in Eq. (19) yieldsAccording to Lemma 2, we can writeSubstituting the inequality relation in Eq. (20)Let us design the gain as follows.Eq. (22) is written aswhere, . Applying Lemma 3 we getHence, we conclude that V is bounded as . In addition, we show the Uniformly Ultimate Boundedness (UUB) here. Using Eq. (17), Eq. (25), and presented by Ge et al.[24] we can writeEq. (26) can be written as follows.It can be observed that, if then and . If then for any given there exist a time such that , .Therefore, we can conclude

Neuro-adaptive augmented DNDI for consensus

Before going to the main derivation of Neuro-adaptive DNDI, we present the philosophy of neuro-adaptive control design[25].

Philosophy of neuro-adaptive control

The sole objective of the design is to drive the actual state X to desired state . The scheme adopted is to make actual state X to track the desired or nominal state through the virtual state as shown in Fig. 2.

Figure 2

Philosophy of neuro-adaptive control.

Philosophy of neuro-adaptive control. The tracking of X to and to is achieved by enforcing error dynamics to obtain the control considering nonlinear plant dynamics. We use the same philosophy to design the Neuro-adaptive distributed NDI controller in the next section.

Mathematical details of neuro-adaptive augmented DNDI (N-DNDI)

Neuro-adaptive augmented DNDI is a blending of neuro-adaptive control and DNDI. The block diagram of the control scheme is shown in Fig. 3. The portion of the diagram inside the blue border is the proposed design of neuro-adaptive controller.

Figure 3

Block diagram of Neuro-adaptive DNDI or N-DNDI.

Block diagram of Neuro-adaptive DNDI or N-DNDI. In case of neuro-adaptive augmented DNDI, the consensus error of ith agent is defined such that, the virtual state of ith agent, i.e., reach consensus with the neighbours. Therefore, the consensus error of ith agent is given bywhere . denotes the actual states of all the agents. The actual dynamics of ith agent is given bywhere is the external disturbance added to ith agent. The virtual dynamics for ith agent is given bywhere is the approximation of . It is important to note that, the consensus error in Eq. (31) is designed to measure the error in virtual state of ith agent and actual states of its neighbours. To drive this error to zero (i.e., ), we define a Lyapunov function as follows.Differentiating Eq. (34) yieldsAccording to the Lyapunov stability theory, let the time derivative of the Lyapunov function should bewhere is a positive definite diagonal matrix. The expression of in Eqs. (35) and (36) are equated to obtainEq. (37) is simplified as followsSubstituting the expression of in Eq. (38) we obtainPutting the expression of in Eq. (39) yieldsThe expression of control can be obtained by simplifying Eq. (40) as follows.It can be observed that the control expression in Eq. (41) is different from Eq. (13). Next, the error dynamics is enforced for driving the actual state of ith agent to its virtual state, i.e., .where . To approximate the unknown disturbance a single layer neural network is designed as shown in Eq. (43).where is a basis function vector. It is important to note that the ideal value of is and thus the disturbance can be approximated bywhere is the error tolerance. Eq. (42) is rewritten aswhere . The weight update rule is given bywhere is learning rate and is stabilizing factor of ith agent. It is important to note that because is constant and .

Convergence study of

The convergence study of the error is important. We have selected a Lyapunov function as follows.where and . Differentiation of Eq. (48) yieldsUsing Lemma 2 and , Eq. (49) is written aswhere . Let us definewhere, . Hence, we can write the Eq. (50) as follows.Using Eq. (17) we can writeTherefore, Eq. (51) is written as follows.Applying Lemma 3 we can write

Lemma 4

[24] Consider the positive function given bywhere and . If the following inequality holds:then, given any initial compact set defined bywe can conclude that the states and weights in the closed-loop system will remain in the compact set defined by the states and weights will eventually converge to the compact sets defined bywhere constants We will present the Uniformly Ultimate Boundedness (UUB) here using conclusion 2. Using Eqs. (52), (53), and (56) we can writeIf then , .If then for a given there exist a such that , we get Therefore, we concludeIn a similar fashion, we can concludeTherefore, according to conclusion 2, the proposed controller is able to make the approximation error to converge in the compact set defined by .

Simulation results

Simulation results are presented here. The simulation study is performed on PC with AMD Ryzen 5 processor and 8 Gb RAM.

Agent dynamics

The agent dynamics are given as follows.where . Equations (71) and (72) giveandandThe values of the parameters used in this simulation study are given as follows.The learning rate . We have selected RBF NN basis functions given by , where, . The centers of the basis functions are spaced evenly in the range of . The width of each basis function is selected as . The value of is chosen as 0.12. The disturbance added is given bywhich is unknown to the controller. The state trajectories of all the agents are shown as and , where, and . Similarly, the controls for the agents are shown by , and . Also, the virtual states are given by , and . The initial values of the states of all the agents are given in Table 1.

Table 1

Initial conditions of the states of the agents.

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$X_1$$\end{document}X1	2	− 2	− 2	− 1	9	3	− 1	6	5	− 5
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$X_2$$\end{document}X2	0	− 1	4	1	0	− 5	7	8	− 3	4

Initial conditions of the states of the agents. The adjacency matrix is given byThe unknown external disturbance is approximated by a neuro-adaptive controller. The approximated and real disturbance is shown in Fig. 4a and the approximation error is shown in Fig. 4b.

Figure 4

Performance of N-DNDI in approximating unknown external disturbance.

Performance of N-DNDI in approximating unknown external disturbance. It can be observed that the approximation is very good, which can be confirmed using the approximation error plot. Consequently, the states of the agents achieved the consensus in a few seconds. The state trajectories of all the agents, i.e., and , are shown in Fig. 5a and 5b respectively. The states of the agents reach the consensus in finite time.

Figure 5

Actual state trajectories.

Actual state trajectories. The consensus is achieved by neuro-adaptive consensus controls and which are shown in Fig. 6a and 6b respectively.

Figure 6

Neuro-adaptive control.

Neuro-adaptive control. The convergence of the states is shown by the consensus errors in state and . They are shown in Fig. 7a and 7b respectively. The errors converged in a few seconds. This means the virtual states and successfully reach the consensus.

Figure 7

Consensus error .

Consensus error . The virtual states and are shown in Fig. 8a and 8b respectively. It can be observed that the consensus value of the virtual state and the actual states are the same. Therefore, the actual states tracked the virtual states accurately. The proof of the tracking can be given by virtual errors.

Figure 8

Virtual state trajectory.

Virtual state trajectory. The virtual errors in states and are shown in Fig. 9a and 9b respectively. They have converged in finite time.

Figure 9

Virtual error .

Conclusion

The augmentation of neuro-adaptive structure to distributed nonlinear dynamic inversion (DNDI) frame produces a unique adaptive controller (N-DNDI) that efficiently handles the external disturbance. The N-DNDI inherits the features of the NDI technique and handles the unknown external disturbance. The convergence study provided in this paper explains the correctness of the design. The simulation results show that the neural network embedded in the controller approximates the unknown external function and the DNDI controller computes the consensus control signal accordingly. Consequently, the consensus is achieved in finite time. Hence, the proposed N-DNDI is a deserving candidate for consensus control in the presence of unknown external disturbances. We consider the heterogeneous agents along with communication issues as part of our future research plan. Also, we will present a comparison study of the proposed controller with the existing controllers.

6 in total

1. Adaptive neural control of uncertain MIMO nonlinear systems.

Authors: Shuzhi Sam Ge; Cong Wang
Journal: IEEE Trans Neural Netw Date: 2004-05

2. Decentralized robust adaptive control for the multiagent system consensus problem using neural networks.

Authors: Zeng-Guang Hou; Long Cheng; Min Tan
Journal: IEEE Trans Syst Man Cybern B Cybern Date: 2009-01-23

3. Distributed neural network control for adaptive synchronization of uncertain dynamical multiagent systems.

Authors: Zhouhua Peng; Dan Wang; Hongwei Zhang; Gang Sun
Journal: IEEE Trans Neural Netw Learn Syst Date: 2014-08 Impact factor: 10.451

4. Neural network disturbance observer-based distributed finite-time formation tracking control for multiple unmanned helicopters.

Authors: Dandan Wang; Qun Zong; Bailing Tian; Shikai Shao; Xiuyun Zhang; Xinyi Zhao
Journal: ISA Trans Date: 2018-01-05 Impact factor: 5.468

5. Neural-Network-Based Adaptive Event-Triggered Consensus Control of Nonstrict-Feedback Nonlinear Systems.

Authors: Wei Wang; Yongming Li; Shaocheng Tong
Journal: IEEE Trans Neural Netw Learn Syst Date: 2021-04-02 Impact factor: 10.451

6. Adaptive Neural Network Fixed-Time Leader-Follower Consensus for Multiagent Systems With Constraints and Disturbances.

Authors: Junkang Ni; Peng Shi
Journal: IEEE Trans Cybern Date: 2021-03-17 Impact factor: 11.448

6 in total