Literature DB >> 24707206

Nonlinear recurrent neural network predictive control for energy distribution of a fuel cell powered robot.

Qihong Chen¹, Rong Long², Shuhai Quan¹, Liyan Zhang¹.

Abstract

This paper presents a neural network predictive control strategy to optimize power distribution for a fuel cell/ultracapacitor hybrid power system of a robot. We model the nonlinear power system by employing time variant auto-regressive moving average with exogenous (ARMAX), and using recurrent neural network to represent the complicated coefficients of the ARMAX model. Because the dynamic of the system is viewed as operating- state- dependent time varying local linear behavior in this frame, a linear constrained model predictive control algorithm is developed to optimize the power splitting between the fuel cell and ultracapacitor. The proposed algorithm significantly simplifies implementation of the controller and can handle multiple constraints, such as limiting substantial fluctuation of fuel cell current. Experiment and simulation results demonstrate that the control strategy can optimally split power between the fuel cell and ultracapacitor, limit the change rate of the fuel cell current, and so as to extend the lifetime of the fuel cell.

Entities: Chemical Gene

Mesh：

Year: 2014 PMID： 24707206 PMCID： PMC3951001 DOI： 10.1155/2014/509729

Source DB: PubMed Journal: ScientificWorldJournal ISSN： 1537-744X

1. Introduction

As the rapid development of modern industrial technology, Ocean technology, and space technology, more and more mobile robots are demanded in these areas. Because of the advantages in operating time, weight, and dimensions, proton exchange membrane (PEM) fuel cells have been considered as alternative power sources for mobile robots. A mobile robot usually has multiple freedoms, which cause the electric load drastically to fluctuate. Sudden changes in power may significantly reduce the operating life of fuel cells in a long term [1, 2]. Furthermore, fuel cells have the characteristics of unidirectional power flow and they cannot absorb the energy from regenerative braking of a robot. As a result, fuel cells are usually arranged with auxiliary power sources to form hybrid power systems and drive mobile robots. Ultracapacitors are highly suitable for the bulk of the transient power demands since the charge/discharge current of an ultracapacitor can vary in a wide range. In this paper we choose a bank of ultracapacitors as auxiliary power source. A smart power split strategy is indispensable to enhance performance and lifetime of the hybrid power system. Jiang et al. [3] presented an adaptive control algorithm that adjusted the output current set point of the fuel cell. Ferreira et al. [4], Li et al. [5], and Kim et al. [6] developed a fuzzy controller to optimally distribute the power between the fuel cell and the battery. Rodatz et al. [7] designed an optimal control strategy to minimize the hydrogen consumption in a hybrid fuel cell system. Paladini et al. [8] proposed an optimal control strategy to power a vehicle with both fuel cell and battery to reduce fuel consumption. Lin et al. [9] studied a dynamic programming (DP) algorithm based on the fuel consumption and exhaust gas emission for a parallel electric vehicle. These strategies are effective in dealing with system efficiency but address little the lifetime of the fuel cell stack due to rapid load demand variations. Zhang et al. [10] presented a wavelet-transform algorithm to identify and allocate power demands with different frequency contents to corresponding sources to achieve an optimal power management control algorithm. This algorithm can protect fuel cell effectively but is complex and difficult to apply online. Xu et al. [11, 12] and Simmons et al. [13] proposed optimal real-time energy management strategies for a proton electrolyte membrane (PEM) fuel cell bus based on the Pontryagin's Minimal Principle and the determined dynamic programming (DDP). Ziogou et al. [14] deployed a dynamic optimization approach based on nonlinear model of fuel cell. Li et al. [15] developed a constrained model predictive control of a solid oxide fuel cell based on genetic optimization. Undoubtedly, the fuel cell power systems are nonlinear. Therefore, the global optimization based energy management strategies depend on nonlinear models of the fuel cell power systems and are time costly. Model predictive control (MPC) has been recognized as a powerful methodology for controlling a wide class of nonlinear dynamic system [16]. In this paper we use MPC appropriately, distribute power between the fuel cell and ultracapacitor, avoid frequent fluctuation of fuel cell current, and so enhance the transient performance and extend the operating life of the hybrid system. There have been three main methods for nonlinear system modeling and predictive control [17]. The first one uses a piecewise linearization to describe the nonlinear behavior of a system. Each model is effective only in a small region, which results in that a mass of models is required [18]. The second one directly employs nonlinear models, but these involve a nonlinear online optimization problem with constraints, which is usually time-consuming and may even be unable to guarantee a feasible solution for real time control [19]. The third method is to use a local linearization approach representing a nonlinear plant, which is valid and simplifies the implement [20-24]. This paper proposes an ARMAX (Autoregressive Moving Average with Exogenous input) modeling approach for fuel cell power systems. Time-variant coefficients of the ARMAX model are estimated by a recurrent neural network. The RNN-ARMAX model is an equal linear model of the fuel cell power system. Therefore, we design linear constrained model predictive control based on the RNN-ARMAX model for the nonlinear fuel cell power system. The design and implementation of the controller are significantly simplified and the method can protect fuel cell from substantial fluctuation of current by trading off transient current demand from the fuel cell to the ultracapacitor, according to constraints and weighting matrices of the output errors. The remainder of this paper is organized as follows. Section 2 describes RNN-ARMAX modeling of the fuel cell power system. MPC is designed in Section 3. In Section 4, we implement and discuss simulation results. Conclusions are given in Section 5.

2. RNN-ARMAX Modeling

We aim at the optimization of electric power distribution between the fuel cell and ultracapacitor of a fuel cell robot.

2.1. System Structure and Description

The fuel cell power system studied in this paper, as shown in Figure 1, is designed for a mobile robot. The electrical output of the PEM fuel cell is connected to the load through a unidirectional DC/DC converter, and an ultracapacitor bank is also connected to the load through a bidirectional DC/DC converter to form a hybrid fuel cell system. The ultracapacitor bank should supply peak power and be recharged by the fuel cell.

Figure 1

Fuel cell power system of a robot.

The distribution of power between the fuel cell and the ultracapacitor depends on the duty ratio of the DC/DC converters. Duty ratio of a DC/DC converter is defined as the ratio of switch on time interval, T ON, to switching period T; that is, There is one duty ratio, d fc, in the unidirectional DC/DC converter for controlling output power of the fuel cell. In the bidirectional DC/DC converter, one duty ratio, d , is for charging the ultracapacitor, and the other, d , is for discharging the ultracapacitor. Power distribution is optimized by controlling the three duty ratios.

2.2. Identification

The hybrid system is a multiple input and multiple output nonlinear system. The control input variables are three duty ratios of the power converters. Input variables are expressed as The output variables contain output voltage of the fuel cell and the state of charge of the ultracapacitor and so forth. Output variables are chosen as where V fc is voltage of the fuel cell, I fc is current of the fuel cell, I is current of the ultracapacitor, SOC is state of charge of the ultracapacitor, V is the bus voltage: and I is the bus current, respectively. Power demanded by the load, P , is viewed as a disturbance to the system. We can describe the model as the following nonlinear function: where φ(t) is the regression vector with known order n and m, n and n are dimensions of output and input, ξ(t) is the system disturbance, and f(·) is an unknown nonlinear function, respectively. If we design MPC based on direct use of the nonlinear model, it involves the online solution of a higher order nonlinear optimization problem with constraints, which is usually computationally expensive and may even be unable to guarantee a feasible solution for real time control. Here we use RNN-ARMAX to model the system. Performing Taylor expansion on the nonlinear function f(φ(t)) around the region φ(t) = 0 as We introduce the notation where Θ ∈ R and the coefficients a = a (φ(t)), b = b (φ(t)) are nonlinear function of φ(t). We have a regression form of the system described by (4) as follows: Here the parameter vector Θ(φ(t)) is time variant. The recurrent neural network (RNN) that consists of feed-forward and feedback connections is well known to be capable of modeling and control nonlinear system. We use RNN to estimate Θ(φ(t)). The recurrent neural network modeling principle is shown in Figure 2.

Figure 2

RNN modeling principle.

The RNN is expressed as where O(t) ∈ R is output of the RNN andare weights for the RNN among the output layer, the input layer, and the hidden layer. Define n , n , and n as the node amounts of the output layer, the input layer, and the hidden layer, respectively. K, W and are expressed as Then the output of the system is predicted by where Ψ(t) ∈ R and The performance criterion ψ(t) of the neural network is then defined by where y(t) is sampled output of the system. Therefore, the weights are adjusted to reduce the cost function ψ(t) to a minimum value by the gradient descent method. The weight vectors are updated along with where η is a positive learning rate. Let q, r be the quotient and remainder of i/[n∗n + (m + 1)n ], respectively. If h = 0, then set r = [n∗n + (m + 1)n ]. Else set q = q + 1. ∂ψ(t)/∂K, ∂ψ(t)/∂W, and are then calculated as follows: where The update rules of (15) call for a proper choice of the learning rate η. For a small value of η the convergence is guaranteed but the speed is slow; if η is too big, the algorithm becomes unstable. Here we develop a guideline in selecting the learning rate properly. A discrete Lyapunov function is given by where Thus the change of Lyapunov function due to the training process is obtained by The error difference due to the learning is represented by where ΔW represents a change in an arbitrary weight vector. From the update rule (15), Then we have the following general convergence theorem.

Theorem 1

η is the learning rate for the weights of RNN and ||·|| is the usual Euclidean norm in R . Then the convergence is guaranteed if η is chosen as

Proof

From equations (20)–(22), ΔV(t) can be calculated as To guarantee ΔV(t) < 0, η should satisfy the following inequality From inequalities (25) and (26), we obtain Namely, η satisfies This proves the theorem. We can establish a state space model from the matrix polynomials (7), (8), and (9) by defining a state vector given by A state space model can then be given by where Model (30) is a state space representation of MIMO RNN-ARX model (4). The parameters in A and B are estimated by the RNN, and the state x(t) at time t can be easily obtained by (29) according to the present output y(t), the past input/output data, and output of the RNN.

3. Controller Design

A predictive controller will be designed to predict the output trajectory of the fuel cell power system and compute a series of control actions, subject to constraints, that will minimize the difference between the predicted trajectory and desired trajectory. A prominent advantage of this controller over other control schemes is its ability to deal with constraints in a systematic and straightforward manner. To design predictive controller for the system, an objective function is defined as [18] where N is predictive horizon, is the estimated output of the system at instant t + k through models based on information available at instantt. y (t + k) is the desired output at instant t + k, and Q, R are weighting matrices on output errors and control, respectively. We choose the control horizon to be equal to the prediction horizon and define Q = diag⁡(Q Q Q Q SOC Q ) and R = diag⁡(R R R ), where, Q , Q , Q , Q SOC, and Q are penalties on errors in V fc, I fc, I , SOC and V , respectively. R , R , and R are penalties on d fc, d and d , respectively. Substituting state equations (30) into (32), the equation is abbreviated as where y(t) is system output at instant t, and Ω, L, G are constant matrices calculated through the system model and matrices Q, R. Consider the following: In the hybrid system, there are several limits to deal with. Rapid variation on current will reduce lifetime of fuel cell, so it is required to constrain the fluctuation of fuel cell current; that is, where ΔI max⁡ is the acceptable maximum value. Moreover, the state of charge of the ultracapacitor, the current of the ultracapacitor, and the voltage of the fuel cell should be limited to some expected range: where SOCmin⁡ and V fc,min⁡ are the lower limitations, SOCmax⁡, I , and V fc,max⁡ are the upper limits, respectively. These limitations are determined by the characteristics of the ultracapacitor and fuel cell. A prominent advantage of MPC is its ability to deal with constraints. Deduced from equations (30), (32) and inequalities (35)–(38), the control optimization is transformed to the following constrained quadratic programming problem: where U min⁡, U max⁡ ∈ R , and E ∈ R are constant matrices obtained from (30) and inequalities (35)–(38). We can solve this optimal problem using the neural network method investigated in [25].

4. Experiment and Simulation

The hybrid fuel cell system, as shown in Figure 1, is designed to power a robot. The rated power is 500 W. The DC bus voltage is controlled around 24 V. The PEM fuel cells have 40 cells and an active area of 22 cm2. The ultracapacitor is 200 F and the rated voltage is 24 V. The value of capacitance can be realized by a bank of 8 ultracapacitors, each with capacitance of 1600 F and a rated voltage of 3 V, connected in series. The upper and lower limits of SOC are 1 and 0.45, respectively. The maximum stored energy is 16 W h, although only 12.76 W h is available between the maximum and minimum of SOC. This 12.76 W h corresponds to an average power at 500 W for 92 seconds and that is sufficient to buffer the fuel cell from acceleration transients.

4.1. Modeling Experiment and Simulation

When real input and output data of the PEM fuel cell was sampled, the operating parameters are shown in Table 1.

Table 1

Parameters used in the experiment and simulation.

Sym.	Meaning	Value
T _st	Temperature of fuel cell	343 K
T _atm	Atmospheric temperature	295 K
P _H₂	Partial pressure of hydrogen	1.5 atm
n	Number of cells in each stack	40
A	Active area of fuel cell	22 cm²
C	Capacitance of ultracapacitor	200 F
V _c,max⁡	Rated voltage of ultracapacitor	24 V

The collected data are equally divided into two groups. The first group is used for modeling and the second group is used for validating. The simulated and measured V-I characteristics curves of the fuel cell are shown in Figure 3. Current of the ultracapacitor changes as Figure 4, and the simulated and measured voltage curves are shown in Figure 5. It is shown that the RNN-ARMAX model closely matches the practical fuel cell power system.

Figure 3

The simulated and measured V-I characteristics curves of the fuel cell.

Figure 4

Current of the ultracapacitor.

Figure 5

The simulated and measured voltage of the ultracapacitor.

4.2. Control Simulation

Control performances of constrained and unconstrained MPCs are studied and compared to validate the proposed constrained MPC. The constraints of the constrained MPC are listed in Table 2.

Table 2

Constraints for the constrained MPC.

Sym.	Meaning	Lower limit	Upper limit
ΔI _max⁡	Rate of change of fuel cell current	−0.4 A/s	0.4 A/s
SOC	State of charge of the ultracapacitor	0.45	1
I _c	Current of the ultracapacitor	−30 A	30 A
V _st	Voltage of the fuel cell	27.5 V	40 V

A typical load cycle that is used in simulation and the power profile, as shown in Figure 6, is considered as the power demand.

Figure 6

Power profile.

The simulation results for both the unconstrained and the constrained MPC are shown in Figure 7. It is shown that, there exist significant perturbations in current of fuel cell for unconstrained MPC. This phenomenon may cause oxygen starvation because the dynamic response of oxygen supply is slower, while in the case of the constrained MPC, current and voltage are much smoother.

Figure 7

Simulation results of constrained and unconstrained MPC: (a) current of fuel cell; (b) voltage of fuel cell; (c) SOC of ultracapacitor.

In the case of constrained MPC, the oscillation of SOC of the ultracapacitor is much larger than that of the unconstrained MPC. The reason is that constrained MPC draws much more energy from the ultracapacitor to supply the peak load and so limits perturbations of the current of the fuel cell. Constraint results are shown in Figure 8. It's exciting that the maximum rate of change of the fuel cell is 0.4 A/s, the minimum voltage of the fuel cell is 27.5 V, the charge and discharge current of the ultracapacitor are no more than 30 A, and the SOC of the ultracapacitor is between 0.45 and 1. It is shown that these variables change in the desired and constrained ranges. These phenomena demonstrate that the constraints on the fuel cell power system are valid.

Figure 8

Curves for validating of constraints: (a) change rate of fuel cell current; (b) voltage of fuel cell; (c) current of ultracapacitor; (d) SOC of ultracapacitor.

The power split under the constrained MPC is shown in Figure 9. We set the minimum voltage of the fuel cell as 27.5 V and the corresponding maximum power of fuel cell as 500 W. It is noticed that the fuel cell power changes in low speed and is no more than 500 W. The high frequency power demands are squeezed from the ultracapacitor. Furthermore, SOC, I and other constrained variables satisfy their constraints. Consequently, the output power of the fuel cell is well controlled and it is helpful to extend the operating life of the fuel cell.

Figure 9

Power distribution of the hybrid system.

5. Conclusions

RNN-ARMAX model was established and linear constrained MPC was developed and verified for a fuel cell power system. The proposed approach, different from other approaches, models the nonlinear fuel cell power system as linear time varying system. Accordingly, linear constrained MPC can be used to globally optimize power distribution and deal with limitations. The design and implementation of the controller are significantly simplified and the method can protect fuel cell from substantial fluctuation of current by trading off transient current demand from the fuel cell to the ultracapacitor.

2 in total

1. A recurrent neural network with exponential convergence for solving convex quadratic program and related linear piecewise equations.

Authors: Youshen Xia; Gang Feng; Jun Wang
Journal: Neural Netw Date: 2004-09

2. Diagonal recurrent neural networks for dynamic systems control.

Authors: C C Ku; K Y Lee
Journal: IEEE Trans Neural Netw Date: 1995

2 in total