Skip to main content

BRIEF RESEARCH REPORT article

Front. Appl. Math. Stat., 28 October 2022
Sec. Dynamical Systems
This article is part of the Research Topic New Trends in Control and Applications of Complex System View all 6 articles

Composite fuzzy learning finite-time prescribed performance control of uncertain nonlinear systems with dead-zone inputs

\nFang Zhu,
Fang Zhu1,2*Pengtong LiPengtong Li1
  • 1College of Mathematics, Nanjing University of Aeronautics and Astronautics, Nanjing, China
  • 2School of Finance and Mathematics, Huainan Normal University, Huainan, China

This paper presents a composite fuzzy learning finite-time prescribed performance control (PPC) approach for uncertain nonlinear systems with dead-zone inputs. First, a finite-time performance function is constructed by a quartic polynomial. Subsequently, with the help of an error transformation function, the restriction problem of the tracking error performance is transformed into a stability problem of an equivalent transformation system. In order to ensure that all signals of the closed-loop system are bounded, a finite-time PPC method combined with fuzzy logic systems (FLSs) is proposed. Although the tracking error can be guaranteed to be limited within a predefined range, the proposed finite-time PPC method only uses instantaneous data, which cannot guarantee the accurate estimation of unknown functions under the influence of dead-zone inputs. Therefore, based on the persistent excitation (PE) condition, a predictive error is defined by using online recorded data and instantaneous data, and a corresponding composite learning finite-time PPC method with parameter updating the law, which can not only achieve the control aim of the former method but also improve the control effect, is designed. The simulation results verified that the composite learning finite-time PPC method is more effective than the finite-time PPC method without learning.

1. Introduction

The traditional control methods, such as adaptive control [13], feedback control [46], active control [79], impulsive control [10, 11], can ensure that the tracking error converges to a residual set whose size is usually unknown. Although these controllers can achieve satisfactory steady-state error performance, the transient error performance (including settling time and maximum overshoot) cannot be guaranteed. Thus, many researchers focused on the transient performance of control systems, and a lot of methods were proposed, for example, in Bechlioulis and Rovithakis [12], Niu and Zhao [13], Li and Tong [14], Yao and Tomizuka [15], and Zhi and Xu [16]. In order to ensure that the tracking error satisfies certain transient performance, a prescribed performance control (PPC) method was proposed by Bechlioulis and Rovithakis [12]. It is concluded that the characteristic of the PPC method is that the tracking error is limited within a small pre-defined range, and its convergence rate is not less than a predefined constant. Thus, the PPC method can ensure that the tracking error satisfies both steady-state performance and transient performance. Some recent research works on PPC can be referred to Kostarigka et al. [17], Zhang et al. [18], Zhang and Cheng [19], Bu et al. [20], Xiang and Liu [21], and Liu et al. [22]. Kostarigka et al. [17] designed a PPC method for the flexible joint robot with unknown or possibly variable elasticity in order to make the link position error meet the pre-set performance index. In Zhang et al. [18] proposed a PPC control strategy for a generic flexible air-breathing hypersonic vehicle, which ensures that the velocity and altitude tracking errors of the vehicle have ideal transient performance. For non-strict feedback systems with unmeasurable states, an observer-based neural adaptive prescribed performance control approach [19] was proposed to achieve expected output tracking performance. The PPC methods in Kostarigka et al. [17], Zhang et al. [18], Xiang et al. [23], and Zhang and Cheng [19] can not achieve the convergence of tracking error with sufficiently small overshoot. To solve this problem, an improved PPC control strategy combined with backstepping technology is proposed by Bu et al. [20] to guarantee the convergence of tracking error with small overshoot for uncertain nonlinear dynamic systems. Using a similar PPC strategy, the transient performance of tracking errors for uncertain systems with unknown control gain signs was discussed by Xiang and Liu [21]. In order to realize the finite-time control of tracking error, a finite-time performance function and a corresponding finite-time PPC method were proposed for strict-feedback nonlinear systems by Liu et al. [22]. However, it should be pointed out that the above PPC methods mainly focus on driving the tracking error to meet certain transient performances, in addition, although fuzzy logic systems (FLSs) or neural networks (NNs) are used to approximate the nonlinear functions of the controlled system, the approximation errors are not further discussed.

In practical control systems, using FLS or NN to deal with system uncertainty has brought us great help. However, if the parameter updating law of FLS or NN is designed based on instantaneous data, it may not guarantee the accurate estimation of the unknown function. To obtain an accurate estimation of system uncertainty, a valid strategy is to define a prediction error by using online recorded data together with instantaneous data, and then design a composite learning parameter updating law [2432]. Based on a partial persistent excitation (PE) condition, an NN composite learning control scheme was proposed by Wang and Hill [24], which can accurately approximate the unknown function. In Pan and Yu [32], a composite learning robot control approach was developed to achieve accurate parameter estimation under an interval excitation (IE) condition. To the best of our knowledge, most of the composite learning control methods only consider linear input, and the existing results are not valid for nonlinear input, such as dead-zone and saturation. In addition, the composite learning control method can achieve the steady-state performance of the tracking error; however, whether it can be combined with PPC technology to achieve transient performance will become an interesting topic.

Inspired by the above discussion, a composite learning finite-time PPC was proposed for an uncertain multi-input multi-output (MIMO) nonlinear second-order system. Compared with Xiang et al. [23], the highlights of this paper are presented as follows: 1) The proposed method in this paper ensures that the tracking error is limited within the predefined region at the settling time. 2) By using online recorded data and instantaneous data, composite learning parameter updating laws are designed. 3) The proposed control method overcomes the influence of dead-zone inputs and realizes accurate estimation of unknown functions.

The paper is arranged as follows. The control problem and some preliminaries are presented in Section 2. The composite learning finite-time PPC method is designed in Section 3. Some simulation results are shown in Section 4. The conclusion is provided in Section 5.

2. Preliminaries

2.1. System description

Consider a MIMO nonlinear second-order system whose dynamics can be described as

ξ¨=F(ξ,ξ˙,t)+Γ(u(t)),    (1)

where ξ=[ξ1,ξ2, ,ξn]Tn is a state vector, ξ˙=[ξ˙1,ξ˙2, ,ξ˙n]T and ξ¨=[ξ¨1,ξ¨2, ,ξ¨n]T are the first derivative and the second derivative of ξ, respectively. F(ξ,ξ˙,t)=[f1(ξ,ξ˙,t),f2(ξ,ξ˙,t), ,fn(ξ,ξ˙,t)]Tn is the nonlinear function vector; u(t)=[u1(t),u2(t), ,un(t)]Tn is the control input vector and Γ(u(t))=[Γ1(u1(t)),Γ2(u2(t)), ,Γn(un(t))]Tn is the output of dead-zone function vector in the actuator, which is described as

Γi(ui(t))={mi1(ui(t)-b-i),if ui(t)b-i,0,if -b-i<ui(t)<b-i,mi2(ui(t)+b-i),if ui(t)-b-i,    (2)

where mi1 and mi2 represent the right and the left slopes of the dead-zone and are assumed to satisfy that mi1 = mi2 = mi. b-i and bi stand for the breakpoints of the dead-zone.

From Equation (2), the equivalent form of Γ(u(t)) can be expressed as

Γ(u(t))=Mu(t)+u(t),    (3)

where M = diag(m1, m2, ⋯ , mn), u(t)=[u1(t),u2(t), ,un(t)]T and △ui(t) is described as

ui(t)={-mib-i,if ui(t)b-i,-miui(t),if -b-i<ui(t)<b-i,mib-i,if ui(t)-b-i.    (4)

where ξ- is defined as ξ-=[ξ1T,ξ2T]T=[ξT,ξ˙T]T. Substituting (Equations 3–1), the system (Equation 1) can be rewritten as

{ξ˙1=ξ2,ξ˙2=F-(ξ-,t)+Mu(t),    (5)

where F-(ξ-,t)=[f-1(ξ-,t),f-2(ξ-,t), ,f-n(ξ-,t)]T=[f1(ξ,ξ˙,t)+u1(t),f2(ξ,ξ˙,t)+u2(t), ,fn(ξ,ξ˙,t)+un(t)]T.

The following assumptions and lemmas are provided to facilitate control analysis.

Assumption 1. State vectors ξ and ξ˙ are measurable.

Assumption 2. f-i(ξ-,t),i=1,2, ,n is an unknown continuous function.

Remark 1. Assumptions 1 and 2 are common assumptions for the second-order nonlinear system [8, 23].

Lemma 1. Assume that f(x) is a continuous function, which is defined on a compact set Ωx, then there exists an optimal FLS φT(x* satisfies

supxΩx|f(x)-φT(x)θ*|εf*,    (6)

where θ*=[θ1*,θ2*, ,θm*]Tm is the ideal weight vector, m ∈ ℕ is the number of the fuzzy rules, εf* is the desired level, and φ(x) is a basis function vector, which can be expressed as

φ(x)=[φ1(x),φ2(x), ,φm(x)]Tj=1mφj(x)    (7)

where φj(x)=exp(-||x-cj||2bjTbj) is a Gaussion function, cj=[cj1,cj2, ,cjm]T, and bj=[bj1,bj2, ,bjm]T are the center vector and the width of φj(x), respectively.

According to Lemma 1, the unknown function f-i(ξ-,t) can be written as

f-i(ξ-,t)=φf-iT(ξ-)θf-i*+εf-i(ξ-),    (8)

where φf-i(ξ-) is the basis function vector and εf-i(ξ-) is the estimation error satisfying |εf-i(ξ-)|εf-i*, εf-i* is a positive constant. θf-i* can be defined as

θf-i*=argminθ^f-im{supξ-Dξ-|f-i(ξ-,t)-φf-iT(ξ-)θ^f-i|},    (9)

where θ^f-i is an adjustable parameter vector. In this paper, φf-iT(ξ-)θ^f-i is applied to approximate f-i(ξ-,t).

Let φF-T(ξ-)θF-*=[φf-1T(ξ-)θf-1*,φf-2T(ξ-)θf-2*, ,φf-nT(ξ-)θf-n*]T, φF-T(ξ-)θ^F-=[φf-1T(ξ-)θ^f-1,φf-2T(ξ-)θ^f-2, ,φf-nT(ξ-)θ^f-n]T, φF-T(ξ-)θ~F-=[φf-1T(ξ-)θ~f-1,φf-2T(ξ-)θ~f-2, ,φf-nT(ξ-)θ~f-n]T, εF-(ξ-)=[εf-1(ξ-),εf-2(ξ-), ,εf-n(ξ-)]T and θ~F-=θF-*-θ^F- is the estimated parameter error, where θF-*=[θf1-*T,θf2-*T, ,θfn-*T]T and θ^F-=[θ^f1-T,θ^f2-T, ,θ^fn-T]T. Thus, the system (Equation 5) can be modified as

{ξ˙1=ξ2,ξ˙2=φF-T(ξ-)θ~F-+φF-T(ξ-)θ^F-+εF-(ξ-)+Mu(t).    (10)

The tracking error e is defined as e=[e1,e2, ,en]T=ξ1-ξd and ξd=[ξd1,ξd2, ,ξdn]T is a reference signal vector, where ξdi, ξ˙di, and ξ¨di are continuous, recurrent, and available.

Definition 1. Pan and Yu [27] proposed that if the inequality t-τdtϕ(τ)ϕT(τ)dτμI holds for a bounded signal ϕ(t), then it is said that ϕ(t) satisfies the PE condition, where μ, τd are positive constants and I is an identity matrix.

Lemma 2. Pan and Yu [27] proposed that if x is recurrent, then there exists a regression vector φ(x) in Equation (7) such that the PE condition is satisfied.

Remark 2. In Pan et al. [26, 31], Liu et al. [29], and Pan and Yu [32], a PE condition criterion for φf-i(ξ-) in Equation (8) is given, that is, by calculating the minimum singular value of t-τdtφf-i(ξ-)φf-iT(ξ-)dτ. If the minimum singular value of t-τdtφf-i(ξ-)φf-iT(ξ-)dτ is greater than zero, φf-i(ξ-) can be regarded as satisfying the PE condition, i.e., there exists a positive constant μi such that t-τdtφf-i(ξ-)φf-iT(ξ-)dτ>μiI, and μi is only used for the theoretical proof of the proposed control method in this paper.

Remark 3. In Wang and Yang [33], a recurrent trajectory is described as periodic or period-like trajectory, that is, given a ø-neighborhood and a constant time T(ø), the return time of the recurrent trajectory to any point in the ø-neighborhood will not exceed T(ø). If a recurrent trajectory ξ is given in advance and |ζ − ξ| < ø holds after a certain time, ζ can also be regarded as a recurrent trajectory.

Lemma 3. (Boundedness of basis function). There exists a positive constant ψ, which is independent of x, such that φ(x) in Equation (7) satisfies max{φ(x),φ˙(x)}ψ.

2.2. Prescribed performance

First, a finite-time performance function p(t, α0, α, Te) is introduced as

p(t,α0,α,Te)={α4t4+α3t3+α2t2+α1t+α0,   0t<Te,α, tTe,    (11)

where α0 and α are the initial value and boundary value of the performance function p(t, α0, α, Te), respectively. Te is the predefined experienced time of p(t, α0, α, Te) than that from α0 to α. To guarantee that p(t, α0, α, Te), (t, α0, α, Te), and p¨(t,α0,α,Te) are continuous at t = Te, parameters α1, α2, α3, and α4 are designed as

{α1=4(α-α0)Te,α2=6(α0-α)Te2,α3=4(α-α0)Te3,α4=α0-αTe4.    (12)

For the tracking error e=[e1,e2, ,en]T, assume that each ei satisfies the following prescribed performance boundary (PPB):

pi(t,α-i0,α-i,Te)<ei<pi(t,α-i0,α-i,Te),    (13)

where pi(t, αi0, αi∞, Te) and pi(t,α-i0,α-i,Te) are the lower boundary and the upper boundary of ei, respectively. The designed parameters α-i0,α-i,α-i0,α- satisfy α-i<α-i0<α-i0 and α-<α-<α-0.

In this paper, a transformation variable zi is defined as

zi=ln(ϱi(t)1-ϱi(t)),    (14)

where ϱi(t)=ei-pi(t,α-i0,α-i,Te)pi(t,α-i0,α-i,Te)-pi(t,α-i0,α-i,Te).

Obviously, the following result holds:

Lemma 4. The boundedness of zi can guarantee that ei satisfies the PPB (13).

Proof. From Equation (14), one obtains that ϱi(t)=ezi1+ezi. Since zi is bounded, which means 0<ϱi(t)=ei-pi(t,α-i0,α-i,Te)pi(t,α-i0,α-i,Te)-pi(t,α-i0,α-i,Te)<1. Obviously, pi(t,α-i0,α-i,Te)<ei<pi(t,α-i0,α-i,Te). The proof is completed.

z is defined as z=[z1,z2,···,zn]T, the derivative of z becomes

z˙=r(ξ2-ξ˙d+Π),    (15)

where r = diag(r1, r2, ···, rn), Π=[Π1,Π2,···,Πn]T, and

    ri=1ϱi(t)(1-ϱi(t))(pi(t,α-i0,α-i,Te)-pi(t,α-i0,α-i,Te)),Πi=pi(t,α-i0,α-i,Te)i(t,α-i0,α-i,Te)-i(t,α-i0,α-i,Te)pi(t,α-i0,α-i,Te)-ei(t)(i(t,α-i0,α-i,Te)-i(t,α-i0,α-i,Te))pi(t,α-i0,α-i,Te)-pi(t,α-i0,α-i,Te).    (16)

Remark 4. If ei is limited within PPB (Equation 13), then 0 < ϱi(t) < 1 and 0 < 1 − ϱi(t) < 1, which implies that ri is negative and bounded. For tTe, one has pi(t, αi0, αi∞, Te) = αi∞, i(t, αi0, αi∞, Te) = 0, pi(t,α-i0,α-i,Te)=α-i, and i(t,α-i0,α-i,Te)=0, then ri=α-i-α-i(α-i-ei)(ei-α-i) and Πi = 0. Obviously, there exists a minimum value r*=4α-i-α-i such that rir* during t ∈ [Te, ∞).

z1 and z2 are defined as z1 = z, z2=z˙, the transformation system is given as follows:

{z˙1=z2,z˙2=rφF-T(ξ-)θ~F-+rφF-T(ξ-)θ^F-+rεF-(ξ-)+rMu(t)-rξ¨d+rΠ˙+Λ,    (17)

where Π˙=[Π˙1,Π˙2, ,Π˙n]T, Λ=r˙(ξ2-ξ˙d+Π), and r˙=diag(1,2, ,n).

3. Composite learning control design

First, introduce a variable σ=[σ1,σ2, ,σn]T as

σ=z2+cz1,    (18)

where c = diag(c1, c2, ⋯ , cn), ci is designed as positive constant. From Equations (15) and (18), the time derivative of σ becomes

σ˙=rφF-T(ξ-)θ~F-+rφF-T(ξ-)θ^F-+rεF-(ξ-)+rMu(t)   -rξ¨d+rΠ˙+Λ+cz2,    (19)

In order to make variable σ and the estimated error θ~F- bounded, the following theorem shows the first main result of this paper.

Theorem 1. For the MIMO nonlinear second-order system (Equation 1) with Assumptions 1 and 2, if the controller u is designed as

u=(rM)-1[-rφF-T(ξ-)θ^F-+rξ¨d-rΠ˙-Λ-cz2-Kσ],    (20)

where K = diag(k1, k2, ⋯ , kn), ki is the positive parameter. And the updating law θ^f-i is chosen as

θ^˙f-i=ηF-(riσiφf-i(ξ-)-γF-θ^f-i),    (21)

where ηF- and γF- are designed positive constants. All signals in Equation (22) are bounded, and then ei satisfies PPB (Equation 13). Meanwhile, ξ1 and ξ2 are recurrent after Te.

Proof. Consider the following Lyapunov function

V1=12(σTσ+1ηF-θ~F-Tθ~F-),    (22)

Substituting Equations (19)–(21) into V˙1, one gets

V˙1=-KσTσ+σTrεF-(ξ-)+σTrφF-T(ξ-)θ~F--i=1n1ηF-θ~f-iT(ξ-)θ^˙f-i=-KσTσ+σTrεF-(ξ-)+i=1nσiriφf-iT(ξ-)θ~f-i-i=1n1ηF-θ~f-iT[ηF-(riσiφf-i(ξ-)-γF-θ^f-i)]=-KσTσ+σTrεF-(ξ-)+i=1nγF-θ~f-iTθ^f-i=-KσTσ+σTrεF-(ξ-)+γF-θ~F-Tθ^F-.    (23)

By using Young's inequality, one obtains

σTrεF-(ξ-)σTσ4+rεF-(ξ-)2,γF-θ~F-Tθ^F--γF-2θ~F-Tθ~F-+γF-2θF-*2,    (24)

Because r and εF-(ξ-) are bounded, there exists an unknown upper bound bεr* such that rεF-(ξ-)bεr*. Therefore,

V˙1-k-σTσ-γF-2θ~F-Tθ~F-+R1*    (25)

where k-=mini{ki-14}>0, R1*=bεr*2+γF-2θF-*2. The following compact sets are defined as follows:

Ωσ={σ|σR1*k-},Ωθ={θ~F-|θ~F-2R1*γF-}.    (26)

Obviously, if σ ∉ Ωσ or θ~F-Ωθ~F-, one has V˙1<0. Thus, σ and θ~f-i are semiglobally uniformly bounded. Notice that z2=z˙1=σ-cz1, selecting Lyapunov function. Let V2=12z1Tz1, one has

V˙2=z1Tz˙1=z1Tσ-z1Tcz1  -cminz1Tz1+z1Tz14+σ2  -c-z1Tz1+R1*k-,    (27)

where c-=cmin-14>0, cmin=mini{ci}. The compact set is defined as Ωz1={z1|z1R1*c-k-}, if z1 ∉ Ωz1, V˙2<0. So, z1 is bounded, and then

z2σ+cz1    R1*k-+cmaxz1    =R1*k-(1+cmaxc-),    (28)

where cmax=maxi{ci}. According to Lemma 4, the boundedness of z1 ensures that all tracking errors ei meet PPB (Equation 13), which means that α-<ξi-ξdi<α- after Te. Because ξdi is recurrent, according to the description of recurrent trajectory in Remark 3, we know that ξi is also recurrent.

Form Equation (15), one gets ξ˙i-ξ˙di=żiri-Πi, and according to Remark 4, one has rir*, Πi = 0 after Te. Thus

|ξ˙i-ξ˙di|=|żi|riz2r*=R1*k-(1+cmaxc-)1r*,tTe.    (29)

It can be found that ξ˙i is limited within the R1*k-(1+cmaxc-)1r*- neighborhood of ξ˙di after Te. Since ξ˙di is recurrent, therefore, ξ˙i is also recurrent. Therefore, we can conclude that ξ1 and ξ2 are recurrent vector after time Te. □

Remark 5. Notice that the parameter updating law (Equation 21) in Theorem 1 only uses instantaneous data, which may not estimate the unknown function F-(ξ-,t) accurately. Therefore, we need to use online recorded data and instantaneous data to design composite learning parameter updating law.

Define Gi(t)=t-τdtφf-i(ξ-)φf-iT(ξ-)dτ, t ≥ τd + Te. Notice that ξ-=[ξ1T,ξ2T]T is recurrent vector after Te, according to Lemma 2 and Remark 2, if the selected fuzzy basis function φf-i(ξ-) in Equation (8) satisfies that the minimum singular value of Gi(t) is greater than zero, it means that Gi(t) satisfies the PE condition, i.e. there exists a positive constant μi such that Gi(t) ≥ μiI.

From Equation (5), one gets

f-i(ξ-,t)=ξ¨i-miui(t).    (30)

Multiplying φf-i(ξ-) to both sides of Equation (30) and integrating over [t − τd, t], one has

t-τdtφf-i(ξ-)f-i(ξ-,τ)dτ=t-τdtφf-i(ξ-)(φf-iT(ξ-)θf-i*+εf-i(ξ-))dτ           =Gi(t)θf-i*+t-τdtφf-i(ξ-)εf-i(ξ-)dτ           =t-τdtφf-i(ξ-)(ξ¨i-miui(τ))dτ.    (31)

Therefore, one obtains

Gi(t)θ~f-i=t-τdtφf-i(ξ-)(ξ¨i-miui(τ)-εf-i(ξ-))dτ.    (32)

Because ξ¨i is not measurable, a second-order filter is used to estimate ξ¨i:

{ω˙i1=ωi2,ω˙i2=-α(2βωi2+α(ωi1-ξ˙i))    (33)

with ωi1(0)=ξ˙i(0) and ωi2(0) = 0, where α denotes the natural frequency and β stands for the damping factor. According to Lemma 2 in Hu and Zhang [34], ξ˙i and ξ¨i can be estimated by ωi1 and ωi2, respectively. Moreover, for any given positive constant πi, there exist corresponding α and β so that |ωi2-ξ¨i|<πi holds.

Define a prediction error ϵi(t) as

ϵi(t)=Gi(t)θ~f-i+εei(t),    (34)

where εei(t) is a lumped approximation error given by

εei(t)=t-τdtφf-i(ξ-)(ωi2-ξ¨i+εf-i(ξ-))dτ.    (35)

From Equations (32) and (35), ϵi(t) can be computed by

ϵi(t)=t-τdtφf-i(ξ-)(ωi2-miui(τ))dτ.    (36)

Obviously, εei(t)t-τdtφf-i(ξ-)·(|ωi2-ξ¨i|+|εf-i(ξ-)|)dτψ(πi+εf-i*)τd by using Lemma 3 and Lemma 2 in Hu and Zhang [34]. Therefore, we modify the updating law (Equation 21) as follows

θ^˙f-i={ηF-(riσiφf-i(ξ-)-γF-θ^f-i), t<τd+Te,ηF-(riσiφf-i(ξ-)+γF-ϵi(t)),tτd+Te,    (37)

where γF- is a designed positive constant.

The following theorem shows the second main result of this paper.

Theorem 2. For the MIMO nonlinear second-order system (Equation 1) with assumptions 1 and 2, if the selected based function φf-i(ξ-) in Equation (8) satisfies Gi(t) ≥ μiI after Te + τd, then, the controller (Equation 20) and the composite learning law (Equation 37) guarantee that each tacking error ei satisfies PPB (Equation 13) at t ∈ [0, ∞) and θ~F- converges to a small neighborhood of zero during t ∈ [Te + τd, ∞).

Proof. Theorem 1 has proved that ei satisfies PPB (Equation 13) at t ∈ [0, Te + τd] and ξ-i=[ξiT,ξ˙iT]T is recurrent after Te. If the selected based function φf-i(ξ-) in Equation (8) satisfies Gi(t) ≥ μiI after Te + τd, then we just need to prove the stability of σ and θ~F- at t ∈ [Te + τd, ∞).

Consider the following Lyapuonv function

V3=12(σTσ+1ηF-θ~F-Tθ~F-),tTe+τd.    (38)

Substituting Equations (19), (20), and (37) into V˙3, one gets

V˙3=-KσTσ+σTrεF-(ξ-)-i=1nγF-θ~f-iTϵi(t)=-KσTσ+σTrεF-(ξ-)-i=1nγF-θ~f-iTGi(t)θ~f-i-i=1nγF-θ~f-iTεei(t).    (39)

Substituting Gi(t) ≥ μiI, t ∈ [Te + τd, ∞) to V˙3, one has

V˙3-KσTσ+σTrεF-(ξ-)  -i=1nμiγF-θ~f-iTθ~f-i-i=1nγF-θ~f-iTεei(t)  =-KσTσ+σTrεF-(ξ-)-μ-γF-2θ~F-Tθ~F-  -i=1nμiγF-2[θ~f-iTθ~f-i+2μiθ~f-iTεei(t)],    (40)

where μ-=mini{μi}. Applying Young's inequality, one gets

-i=1nμiγF-2[θ~f-iTθ~f-i+2μiθ~f-iTεei(t)]i=1nμiγF-2·εei2μi2i=1nγF-[ψ(πi+εfi*)τd]22μi.    (41)

Substituting the first inequality of Equations (24) and (41) into Equation (40) shows that

V˙3-k-σTσ-μ-γF-2θ~F-Tθ~F-+R2*,    (42)

where k-=mini{ki-14}>0, R2*=bεr*2+i=1nγF-[ψi(πi+εfi*)τd]22μi. Let κ=min{2k-,μ-γF-ηF-}. From Equation (42), it yields

V˙3-κV3+R2*.    (43)

Solving the inequality (Equation 43) leads to

V3(t)V3(Te+τd)e-κt+R2*κ,  t[Te+τd,),    (44)

which means σ and θ~F- tend to R2*κ- neighborhood of zero that can be arbitrarily diminished by properly designed parameters ki, ηF-, and γF- are chosen. Since σ is bounded, similar to the derivation of Theorem 1, we can conclude that ei is limited within [α-,α¯].

Remark 6. Theorem 1 shows a finite-time PPC method without learning. The parameter adaptive law uses instantaneous data, which does not guarantee the accurate estimation of the unknown function (see Figures 2AD in Section 4). Theorem 2 improves the parameter adaptive law by using online recorded data and instantaneous data after the preset time Te. Figures 4AC in Section 4 show the control effect. Obviously, the unknown functions are not affected by the dead-zone inputs and are accurately estimated.

4. Example

In this section, an inverted double pendulum system [35] (shown in Figure 1) was studied to illustrate the different control effectiveness of the proposed two methods. The dynamic model of the inverted double pendulum system (IBPS) was expressed as

{θ¨1=f1(θ1,θ˙1,θ2,θ˙2,t)+Γ1(ςu1(t)),θ¨2=f2(θ1,θ˙1,θ2,θ˙2,t)+Γ2(ςu2(t)),    (45)

where θi and θ˙i were the angles and angular velocities of the inverted double pendulum. f1(θ1,θ˙1, θ2,θ˙2,t)=χθ1+h(t)(-α(t)θ1+α(t)θ2-δ1+δ2)+d1(θ1,θ˙1), f2(θ1,θ˙1,θ2,θ˙2,t)=χθ2+h(t)(-α(t)θ2+α(t)θ1-δ1+δ2)+d2(θ2,θ˙2), χ=grcml, cm=mbma+mb, h(t)=k-(α(t)-cml)mbcml2, d1(θ1,θ˙1)=-mbmaθ˙12sin(θ1), and d2(θ2,θ˙2)=-mbmaθ˙22sin(θ2) were external disturbances for the IBPS (Equation 45), and control coefficient ς=1mbcml2. The parameter values of the IBPS were selected as ma = mb = 50, L = 2gr = 2k = 2l = 2, δ1 = sin(2t), δ2 = sin(3t) + L. Let ξ1=[θ1,θ2]T,ξ2=[θ˙1,θ˙2]T, the reference signal ξd=[ξd1,ξd2]T, ξd1=-ξd2=π6.28[sin(t)+0.3sin(3t)]. The initial angles and angular velocities were set as ξ1(0)=[0.8, -0.8]T and ξ2(0)=[0, 0]T. The control parameters were chosen as m1=m2=5,b-1=b-2=3, k1 = k2 = 10, and c1 = c2 = 5. Let f1(ξ-,t)=f1(θ1,θ˙1,θ2,θ˙2,t)+Δ1(ςu1(t)) and f2(ξ-,t)=f2(θ1,θ˙1,θ2,θ˙2,t)+Δ2(ςu2(t)) and ξ-=[θ1,θ˙1,θ2,θ˙2]T. The fuzzy membership functions were defined as

φf1j=φf2j=exp (-(ξ--ξj)T(ξ--ξj)0.5),    (46)

where ξj=[-2+j,-2+j,-2+j,-2+j]T,j=1,2, and 3. The initial values of θ^f1(t) and θ^f2(t) were designed as zero, parameters ηF-, γF-, and γF- in Equations (21) and (37) were set as ηF-=10, γF-=0.01, and γF-=6. The parameters α and β in Equation (33) were designed as α = 100 and β = 0.7. The tracking errors were defined as e1 = θ1 − ξd1 and e2 = θ2 − ξd2, and the following prescribed performance boundary conditions were chosen:

{p1(t,0.4,-0.05,Te)<e1<p1(t,1.2,0.05,Te),p2(t,-1.2,-0.05,Te)<e2<p2(t,-0.4,0.05,Te),    (47)

where Te = 5. The method in Theorem 1 was denoted as the TPPC method (Equation 20) without learning. Based on the above designed parameters and initial values, the simulation results of the IBPS by using the TPPC method (Equation 20) without learning are shown in Figures 2AD.

FIGURE 1
www.frontiersin.org

Figure 1. The inverted double pendulum system.

FIGURE 2
www.frontiersin.org

Figure 2. (A) Tracking errors e1 and e2, (B) estimation of f1(ξ-,t) and f2(ξ-,t), (C) control inputs u1 and u2, and (D) states θ1,θ˙1,θ2, and θ˙2 by using the time prescribed performance control (TPPC) method (Equation 20) without learning.

Figure 2A shows that tracking errors e1 and e2 are limited within PPBs (Equation 47). However, it was found that φf-1T(ξ)θ^f-1 and φf-2T(ξ)θ^f-2 appeared as some large jumps after 10 s in Figure 2B, indicating that the parameter updating law (Equation 21) cannot ensure that the unknown functions f1(ξ-,t) and f2(ξ-,t) are accurately estimated. From Figure 2C, one finds that each jump of controllers u1 and u2 is consistent with each fluctuation of φf-1T(ξ)θ^f-1 and φf-2T(ξ)θ^f-2, which indicates that the TPPC method (Equation 20) without learning cannot overcome the influence of dead-zone inputs Γ1u1) and Γ2u2). Meanwhile, Figure 2D shows that states θ1,θ˙1,θ2, and θ˙2 are recurrent, which also confirms the conclusion of Theorem 1.

Before using the method in Theorem 2, we need to verify whether G(t)=t-τdtφF-T(ξ-)φF-(ξ-)dτμI is always true after time Te + τd. According to Remark 2, we only need to verify that the minimum singular value of G(t) is greater than zero. The method in Theorem 2 was denoted as the TPPC (Equation 20) method with learning and let τd = 3 and

λmin={0,t<Td+τd,λmin{G(t)},tTd+τd.    (48)

Figure 3 shows that the selected fuzzy basic vector φF-(ξ-) satisfies λmin{G(t)} > 0 for tTd + τd, which means that there is a constant μ such that G(t) ≥ μI holds. The simulation results of the IBPS by using the TPPC method (Equation 20) with learning are displayed in Figures 4AC. By comparing Figure 2A with Figure 4A, the tracking error control effect of the TPPC method (Equation 20) with learning is obviously better than that of the TPPC method (Equation 20) without learning after Te + τd. By comparing the estimation effect of unknown functions in Figures 2B, 4B, φf-1T(ξ)θ^f-1 and φf-2T(ξ)θ^f-2 do not show big fluctuation in Figure 4 by using the TPPC method (Equation 20) with learning and unknown functions f1(ξ-,t) and f2(ξ-,t) are estimated accurately. Figure 4C shows that controllers u1 and u2 were stable without large fluctuation by the TPPC method (Equation 20) with learning. Through the comparison of simulation results, the control effect of the TPPC method (Equation 20) with learning was obviously better than that of the TPPC method (Equation 20) without learning, which also confirmed the theoretical analysis of this paper.

FIGURE 3
www.frontiersin.org

Figure 3. The minimum singular value λmin of G(t).

FIGURE 4
www.frontiersin.org

Figure 4. (A) Tracking errors e1 and e2, (B) estimation of f1(ξ-,t) and f2(ξ-,t), and (C) control inputs u1 and u2 by using the TPPC method (Equation 20) with learning.

5. Conclusion

In this paper, a composite learning finite-time PPC method was proposed for uncertain nonlinear systems with dead-zone inputs. A finite-time performance function and a transformation function were introduced, which can ensure that the racking error can be limited within a predefined region at a settling time. Then, in order to improve the accurate estimation effect of the unknown function, a prediction error was defined and a corresponding composite learning parameter law was designed. The simulation comparison of IBPS showed the superiority of the control effect of the proposed composite learning finite-time PPC method. Meanwhile, one can notice the limitation of the proposed method in this paper is that all states must be measurable. If some states are unmeasurable, the results of this paper cannot be obtained. Therefore, it is necessary to further study the problem of effective estimation of uncertainties based on partially measurable states.

Data availability statement

The original contributions presented in the study are included in the article/supplementary material, further inquiries can be directed to the corresponding author.

Author contributions

All authors listed have made a substantial, direct, and intellectual contribution to the work and approved it for publication.

Funding

This work was supported by the Natural Science Foundation for the Higher Education Institutions of Anhui Province of China (KJ2021A0965).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher's note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

1. Lymperopoulos G, Ioannou P. Building temperature regulation in a multi-zone HVAC system using distributed adaptive control. Energy Build. (2020) 215:109825. doi: 10.1016/j.enbuild.2020.109825

CrossRef Full Text | Google Scholar

2. Wang P, Wen G, Huang T, Yu W, Lv Y. Asymptotical neuro-adaptive consensus of multi-agent systems with a high dimensional leader and directed switching topology. IEEE Trans Neural Netw Learn Syst. (2022) doi: 10.1109/TNNLS.2022.3156279 [Epub ahead of print].

PubMed Abstract | CrossRef Full Text | Google Scholar

3. El Hamidi K, Mjahed M, El Kari A, Ayad H. Adaptive control using neural networks and approximate models for nonlinear dynamic systems. Model Simul Eng. (2020) 2020:8642915. doi: 10.1155/2020/8642915

CrossRef Full Text | Google Scholar

4. Gammons MV, Renko M, Flack JE, Mieszczanek J, Bienz M. Feedback control of Wnt signaling based on ultrastable histidine cluster co-aggregation between Naked/NKD and Axin. Elife. (2020) 9:e59879. doi: 10.7554/eLife.59879

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Nguyen AT, Coutinho P, Guerra TM, Palhares R, Pan J. Constrained output-feedback control for discrete-time fuzzy systems with local nonlinear models subject to state and input constraints. IEEE Trans Cybern. (2020) 51:4673–84. doi: 10.1109/TCYB.2020.3009128

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Wang P, Yu W, Wen G, Yu X, Huang T. A chattering free consensus controller for multiple Lur systems with a non-autonomous leader and directed switching topology. Sci China Technol Sci. (2022). doi: 10.1007/s11431-022-2175-5

CrossRef Full Text

7. Wang P, Wen G, Huang T, Yu W, Ren Y. Observer-based consensus protocol for directed switching networks with a leader of nonzero inputs. IEEE Trans Cybern. (2020) 52: 630–40. doi: 10.1109/TCYB.2020.2981518

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Wang S, Bekiros S, Yousefpour A, He S, Castillo O, Jahanshahi H. Synchronization of fractional time-delayed financial system using a novel type-2 fuzzy active control method. Chaos Solitons Fractals. (2020) 136:109768. doi: 10.1016/j.chaos.2020.109768

CrossRef Full Text | Google Scholar

9. Singh J, Jafari S, Khalaf A, Pham V, Roy B. A modified chaotic oscillator with megastability and variable boosting and its synchronisation using contraction theory-based control which is better than backstepping and nonlinear active control. Pramana. (2020) 94:1–14. doi: 10.1007/s12043-020-01993-y

CrossRef Full Text | Google Scholar

10. Li X, Peng D, Cao J. Lyapunov stability for impulsive systems via event-triggered impulsive control. IEEE Trans Automat Contr. (2020) 65:4908–3. doi: 10.1109/TAC.2020.2964558

CrossRef Full Text | Google Scholar

11. Stamov T, Stamova I. Design of impulsive controllers and impulsive control strategy for the Mittag-Leffler stability behavior of fractional gene regulatory networks. Neurocomputing. (2021) 424:54–62. doi: 10.1016/j.neucom.2020.10.112

CrossRef Full Text | Google Scholar

12. Bechlioulis CP, Rovithakis GA. Robust adaptive control of feedback linearizable MIMO nonlinear systems with prescribed performance. IEEE Trans Automat Contr. (2008) 53:2090–9. doi: 10.1109/TAC.2008.929402

CrossRef Full Text | Google Scholar

13. Niu B, Zhao J. Barrier Lyapunov functions for the output tracking control of constrained nonlinear switched systems. Syst Control Lett. (2013) 62:963–71. doi: 10.1016/j.sysconle.2013.07.003

CrossRef Full Text | Google Scholar

14. Li Y, Tong S. Prescribed performance adaptive fuzzy output-feedback dynamic surface control for nonlinear large-scale systems with time delays. Inf Sci. (2015) 292:125–42. doi: 10.1016/j.ins.2014.08.060

CrossRef Full Text | Google Scholar

15. Yao B, Tomizuka M. Smooth robust adaptive sliding mode control of manipulators with guaranteed transient performance. In: Proceedings of 1994 American Control Conference-ACC '94. Baltimore, MD: IEEE (1996).

Google Scholar

16. Zhi D, Xu L. Direct power control of DFIG with constant switching frequency and improved transient performance. IEEE Trans Energy Convers. (2007) 22:110–8. doi: 10.1109/TEC.2006.889549

CrossRef Full Text | Google Scholar

17. Kostarigka AK, Doulgeri Z, Rovithakis GA. Prescribed performance tracking for flexible joint robots with unknown dynamics and variable elasticity. Automatica. (2013) 49:1137–47. doi: 10.1016/j.automatica.2013.01.042

CrossRef Full Text | Google Scholar

18. Zhang L, Tong S, Li Y. Prescribed performance adaptive fuzzy output-feedback control of uncertain nonlinear systems with unmodeled dynamics. Nonlinear Dyn. (2014) 77:1653–65. doi: 10.1007/s11071-014-1407-0

CrossRef Full Text | Google Scholar

19. Zhang G, Cheng D. Observer-based neuro-adaptive prescribed performance control of nonstrict feedback systems and its application. Optik. (2019) 181:264–77. doi: 10.1016/j.ijleo.2018.12.018

CrossRef Full Text | Google Scholar

20. Bu X, Wu X, Huang J, Wei D. A guaranteed transient performance-based adaptive neural control scheme with low-complexity computation for flexible air-breathing hypersonic vehicles. Nonlinear Dyn. (2016) 84:2175–94. doi: 10.1007/s11071-016-2637-0

CrossRef Full Text | Google Scholar

21. Xiang W, Liu H. Fuzzy adaptive prescribed performance tracking control for uncertain nonlinear systems with unknown control gain signs. IEEE Access. (2019) 7:149867–77. doi: 10.1109/ACCESS.2019.2946601

CrossRef Full Text | Google Scholar

22. Liu Y, Liu X, Jing Y. Adaptive fuzzy finite-time stability of uncertain nonlinear systems based on prescribed performance. Fuzzy Sets Syst. (2019) 374:23–39. doi: 10.1016/j.fss.2018.12.015

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Xiang W, Sun Y, Li N, Yang C. Uncertain chaotic gyros synchronization using adaptive fuzzy prescribed performance control with unknown dead-zone input. Int J Innovat Comput Inf Control. (2017) 13:429–440. doi: 10.1155/2017/4386515

CrossRef Full Text

24. Wang C, Hill DJ. Learning from neural control. IEEE Trans Neural Netw. (2006) 17:130–46. doi: 10.1109/TNN.2005.860843

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Jamil ARM, Ganguly KK, Nower N. Adaptive traffic signal control system using composite reward architecture based deep reinforcement learning. IET Intell Transport Syst. (2021) 14:2030–41. doi: 10.1049/iet-its.2020.0443

CrossRef Full Text | Google Scholar

26. Pan Y, Zhou Y, Sun T, Er MJ. Composite adaptive fuzzy H tracking control of uncertain nonlinear systems. Neurocomputing. (2013) 99:15–24. doi: 10.1016/j.neucom.2012.05.011

CrossRef Full Text | Google Scholar

27. Pan Y, Yu H. Composite learning from adaptive dynamic surface control. IEEE Trans Automat Contr. (2015) 61:2603–9. doi: 10.1109/TAC.2015.2495232

PubMed Abstract | CrossRef Full Text | Google Scholar

28. Soukkou Y, Labiod S, Tadjine M. Composite adaptive dynamic surface control of nonlinear systems in parametric strict-feedback form. Trans Inst Meas Control. (2018) 40:1127–35. doi: 10.1177/0142331216675672

CrossRef Full Text | Google Scholar

29. Liu H, Pan Y, Cao J. Composite learning adaptive dynamic surface control of fractional-order nonlinear systems. IEEE Trans Cybern. (2019) 50:2557–67. doi: 10.1109/TCYB.2019.2938754

PubMed Abstract | CrossRef Full Text | Google Scholar

30. Liu H, Pan Y, Li S, Chen Y. Adaptive fuzzy backstepping control of fractional-order nonlinear systems. IEEE Trans Syst Man Cybern Syst. (2017) 47:2209–17. doi: 10.1109/TSMC.2016.2640950

PubMed Abstract | CrossRef Full Text | Google Scholar

31. Pan Y, Sun T, Yu H. Composite adaptive dynamic surface control using online recorded data. Int J Robust Nonlinear Control. (2016) 26:3921–36. doi: 10.1002/rnc.3541

PubMed Abstract | CrossRef Full Text | Google Scholar

32. Pan Y, Yu H. Composite learning robot control with guaranteed parameter convergence. Automatica. (2018) 89:398–406. doi: 10.1016/j.automatica.2017.11.032

CrossRef Full Text | Google Scholar

33. Wang M, Yang A. Dynamic learning from adaptive neural control of robot manipulators with prescribed performance. IEEE Trans Syst Man Cybern Syst. (2017) 47:2244–55. doi: 10.1109/TSMC.2016.2645942

CrossRef Full Text | Google Scholar

34. Hu J, Zhang H. Immersion and invariance based command-filtered adaptive backstepping control of VTOL vehicles. Automatica. (2013) 49:2160–7. doi: 10.1016/j.automatica.2013.03.019

CrossRef Full Text | Google Scholar

35. Lin CM, Chung CC. Fuzzy brain emotional learning control system design for nonlinear systems. Int J Fuzzy Syst. (2015) 17:117–28. doi: 10.1007/s40815-015-0020-9

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: nonlinear system, performance function, partial persistent excitation, dead-zone input, finite-time

Citation: Zhu F and Li P (2022) Composite fuzzy learning finite-time prescribed performance control of uncertain nonlinear systems with dead-zone inputs. Front. Appl. Math. Stat. 8:1041588. doi: 10.3389/fams.2022.1041588

Received: 11 September 2022; Accepted: 04 October 2022;
Published: 28 October 2022.

Edited by:

Peijun Wang, Anhui Normal University, China

Reviewed by:

Yude Ji, Hebei University of Science and Technology, China
Taiyan Jing, Henan Polytechnic University, China
Yufeng Yang, Xi'an University of Technology, China

Copyright © 2022 Zhu and Li. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Fang Zhu, zhufang198305@126.com

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.