State estimation for dynamic systems with higher-order autoregressive moving average non-Gaussian noise

Zhang, Guanghua; Zeng, Linghao; Lian, Feng; Liu, Xinqiang; Fu, Na; Dai, Shasha

doi:10.3389/fenrg.2022.990267

ORIGINAL RESEARCH article

Front. Energy Res., 05 September 2022

Sec. Smart Grids

Volume 10 - 2022 | https://doi.org/10.3389/fenrg.2022.990267

This article is part of the Research TopicFuture Electricity System Based on Energy Internet: Energy storage system design, Optimal Scheduling, Security, Attack Model and CountermeasuresView all 14 articles

State estimation for dynamic systems with higher-order autoregressive moving average non-Gaussian noise

Guanghua Zhang¹

Linghao Zeng²

Feng Lian¹*

Xinqiang Liu³

Na Fu⁴

Shasha Dai⁵

¹School of Automation Science and Engineering, Faculty of Electronic and Information Engineering, Xi’an Jiaotong University, Xi’an, China
²School of Economics and Management, Chang’an University, Xi’an, China
³Beijing Institute of Electronic System Engineering, Beijing, China
⁴State Key Laboratory of Astronautic Dynamics, Xi’an Satellite Control Center, Xi’an, China
⁵Xi’an Satellite Control Center, Xi’an, China

The classical Kalman filter is a very important state estimation approach, which has been widely used in many engineering applications. The Kalman filter is optimal for linear dynamic systems with independent Gaussian noises. However, the independence and Gaussian assumptions may not be satisfied in practice. On the one hand, modeling physical systems usually results in discrete-time state-space models with correlated process and measurement noises. On the other hand, the noise is non-Gaussian when the system is disturbed by heavy-tailed noise. In this case, the performance of the Kalman filter will deteriorate, or even diverge. This paper is devoted to addressing the state estimation problem of linear dynamic systems with high-order autoregressive moving average (ARMA) non-Gaussian noise. First, a triplet Markov model is introduced to model the system with high-order ARMA noise, since this model relaxes the independence assumption of the hidden Markov model. Then, a new filter is derived based on correntropy, instead of the commonly used minimum mean square error (MMSE), to deal with non-Gaussian noise. Unlike the MMSE, which uses only second-order statistics of error, correntropy can capture second-order and higher-order statistics. Finally, simulation results verify the effectiveness of the proposed algorithm.

1 Introduction

State estimation is a very important problem in many engineering applications, such as energy internet, system control, tracking, and so on [Zandavi and Chung (2019); Zhang et al. (2022)]. These engineering applications are essentially a dynamic system, which is usually described as a state-space model. The hidden Markov model (HMM) is the one of the most commonly used state-space models (Zhang et al. (2018)). For linear case, the state estimation problem is generally solved by the well-known Kalman filter(KF) (Kalman (1960)), which is an optimal filter in the minimum mean square error (MMSE) sense. MMSE is one of the most commonly used cost function in the case of Gaussian noise, and MMSE approach is an estimator which minimizes the mean square error. In addition, a large number of nonlinear filters have been proposed to solve nonlinear estimation problems, such as extended Kalman filter, unscented Kalman filter, cubature Kalman filter, particle filter, to name but a few (Anderson and Moore (2012)).

Although the KF in general performs well, it has rigorous requirements, that is, the process and measurement noises of dynamic systems are independent and Gaussian. However, the independence and Gaussian assumptions do not always hold in practice (Zhang G. et al. (2021)). On one hand, in fact, the noise of most dynamic systems is correlated. Research has shown that modeling physical systems usually results in discrete-time state-space models with correlated process and measurement noises, and some practical applications are explained in (Saha and Gustafsson (2012)). In addition, the dynamic and measurement noise may even high-order (i.e., multi-step) correlated in some severe environments (Zhang D. et al. (2021)). On the other hand, the main reason for using the Gaussian assumption is that it is mathematically simple, but in fact, dynamic systems are usually disturbed by some heavy-tailed impulse noise (Roth et al. (2013)). When the independence and Gaussian assumptions are not satisfied, the KF may fail to output reliable estimation results.

To deal with correlated noise, the traditional method is to reconstruct an HMM by prewhitening processing, and then the classical KF can be used to estimate the state (Bar-Shalom et al. (2001)). Another solution is to characterize dynamic systems with correlated noise through more flexible state-space models, such as the pairwise Markov model (PMM) (Pieczynski and Desbouvries (2003)) and the triplet Markov model (TMM) (Ait-El-Fquih and Desbouvries (2006)). In the PMM, the state and measurement as a whole are regarded as a Markov process, which improves the modeling ability of complex dynamic systems. In the TMM, an auxiliary variable is introduced to completely describe the dynamic systems. This auxiliary variable can play a very significant role in some engineering applications. For example, it can characterize the uncertainty of parameters, non-stationarity and error sources. It has been proved that the TMM is more general than the HMM and the PMM, and it structural advantages make it more preferable in addressing some real-world applications, such as image segmentation (Derrode and Pieczynski (2004)), speech processing (Ait El Fquih and Desbouvries (2005)), target tracking (Zhang et al. (2017); Lehmann and Pieczynski (2020, 2021)), and so on.

For non-Gaussian noise, several approaches have been proposed, which are mainly divided into three categories (Izanloo et al. (2016)). The first is to replace the Gaussian distribution with a more extensive heavy-tailed distribution (Huang et al. (2017)). For example, the Student’s t distribution is one of the most commonly used heavy-tailed distribution. The main disadvantage of heavy-tailed distributions is that they are usually analytically difficult, which brings about that related estimation approaches have no closed form solution. The second is the multiple model technique. In this approach, non-Gaussian noise is represented as a finite sum of Gaussian distribution (Shan et al. (2021)). The main difficulty of this approach is how to design the model set reasonably, and the disadvantage is that the amount of calculation will increase sharply with the increase of the number of models. The third is the Monte Carlo approach, in which a set of weighted random particles are employed to characterize the state (Liu et al. (2018)). Generally, sampling-based algorithms can be categorized into deterministic sampling method and random sampling method. Particularly, in the random sampling method, enough particles can approximate the real state with arbitrary precision, at the cost of expensive computation.

In the past few years, the correntropy-based filtering technology has become an important orientation to solve the state estimation of dynamic systems with non-Gaussian noise (Kulikova (2017); Chen et al. (2017)). In information theory, correntropy is a significant mathematical tool to measure the similarity of two random variables. Unlike the commonly utilized MMSE cost function, which uses only second-order statistic of error, the correntropy captures second-order and higher-order information, and is more suitable for non-Gaussian noise, such as heavy-tailed impulsive noise. Several filtering algorithms based on correntropy have been designed in the framework of HMMs, and they are more robust to non-Gaussian noise than the KF and its variants.

In this paper, we are devoted to addressing the state estimation problem of linear dynamic systems with high-order autoregressive moving average (ARMA) non-Gaussian noise. A new Kalman-like filter is developed in the framework of the TMM based on correntropy. First, we resort to a linear TMM to describe dynamic systems with high-order ARMA noise, since the TMM is more general than the HMM. Second, based on the model, a new Kalman-like filter is derived by using correntropy cost function, instead of the commonly used MMSE cost function. Because correntropy can capture not only second-order but also higher-order statistics of error, the proposed algorithm is more robust to non-Gaussian noise than the traditional filter. Finally, simulation results show the effectiveness of the proposed algorithm.

The rest of the paper is organized as follows. Section 2 is the modeling of linear dynamic systems with high-order ARMA noise. Section 3 derives a new Kalman filter by using correntropy cost function in the framework of the TMM. In Section 4, we validate the proposed algorithm via simulations. Finally, conclusion is provided in Section 5.

2 Modeling of linear dynamic systems with high-order ARMA noise

2.1 Linear hidden markov model

Consider the following linear dynamic system

\{\begin{matrix} x_{k + 1} = F_{k} x_{k} + G_{k} w_{k} \\ z_{k} = H_{k} x_{k} + v_{k} \end{matrix} (1)

where k is the time index, $x_{k} \in R^{n_{x}}$ is the state vector of dimension n_x, F_k the transition matrix, G_k is the process noise matrix, $w_{k} \sim N (0, Q_{k})$ is the process noise, $z_{k} \in R^{n_{z}}$ is the measurement vector of dimension n_z, H_k is the measurement matrix, and $v_{k} \sim N (0, R_{k})$ is the measurement noise. $N (m, P)$ denotes a Gaussian distribution with mean vector m and covariance matrix P.

In general, noise sequences $w = {\{w_{k}\}}_{k \in IN}$ and $v = {\{v_{k}\}}_{k \in IN}$ are assumed independent, jointly independent and independent of the initial state $x_{0} \sim N ({\hat{x}}_{0}, P_{0})$ . Then the state estimate can be obtained by the classical KF, which is an optimal filter in the MMSE sense. However, the independence and Gaussian assumptions that are typically assumed in the HMM do not always hold in practice, such as dynamic systems with high-order ARMA non-Gaussian noise. In this case, the KF may not output reliable estimation results.

2.2 Linear triplet markov model for dynamic systems with high-order ARMA noise

2.2.1 Linear triplet markov chain model

We resort to a linear TMM to describe a linear HMM with correlated noise. Let $x_{k} \in R^{n_{x}}$ is the state vector, $z_{k} \in R^{n_{z}}$ is the measurement vector, $r_{k} \in R^{n_{r}}$ is an auxiliary variable, and $ζ_{k} = {[x_{k}^{T}, r_{k}^{T}, z_{k - 1}^{T}]}^{T}$ . If $ζ = {\{ζ_{k}\}}_{k \in IN}$ is a Markov process, the following system is called a linear TMM (Ait-El-Fquih and Desbouvries (2006)):

\underset{ζ_{k + 1}}{\underset{︸}{[\begin{matrix} x_{k + 1} \\ r_{k + 1} \\ z_{k} \end{matrix}]}} = \underset{F_{k}}{\underset{︸}{[\begin{matrix} F_{k}^{x x} & F_{k}^{x r} & F_{k}^{x z} \\ F_{k}^{r x} & F_{k}^{r r} & F_{k}^{r z} \\ F_{k}^{z x} & F_{k}^{z r} & F_{k}^{z z} \end{matrix}]}} [\begin{matrix} x_{k} \\ r_{k} \\ z_{k - 1} \end{matrix}] + \underset{ξ_{k}}{\underset{︸}{[\begin{matrix} ξ_{k}^{x} \\ ξ_{k}^{r} \\ ξ_{k}^{z} \end{matrix}]}} (2)

where $ξ = {\{ξ_{k}\}}_{k \in IN}$ is zero mean white noise and independent of the initial state ζ₀.

2.2.2 Modeling high-order ARMA noise using TMM

In this section, we utilize a linear TMM to model dynamic systems with high-order ARMA noise (Zhang D. et al. (2021)). The TMM provides a general framework for these typical stochastic systems.

(1) High-Order ARMA Process Noise.

For high-order ARMA process noise, it can usually be written in the form of the following difference equation:

w_{k} = - \sum_{i = 1}^{p^{w}} α_{i}^{w} w_{k - i} + \sum_{i = 0}^{q^{w}} β_{i}^{w} ξ_{k - i}^{w} (3)

where $ξ_{k}^{w}$ is white noise. Model (3) is a typical high-order ARMA model, in which p^w is the autoregressive order, q^w is the moving average order, and coefficient parameters $α_{i}^{w}$ and $β_{i}^{w}$ are determined by the spectral factor $H_{k}^{w} (z)$ of the power spectral density $Φ_{k}^{w} (z)$ of the process noise w_k.

Suppose $Φ_{k}^{w} (z)$ is a rational spectrum. According to the spectral decomposition theorem, there is a spectral factor satisfying

Φ_{k}^{w} (z) = H_{k}^{w} (z) H_{k}^{w} {(z)}^{*} (4)

where ${(\cdot)}^{*}$ represents complex conjugate transpose operation, and $H_{k}^{w} (z)$ can be written by

H_{k}^{w} (z) = C_{k}^{w} {(z I - A_{k}^{w})}^{- 1} B_{k}^{w} + D_{k}^{w} (5)

Then, high-order ARMA process noise can be formulated by

\{\begin{matrix} x_{k + 1}^{w} = A_{k}^{w} x_{k}^{w} + B_{k}^{w} ξ_{k}^{w} \\ w_{k} = C_{k}^{w} x_{k}^{w} + D_{k}^{w} ξ_{k}^{w} \end{matrix} (6)

If the process noise in model 1) is high-order ARMA noise, it can be described by model (6). In this case, $\{x_{k}\}$ is no longer a Markov process, but $\{(x_{k}, x_{k}^{w})\}$ is a Markov process. Let $ζ_{k} = (x_{k}, r_{k} = x_{k}^{w}, z_{k - 1})$ . Model 1) with (6) can be written in the form of linear TMM (2), i.e.,

[\begin{matrix} x_{k + 1} \\ x_{k + 1}^{w} \\ z_{k} \end{matrix}] = [\begin{matrix} F_{k} & G_{k} C_{k}^{w} & 0 \\ 0 & A_{k}^{w} & 0 \\ H_{k} & 0 & 0 \end{matrix}] [\begin{matrix} x_{k} \\ x_{k}^{w} \\ z_{k - 1} \end{matrix}] + [\begin{matrix} G_{k} D_{k}^{w} ξ_{k}^{w} \\ B_{k}^{w} ξ_{k}^{w} \\ v_{k} \end{matrix}] (7)

Assuming white noise $ξ_{k}^{w} \sim N (0, Q_{k})$ , the noise covariance matrix of model 7) is

Q_{k} = [\begin{matrix} G_{k} D_{k}^{w} Q_{k} {(D_{k}^{w})}^{T} G_{k}^{T} & G_{k} D_{k}^{w} Q_{k} {(B_{k}^{w})}^{T} & 0 \\ B_{k}^{w} Q_{k} {(D_{k}^{w})}^{T} G_{k}^{T} & B_{k}^{w} Q_{k} {(B_{k}^{w})}^{T} & 0 \\ 0 & 0 & R_{k} \end{matrix}] (8)

(2) High-order ARMA Measurement Noise.

For high-order ARMA measurement noise, it can also be written in the following form of difference equation

v_{k} = - \sum_{i = 1}^{p^{v}} a_{i}^{v} v_{k - i} + \sum_{i = 0}^{q^{v}} b_{i}^{v} ξ_{k - i}^{v} (9)

where $ξ_{k}^{v}$ is white noise, p^v and q^v are autoregressive order and moving average order, respectively. If the spectral density of high-order ARMA measurement noise is $Φ_{k}^{v} (z)$ and the corresponding spectral factor is $H_{k}^{v} (z)$ , similar to model (6), v_k can be modeled as follows

\{\begin{cases} x_{k + 1}^{v} = A_{k}^{v} x_{k}^{v} + B_{k}^{v} ξ_{k}^{v} \\ v_{k} = C_{k}^{v} x_{k}^{v} + D_{k}^{v} ξ_{k}^{v} \end{cases} (10)

If the measurement noise in model 1) is high-order ARMA noise, it can be described by model (10). Let $ζ_{k} = (x_{k}, r_{k} = x_{k}^{v}, z_{k - 1})$ . Model 1) with (10) can be written in the form of linear TMM (2), i.e.,

[\begin{matrix} x_{k + 1} \\ x_{k + 1}^{v} \\ z_{k} \end{matrix}] = [\begin{matrix} F_{k} & 0 & 0 \\ 0 & A_{k}^{v} & 0 \\ H_{k} & C_{k}^{v} & 0 \end{matrix}] [\begin{matrix} x_{k} \\ x_{k}^{v} \\ z_{k - 1} \end{matrix}] + [\begin{matrix} G_{k} w_{k} \\ B_{k}^{v} ξ_{k}^{v} \\ D_{k}^{v} ξ_{k}^{v} \end{matrix}] (11)

Assuming white noise $ξ_{k}^{v} \sim N (0, R_{k})$ , the noise covariance matrix of model 11) is

Q_{k} = [\begin{matrix} G_{k} Q_{k} G_{k}^{T} & 0 & 0 \\ 0 & B_{k}^{v} R_{k} {(B_{k}^{v})}^{T} & B_{k}^{v} R_{k} {(D_{k}^{v})}^{T} \\ 0 & D_{k}^{v} R_{k} {(B_{k}^{v})}^{T} & D_{k}^{v} R_{k} {(D_{k}^{v})}^{T} \end{matrix}] (12)

(3) High-Order ARMA Process and Measurement Noises.

If the process noise and measurement noise are high-order ARMA noises, they can be described by model 6) and model (10), respectively. Let $ζ_{k} = (x_{k}, r_{k} = (x_{k}^{w}, x_{k}^{v}), z_{k - 1})$ . Model 1) with (6) and (10) can be written in the form of linear TMM (2), i.e.,

[\begin{matrix} x_{k + 1} \\ \begin{aligned} x_{k + 1}^{w} \\ x_{k + 1}^{v} \end{aligned} \\ z_{k} \end{matrix}] = [\begin{matrix} F_{k} & G_{k} C_{k}^{w} & 0 & 0 \\ 0 & A_{k}^{w} & 0 & 0 \\ 0 & 0 & A_{k}^{v} & 0 \\ H_{k} & 0 & C_{k}^{v} & 0 \end{matrix}] [\begin{matrix} x_{k} \\ \begin{aligned} x_{k}^{w} \\ x_{k}^{v} \end{aligned} \\ z_{k - 1} \end{matrix}] + [\begin{matrix} G_{k} D_{k}^{w} ξ_{k}^{w} \\ B_{k}^{w} ξ_{k}^{w} \\ B_{k}^{v} ξ_{k}^{v} \\ D_{k}^{v} ξ_{k}^{v} \end{matrix}] (13)

Assuming white noises $ξ_{k}^{w} \sim N (0, Q_{k})$ and $ξ_{k}^{v} \sim N (0, R_{k})$ , the noise covariance matrix of model 13) is

Q_{k} = [\begin{matrix} G_{k} D_{k}^{w} Q_{k} {(D_{k}^{w})}^{T} G_{k}^{T} & G_{k} D_{k}^{w} Q_{k} {(B_{k}^{w})}^{T} & 0 & 0 \\ B_{k}^{w} Q_{k} {(D_{k}^{w})}^{T} G_{k}^{T} & B_{k}^{w} Q_{k} {(B_{k}^{w})}^{T} & 0 & 0 \\ 0 & 0 & B_{k}^{v} R_{k} {(B_{k}^{v})}^{T} & B_{k}^{v} R_{k} {(D_{k}^{v})}^{T} \\ 0 & 0 & D_{k}^{v} R_{k} {(B_{k}^{v})}^{T} & D_{k}^{v} R_{k} {(D_{k}^{v})}^{T} \end{matrix}] (14)

2.3 Restoration algorithm

Let $x_{k}^{*} = (x_{k}, r_{k})$ . Then model 2) can be written as

[\begin{matrix} x_{k + 1}^{*} \\ z_{k} \end{matrix}] = [\begin{matrix} F_{k}^{x^{*} x^{*}} & F_{k}^{x^{*} z} \\ F_{k}^{z x^{*}} & F_{k}^{z z} \end{matrix}] [\begin{matrix} x_{k}^{*} \\ z_{k - 1} \end{matrix}] + [\begin{matrix} ξ_{k}^{x^{*}} \\ ξ_{k}^{z} \end{matrix}] (15)

where the initial state $x_{0}^{*}$ and noise ξ_k are

x_{0}^{*} \sim N ({\hat{x}}_{0}^{*}, P_{0}^{*}), ξ_{k} \sim N (0, \underset{Q_{k}}{\underset{︸}{[\begin{matrix} Q_{k}^{x^{*} x^{*}} & Q_{k}^{x^{*} z} \\ Q_{k}^{z x^{*}} & Q_{k}^{z z} \end{matrix}]}}) (16)

For model 15) with (16), a Kalman-like filter, called triplet Kalman filter (TKF), has been derived to estimate the state $x_{k}^{*}$ . For convenience, the recursive equations are summarized as follows (Ait-El-Fquih and Desbouvries (2006)).

Initialization:

{\hat{x}}_{0 | 0}^{*} = {\hat{x}}_{0}^{*}, P_{0 | 0} = P_{0}^{*} (17)

{\hat{F}}_{k}^{x^{*} x^{*}} = F_{k}^{x^{*} x^{*}} - Q_{k}^{x^{*} z} {(Q_{k}^{z z})}^{- 1} F_{k}^{z x^{*}} (18)

{\hat{F}}_{k}^{x^{*} z} = F_{k}^{x^{*} z} - Q_{k}^{x^{*} z} {(Q_{k}^{z z})}^{- 1} F_{k}^{z z} (19)

{\hat{Q}}_{k}^{x^{*} x^{*}} = Q_{k}^{x^{*} x^{*}} - Q_{k}^{x^{*} z} {(Q_{k}^{z z})}^{- 1} Q_{k}^{z x^{*}} (20)

Prediction:

{\hat{x}}_{k | k - 1}^{*} = {\hat{F}}_{k - 1}^{x^{*} x^{*}} {\hat{x}}_{k - 1 | k - 1}^{*} + Q_{k - 1}^{x^{*} z} {(Q_{k - 1}^{z z})}^{- 1} z_{k - 1} + {\hat{F}}_{k - 1}^{x^{*} z} z_{k - 2} (21)

P_{k | k - 1}^{*} = {\hat{F}}_{k - 1}^{x^{*} x^{*}} P_{k - 1 | k - 1}^{*} {({\hat{F}}_{k - 1}^{x^{*} x^{*}})}^{T} + {\hat{Q}}_{k - 1}^{x^{*} x^{*}} (22)

Update:

e_{k} = z_{k} - F_{k - 1}^{z x^{*}} {\hat{x}}_{k | k - 1}^{*} - F_{k - 1}^{z z} z_{k - 1} (23)

R_{e, k} = F_{k - 1}^{z x^{*}} P_{k | k - 1}^{*} {(F_{k - 1}^{z x^{*}})}^{T} + Q_{k - 1}^{z z} (24)

K_{k} = P_{k | k - 1}^{*} {(F_{k - 1}^{z x^{*}})}^{T} R_{e, k}^{- 1} (25)

{\hat{x}}_{k | k}^{*} = {\hat{x}}_{k | k - 1}^{*} + K_{k} e_{k} (26)

P_{k | k}^{*} = (I - K_{k} F_{k}^{z x^{*}}) P_{k | k - 1}^{*} (27)

The TKF is also an optimal filter in the MMSE sense. It in general performs well in Gaussian noise. However, it performance will deteriorate or even diverge when applied to non-Gaussian systems, since the TKF is derived under MMSE criterion, which only uses second-order statistics of error. To solve this problem, in the next section, a new filter is developed by using correntropy cost function, which utilizes not only second-order but also higher-order statistics information.

3 Correntropy-based triplet kalman filter

3.1 Correntropy

Correntropy is a very useful metric tool to measure the similarity of two random variables in information theory (Chen et al. (2017)). For variables X and Y, the correntropy is defined by

C (X, Y) = E [κ (X, Y)] = \int κ (x, y) d f_{X Y} (x, y) (28)

where $E [\cdot]$ is an expectation operator, $κ (\cdot, \cdot)$ is a kernel function, and $f_{X Y} (x, y)$ is the joint probability density function of X and Y. Generally, $f_{X Y} (x, y)$ is unknown, and only a limited amount of data is provided. Thus, the correntropy can be computed by

\hat{C} (X, Y) = \frac{1}{N} \sum_{i = 1}^{N} κ (x, y) . (29)

There are many options for kernel function. In this paper, we choose the Gaussian kernel function

κ (x, y) = G_{σ} (x_{i} - y_{i}), (30)

where σ is the kernel size, and $G_{σ} (x_{i} - y_{i}) = \exp (- \frac{{‖x_{i} - y_{i}‖}^{2}}{2 σ^{2}})$ . The Gaussian kernel function is positive definite and bounded. When X = Y, it takes the maximum value.

For the Gaussian kernel function, its Taylor series expansion can be written as

C (X, Y) = \sum_{n = 0}^{\infty} \frac{{(- 1)}^{n}}{2^{n} σ^{2 n} n!} E [{(X - Y)}^{2 n}] . (31)

It can be seen that the correntropy is in essence the weighted sum of all even-order moments of error. Compared with the MMSE, which uses only the second-order statistics of error, correntropy captures the second-order and higher-order statistics.

3.2 Main results

In this section, a new filter, called correntropy-based TKF (CTKF), is derived by using correntropy under TMM. For clarity, we first provide the main results, and then give the mathematical derivation.

The initialization and prediction steps of the CTKF are the same as those of the TKF, and its update step is summarized as follows:

e_{k} = z_{k} - F_{k - 1}^{z x^{*}} {\hat{x}}_{k | k - 1}^{*} - F_{k - 1}^{z z} z_{k - 1} (32)

λ_{k} = G_{σ} ({‖e_{k}‖}_{{(Q_{k}^{z z})}^{- 1}}) (33)

R_{e, k} = F_{k - 1}^{z x^{*}} P_{k | k - 1}^{*} {(F_{k - 1}^{z x^{*}})}^{T} + Q_{k - 1}^{z z} (34)

K_{k}^{λ} = λ_{k} P_{k | k - 1}^{*} {(F_{k - 1}^{z x^{*}})}^{T} R_{e, k}^{- 1} (35)

{\hat{x}}_{k | k}^{*} = {\hat{x}}_{k | k - 1}^{*} + K_{k}^{λ} e_{k} (36)

P_{k | k}^{*} = (I - K_{k}^{λ} F_{k}^{z x^{*}}) P_{k | k - 1}^{*} (37)

Proof. For the linear TMM (2), we have

[\begin{matrix} {\hat{x}}_{k | k - 1}^{*} \\ z_{k} \end{matrix}] = [\begin{matrix} I \\ F_{k}^{z x^{*}} \end{matrix}] x_{k}^{*} + [\begin{matrix} 0 \\ F_{k}^{z z} \end{matrix}] z_{k - 1} + η_{k} (38)

where I and 0 are identity and zeros matrices, and

η_{k} = [\begin{matrix} - (x_{k}^{*} - {\hat{x}}_{k | k - 1}^{*}) \\ w_{k}^{z} \end{matrix}] with E [η_{k} η_{k}^{T}] = [\begin{matrix} P_{k | k - 1}^{*} & 0 \\ 0 & Q_{k}^{z z} \end{matrix}] (39)

To address non-Gaussian noise, we use correntropy instead of MMSE to derive update equations. The cost function based on correntropy is established by

\begin{align} J (x_{k}^{*}) & = G_{σ} ({‖z_{k} - F_{k}^{z x^{*}} x_{k}^{*} - F_{k}^{z z} z_{k - 1}‖}_{{(Q_{k}^{z z})}^{- 1}}) \\ + G_{σ} ({‖x_{k}^{*} - {\hat{x}}_{k | k - 1}^{*}‖}_{{(P_{k | k - 1}^{*})}^{- 1}}) \end{align} (40)

Then the optimal estimation of $x_{k}^{*}$ is ${\hat{x}}_{k}^{*} = \arg \max_{x_{k}^{*}} J (x_{k}^{*})$ , which can be obtained by

\begin{aligned} \frac{\partial J (x_{k}^{*})}{\partial x_{k}^{*}} & = \frac{1}{σ^{2}} G_{σ} ({‖z_{k} - F_{k}^{z x^{*}} x_{k}^{*} - F_{k}^{z z} z_{k - 1}‖}_{{(Q_{k}^{z z})}^{- 1}}) \\ {(F_{k}^{z x^{*}})}^{T} {(Q_{k}^{z z})}^{- 1} (z_{k} - F_{k}^{z x^{*}} x_{k}^{*} - F_{k}^{z z} z_{k - 1}) \\ - \frac{1}{σ^{2}} G_{σ} ({‖x_{k} - {\hat{x}}_{k | k - 1}^{*}‖}_{{(P_{k | k - 1}^{*})}^{- 1}}) {(P_{k | k - 1}^{*})}^{- 1} \\ (x_{k} - {\hat{x}}_{k | k - 1}^{*}) \\ = 0 . \end{aligned} (41)

Equation 41 can be written by

\begin{align} Ψ_{k} x_{k}^{*} & = {(P_{k | k - 1}^{*})}^{- 1} {\hat{x}}_{k | k - 1}^{*} + λ_{k} {(F_{k}^{z x^{*}})}^{T} {(Q_{k}^{z z})}^{- 1} \\ (z_{k} - F_{k}^{z z} z_{k - 1}) \end{align} (42)

where

Ψ_{k} = {(P_{k | k - 1}^{*})}^{- 1} + λ_{k} {(F_{k}^{z x^{*}})}^{T} {(Q_{k}^{z z})}^{- 1} F_{k}^{z x^{*}}, (43)

λ_{k} = \frac{G_{σ} ({‖z_{k} - F_{k}^{z x^{*}} x_{k}^{*} - F_{k}^{z z} z_{k - 1}‖}_{{(Q_{k}^{z z})}^{- 1}})}{G_{σ} ({‖x_{k}^{*} - {\hat{x}}_{k | k - 1}^{*}‖}_{{(P_{k | k - 1}^{*})}^{- 1}})} . (44)

Adding and subtracting a term $λ_{k} {(F_{k}^{z x^{*}})}^{T} {(Q_{k}^{z z})}^{- 1} F_{k}^{z x^{*}} {\hat{x}}_{k | k - 1}^{*}$ on the right-hand side of (42), we have

\begin{align} Ψ_{k} x_{k}^{*} & = Ψ_{k} {\hat{x}}_{k | k - 1}^{*} + λ_{k} {(F_{k}^{z x^{*}})}^{T} {(Q_{k}^{z z})}^{- 1} \\ (z_{k} - F_{k}^{z x^{*}} {\hat{x}}_{k | k - 1}^{*} - F_{k}^{z z} z_{k - 1}) . \end{align} (45)

Thus, the estimation of $x_{k}^{*}$ can be computed by

{\hat{x}}_{k | k}^{*} = {\hat{x}}_{k | k - 1}^{*} + K_{k}^{λ} (z_{k} - F_{k}^{z x^{*}} {\hat{x}}_{k | k - 1}^{*} - F_{k}^{z z} z_{k - 1}) (46)

where

K_{k}^{λ} = Ψ_{k}^{- 1} λ_{k} {(F_{k}^{z x^{*}})}^{T} {(Q_{k}^{z z})}^{- 1} (47)

Note that the parameter λ_k is function of $x_{k}^{*}$ . For simplicity, let $x_{k}^{*} \approx {\hat{x}}_{k | k - 1}^{*}$ in (44), and λ_k can be obtained by

λ_{k} = G_{σ} ({‖z_{k} - F_{k}^{z x^{*}} {\hat{x}}_{k | k - 1}^{*} - F_{k}^{z z} z_{k - 1}‖}_{{(Q_{k}^{z z})}^{- 1}}) (48)

In addition, parameter σ plays an important role in correntropy-based filters. Inspired by (Kulikova (2017)), this paper adopts an adaptive method to choose σ, i.e.,

σ = {‖z_{k} - F_{k}^{z x^{*}} {\hat{x}}_{k | k - 1}^{*} - F_{k}^{z z} z_{k - 1}‖}_{{(Q_{k}^{z z})}^{- 1}} (49)

In this section, a CTKF is developed to address the estimation problem of dynamic systems with high-order ARMA non-Gaussian noise. Instead of the commonly used MMSE criterion, which uses only second-order statistics of error, correntropy is employed to derive the filter, since it can captures second-order and higher-order statistics of error. It can be seen that the structure of CTKF is similar to that of TKF, except that an extra scale parameter λ_k is involved. The scale parameter is computed according to correntropy criterion to control the gain matrix $K_{k}^{λ}$ , which results in that the CTKF in general performs well for non-Gaussian noise. In addition, the CTKF has a simple form, which facilitates its practical application.

4 Numerical simulations

In this section, two scenarios, i.e., dynamic system with high-order ARMA Gaussian and non-Gaussian noise, are taken into account to verify the effectiveness of the TMM and CTKF. In model (1), the state is $x_{k} = {[p_{x, k}, v_{x, k}, p_{y, k}, v_{y, k}]}^{T}$ , and relevant matrices are

\begin{align} F_{k} = [\begin{matrix} 1 & T & 0 & 0 \\ 0 & 1 & 0 & 0 \\ 0 & 0 & 1 & T \\ 0 & 0 & 0 & 1 \end{matrix}], G_{k} = [\begin{matrix} \frac{T^{2}}{2} & 0 \\ T & 0 \\ 0 & \frac{T^{2}}{2} \\ 0 & T \end{matrix}], \\ H_{k} = [\begin{matrix} 1 & 0 & 0 & 0 \\ 0 & 0 & 1 & 0 \end{matrix}] \end{align} (50)

where T = 1 is the sampling period. The spectral factors of process and measurement noises are

H^{w} (z) = \frac{z^{4} - 0.4 z^{3} + 0.9 z^{2} - 0.1 z - 0.3}{z^{4} - z^{3} + 0.5 z^{2} + 0.2 z - 0.4} (51)

H^{v} (z) = \frac{z^{6} + 0.6 z^{5} + 0.4 z^{4} + 0.3 z^{3} - 0.08 z^{2} + 0.05 z + 0.01}{z^{6} + 0.8 z^{5} + 0.6 z^{4} + 0.2 z^{3} - 0.09 z^{2} - 0.08 z + 0.01} (52)

Case 1: $ξ_{k}^{w} \sim N (0, Q_{k})$ and $ξ_{k}^{v} \sim N (0, R_{k})$ are Gaussian noises, where $Q_{k} = diag ({0.01}^{2}, {0.01}^{2})$ and $R_{k} = diag ({0.1}^{2}, {0.1}^{2})$ . For comparison, the standard Kalman filter (KF), the traditional state augmented Kalman filter (SAKF) (Bar-Shalom et al. (2001)), and the triplet Kalman filter (TKF) are tested. Besides, the root mean square error (RMSE) is used to evaluate estimation performance, which is computed by

RMSE = \sqrt{\frac{1}{M} \sum_{i = 1}^{M} ({(x_{k}^{i} - {\hat{x}}_{k}^{i})}^{2} + {(y_{k}^{i} - {\hat{y}}_{k}^{i})}^{2})} (53)

where $x_{k}^{i}$ and $y_{k}^{i}$ are the true values at time k in the ith Monte Carlo trail, and ${\hat{x}}_{k}^{i}$ and ${\hat{y}}_{k}^{i}$ are the corresponding estimation values. The number of Monte Carlo trails is M = 200.

Position and velocity RMSE results are provided in Figure 1. It can be seen that the TKF and SAKF have similar estimation performance, and are better than the standard KF. High-order ARMA process and measurement noises do not meet the independence assumption, resulting in poor estimation performance of the KF. The TKF and SAKF are essentially equivalent, and they are optimal in the MMSE sense. The former models dynamic system with high-order ARMA noise through TMM, and the latter deals with high-order ARMA noise through prewhitening technique. Simulation results show that TMM can accurately model dynamic systems with high-order ARMA noise.

FIGURE 1

FIGURE 1. RMSE results for different filters. (A) Position RMSE. (B) Velocity RMSE.

Case 2: $ξ_{k}^{w} \sim N (0, Q_{k})$ and $ξ_{k}^{v} \sim N (0, R_{k})$ are Gaussian noise disturbed by shot noise with probability of 0.2, where Q_k and R_k are the same as those in case 1, and the shot noise is generated by 0.1 ×randi([5, 10]). Symbol randi([a, b]) denotes that an integer is returned from the uniform distribution of [a, b].

For comparison, the TKF and the proposed CTKF are tested. Position and velocity RMSE results are provided in Figure 2. It can be seen that the CTKF performs better than the TKF. Non-Gaussian noise results in the poor estimation performance of the TKF, since it adopts the MMSE criterion, which uses only second-order statistic of error. The CTKF shows stronger robustness to non-Gaussian noise, because the adopted correntropy cost function can capture second-order and higher-order statistics of error. Simulation results show that the CTKF is an effective state estimation method for dynamic systems with high-order ARMA non-Gaussian noise.

FIGURE 2

FIGURE 2. RMSE results for different filters. (A) Position RMSE. (B) Velocity RMSE.

5 Conclusion

In this paper, a new filter is designed to solve the state estimation problem of dynamic systems with high-order ARMA non-Gaussian noise. In this filter, high-order ARMA process and measurement noises are modeled in the TMM framework, and then the recursive algorithm is derived by using corretropy cost function. On the one hand, the TMM is more general than the HMM, and it can directly model dynamic systems with high-order ARMA noise. On the other hand, correntropy can capture second-order and higher-order statistics of error, and is more suitable for non-Gaussian noise than the MMSE cost function, which uses only second-order statistics of error. In addition, the CTKF has a simple form, which facilitates its practical application.

Data availability statement

The original contributions presented in the study are included in the article/Supplementary Material, further inquiries can be directed to the corresponding author.

Author contributions

GZ provided the idea of the work and organized the manuscript, LZ performed the experiment, FL provided revisions to this paper, XL designed experimental scenarios, and NF and SD conducted data analysis.

Funding

This work is supported by National Natural Science Foundation of China under Grants 62103318 and 62173266.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fenrg.2022.990267/full#supplementary-material

References

Ait El Fquih, B., and Desbouvries, F. (2005). “Kalman filtering for triplet Markov chains : Applications and extensions,” in IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, Philadelphia, PA, USA, 685–688. vol. IV. doi:10.1109/ICASSP.2005.1416101

CrossRef Full Text | Google Scholar

Ait-El-Fquih, B., and Desbouvries, F. (2006). Kalman filtering in triplet Markov chains. IEEE Trans. Signal Process. 54, 2957–2963. doi:10.1109/TSP.2006.877651

CrossRef Full Text | Google Scholar

Anderson, B. D., and Moore, J. B. (2012). Optimal filtering. New York: Courier Corporation.

Google Scholar

Bar-Shalom, Y., Li, X. R., and Kirubarajan, T. (2001). Estimation with applications to tracking and navigation: Theory, algorthims and software. New York: Wiley.

Google Scholar

Chen, B., Liu, X., Zhao, H., and Principe, J. C. (2017). Maximum correntropy kalman filter. Automatica 76, 70–77. doi:10.1016/j.automatica.2016.10.004

CrossRef Full Text | Google Scholar

Derrode, S., and Pieczynski, W. (2004). Signal and image segmentation using pairwise Markov chains. IEEE Trans. Signal Process. 52, 2477–2489. doi:10.1109/TSP.2004.832015

CrossRef Full Text | Google Scholar

Huang, Y., Zhang, Y., Li, N., Wu, Z., and Chambers, J. A. (2017). A novel robust Student’s t-based kalman filter. IEEE Trans. Aerosp. Electron. Syst. 53, 1545–1554. doi:10.1109/TAES.2017.2651684

CrossRef Full Text | Google Scholar

Izanloo, R., Fakoorian, S. A., Yazdi, H. S., and Simon, D. (2016). “Kalman filtering based on the maximum correntropy criterion in the presence of non-Gaussian noise,” in 2016 Annual Conference on Information Science and Systems, CISS, Princeton, NJ, USA, 500–505. doi:10.1109/CISS.2016.7460553

CrossRef Full Text | Google Scholar

Kalman, R. E. (1960). A new approach to linear filtering and prediction problems. J. Basic Eng. 82, 35–45. doi:10.1115/1.3662552

CrossRef Full Text | Google Scholar

Kulikova, M. V. (2017). Square-root algorithms for maximum correntropy estimation of linear discrete-time systems in presence of non-Gaussian noise. Syst. Control Lett. 108, 8–15. doi:10.1016/j.sysconle.2017.07.016

CrossRef Full Text | Google Scholar

Lehmann, F., and Pieczynski, W. (2021). Reduced-dimension filtering in triplet markov models. IEEE Trans. Autom. Contr. 67, 605–617. doi:10.1109/TAC.2021.3050721

CrossRef Full Text | Google Scholar

Lehmann, F., and Pieczynski, W. (2020). Suboptimal kalman filtering in triplet markov models using model order reduction. IEEE Signal Process. Lett. 27, 1100–1104. doi:10.1109/LSP.2020.3002420

CrossRef Full Text | Google Scholar

Liu, X., Qu, H., Zhao, J., and Yue, P. (2018). Maximum correntropy square-root cubature kalman filter with application to SINS/GPS integrated systems. ISA Trans. 80, 195–202. doi:10.1016/j.isatra.2018.05.001

PubMed Abstract | CrossRef Full Text | Google Scholar

Pieczynski, W., and Desbouvries, F. (2003). “Kalman filtering using pairwise Gaussian models,” in IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2003, Hong Kong, China, 57–60. doi:10.1109/ICASSP.2003.1201617

CrossRef Full Text | Google Scholar

Roth, M., özkan, E., and Gustafsson, F. (2013). “A Student’s t filter for heavy tailed process and measurement noise,” in IEEE International Conference on Acoustics, Speech and Signal Processing, Vancouver, BC, Canada, 5770–5774. doi:10.1109/ICASSP.2013.6638770

CrossRef Full Text | Google Scholar

Saha, S., and Gustafsson, F. (2012). Particle filtering with dependent noise processes. IEEE Trans. Signal Process. 60, 4497–4508. doi:10.1109/TSP.2012.2202653

CrossRef Full Text | Google Scholar

Shan, C., Zhou, W., Jiang, Z., and Shan, H. (2021). A new Gaussian approximate filter with colored non-stationary heavy-tailed measurement noise. Digit. Signal Process. 122, 103358. doi:10.1016/j.dsp.2021.103358

CrossRef Full Text | Google Scholar

Zandavi, S. M., and Chung, V. (2019). State estimation of nonlinear dynamic system using novel heuristic filter based on genetic algorithm. Soft Comput. 23, 5559–5570. doi:10.1007/s00500-018-3213-y

CrossRef Full Text | Google Scholar

Zhang, D., Duan, Z., Wang, P., and Zhang, Y. (2021a). “Spacecraft state estimation with multichannel higher-order ARMA colored noises,” in 2021 International Conference on Control, Automation and Information Sciences (ICCAIS) (Xi'an, China: IEEE), 602–607. doi:10.1109/ICCAIS52680.2021.9624490

CrossRef Full Text | Google Scholar

Zhang, G. H., Han, C. Z., Lian, F., and Zeng, L. H. (2017). Cardinality balanced multi-target multi-Bernoulli filter for pairwise markov model. Acta Autom. Sin. 43, 2100–2108. doi:10.16383/j.aas.2017.c160430

CrossRef Full Text | Google Scholar

Zhang, G., Lan, J., Zhang, L., He, F., and Li, S. (2021b). Filtering in pairwise markov model with Student’s t non-stationary noise with application to target tracking. IEEE Trans. Signal Process. 69, 1627–1641. doi:10.1109/TSP.2021.3062170

CrossRef Full Text | Google Scholar

Zhang, G., Lian, F., Han, C., Chen, H., and Fu, N. (2018). Two novel sensor control schemes for multi-target tracking via delta generalised labelled multi-Bernoulli filtering. IET signal Process. 12, 1131–1139. doi:10.1049/iet-spr.2018.5124

CrossRef Full Text | Google Scholar

Zhang, X., Liang, H., Feng, J., and Tan, H. (2022). Kalman filter based high precision temperature data processing method. Front. Energy Res. 10, 832346. doi:10.3389/fenrg.2022.832346

CrossRef Full Text | Google Scholar

Keywords: kalman filter, higher-order autoregressive moving average, non-Gaussian, triplet markov model, correntropy

Citation: Zhang G, Zeng L, Lian F, Liu X, Fu N and Dai S (2022) State estimation for dynamic systems with higher-order autoregressive moving average non-Gaussian noise. Front. Energy Res. 10:990267. doi: 10.3389/fenrg.2022.990267

Received: 09 July 2022; Accepted: 25 July 2022;
Published: 05 September 2022.

Edited by:

Hanlin Zhang, Qingdao University, China

Reviewed by:

Wenting Shen, Qingdao University Qingdao, China
Junjun Guo, KMUST, China

Copyright © 2022 Zhang, Zeng, Lian, Liu, Fu and Dai. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Feng Lian, bGlhbmZlbmcxOTgxQG1haWwueGp0dS5lZHUuY24=

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.