Abstract
The classical Kalman filter is a very important state estimation approach, which has been widely used in many engineering applications. The Kalman filter is optimal for linear dynamic systems with independent Gaussian noises. However, the independence and Gaussian assumptions may not be satisfied in practice. On the one hand, modeling physical systems usually results in discrete-time state-space models with correlated process and measurement noises. On the other hand, the noise is non-Gaussian when the system is disturbed by heavy-tailed noise. In this case, the performance of the Kalman filter will deteriorate, or even diverge. This paper is devoted to addressing the state estimation problem of linear dynamic systems with high-order autoregressive moving average (ARMA) non-Gaussian noise. First, a triplet Markov model is introduced to model the system with high-order ARMA noise, since this model relaxes the independence assumption of the hidden Markov model. Then, a new filter is derived based on correntropy, instead of the commonly used minimum mean square error (MMSE), to deal with non-Gaussian noise. Unlike the MMSE, which uses only second-order statistics of error, correntropy can capture second-order and higher-order statistics. Finally, simulation results verify the effectiveness of the proposed algorithm.
1 Introduction
State estimation is a very important problem in many engineering applications, such as energy internet, system control, tracking, and so on [Zandavi and Chung (2019); Zhang et al. (2022)]. These engineering applications are essentially a dynamic system, which is usually described as a state-space model. The hidden Markov model (HMM) is the one of the most commonly used state-space models (Zhang et al. (2018)). For linear case, the state estimation problem is generally solved by the well-known Kalman filter(KF) (Kalman (1960)), which is an optimal filter in the minimum mean square error (MMSE) sense. MMSE is one of the most commonly used cost function in the case of Gaussian noise, and MMSE approach is an estimator which minimizes the mean square error. In addition, a large number of nonlinear filters have been proposed to solve nonlinear estimation problems, such as extended Kalman filter, unscented Kalman filter, cubature Kalman filter, particle filter, to name but a few (Anderson and Moore (2012)).
Although the KF in general performs well, it has rigorous requirements, that is, the process and measurement noises of dynamic systems are independent and Gaussian. However, the independence and Gaussian assumptions do not always hold in practice (Zhang G. et al. (2021)). On one hand, in fact, the noise of most dynamic systems is correlated. Research has shown that modeling physical systems usually results in discrete-time state-space models with correlated process and measurement noises, and some practical applications are explained in (Saha and Gustafsson (2012)). In addition, the dynamic and measurement noise may even high-order (i.e., multi-step) correlated in some severe environments (Zhang D. et al. (2021)). On the other hand, the main reason for using the Gaussian assumption is that it is mathematically simple, but in fact, dynamic systems are usually disturbed by some heavy-tailed impulse noise (Roth et al. (2013)). When the independence and Gaussian assumptions are not satisfied, the KF may fail to output reliable estimation results.
To deal with correlated noise, the traditional method is to reconstruct an HMM by prewhitening processing, and then the classical KF can be used to estimate the state (Bar-Shalom et al. (2001)). Another solution is to characterize dynamic systems with correlated noise through more flexible state-space models, such as the pairwise Markov model (PMM) (Pieczynski and Desbouvries (2003)) and the triplet Markov model (TMM) (Ait-El-Fquih and Desbouvries (2006)). In the PMM, the state and measurement as a whole are regarded as a Markov process, which improves the modeling ability of complex dynamic systems. In the TMM, an auxiliary variable is introduced to completely describe the dynamic systems. This auxiliary variable can play a very significant role in some engineering applications. For example, it can characterize the uncertainty of parameters, non-stationarity and error sources. It has been proved that the TMM is more general than the HMM and the PMM, and it structural advantages make it more preferable in addressing some real-world applications, such as image segmentation (Derrode and Pieczynski (2004)), speech processing (Ait El Fquih and Desbouvries (2005)), target tracking (Zhang et al. (2017); Lehmann and Pieczynski (2020, 2021)), and so on.
For non-Gaussian noise, several approaches have been proposed, which are mainly divided into three categories (Izanloo et al. (2016)). The first is to replace the Gaussian distribution with a more extensive heavy-tailed distribution (Huang et al. (2017)). For example, the Student’s t distribution is one of the most commonly used heavy-tailed distribution. The main disadvantage of heavy-tailed distributions is that they are usually analytically difficult, which brings about that related estimation approaches have no closed form solution. The second is the multiple model technique. In this approach, non-Gaussian noise is represented as a finite sum of Gaussian distribution (Shan et al. (2021)). The main difficulty of this approach is how to design the model set reasonably, and the disadvantage is that the amount of calculation will increase sharply with the increase of the number of models. The third is the Monte Carlo approach, in which a set of weighted random particles are employed to characterize the state (Liu et al. (2018)). Generally, sampling-based algorithms can be categorized into deterministic sampling method and random sampling method. Particularly, in the random sampling method, enough particles can approximate the real state with arbitrary precision, at the cost of expensive computation.
In the past few years, the correntropy-based filtering technology has become an important orientation to solve the state estimation of dynamic systems with non-Gaussian noise (Kulikova (2017); Chen et al. (2017)). In information theory, correntropy is a significant mathematical tool to measure the similarity of two random variables. Unlike the commonly utilized MMSE cost function, which uses only second-order statistic of error, the correntropy captures second-order and higher-order information, and is more suitable for non-Gaussian noise, such as heavy-tailed impulsive noise. Several filtering algorithms based on correntropy have been designed in the framework of HMMs, and they are more robust to non-Gaussian noise than the KF and its variants.
In this paper, we are devoted to addressing the state estimation problem of linear dynamic systems with high-order autoregressive moving average (ARMA) non-Gaussian noise. A new Kalman-like filter is developed in the framework of the TMM based on correntropy. First, we resort to a linear TMM to describe dynamic systems with high-order ARMA noise, since the TMM is more general than the HMM. Second, based on the model, a new Kalman-like filter is derived by using correntropy cost function, instead of the commonly used MMSE cost function. Because correntropy can capture not only second-order but also higher-order statistics of error, the proposed algorithm is more robust to non-Gaussian noise than the traditional filter. Finally, simulation results show the effectiveness of the proposed algorithm.
The rest of the paper is organized as follows. Section 2 is the modeling of linear dynamic systems with high-order ARMA noise. Section 3 derives a new Kalman filter by using correntropy cost function in the framework of the TMM. In Section 4, we validate the proposed algorithm via simulations. Finally, conclusion is provided in Section 5.
2 Modeling of linear dynamic systems with high-order ARMA noise
2.1 Linear hidden markov model
Consider the following linear dynamic systemwhere k is the time index, is the state vector of dimension nx, Fk the transition matrix, Gk is the process noise matrix, is the process noise, is the measurement vector of dimension nz, Hk is the measurement matrix, and is the measurement noise. denotes a Gaussian distribution with mean vector m and covariance matrix P.
In general, noise sequences and are assumed independent, jointly independent and independent of the initial state . Then the state estimate can be obtained by the classical KF, which is an optimal filter in the MMSE sense. However, the independence and Gaussian assumptions that are typically assumed in the HMM do not always hold in practice, such as dynamic systems with high-order ARMA non-Gaussian noise. In this case, the KF may not output reliable estimation results.
2.2 Linear triplet markov model for dynamic systems with high-order ARMA noise
2.2.1 Linear triplet markov chain model
We resort to a linear TMM to describe a linear HMM with correlated noise. Let is the state vector, is the measurement vector, is an auxiliary variable, and . If is a Markov process, the following system is called a linear TMM (Ait-El-Fquih and Desbouvries (2006)):where is zero mean white noise and independent of the initial state ζ0.
2.2.2 Modeling high-order ARMA noise using TMM
In this section, we utilize a linear TMM to model dynamic systems with high-order ARMA noise (Zhang D. et al. (2021)). The TMM provides a general framework for these typical stochastic systems.
(1) High-Order ARMA Process Noise.
For high-order ARMA process noise, it can usually be written in the form of the following difference equation:where is white noise. Model (3) is a typical high-order ARMA model, in which pw is the autoregressive order, qw is the moving average order, and coefficient parameters and are determined by the spectral factor of the power spectral density of the process noise wk.
Suppose is a rational spectrum. According to the spectral decomposition theorem, there is a spectral factor satisfyingwhere represents complex conjugate transpose operation, and can be written by
Then, high-order ARMA process noise can be formulated by
If the process noise in model 1) is high-order ARMA noise, it can be described by model (6). In this case, is no longer a Markov process, but is a Markov process. Let . Model 1) with (6) can be written in the form of linear TMM (2), i.e.,
Assuming white noise , the noise covariance matrix of model 7) is
(2) High-order ARMA Measurement Noise.
For high-order ARMA measurement noise, it can also be written in the following form of difference equationwhere is white noise, pv and qv are autoregressive order and moving average order, respectively. If the spectral density of high-order ARMA measurement noise is and the corresponding spectral factor is , similar to model (6), vk can be modeled as follows
If the measurement noise in model 1) is high-order ARMA noise, it can be described by model (10). Let . Model 1) with (10) can be written in the form of linear TMM (2), i.e.,
Assuming white noise , the noise covariance matrix of model 11) is
(3) High-Order ARMA Process and Measurement Noises.
If the process noise and measurement noise are high-order ARMA noises, they can be described by model 6) and model (10), respectively. Let . Model 1) with (6) and (10) can be written in the form of linear TMM (2), i.e.,
Assuming white noises and , the noise covariance matrix of model 13) is
2.3 Restoration algorithm
Let . Then model 2) can be written aswhere the initial state and noise ξk are
For model 15) with (16), a Kalman-like filter, called triplet Kalman filter (TKF), has been derived to estimate the state . For convenience, the recursive equations are summarized as follows (Ait-El-Fquih and Desbouvries (2006)).
Initialization:
Prediction:
Update:
The TKF is also an optimal filter in the MMSE sense. It in general performs well in Gaussian noise. However, it performance will deteriorate or even diverge when applied to non-Gaussian systems, since the TKF is derived under MMSE criterion, which only uses second-order statistics of error. To solve this problem, in the next section, a new filter is developed by using correntropy cost function, which utilizes not only second-order but also higher-order statistics information.
3 Correntropy-based triplet kalman filter
3.1 Correntropy
Correntropy is a very useful metric tool to measure the similarity of two random variables in information theory (Chen et al. (2017)). For variables X and Y, the correntropy is defined bywhere is an expectation operator, is a kernel function, and is the joint probability density function of X and Y. Generally, is unknown, and only a limited amount of data is provided. Thus, the correntropy can be computed by
There are many options for kernel function. In this paper, we choose the Gaussian kernel functionwhere σ is the kernel size, and . The Gaussian kernel function is positive definite and bounded. When X = Y, it takes the maximum value.
For the Gaussian kernel function, its Taylor series expansion can be written as
It can be seen that the correntropy is in essence the weighted sum of all even-order moments of error. Compared with the MMSE, which uses only the second-order statistics of error, correntropy captures the second-order and higher-order statistics.
3.2 Main results
In this section, a new filter, called correntropy-based TKF (CTKF), is derived by using correntropy under TMM. For clarity, we first provide the main results, and then give the mathematical derivation.
The initialization and prediction steps of the CTKF are the same as those of the TKF, and its update step is summarized as follows:
Proof. For the linear TMM (2), we havewhere I and 0 are identity and zeros matrices, and
To address non-Gaussian noise, we use correntropy instead of MMSE to derive update equations. The cost function based on correntropy is established by
Then the optimal estimation of is , which can be obtained by
Equation 41 can be written bywhere
Adding and subtracting a term on the right-hand side of (42), we have
Thus, the estimation of can be computed bywhere
Note that the parameter λk is function of . For simplicity, let in (44), and λk can be obtained by
In addition, parameter σ plays an important role in correntropy-based filters. Inspired by (Kulikova (2017)), this paper adopts an adaptive method to choose σ, i.e.,
In this section, a CTKF is developed to address the estimation problem of dynamic systems with high-order ARMA non-Gaussian noise. Instead of the commonly used MMSE criterion, which uses only second-order statistics of error, correntropy is employed to derive the filter, since it can captures second-order and higher-order statistics of error. It can be seen that the structure of CTKF is similar to that of TKF, except that an extra scale parameter λk is involved. The scale parameter is computed according to correntropy criterion to control the gain matrix , which results in that the CTKF in general performs well for non-Gaussian noise. In addition, the CTKF has a simple form, which facilitates its practical application.
4 Numerical simulations
In this section, two scenarios, i.e., dynamic system with high-order ARMA Gaussian and non-Gaussian noise, are taken into account to verify the effectiveness of the TMM and CTKF. In model (1), the state is , and relevant matrices arewhere T = 1 is the sampling period. The spectral factors of process and measurement noises are
Case 1: and are Gaussian noises, where and . For comparison, the standard Kalman filter (KF), the traditional state augmented Kalman filter (SAKF) (Bar-Shalom et al. (2001)), and the triplet Kalman filter (TKF) are tested. Besides, the root mean square error (RMSE) is used to evaluate estimation performance, which is computed bywhere and are the true values at time k in the ith Monte Carlo trail, and and are the corresponding estimation values. The number of Monte Carlo trails is M = 200.
Position and velocity RMSE results are provided in Figure 1. It can be seen that the TKF and SAKF have similar estimation performance, and are better than the standard KF. High-order ARMA process and measurement noises do not meet the independence assumption, resulting in poor estimation performance of the KF. The TKF and SAKF are essentially equivalent, and they are optimal in the MMSE sense. The former models dynamic system with high-order ARMA noise through TMM, and the latter deals with high-order ARMA noise through prewhitening technique. Simulation results show that TMM can accurately model dynamic systems with high-order ARMA noise.
FIGURE 1
Case 2: and are Gaussian noise disturbed by shot noise with probability of 0.2, where Qk and Rk are the same as those in case 1, and the shot noise is generated by 0.1 ×randi([5, 10]). Symbol randi([a, b]) denotes that an integer is returned from the uniform distribution of [a, b].
For comparison, the TKF and the proposed CTKF are tested. Position and velocity RMSE results are provided in Figure 2. It can be seen that the CTKF performs better than the TKF. Non-Gaussian noise results in the poor estimation performance of the TKF, since it adopts the MMSE criterion, which uses only second-order statistic of error. The CTKF shows stronger robustness to non-Gaussian noise, because the adopted correntropy cost function can capture second-order and higher-order statistics of error. Simulation results show that the CTKF is an effective state estimation method for dynamic systems with high-order ARMA non-Gaussian noise.
FIGURE 2
5 Conclusion
In this paper, a new filter is designed to solve the state estimation problem of dynamic systems with high-order ARMA non-Gaussian noise. In this filter, high-order ARMA process and measurement noises are modeled in the TMM framework, and then the recursive algorithm is derived by using corretropy cost function. On the one hand, the TMM is more general than the HMM, and it can directly model dynamic systems with high-order ARMA noise. On the other hand, correntropy can capture second-order and higher-order statistics of error, and is more suitable for non-Gaussian noise than the MMSE cost function, which uses only second-order statistics of error. In addition, the CTKF has a simple form, which facilitates its practical application.
Statements
Data availability statement
The original contributions presented in the study are included in the article/Supplementary Material, further inquiries can be directed to the corresponding author.
Author contributions
GZ provided the idea of the work and organized the manuscript, LZ performed the experiment, FL provided revisions to this paper, XL designed experimental scenarios, and NF and SD conducted data analysis.
Funding
This work is supported by National Natural Science Foundation of China under Grants 62103318 and 62173266.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fenrg.2022.990267/full#supplementary-material
References
1
Ait El FquihB.DesbouvriesF. (2005). “Kalman filtering for triplet Markov chains : Applications and extensions,” in IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, Philadelphia, PA, USA, 685–688. vol. IV. 10.1109/ICASSP.2005.1416101CrossRef Full Text | Google Scholar
2
Ait-El-FquihB.DesbouvriesF. (2006). Kalman filtering in triplet Markov chains. IEEE Trans. Signal Process.54, 2957–2963. 10.1109/TSP.2006.877651CrossRef Full Text | Google Scholar
3
AndersonB. D.MooreJ. B. (2012). Optimal filtering. New York: Courier Corporation. Google Scholar
4
Bar-ShalomY.LiX. R.KirubarajanT. (2001). Estimation with applications to tracking and navigation: Theory, algorthims and software. New York: Wiley. Google Scholar
5
ChenB.LiuX.ZhaoH.PrincipeJ. C. (2017). Maximum correntropy kalman filter. Automatica76, 70–77. 10.1016/j.automatica.2016.10.004CrossRef Full Text | Google Scholar
6
DerrodeS.PieczynskiW. (2004). Signal and image segmentation using pairwise Markov chains. IEEE Trans. Signal Process.52, 2477–2489. 10.1109/TSP.2004.832015CrossRef Full Text | Google Scholar
7
HuangY.ZhangY.LiN.WuZ.ChambersJ. A. (2017). A novel robust Student’s t-based kalman filter. IEEE Trans. Aerosp. Electron. Syst.53, 1545–1554. 10.1109/TAES.2017.2651684CrossRef Full Text | Google Scholar
8
IzanlooR.FakoorianS. A.YazdiH. S.SimonD. (2016). “Kalman filtering based on the maximum correntropy criterion in the presence of non-Gaussian noise,” in 2016 Annual Conference on Information Science and Systems, CISS, Princeton, NJ, USA, 500–505. 10.1109/CISS.2016.7460553CrossRef Full Text | Google Scholar
9
KalmanR. E. (1960). A new approach to linear filtering and prediction problems. J. Basic Eng.82, 35–45. 10.1115/1.3662552CrossRef Full Text | Google Scholar
10
KulikovaM. V. (2017). Square-root algorithms for maximum correntropy estimation of linear discrete-time systems in presence of non-Gaussian noise. Syst. Control Lett.108, 8–15. 10.1016/j.sysconle.2017.07.016CrossRef Full Text | Google Scholar
11
LehmannF.PieczynskiW. (2021). Reduced-dimension filtering in triplet markov models. IEEE Trans. Autom. Contr.67, 605–617. 10.1109/TAC.2021.3050721CrossRef Full Text | Google Scholar
12
LehmannF.PieczynskiW. (2020). Suboptimal kalman filtering in triplet markov models using model order reduction. IEEE Signal Process. Lett.27, 1100–1104. 10.1109/LSP.2020.3002420CrossRef Full Text | Google Scholar
13
LiuX.QuH.ZhaoJ.YueP. (2018). Maximum correntropy square-root cubature kalman filter with application to SINS/GPS integrated systems. ISA Trans.80, 195–202. 10.1016/j.isatra.2018.05.001PubMed Abstract | CrossRef Full Text | Google Scholar
14
PieczynskiW.DesbouvriesF. (2003). “Kalman filtering using pairwise Gaussian models,” in IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2003, Hong Kong, China, 57–60. 10.1109/ICASSP.2003.1201617CrossRef Full Text | Google Scholar
15
RothM.özkanE.GustafssonF. (2013). “A Student’s t filter for heavy tailed process and measurement noise,” in IEEE International Conference on Acoustics, Speech and Signal Processing, Vancouver, BC, Canada, 5770–5774. 10.1109/ICASSP.2013.6638770CrossRef Full Text | Google Scholar
16
SahaS.GustafssonF. (2012). Particle filtering with dependent noise processes. IEEE Trans. Signal Process.60, 4497–4508. 10.1109/TSP.2012.2202653CrossRef Full Text | Google Scholar
17
ShanC.ZhouW.JiangZ.ShanH. (2021). A new Gaussian approximate filter with colored non-stationary heavy-tailed measurement noise. Digit. Signal Process.122, 103358. 10.1016/j.dsp.2021.103358CrossRef Full Text | Google Scholar
18
ZandaviS. M.ChungV. (2019). State estimation of nonlinear dynamic system using novel heuristic filter based on genetic algorithm. Soft Comput.23, 5559–5570. 10.1007/s00500-018-3213-yCrossRef Full Text | Google Scholar
19
ZhangD.DuanZ.WangP.ZhangY. (2021a). “Spacecraft state estimation with multichannel higher-order ARMA colored noises,” in 2021 International Conference on Control, Automation and Information Sciences (ICCAIS) (Xi'an, China: IEEE), 602–607. 10.1109/ICCAIS52680.2021.9624490CrossRef Full Text | Google Scholar
20
ZhangG. H.HanC. Z.LianF.ZengL. H. (2017). Cardinality balanced multi-target multi-Bernoulli filter for pairwise markov model. Acta Autom. Sin.43, 2100–2108. 10.16383/j.aas.2017.c160430CrossRef Full Text | Google Scholar
21
ZhangG.LanJ.ZhangL.HeF.LiS. (2021b). Filtering in pairwise markov model with Student’s t non-stationary noise with application to target tracking. IEEE Trans. Signal Process.69, 1627–1641. 10.1109/TSP.2021.3062170CrossRef Full Text | Google Scholar
22
ZhangG.LianF.HanC.ChenH.FuN. (2018). Two novel sensor control schemes for multi-target tracking via delta generalised labelled multi-Bernoulli filtering. IET signal Process.12, 1131–1139. 10.1049/iet-spr.2018.5124CrossRef Full Text | Google Scholar
23
ZhangX.LiangH.FengJ.TanH. (2022). Kalman filter based high precision temperature data processing method. Front. Energy Res.10, 832346. 10.3389/fenrg.2022.832346CrossRef Full Text | Google Scholar
Summary
Keywords
kalman filter, higher-order autoregressive moving average, non-Gaussian, triplet markov model, correntropy
Citation
Zhang G, Zeng L, Lian F, Liu X, Fu N and Dai S (2022) State estimation for dynamic systems with higher-order autoregressive moving average non-Gaussian noise. Front. Energy Res. 10:990267. doi: 10.3389/fenrg.2022.990267
Received
09 July 2022
Accepted
25 July 2022
Published
05 September 2022
Volume
10 - 2022
Edited by
Hanlin Zhang, Qingdao University, China
Reviewed by
Wenting Shen, Qingdao University Qingdao, China
Junjun Guo, KMUST, China
Updates
Copyright
© 2022 Zhang, Zeng, Lian, Liu, Fu and Dai.
This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Feng Lian, lianfeng1981@mail.xjtu.edu.cn
This article was submitted to Smart Grids, a section of the journal Frontiers in Energy Research
Disclaimer
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.