Practical Steady-State Discrete-Time Kalman Filter Design for Uncertain LTI Systems

Michele Martino

doi:10.20944/preprints202606.0255.v1

Submitted:

02 June 2026

Posted:

03 June 2026

You are already at the latest version

Abstract

A practical methodology for designing a steady-state discrete-time Kalman filter for linear time-invariant (LTI) systems subject to parametric uncertainties is presented. Inspired by the Mean Value First-Order Second-Moment (MVFOSM) method, the approach models parameter deviations as zero-mean random variables with known second-order statistics, and extends De Koning’s foundational framework to explicitly account for control input contributions to equivalent noise statistics. Closed-form approximations for the state- and input-dependent noise covariance matrices Q, R and M are derived as functions of parameter uncertainties and a representative nominal input magnitude. The steady-state filter gain is obtained by solving a generalized discrete algebraic Riccati equation (DARE), yielding a fixed-gain filter that remains robust to modelling errors without requiring on-line tuning. Filter robustness is parameterized by a single scalar choice — the nominal input level (e.g., worst-case or RMS value) — which provides a transparent physical trade-off between nominal performance and robustness to component tolerances. The methodology is particularly suited to embedded power converter control, where component parameter variations significantly affect system dynamics and real-time computational resources are constrained. An unbiased formulation via an augmented state that tracks second-order bias corrections is also presented alongside the simpler biased formulation that reduces to the canonical Kalman filter structure.

Keywords:

Kalman filter

;

robust estimation

;

parametric uncertainty

;

power converter control

;

Mean Value First-Order Second-Moment (MVFOSM)

Subject:

Engineering - Control and Systems Engineering

1. Introduction

The Kalman filter, introduced by Rudolf E. Kálmán in 1960 [1], represents a fundamental tool in modern control theory and signal processing for optimal state estimation of linear dynamic systems in the presence of noise. The filter operates recursively on streams of noisy input data to produce statistically optimal estimates of the underlying system state, making it indispensable in applications ranging from navigation and guidance systems to power converter control [2].

1.1. State Estimation and the Filtering Problem

Consider a discrete-time linear time-invariant (LTI) system described by the state-space representation:

\{\begin{matrix} x_{k + 1} & = A x_{k} + B u_{k} + w_{k} \\ y_{k} & = H x_{k} + v_{k} \end{matrix}

(1)

where

x_{k} \in R^{n}

is the state vector,

u_{k} \in R^{q}

is the control input,

y_{k} \in R^{m}

is the measurement (output) vector, and

w_{k} \in R^{n}

and

v_{k} \in R^{m}

represent process and measurement noise, respectively.

The fundamental problem of state estimation is to reconstruct the internal state

x_{k}

from the available noisy measurements

y_{k}

and known inputs

u_{k}

. This problem is particularly challenging when:

☐: the system state is not directly measurable,
☐: measurements are corrupted by noise with unknown characteristics,
☐: the system model itself contains uncertainties,
☐: real-time estimation is required for feedback control.

The Kalman filter addresses this problem by computing the minimum variance unbiased estimate of the state, i.e. the estimate that minimizes the mean squared error

E [| | x_{k} - {\hat{x}}_{k} {| |}^{2}]

(

{\hat{x}}_{k}

being the state estimate) under the assumption of Gaussian noise statistics [3].

1.2. The Dual Role: Estimation and Filtering

The Kalman filter serves two complementary purposes in control and signal processing applications.

State Estimation

The filter provides optimal estimates of the system’s internal state variables that are either unmeasurable or measured with excessive noise. By fusing information from the dynamic model (predictions) and measurements (corrections), the Kalman filter produces estimates that are more accurate than either source alone. The recursive structure, consisting of prediction and update steps, enables real-time implementation with minimal computational overhead:

\{\begin{matrix} \begin{matrix} {\hat{x}}_{k + 1}^{-} & = A {\hat{x}}_{k}^{+} + B u_{k} & (a priori estimate), \\ {\hat{x}}_{k}^{+} & = {\hat{x}}_{k}^{-} + K_{k} (y_{k} - H {\hat{x}}_{k}^{-}) & (a posteriori estimate) . \end{matrix} \end{matrix}

(2)

The Kalman gain

K_{k}

is computed to optimally weight the relative contributions of the prediction and measurement based on their respective noise characteristics.

Variance Minimization (Filtering)

Beyond point estimation, the Kalman filter explicitly minimizes the covariance of the estimation error. By propagating the error covariance matrix

P_{k}

through the prediction and update equations, the filter quantifies the uncertainty in its estimates and automatically adjusts the Kalman gain to minimize this uncertainty. This dual estimation of both the state mean and covariance distinguishes the Kalman filter from simpler estimation techniques and enables:

☐: optimal sensor fusion when multiple measurements with different noise levels are available,
☐: detection of degraded system performance through monitoring of innovation statistics,
☐: robust control design using the filtered state estimates with quantified uncertainty,
☐: adaptive adjustment of controller parameters based on estimation confidence.

The variance reduction property is particularly valuable in power converter applications, where measurement noise from analog-to-digital converters, electromagnetic interference, and quantization effects can significantly degrade control performance if not properly filtered.

1.3. Steady-State Kalman Filter

For time-invariant systems, the Kalman gain converges to a constant matrix, yielding the steady-state Kalman filter. This asymptotic solution, obtained by solving the discrete algebraic Riccati equation (DARE), offers several practical advantages:

☐: reduced computational burden (no recursive covariance propagation),
☐: simplified implementation and parameter tuning,
☐: guaranteed stability under standard detectability and stabilizability assumptions (subject to numerical regularization) [4],
☐: design-time computation of filter parameters,

The steady-state formulation is particularly attractive for embedded control systems where computational resources are limited and deterministic real-time performance is critical.

1.4. Model Uncertainty and Robustness

Classical Kalman filter theory assumes perfect knowledge of the system matrices A, B, and H, as well as Gaussian white noise processes w and v with known covariances. In practice, these assumptions are rarely satisfied:

☐: component tolerances introduce uncertainty in physical parameters (resistance, inductance, capacitance),
☐: operating conditions vary (temperature, ageing, non-linearities),
☐: modelling approximations neglect high-frequency dynamics,
☐: external disturbances exhibit non-Gaussian characteristics.

Non-linearities are expressly addressed by Extended Kalman Filters (EKFs) and not investigated any further in this work; it is however worth mentioning that the standard EKF employs a similar linearization strategy — first-order Taylor expansion combined with second-order moment propagation — which is precisely the approach adopted here. When model uncertainties are significant, the nominal Kalman filter may provide far from optimal or even unstable estimates. This motivates the development of robust Kalman filtering techniques that explicitly account for parametric uncertainty. By incorporating the covariance of model parameters into the filter design, one can synthesize filters that maintain near-optimal performance (such as being the best linear estimator) despite modelling errors.

The literature on robust Kalman filtering has evolved along several distinct directions. The seminal work of De Koning [5] established the theoretical foundation by deriving equivalent noise covariances for systems with stochastic parameters variations modelled as sequences of i.i.d. (independent and identically distributed) zero-mean random variables, demonstrating that model uncertainties can be systematically incorporated into the classical Kalman framework and deriving augmented process and measurement noise statistics.

Building on these foundations, several researchers have developed alternative robust filtering methodologies. Xie et al. [6] proposed a min-max approach based on game theory, designing filters that minimize the worst-case estimation error over all admissible parameter uncertainties. Theodor and Shaked [7] extended this framework to derive robust minimum-variance filters with guaranteed performance bounds. Zhu et al. [8] provided a comprehensive design methodology combining linear matrix inequality (LMI) techniques with parameter-dependent Lyapunov functions. Luo and Bosch [9] analysed the performance robustness of Kalman filters, establishing relationships between parameter uncertainty levels and estimation error degradation.

More recent advances have addressed structured uncertainties as polytopic models: Yu et al. [10] developed robust Kalman filters for systems with convex polytopic uncertainties, where the uncertain system matrices lie within a convex hull of known vertices. Xie et al. [11] proposed improved

H_{2}

and

H_{\infty}

filtering techniques that, although not strictly Kalman filters, offer relevant insights into handling model uncertainties.

While all the approaches mentioned above provide valuable theoretical insights and worst-case performance guarantees, they do not explicitly consider the control input

u_{k}

in the uncertainty propagation.

Rocha and Terra [12] presented the most comprehensive treatment to date (to the best of the author’s knowledge), deriving robust filters that handle multiplicative noise, time-correlated uncertainties, and cross-coupled parameter variations through a unified recursive framework; it furthermore includes control input

u_{k}

. Such an approach is, however, considered difficult to apply operationally for practitioners that are not necessarily experts in robust control as it is based on regularization techniques that need non-trivial tuning.

The present work attempts to extend De Koning’s approach [5] to explicitly account for control inputs; it however departs from it by modelling the parametric uncertainties more rigorously although only approximately. This extension is particularly important for feedback control applications where the system operates under varying inputs, and where the interaction between parametric uncertainty and control effort significantly affects the equivalent noise characteristics. By deriving closed-form expressions for the input-dependent covariance terms, it enables systematic tuning of the steady-state filter robustness through appropriate selection of the nominal operating point. The focus of the present work is mostly on the practicality of the proposed robust Kalman filter.

1.5. Scope and Contribution

This paper presents a steady-state discrete-time Kalman filter design methodology for LTI systems with parametric uncertainties similar to the MVFOSM approach [13,14]. The key contributions are:

☐: derivation of equivalent process and measurement noise covariances that incorporate both stochastic noise and deterministic model uncertainty, with explicit treatment of control input effects,
☐: closed-form expressions for the (approximated) state-dependant covariance matrices Q, R, and M as functions of parameter uncertainty and control input,
☐: extension to time-varying implementations for scenarios where reduced conservatism justifies increased computational cost.

The methodology enables systematic design of robust Kalman filters that explicitly trade off nominal performance and robustness to parameters’ variations. By selecting an appropriate nominal input value (e.g., maximum expected input or RMS value for periodic signals), designers can tune the conservatism of the steady-state solution to match application requirements. This is particularly valuable for power converter control, where component tolerances can significantly affect dynamic behaviour, and where computational constraints favour steady-state over time-varying implementations.

The remainder of this paper is organized as follows: Section 2 develops the system model with parametric uncertainty and establishes the discretization procedure. Section 3 derives the steady-state Kalman filter equations together with detailed, although approximated, calculations of the state-dependent covariance matrices. Section 4 briefly outlines the extension to time-varying implementations. Conclusions and directions for future work are presented in Section 5.

2. Modeling and Discretization

The following LTI continuous-time model is considered:

\{\begin{matrix} \dot{x} = (A_{c} + δ A_{c}) x + (B_{c} + δ B_{c}) (u + δ u) + w^{*}, \\ y = (H + δ H) x + v^{*}, \end{matrix}

(3)

where:

\{\begin{matrix} A_{c} & = A_{c} (\bar{θ}), \\ B_{c} & = B_{c} (\bar{θ}), \\ H & = H (\bar{θ}), \end{matrix}

are deterministic matrices and:

\{\begin{matrix} δ A_{c} & = δ A_{c} (δ θ), \\ δ B_{c} & = δ B_{c} (δ θ), \\ δ H & = δ H (δ θ), \end{matrix}

are random matrices due to the uncertainty of the parameters grouped in the vector

θ

, which can be expressed as:

θ = \bar{θ} + δ θ .

(4)

In eq. (4),

\bar{θ}

groups the nominal values of the parameters, and

δ θ

represents their uncertainty such that

E [δ θ] = 0

. Furthermore, it is assumed that the input u is affected by some noise

δ u

(which could be quantization noise or coming from un-modelled dynamics), and the state is affected by noise

w^{*}

, while the output is corrupted by measurement noise

v^{*}

. It is also assumed that

δ θ

is statistically independent from

δ u

,

w^{*}

, and

v^{*}

and

x_{0}

, which is a natural assumption. In terms of dimensions: the state

x \in R^{n}

, the output

y \in R^{m}

, u and

δ u \in R^{q}

,

A_{c}

and

δ A_{c} \in R^{n \times n}

,

B_{c}

and

δ B_{c} \in R^{n \times q}

, H and

δ H \in R^{m \times n}

,

w^{*} \in R^{n}

and

v^{*} \in R^{m}

.

For practical use, this model needs to be discretized (with sampling period T) as follows:

\{\begin{matrix} x_{k + 1} & = (A + δ A) x_{k} + (B + δ B) (u_{k} + δ u_{k}) + w_{k}^{*}, \\ y_{k} & = (H + δ H) x_{k} + v_{k}^{*}, \end{matrix}

(5)

where:

\{\begin{matrix} A = e^{A_{c} T}, \\ B = \int_{0}^{T} e^{A_{c} τ} d τ B_{c} = A_{c}^{- 1} (e^{A_{c} T} - I_{n}) B_{c} = A_{c}^{- 1} (A - I_{n}) B_{c}, \end{matrix}

(6)

and where

I_{n}

is the

n \times n

identity matrix and

A_{c}

is assumed invertible. It is convenient to express the last relationship in eq. (6) as follows:

B = A_{c}^{- 1} (A - I_{n}) B_{c} = G B_{c}

(7)

It can be assumed (standard Kalman filters) that

w_{k}^{*}

and

v_{k}^{*}

are zero-mean uncorrelated white Gaussian noises. The LTI state equations can be therefore rewritten as:

\{\begin{matrix} x_{k + 1} & = A x_{k} + B u_{k} + (δ A x_{k} + δ B u_{k} + B δ u_{k} + δ B δ u_{k}) + w_{k}^{*}, \\ y_{k} & = H x_{k} + (δ H x_{k} + v_{k}^{*}), \end{matrix}

(8)

or equivalently as:

\{\begin{matrix} x_{k + 1} & = A x_{k} + B u_{k} + w_{k}, \\ y_{k} & = H x_{k} + v_{k}, \end{matrix}

(9)

where:

\{\begin{matrix} w_{k} & = δ A x_{k} + δ B u_{k} + B δ u_{k} + δ B δ u_{k} + w_{k}^{*}, \\ v_{k} & = δ H x_{k} + v_{k}^{*} . \end{matrix}

(10)

Continuous-time process noise

w^{*} (t)

and measurement noise

v^{*} (t)

are assumed zero-mean, white and Gaussian. In particular

Cov (w^{*} (t)) = E [w^{*} (t) w^{* T} (t - τ)] = Q_{w^{*}}^{c} δ (τ)

; the discretized process noise

w_{k}^{*}

has a covariance [2]:

Q_{w^{*}} = Cov (w_{k}^{*}) = \int_{0}^{T} e^{A_{c} (T - τ)} Q_{w^{*}}^{c} e^{A_{c}^{T} (T - τ)} d τ .

(11)

Equation (11) can be used if a knowledge of the covariance of the continuous time process noise is available to numerically calculate the covariance of the discrete time process noise. In particular one might assume

Q_{w^{*}}^{c}

to be diagonal; as shown by eq. (11) this is not the case in discrete time unless T is extremely small, in such a case it can be seen that

Q_{w^{*}} \approx Q_{w^{*}}^{c} T

so diagonality is preserved.

Furthermore [2]:

R_{v^{*}} = \frac{R_{v^{*}}^{c}}{T},

(12)

in this case, diagonality of the measurement noise covariance is preserved irrespective of the value of T.

Equation (10) defines equivalent noises similar to those presented in [5] (which, however, does not discuss the control input

u_{k}

) where the equivalent noises are state dependent.

However, for the case analysed here, the equivalent noises are not zero-mean:

\{\begin{matrix} E [w_{k}] & = z_{k}, \\ E [v_{k}] & = t_{k} . \end{matrix}

(13)

For the second moments:

\{\begin{matrix} E [w_{k} w_{k}^{T}] & = P_{w w_{k}}, \\ E [v_{k} v_{k}^{T}] & = P_{v v_{k}}, \\ E [w_{k} v_{k}^{T}] & = P_{w v_{k}}, \end{matrix}

(14)

and covariances:

\{\begin{matrix} Q_{k} & = P_{w w_{k}} - z_{k} z_{k}^{T}, \\ R_{k} & = P_{v v_{k}} - t_{k} t_{k}^{T}, \\ M_{k} & = P_{w v_{k}} - z_{k} t_{k}^{T} . \end{matrix}

(15)

The expected values of the equivalent noises (

z_{k}

and

t_{k}

respectively) will be considered as additional states. In terms of dimensions:

Q_{k} \in R^{n \times n}

,

R_{k} \in R^{m \times m}

,

M_{k} \in R^{n \times m}

.

2.1. First-Order Modelling of the Uncertainty

At the first order, the continuous-time state-space uncertainties can be written as follows:

\{\begin{matrix} \begin{matrix} δ A_{c} & \approx J_{A_{c}} δ θ, \\ δ B_{c} & \approx J_{B_{c}} δ θ, \\ δ H & \approx J_{H} δ θ, \end{matrix} \end{matrix}

(16)

where the

J_{(.)}

are the Jacobians (used here as a shorthand for Jacobian matrices) of the state-space matrices with respect to the parameters

θ

. Actually, the Jacobian is defined for (column) vectors and needs to be generalized for matrices by stacking on top of each other the Jacobians of each column; this is also known as vectorization. In other words, the following should have been written as:

\{\begin{matrix} \begin{matrix} δ A_{c} & \approx unvec [\frac{\partial}{\partial θ} vec (A_{c}) δ θ] & = unvec (J_{A_{c}} δ θ), \\ δ H & \approx unvec [\frac{\partial}{\partial θ} vec (H) δ θ] & = unvec (J_{H} δ θ), \end{matrix} \end{matrix}

(17)

however the simplified, but abused, notation in eq. (16) is preferred throughout this work. The (partial) derivatives

\partial θ

should be considered as evaluated at

θ = \bar{θ}

, but here a simplified notation has also been preferred. In terms of dimensions:

θ \in R^{p}

,

J_{A_{c}} \in R^{n^{2} \times p}

,

J_{B_{c}} \in R^{n q \times p}

,

J_{H} \in R^{n m \times p}

.

These uncertainties need to be converted into discrete time, except for

δ H

, which is identical to the continuous time counterpart.

This is done in eq. (18) where it is considered that

B = G (A_{c}) B_{c}

is a function of both

B_{c}

and

A_{c}

, i.e.

B = g (B_{c}, A_{c})

:

\{\begin{matrix} \begin{matrix} δ A & \approx \frac{\partial vec (A)}{\partial vec {(A_{c})}^{T}} δ A_{c} = J_{A} δ A_{c} \approx J_{A} J_{A_{c}} δ θ, \\ δ B & \approx \frac{\partial vec (g)}{\partial vec {(B_{c})}^{T}} δ B_{c} + \frac{\partial vec (g)}{\partial vec {(A_{c})}^{T}} δ A_{c} = (I_{q} \otimes G) δ B_{c} + J_{B} δ A_{c} \approx (I_{q} \otimes G) J_{B_{c}} δ θ + J_{B} J_{A_{c}} δ θ . \end{matrix} \end{matrix}

(18)

In eq. (18), the symbol ⊗ refers to the Kronecker (or tensor) product used in the vectorization that allowed the different contributions to be expressed as functions of the covariances of the state-space matrices. Furthermore in eqs. (17) and (18) the notation for the expression of the Jacobians strictly follows [15] although it is abused for the expression of the differentials which would require an

unvec

operation. This will be clarified only when a disambiguation is strictly needed.

In the following, the definition in eq. (19) is used:

V = (I_{q} \otimes G) J_{B_{c}} + J_{B} J_{A_{c}},

(19)

hence it can finally be written, with abused notation (i.e. implicitly assuming vectorization and de-vectorization):

δ B \approx V δ θ .

(20)

The other newly introduced Jacobians are the following:

\{\begin{matrix} \begin{matrix} J_{A} & = \frac{\partial A}{\partial A_{c}} = \frac{\partial vec (A)}{\partial vec {(A_{c})}^{T}} = \int_{0}^{T} e^{A_{c}^{T} τ} \otimes e^{A_{c} (T - τ)} d τ, \\ J_{B} & = \frac{\partial B}{\partial A_{c}} = \frac{\partial vec (B)}{\partial vec {(A_{c})}^{T}} = (B_{c}^{T} \otimes A_{c}^{- 1}) J_{A} - (B^{T} \otimes A_{c}^{- 1}) . \end{matrix} \end{matrix}

(21)

The derivation of the expression of

J_{B}

is detailed in the appendix. In terms of dimensions:

V \in R^{n q \times p}

,

J_{A} \in R^{n^{2} \times n^{2}}

,

J_{B} \in R^{n q \times n^{2}}

.

2.1.1. Type B Uncertainty Characterization of Component Tolerances

The robust filter design methodology presented in this work requires knowledge of the parameter uncertainty covariance matrix

Σ_{δ θ}

. In practice, the filter designers have rarely access to statistical population data for individual components. Instead, parameter uncertainties are typically available only as manufacturer-specified tolerances. This section discusses how to construct

Σ_{δ θ}

from tolerance specifications following the Guide to the Expression of Uncertainty in Measurement (GUM) [16] framework.

The GUM distinguishes between two fundamental approaches to uncertainty quantification:

☐: Type A evaluation: Based on statistical analysis of series of observations. Requires measuring a representative sample of components and computing sample statistics (mean, variance, correlations).
☐: Type B evaluation: Based on means other than statistical analysis, including manufacturer specifications, calibration certificates, handbooks, experience, or scientific judgment. This is the approach typically available to filter designers and is therefore adopted in this work.

Electrical component tolerances are usually specified by manufacturers as symmetric bounds around nominal values. Common examples include:

☐: Resistors: $R = 10 k Ω \pm 1 %$ (meaning $R \in [9.9, 10.1] k Ω$ )
☐: Capacitors: $C = 100 μ F \pm 10 %$ (meaning $C \in [90, 110] μ F$ )
☐: Inductors: $L = 1 mH \pm 5 %$ (meaning $L \in [0.95, 1.05] mH$ )

These specifications define bounded intervals

[{\bar{θ}}_{i} - Δ θ_{i}, {\bar{θ}}_{i} + Δ θ_{i}]

but provide no information about the probability distribution within these bounds.

When only tolerance bounds are available, the GUM methodology recommends using a uniform distribution:

δ θ_{i} \sim U (- Δ θ_{i}, Δ θ_{i}) .

(22)

The corresponding variance is:

σ_{δ θ_{i}}^{2} = \frac{Δ θ_{i}^{2}}{3} .

(23)

For a system with p uncertain parameters

θ = {[θ_{1}, θ_{2}, \dots, θ_{p}]}^{T}

, the covariance matrix is constructed as:

Σ_{δ θ} = [\begin{matrix} σ_{δ θ_{1}}^{2} & σ_{δ θ_{1}, δ θ_{2}} & \dots & σ_{δ θ_{1}, δ θ_{p}} \\ σ_{δ θ_{2}, δ θ_{1}} & σ_{δ θ_{2}}^{2} & \dots & σ_{δ θ_{2}, δ θ_{p}} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ σ_{δ θ_{p}, δ θ_{1}} & σ_{δ θ_{p}, δ θ_{2}} & \dots & σ_{δ θ_{p}}^{2} \end{matrix}] .

(24)

Diagonal elements are computed from tolerance specifications using, as an example eq. (23); off-diagonal elements (covariances between different parameters) are typically set to zero in the absence of specific information; assuming that component parameters are statistically independent is a sound hypothesis most of the time (there could be exceptions for special arrangement such as current or voltage dividers where two or more components are known to track each other etc...). When assuming: (i) uniform distribution, (ii) diagonal covariance matrix and (iii) symmetry around the mean value, it is straightforward to calculate all higher order moments :

E [δ θ_{i}^{n}] = \{\begin{matrix} \frac{Δ θ_{i}^{n}}{n + 1} & n even \\ 0 & n odd \end{matrix}

(25)

What is presented in the rest of the paper does not specifically assume the uniform distribution or the diagonality of the covariance matrix, however the derivation of the proposed robust Kalman filter is driven exactly by the practical case where only tolerances of the components are known to the filter designer.

2.2. First-Order Statistical Modelling

The state can be expressed explicitly as:

\{\begin{matrix} x_{k} & = {(A + δ A)}^{k} x_{0} + \sum_{j = 0}^{k - 1} {(A + δ A)}^{k - 1 - j} [(B + δ B) (u_{j} + δ u_{j}) + w_{j}^{*}], \\ y_{k} & = (H + δ H) x_{k} + v_{k}^{*} . \end{matrix}

(26)

Even assuming statistical independence of

x_{0}

from

δ A

(which is a completely reasonable assumption), the problem as expressed in eq. (26) is very complex in general terms. The evolution of the state is governed by

{(A + δ A)}^{k}

which can be expressed as follows:

\begin{matrix} {(A + δ A)}^{k} & = A^{k} + \sum_{i = 0}^{k - 1} A^{k - 1 - i} δ A A^{i} + α_{k} (δ A), \end{matrix}

(27)

where

α_{k} (δ A)

contains all terms of order

O (δ A^{2})

(i.e. terms like

\sum_{0 \leq i < j, i + j \leq k - 2} A^{k - 2 - i - j} δ A A^{i} δ A A^{j}

) and higher-order ones in the expansion of

{(A + δ A)}^{k}

(note that

α_{k} (δ A) = 0

for

k < 0

).

From eqs. (26) and (27) the state can be expressed as:

\begin{matrix} x_{k} & = (A^{k} + \sum_{i = 0}^{k - 1} A^{k - 1 - i} δ A A^{i} + α_{k} (δ A)) x_{0} \\ + \sum_{j = 0}^{k - 1} [A^{k - 1 - j} (B + δ B) + (\sum_{i = 0}^{k - 2 - j} A^{k - 2 - j - i} δ A A^{i}) (B + δ B) + α_{k - 1 - j} (δ A) (B + δ B)] (u_{j} + δ u_{j}) \\ + \sum_{j = 0}^{k - 1} (A^{k - 1 - j} + \sum_{i = 0}^{k - 2 - j} A^{k - 2 - j - i} δ A A^{i} + α_{k - 1 - j} (δ A)) w_{j}^{*}, \end{matrix}

(28)

From eq. (28) one can decompose

x_{k}

as follows:

\begin{matrix} x_{k} & = x_{k}^{n o m} + δ x_{k} + δ x_{k}^{h o} = A^{k} x_{0} + \sum_{j = 0}^{k - 1} A^{k - 1 - j} B u_{j} + {\tilde{δ A}}_{k - 1} x_{0} + \sum_{j = 0}^{k - 1} A^{k - 1 - j} δ B u_{j} \\ + \sum_{j = 0}^{k - 1} {\tilde{δ A}}_{k - 2 - j} B u_{j} + δ x_{k}^{h o}, \end{matrix}

(29)

where:

\{\begin{matrix} x_{k}^{n o m} & = A^{k} x_{0} + \sum_{j = 0}^{k - 1} A^{k - 1 - j} B u_{j}, \\ δ x_{k} & = {\tilde{δ A}}_{k - 1} x_{0} + \sum_{j = 0}^{k - 1} A^{k - 1 - j} δ B u_{j} + \sum_{j = 0}^{k - 1} {\tilde{δ A}}_{k - 2 - j} B u_{j}, \end{matrix}

(30)

represent, respectively, the nominal state evolution (i.e. the state evolution assuming perfect knowledge of the state-space model) and the first order perturbation in

δ A

and

δ B

, or equivalently in

δ θ

. It is worth noting that

E [δ x_{k}] = 0

because of the independence of

x_{0}

from

δ θ

and

E [δ A] = 0

and

E [δ B] = 0

. The fundamental idea, in this work, is to design a robust Kalman filter neglecting the higher order contributions in

δ x_{k}^{h o}

. This is similar to the MVFOSM approach (where only the zeroth order is considered for the mean and only contributions up to the second order are considered for the second moments such as the covariances) but slightly more accurate as it also allows to take into account the state and output biases induced by the model uncertainty, by augmenting the state as detailed in the next section.

2.3. Augmented Model

To build the augmented model it can be observed (thanks to the statistical independence assumptions) that:

\{\begin{matrix} z_{k} = E [w_{k}] = E [δ A x_{k}], \\ t_{k} = E [v_{k}] = E [δ H x_{k}] . \end{matrix}

(31)

For the additional states, the dynamics is as follows:

\{\begin{matrix} z_{k + 1} & = E [δ A x_{k + 1}] = E [δ A A x_{k}] + E [δ A δ A x_{k}] + E [δ A δ B u_{k}] \\ \approx E [δ A A x_{k}] + E [δ A δ A] {\bar{x}}_{k}^{n o m} + E [δ A δ B] u_{k}, \\ t_{k + 1} & = E [δ H x_{k + 1}] = E [δ H A x_{k}] + E [δ H δ A x_{k}] + E [δ H δ B u_{k}] \\ \approx E [δ H A x_{k}] + E [δ H δ A] {\bar{x}}_{k}^{n o m} + E [δ H δ B] u_{k}, \end{matrix}

(32)

where in (32) all contributions that are higher than second order have been neglected.

In eq. (32) the difficulty comes from the fact that A does not commute with

δ A

and

δ H

so the terms

E [δ A A x_{k}]

and

E [δ H A x_{k}]

are not immediately expressible as functions of

z_{k}

and

t_{k}

respectively. However, it can be observed that:

\{\begin{matrix} E [δ A A x_{k}] & = vec (E [δ A A x_{k}]) = E [(x_{k}^{T} \otimes δ A) vec (A)] = E [x_{k}^{T} \otimes δ A] vec (A), \\ E [δ H A x_{k}] & = vec (E [δ H A x_{k}]) = E [(x_{k}^{T} \otimes δ H) vec (A)] = E [x_{k}^{T} \otimes δ H] vec (A), \end{matrix}

(33)

and that:

\{\begin{matrix} z_{k + 1} = & E [δ A x_{k + 1}] = vec (E [δ A I_{n} x_{k + 1}]) = E [x_{k + 1}^{T} \otimes δ A] vec (I_{n}), \\ t_{k + 1} = & E [δ H x_{k + 1}] = vec (E [δ H I_{n} x_{k + 1}]) = E [x_{k + 1}^{T} \otimes δ H] vec (I_{n}) . \end{matrix}

(34)

Therefore, the recursion can be written as:

\{\begin{matrix} z_{k + 1} = & E [x_{k + 1}^{T} \otimes δ A] vec (I_{n}) \approx E [x_{k}^{T} \otimes δ A] vec (A) + E [δ A δ A] {\bar{x}}_{k}^{n o m} + E [δ A δ B] u_{k}, \\ t_{k + 1} = & E [x_{k + 1}^{T} \otimes δ H] vec (I_{n}) \approx E [x_{k}^{T} \otimes δ H] vec (A) + E [δ H δ A] {\bar{x}}_{k}^{n o m} + E [δ H δ B] u_{k} . \end{matrix}

(35)

The recursion, as written in eq. (35), is still implicit; to be able to actually run it,

z_{k}

and

t_{k}

need to be expressed explicitly. This can be done as follows; first note, by means of the definition in eq. (34), that:

\{\begin{matrix} z_{k} & = E [x_{k}^{T} \otimes δ A] vec (I_{n}), \\ t_{k} & = E [x_{k}^{T} \otimes δ H] vec (I_{n}), \end{matrix}

(36)

then define:

\{\begin{matrix} Z_{k} & = E [x_{k}^{T} \otimes δ A] \in R^{n \times n^{2}}, \\ T_{k} & = E [x_{k}^{T} \otimes δ H] \in R^{m \times n^{2}}, \end{matrix}

(37)

the recursion for these newly introduced matrices is therefore the following:

\{\begin{matrix} Z_{k + 1} & \approx Z_{k} (A^{T} \otimes I_{n}) + ({\bar{x}}_{k}^{n o m} \otimes I_{n}) E [δ A^{T} \otimes δ A] + (u_{k}^{T} \otimes I_{n}) E [δ B^{T} \otimes δ A] \\ T_{k + 1} & \approx T_{k} (A^{T} \otimes I_{n}) + ({\bar{x}}_{k}^{n o m} \otimes I_{m}) E [δ A^{T} \otimes δ H] + (u_{k}^{T} \otimes I_{m}) E [δ B^{T} \otimes δ H] . \end{matrix}

(38)

The derivation of eq. (38) is detailed in the Appendix.

The following holds true for the expected values of state and output:

\{\begin{matrix} {\bar{x}}_{k + 1} & \approx A {\bar{x}}_{k} + B u_{k} + z_{k} \\ Z_{k + 1} & \approx Z_{k} (A^{T} \otimes I_{n}) + ({\bar{x}}_{k}^{n o m} \otimes I_{n}) E [δ A^{T} \otimes δ A] + (u_{k}^{T} \otimes I_{n}) E [δ B^{T} \otimes δ A] \\ z_{k} & = Z_{k} vec (I_{n}) \\ T_{k + 1} & \approx T_{k} (A^{T} \otimes I_{n}) + ({\bar{x}}_{k}^{n o m} \otimes I_{m}) E [δ A^{T} \otimes δ H] + (u_{k}^{T} \otimes I_{m}) E [δ B^{T} \otimes δ H] \\ t_{k} & = T_{k} vec (I_{n}) \\ {\bar{y}}_{k} & \approx H {\bar{x}}_{k} + t_{k} \end{matrix}

(39)

Equation (39) can be further simplified as follows by recognizing that

{\bar{x}}_{k}^{n o m} \approx {\bar{x}}_{k}

up to

O (δ θ^{2})

:

\{\begin{matrix} {\bar{x}}_{k + 1} & \approx A {\bar{x}}_{k} + B u_{k} + z_{k} \\ Z_{k + 1} & \approx Z_{k} (A^{T} \otimes I_{n}) + ({\bar{x}}_{k} \otimes I_{n}) E [δ A^{T} \otimes δ A] + (u_{k}^{T} \otimes I_{n}) E [δ B^{T} \otimes δ A] \\ z_{k} & = Z_{k} vec (I_{n}) \\ T_{k + 1} & \approx T_{k} (A^{T} \otimes I_{n}) + ({\bar{x}}_{k} \otimes I_{m}) E [δ A^{T} \otimes δ H] + (u_{k}^{T} \otimes I_{m}) E [δ B^{T} \otimes δ H] \\ t_{k} & = T_{k} vec (I_{n}) \\ {\bar{y}}_{k} & \approx H {\bar{x}}_{k} + t_{k} \end{matrix}

(40)

this simplification is very important as it allows reducing the computational burden by avoiding the introduction of yet another state

d_{k} = E [δ x_{k}]

to keep track of the difference between

{\bar{x}}_{k}

and

{\bar{x}}_{k}^{n o m}

as detailed in the appendix.

3. Steady-State Kalman Filter

Equations (15) and (40) are sufficient to calculate the steady-state Kalman filter when assuming

Q, R

and M to be time invariant (which will be discussed in Section 3.3) and that the system in (40) is stable (this is so because the homogeneous dynamics of

Z_{k}

and

T_{k}

are governed by the matrix

(A^{T} \otimes I_{n}) \in R^{n^{2} \times n^{2}}

whose eigenvalues are those of A each with multiplicity n; and the

{\bar{x}}_{k}

equation is driven by

z_{k}

which decays to a constant for constant input. Since all eigenvalues of A are strictly within the unit circle, all modes are stable.). Let the covariance matrix of the state estimation error, in steady-state, be denoted as P. It can be computed by solving the following generalized DARE (discrete algebraic Riccati equation) as reported in [2]:

P = A P A^{T} - A (P H^{T} + M) {(H P H^{T} + H M + M^{T} H^{T} + R)}^{- 1} (H P + M^{T}) A^{T} + Q

(41)

Once P is calculated, the steady-state Kalman gain K can be calculated as follows:

K = (P H^{T} + M) {(H P H^{T} + H M + M^{T} H^{T} + R)}^{- 1}

(42)

It is useful to introduce the following Choleski decomposition:

L L^{T} = Q - M R^{- 1} M^{T}

. A unique positive semi-definite solution P of eq. (41) exists under the following conditions [2]:

\{\begin{matrix} Q ⪰ 0, \\ R ≻ 0, \\ (A, H) detectable, \\ (A - M R^{- 1} H, L) stabilizable . \end{matrix}

(43)

In such a case the solution leads to a stable steady-state Kalman filter i.e. all eigenvalues of

(I_{n} - K H) A

are strictly inside the unit circle. The detectability condition is satisfied when all undetectable modes of

(A, H)

are stable. For a stable open-loop system (all eigenvalues of A strictly inside the unit circle), detectability is automatically satisfied. The stabilizability of

(A - M R^{- 1} H, L)

can be verified once Q, R, M are computed, typically by checking the controllability Gramian or by attempting to solve the DARE numerically. Their calculation is discussed in the remaining of this section.

3.1. Steady-State of the Augmented Model

It is not straightforward to calculate the asymptotic response (i.e. assuming

u_{k}

has been kept constant for long enough) from eq. (40) so a different way is presented here. Considering the first order perturbation

δ x_{k}

introduced in eq. (28) the additional states

z_{k}

and

t_{k}

can be written explicitly as:

\{\begin{matrix} z_{k} & = E [δ A x_{k}] \approx E [δ A δ x_{k}] = E [δ A {\tilde{δ A}}_{k - 1}] E [x_{0}] + E [δ A \sum_{j = 0}^{k - 1} A^{k - 1 - j} δ B] u_{j} \\ + E [δ A \sum_{j = 0}^{k - 1} {\tilde{δ A}}_{k - 2 - j}] B u_{j}, \\ t_{k} & = E [δ H x_{k}] \approx E [δ H δ x_{k}] = E [δ H {\tilde{δ A}}_{k - 1}] E [x_{0}] + E [δ H \sum_{j = 0}^{k - 1} A^{k - 1 - j} δ B] u_{j} \\ + E [δ H \sum_{j = 0}^{k - 1} {\tilde{δ A}}_{k - 2 - j}] B u_{j} . \end{matrix}

(44)

It is worth introducing the following:

W = {(I_{n} - A)}^{- 1}

(45)

The first term of both expressions in eq. (44), proportional to

E [x_{0}]

, vanishes as

k \to \infty

(the proof is reported in the appendix).

The steady-state values of the additional states read (with

u_{j} = u_{k} = u

):

\{\begin{matrix} z_{\infty} & \approx E [δ A W δ B] u_{k} + E [δ A W δ A] W B u_{k}, \\ t_{\infty} & \approx E [δ H W δ B] u_{k} + E [δ H W δ A] W B u_{k} . \end{matrix}

(46)

The proof of eq. (46) is also given in the appendix.

So the steady-state behaviour of the expected value is described by the following:

\{\begin{matrix} {\bar{x}}_{\infty} & \approx A {\bar{x}}_{\infty} + B u_{k} + z_{\infty}, \\ z_{\infty} & \approx E [δ A W δ B] u_{k} + E [δ A W δ A] W B u_{k}, \\ t_{\infty} & \approx E [δ H W δ B] u_{k} + E [δ H W δ A] W B u_{k} . \end{matrix}

(47)

The solution of the algebraic system in eq. (47) can be written as follows:

\{\begin{matrix} {\bar{x}}_{\infty} & \approx F_{x} u_{k} \\ z_{\infty} & \approx F_{z} u_{k} \\ t_{\infty} & \approx F_{t} u_{k} \end{matrix}

(48)

letting

F_{n o m} = {(I_{n} - A)}^{- 1} B = W B

be the nominal DC gain, the augmented model gains are:

\{\begin{matrix} F_{x} & = {(I_{n} - A)}^{- 1} (B + F_{z}) = F_{n o m} + W F_{z}, \\ F_{z} & = E [δ A W δ B] + E [δ A W δ A] F_{n o m}, \\ F_{t} & = E [δ H W δ B] + E [δ H W δ A] F_{n o m} . \end{matrix}

(49)

For the actual computation of the above gains, the following relationships are useful:

\{\begin{matrix} E [δ A W δ B] & = unvec (E [δ B^{T} \otimes δ A] vec (W)) \approx unvec [Σ_{(B A)}^{*} vec (W)], \\ E [δ A W δ A] & = unvec (E [δ A^{T} \otimes δ A] vec (W)) \approx unvec [Σ_{(A A)}^{*} vec (W)], \\ E [δ H W δ B] & = unvec (E [δ B^{T} \otimes δ H] vec (W)) \approx unvec [Σ_{(B H)}^{*} vec (W)], \\ E [δ H W δ A] & = unvec (E [δ A^{T} \otimes δ H] vec (W)) \approx unvec [Σ_{(A H)}^{*} vec (W)] . \end{matrix}

(50)

Finally the DC gains of the augmented model are expressed by the following:

\{\begin{matrix} F_{x} & = W (B + F_{z}) = F_{n o m} + W F_{z}, \\ F_{z} & \approx unvec [Σ_{(B A)}^{*} vec (W)] + unvec [Σ_{(A A)}^{*} vec (W)] F_{n o m}, \\ F_{t} & \approx unvec [Σ_{(B H)}^{*} vec (W)] + unvec [Σ_{(A H)}^{*} vec (W)] F_{n o m} . \end{matrix}

(51)

3.2. Calculation of Q, R and M

In accordance with the MVFOSM approach in the calculation of the Q, R and M matrices only contributions up to the second order in

δ A

,

δ B

and

δ H

, or, in general,

O (δ θ^{2})

will be considered. However, the terms

z_{k} z_{k}^{T}

,

t_{k} t_{k}^{T}

and

z_{k} t_{k}^{T}

although being

O (δ θ^{4})

will be retained to ensure that

Q_{k}

,

R_{k}

and

M_{k}

exactly represent the covariances of the equivalent noises:

E [(w_{k} - {\bar{w}}_{k}) {(w_{k} - {\bar{w}}_{k})}^{T}]

,

E [(v_{k} - {\bar{v}}_{k}) {(v_{k} - {\bar{v}}_{k})}^{T}]

and

E [(w_{k} - {\bar{w}}_{k}) {(v_{k} - {\bar{v}}_{k})}^{T}]

as required by standard Kalman filter.

The following matrices are going to be useful in the upcoming calculations:

J_{A_{c}}^{A} = J_{A} J_{A_{c}} \in R^{n^{2} \times p},

(52)

so

Σ_{δ A} = E [δ A δ A^{T}] = E [J_{A} J_{A_{c}} δ θ δ θ^{T} J_{A_{c}}^{T} J_{A}^{T}] \approx J_{A_{c}}^{A} E [δ θ δ θ^{T}] J_{A_{c}}^{A^{T}} = J_{A_{c}}^{A} Σ_{δ θ} J_{A_{c}}^{A^{T}} .

(53)

Furthermore:

Σ_{δ u} = E [δ u_{k} δ u_{k}^{T}],

(54)

where

Σ_{δ u} \in R^{q \times q}

.

It is also useful to define the following matrices:

\{\begin{matrix} \begin{matrix} Σ_{(A A)} & = Σ_{δ A} = E [δ A δ A^{T}] \approx J_{A_{c}}^{A} Σ_{δ θ} J_{A_{c}}^{A^{T}} \in R^{n^{2} \times n^{2}}, \\ Σ_{(B B)} & = Σ_{δ B} = E [δ B δ B^{T}] \approx V Σ_{δ θ} V^{T} \in R^{n q \times n q}, \\ Σ_{(H H)} & = Σ_{δ H} = E [δ H δ H^{T}] \approx J_{H} Σ_{δ θ} J_{H}^{T} \in R^{n m \times n m}, \\ Σ_{(A B)} & = E [δ A δ B^{T}] \approx J_{A_{c}}^{A} Σ_{δ θ} V^{T} \in R^{n^{2} \times n q}, \\ Σ_{(A H)} & = E [δ A δ H^{T}] \approx J_{A_{c}}^{A} Σ_{δ θ} J_{H}^{T} \in R^{n^{2} \times n m}, \\ Σ_{(B H)} & = E [δ B δ H^{T}] \approx V Σ_{δ θ} J_{H}^{T} \in R^{n q \times n m}, \\ Σ_{(A A)}^{*} & = E [δ A^{T} \otimes δ A] \approx r (Σ_{(A A)}), \\ Σ_{(B A)}^{*} & = E [δ B^{T} \otimes δ A] \approx r (Σ_{(A B)}), \\ Σ_{(A H)}^{*} & = E [δ A^{T} \otimes δ H] \approx r (Σ_{(A H)}), \\ Σ_{(B H)}^{*} & = E [δ B^{T} \otimes δ H] \approx r (Σ_{(B H)}), \end{matrix} \end{matrix}

(55)

where

r (.)

is a reshaping function that translates the entries of the un-starred matrices in eq. (55) into the entries of the starred matrices that appear in the augmented state. Detailed expression of

r (.)

is given in the appendix.

It is important to highlight that the equalities in eq. (55) have to be interpreted taking into account the abused notation that implicitly assumes vectorization and de-vectorization e.g. for

Σ_{(A B)} = E [δ A δ B^{T}]

the product

δ A δ B^{T}

does not ordinarily exist as

δ A \in R^{n \times n}

and

δ B^{T} \in R^{q \times n}

so they cannot be multiplied; for the calculation of

Σ_{(A B)}

one would more correctly need to write

Σ_{(A B)} = E [vec (δ A) vec (δ B^{T})] = E [vec [unvec (J_{A_{c}}^{A} δ θ)] vec [unvec (δ θ^{T} V^{T})]] = E [J_{A_{c}}^{A} δ θ δ θ^{T} V^{T}] = J_{A_{c}}^{A} Σ_{δ θ} V^{T}

and so on.

3.2.1. Calculation of Q

Neglecting the terms higher than quadratic it yields:

\begin{matrix} P_{w w_{k}} & \approx E [w_{k} w_{k}^{T}] \approx E [δ A x_{k} x_{k}^{T} δ A^{T}] + E [δ A x_{k} u_{k}^{T} δ B^{T}] + E [δ B u_{k} x_{k}^{T} δ A^{T}] + \\ + E [δ B u_{k} u_{k}^{T} δ B^{T}] + E [B δ u_{k} δ u_{k}^{T} B^{T}] + E [w_{k}^{*} w_{k}^{* T}], \end{matrix}

(56)

where the other terms are exactly zero because of statistical independence or approximately zero due to MVFOSM truncation.

The terms in eq. (56) can be expressed as follows:

\{\begin{matrix} \begin{matrix} E [δ A x_{k} x_{k}^{T} δ A^{T}] & \approx E [δ A {\bar{x}}_{k}^{n o m} {\bar{x}}_{k}^{n o m^{T}} δ A^{T}] \\ \approx (u_{k}^{T} \otimes I_{n}) (F_{n o m}^{T} \otimes I_{n}) Σ_{(A A)} (F_{n o m} \otimes I_{n}) (u_{k} \otimes I_{n}), \\ E [δ A x_{k} u_{k}^{T} δ B^{T}] & \approx E [δ A {\bar{x}}_{k}^{n o m} u_{k}^{T} δ B^{T}] \approx E [δ A F_{n o m} u_{k} u_{k}^{T} δ B^{T}] \\ \approx (u_{k}^{T} \otimes I_{n}) (F_{n o m}^{T} \otimes I_{n}) Σ_{(A B)} (u_{k} \otimes I_{n}), \\ E [δ B u_{k} x_{k}^{T} δ A^{T}] & \approx E [δ B u_{k} {\bar{x}}_{k}^{n o m^{T}} δ A^{T}] \approx E [δ B u_{k} u_{k}^{T} F_{n o m}^{T} δ A^{T}] \\ \approx (u_{k}^{T} \otimes I_{n}) Σ_{(A B)}^{T} (F_{n o m} \otimes I_{n}) (u_{k} \otimes I_{n}), \\ E [δ B u_{k} u_{k}^{T} δ B^{T}] & = (u_{k}^{T} \otimes I_{n}) E [δ B δ B^{T}] (u_{k} \otimes I_{n}) \approx (u_{k}^{T} \otimes I_{n}) Σ_{(B B)} (u_{k} \otimes I_{n}), \\ E [B δ u_{k} δ u_{k}^{T} B^{T}] & = B E [δ u_{k} δ u_{k}^{T}] B^{T} = B Σ_{δ u} B^{T}, \\ E [w_{k}^{*} w_{k}^{* T}] & = Σ_{w_{k}^{*}} = Q_{w^{*}} \end{matrix} \end{matrix}

(57)

where it has been considered that

x_{k} = x_{k}^{n o m} + δ x_{k}^{f u l l}

and

δ x_{k}^{f u l l}

contains all the terms depending on

δ A

and

δ B

; when calculating the second moments only the term

x_{k}^{n o m}

contributes up to the second order as

δ x_{k}^{f u l l}

would only contribute to third order or higher. As an example when computing

E [δ A x_{k} x_{k}^{T} δ A^{T}

], expanding the products yields:

☐: $E [δ A x_{k}^{n o m} x_{k}^{n o m^{T}} δ A^{T}]$ : $O (δ θ^{2})$ - retained,
☐: $E [δ A x_{k}^{n o m} δ x_{k}^{f u l l^{T}} δ A^{T}]$ and transpose: $O (δ θ^{3})$ - neglected
☐: $E [δ A δ x_{k}^{f u l l} δ x_{k}^{f u l l^{T}} δ A^{T}]$ : $O (δ θ^{4})$ -neglected.

It is important to note that

x_{k}^{n o m}

depends only on

x_{0}

(and the history of

u_{k}

that is however deterministic) which is independent of

δ θ

and therefore of

δ A

,

δ B

and

δ H

so

E [δ A x_{k}^{n o m} x_{k}^{n o m^{T}} δ A^{T}] = E [δ A {\bar{x}}_{k}^{n o m} {\bar{x}}_{k}^{n o m^{T}} δ A^{T}]

. Thus, retaining only second-order terms,

{\bar{x}}_{k}^{n o m} = E [x_{k}^{n o m}]

is used in place of

x_{k}

, which is consistent with MVFOSM-like truncation. Furthermore it has also been considered that the terms containing

A^{k} E [x_{0}]

vanishes in steady-state (as A has eigenvalues strictly inside the unit circle).

As already done for the augmented state,

{\bar{x}}_{k}^{n o m}

can be approximated by

{\bar{x}}_{k}

leading to:

\{\begin{matrix} \begin{matrix} E [δ A x_{k} x_{k}^{T} δ A^{T}] & \approx E [δ A {\bar{x}}_{k} {\bar{x}}_{k}^{T} δ A^{T}] \approx (u_{k}^{T} \otimes I_{n}) (F_{x}^{T} \otimes I_{n}) Σ_{(A A)} (F_{x} \otimes I_{n}) (u_{k} \otimes I_{n}), \\ E [δ A x_{k} u_{k}^{T} δ B^{T}] & \approx E [δ A {\bar{x}}_{k} u_{k}^{T} δ B^{T}] \approx E [δ A F_{x} u_{k} u_{k}^{T} δ B^{T}] \\ \approx (u_{k}^{T} \otimes I_{n}) (F_{x}^{T} \otimes I_{n}) Σ_{(A B)} (u_{k} \otimes I_{n}), \\ E [δ B u_{k} x_{k}^{T} δ A^{T}] & \approx E [δ B u_{k} {\bar{x}}_{k}^{T} δ A^{T}] \approx E [δ B u_{k} u_{k}^{T} F_{x}^{T} δ A^{T}] \\ \approx (u_{k}^{T} \otimes I_{n}) Σ_{(A B)}^{T} (F_{x} \otimes I_{n}) (u_{k} \otimes I_{n}), \\ E [δ B u_{k} u_{k}^{T} δ B^{T}] & = (u_{k}^{T} \otimes I_{n}) E [δ B δ B^{T}] (u_{k} \otimes I_{n}) \approx (u_{k}^{T} \otimes I_{n}) Σ_{(B B)} (u_{k} \otimes I_{n}), \\ E [B δ u_{k} δ u_{k}^{T} B^{T}] & = B E [δ u_{k} δ u_{k}^{T}] B^{T} = B Σ_{δ u} B^{T}, \\ E [w_{k}^{*} w_{k}^{* T}] & = Σ_{w_{k}^{*}} = Q_{w^{*}} . \end{matrix} \end{matrix}

(58)

In the remainder of the paper this approximation is going to be adopted for the covariances computation by using the DC gain

F_{x}

instead of

F_{n o m}

.

The following time invariant covariance matrix is useful for the upcoming computations:

Q_{0} = Q_{w^{*}} + B Σ_{δ u} B^{T} .

(59)

Thus, the full expression of

P_{w w_{k}}

can be written as:

\begin{matrix} P_{w w_{k}} & \approx Q_{0} + (u_{k}^{T} \otimes I_{n}) P_{u} (u_{k} \otimes I_{n}) \\ P_{u} & = (F_{x}^{T} \otimes I_{n}) Σ_{(A A)} (F_{x} \otimes I_{n}) + (F_{x}^{T} \otimes I_{n}) Σ_{(A B)} + Σ_{(A B)}^{T} (F_{x} \otimes I_{n}) + Σ_{(B B)} . \end{matrix}

(60)

Finally:

Q_{k}^{t h} \approx P_{w w_{k}} - z_{k} z_{k}^{T} = P_{w w_{k}} - F_{z} u_{k} u_{k}^{T} F_{z}^{T} .

(61)

3.2.2. Calculation of R

P_{v v_{k}} = E [v_{k} v_{k}^{T}] = E [δ H x_{k} x_{k}^{T} δ H^{T}] + E [v_{k}^{*} v_{k}^{* T}] \approx E [δ H {\bar{x}}_{k}^{n o m} {\bar{x}}_{k}^{n o m^{T}} δ H^{T}] + E [v_{k}^{*} v_{k}^{* T}],

(62)

as the expectations

E [δ H x_{k} v_{k}^{* T}]

and

E [v_{k}^{*} x_{k}^{T} δ H^{T}]

are both (strictly) zero.

The individual contributions are (again using

{\bar{x}}_{k}

instead of

{\bar{x}}_{k}^{n o m}

):

\{\begin{matrix} \begin{matrix} E [δ H {\bar{x}}_{k} {\bar{x}}_{k}^{T} δ H^{T}] & \approx E [δ H F_{x} u_{k} u_{k}^{T} F_{x}^{T} δ H^{T}] \\ \approx (u_{k}^{T} \otimes I_{m}) (F_{x}^{T} \otimes I_{m}) Σ_{(H H)} (F_{x} \otimes I_{m}) (u_{k} \otimes I_{m}), \\ E [v_{k}^{*} v_{k}^{* T}] & = Σ_{v^{*}} = R_{v^{*}} . \end{matrix} \end{matrix}

(63)

Finally:

P_{v v_{k}} \approx R_{v^{*}} + (u_{k}^{T} \otimes I_{m}) (F_{x}^{T} \otimes I_{m}) Σ_{(H H)} (F_{x} \otimes I_{m}) (u_{k} \otimes I_{m}),

(64)

and:

R_{k}^{t h} \approx P_{v v_{k}} - t_{k} t_{k}^{T} = P_{v v_{k}} - F_{t} u_{k} u_{k}^{T} F_{t}^{T} .

(65)

3.2.3. Calculation of M

\begin{matrix} P_{w v_{k}} & = E [w_{k} v_{k}^{T}] = E [δ A x_{k} x_{k}^{T} δ H^{T}] + E [δ B u_{k} x_{k}^{T} δ H^{T}] \\ \approx E [δ A {\bar{x}}_{k}^{n o m} {\bar{x}}_{k}^{n o m^{T}} δ H^{T}] + E [δ B u_{k} {\bar{x}}_{k}^{T} δ H^{T}], \end{matrix}

(66)

as the expectations

E [δ A x_{k} v_{k}^{* T}]

,

E [δ B u_{k} v_{k}^{* T} δ H^{T}]

,

E [B δ u_{k} x_{k}^{T} δ H^{T}]

and

E [B δ u_{k} v_{k}^{* T}]

are all (strictly) zero.

The non zero contributions are:

\{\begin{matrix} \begin{matrix} E [δ A {\bar{x}}_{k} {\bar{x}}_{k}^{T} δ H^{T}] & \approx E [δ A F_{x} u_{k} u_{k}^{T} F_{x}^{T} δ H^{T}] \approx (u_{k}^{T} \otimes I_{n}) (F_{x}^{T} \otimes I_{n}) Σ_{(A H)} (F_{x} \otimes I_{m}) (u_{k} \otimes I_{m}), \\ E [δ B u_{k} {\bar{x}}_{k}^{T} δ H^{T}] & \approx E [δ B u_{k} u_{k}^{T} F_{x}^{T} δ H^{T}] \approx (u_{k}^{T} \otimes I_{n}) Σ_{(B H)} (F_{x} \otimes I_{m}) (u_{k} \otimes I_{m}) . \end{matrix} \end{matrix}

(67)

Finally:

P_{w v_{k}} \approx (u_{k}^{T} \otimes I_{n}) [(F_{x}^{T} \otimes I_{n}) Σ_{(A H)} (F_{x} \otimes I_{m}) + Σ_{(B H)} (F_{x} \otimes I_{m})] (u_{k} \otimes I_{m}),

(68)

and:

M_{k} \approx P_{w v_{k}} - z_{k} t_{k}^{T} = P_{w v_{k}} - F_{z} u_{k} u_{k}^{T} F_{t}^{T} .

(69)

3.3. Time Invariant Covariances

As derived in the previous sections Q, R and M have components proportional to

u_{k}

(or conversely proportional to

U_{k - 1}^{*}

) as such they are time-varying matrices. A time-varying Kalman filter could be straightforwardly implemented if desired and computationally compatible with real time implementation as briefly discussed in Section 4. However, the purpose of this work is to design a steady-state Kalman filter; in order to do that, several choices are possible with different level of robustness.

The degree of robustness can, as an example, be tuned by properly choosing the value of the nominal input u.

For steady-state filter design, time-invariant covariance matrices Q, R and M are sought. Since these depend on

u_{k}

(eqs. (60), (64), (68)), a representative constant value must be selected. Two common approaches:

☐: Worst-case robustness: Set $u = u_{m a x}$ (componentwise) to bound uncertainties for all operating conditions,
☐: Average-case design: For periodic inputs, use $u = u_{r m s}$ to match typical operating conditions.

The choice represents a trade-off: larger u increases Q, R, M, making the filter more conservative (higher estimation uncertainty) but more robust to parameter variations. The designer should select u based on the application’s robustness requirements and typical operating regime.

It can happen that, due to the different approximations introduced,

Q_{k}^{t h}

in eq. (61) is not positive semi-definite and/or

R_{k}^{t h}

in eq. (65) is not positive definite. If that is the case, one needs to regularize them to numerically solve the DARE equation as follows:

\begin{matrix} Q_{k} & = Q_{k}^{t h} + ϵ_{Q} I_{n}, where ϵ_{Q} = max (0, | λ_{-}^{m a x} (Q_{k}^{t h}) |), \\ R_{k} & = R_{k}^{t h} + ϵ_{R} I_{m}, where ϵ_{R} = max (ϵ, | λ_{-}^{m a x} (R_{k}^{t h}) |) . \end{matrix}

(70)

and

ϵ

is a sufficiently small positive number.

3.4. Filter Implementation

It is useful to recall the following property of the a priori and a posteriori estimations:

\{\begin{matrix} {\hat{x}}_{k}^{+} = E [x_{k} | y_{1} \dots y_{k}], \\ {\hat{x}}_{k}^{-} = E [x_{k} | y_{1} \dots y_{k - 1}] . \end{matrix}

(71)

It is important to note that for the "Predict Phase"

x_{k}

can be approximated by

{\hat{x}}_{k}^{+}

since in this phase the measurement

y_{k}

is available; whereas for the "Update/Correction Phase"

x_{k}

can be approximated by

{\hat{x}}_{k}^{-}

as only measurements up to

y_{k - 1}

are available. Furthermore,

{\hat{z}}_{k}

and

{\hat{t}}_{k}

are the estimations of

z_{k}

and

t_{k}

respectively.

3.4.1. Predict Phase

\{\begin{matrix} {\hat{x}}_{k + 1}^{-} & = A {\hat{x}}_{k}^{+} + B u_{k} + {\hat{z}}_{k}, \\ {\hat{Z}}_{k + 1} & = {\hat{Z}}_{k} (A^{T} \otimes I_{n}) + ({\hat{x}}_{k}^{+^{T}} \otimes I_{n}) Σ_{(A A)}^{*} + (u_{k}^{T} \otimes I_{n}) Σ_{(B A)}^{*}, \\ {\hat{T}}_{k + 1} & = {\hat{T}}_{k} (A^{T} \otimes I_{n}) + ({\hat{x}}_{k}^{+^{T}} \otimes I_{m}) Σ_{(A H)}^{*} + (u_{k}^{T} \otimes I_{m}) Σ_{(B H)}^{*} . \end{matrix}

(72)

The initial values

{\hat{Z}}_{0}

and

{\hat{T}}_{0}

are both zero.

3.4.2. Update/Correction Phase

\{\begin{matrix} {\hat{z}}_{k} & = Z_{k} vec (I_{n}), \\ {\hat{t}}_{k} & = T_{k} vec (I_{n}), \\ {\hat{x}}_{k}^{+} & = {\hat{x}}_{k}^{-} + K [y_{k} - (H {\hat{x}}_{k}^{-} + {\hat{t}}_{k})] . \end{matrix}

(73)

It can be noticed that the correction does not depend only on the (steady-state) Kalman gain K but also on the estimation of the output bias

{\hat{t}}_{k}

.

3.5. Computational Considerations and Basic MVFOSM Implementation

As it can be seen in eqs. (72) and (73), even if there is no need to update the state estimation covariance and therefore the Kalman gain, to fully profit of the proposed robust filter one needs to update in real time the estimations

{\hat{z}}_{k}

and

{\hat{t}}_{k}

and therefore, during the "Predict Phase", one needs to compute an additional

n \times n^{2}

matrix to keep track of

{\hat{Z}}_{k}

and an additional

m \times n^{2}

matrix to keep track of

{\hat{T}}_{k}

with respect to the canonical steady-state Kalman filter (the additional computational burden during the "Update/Correction Phase" is negligible 1 ).

A possible simplification can be realized by strictly following the MVFOSM framework i.e. performing:

☐: Zeroth-order approximation for means: $E [x_{k}] = {\bar{x}}_{k} \approx E [x_{k}^{n o m}]$ , therefore neglecting $O (δ θ^{2})$ corrections
☐: Second-order approximation for covariances: retaining all terms up to $O (δ θ^{2})$ in Q, R and M

This would mean accepting a biased state estimation by considering

{\hat{z}}_{k} = 0

and

{\hat{t}}_{k} = 0

, in this case the resulting robust steady-state Kalman filter would reduce to the canonical Kalman filter described in eq. (2) with

K_{k} = K

, where K is still the solution of DARE in eq. (42) where one used

Q = P_{w w_{k}}

,

R = P_{v v_{k}}

and

M = P_{w v_{k}}

; indeed the terms

{\hat{z}}_{k} {\hat{z}}_{k}^{T}

,

{\hat{t}}_{k} {\hat{t}}_{k}^{T}

and

{\hat{z}}_{k} {\hat{t}}_{k}^{T}

would be neglected too as they are

O (δ θ^{4})

.

\{\begin{matrix} \begin{matrix} {\hat{x}}_{k + 1}^{-} & = A {\hat{x}}_{k}^{+} + B u_{k} & predict phase \\ {\hat{x}}_{k}^{+} & = {\hat{x}}_{k}^{-} + K (y_{k} - H {\hat{x}}_{k}^{-}) & correction / update phase \end{matrix} \end{matrix}

(74)

4. Time-Varying Kalman Filter

If computationally feasible a robust time-varying filter can be implemented quite straightforwardly, as briefly sketched in the following. Furthermore, if the original uncertain system is also time-varying but having uncertainties with zero first order moment and constant second order moment, a time-varying filter is the only available option (where one could substitute

A_{k}

for A,

B_{k}

for B and

H_{k}

for H).

4.1. Predict Phase

1.: Calculate $Q_{k}$ as follows:

$\begin{matrix} Q_{k} & \approx Q_{0} + ({\hat{x}}_{k}^{+^{T}} \otimes I_{n}) Σ_{(A B)} (u_{k} \otimes I_{n}) + (u_{k}^{T} \otimes I_{n}) Σ_{(A B)}^{T} ({\hat{x}}_{k}^{+} \otimes I_{n}) \\ + ({\hat{x}}_{k}^{+^{T}} \otimes I_{n}) Σ_{(A A)} ({\hat{x}}_{k}^{+} \otimes I_{n}) + (u_{k}^{T} \otimes I_{n}) Σ_{(B B)} (u_{k} \otimes I_{n}) - {\hat{z}}_{k} {\hat{z}}_{k}^{T} . \end{matrix}$

(75)
2.: Perform the prediction step:

$\{\begin{matrix} {\hat{x}}_{k + 1}^{-} & = A {\hat{x}}_{k}^{+} + B u_{k} + {\hat{z}}_{k}, \\ {\hat{Z}}_{k + 1} & = {\hat{Z}}_{k} (A^{T} \otimes I_{n}) + ({\hat{x}}_{k}^{+^{T}} \otimes I_{n}) Σ_{(A A)}^{*} + (u_{k}^{T} \otimes I_{n}) Σ_{(B A)}^{*}, \\ {\hat{T}}_{k + 1} & = {\hat{T}}_{k} (A^{T} \otimes I_{n}) + ({\hat{x}}_{k}^{+^{T}} \otimes I_{m}) Σ_{(A H)}^{*} + (u_{k}^{T} \otimes I_{m}) Σ_{(B H)}^{*}, \\ P_{k + 1}^{-} & = A P_{k}^{+} A^{T} + Q_{k} . \end{matrix}$

(76)

4.2. Update/Correction Phase

1.: Calculate $R_{k}$ and $M_{k}$ as follows:

$R_{k} \approx R_{v^{*}} + ({\hat{x}}_{k}^{-^{T}} \otimes I_{m}) Σ_{(H H)} ({\hat{x}}_{k}^{-} \otimes I_{m}) - {\hat{t}}_{k} {\hat{t}}_{k}^{T},$

(77)

and

$M_{k} \approx ({\hat{x}}_{k}^{-^{T}} \otimes I_{n}) Σ_{(A H)} ({\hat{x}}_{k}^{-} \otimes I_{m}) + (u_{k}^{T} \otimes I_{n}) Σ_{(B H)} ({\hat{x}}_{k}^{-} \otimes I_{m}) - {\hat{z}}_{k} {\hat{t}}_{k}^{T} .$

(78)
2.: Perform the update/correction step (based on the most general formulation of Kalman filter in [2]):

$\{\begin{matrix} S_{k} & = H P_{k}^{-} H^{T} + H M_{k} + M_{k}^{T} H^{T} + R_{k}, \\ K_{k} & = (P_{k}^{-} H^{T} + M_{k}) S_{k}^{- 1}, \\ {\hat{z}}_{k} & = Z_{k} vec (I_{n}), \\ {\hat{t}}_{k} & = T_{k} vec (I_{n}), \\ {\hat{x}}_{k}^{+} & = {\hat{x}}_{k}^{-} + K_{k} [y_{k} - (H {\hat{x}}_{k}^{-} + {\hat{t}}_{k})], \\ P_{k}^{+} & = (I_{n} - K_{k} H) P_{k}^{-} {(I_{n} - K_{k} H)}^{T} + K_{k} (H M_{k} + M_{k}^{T} H^{T} + R_{k}) K_{k}^{T} - K_{k} M_{k}^{T} - M_{k} K_{k}^{T} . \end{matrix}$

(79)

5. Conclusions

A practical and systematic methodology for robust steady-state Kalman filter design for uncertain LTI systems has been presented. Building on De Koning’s classical framework, the key contribution is the explicit derivation, within an MVFOSM-like approximation, of input-dependent (and state-dependent) equivalent noise covariance matrices

Q, R

and M that systematically incorporate second-order statistics of parametric uncertainty alongside the usual process and measurement noise statistics. The resulting steady-state filter is obtained by solving a standard generalized DARE, requiring no iterative robust optimization.

A notable feature of the methodology is that filter robustness is governed by a single physically interpretable design parameter: the nominal input magnitude

∥ u ∥

used to evaluate the input-dependent covariance contributions. Selecting

u = u_{max}

yields a worst-case robust design; selecting

u = u_{rms}

targets average operating conditions. This transparent parametrization distinguishes the proposed approach from LMI- or regularization-based methods that require less intuitive tuning.

Two levels of approximation have been detailed. The simpler formulation (standard MVFOSM truncation,

z_{k} = 0

and

t_{k} = 0

) yields a biased estimator but retains the canonical Kalman filter prediction-correction structure, making it directly deployable with minimal implementation overhead. The refined formulation tracks second-order bias corrections via an state, improving mean estimate accuracy at the cost of increased memory and computation. An extension to the time-varying case — which replaces the fixed nominal input with the actual time-varying

u_{k}

and propagates the error covariance recursively — is also outlined, offering reduced conservatism where computational resources permit.

The methodology is especially relevant to embedded control in power electronics, where component tolerances are often significant, steady-state implementations are strongly preferred, and the physical interpretation of uncertainty parameters (component tolerance bounds) maps naturally onto the required second-order statistics via the GUM framework. Future work will focus on experimental validation in a power converter testbench.

Appendix A: Summary of Assumptions

All the assumptions mentioned in the paper are summarized here for the readers’ convenience.

Appendix System Structure and Modelling

label=(0): Invertibility of $A_{c}$ : The continuous-time system matrix $A_{c}$ is assumed invertible, allowing the expression:

$B = A_{c}^{- 1} (A - I_{n}) B_{c} .$

(This assumption simplifies the derivation of $J_{B}$ ; if $A_{c}$ is singular, the integral form must be used and derivatives computed accordingly.)
lbbel=(0): Parametric Uncertainty Structure: The matrices $A_{c}$ , $B_{c}$ , and H depend on a physical parameter vector $θ \in R^{p}$ :

$θ = \bar{θ} + δ θ,$

where $\bar{θ}$ is the nominal (mean) value and $δ θ$ is a zero-mean random vector with known covariance $Σ_{δ θ} = E [δ θ δ θ^{T}]$ .
lcbel=(0): Small Parameter Variations: The parameter uncertainty is small in the sense that $∥ δ θ ∥ ≪ ∥ \bar{θ} ∥$ , justifying first-order Taylor expansions (MVFOSM-like framework).

Appendix Noise Characteristics

Zero-Mean Gaussian Process Noise: The continuous-time process noise

w^{*} (t)

is zero-mean, white, and Gaussian with covariance:

E [w^{*} (t) w^{*} {(τ)}^{T}] = Q_{w^{*}}^{c} δ (t - τ), Q_{w^{*}}^{c} ⪰ 0 .

Its discrete-time counterpart

w_{k}^{*}

is zero-mean, white, Gaussian with covariance

Q_{w^{*}}

given by:

Q_{w^{*}} = \int_{0}^{T} e^{A_{c} (T - τ)} Q_{w^{*}}^{c} e^{A_{c}^{T} (T - τ)} d τ .

Zero-Mean Gaussian Measurement Noise: The continuous-time measurement noise

v^{*} (t)

is zero-mean, white, and Gaussian with covariance:

E [v^{*} (t) v^{*} {(τ)}^{T}] = R_{v^{*}}^{c} δ (t - τ), R_{v^{*}}^{c} ≻ 0 .

Its discrete-time counterpart

v_{k}^{*}

is zero-mean, white, Gaussian with covariance

R_{v^{*}} = \frac{R_{v^{*}}^{c}}{T}

.

Input Noise: The control input is corrupted by additive noise

δ u_{k}

, zero-mean with covariance

Σ_{δ u} = E [δ u_{k} δ u_{k}^{T}]

, independent of

δ θ

,

w_{k}^{*}

, and

v_{k}^{*}

.

Statistical Independence: The following independence conditions hold:

☐: $δ θ$ is independent of $x_{0}$ , $δ u_{k}$ , $w_{k}^{*}$ , and $v_{k}^{*}$ .
☐: $w_{k}^{*}$ and $v_{k}^{*}$ are mutually independent and independent of $x_{0}$ , $δ θ$ , and $δ u_{k}$ .
☐: $δ u_{k}$ is independent of $x_{0}$ , $δ θ$ , $w_{k}^{*}$ , and $v_{k}^{*}$ .

☐

Time-Invariant Noise Statistics: The covariance matrices

Q_{w^{*}}

,

R_{v^{*}}

, and

Σ_{δ u}

are constant (stationary noise processes).

Appendix Mathematical Approximations and Methodological Assumptions

First-Order Taylor Expansion of Matrices: Variations in $A_{c}$ , $B_{c}$ , and H are approximated to first order:

$δ A_{c} \approx J_{A_{c}} δ θ, δ B_{c} \approx J_{B_{c}} δ θ, δ H \approx J_{H} δ θ,$

where Jacobians $J_{A_{c}} = \frac{\partial vec (A_{c})}{\partial θ^{T}}$ , etc., are evaluated at $\bar{θ}$ .
Discretization of Uncertainties: The discrete-time variations are obtained via chain rule:

$δ A \approx J_{A} J_{A_{c}} δ θ, δ B \approx [(I_{q} \otimes G) J_{B_{c}} + J_{B} J_{A_{c}}] δ θ,$

with $G = A_{c}^{- 1} (A - I_{n})$ and Jacobians $J_{A} = \frac{\partial vec (A)}{\partial vec {(A_{c})}^{T}}$ , $J_{B} = \frac{\partial vec (B)}{\partial vec {(A_{c})}^{T}}$ .
MVFOSM-like Truncation and Extensions: The approximation framework employed in this work extends the standard Mean Value First-Order Second-Moment method:

Standard MVFOSM approximation:

☐

Zeroth-order means: $E [x_{k}] \approx E [x_{k}^{nom}]$ (neglecting $O (δ θ^{2})$ bias corrections).

☐

Second-order covariances: Retain all terms up to $O (δ θ^{2})$ in $E [x_{k} x_{k}^{T}]$ and so on.

☐

Neglect of higher-order terms: Terms $O (δ θ^{3})$ and above are discarded.

This work’s enhancement:

☐

First-order means: $E [x_{k}] \approx E [x_{k}^{nom}] + E [δ x_{k}]$ where $E [δ x_{k}]$ is $O (δ θ^{2})$ .

☐

Explicitly track bias terms: $z_{k} = E [δ A x_{k}]$ and $t_{k} = E [δ H x_{k}]$ .

☐

Second-order covariances: same as MVFOSM, i.e. retain $O (δ θ^{2})$ , but corrected for $z_{k} z_{k}^{T}$ , $t_{k} t_{k}^{T}$ and $z_{k} t_{k}^{T}$ .

☐

Neglect of higher-order terms: Same as MVFOSM, discard $O (δ θ^{3})$ and above

The key distinction is that second-order bias corrections $z_{k}$ and $t_{k}$ are explicitly computed and tracked via the augmented state representation, rather than being neglected as in standard MVFOSM. This improves mean estimate accuracy at the computational cost of maintaining $Z_{k} \in R^{n \times n^{2}}$ and $T_{k} \in R^{m \times n^{2}}$ . When computational constraints are severe, the standard MVFOSM approach can be recovered by setting $z_{k} = 0$ and $t_{k} = 0$ (Section 3.5).
Steady-State Approximation for Mean State (Covariance Computation):

Two distinct but related approximations are employed for the mean state in the computation of the covariance matrices Q, R, and M.

(a) Approximation ${\bar{x}}_{k} \approx {\bar{x}}_{k}^{nom}$ (MVFOSM consistency):

The true conditional mean ${\bar{x}}_{k} = E [x_{k}]$ differs from the nominal trajectory ${\bar{x}}_{k}^{n o m}$ by an $O (δ θ^{2})$ correction:

${\bar{x}}_{k} = {\bar{x}}_{k}^{n o m} + \underset{O (δ θ^{2})}{\underset{︸}{E [δ x_{k}]}} .$

Within the MVFOSM framework, only second-order terms in $δ θ$ are retained in the covariances. Substituting ${\bar{x}}_{k}^{nom}$ for ${\bar{x}}_{k}$ in expressions such as $E [δ A {\bar{x}}_{k} {\bar{x}}_{k}^{T} δ A^{T}]$ introduces an error of order $O (δ θ^{4})$ , which is consistently neglected. This approximation is therefore exact at the order of truncation adopted throughout this work.

(b) Steady-state DC-gain substitution ${\bar{x}}_{k}^{nom} \approx F_{x} u_{k}$ (time-invariance of Q, R, M):

For stable A (spectral radius $ρ (A) < 1$ ) and a constant input $u_{k} \equiv u$ , the nominal trajectory ${\bar{x}}_{k}^{nom} = A^{k} E [x_{0}] + \sum_{j = 0}^{k - 1} A^{k - 1 - j} B u$ converges to the DC steady state $F_{x} u = {(I_{n} - A)}^{- 1} (B + F_{z}) u$ as $k \to \infty$ , since the transient $A^{k} E [x_{0}] \to 0$ exponentially. Substituting ${\bar{x}}_{k}^{nom} \approx F_{x} u$ into eqs. (60)–(68) renders Q, R, and M time-invariant, which is a prerequisite for solving the DARE and obtaining a fixed Kalman gain. The nominal input u serves as the sole robustness tuning parameter (Section 3.3).
Neglect of State-Dependent Higher-Order Terms: In covariance calculations (e.g., $E [δ A x_{k} x_{k}^{T} δ A^{T}]$ ), only the nominal state $x_{k}^{n o m}$ is retained; contributions from $δ x_{k}^{f u l l}$ are $O (δ θ^{3})$ or higher and are neglected.

Appendix Stability and Existence Conditions

Nominal Stability: The nominal discrete-time matrix A has all eigenvalues strictly inside the unit circle ( $ρ (A) < 1$ where $ρ$ is the spectral radius).
Detectability: The pair $(A, H)$ is detectable. Given A is stable, this condition is automatically satisfied.
Stabilizability: The pair $(A - M R^{- 1} H, L)$ is stabilizable, where L satisfies $L L^{T} = Q - M R^{- 1} M^{T}$ .
Positive-Definiteness of R: The augmented measurement noise covariance R is positive definite.
Positive Semi-Definiteness of Q: The matrix Q is positive semi-definite, ensuring a valid solution to the DARE.
Regularization of Q and R: The theoretical covariances $Q_{k}^{t h} = P_{w w_{k}} - z_{k} z_{k}^{T}$ and $R_{k}^{t h} = P_{v v_{k}} - t_{k} t_{k}^{T}$ may not satisfy the required definiteness conditions ( $Q ⪰ 0$ , $R ≻ 0$ ) due to numerical errors from approximation truncation and finite precision arithmetic. When this occurs, regularization is applied as specified in eq. (70) :

$\begin{matrix} Q_{k} & = Q_{k}^{th} + ε_{Q} I_{n}, where ε_{Q} = max (0, | λ_{-}^{m a x} (Q_{k}^{th}) |), \\ R_{k} & = R_{k}^{th} + ε_{R} I_{m}, where ε_{R} = max (ε, | λ_{-}^{m a x} (R_{k}^{th}) |) . \end{matrix}$

Here $λ_{-}^{m a x} (\cdot)$ denotes the most negative eigenvalue, and $ε > 0$ is a small positive constant (typically $10^{- 10}$ ) ensuring strict positive definiteness of R, which is required for the innovation covariance to be invertible in the Kalman gain computation (eq. (42) ). The regularization preserves the approximate nature of Q and R while ensuring numerical stability of the DARE solution.

Appendix Practical Implementation Assumptions

Time-Invariant Covariances for Steady-State Filter: For steady-state filter design, Q, R, and M are approximated as constant by selecting a representative constant input u (e.g., worst-case $u_{max}$ or RMS value $u_{rms}$ ).
Jacobian Computability: The Jacobian matrices $J_{A_{c}}, J_{B_{c}}, J_{H}, J_{A}, J_{B}$ can be computed analytically or numerically.
Known Statistical Moments: The covariance $Σ_{δ θ}$ is known (second moment). No specific distribution is assumed beyond zero mean and finite second moment although in practice, for parameters, tolerances can be assumed as drawn from uniform distributions.
Discretization Accuracy: The discretization period T is sufficiently small to accurately capture continuous-time dynamics and noise properties.

Appendix B: Some Mathematical Derivations

Appendix B.1 Derivation of J B Expression

The relationship given in eq. (21) is:

J_{B} = \frac{\partial vec (B)}{\partial vec {(A_{c})}^{T}} = (B_{c}^{T} \otimes A_{c}^{- 1}) J_{A} - (B^{T} \otimes A_{c}^{- 1}),

where

J_{A} = \frac{\partial vec (A)}{\partial vec {(A_{c})}^{T}}

and B is defined as

B = G B_{c}, G = A_{c}^{- 1} (A - I_{n}) .

Starting from the definition

B = A_{c}^{- 1} (A - I_{n}) B_{c}

, its differential is :

d B = d (A_{c}^{- 1}) (A - I_{n}) B_{c} + A_{c}^{- 1} d A B_{c} .

Using the identity

d (A_{c}^{- 1}) = - A_{c}^{- 1} (d A_{c}) A_{c}^{- 1}

(valid for an invertible matrix), the first term becomes

- A_{c}^{- 1} (d A_{c}) A_{c}^{- 1} (A - I_{n}) B_{c} .

Now

A_{c}^{- 1} (A - I_{n}) = G

can be substituted to simplify:

- A_{c}^{- 1} (d A_{c}) G B_{c} = - A_{c}^{- 1} (d A_{c}) B .

Thus the differential is

d B = - A_{c}^{- 1} (d A_{c}) B + A_{c}^{- 1} (d A) B_{c} .

To convert this to a vectorized form, the

vec

operator is applied together with the identity in eq. (80):

d vec (B) = - (B^{T} \otimes A_{c}^{- 1}) d vec (A_{c}) + (B_{c}^{T} \otimes A_{c}^{- 1}) d vec (A) .

Now observe that

d vec (A) = J_{A} d vec (A_{c})

by the definition of

J_{A}

. Substituting this gives

d vec (B) = - (B^{T} \otimes A_{c}^{- 1}) d vec (A_{c}) + (B_{c}^{T} \otimes A_{c}^{- 1}) J_{A} d vec (A_{c}) .

Finally, factoring out

d vec (A_{c})

on the right yields the Jacobian matrix:

\frac{\partial vec (B)}{\partial vec {(A_{c})}^{T}} = (B_{c}^{T} \otimes A_{c}^{- 1}) J_{A} - (B^{T} \otimes A_{c}^{- 1}) .

Appendix B.2 Second Moments

In the computation of the second moments such as the ones presented in eqs. (58), (63) and (67) the following identities have been used:

vec (X Y Z) = (Z^{T} \otimes X) vec (Y),

(80)

(X \otimes Y) (W \otimes Z) = (X W) \otimes (Y Z),

(81)

from which, assuming

Y = Z = I

, so

Y Z = I I = I

, it yields:

X W \otimes I = (X \otimes I) (W \otimes I) .

(82)

As an example, the computation of

E [δ A {\bar{x}}_{k} {\bar{x}}_{k}^{T} δ A^{T}]

in eq. (58), with the identities

{\bar{x}}_{k} = F_{x} u_{k}

and

Σ_{(A A)} = Σ_{δ A} = E [vec (δ A) vec {(δ A)}^{T}]

, goes as follows:

1.: $δ A {\bar{x}}_{k} = (x_{k}^{T} \otimes I_{n}) vec (δ A)$
2.: $\begin{matrix} δ A {\bar{x}}_{k} {\bar{x}}_{k}^{T} δ A^{T} & = (δ A {\bar{x}}_{k}) {(δ A {\bar{x}}_{k})}^{T} = [(x_{k}^{T} \otimes I_{n}) vec (δ A)] {[(x_{k}^{T} \otimes I_{n}) vec (δ A)]}^{T} \\ = ({\bar{x}}_{k}^{T} \otimes I_{n}) vec (δ A) vec {(δ A)}^{T} ({\bar{x}}_{k} \otimes I_{n}) \end{matrix}$
3.: $E [δ A {\bar{x}}_{k} {\bar{x}}_{k}^{T} δ A^{T}] = ({\bar{x}}_{k}^{T} \otimes I_{n}) E [vec (δ A) vec {(δ A)}^{T}] ({\bar{x}}_{k} \otimes I_{n}) = ({\bar{x}}_{k}^{T} \otimes I_{n}) Σ_{(A A)} ({\bar{x}}_{k} \otimes I_{n})$
4.: $\begin{matrix} {\bar{x}}_{k} \otimes I_{n} & = (F_{x} u_{k}) \otimes I_{n} = (F_{x} \otimes I_{n}) (u_{k} \otimes I_{n}) \\ {\bar{x}}_{k}^{T} \otimes I_{n} & = (u_{k}^{T} F_{x}^{T}) \otimes I_{n} = (u_{k}^{T} \otimes I_{n}) (F_{x} \otimes I_{n}) \end{matrix}$
5.: $({\bar{x}}_{k}^{T} \otimes I_{n}) Σ_{(A A)} ({\bar{x}}_{k} \otimes I_{n}) = (u_{k}^{T} \otimes I_{n}) (F_{x} \otimes I_{n}) Σ_{(A A)} (F_{x} \otimes I_{n}) (u_{k} \otimes I_{n})$

The other terms in the second moments can be calculated analogously.

Appendix B.3 Derivation of Eq. (uid73)

\begin{matrix} E [x_{k + 1}^{T} \otimes δ A] & = E [(x_{k}^{T} {(A + δ A)}^{T} + {(u_{k} + δ u_{k})}^{T} {(B + δ B)}^{T} + w_{k}^{*^{T}}) \otimes δ A] \\ = E [x_{k}^{T} A^{T} \otimes δ A] + E [x_{k}^{T} δ A^{T} \otimes δ A] + E [u_{k}^{T} δ B^{T} \otimes δ A] \\ \approx E [x_{k}^{T} A^{T} \otimes δ A] + E [{\bar{x}}_{k}^{T} δ A^{T} \otimes δ A] + E [u_{k}^{T} δ B^{T} \otimes δ A] \\ = E [x_{k}^{T} \otimes δ A] (A^{T} \otimes I_{n}) + ({\bar{x}}_{k}^{T} \otimes I_{n}) E [δ A^{T} \otimes δ A] + (u_{k}^{T} \otimes I_{n}) E [δ B^{T} \otimes δ A] \\ E [x_{k + 1}^{T} \otimes δ H] & = E [(x_{k}^{T} {(A + δ A)}^{T} + {(u_{k} + δ u_{k})}^{T} {(B + δ B)}^{T} + w_{k}^{*^{T}}) \otimes δ H] \\ = E [x_{k}^{T} A^{T} \otimes δ H] + E [x_{k}^{T} δ A^{T} \otimes δ H] + E [u_{k}^{T} δ B^{T} \otimes δ H] \\ \approx E [x_{k}^{T} A^{T} \otimes δ H] + E [{\bar{x}}_{k}^{T} δ A^{T} \otimes δ H] + E [u_{k}^{T} δ B^{T} \otimes δ H] \\ = E [x_{k}^{T} \otimes δ H] (A^{T} \otimes I_{n}) + ({\bar{x}}_{k}^{T} \otimes I_{n}) E [δ A^{T} \otimes δ H] + (u_{k}^{T} \otimes I_{n}) E [δ B^{T} \otimes δ H] \end{matrix}

(83)

The derivation of eq. (83) is a simple application of the property in eq. (81) once the terms higher than

O (δ θ^{2})

have been neglected; eq. (38) then comes from the definition of

Z_{k}

and

T_{k}

in eq. (37).

Appendix B.4 Asymptotic Behavior of error[δAδA ˜ k-1 ] and error[δHδA ˜ k-1 ]

lim_{k \to \infty} {\tilde{δ A}}_{k - 1} = lim_{k \to \infty} \sum_{j = 1}^{k - 1} A^{k - 1 - j} δ A A^{j}

(84)

Let A be Schur stable, i.e., its spectral radius satisfies

ρ (A) < 1

. Then there exist constants

C > 0

and

λ \in (0, 1)

such that for all integers

p \geq 0

,

∥ A^{p} ∥ \leq C λ^{p},

where

∥ \cdot ∥

denotes any submultiplicative matrix norm. For any

j = 1, \dots, k - 1

:

∥ A^{k - 1 - j} δ A A^{j} ∥ \leq ∥ A^{k - 1 - j} ∥ ∥ δ A ∥ ∥ A^{j} ∥ \leq C^{2} ∥ δ A ∥ λ^{k - 1 - j} λ^{j} = C^{2} ∥ δ A ∥ λ^{k - 1} .

Summing this bound over

j = 1

to

k - 1

gives:

∥ {\tilde{δ A}}_{k - 1} ∥ = ∥\sum_{j = 1}^{k - 1} A^{k - 1 - j} δ A A^{j}∥ \leq \sum_{j = 1}^{k - 1} ∥ A^{k - 1 - j} δ A A^{j} ∥ \leq (k - 1) C^{2} ∥ δ A ∥ λ^{k - 1} .

Since

λ^{k - 1}

decays exponentially, the factor

(k - 1) λ^{k - 1}

tends to zero as

k \to \infty

. Consequently,

lim_{k \to \infty} {\tilde{δ A}}_{k - 1} = 0 .

Appendix B.5 Proof of Eq. (uid82)

Only the result for

z_{\infty}

is going to be proved; the derivation of

t_{\infty}

is entirely analogous (replace

δ A

with

δ H

everywhere).

Starting from eq. (44) and dropping the transient term (which vanishes as

k \to \infty

, see the proof of

{lim}_{k \to \infty} {\tilde{δ A}}_{k - 1} = 0

in this appendix), for a constant input

u_{k} \equiv u

one has:

z_{\infty} \approx lim_{k \to \infty} E [δ A \sum_{j = 0}^{k - 1} A^{k - 1 - j} δ B] u + lim_{k \to \infty} E [δ A \sum_{j = 0}^{k - 1} {\tilde{δ A}}_{k - 2 - j}] B u .

(85)

Step 1 - linearity of expectation over a finite sum:

For every finite k both sums contain finitely many terms; the expectation of a finite sum of random matrices equals the sum of expectations. Hence, for any finite k:

E [δ A \sum_{j = 0}^{k - 1} A^{k - 1 - j} δ B] u = \sum_{j = 0}^{k - 1} E [δ A A^{k - 1 - j} δ B] u = \sum_{j = 0}^{k - 1} E [δ A A^{k - 1 - j} δ B] u,

(86)

where

A^{k - 1 - j}

is a deterministic matrix and has been factored out of the expectation (bilinearity). Substituting

ℓ = k - 1 - j

(so ℓ runs from 0 to

k - 1

as j runs from

k - 1

to 0) and using

ρ (A) < 1

so that

\sum_{ℓ = 0}^{\infty} A^{ℓ} = {(I_{n} - A)}^{- 1}

in the operator–norm sense:

lim_{k \to \infty} \sum_{j = 0}^{k - 1} E [δ A A^{k - 1 - j} δ B] u = E [δ A (\sum_{ℓ = 0}^{\infty} A^{ℓ}) δ B] u .

(87)

Justification of the limit–sum exchange : since

δ A

and

δ B

have finite second moments (assumption (3)) and

ρ (A) < 1

(assumption (14)), there exist

C > 0

,

λ \in (0, 1)

such that

∥ A^{ℓ} ∥ \leq C λ^{ℓ}

. Therefore:

∥E [δ A A^{ℓ} δ B]∥ \leq E [∥ δ A ∥ ∥ A^{ℓ} ∥ ∥ δ B ∥] \leq C λ^{ℓ} E [∥ δ A ∥ ∥ δ B ∥] .

Because

\sum_{ℓ = 0}^{\infty} C λ^{ℓ} < \infty

, by the dominated convergence argument for matrix series the limit and expectation commute, and the partial sum converges in norm to

E [δ A {(I_{n} - A)}^{- 1} δ B]

, yielding the first term of eq. (46):

lim_{k \to \infty} E [δ A \sum_{j = 0}^{k - 1} A^{k - 1 - j} δ B] u = E [δ A {(I_{n} - A)}^{- 1} δ B] u = E [δ A W δ B] u .

(88)

Step 2 - expansion of the double-sum term.

Expanding

{\tilde{δ A}}_{k - 2 - j}

by its definition (eq. (27)):

{\tilde{δ A}}_{k - 2 - j} = \sum_{i = 0}^{k - 2 - j} A^{k - 2 - j - i} δ A A^{i},

(89)

so the second term in eq. (85) becomes, again using linearity of expectation over a finite double sum:

E [δ A \sum_{j = 0}^{k - 1} {\tilde{δ A}}_{k - 2 - j}] B u = \sum_{j = 0}^{k - 1} \sum_{i = 0}^{k - 2 - j} E [δ A A^{k - 2 - j - i} δ A] A^{i} B u .

(90)

Justification of the double limit–sum exchange : setting

p = k - 2 - j - i

and

q = i

, the general term has norm bounded by

∥E [δ A A^{p} δ A]∥ ∥ A^{q} ∥ \leq C^{2} λ^{p + q} {E [∥ δ A ∥}^{2}] .

The sum

\sum_{p = 0}^{\infty} \sum_{q = 0}^{\infty} C^{2} λ^{p + q} {E [∥ δ A ∥}^{2}] = C^{2} {E [∥ δ A ∥}^{2} {] / (1 - λ)}^{2} < \infty

, so the double series converges absolutely in norm. Therefore the limit

k \to \infty

and both summation signs may be freely exchanged, giving:

lim_{k \to \infty} \sum_{j = 0}^{k - 1} \sum_{i = 0}^{k - 2 - j} E [δ A A^{p} δ A] A^{i} B u = (\sum_{p = 0}^{\infty} E [δ A A^{p} δ A]) (\sum_{i = 0}^{\infty} A^{i}) B u .

(91)

Note on the change of order : the two infinite sums decouple because the change of variables

(j, i) \mapsto (p, q)

with

p = k - 2 - j - i

,

q = i

maps the triangular region

{0 \leq i \leq k - 2 - j, 0 \leq j \leq k - 1}

(for fixed k) to the triangular region

{p + q \leq k - 2, p, q \geq 0}

, which exhausts all pairs

(p, q) \in N_{0}^{2}

as

k \to \infty

. Absolute convergence (established above) guarantees that the limiting double sum equals the product of the two individual geometric sums:

\sum_{p = 0}^{\infty} E [δ A A^{p} δ A] = E [δ A {(I_{n} - A)}^{- 1} δ A] = E [δ A W δ A], \sum_{i = 0}^{\infty} A^{i} = W,

(92)

where the same dominated-convergence argument as in Step 1 justifies moving

E

outside the sum

\sum_{p}

. Combining eqs. (91)–(92):

lim_{k \to \infty} E [δ A \sum_{j = 0}^{k - 1} {\tilde{δ A}}_{k - 2 - j}] B u = E [δ A W δ A] W B u .

(93)

Substituting eqs. (88) and (93) into eq. (85) gives:

z_{\infty} = E [δ A W δ B] u + E [δ A W δ A] W B u = E [δ A W δ B] u + E [δ A W δ A] F_{n o m} u .

(94)

The identical calculation with

δ A \to δ H

yields:

t_{\infty} = E [δ H W δ B] u + E [δ H W δ A] W B u = E [δ H W δ B] u + E [δ H W δ A] F_{n o m} u,

(95)

which completes the proof of eq. (46).

Appendix B.6 Definition of the r() Function

As discussed in Section 3.2 the starred

Σ

matrices introduced in eq. (55) can be calculated by simply rearranging the elements of the unstarred

Σ

ones as detailed in the following:

\begin{matrix} {[r (Σ_{(A A)})]}_{(p - 1) n + q, (r - 1) n + s} = {(Σ_{(A A)}^{*})}_{(p - 1) n + q, (r - 1) n + s} = {(Σ_{(A A)})}_{(p - 1) n + r, (s - 1) n + q} \\ p, q, r, s = 1, \dots, n \end{matrix}

(96)

\begin{matrix} {[r (Σ_{(A B)})]}_{(c - 1) n + i, (k - 1) n + j} = {(Σ_{(B A)}^{*})}_{(c - 1) n + i, (k - 1) n + j} = {(Σ_{(A B)})}_{(j - 1) n + i, (c - 1) n + k} \\ i, j, k = 1, \dots, n \\ c = 1, \dots, q \end{matrix}

(97)

\begin{matrix} {[r (Σ_{(A H)})]}_{(i - 1) m + r, (j - 1) n + c} = {(Σ_{(A H)}^{*})}_{(i - 1) m + r, (j - 1) n + c} = {(Σ_{(A H)})}_{(i - 1) n + j, (c - 1) m + r} \\ i, j, c = 1, \dots, n \\ r = 1, \dots, m \end{matrix}

(98)

\begin{matrix} {[r (Σ_{(B H)})]}_{(j - 1) m + r, (c - 1) n + i} = {(Σ_{(B H)}^{*})}_{(j - 1) m + r, (c - 1) n + i} = {(Σ_{(B H)})}_{(j - 1) n + c, (i - 1) m + r} \\ i, c = 1, \dots, n \\ r = 1, \dots, m \\ j = 1, \dots, q \end{matrix}

(99)

The above equations enable the computation of the starred matrices straightforwardly from the unstarred ones that have an explicit expression in terms of the Jacobians and the second moment

Σ_{δ θ}

.

Appendix B.7 Further Augmented State

Considering the augmented state described in eq. (39) and introducing an additional state

d_{k} = \bar{x} - {\bar{x}}^{n o m}

it is straightforward to write down the following system evolution:

\{\begin{matrix} {\bar{x}}_{k + 1} & \approx A {\bar{x}}_{k} + B u_{k} + z_{k} \\ Z_{k + 1} & \approx Z_{k} (A^{T} \otimes I_{n}) + (({\bar{x}}_{k} - d_{k}) \otimes I_{n}) E [δ A^{T} \otimes δ A] + (u_{k}^{T} \otimes I_{n}) E [δ B^{T} \otimes δ A] \\ z_{k} & = Z_{k} vec (I_{n}) \\ T_{k + 1} & \approx T_{k} (A^{T} \otimes I_{n}) + (({\bar{x}}_{k} - d_{k}) \otimes I_{m}) E [δ A^{T} \otimes δ H] + (u_{k}^{T} \otimes I_{m}) E [δ B^{T} \otimes δ H] \\ t_{k} & = T_{k} vec (I_{n}) \\ d_{k + 1} & = A d_{k} + z_{k} \\ {\bar{y}}_{k} & \approx H {\bar{x}}_{k} + t_{k} \end{matrix}

(100)

indeed

d_{k + 1} = {\bar{x}}_{k + 1} - {\bar{x}}_{k + 1}^{n o m} = A {\bar{x}}_{k} + B u_{k} + z_{k} - (A {\bar{x}}_{k}^{n o m} + B u_{k}) = A ({\bar{x}}_{k} - {\bar{x}}_{k}^{n o m}) + z_{k}

. As already mentioned the difference between

{\bar{x}}_{k}

and

{\bar{x}}_{k}^{n o m}

is at least

O (δ θ^{2})

and can be neglected when the additional computational burden is not worth it. The evolution of the fully augmented system is reported in the following:

\{\begin{matrix} {\bar{x}}_{k + 1} & \approx A {\bar{x}}_{k} + B u_{k} + z_{k} \\ Z_{k + 1} & \approx Z_{k} (A^{T} \otimes I_{n}) + ({\bar{x}}_{k} \otimes I_{n}) Σ_{(A A)}^{*} - (d_{k} \otimes I_{n}) Σ_{(A A)}^{*} + (u_{k}^{T} \otimes I_{n}) Σ_{(B A)}^{*} \\ z_{k} & = Z_{k} vec (I_{n}) \\ T_{k + 1} & \approx T_{k} (A^{T} \otimes I_{n}) + ({\bar{x}}_{k} \otimes I_{m}) Σ_{(A H)}^{*} - (d_{k} \otimes I_{m}) Σ_{(A H)}^{*} + (u_{k}^{T} \otimes I_{m}) Σ_{(B H)}^{*} \\ t_{k} & = T_{k} vec (I_{n}) \\ d_{k + 1} & = A d_{k} + z_{k} \\ {\bar{y}}_{k} & \approx H {\bar{x}}_{k} + t_{k} \end{matrix}

(101)

References

Kalman, R.E. A new approach to linear filtering and prediction problems. J. Basic Eng. 1960, 82, 35–45. [Google Scholar] [CrossRef]
Simon, D. Optimal State Estimation: Kalman, H Infinity, and Nonlinear Approaches; Wiley, 2006. [Google Scholar] [CrossRef]
Anderson, B.D.; Moore, J.B. Optimal filtering; Reprint of 1979 edition; Dover Publications: Mineola, NY; Prentice-Hall, 2012. [Google Scholar]
Franklin, G.F.; Powell, J.D.; Emami-Naeini, A. Feedback control of dynamic systems, 7th ed.; Pearson: Upper Saddle River, NJ, 2015. [Google Scholar]
De Koning, W. Optimal estimation of linear discrete-time systems with stochastic parameters. Automatica 1984, 20, 113–115. [Google Scholar] [CrossRef]
Xie, L.; Soh, Y.C.; de Souza, C. Robust Kalman filtering for uncertain discrete-time systems. IEEE Trans. Autom. Control 1994, 39, 1310–1314. [Google Scholar] [CrossRef]
Theodor, Y.; Shaked, U. Robust discrete-time minimum-variance filtering. IEEE Trans. Signal Process. 1996, 44, 181–189. [Google Scholar] [CrossRef]
Zhu, X.; Soh, Y.C.; Xie, L. Design and analysis of discrete-time robust Kalman filters. Automatica 2002, 38, 1069–1077. [Google Scholar] [CrossRef]
Luo, J.; Bosch, P. Performance Robustness of Kalman Filters for Uncertain Linear Discrete-Time Systems. 13th World Congress of IFAC IFAC Proceedings Volumes 1996, San Francisco USA, 30 June - 5 July; 1996; 29, pp. 3870–3875. [Google Scholar] [CrossRef]
Yu, X.; Xin, D.; Li, J. Robust Kalman Filter for Linear System With Convex Polytopic Uncertainties. IEEE Trans. Circuits Syst. II Express Briefs 2023, 70, 821–825. [Google Scholar] [CrossRef]
Xie, L.; Lu, L.; Zhang, D.; Zhang, H. Improved robust H₂ and H_∞ filtering for uncertain discrete-time systems. Automatica 2004, 40, 873–880. [Google Scholar] [CrossRef]
Rocha, K.D.; Terra, M.H. Robust Kalman filter for systems subject to parametric uncertainties. Syst. Control Lett. 2021, 157, 105034. [Google Scholar] [CrossRef]
Wong, F.S. First-order, second-moment methods. Comput. Struct. 1985, 20, 779–791. [Google Scholar] [CrossRef]
Robert E. Melchers, A.T.B. Structural Reliability Analysis and Prediction, 3rd ed.; Wiley, 2017. [CrossRef]
Magnus, J.; Neudecker, H. Matrix Differential Calculus with Applications in Statistics and Econometrics. In Wiley Series in Probability and Statistics; Wiley, 2019. [Google Scholar]
BIPM.; IEC.; IFCC.; ILAC.; ISO.; IUPAC.; IUPAP.; OIML. Evaluation of measurement data — Guide to the expression of uncertainty in measurement. Jt. Comm. Guid. Metrol. 2008, JCGM 100, 2008. [CrossRef]

1	As an example, the j-th component of ${\hat{z}}_{k}$ is ${\hat{z}}_{k}^{(j)} = \sum_{t = 1}^{n} {\hat{Z}}_{k}^{(j, (t - 1) n + t)}$ , i.e. the trace of the j-th $n \times n$ block of ${\hat{Z}}_{k}$ , which is computationally inexpensive.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.

Practical Steady-State Discrete-Time Kalman Filter Design for Uncertain LTI Systems

Abstract

Keywords:

Subject:

1. Introduction

1.1. State Estimation and the Filtering Problem

1.2. The Dual Role: Estimation and Filtering

State Estimation

Variance Minimization (Filtering)

1.3. Steady-State Kalman Filter

1.4. Model Uncertainty and Robustness

1.5. Scope and Contribution

2. Modeling and Discretization

2.1. First-Order Modelling of the Uncertainty

2.1.1. Type B Uncertainty Characterization of Component Tolerances

2.2. First-Order Statistical Modelling

2.3. Augmented Model

3. Steady-State Kalman Filter

3.1. Steady-State of the Augmented Model

3.2. Calculation of Q, R and M

3.2.1. Calculation of Q

3.2.2. Calculation of R

3.2.3. Calculation of M

3.3. Time Invariant Covariances

3.4. Filter Implementation

3.4.1. Predict Phase

3.4.2. Update/Correction Phase

3.5. Computational Considerations and Basic MVFOSM Implementation

4. Time-Varying Kalman Filter

4.1. Predict Phase

4.2. Update/Correction Phase

5. Conclusions

Appendix A: Summary of Assumptions

Appendix System Structure and Modelling

Appendix Noise Characteristics

Appendix Mathematical Approximations and Methodological Assumptions

Appendix Stability and Existence Conditions

Appendix Practical Implementation Assumptions

Appendix B: Some Mathematical Derivations

Appendix B.1 Derivation of J B Expression

Appendix B.2 Second Moments

Appendix B.3 Derivation of Eq. (uid73)

Appendix B.4 Asymptotic Behavior of error[δAδA ˜ k-1 ] and error[δHδA ˜ k-1 ]

Appendix B.5 Proof of Eq. (uid82)

Appendix B.6 Definition of the r() Function

Appendix B.7 Further Augmented State

References

MDPI Initiatives

Important Links

Subscribe