Linear Quadratic Pursuit–Evasion Games on Time Scales

Davis Funk; Richard Williams; Nick Wintz

doi:10.20944/preprints202509.1965.v1

Submitted:

22 September 2025

Posted:

23 September 2025

You are already at the latest version

Abstract

In this paper, we unify and extend the linear quadratic pursuit-evasion games to dynamic equations on time scales. Here we seek to a mixed strategy for a pair of linear players. We show that when the final state is fixed, these (open–loop) strategies can be written in terms of a zero-input state difference. On the other hand, when the final states are free, we find closed-loop strategies in terms of an extended state.

Keywords:

dynamic equations on time scales

;

optimal control

;

pursuit-evasion games

;

Riccati equation

Subject:

Computer Science and Mathematics - Applied Mathematics

1. Introduction

The theory of deterministic pursuit-evasion games can single-handedly be attributed to Isaacs in the 1950s [1,2]. Here, Isaacs first considered differential games as two-player zero-sum games. One early application was formulation of missile guidance systems during his time with the RAND Corporation. Shortly thereafter, Kalman among others initiated the linear quadratic regulator and tracking (LQR) and (LQT) in the continuous and discrete cases (see [3,4,5,6]). Since then, the concept of pursuit-evasion games and optimal control have been closely related, each playing a fundamental role in control engineering and economics. One breakout paper to combine these concepts was written by Ho, Bryson, and Baron. Together, they studied linear quadratic pursuit-evasion games (LQPEG) as regulator problems [7,8]. In particular, this work included a three-dimensional target interception problem. Since then, there have been a number of papers that have extended these results in the continuous and discrete cases. One of the issues that researchers have faced in the past is the discrete nature of these mixed strategies.

In 1988, Stefan Hilger initiated the theory of dynamic equations of time scales, which seeks to unify and extend discrete and continuous analysis [9]. As a result, we can generalize a process to account for both cases, or any combination of the two provided we restrict ourselves to closed, nonempty subsets of the reals (a time scale). From a numerical viewpoint, this theory can be thought of a generalized sampling technique that allows a researcher to evaluate processes with continuous, discrete, or uneven measurements. Since its inception, this area of mathematics has gained a great deal of international attention. Researchers have since found applications of time scales to include heat transfer, population dynamics, as well as economics. For a more in depth study of time scales, it is suggested that one see Bohner and Peterson’s books [10,11].

There have been a number of researchers who have sought to combine this field with the theory of control. A number of authors have contributed to generalizing the basic notions of controllability and observability (see [12,13,14,15,16]). Bohner first provided the conditions for optimality for dynamic control processes in [17]. DaCunha unified the theory of Lyapunov and Floquet theory in his dissertation [18]. Hilscher along with Zeidan have studied optimal control for sympletic systems [19]. Additional contributions can be found in [20,21,22,23,24,25], among several others.

In this paper, we study a natural extension of the LQR and LQT previously generalized to dynamics equations on time scales (see [26,27]). Here, we consider the following separable dynamic systems

\begin{matrix} x_{P}^{Δ} (t) & = & A_{P} x_{P} (t) + B_{P} u (t), x_{P} (t_{0}) = x_{0}^{P} \\ x_{E}^{Δ} (t) & = & A_{E} x_{E} (t) + B_{E} v (t), x_{E} (t_{0}) = x_{0}^{E}, \end{matrix}

(1.1)

where

x_{i} \in R^{n}

represent our states and

u, v \in R^{m}

represent our controls. Note that the subscripts P and E to stand for the pursuer and the evader respectively. The pursuing state seeks to intercept the evading state at time

t_{f}

while the latter state seeks to do the opposite. For simplicity, we make the following assumptions. First, we assume the given systems are linear-time invariant (although the strategies for the time-varying case can be determined in a similar fashion). Second, we assume that both states are controllable and are being evaluated on the same time scale. Finally, we assume our state equations are associated with the cost functional

\begin{matrix} J (u, v) = & \frac{1}{2} | | x_{P} - x_{E} {| |}_{M} (t_{f}) + \frac{1}{2} \int_{t_{0}}^{t_{f}} (| | x_{P} - x_{E} {| |}_{Q} + {| | u | |}_{R_{P}} - {| | v | |}_{R_{E}}) (τ) Δ τ \\ = & \frac{1}{2} ({(x_{P} - x_{E})}^{T} (t_{f}) M (x_{P} - x_{E})) (t_{f}) \\ + \frac{1}{2} \int_{t_{0}}^{t_{f}} ({(x_{P} - x_{E})}^{T} Q (x_{P} - x_{E}) + u^{T} R_{P} u - v^{T} R_{E} v) (τ) Δ τ, \end{matrix}

(1.2)

where

M \geq 0

and diagonal,

Q \geq 0

and

R_{P}

,

R_{E} > 0

. Note that the goal of the pursuing state is to minimize (1.2) while the evading state seeks to maximize it. Since these states represent opposing players, evaluating this cost can be thought of as a minimax problem.

The pursuit-evasion framework remains an active area across multiple disciplines, as found in [28,29,30,31,32,33,34]. It should be noted that there have been other excursions in combining dynamic games with time scales calculus. Libich and Stehlík introduced macroeconomic policy games on times scales with inefficient equilibria in [35]. Martins and Torres considered

n -

player games where each player sought to minimize a shared cost functional. Mozhegova and Petrov introduced a simple pursuit problem in [36] and a dynamic analogue to the “Cossacks-robbers” in [37]. Minh and Phuong have previously studied linear pursuit-evasion games on time scales in [38]. However, these results do include a regulator/saddle point framework, nor are they complete when compared to this manuscript.

The organization of this paper is as follows. Section 2 presents core definitions and concepts of the time scales calculus. We offer the variational properties needed such that an optimal strategy exists in Section 3. In Section 4, we seek a mixed strategy when the final states are both fixed. In this setting, we can rewrite our cost functional (1.2) in terms of the difference in Gramians of each system. For Section 5, we find a pair of a controls in terms of an extended state. In Section 6, we offer some examples including a numerical result. Finally, we provide some concluding remarks and future plans in Section 7.

2. Time Scales Preliminaries

Here we offer a brief introduction to the theory of dynamic equations on time scales. For a more in-depth study of time scales, see Bohner and Peterson’s books [10,11].

Definition 1.

A time scale

T

is an arbitrary nonempty closed subset of the real numbers. We let

T^{κ} = T ∖ {max T}

if

max T

exists; otherwise

T^{κ} = T

.

Example 2.

The most common examples of time scales are

T = R

,

T = Z

,

T = h Z

for

h > 0

, and

T = q^{N_{0}}

for

q > 1

.

Definition 3.

We define the forward jump operator

σ : T \to T

and the graininess function

μ : T \to [0, \infty)

by

σ (t) : = inf \{s \in T : s > t\} a n d μ (t) = σ (t) - t .

Definition 4.

For any function

f : T \to R

, we define the function

f^{σ} : T \to R

by

f^{σ} = f \circ σ

.

Next, we define the delta (or Hilger) derivative as follows.

Definition 5.

Assume

f : T \to R

and let

t \in T^{κ}

. The delta derivative

f^{Δ} (t)

is the number (when it exists) such that given any

ε > 0

, there is a neighborhood U of t such that

|[f (σ (t)) - f (s)] - f^{Δ} (t) [σ (t) - s]| \leq ε | σ (t) - s | for all s \in U .

In the next two theorems, we consider some properties of the delta derivative.

Theorem 6

(See Theorem 1.16 [10]). Suppose

f : T \to R

is a function and let

t \in T^{κ}

. Then we have the following:

a.: If f is differentiable at t, then f is continuous at t.
b.: If f is continuous at t, where t is right-scattered, then f is differentiable at t and

$f^{Δ} (t) = \frac{f (σ (t)) - f (t)}{μ (t)} .$
c.: If f is differentiable at t, where t is right-dense, then

$f^{Δ} (t) = lim_{s \to t} \frac{f (t) - f (s)}{t - s} .$
d.: If f is differentiable at t, then

$f (σ (t)) = f (t) + μ (t) f^{Δ} (t) .$

(2.1)

Note that (2.1) is sometimes called the “simple useful formula."

Example 7.

Note the following examples.

a.: When $T = R$ , then (if the limit exists)

$f^{Δ} (t) = lim_{s \to t} \frac{f (t) - f (s)}{t - s} = f^{'} (t) .$
b.: When $T = Z$ , then

$f^{Δ} (t) = f (t + 1) - f (t) = : Δ f (t) .$
c.: When $T = h Z$ for $h > 0$ , then

$f^{Δ} (t) = \frac{f (t + h) - f (t)}{h} = : Δ_{h} f (t) .$
d.: When $T = q^{Z}$ for $q > 1$ , then

$f^{Δ} (t) = \frac{f (q t) - f (t)}{(q - 1) t} = : D_{q} f (t) .$

Next we consider the linearity property as well as the product rules.

Theorem 8

(See Theorem 1.20 [10]). Let

f, g : T \to R

be differentiable at

t \in T^{κ}

. Then we have the following:

a.: For any constants α and β, the sum $(α f + β g) : T \to R$ is differentiable at t with

${(α f + β g)}^{Δ} (t) = α f^{Δ} (t) + β g^{Δ} (t) .$
b.: The product $f g : T \to R$ is differentiable at t with

${(f g)}^{Δ} (t) = f^{Δ} (t) g (t) + f^{σ} (t) g^{Δ} (t) = f (t) g^{Δ} (t) + f^{Δ} (t) g^{σ} (t) .$

Definition 9.

A function

f : T \to R

is said to be rd-continuous on

T

when f is continuous in points

t \in T

with

σ (t) = t

and it has finite left-sided limits in points

t \in T

with

sup \{s \in T : s < t\} = t

. The class of rd-continuous functions

f : T \to R

is denoted by

C_{rd} = C_{rd} (T) = C_{rd} (T, R)

. The set of functions

f : T \to R

that are differentiable and whose derivative is rd-continuous is denoted by

C_{rd}^{1}

.

Theorem 10

(See Theorem 1.74[10]). Any rd-continuous function

f : T \to R

has an antiderivative F, i.e.,

F^{Δ} = f

on

T^{κ}

.

Definition 11.

Let

f \in C_{rd}

and let F be any function such that

F^{Δ} (t) = f (t)

for all

t \in T^{κ}

. Then the Cauchy integral of f is defined by

\int_{a}^{b} f (t) Δ t = F (b) - F (a) for all a, b \in T .

Example 12.

Let

a, b \in T

with

a < b

and assume that

f \in C_{rd}

.

a.: When $T = R$ , then

$\int_{a}^{b} f (t) Δ t = \int_{a}^{b} f (t) d t .$
b.: When $T = Z$ , then

$\int_{a}^{b} f (t) Δ t = \sum_{t = a}^{b - 1} f (t) .$
c.: When $T = h Z$ for $h > 0$ , then

$\int_{a}^{b} f (t) Δ t = h \sum_{t = a / h}^{b / h - 1} f (t h) .$
d.: When $T = q^{N_{0}}$ for $q > 1$ , then

$\int_{a}^{b} f (t) Δ t = \int_{a}^{b} f (t) d_{q} (t) : = (q - 1) \sum_{t \in [a, b) \cap T} t f (t) .$

Next, we present the matrix exponential and some of its properties.

Definition 13.

An

m \times n

matrix-valued function A on

T

is rd-continuous if each of its entries are rd-continuous. Furthermore, if

m = n

, A is said to be regressive (we write

A \in R

) if

I + μ (t) A (t) is invertible for all t \in T^{κ} .

Theorem 14

(See Theorem 5.8 [10]). Suppose that A is regressive and rd-continuous. Then the initial value problem

X^{Δ} (t) = A (t) X (t), X (t_{0}) = I,

where I is the identity matrix, has a unique

n \times n

matrix-valued solution X.

Definition 15.

The solution X from Theorem 14 is called the matrix exponential function on

T

and is denoted by

e_{A} (\cdot, t_{0})

.

Theorem 16

(See Theorem 5.21 [10]). Let A be regressive and rd-continuous. Then for

r, s, t \in T

,

a.: $e_{A} (t, s) e_{A} (s, r) = e_{A} (t, r)$ , hence $e_{A} (t, t) = I$ ,
b.: $e_{A} (σ (t), s) = (I + μ (t) A (t)) e_{A} (t, s)$ ,
c.: $e_{A} (t, σ (s)) = e_{A} (t, s) {(I + μ (s) A (s))}^{- 1}$ ,
d.: ${(e_{A} (\cdot, s))}^{Δ} = A e_{A} (\cdot, s)$ ,
e.: ${(e_{A} (t, \cdot))}^{Δ} = - e_{A}^{σ} (t, \cdot) A (s) = - e_{A} (t, \cdot) {(I + μ (s) A (s))}^{- 1} A (s)$ .

Next we give the solution (state response) to our linear system using variation of parameters.

Theorem 17

(See Theorem 5.24 [10]). Let

A \in R

be an

n \times n

matrix-valued function on

T

and suppose that

f : T \to R^{n}

is rd-continuous. Let

t_{0} \in T

and

x_{0} \in R^{n}

. Then the solution of the initial value problem

x^{Δ} (t) = A (t) x (t) + f (t), x (t_{0}) = x_{0}

is given by

x (t) = e_{A} (t, t_{0}) x_{0} + \int_{t_{0}}^{t} e_{A} (t, σ (τ)) f (τ) Δ τ .

3. Optimization of Linear Systems on Time Scales

In this section, we make use of variational methods on time scales as introduced by Bohner in [17]. First, note that the state equations in (1.1) are uncoupled. For convenience, we rewrite (1.1) as

z^{Δ} (t) = \hat{A} z (t) + \hat{B} u (t) + \hat{C} v (t), z (t_{0}) = z_{0},

(3.1)

where z represents an extended state given by

z = {[\begin{matrix} x_{P} & x_{E} \end{matrix}]}^{T}

,

\hat{A} = [\begin{matrix} A_{P} & 0 \\ 0 & A_{E} \end{matrix}]

,

\hat{B} = {[\begin{matrix} B_{P} & 0 \end{matrix}]}^{T}

, and

\hat{C} = {[\begin{matrix} 0 & B_{E} \end{matrix}]}^{T}

. Associated with (3.1) is the quadratic cost functional

J (u, v) = \frac{1}{2} z^{T} (t_{f}) \hat{M} z (t_{f}) + \frac{1}{2} \int_{t_{0}}^{t_{f}} (z^{T} \hat{Q} z + u^{T} R_{P} u - v^{T} R_{E} v) (τ) Δ τ,

(3.2)

where

\hat{M}

,

\hat{Q} \geq 0

and

R_{P}

,

R_{E} > 0

. To minimize (3.2), we introduce the augmented cost functional

J^{+} (u, v) = \frac{1}{2} z^{T} (t_{f}) \hat{M} z (t_{f}) + \int_{t_{0}}^{t_{f}} [H (x, u, v, λ^{σ}) - {(λ^{σ})}^{T} z^{Δ}] (τ) Δ τ,

where the so-called Hamiltonian H is given by

H (x, u, v, λ) = \frac{1}{2} (z^{T} \hat{Q} z + u^{T} R_{P} u - v^{T} R_{E} v) + λ^{T} (\hat{A} z + \hat{B} u + \hat{C} v)

(3.3)

and

λ = {[\begin{matrix} λ_{P} & λ_{E} \end{matrix}]}^{T}

represents a multiplier to be determined later.

Remark 18.

Our treatment of (1.1) differs from the argument used by Ho, Bryson, and Baron in [7]. In their paper, they appealed to state estimates of the pursuer and evader to evaluate the cost functional. Their motivation for their argument is due to notion that when they studied pursing and evading missiles, they considered difference in altitude as negligible. As a result of our rewriting of (1.1), we are not required to make such a restriction.

Next, we provide necessary conditions for an optimal control. We assume that

\frac{d}{d ε} \int_{t_{0}}^{t_{f}} f (τ, ε) Δ τ = \int_{t_{0}}^{t_{f}} \frac{\partial}{\partial ε} f (τ, ε) Δ τ

(3.4)

for all

f : T \times R \to R

such that

f (\cdot, ε), \partial f (\cdot, ε) / \partial ε \in C_{rd} (T)

.

Lemma 19.

Let (3.2) be the cost functional associated with (3.1). Assume (3.4) holds. Then the first variation,

\dot{Φ} (0)

, is zero provided that z, λ, u, and v satisfy

\begin{matrix} z^{Δ} = \hat{A} z + \hat{B} u + \hat{C} v, \end{matrix}

(3.5a)

\begin{matrix} - λ^{Δ} = \hat{Q} z + {\hat{A}}^{T} λ^{σ}, \end{matrix}

(3.5b)

\begin{matrix} 0 = R_{P} u + {\hat{B}}^{T} λ^{σ}, \end{matrix}

(3.5c)

\begin{matrix} 0 = - R_{E} v + {\hat{C}}^{T} λ^{σ} . \end{matrix}

(3.5d)

Proof.

First note that

\begin{matrix} Φ (ε) = & J ((z, u, v, λ) + ε (η_{1}, η_{2}, η_{3}, η_{4})) \\ = & \frac{1}{2} {(z + ε η_{1})}^{T} (t_{f}) \hat{M} (z + ε η_{1}) (t_{f}) + \frac{1}{2} \int_{t_{0}}^{t_{f}} [{(z + ε η_{1})}^{T} \hat{Q} (z + ε η_{1})] (τ) Δ τ \\ + \frac{1}{2} \int_{t_{0}}^{t_{f}} [{(u + ε η_{2})}^{T} R_{P} (u + ε η_{2})] (τ) Δ τ - \frac{1}{2} \int_{t_{0}}^{t_{f}} [{(v + ε η_{3})}^{T} R_{E} (v + ε η_{3})] (τ) Δ τ \\ + \int_{t_{0}}^{t_{f}} \{{(λ^{σ} + ε η_{4}^{σ})}^{T} [\hat{A} (z + ε η_{1}) + \hat{B} (u + ε η_{2})\} (τ) Δ τ \\ + \int_{t_{0}}^{t_{f}} \{{(λ^{σ} + ε η_{4}^{σ})}^{T} [\hat{C} (v + ε η_{3}) - {(z + ε η_{1})}^{Δ}]\} (τ) Δ τ . \end{matrix}

Then

\begin{matrix} \dot{Φ} (ε) & = η_{1}^{T} (t_{f}) \hat{M} (z + ε η_{1}) (t_{f}) + \int_{t_{0}}^{t_{f}} [η_{1}^{T} \hat{Q} (z + ε η_{1})] (τ) Δ τ \\ + \int_{t_{0}}^{t_{f}} [η_{2}^{T} R_{P} (u + ε η_{2}) - η_{3}^{T} R_{E} (v + ε η_{3})] (τ) Δ τ \\ + \int_{t_{0}}^{t_{f}} \{{(η_{4}^{σ})}^{T} [\hat{A} (z + ε η_{1}) + \hat{B} (u + ε η_{2})]\} (τ) Δ τ \\ + \int_{t_{0}}^{t_{f}} \{{(η_{4}^{σ})}^{T} [\hat{C} (v + ε η_{3}) - {(z + ε η_{1})}^{Δ}]\} (τ) Δ τ \\ + \int_{t_{0}}^{t_{f}} \{{(λ^{σ} + ε η_{4}^{σ})}^{T} [\hat{A} η_{1} + \hat{B} η_{2} + \hat{C} η_{3} - η_{1}^{Δ}]\} (τ) Δ τ . \end{matrix}

Then after rearranging terms, the first variation can be written as

\begin{matrix} \dot{Φ} (0) & = {[\hat{M} z (t_{f}) - λ (t_{f})]}^{T} η_{1} (t_{f}) + λ^{T} (t_{0}) η_{1} (t_{0}) \\ + \int_{t_{0}}^{t_{f}} [{({\hat{A}}^{T} λ^{σ} + \hat{Q} z + λ^{Δ})}^{T} η_{1} + {(R_{P} u + {\hat{B}}^{T} λ^{σ})}^{T} η_{2}] (τ) Δ τ \\ + \int_{t_{0}}^{t_{f}} [{(- R_{E} v + {\hat{C}}^{T} λ^{σ})}^{T} η_{3} + {(\hat{A} z + \hat{B} u + \hat{C} v - z^{Δ})}^{T} η_{4}^{σ}] (τ) Δ τ . \end{matrix}

Now in order for

\dot{Φ} (0) = 0

, we set each coefficient of independent increments

η_{1}

,

η_{2}

,

η_{3}

,

η_{4}^{σ}

equal to zero. This yields the necessary conditions for a minimum of (3.2). Using the Hamiltonian (3.3), we have state and costate equations

z^{Δ} = H_{λ} (z, u, v, λ^{σ}) = \hat{A} z + \hat{B} u + \hat{C} v

and

- λ^{Δ} = H_{z} (z, u, v, λ^{σ}) = \hat{Q} z + {\hat{A}}^{T} λ^{σ} .

Similarly, we have the stationary conditions

0 = H_{u} (z, u, v, λ^{σ}) = R_{P} u + {\hat{B}}^{T} λ^{σ}

and

0 = H_{v} (z, u, v, λ^{σ}) = - R_{E} v + {\hat{C}}^{T} λ^{σ}

This concludes the proof. □

Remark 20.

We note that z,

λ

, u, and v solve (3.5) if and only if they solve

\begin{matrix} z^{Δ} = \hat{A} z - \hat{D} λ^{σ}, \end{matrix}

(3.6a)

\begin{matrix} - λ^{Δ} = \hat{Q} z + {\hat{A}}^{T} λ^{σ}, \end{matrix}

(3.6b)

\begin{matrix} u = - R_{P}^{- 1} {\hat{B}}^{T} λ^{σ}, \end{matrix}

(3.6c)

\begin{matrix} v = R_{E}^{- 1} {\hat{C}}^{T} λ^{σ}, \end{matrix}

(3.6d)

where

\hat{D}

is a “mixing term” given by

\hat{D} : = \hat{B} R_{P}^{- 1} {\hat{B}}^{T} - \hat{C} R_{E}^{- 1} {\hat{C}}^{T} .

Throughout this paper, we assume that

\hat{D}

is regressive. As a result, we can determine an optimal strategy if we know the value of the costate.

Finally, we give the sufficient conditions for a local optimal control.

Lemma 21.

Let (3.2) be the cost functional associated with (3.1). Assume (3.4) holds. Then the second variation,

\ddot{Φ} (0)

, is positive provided that

η_{1}

,

η_{2}

, and

η_{3}

satisfy the constraints

η_{1}^{Δ} = \hat{A} η_{1} + \hat{B} η_{2} + \hat{C} η_{3}

where

η_{2} \neq 0

and

η_{3}

is fixed.

Proof.

Taking the second derivative of

Φ (ε)

, we have

\begin{matrix} \ddot{Φ} (ε) & = & η_{1}^{T} (t_{f}) \hat{M} η_{1} (t_{f}) + \int_{t_{0}}^{t_{f}} [η_{1}^{T} \hat{Q} η_{1} + η_{2}^{T} R_{P} η_{2} - η_{3}^{T} R_{E} η_{3}] (τ) Δ τ \\ + 2 \int_{t_{0}}^{t_{f}} [{(\hat{A} η_{1} + \hat{B} η_{2} + \hat{C} η_{3} - η_{1}^{Δ})}^{T} η_{4}^{σ}] (τ) Δ τ . \end{matrix}

If we assume that

η_{1}

,

η_{2}

, and

η_{3}

satisfy the constraint

η_{1}^{Δ} = \hat{A} η_{1} + \hat{B} η_{2} + \hat{C} η_{3},

then the second variation is given by

\ddot{Φ} (0) = η_{1}^{T} (t_{f}) \hat{M} η_{1} (t_{f}) + \int_{t_{0}}^{t_{f}} [η_{1}^{T} \hat{Q} η_{1} + η_{2}^{T} R_{P} η_{2} - η_{3}^{T} R_{E} η_{3}] (τ) Δ τ .

Note that

\hat{M}

and

\hat{Q} \geq 0

while

R_{P}

and

R_{E} > 0

. Thus if

η_{2} \neq 0

and

η_{3}

is fixed, then (3.7) is guaranteed to be positive. □

Definition 22.

The pair

(u^{*}, v^{*})

is a saddle point to the system (3.1) associated with the cost (3.2) provided

J (u, v^{*}) \leq J (u^{*}, v^{*}) \leq J (u^{*}, v) .

Here, the stationary conditions needed to ensure a saddle are

H_{u u} = R_{P} > 0

and

H_{v v} = - R_{E} < 0

(see [39]). For our purposes, this pair corresponds to when neither player wishes to deviate from this compromise without being penalized by the other player. It be understood that this compromise occurs when we have the natural caveat that the pursuer and evader belong to the same time scale. In this paper, we do not claim that this saddle point must be unique.

4. Fixed Final States Case

In this section, we seek an optimal strategy when the final states are fixed. In this setting we write the equations for the pursuer and evader separately. Here we consider the state and costate equations for the pursuer

\begin{matrix} x_{P}^{Δ} (t) & = & A_{P} x_{P} (t) - B_{P} R_{P}^{- 1} B_{P}^{T} λ_{P}^{σ} (t), & x_{P} (t_{0}) = x_{0}^{P} \\ - λ_{P}^{Δ} (t) & = & A_{P}^{T} λ_{P}^{σ} (t), & λ_{P} (t_{f}) = M (x_{P} - x_{E}) (t_{f}) \end{matrix}

(4.1)

as well as those for the evader

\begin{matrix} x_{E}^{Δ} (t) & = & A_{E} x_{E} (t) - B_{E} R_{E}^{- 1} B_{E}^{T} λ_{E}^{σ} (t), & x_{E} (t_{0}) = x_{0}^{E} \\ - λ_{E}^{Δ} (t) & = & A_{E}^{T} λ_{E}^{σ} (t), & λ_{E} (t_{f}) = M (x_{E} - x_{P}) (t_{f}) \end{matrix}

(4.2)

associated with the cost functional

\begin{matrix} J (u, v) & = \frac{1}{2} ({(x_{P} - x_{E})}^{T} (t_{f}) M (x_{P} - x_{E})) (t_{f}) \\ + \frac{1}{2} \int_{t_{0}}^{t_{f}} ({(x_{P} - x_{E})}^{T} Q (x_{P} - x_{E}) + u^{T} R_{P} u - v^{T} R_{E} v) (τ) Δ τ . \end{matrix}

(4.3)

Definition 23.

The initial state difference,

d_{0} (\cdot)

, is the difference between the zero-input pursuing and evading states, i.e.,

d_{0} (t) : = e_{A_{P}} (t, t_{0}) x_{P} (t_{0}) - e_{A_{E}} (t, t_{0}) x_{E} (t_{0}) .

Next, we determine an open–loop strategy for both players. Note that the following theorem mirrors Kalman’s generalized controllability criterion as found in Theorem 3.2 [16].

Theorem 24.

Suppose that

x_{P}

and

λ_{P}

solve (4.1) while

x_{E}

and

λ_{E}

satisfy (4.2). Let the Gramians for the pursuer and evader

G_{P} (t_{o}, t_{f}) : = \int_{t_{0}}^{t_{f}} e_{A_{P}} (t_{f}, σ (τ)) B_{P} R_{P}^{- 1} B_{P}^{T} e_{A_{P}}^{T} (t_{f}, σ (τ)) Δ τ

(4.4)

and

G_{E} (t_{o}, t_{f}) : = \int_{t_{0}}^{t_{f}} e_{A_{E}} (t_{f}, σ (τ)) B_{E} R_{E}^{- 1} B_{E}^{T} e_{A_{E}}^{T} (t_{f}, σ (τ)) Δ τ,

(4.5)

respectively, be such that

I + (G_{P} - G_{E}) (t_{0}, t_{f}) M

is invertible for all

t \in [t_{0}, t_{f}] \cap T

. Then u and v can be rewritten as

u (t) = - R_{P}^{- 1} B_{P}^{T} e_{A_{P}}^{T} (t_{f}, σ (t)) M {[I + (G_{P} - G_{E}) (t_{o}, t_{f}) M]}^{- 1} d_{0} (t_{f})

(4.6)

and

v (t) = - R_{E}^{- 1} B_{E}^{T} e_{A_{E}}^{T} (t_{f}, σ (t)) M {[I + (G_{P} - G_{E}) (t_{o}, t_{f}) M]}^{- 1} d_{0} (t_{f}) .

(4.7)

Proof.

Solving (4.1) for

λ_{P}

, we have

λ_{P} (t) = e_{A_{P}}^{T} (t_{f}, t) λ_{P} (t_{f}) = e_{A_{P}}^{T} (t_{f}, t) M (x_{P} - x_{E}) (t_{f}) .

Using (2.1) and (3.5a), the state equation becomes

x_{P}^{Δ} (t) = A_{P} x_{P} (t) - B_{P} R_{P}^{- 1} B_{P}^{T} e_{A_{P}}^{T} (t_{f}, σ (t)) λ_{P} (t_{f}) .

(4.8)

Now solving (4.8) with Theorem 17 at time

t = t_{f}

, we have

\begin{matrix} x_{P} (t_{f}) & = e_{A_{P}} (t_{f}, t_{0}) x_{P} (t_{0}) \\ - \int_{t_{0}}^{t_{f}} e_{A_{P}} (t_{f}, σ (τ)) B_{P} R_{P}^{- 1} B_{P}^{T} e_{A_{P}}^{T} (t_{f}, σ (τ)) λ_{P} (t_{f}) Δ τ \\ = & e_{A_{P}} (t_{f}, t_{0}) x_{P} (t_{0}) - G_{P} (t_{0}, t_{f}) M (x_{P} - x_{E}) (t_{f}) . \end{matrix}

Similarly, the final state for the pursuer can be written as

x_{E} (t_{f}) = e_{A_{E}} (t_{f}, t_{0}) x_{E} (t_{0}) - G_{E} (t_{o}, t_{f}) M (x_{P} - x_{E}) (t_{f}) .

Taking the difference in the final states and rearranging, we have

\begin{matrix} (x_{P} - x_{E}) (t_{f}) & = & d_{0} (t_{f}) - (G_{P} - G_{E}) (t_{0}, t_{f}) M (x_{P} - x_{E}) (t_{f}) \\ = & {[I + (G_{P} - G_{E}) (t_{0}, t_{f}) M]}^{- 1} d_{0} (t_{f}) . \end{matrix}

(4.9)

Finally, plugging

λ

into (3.6c) and using (4.9) yields

\begin{matrix} u (t) = & - R_{P}^{- 1} B_{P}^{T} e_{A_{P}}^{T} (t_{f}, σ (t)) λ_{P} (t_{f}) \\ = & - R_{P}^{- 1} B_{P}^{T} e_{A_{P}}^{T} (t_{f}, σ (t)) M (x_{P} - x_{E}) (t_{f}) \\ = & - R_{P}^{- 1} B_{P}^{T} e_{A_{P}}^{T} (t_{f}, σ (t)) M {[I + (G_{P} - G_{E}) (t_{o}, t_{f}) M]}^{- 1} d_{0} (t_{f}) . \end{matrix}

The equation for v can be shown similarly. This concludes the proof. □

Next, we determine the optimal cost.

Theorem 25.

If u and v are given by (4.6) and (4.7), respectively, then the cost functional (4.3) can be rewritten as

J (u, v) = \frac{1}{2} d_{0}^{T} (t_{f}) H (t_{0}, t_{f}) M^{T} [I + (G_{P} - G_{E}) (t_{0}, t_{f})] M H (t_{0}, t_{f}) d_{0} (t_{f}),

(4.10)

where

H (t_{0}, t_{f}) : = {[I + (G_{P} - G_{E}) (t_{0}, t_{f}) M]}^{- 1}

.

Proof.

First, plugging (4.6), (4.7), and (4.9) into (4.3), we have

\begin{matrix} J (u, v) & = \frac{1}{2} d_{0}^{T} (t_{f}) H (t_{0}, t_{f}) M H (t_{0}, t_{f}) d_{0} (t_{f}) \\ + \frac{1}{2} d_{0}^{T} (t_{f}) H (t_{0}, t_{f}) M^{T} (\int_{t_{0}}^{t_{f}} e_{A_{P}} (t_{f}, σ (τ)) B_{P} R_{P}^{- 1} B_{P}^{T} e_{A_{P}}^{T} (t_{f}, σ (τ)) Δ τ) M H (t_{0}, t_{f}) d_{0} (t_{f}) \\ - \frac{1}{2} d_{0}^{T} (t_{f}) H (t_{0}, t_{f}) M^{T} (\int_{t_{0}}^{t_{f}} e_{A_{E}} (t_{f}, σ (τ)) B_{E} R_{E}^{- 1} B_{E}^{T} e_{A_{E}}^{T} (t_{f}, σ (τ)) Δ τ) M H (t_{0}, t_{f}) d_{0} (t_{f}) \\ = \frac{1}{2} d_{0}^{T} (t_{f}) H (t_{0}, t_{f}) M H (t_{0}, t_{f}) d_{0} (t_{f}) \\ + \frac{1}{2} d_{0}^{T} (t_{f}) H (t_{0}, t_{f}) M^{T} (G_{P} - G_{E}) (t_{0}, t_{f})] M H (t_{0}, t_{f}) d_{0} (t_{f}), \end{matrix}

using the gramians (4.4) and (4.5). Since

M \geq 0

is symmetric, we can pull out common factors on the left and right to obtain our result. □

Remark 26.

Suppose that the pursuer wants to use a strategy u that intercepts the evader (using strategy v) with minimal energy. Note that

det [I + (G_{P} - G_{E}) (t_{0}, t_{f})] \neq 0

if and only if

det [(G_{P} - G_{E}) (t_{0}, t_{f})] \neq 0

. From the classical definition of controllability, this implies that the pursuer captures the evader when the pursuer is “more controllable" than the evader. A sufficient condition for the pursuing state to intercept the evader is given by

(G_{P} - G_{E}) (t_{0}, t_{f}) > 0

. As a result, this relationship is preserved in the unification of pursuit-evasion to dynamic equations on time scales.

5. Free Final States Case

In this section, we develop an optimal control law in the form of state feedback. In considering the boundary conditions, note that

z (t_{0})

is known (meaning

η_{1} (t_{0}) = 0

) while

z (t_{f})

is free (meaning

η_{1} (t_{f}) \neq 0

). Thus the coefficient on

η_{1} (t_{f})

must be zero. This gives the terminal condition on the costate to be

λ (t_{f}) = \hat{M} z (t_{f}) .

(5.1)

Remark 27.

Now in order to solve this two-point boundary value problem, we make the assumption that z and

λ

satisfy

λ (t) = S (t) z (t) .

(5.2)

for all

t \in [t_{0}, t_{f}]

. This condition (5.2) is called a “sweep condition," a term used by Bryson and Ho in [8]. Since the terminal condition

\hat{M} \geq 0

, it is natural to assume that

S \geq 0

as well.

Theorem 28.

Assume that S solves

- S^{Δ} = \hat{Q} + {\hat{A}}^{T} S^{σ} + (I + μ {\hat{A}}^{T}) S^{σ} {(I + μ \hat{D} S^{σ})}^{- 1} (\hat{A} - \hat{D} S^{σ}) .

(5.3)

If x satisfies

z^{Δ} = {(I + μ \hat{D} S^{σ})}^{- 1} (A - \hat{D} S^{σ}) z

(5.4)

and λ is given by (5.2), then

- λ^{Δ} = \hat{Q} z + {\hat{A}}^{T} λ^{σ} .

(5.5)

Proof.

Since

λ

is as given in (5.2), we may use the product rule, (5.3), (5.4), and (2.1) to arrive at

\begin{matrix} - λ^{Δ} = & - S^{Δ} z - S^{σ} z^{Δ} \\ = & \hat{Q} z + {\hat{A}}^{T} S^{σ} z + (I + μ {\hat{A}}^{T}) S^{σ} z - S^{Δ} z \\ = & \hat{Q} z + {\hat{A}}^{T} S^{σ} z + μ {\hat{A}}^{T} S^{σ} z^{Δ} \\ = & \hat{Q} z + {\hat{A}}^{T} S^{σ} z^{σ} \\ = & \hat{Q} z + {\hat{A}}^{T} λ^{σ}, \end{matrix}

which gives (5.5) as desired. □

Next we offer an alternative form of our Riccati equation.

Lemma 29.

If

\hat{D} S^{σ}

is regressive, then S solves (5.3) if and only if it solves

- S^{Δ} = \hat{Q} + {\hat{A}}^{T} S^{σ} + (I + μ {\hat{A}}^{T}) S^{σ} \hat{A} - (I + μ {\hat{A}}^{T}) S^{σ} \hat{D} S^{σ} {(I + μ \hat{D} S^{σ})}^{- 1} (I + μ \hat{A}) .

(5.6)

Proof.

Note that

A - \hat{D} S^{σ} = A - \hat{D} S^{σ} (I + μ A - μ A) = (I + μ \hat{D} S^{σ}) A - \hat{D} S^{σ} (I + μ A) .

Plugging the above identity into (5.3) yields (5.6). □

Next we define our Kalman gains as follows.

Definition 30.

Let

\hat{D} S^{σ}

be regressive. Then the matrix-valued functions

K_{P} (t) = R_{P}^{- 1} {\hat{B}}^{T} S^{σ} (t) {(I + μ (t) \hat{D} S^{σ} (t))}^{- 1} (I + μ (t) \hat{A})

(5.7)

and

K_{E} (t) = R_{E}^{- 1} {\hat{C}}^{T} S^{σ} (t) {(I + μ (t) \hat{D} S^{σ} (t))}^{- 1} (I + μ (t) \hat{A})

(5.8)

are called the pursuer feedback gain and evader feedback gain, respectively.

Theorem 31.

Let

\hat{D} S^{σ}

be regressive and suppose that z and λ solve (4.8) such that (5.2) holds. Then

\hat{B} u + \hat{C} v = - K_{P} z + K_{E} z .

(5.9)

Proof.

Using (3.6), (5.2), and (2.1), we have

\begin{matrix} \hat{B} u + \hat{C} v & = & - \hat{B} R_{P}^{- 1} {\hat{B}}^{T} λ^{σ} + \hat{C} R_{E}^{- 1} {\hat{C}}^{T} λ^{σ} \\ = & - \hat{B} R_{P}^{- 1} {\hat{B}}^{T} S^{σ} (z + μ z^{Δ}) + \hat{C} R_{E}^{- 1} {\hat{C}}^{T} S^{σ} (z + μ z^{Δ}) \\ = & - \hat{D} S^{σ} [(I + μ \hat{A}) z + μ (\hat{B} u + \hat{C} v)] . \end{matrix}

Now combining like terms yields

(I + \hat{D} S^{σ}) (\hat{B} u + \hat{C} v) = - \hat{D} S^{σ} (I + μ \hat{A}) z

Multiplying both side by the inverse of

I + \hat{D} S^{σ}

and rearranging terms, we have

\begin{matrix} \hat{B} u + \hat{C} v = - {(I + \hat{D} S^{σ})}^{- 1} \hat{D} S^{σ} (I + μ \hat{A}) z \\ = - \hat{D} S^{σ} {(I + \hat{D} S^{σ})}^{- 1} (I + μ \hat{A}) z \\ = - \hat{B} R_{P}^{- 1} {\hat{B}}^{T} S^{σ} {(I + \hat{D} S^{σ})}^{- 1} (I + μ \hat{A}) z + \hat{C} R_{E}^{- 1} {\hat{C}}^{T} S^{σ} {(I + \hat{D} S^{σ})}^{- 1} (I + μ \hat{A}) z . \end{matrix}

Finally, (5.9) follows using (5.7) and (5.8). □

Next we rewrite our extended state equation under the influence of the pursuit-evasion control laws. This yields the closed-loop plant given by

z^{Δ} (t) = (\hat{A} - \hat{B} K_{P} (t) + \hat{C} K_{E} (t)) z (t),

(5.10)

which can be used to find an optimal trajectory for any given

z (t_{0})

.

Lemma 32.

If

\hat{D} S^{σ}

is regressive and S is symmetric, then

\begin{matrix} (I + μ {\hat{A}}^{T}) S^{σ} \hat{A} - (I + μ {\hat{A}}^{T}) S^{σ} {(I + μ \hat{D} S^{σ})}^{1} \hat{D} S^{σ} (I + μ \hat{A}) \\ = (I + μ {(\hat{A} - \hat{B} K_{P} + \hat{C} K_{E})}^{T}) S^{σ} (\hat{A} - \hat{B} K_{P} + \hat{C} K_{E}) \\ - K_{P}^{T} {\hat{B}}^{T} S^{σ} + K_{E}^{T} {\hat{C}}^{T} S^{σ} + K_{P}^{T} R_{P} K_{P} - K_{E}^{T} R_{E} K_{E} . \end{matrix}

(5.11)

Moreover, both sides of (5.11) are equal to

(I + μ {\hat{A}}^{T}) S^{σ} (\hat{A} - \hat{B} K_{P} + \hat{C} K_{E})

.

Proof.

We can use (5.7) and (5.8) to rewrite the left-hand side of (5.11) as

\begin{matrix} (I + μ {\hat{A}}^{T}) S^{σ} \hat{A} - (I + μ {\hat{A}}^{T}) S^{σ} \hat{D} S^{σ} {(I + μ \hat{D} S^{σ})}^{1} (I + μ \hat{A}) \\ = (I + μ {\hat{A}}^{T}) S^{σ} \hat{A} - (I + μ {\hat{A}}^{T}) S^{σ} (\hat{B} K_{P} - \hat{C} K_{E}) \\ = (I + μ {\hat{A}}^{T}) S^{σ} (\hat{A} - \hat{B} K_{P} + \hat{C} K_{E}) . \end{matrix}

Using (5.7) and (5.8), the right-hand side of (5.11) can be written as

\begin{matrix} (I + μ {\hat{A}}^{T}) & S^{σ} (\hat{A} - \hat{B} K_{P} + \hat{C} K_{E}) - K_{P}^{T} {\hat{B}}^{T} S^{σ} (I + μ \hat{A}) + K_{E}^{T} {\hat{C}}^{T} S^{σ} (I + μ \hat{A}) \\ - μ K_{P}^{T} {\hat{B}}^{T} S^{σ} (\hat{B} K_{P} + \hat{C} K_{E}) + μ K_{E}^{T} {\hat{C}}^{T} S^{σ} (\hat{B} K_{P} + \hat{C} K_{E}) \\ + K_{P}^{T} R_{P} K_{P} - K_{E}^{T} R_{E} K_{E} \\ = & (I + μ {\hat{A}}^{T}) S^{σ} (\hat{A} - \hat{B} K_{P} + \hat{C} K_{E}) - K_{P}^{T} {\hat{B}}^{T} S^{σ} (I + μ \hat{A}) \\ + K_{E}^{T} {\hat{C}}^{T} S^{σ} (I + μ \hat{A}) - μ K_{P}^{T} {\hat{B}}^{T} S^{σ} \hat{D} S^{σ} {(I + μ \hat{D} S^{σ})}^{- 1} (I + μ \hat{A}) \\ + μ K_{E}^{T} {\hat{C}}^{T} S^{σ} \hat{D} S^{σ} {(I + μ \hat{D} S^{σ})}^{- 1} (I + μ \hat{A}) + K_{P}^{T} R_{P} K_{P} - K_{E}^{T} R_{E} K_{E} \\ = & (I + μ {\hat{A}}^{T}) S^{σ} (\hat{A} - \hat{B} K_{P} + \hat{C} K_{E}) - K_{P}^{T} {\hat{B}}^{T} S^{σ} {(I + μ \hat{D} S^{σ})}^{- 1} (I + μ \hat{A}) \\ + K_{E}^{T} {\hat{C}}^{T} S^{σ} {(I + μ \hat{D} S^{σ})}^{- 1} (I + μ \hat{A}) + K_{P}^{T} R_{P} K_{P} - K_{E}^{T} R_{E} K_{E} \\ = & (I + μ {\hat{A}}^{T}) S^{σ} (\hat{A} - \hat{B} K_{P} + \hat{C} K_{E}) . \end{matrix}

Thus, (5.11) holds. □

Now we rewrite the Riccati equation (5.6) in so-called (generalized) Joseph stabilized form (see [39]).

Theorem 33.

If

\hat{D} S^{σ}

is regressive and S is symmetric, then S solves the Riccati equation (5.6) if and only if it solves

\begin{matrix} - S^{Δ} & = \hat{Q} + {(\hat{A} - \hat{B} K_{P} + \hat{C} K_{E})}^{T} S^{σ} \\ + (I + μ {(\hat{A} - \hat{B} K_{P} + \hat{C} K_{E})}^{T}) S^{σ} (\hat{A} - \hat{B} K_{P} + \hat{C} K_{E}) \\ + K_{P}^{T} R_{P} K_{P} - K_{E}^{T} R_{E} K_{E} . \end{matrix}

(5.12)

Proof.

The statement follows directly from Lemma 32. □

Finally, we rewrite the cost.

Theorem 34.

Suppose that S solves (5.12). If z, u, and v satisfy (5.10), and (5.9) respectively, then the cost functional (3.2) can be rewritten as

J (u, v) = \frac{1}{2} z^{T} (t_{0}) S (t_{0}) z (t_{0}) .

(5.13)

Proof.

First note that we may use the product rule, (2.1), and (5.10) to find

\begin{matrix} {(z^{T} S z)}^{Δ} = {(z^{T} S)}^{Δ} z + {(z^{T} S)}^{σ} z^{Δ} \\ = {(z^{Δ})}^{T} S^{σ} z + z^{T} S^{Δ} z + {(z + μ z^{Δ})}^{T} S^{σ} z^{Δ} \\ = z^{T} [{(\hat{A} - \hat{B} K_{P} + \hat{C} K_{E})}^{T} S^{σ} + S^{Δ}] z \\ + z^{T} {[I + μ (\hat{A} - \hat{B} K_{P} + \hat{C} K_{E})]}^{T} S^{σ} (\hat{A} - \hat{B} K_{P} + \hat{C} K_{E}) z . \end{matrix}

(5.14)

Using this and (5.9) in (3.2), we have

\begin{matrix} J (u, v) & = \frac{1}{2} z^{T} (t_{0}) S (t_{0}) z (t_{0}) + \frac{1}{2} \int_{t_{0}}^{t_{f}} {(z^{T} S z)}^{Δ} (τ) Δ τ \\ + \frac{1}{2} \int_{t_{0}}^{t_{f}} [z^{T} \hat{Q} z + u^{T} R_{P} u - v^{T} R_{E} v] (τ) Δ τ \\ = & \frac{1}{2} z^{T} (t_{0}) S (t_{0}) z (t_{0}) + \frac{1}{2} \int_{t_{0}}^{t_{f}} {(z^{T} S z)}^{Δ} (τ) Δ τ \\ + \frac{1}{2} \int_{t_{0}}^{t_{f}} \{z^{T} [\hat{Q} + K_{P}^{T} R_{P} K_{P} - K_{E}^{T} R_{E} K_{E}] z\} (τ) Δ τ . \end{matrix}

Using (5.14) and (5.12), the cost functional can be rewritten as

J (u, v) = \frac{1}{2} z^{T} (t_{0}) S (t_{0}) z (t_{0}) .

This concludes the proof. □

From Theorem 34, if the current state and S are known, we can determine the optimal cost before we apply the optimal control or even calculate it. The table below summarizes our results.

Table 5.1. The LQPEG on

T

Table 5.1. The LQPEG on

T

System:

z^{Δ} = \hat{A} z + \hat{B} u + \hat{C} v

Cost:

J (u, v) = \frac{1}{2} z^{T} (t_{f}) \hat{M} z (t_{f}) + \frac{1}{2} \int_{t_{0}}^{t_{f}} (z^{T} \hat{Q} z + u^{T} R_{P} u - v^{T} R_{E} v) (τ) Δ τ

Mixing Term:

\hat{D} = \hat{B} R_{P}^{- 1} {\hat{B}}^{T} - \hat{C} R_{E}^{- 1} {\hat{C}}^{T}

Pursuer Feedback:

K_{P} = R_{P}^{- 1} {\hat{B}}^{T} S^{σ} {(I + μ \hat{D} S^{σ})}^{- 1} (I + μ \hat{A})

Evader Feedback:

K_{E} = R_{E}^{- 1} {\hat{C}}^{T} S^{σ} {(I + μ \hat{D} S^{σ})}^{- 1} (I + μ \hat{A})

Riccati Equation:

- S^{Δ} = Q + {\hat{A}}^{T} S^{σ} + (I + μ {\hat{A}}^{T}) S^{σ} \hat{A} - (I + μ {\hat{A}}^{T}) S^{σ} \hat{D} S^{σ} {(I + μ \hat{D} S^{σ})}^{- 1} (I + μ \hat{A})

6. Examples

Example 35. (The Continuous LQPEG) Let

T = R

and consider

z^{'} (t) = \hat{A} z (t) + \hat{B} u (t) + \hat{C} v (t),

associated with the cost functional

J (u, v) = \frac{1}{2} z^{T} (t_{f}) \hat{M} z (t_{f}) + \frac{1}{2} \int_{t_{0}}^{t_{f}} [z^{T} \hat{Q} z + u^{T} R_{P} u - v^{T} R_{E} v] (τ) d τ

(observe part (a) of Examples 7 and 12). Then the state, costate, and stationary equations (3.6) are given by

\begin{matrix} z^{'} = \hat{A} z - \hat{D} λ, \\ - λ^{'} = {\hat{A}}^{T} λ + \hat{Q} z, \\ u = - R_{P}^{- 1} {\hat{B}}^{T} λ, \\ v = R_{E}^{- 1} {\hat{C}}^{T} λ . \end{matrix}

In this case, our pursuer-evader feedback gains (5.7) and (5.8) are given as

K_{P} (t) = R_{P}^{- 1} {\hat{B}}^{T} S (t) a n d K_{E} (t) = R_{E}^{- 1} {\hat{C}}^{T} S (t) .

Now the pursuer-evader law (5.9) and the closed-loop plant (5.10) can be written as

\hat{B} u (t) + \hat{C} v (t) = - \hat{B} K_{P} (t) z (t) + \hat{C} K_{E} (t) z (t)

and

z^{'} = (\hat{A} - \hat{B} K_{P} + \hat{C} K_{E}) z .

Similarly, the closed-loop Riccati equation (5.12) can be written as

- S^{'} = \hat{Q} + S (\hat{A} - \hat{B} K_{P} + \hat{C} K_{E}) + {(\hat{A} - \hat{B} K_{P} + \hat{C} K_{E})}^{T} S + K_{P}^{T} R_{P} K_{P} - K_{E}^{T} R_{E} K_{E}

while the optimal cost is given by (5.13).

Example 36. (The h-difference LQPEG) Let

T = h Z

and consider

Δ_{h} z (t) = \hat{A} z (t) + \hat{B} u (t) + \hat{C} v (t),

By observing Example 7 (c) and introducing

\tilde{A} = I + h \hat{A}, \tilde{B} = h \hat{B}, \tilde{C} = h \hat{C}, \tilde{Q} = h \hat{Q}, {\tilde{R}}_{i} = h R_{i}, \tilde{D} = h \hat{D},

we can rewrite the system as

z (t + h) = \tilde{A} z (t) + \tilde{B} u (t) + \tilde{C} v (t),

and the associated cost functional takes the form (observe Example 12 (c))

\begin{matrix} J (u, v) & = & \frac{1}{2} z^{T} (t_{f}) \hat{M} z (t_{f}) + \frac{1}{2} \sum_{τ = t_{0} / h}^{t_{f} / h - 1} [z^{T} \tilde{Q} z + u^{T} {\tilde{R}}_{P} u - v^{T} {\tilde{R}}_{E} v] (τ h) . \end{matrix}

Then the state, costate, and stationary equations (3.6) are given by

\begin{matrix} z (t + h) = \tilde{A} z (t) - \tilde{D} λ (t + h), \\ λ (t) = {\tilde{A}}^{T} λ (t + h) + \tilde{Q} z (t), \\ u (t) = - {\tilde{R}}_{P}^{- 1} {\tilde{B}}^{T} λ (t + h), \\ v (t) = {\tilde{R}}_{E}^{- 1} {\tilde{C}}^{T} λ (t + h) . \end{matrix}

Now our pursuer and evader feedback gains (5.7) and (5.8) are

K_{P} (t) = {\tilde{R}}_{P}^{- 1} {\tilde{B}}^{T} S (t + h) {(I + \tilde{D} S (t + h))}^{- 1} \tilde{A}

and

K_{E} (t) = {\tilde{R}}_{E}^{- 1} {\tilde{C}}^{T} S (t + h) {(I + \tilde{D} S (t + h))}^{- 1} \tilde{A} .

Next, the control-tracker law (5.9) and the closed-loop plant (5.10) can be written as

\tilde{B} u (t) + \tilde{C} v (t) = - \tilde{B} K_{P} (t) z (t) + \tilde{C} K_{E} (t) z (t)

and

z (t + h) = (\tilde{A} - \tilde{B} K_{P} (t) + \tilde{C} K_{E} (t)) z (t),

respectively. Similarly, the closed-loop Riccati equation (5.12) can be written as

\begin{matrix} S (t) & = & \tilde{Q} + {(\tilde{A} - \tilde{B} K_{P} (t) + \tilde{C} K_{E} (t))}^{T} S (t + h) (\tilde{A} - \tilde{B} K_{P} (t) + \tilde{C} K_{E} (t)) \\ + K_{P}^{T} (t) {\tilde{R}}_{P} K_{P} (t) - K_{E}^{T} (t) {\tilde{R}}_{E} K_{E} (t) \end{matrix}

while the optimal cost is given by (5.13).

Example 37. (The q-difference LQPEG) Let

T = q^{N_{0}}

with

q > 1

and consider

D_{q} z (t) = \hat{A} z (t) + \hat{B} u (t) + \hat{C} v (t) .

By observing Example 7 (d) and introducing

\begin{matrix} \tilde{A} (t) = I + (q - 1) t \hat{A}, \tilde{B} (t) = (q - 1) t \hat{B}, \tilde{C} (t) = (q - 1) t \hat{C} \\ \tilde{Q} (t) = (q - 1) t \hat{Q}, {\tilde{R}}_{i} (t) = (q - 1) t R_{i}, \tilde{D} (t) = (q - 1) t \hat{D}, \end{matrix}

we can rewrite the system as

z (q t) = \tilde{A} (t) z (t) + \tilde{B} (t) u (t) + \tilde{C} v (t),

while the associated cost functional becomes (observe Example 12 (d))

\begin{matrix} J (u, v) & = & \frac{1}{2} z^{T} (t_{f}) \hat{M} z (t_{f}) + \frac{1}{2} \sum_{τ \in [t_{0}, t_{f}) \cap T} [z^{T} \tilde{Q} z + u^{T} {\tilde{R}}_{P} u - v^{T} {\tilde{R}}_{E} v] (τ) . \end{matrix}

Then the state, costate, and stationary equations (3.6) are given by

\begin{matrix} z (q t) = \tilde{A} (t) z (t) - \tilde{D} (t) λ (q t), \\ λ (t) = {\tilde{A}}^{T} (t) λ (q t) + \tilde{Q} (t) z (t), \\ u (t) = - {\tilde{R}}_{P}^{- 1} (t) {\tilde{B}}^{T} (t) λ (q t), \\ v (t) = {\tilde{R}}_{E}^{- 1} (t) {\tilde{C}}^{T} (t) λ (q t) . \end{matrix}

In this case, our pursuer and evader feedback gains (5.7) and (5.8) are

K_{P} (t) = {\tilde{R}}_{P}^{- 1} (t) {\tilde{B}}^{T} (t) S (q t) {(I + \tilde{D} S (q t))}^{- 1} \tilde{A} (t)

and

K_{E} (t) = {\tilde{R}}_{E}^{- 1} (t) {\tilde{C}}^{T} (t) S (q t) {(I + \tilde{D} S (q t))}^{- 1} \tilde{A} (t) .

Now the control-tracker law (5.9) and the closed-loop plant (5.10) can be written as

\tilde{B} (t) u (t) + \tilde{C} (t) v (t) = - \tilde{B} (t) K_{P} (t) z (t) + \tilde{C} (t) K_{E} (t) z (t)

and

z (q t) = (\tilde{A} (t) - \tilde{B} (t) K_{P} (t) + \tilde{C} (t) K_{E} (t)) z (t),

respectively. Finally, the closed-loop Riccati equation (5.12) can be written as

\begin{matrix} S (t) & = & \tilde{Q} (t) + K_{P}^{T} (t) {\tilde{R}}_{P} (t) K_{P} (t) - K_{E}^{T} (t) {\tilde{R}}_{E} (t) K_{E} (t) \\ + {(\tilde{A} (t) - \tilde{B} (t) K_{P} (t) + \tilde{C} (t) K_{E} (t))}^{T} S (q t) (\tilde{A} (t) - \tilde{B} (t) K_{P} (t) + \tilde{C} (t) K_{E} (t)) \end{matrix}

while the optimal cost is given by (5.13).

Example 38.

In this last example, we provide a numerical of the LQPEG. In this setting, we sample a two-dimensional pursuer and evader on the same discrete, but uneven time scale

\begin{matrix} T = & {0, 0.03, 0.29, 1.23, 1.49, 1.94, 2.11, 2.51, 2.77, 3.78, 3.87, 4.15, 4.78, 4.81, 4.89, \\ 4.91, 5.49, 5.62, 5.71, 6.15, 6.72, 7.2, 7.4, 7.48, 7.59, 7.66, 7.68, 8.37, 8.55, 8.87, \\ 8.96, 9.4, 9.44, 9.73, 10} . \end{matrix}

Next, we consider the theoretical linear dynamic system

\begin{matrix} x_{P}^{Δ} (t) & = & [\begin{matrix} 2 & 0 \\ 0 & 1 \end{matrix}] x_{P} (t) + [\begin{matrix} 1 \\ 3 \end{matrix}] u (t), x_{P} (0) = [\begin{matrix} 2 \\ 1 \end{matrix}] \\ x_{E}^{Δ} (t) & = & [\begin{matrix} 3 & 1 \\ 1 & 1 \end{matrix}] x_{E} (t) + [\begin{matrix} 2 \\ - 2 \end{matrix}] v (t), x_{E} (0) = [\begin{matrix} 1 \\ 2 \end{matrix}] . \end{matrix}

Note that the first component of each player represents its position while the second corresponds to its velocity. For simplicity, only the position is observed. Here, we set the weights in (3.2) to be

R_{P} = 1

,

R_{E} = 1.3

and

Q = S (t_{f}) = I_{4}

. The plots for the pursuer and evader’s positions are given in Figure 6.1 below.

7. Concluding Remarks and Future Work

In this project, we have established the LQPEG where the pursuer and evader belong to the same arbitrary time scale

T .

One potential application of this work is when the evader represented by a drone and the evader represents a missile guidance where their corresponding signals are unevenly sampled. Here. the cost in part represents the wear and tear on the drone. A saddle point in this setting would represent a “live and let live” arrangement, where the drone is allowed to spy briefly on the missile-guidance system and return home, but is not given opportunity to preserve enough of its battery to outstay its welcome. Similarly, in finance the pursuer and evader can represents competing companies where a saddle point would correspond to an effort to coexist, where a hostile takeover or unnecessarily expended resources can be avoided. We have sidestepped the setting where the pursuer and evader each belong to their own time scale

T_{P}

and

T_{E}

, respectively. However, these time scales can be merged using a sample-and-hold method as found in [40,41].

One potential extension of this work is the introduction of additional pursuers. In this setting, the cost must be adjusted to account for the closest pursuer, which can vary over the time scale. A second potential extension is to consider the setting when one player is subject to a delay. Here, both players can still belong to the same time scale. However, this allows one player to act after the other, perhaps with some knowledge of the opposing player’s strategy. Finally, a third possible approach is to such games in a stochastic setting. Here, we can discretize each player’s stochastic linear time-invariant system to a dynamic system on an isolated time scale, as found in [40,42]. However, the usual separability property is not preserved in this setting.

Author Contributions

D. Funk and R Williams contributed to the analysis and writing/editing of the manuscript as well as the numerical example. N. Wintz contributed to the project conceptualization/analysis, writing/editing, and the funding of the project. All authors have read and agreed to the published version of the manuscript.

Funding

This project was partially supported by the National Science Foundation grant DMS-2150226, the NASA West Virginia Space Grant Consortium, training grant #80NSSC20M0055, and the NASA Missouri Space Grant Consortium grant #80NSSC20M0100.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

The authors would like to thank Matthias Baur and Tom Cuchta for the use of their time scales Python package in producing the last example.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

LQR	linear quadratic regulator
LQT	linear quadratic tracker
LQPEG	linear quadratic pursuit-evasion games

References

Isaacs, R. Differential Games I, II, III, IV; The RAND Corp. Memo. RM-1391, RM-1399, RM-1411, RM-1486, 1954-1956.
Isaacs, R. Differential games. A mathematical theory with applications to warfare and pursuit, control and optimization; John Wiley & Sons Inc.: New York, 1965; pp. xvii+384. [CrossRef]
Kalman, R.E. Contributions to the theory of optimal control. Bol. Soc. Mat. Mexicana (2) 1960, 5, 102–119.
Kalman, R.E. When is a linear control system optimal? Trans. ASME Ser. D. J. Basic Engineering 1964, 86, 81–90.
Kalman, R.E.; Koepcke, R.W. Optimal synthesis of linear sampling control systems using generalized performance indexes. Trans. ASME Ser. D. J. Basic Engineering 1958, 80, 1820–1826. [CrossRef]
Kalman, R.E. The theory of optimal control and the calculus of variations. In Mathematical optimization techniques; Univ. California Press: Berkeley, Calif., 1963; pp. 309–331.
Ho, Y.C.; Bryson, Jr., A.E.; Baron, S. Differential games and optimal pursuit-evasion strategies. IEEE Trans. Automatic Control 1965, AC-10, 385–389. [CrossRef]
Bryson, Jr., A.E.; Ho, Y.C. Applied optimal control; Hemisphere Publishing Corp. Washington, D. C., 1975; pp. xiv+481. Optimization, estimation, and control, Revised printing.
Hilger, S. Ein Maßkettenkalkül mit Anwendung auf Zentrumsmannigfaltigkeiten; PhD Thesis, Universität Würzburg, 1988.
Bohner, M.; Peterson, A. Dynamic equations on time scales; Birkhäuser Boston Inc.: Boston, MA, 2001; pp. x+358.
Bohner, M.; Peterson, A., Eds. Advances in dynamic equations on time scales; Birkhäuser Boston Inc.: Boston, MA, 2003; pp. xii+348.
Bartosiewicz, Z.; Pawłuszewicz, E. Linear control systems on time scales: unification of continuous and discrete. Proceedings of the 10th IEEE International Conference on Methods and Models in Automation and Robotics MMAR’04 2004, pp. 263–266.
Bartosiewicz, Z.; Pawłuszewicz, E. Realizations of linear control systems on time scales. Control Cybernet. 2006, 35, 769–786. [CrossRef]
Davis, J.; Gravagne, I.; Jackson, B.; Marks II, R. Controllability, observability, realizability, and stability of dynamic linear systems. Electron. J. Diff. Eqns 2009, 2009, 1–32.
Fausett, L.V.; Murty, K.N. Controllability, observability and realizability criteria on time scale dynamical systems. Nonlinear Stud. 2004, 11, 627–638.
Bohner, M.; Wintz, N. Controllability and observability of time-invariant linear dynamic systems. Mathematica Bohemica 2012, 137, 149–163. [CrossRef]
Bohner, M. Calculus of variations on time scales. Dynam. Systems Appl. 2004, 13, 339–349. [CrossRef]
DaCunha, J.J. Lyapunov Stability and Floquet Theory for Nonautonomous Linear Dynamic Systems on Time Scales; PhD Thesis, Baylor University, 2004.
Hilscher, R.; Zeidan, V. Weak maximum principle and accessory problem for control problems on time scales. Nonlinear Anal. 2009, 70, 3209–3226. [CrossRef]
Bettiol, P.; Bourdin, L. Pontryagin maximum principle for state constrained optimal sampled-data control problems on time scales. ESAIM: Control, Optimisation and Calculus of Variations 2021, 27. [CrossRef]
Zhu, Y.; Jia, G. Linear Feedback of Mean-Field Stochastic Linear Quadratic Optimal Control Problems on Time Scales. Mathematical Problems in Engineering 2020, 2020, 8051918. [CrossRef]
Ren, Q.Y.; Sun, J.P.; et al. Optimality necessary conditions for an optimal control problem on time scales. AIMS Math. 2021, 6, 5639–5646. [CrossRef]
Zhu, Y.; Jia, G. Stochastic Linear Quadratic Control Problem on Time Scales. Discrete Dynamics in Nature and Society 2021, 2021, 5743014. [CrossRef]
Poulsen, D.; Defoort, M.; Djemai, M. Mean Square Consensus of Double-Integrator Multi-Agent Systems under Intermittent Control: A Stochastic Time Scale Approach. Journal of the Franklin Institute 2019, 356. [CrossRef]
Duque, C.; Leiva, H. CONTROLLABILITY OF A SEMILINEAR NEUTRAL DYNAMIC EQUATION ON TIME SCALES WITH IMPULSES AND NONLOCAL CONDITIONS. TWMS Journal of Applied and Engineering Mathematics 2023, 13, 975–989.
Bohner, M.; Wintz, N. The linear quadratic regulator on time scales. Int. J. Difference Equ 2010, 5, 149–174.
Bohner, M.; Wintz, N. The linear quadratic tracker on time scales. International Journal of Dynamical Systems and Differential Equations 2011, 3, 423–447. [CrossRef]
Mu, Z.; Jie, T.; Zhou, Z.; Yu, J.; Cao, L. A survey of the pursuit–evasion problem in swarm intelligence, volume = 24, journal = Frontiers of Information Technology & Electronic Engineering, doi = 10.1631/FITEE.2200590, 2023. pp. 1093–1116. [CrossRef]
Sun, Z.; Sun, H.; Li, P.; Zou, J. Cooperative strategy for pursuit-evasion problem in the presence of static and dynamic obstacles. Ocean Engineering 2023, 279, 114476. [CrossRef]
Chen, N.; Li, L.; Mao, W. Equilibrium Strategy of the Pursuit-Evasion Game in Three-Dimensional Space. IEEE/CAA Journal of Automatica Sinica 2024, 11, 446–458. [CrossRef]
Venigalla, C.; Scheeres, D.J. Delta-V-Based Analysis of Spacecraft Pursuit–Evasion Games. Journal of Guidance, Control, and Dynamics 2021, 44, 1961–1971. [CrossRef]
Feng, Y.; Dai, L.; Gao, J.; Cheng, G. Uncertain pursuit-evasion game. Soft Comput. 2020, 24, 2425–2429. [CrossRef]
Bertram, J.; Wei, P., An Efficient Algorithm for Multiple-Pursuer-Multiple-Evader Pursuit/Evasion Game; AIAA Scitech 2021 Forum, 2021; [https://arc.aiaa.org/doi/pdf/10.2514/6.2021-1862]. [CrossRef]
Ye, D.; Shi, M.; Sun, Z. Satellite proximate pursuit-evasion game with different thrust configurations. Aerospace Science and Technology 2020, 99, 105715. [CrossRef]
Libich, J.; Stehlík, P. Macroeconomic games on time scales. Dynamic Systems and Applications 2008, 5, 274–278.
Petrov, N.; Mozhegova, E. On a simple pursuit problem on time scales of two coordinated evaders. Chelyabinsk Journal of Physics and Mathematics 2022, 7, 277–286. [CrossRef]
Mozhegova, E.; Petrov, N. The differential game “Cossacks–robbers” on time scales. Izvestiya Instituta Matematiki i Informatiki Udmurtskogo Gosudarstvennogo Universiteta 2023, 62, 56–70. [CrossRef]
Minh, V.D.; Phuong, B.L. LINEAR PURSUIT GAMES ON TIME SCALES WITH DELAY IN INFORMATION AND GEOMETRICAL CONSTRAINTS. TNU Journal of Science and Technology 2019, 200, 11–17.
Lewis, F.L.; Vrabie, D.L.; Syrmos, V.L. Optimal Control; John Wiley & Sons, Inc., 2012. [CrossRef]
Poulsen, D.; Davis, J.; Gravagne, I. Optimal Control on Stochastic Time Scales. IFAC-PapersOnLine 2017, 50, 14861–14866. [CrossRef]
Siegmund, S.; Stehlik, P. Time scale-induced asynchronous discrete dynamical systems. Discrete & Continuous Dynamical Systems - B 2017, 22. [CrossRef]
Poulsen, D.; Wintz, N. The Kalman filter on stochastic time scales. Nonlinear Analysis: Hybrid Systems 2019, 33, 151–161. [CrossRef]

Figure 6.1. Two-dimensional LQPEG on an isolated, uneven

T

Figure 6.1. Two-dimensional LQPEG on an isolated, uneven

T

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.

Linear Quadratic Pursuit–Evasion Games on Time Scales

Abstract

Keywords:

Subject:

1. Introduction

2. Time Scales Preliminaries

3. Optimization of Linear Systems on Time Scales

4. Fixed Final States Case

5. Free Final States Case

6. Examples

7. Concluding Remarks and Future Work

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

References

MDPI Initiatives

Important Links

Subscribe