Exact Solution for the Production Planning Problem with Several Regimes Switching over an Infinite Horizon Time

Dragos-Patru Covei

doi:10.20944/preprints202309.0965.v1

Submitted:

13 September 2023

Posted:

14 September 2023

You are already at the latest version

Abstract

We consider a stochastic production planning problem with regime switching. There are k≥1 regimes corresponding to different economic cycles. The problem is to minimize the production costs and analyze the problem by the value function approach. Our main contribution is to show that the optimal production is characterized by an exact solution of an elliptic system of partial differential equations. A verification result is given for the determined solution.

Keywords:

production planning

;

regime switching

;

PDE system

Subject:

Computer Science and Mathematics - Computational Mathematics

1. Introduction and proposal of the paper

We consider a factory producing

N \geq 1

types of economic goods that stores them in an inventory-designated place. The model is described mathematically in the next.

Let

(Ω, F, F

,

P)

be a complete filtered probability space, where P is the historical probability and

F = {F_{t}| t \in [0, \infty)},

is generated by an

R^{N}

-valued Brownian motion denoted by

w = (w_{1}, . . ., w_{N})

with respect to the probability P.

In the production planning problem, the regime switching is captured by a continuous time homogeneous Markov chain

ε (t)

adapted to

F

that can take k different values, modelling k regimes which should be noted by

1, 2, . . ., k

. The Markov chain’s rate matrix that denotes the strongly ireductible generator of

ε

, is denoted by

G = {[ϑ_{i j}]}_{k \times k}

where

ϑ_{i i} = - a_{i i} < 0 for all i, ϑ_{i j} = a_{i j} \geq 0 for all i \neq j,

and the diagonal elements

ϑ_{i i}

may be expressed as

ϑ_{i i} = - \underset{j \neq i}{Σ} ϑ_{i j} .

(1)

In this case, if

P_{t} (t) = E [ε (t)] \in R

, then

\frac{d P_{t} (t)}{d t} = G ε (t) .

(2)

Moreover,

ε (t)

it is explicitly described by the integral form

ε (t) = ε (0) + \int_{0}^{t} G ε (u) d u + M (t),

(3)

where

M (t)

is a martingale with respect to

F

. Here and hereafter, we use the notation from other papers to keep the applicative character of the problem,

p (t) = (p_{1} (t), . . ., p_{N} (t)),

represent the production rate at time t (control variable) adjusted for the demand rate.

These adjusted for demand inventory levels are modeled by the following system of stochastic differential equations

d y_{i} (t) = p_{i} d t + σ_{ε (t)} d w_{i}, y_{i} (0) = y_{i}^{0} for i = 1, . . ., N,

(4)

where

y_{i} (t)

is an Itô process in

R

(i.e., the inventory level of good i, at times

t,

adjusted for demand),

p_{i}

is the deterministic part,

σ_{ε (t)}

is a random regime-dependent constant (non-zero) diffusion coefficient taking on the values

σ_{1}

,

σ_{2}

, ... ,

σ_{k}

and

y_{i}^{0}

is the initial condition (i.e., initial inventory level of goods i).

The stochasticity here is due to demand adjustment, which is random and dependent on the regime. This is the most commonly used process when the demand is more volatile in some periods (e.g., some states of the Markov chain) and less volatile in other periods.

The performance over time of a demand-adjusted production

p (t) = (p_{1} (t), . . ., p_{N} (t)),

is measured by means of its cost. At this point, we introduce the cost functional which yields the cost

J (p_{1}, . . ., p_{N}) : = E \int_{0}^{\infty} {(| p (t) |}^{2} + {| y (t) |}^{2}) e^{- α_{ε (t)} t} d t, y (t) = (y_{1} (t), . . ., y_{N} (t)),

(5)

which measures the quadratic loss.

We measure deviations from the demand, from what place the loss. Here

α_{ε (t)}

is a regime dependent, taking on the values

α_{1} \geq 0

,

α_{2} \geq 0

, ... ,

α_{k} \geq 0

, constant psychological rate of time discount from what place the exponential discounting.

At the moment, we are ready to frame our objective, which is to minimize the cost functional, i.e.,

inf_{p_{1}, . . ., p_{N}} J (p_{1}, . . ., p_{N}),

(6)

subject to the Itô equation (4).

This model problem was proposed by Bensoussan, Sethi, Vickson and Derzko [1] in the context of no regime switching in the economy and for the case of a factory producing one type of economic goods. Later, many other authors are concerned with regime switching.

In production management, Cadenillas, Lakner and Pinedo [2] adapted the model problem in [1] to study the optimal production stochastic control planning problem of a company within an economy characterized by the two-state regime switching with limited/unlimited information. Later, Dong, Malikopoulos, Djouadi and Kuruganti [9] applied in the civil engineering the model described by [2] to the study of the optimal stochastic control problem for home energy systems with solar and energy storage devices when the demand is subject to Brownian motion; the two switching regimes are the peak and off peak energy demand.

A good deal of attention to this subject has been also devoted by Pirvu and Zhang [17] where the authors studied the effect of high versus low discount rates to a consumption-investment decision problem.

After that, there have been numerous applications of regime switching in many important problems in economics, operations research, actuarial science, finance, reinsurance, and other fields, see the works of: Capponi and Figueroa-López [3], Elliott and Hamada [11], Gharbi and Kenne [13], Yao, Zhang and Zhou [22] and Wang, Chang and Fang [23] for more details.

There are of course other research studies that may also serve to better explain the importance of regime switching in the real world.

In a precursor to this article, Covei and Pirvu [5], formulate and analyze the production-planning problem in the continuous time case, with no regime switching in the economy over an infinite time. In the paper [7], the author improved the results of [5], in the sense that the value function in the production model is given in the closed form. Related works that deal with no regime switching in the economy are Sheng-Zhu-Wang [20] and Qin-Bai-Ralescu [18].

Recently, Canepa, Covei and Pirvu [4], considered the production planning problem with regime switching in the economy over a finite horizon time. Here, the solution is obtained through numerical approaches. Although a closed form expression for the corresponding case of regime switching on a particular state space consisting of two regimes over an infinite horizon time is available in the paper of [6]. So, at least one question suggested by the paper of [14] has some nice features: can we obtain a closed form solution when the state space consists of several numbers of states? Our present paper fills the gap in the literature by proving a closed form solution to the stochastic production planning problem with regime switching in the economy over an infinite horizon in a general state space.

The technique presented in this paper makes a methodological contribution that is of independent interest in other considerable number of works on regime switching.

To conclude this introduction, our paper is structured as follows. In Section 2 we give the relationship of our model with a system of partial differential equations (PDE system). Section 3 presents a closed form solution and the uniqueness of solution for our production planning problem. A numerical approximation of the solution for the production planning problem is also given in Section 4. In Section 5 we present a verification result. We introduced in Section 6 the equilibrium production rates as the the subgame perfect production rates. They are the output of an interpersonal game between the present self and future selves. The equilibrium production rates are time consistent meaning there is no incentive to deviate from them. It turns out that in our setting the optimal production rates are among the equilibrium ones so they are time consistent. In Section 7, we give some applications. Finally, in Section 8 we want to discuss our strategy.

Having presented the model that we want to solve, now we provide our means to tackle it.

2. Reduction of the model to a PDE system

Our approach is based on the value function and dynamic programming, which leads to the Hamilton-Jacobi-Bellman (HJB) system of equations.

To characterize the value function, we apply the probabilistic approach. We search for functions

V (x, 1)

, ...,

V (x, k)

such that the stochastic process

S^{p} (t)

defined below

S^{p} (t) = e^{- α_{ε (t)} t} V (y (t), ε (t)) - \int_{0}^{t} {[| p (s) |}^{2} + {| y (s) |}^{2}] e^{- α_{ε (s)} s} d s,

(7)

is supermartingale for all

p (t) = (p_{1} (t), . . ., p_{N} (t)),

and martingale for the optimal control

p^{*} (t) = (p_{1}^{*} (t), . . ., p_{N}^{*} (t)) .

As shown by [5], if this is achieved, with the following transversality condition

lim_{t \to \infty} E [e^{- α_{ε (t)} t} V (y (t), ε (t))] = 0,

(8)

and some estimates on the value function yield that

- V (x, i) = inf J (p_{1}, . . ., p_{N}),

(9)

where

x = (x_{1}, . . ., x_{N}) \in R^{N}

assumes values

(y_{1} (0), . . ., y_{N} (0)) .

Once such a function is found, it turns out that

(u_{1}, . . ., u_{k})

with

u_{1} (x) = - V (x, 1), . . ., u_{k} (x) = - V (x, k),

is the value function. We search for

u_{1}, . . . ., u_{k}

the functions in

C^{2} [0, \infty)

, and the supermartingale/martingale requirement yields by using Itô’s Lemma for Markov modulated diffusion, the HJB system of equations, which characterizes the value function

- (\begin{matrix} \frac{σ_{1}^{2}}{2} Δ u_{1} \\ . . . \\ \frac{σ_{k}^{2}}{2} Δ u_{k} \end{matrix}) + G_{a, α} (\begin{matrix} u_{1} \\ . . . \\ u_{k} \end{matrix}) - (\begin{matrix} {|x|}^{2} \\ . . . \\ {|x|}^{2} \end{matrix}) = (\begin{matrix} inf_{p} {p \nabla u_{1} + {|p|}^{2}} \\ . . . \\ inf_{p} {p \nabla u_{k} + {|p|}^{2}} \end{matrix}),

(10)

where

G_{a, α} = (\begin{matrix} a_{11} + α_{1} & - a_{12} & . . . & - a_{1 k} \\ - a_{21} & a_{22} + α_{2} & . . . & - a_{2 k} \\ . . . & . . . & . . . & . . . \\ - a_{k 1} & - a_{k 2} & . . . & a_{k k} + α_{k} \end{matrix}) .

For the transformation of the HJB system, it is essential to observe that

inf_{p} {p \nabla u_{i} + {|p|}^{2}} = - \frac{1}{4} {|\nabla u_{i}|}^{2}, i = 1, 2, . . ., k .

(11)

Thus, the HJB system (10) can be written as a PDE system

\{\begin{matrix} - \frac{σ_{1}^{2}}{2} Δ u_{1} + (a_{11} + α_{1}) u_{1} - \sum_{i = 2}^{k} a_{1 i} u_{i} - {|x|}^{2} = - \frac{1}{4} {|\nabla u_{1}|}^{2}, \\ . . . \\ - \frac{σ_{k}^{2}}{2} Δ u_{k} + (a_{k k} + α_{k}) u_{k} - \sum_{i = 1}^{k - 1} a_{k i} u_{i} - {|x|}^{2} = - \frac{1}{4} {|\nabla u_{k}|}^{2} . \end{matrix}

(12)

To perform the verification, i.e., show that the HJB system gives the solution to the optimization problem, one should write (12) with the following boundary condition

u_{1} (x) \to \infty, . . ., u_{k} (x) \to \infty, as | x | \to \infty .

(13)

The value function will give us in turn the candidate optimal control. The first-order optimality conditions on the left-hand side of (11) are sufficient for optimality since we deal with a quadratic (convex) function and they produce the candidate optimal control as follows:

p_{i}^{*} (t) = {\bar{p}}_{i} (y_{1} (t), \dots, y_{N} (t), ε (t)), i = 1, . . ., N,

and

{\bar{p}}_{i} (x_{1}, . . ., x_{N}, j) = - \frac{1}{2} \frac{\partial u_{j}}{\partial x_{i}} (x_{1}, . . ., x_{N}), for i \in {1, . . ., n}, j \in {1, . . ., k} .

(14)

The production rate

{\bar{p}}_{i}

is allowed to be negative. A negative production rate would correspond to a write-off or disposal of inventory (for example, due to obsolescence or perishability).

Our next goal of this paper is to determine the candidate optimal control in closed form.

3. Closed form solution for the PDE system

In spite of their clear simplicity, the PDE system (12) with boundary conditions (13) presents a host of mathematical difficulties arising from the presence of nonlinear gradient terms

{|\nabla u_{1}|}^{2}

, ...,

{|\nabla u_{k}|}^{2}

, see for details [8].

The following result will be proved and is the main original element of the article.

Theorem 1.

Assume that

G_{a, α}

is a positive definite matrix with all elements of

G_{a, α}^{- 1}

positive. Then, the PDE system (12) with boundary condition (13) has a unique radially symmetric convex positive classical solution with quadratic growth.

Proof of Theorem 1

In the following, we construct the function

(u_{1}, . . ., u_{k}) \in C^{2} [0, \infty) \times . . . \times C^{2} [0, \infty),

which satisfies (12) with boundary condition (13). One way of solving this partial differential equation is to show that there exists

(u_{1} (x), . . ., u_{k} (x)) = (β_{1} {|x|}^{2} + η_{1}, . . ., β_{k} {|x|}^{2} + η_{k}), with β_{1}, . . ., β_{k}, η_{1}, . . ., η_{k} \in (0, \infty),

(15)

that solves (1).

The main task for the proof of existence of (15) is performed by proving that there exists

β_{1}, . . ., β_{k}, η_{1}, . . ., η_{k} \in (0, \infty),

such that

\{\begin{matrix} - \frac{2 β_{1} N σ_{1}^{2}}{2} + (a_{11} + α_{1}) (β_{1} {|x|}^{2} + η_{1}) - \sum_{i = 2}^{k} a_{1 i} (β_{i} {|x|}^{2} + η_{i}) - {|x|}^{2} = - \frac{1}{4} {(2 β_{1} |x|)}^{2}, \\ . . . \\ - \frac{2 β_{k} N σ_{k}^{2}}{2} + (a_{k k} + α_{k}) (β_{k} {|x|}^{2} + η_{k}) - \sum_{i = 1}^{k - 1} a_{k i} (β_{i} {|x|}^{2} + η_{i}) - {|x|}^{2} = - \frac{1}{4} {(2 β_{k} |x|)}^{2}, \end{matrix}

or equivalently, after grouping the terms

\{\begin{matrix} {|x|}^{2} [- \sum_{i = 2}^{k} a_{1 i} β_{i} + (a_{11} + α_{1}) β_{1} + β_{1}^{2} - 1] - β_{1} N σ_{1}^{2} - \sum_{i = 2}^{k} a_{1 i} η_{i} + (a_{11} + α_{1}) η_{1} = 0, \\ . . . \\ {|x|}^{2} [- \sum_{i = 1}^{k - 1} a_{k i} β_{i} + (a_{k k} + α_{k}) β_{k} + β_{k}^{2} - 1] - β_{k} N σ_{k}^{2} - \sum_{i = 1}^{k - 1} a_{k i} η_{i} + (a_{k k} + α_{k}) η_{k} = 0 . \end{matrix}

Now, we consider the system of equations

\{\begin{matrix} - \sum_{i = 2}^{k} a_{1 i} β_{i} + (a_{11} + α_{1}) β_{1} + β_{1}^{2} - 1 = 0 \\ . . . \\ - \sum_{i = 1}^{k - 1} a_{k i} β_{i} + (a_{k k} + α_{k}) β_{k} + β_{k}^{2} - 1 = 0 \\ - β_{1} N σ_{1}^{2} - \sum_{i = 2}^{k} a_{1 i} η_{i} + (a_{11} + α_{1}) η_{1} = 0 \\ . . . \\ - β_{k} N σ_{k}^{2} - \sum_{i = 1}^{k - 1} a_{k i} η_{i} + (a_{k k} + α_{k}) η_{k} = 0 . \end{matrix}

(16)

To solve (16), we can rearrange those equations 1, ... , k such

(\begin{matrix} a_{11} + α_{1} & . . . & - a_{1 k} \\ . . . & . . . & . . . \\ - a_{k 1} & . . . & a_{k k} + α_{k} \end{matrix}) (\begin{matrix} β_{1} \\ . . . \\ β_{k} \end{matrix}) = (\begin{matrix} 1 - β_{1}^{2} \\ . . . \\ 1 - β_{k}^{2} \end{matrix}) .

(17)

The arguments in [15,16] say that the system (17) has a unique positive solution. Next, letting

(β_{1}, . . ., β_{k}) \in (0, \infty) \times . . . \times (0, \infty)

a unique solution of (17) we observe that the Equations

k + 1

, ... ,

2 k

of (16) can be written equivalently as

(\begin{matrix} β_{1} N σ_{1}^{2} \\ . . . \\ β_{k} N σ_{k}^{2} \end{matrix}) = (\begin{matrix} a_{11} + α_{1} & . . . & - a_{1 k} \\ . . . & . . . & . . . \\ - a_{k 1} & . . . & a_{k k} + α_{k} \end{matrix}) (\begin{matrix} η_{1} \\ . . . \\ η_{k} \end{matrix}),

(18)

from where using the fact that

G_{a, α}^{- 1}

has all elements positive, we can see that there exist and are unique

η_{1}

,...,

η_{k} \in (0, \infty)

that solve (16) and then

(u_{1} (x), . . ., u_{k} (x)),

solve (12). This finishes the proof of Theorem 1.

Because our solution depends on solving a non-linear algebraic system of equations the exact solution of the PDE system cannot be determined using a computer software. In order to be implemented the solution of the PDE system (12) in a software application in the next section it is necessary to give the numerical approximation of solution to (16), and therefore the arguments in [15,16] are used again.

4. Numerical solution of an algebraic nonlinear system in building the solution for the PDE system

We intend to approximate

β_{1}, . . ., β_{k}, η_{1}, . . ., η_{k} \in (0, \infty)

in (15) by the Newton-Raphson method. To do this, we denote

\begin{matrix} h_{1} (β_{1}, . ., β_{k}) = - \sum_{i = 2}^{k} a_{1 i} β_{i} + (a_{11} + α_{1}) β_{1} + β_{1}^{2} - 1, \\ . . . \\ h_{k} (β_{1}, . ., β_{k}) = - \sum_{i = 1}^{k - 1} a_{k i} β_{i} + (a_{k k} + α_{k}) β_{k} + β_{k}^{2} - 1, \end{matrix}

(19)

and

J_{(h_{1}, . . ., h_{k})} = (\begin{matrix} a_{11} + α_{1} + 2 β_{1} & . . . & - a_{1 k} \\ . . . & . . . & . . . \\ - a_{k 1} β_{1} & . . . & a_{k k} + α_{k} + 2 β_{k} \end{matrix}),

the Jacobian matrix of (19). For

n = 1, 2, . . .

we find the approximate of the unique parameters

(β_{1}, . . ., β_{k}) \in (0, \infty) \times . . . \times (0, \infty),

in the following way

(\begin{matrix} β_{1}^{n + 1} \\ . . . \\ β_{k}^{n + 1} \end{matrix}) = (\begin{matrix} β_{1}^{n} \\ . . . \\ β_{k}^{n} \end{matrix}) - {(\begin{matrix} a_{11} + α_{1} + 2 β_{1}^{n} & . . . & - a_{1 k} \\ . . . & . . . & . . . \\ - a_{k 1} & . . . & a_{k k} + α_{k} + 2 β_{k}^{n} \end{matrix})}^{- 1} (\begin{matrix} h_{1} (β_{1}^{n}, . ., β_{k}^{n}) \\ . . . \\ h_{k} (β_{1}^{n}, . ., β_{k}^{n}) \end{matrix}),

with

β_{1}^{0}, . ., β_{k}^{0} \in (0, \infty)

. Clearly

η_{1}

,...,

η_{k} \in (0, \infty)

are easy determined from (18).

Now we are moving on to the verification result which is also inspired from [6].

5. Verification

Next, we show that the control of (14) obtained in our reduction strategy is indeed optimal. We apply the supermartingale and martingale approach.

Repeating the same argument in [4], as the first step we can show that the stochastic process

S^{p} (t)

defined below

S^{p} (t) = e^{- α_{ε (t)} t} V (y (t), ε (t)) - \int_{0}^{t} {[| p (s) |}^{2} + {| y (s) |}^{2}] e^{- α_{ε (s)} s} d s,

is supermartingale for all

p (t) = (p_{1} (t), . . ., p_{N} (t)),

and martingale for the optimal control

p^{*} (t) = (p_{1}^{*} (t), . . ., p_{N}^{*} (t)) .

Owing to the well-known Itô Lemma for Markov modulated diffusion (see [22] for more on this) we have

\begin{matrix} d S^{p} (s) & = & e^{- α_{ε (s)} s} [\frac{σ_{ε (s)}^{2}}{2} Δ V (y (s), ε (s)) - {|y (s)|}^{2} + p (s) \nabla V (y (s), ε (s)) \\ - {|p (s)|}^{2} - (α_{ε (s)} + a_{ε (s) ε (s)}) V (y (s), ε (s)) \\ + \sum_{i = 1, i \neq ε (s)}^{k} a_{ε (s) i} V (y (s), i)] d s + d Z (s), \end{matrix}

for some martingale

Z (s)

, and

Z (0) = 0

. Therefore

\begin{matrix} E S^{p} (t) & = & S^{p} (0) + E [\int_{0}^{t} e^{- α_{ε (s)} s} [\frac{σ_{ε (s)}^{2}}{2} Δ V (y (s), ε (s)) - {|y (s)|}^{2} + p (s) \nabla V (y (s), ε (s))] d s] \\ + E [\int_{0}^{t} e^{- α_{ε (s)} s} [- {|p (s)|}^{2} - (α_{ε (s)} + a_{ε (s) ε (s)}) V (y (s), ε (s))] d s] \\ + E [\int_{0}^{t} e^{- α_{ε (s)} s} [\sum_{i = 1, i \neq ε (s)}^{k} a_{ε (s) i} V (y (s), i)] d s] . \end{matrix}

Then, the claim yields considering HJB equation (10) and (12) which says that

S^{p} (t)

is martingale for the optimal control and supermartingale otherwise. This last fact combined with the transversality condition yields the claim.

In the second step, let us establish the optimality of

(p_{1}^{*}, . . ., p_{N}^{*})

. Considering the quadratic estimate on the value function

V (x, 1) = - β_{1} {|x|}^{2} - η_{1}, . . ., V (x, k) = - β_{k} {|x|}^{2} - η_{k},

(20)

where

β_{i}

,

η_{i} \in (0, \infty)

are the solution of (16).

Let us provide a lower bound estimate for

α_{1}, . . ., α_{k}

so that the transversality condition (8) is met

lim_{t \to \infty} E [e^{- α_{ϵ (t)} t} | y (t) |^{2}] = 0,

holds true. The SDE system (4) in this case becomes

d y_{i} (t) = - β_{ϵ (t)} y_{i} (t) d t + σ_{ε (t)} d W^{i} (t), i = 1, \dots N .

Using Itô’s Lemma, one gets

\begin{matrix} d {(y_{i} (t))}^{2} & = & 2 y_{i} (t) d y_{i} (t) + d y_{i} (t) d y_{i} (t) \\ = & [- 2 β_{ϵ (t)} {(y_{i} (t))}^{2} + σ_{ϵ (t)}^{2}] d t + 2 y_{i} (t) σ_{ϵ (t)} d W^{i} (t) . \end{matrix}

We introduce

F_{i} (t) = E [{(y_{i} (t))}^{2}] .

By taking expectations in the above equation, we get

\begin{matrix} F_{i} (t) & = & E [\int_{0}^{t} [- 2 β_{ϵ (s)} {(y_{i} (s))}^{2} + σ_{ϵ (s)}^{2}] d s + [{(y_{i} (0))}^{2}]] \\ = & E [\int_{0}^{t} [- 2 β_{ϵ (s)} {(y_{i} (s))}^{2} + σ_{ϵ (s)}^{2}] d s] + y_{i}^{2} (0)) . \end{matrix}

Let

D_{2} = max {σ_{1}^{2}, . . ., σ_{k}^{2}}, D_{3} = max ([{(y_{1} (0))}^{2}], . . ., [{(y_{k} (0))}^{2}]) .

Then, in the light of the above equation, we get

F_{i} (t) \leq \int_{0}^{t} D_{2} d s + D_{3} .

Hence, we have that

F_{i} (t) \leq D_{2} t + D_{3} .

Therefore, one must to choose

α_{1}, . . ., α_{k} \in (0, \infty)

for the transversality condition to hold true and the proof is completed. Finally, a simple system of nonlinear equations (16) remains to be solved.

6. The Equilibrium Production

For a production rate

{p_{i} (t)}_{t \geq 0}

and its corresponding inventory level

{y_{i} (t)}_{t \geq 0}

given by (4), we introduce equilibrium production as the subgame perfect production in the definition below (for more on this economic concept see [10]).

Definition 1.

Let

F = (F_{i}, i = 1, \dots N) : R \times {1, 2, \dots k} \to R^{N}

be a vector map such that for any

x > 0

and

i \in {1, 2, \dots, k}

lim inf_{ϵ ↓ 0} \frac{J ({\bar{p}}_{i}) - J (p_{i}^{ϵ})}{ϵ} \leq 0,

(21)

where the subgame perfect production

{\bar{p}}_{i} (s) : = F_{i} ({\bar{y}}_{i} (s), ϵ (s)) .

Here, the process

{{\bar{y}}_{i} (s)}_{s \geq 0}

is the inventory level process corresponding to

{{\bar{p}}_{i} (s)}_{s \geq 0}

. The production rate

{{p^{ϵ}}_{i} (s)}_{s \geq 0}

is defined by

{p^{ϵ}}_{i} (s) = \{\begin{matrix} {\bar{p}}_{i} (s), s \in [0, \infty] ∖ E_{ϵ, 0} \\ p_{i} (s), s \in E_{ϵ, 0}, \end{matrix}

(22)

with

E_{ϵ, 0} = [0, ϵ];

{p_{i} (s)}_{s \in E_{ϵ, 0}}

is any production rate. If (21) holds true, then

{\bar{p}}_{i} (s),

i = 1 \dots N,

is a subgame perfect production rate.

The equilibrium production are by design time consistent meaning that they will be implemented at a future date even if the optimization criterion is updated. In some situations the optimal production may be time inconsistent meaning that they will fail to be implemented in the future because they are not optimal anymore if the optimization criterion is updated; they will be implementable only in the presence of a commitment mechanism, that is why sometimes they are referred as pre commitment production. Let us remark that in our setting the optimal production rate

{\bar{p}}_{i}, i \in {1, . . ., N},

(23)

is a subgame perfect production with

F_{i} (x, j) : = - \frac{1}{2} \frac{\partial u_{j}}{\partial x_{i}} (x),

since

({\bar{p}}_{i}, i = 1 \dots N) = arg min_{p_{1}, . . ., p_{N}} J (p_{1}, . . ., p_{N})

and thus (21) is automatically satisfied. Therefore the equilibrium production is time consistent.

7. Applications

We offer some applications, which also are inspired by the paper of Ghosh, Arapostathis, Marcus [14].

Application 1. Suppose there is one machine producing two products and let

ε (t)

the machine state that can take values in two regimes 1=good and 2=bad, i.e., for every

t \in [0, \infty)

we have

ε (t) \in {1, 2}

. We consider

ε (t)

a continuous time Markov chain with generator

(\begin{matrix} - \frac{1}{2} & \frac{1}{2} \\ \frac{1}{2} & - \frac{1}{2} \end{matrix}),

and the inventory

y_{i} (t)

which is governed by the Itô system of stochastic differential equations (4) with the diffusion

σ_{1} = σ_{2} = \frac{1}{\sqrt{2}}

and let

α_{1} = α_{2} = \frac{1}{2}

the discount factor. Under these assumptions, the system (17) becomes

(\begin{matrix} a_{11} + α_{1} & - a_{11} \\ - a_{22} & a_{22} + α_{2} \end{matrix}) (\begin{matrix} β_{1} \\ β_{2} \end{matrix}) = (\begin{matrix} 1 - β_{1}^{2} \\ 1 - β_{2}^{2} \end{matrix}),

or, with our data

\{\begin{matrix} β_{1}^{2} + β_{1} - \frac{1}{2} β_{2} - 1 = 0 \\ β_{2}^{2} - \frac{1}{2} β_{1} + β_{2} - 1 = 0 \end{matrix}

which has a unique positive solution

β_{1} = \frac{1}{4} (\sqrt{17} - 1), β_{2} = \frac{1}{4} (\sqrt{17} - 1) .

On the other hand the system (18) becomes

(\begin{matrix} β_{1} N σ_{1}^{2} \\ β_{2} N σ_{2}^{2} \end{matrix}) = (\begin{matrix} a_{11} + α_{1} & - a_{11} \\ - a_{22} & a_{22} + α_{2} \end{matrix}) (\begin{matrix} η_{1} \\ η_{2} \end{matrix}),

or, with our data

(\begin{matrix} β_{1} \\ β_{2} \end{matrix}) = (\begin{matrix} 1 & - \frac{1}{2} \\ - \frac{1}{2} & 1 \end{matrix}) (\begin{matrix} η_{1} \\ η_{2} \end{matrix}),

which has a unique positive solution

η_{1} = \frac{4}{3} β_{1} + \frac{2}{3} β_{2} = \frac{1}{2} (\sqrt{17} - 1), η_{2} = \frac{2}{3} β_{1} + \frac{4}{3} β_{2} = \frac{1}{2} (\sqrt{17} - 1) .

Then

V ((x_{1}, x_{2}), 1) = V ((x_{1}, x_{2}), 2) = - \frac{1}{4} (\sqrt{17} - 1) (x_{1}^{2} + x_{2}^{2}) - \frac{1}{2} (\sqrt{17} - 1)

and furthermore, the production rate is

{\bar{p}}_{i} (x_{1}, x_{2}, j) = - \frac{1}{2} (\sqrt{17} - 1) x_{i}, for i \in {1, 2}, j \in {1, 2} .

We also give the approximate of

β_{1}

,

β_{2}

,

η_{1}

,

η_{2}

by using the Newton-Raphson Method. Denote

\begin{matrix} h_{1} (β_{1}, β_{2}) = - a_{12} β_{2} + (a_{11} + α_{1}) β_{1} + β_{1}^{2} - 1 \\ h_{2} (β_{1}, β_{2}) = - a_{21} β_{1} + (a_{22} + α_{2}) β_{2} + β_{2}^{2} - 1 \end{matrix}

and

J_{(h_{1}, . . ., h_{k})} = (\begin{matrix} 2 β_{1} + 1 & - \frac{1}{2} \\ - \frac{1}{2} & 2 β_{2} + 1 \end{matrix}) .

We construct

\{\begin{matrix} (\begin{matrix} β_{1}^{n + 1} \\ β_{2}^{n + 1} \end{matrix}) = (\begin{matrix} β_{1}^{n} \\ β_{2}^{n} \end{matrix}) - {(\begin{matrix} a_{11} + α_{1} + 2 β_{1}^{n} & - a_{1 k} \\ - a_{k 1} & a_{k k} + α_{k} + 2 β_{k}^{n} \end{matrix})}^{- 1} (\begin{matrix} h_{1} (β_{1}^{n}, β_{2}^{n}) \\ h_{2} (β_{1}^{n}, β_{2}^{n}) \end{matrix}) \\ β_{1}^{0} = β_{2}^{0} = 0.1 . \end{matrix}

Using the standard computation, approximations to four digits are

\begin{matrix} n = 1 ⟹ & β_{1}^{1} = 1.4429 & and & β_{2}^{1} = 1.4429 \\ n = 2 ⟹ & β_{1}^{2} = 0.9102 & and & β_{2}^{2} = 0.9102 \\ n = 3 ⟹ & β_{1}^{3} = 0.7808 & and & β_{2}^{3} = 0.7808 \\ n = 4 ⟹ & β_{1}^{4} = 0.7808 & and & β_{2}^{4} = 0.7808 \end{matrix}

On the other hand

β_{1} = β_{2} = \frac{1}{4} (\sqrt{17} - 1) ≃ 0.780 7 .

Clearly, the approximations for

η_{1}

and

η_{2}

are

η_{1} = η_{2} ≃ 1 . 561 6 .

Application 2. Suppose there is one machine producing three products and let

ε (t)

the machine state that can take values in three regimes 1, 2, 3, i.e., for every

t \in [0, \infty)

we have

ε (t) \in {1, 2, 3}

. We consider

ε (t)

a continuous time Markov chain with generator

(\begin{matrix} - 3 & 3 & 0 \\ 4 & - 7 & 3 \\ 0 & 4 & - 4 \end{matrix}),

and the inventory

y_{i} (t)

which is governed by (4) with

σ_{1} = σ_{2} = σ_{3} = \frac{1}{\sqrt{3}}

and let

α_{1} = α_{2} = α_{3} = 1

the discount factor. Under these assumptions, the system (17) becomes

(\begin{matrix} a_{11} + 1 & - a_{11} & 0 \\ - a_{22} & a_{22} + a_{11} + 1 & - a_{11} \\ 0 & - a_{22} & a_{22} + 1 \end{matrix}) (\begin{matrix} β_{1} \\ β_{2} \\ β_{3} \end{matrix}) = (\begin{matrix} 1 - β_{1}^{2} \\ 1 - β_{2}^{2} \\ 1 - β_{3}^{2} \end{matrix}),

or, with our data

\{\begin{matrix} β_{1}^{2} + 4 β_{1} - 3 β_{2} - 1 = 0 \\ β_{2}^{2} + 8 β_{2} - 4 β_{1} - 3 β_{3} - 1 = 0 \\ β_{3}^{2} + 5 β_{3} - 4 β_{2} - 1 = 0 \end{matrix}

which has a unique positive solution

β_{1} = β_{2} = β_{3} = \frac{1}{2} (\sqrt{5} - 1) .

On the other hand, the system (18) becomes

(\begin{matrix} β_{1} N σ_{1}^{2} \\ β_{2} N σ_{2}^{2} \\ β_{3} N σ_{3}^{2} \end{matrix}) = (\begin{matrix} a_{11} + 1 & - a_{11} & 0 \\ - a_{22} & a_{22} + a_{11} + 1 & - a_{11} \\ 0 & - a_{22} & a_{22} + 1 \end{matrix}) (\begin{matrix} η_{1} \\ η_{2} \\ η_{3} \end{matrix}),

or, with our data

(\begin{matrix} β_{1} \\ β_{2} \\ β_{3} \end{matrix}) = (\begin{matrix} 3 + 1 & - 3 & 0 \\ - 4 & 4 + 3 + 1 & - 3 \\ 0 & - 4 & 4 + 1 \end{matrix}) (\begin{matrix} η_{1} \\ η_{2} \\ η_{3} \end{matrix}),

from where

(\begin{matrix} η_{1} \\ η_{2} \\ η_{3} \end{matrix}) = (\begin{matrix} \frac{7}{13} & \frac{15}{52} & \frac{9}{52} \\ \frac{5}{13} & \frac{5}{13} & \frac{3}{13} \\ \frac{4}{13} & \frac{4}{13} & \frac{5}{13} \end{matrix}) (\begin{matrix} β_{1} \\ β_{2} \\ β_{3} \end{matrix}),

has a unique positive solution

η_{1} = η_{2} = η_{3} = \frac{1}{2} \sqrt{5} - \frac{1}{2} .

Then

\begin{matrix} V ((x_{1}, x_{2}, x_{3}), 1) & = & V ((x_{1}, x_{2}, x_{3}), 2) = V ((x_{1}, x_{2}, x_{3}), 3) \\ = & - \frac{1}{2} (\sqrt{5} - 1) (x_{1}^{2} + x_{2}^{2} + x_{3}^{2} + 1) \end{matrix}

and furthermore, the production rate is

{\bar{p}}_{i} (x_{1}, x_{2}, x_{3}, j) = - \frac{1}{2} (\sqrt{5} - 1) x_{i}, for i \in {1, 2, 3}, j \in {1, 2, 3} .

We also point out that the numerical approximations for

β_{1}

,

β_{2}

,

β_{3}

, using Newton-Raphson Method described, are

\begin{matrix} n = 1 ⟹ & β_{1}^{1} = 0.8418 & β_{2}^{1} = 1.017 & β_{3}^{1} = 1.2789 \\ n = 2 ⟹ & β_{1}^{2} = 0.6575 & β_{2}^{2} = 0.6761 & β_{3}^{2} = 0.7066 \\ n = 3 ⟹ & β_{1}^{3} = 0.6192 & β_{2}^{3} = 0.6196 & β_{3}^{3} = 0.6202 \\ n = 4 ⟹ & β_{1}^{4} = 0.618 & β_{2}^{4} = 0.618 & β_{3}^{4} = 0.618 \end{matrix}

when

β_{1}^{0} = 1

,

β_{2}^{0} = 2

and

β_{3}^{0} = 3

. Clearly

\frac{1}{2} (\sqrt{5} - 1) ≃ 0.618

.

8. Final Remark and Conclusion

When

w_{i}

are correlated with correlation

ρ,

the HJB system (10) becomes

- (\begin{matrix} \frac{σ_{1}^{2}}{2} Δ u_{1} \\ . . . \\ \frac{σ_{k}^{2}}{2} Δ u_{k} \end{matrix}) + G_{a, α} (\begin{matrix} u_{1} \\ . . . \\ u_{k} \end{matrix}) - \frac{ρ}{2} (\begin{matrix} σ_{1}^{2} \sum_{i \neq j} \frac{\partial^{2} u_{1}}{\partial x_{i} \partial x_{j}} \\ . . . \\ σ_{k}^{2} \sum_{i \neq j} \frac{\partial^{2} u_{k}}{\partial x_{i} \partial x_{j}} \end{matrix}) - (\begin{matrix} {|x|}^{2} \\ . . . \\ {|x|}^{2} \end{matrix}) = (\begin{matrix} inf_{p} {p \nabla u_{1} + {|p|}^{2}} \\ . . . \\ inf_{p} {p \nabla u_{k} + {|p|}^{2}} \end{matrix}),

which has the same solution as (10), due to the mixed derivative terms (see [8] for details).

In summary, we have reduced the stochastic production-planning problem with several regime switching in the economy to demonstrate that there is an exact solution for the PDE system which models the stochastic production problem.

References

A. Bensoussan, S.P. Sethi, R. Vickson, N. Derzko, Stochastic production planning with production constraints, SIAM. J. Control. Optim. 22 (1984), 920-935. [CrossRef]
A. Cadenillas, P. Lakner, M. Pinedo, Optimal production management when demand depends on the business cycle, Operations Research, 61(4) (2013), 1046-1062. [CrossRef]
A. Capponi, J. E. Figueroa-López, Dynamic Portfolio Optimization with a Defaultable Security and Regime-Switching, Mathematical Finance, (2012), 207-249. [CrossRef]
E. C. Canepa, D.-P. Covei, T. A. Pirvu, Stochastic production planning with regime switching, Journal of Industrial & Management Optimization, 19 (2023), 1697-1713. [CrossRef]
D.-P. Covei, T.A. Pirvu, An elliptic partial differential equation and its application, Appl. Math. Lett., 101 (2020), 1-7. [CrossRef]
D.-P. Covei, T.A. Pirvu, An elliptic partial differential equations system and its applications, Carpathian J. Math., 37 (2021), 427-440.
D.-P. Covei, An elliptic partial differential equation modeling the production planning problem,J. Appl. Anal. Comput.,11(2) (2021), 903-910. [CrossRef]
D.-P. Covei, On a parabolic partial differential equation and system modeling a production planning problem, Electronic Research Archive, 30(4) 2022, 1340-1353. [CrossRef]
J. Dong, A. Malikopoulos, S. M. Djouadi, T. Kuruganti, Application of Optimal Production Control theory for Home Energy Management in a Micro Grid, 2016 American Control Conference (ACC), (2016), 5014-5019. [CrossRef]
I. Ekeland, T.A. Pirvu, Investment and consumption without commitment, Mathematics and Financial Economics, 2(2008), 57-86. [CrossRef]
R. Elliott, A.S. Hamada, Option Pricing Using A Regime Switching Stochastic Discount Factor, International Journal of Theoretical and Applied Finance, 17 (2014), 1-26. [CrossRef]
W.H. Fleming, S.P. Sethi, H.M. Soner, An Optimal Stochastic Production Planning Problem with Randomly Fluctuating Deman, SIAM. J. Control Optim., 25 (1987), 1494-1502. [CrossRef]
A. Gharbi, J.P. Kenne, Optimal production control problem in stochastic multiple-product multiple-machine manufacturing systems, IIE Transactions, 35 (2003), 941-952. [CrossRef]
M. K. Ghosh, A. Arapostathis, S. I. Marcus, Optimal Control of Switching Diffusions with Application to Flexible Manufacturing Systems, Siam J. Control and Optimization, 31 (1992), 1183–1204. [CrossRef]
I. Gyori, F. Hartung and N. A. Mohamady, Existence and uniqueness of positive solutions of a system of nonlinear algebraic equations, Period. Math. Hung., 75(2017), 114-127. [CrossRef]
I. Gyori, F. Hartung and N. A. Mohamady, Boundedness of positive solutions of a system of nonlinear delay differential equations, Discrete and Continuous Dynamical Systems - Series B, 23 (2018), 809-836. [CrossRef]
T. A. Pirvu, H. Zhang, Investment-consumption with regime-switching discount rates, Math. Social. Sci. 71 (2014), 142-150. [CrossRef]
Z. Qin, M. Bai, D. Ralescu, A fuzzy control system with application to production planning problems, Inform. Sci., 181 (2011), 1018–1027. [CrossRef]
S. P. Sethi, G.L. Thompson, Applied Optimal Control: Applications to Management Science, Nijhoff Boston, 1981.
L. Sheng, Y. Zhu, K. Wang, Uncertain dynamical system-based decision making with application to production-inventory problems, Appl. Math. Model., 56 (2018), 275–288. [CrossRef]
G. L. Thompson, S.P. Sethi, Turnpike horizons for production planning, Management Sci., 26 (1980), 229-241. [CrossRef]
D.D. Yao, Q. Zhang, X.Y. Zhou, A regime-switching model for european options (In: Yan H, Yin G, Zhang Q (eds) Stochastic Processes, Optimization, and Control Theory: Applications in Financial Engineering, Queueing Networks, and Manufacturing Systems), International Series in Operations Research & Management Science. Springer, New York, 94 (Chapter 14) (2006), 281–300.
C.F. Wang, H. Chang, Z.M. Fang, Optimal Portfolio and Consumption Rule with a CIR Model Under HARA Utility, J. Oper. Res. Soc. China, 6 (2018), 107–137. [CrossRef]

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.

Exact Solution for the Production Planning Problem with Several Regimes Switching over an Infinite Horizon Time

Abstract

Keywords:

Subject:

1. Introduction and proposal of the paper

2. Reduction of the model to a PDE system

3. Closed form solution for the PDE system

4. Numerical solution of an algebraic nonlinear system in building the solution for the PDE system

5. Verification

6. The Equilibrium Production

7. Applications

8. Final Remark and Conclusion

References

MDPI Initiatives

Important Links

Subscribe