1. Introduction and proposal of the paper
We consider a factory producing types of economic goods that stores them in an inventory-designated place. The model is described mathematically in the next.
Let
,
be a complete filtered probability space, where
P is the historical probability and
is generated by an
-valued Brownian motion denoted by
with respect to the probability
P.
In the production planning problem, the regime switching is captured by a continuous time homogeneous Markov chain
adapted to
that can take
k different values, modelling
k regimes which should be noted by
. The Markov chain’s rate matrix that denotes the strongly ireductible generator of
, is denoted by
where
and the diagonal elements
may be expressed as
In this case, if
, then
Moreover,
it is explicitly described by the integral form
where
is a martingale with respect to
. Here and hereafter, we use the notation from other papers to keep the applicative character of the problem,
represent the production rate at time
t (control variable)
adjusted for the demand rate.
These adjusted for demand inventory levels are modeled by the following system of stochastic differential equations
where
is an Itô process in
(i.e., the inventory level of good
i, at times
adjusted for demand),
is the deterministic part,
is a random regime-dependent constant (non-zero) diffusion coefficient taking on the values
,
, ... ,
and
is the initial condition (i.e., initial inventory level of goods
i).
The stochasticity here is due to demand adjustment, which is random and dependent on the regime. This is the most commonly used process when the demand is more volatile in some periods (e.g., some states of the Markov chain) and less volatile in other periods.
The performance over time of a demand-adjusted production
is measured by means of its cost. At this point, we introduce the cost functional which yields the cost
which measures the quadratic loss.
We measure deviations from the demand, from what place the loss. Here is a regime dependent, taking on the values , , ... , , constant psychological rate of time discount from what place the exponential discounting.
At the moment, we are ready to frame our objective, which is to minimize the cost functional, i.e.,
subject to the Itô equation (
4).
This model problem was proposed by Bensoussan, Sethi, Vickson and Derzko [
1] in the context of no regime switching in the economy and for the case of a factory producing one type of economic goods. Later, many other authors are concerned with regime switching.
In production management, Cadenillas, Lakner and Pinedo [
2] adapted the model problem in [
1] to study the optimal production stochastic control planning problem of a company within an economy characterized by the two-state regime switching with limited/unlimited information. Later, Dong, Malikopoulos, Djouadi and Kuruganti [
9] applied in the civil engineering the model described by [
2] to the study of the optimal stochastic control problem for home energy systems with solar and energy storage devices when the demand is subject to Brownian motion; the two switching regimes are the peak and off peak energy demand.
A good deal of attention to this subject has been also devoted by Pirvu and Zhang [
17] where the authors studied the effect of high versus low discount rates to a consumption-investment decision problem.
After that, there have been numerous applications of regime switching in many important problems in economics, operations research, actuarial science, finance, reinsurance, and other fields, see the works of: Capponi and Figueroa-López [
3], Elliott and Hamada [
11], Gharbi and Kenne [
13], Yao, Zhang and Zhou [
22] and Wang, Chang and Fang [
23] for more details.
There are of course other research studies that may also serve to better explain the importance of regime switching in the real world.
In a precursor to this article, Covei and Pirvu [
5], formulate and analyze the production-planning problem in the continuous time case, with no regime switching in the economy over an infinite time. In the paper [
7], the author improved the results of [
5], in the sense that the value function in the production model is given in the closed form. Related works that deal with no regime switching in the economy are Sheng-Zhu-Wang [
20] and Qin-Bai-Ralescu [
18].
Recently, Canepa, Covei and Pirvu [
4], considered the production planning problem with regime switching in the economy over a finite horizon time. Here, the solution is obtained through numerical approaches. Although a closed form expression for the corresponding case of regime switching on a particular state space consisting of two regimes over an infinite horizon time is available in the paper of [
6]. So, at least one question suggested by the paper of [
14] has some nice features: can we obtain a closed form solution when the state space consists of several numbers of states? Our present paper fills the gap in the literature by proving a closed form solution to the stochastic production planning problem with regime switching in the economy over an infinite horizon in a general state space.
The technique presented in this paper makes a methodological contribution that is of independent interest in other considerable number of works on regime switching.
To conclude this introduction, our paper is structured as follows. In
Section 2 we give the relationship of our model with a system of partial differential equations (PDE system).
Section 3 presents a closed form solution and the uniqueness of solution for our production planning problem. A numerical approximation of the solution for the production planning problem is also given in
Section 4. In
Section 5 we present a verification result. We introduced in
Section 6 the equilibrium production rates as the the subgame perfect production rates. They are the output of an interpersonal game between the present self and future selves. The equilibrium production rates are time consistent meaning there is no incentive to deviate from them. It turns out that in our setting the optimal production rates are among the equilibrium ones so they are time consistent. In
Section 7, we give some applications. Finally, in
Section 8 we want to discuss our strategy.
Having presented the model that we want to solve, now we provide our means to tackle it.
2. Reduction of the model to a PDE system
Our approach is based on the value function and dynamic programming, which leads to the Hamilton-Jacobi-Bellman (HJB) system of equations.
To characterize the value function, we apply the probabilistic approach. We search for functions
, ...,
such that the stochastic process
defined below
is supermartingale for all
and martingale for the optimal control
As shown by [
5], if this is achieved, with the following transversality condition
and some estimates on the value function yield that
where
assumes values
Once such a function is found, it turns out that
with
is the value function. We search for
the functions in
, and the supermartingale/martingale requirement yields by using Itô’s Lemma for Markov modulated diffusion, the HJB system of equations, which characterizes the value function
where
For the transformation of the HJB system, it is essential to observe that
Thus, the HJB system (
10) can be written as a PDE system
To perform the verification, i.e., show that the HJB system gives the solution to the optimization problem, one should write (
12) with the following boundary condition
The value function will give us in turn the candidate optimal control. The first-order optimality conditions on the left-hand side of (
11) are sufficient for optimality since we deal with a quadratic (convex) function and they produce the candidate optimal control as follows:
and
The production rate is allowed to be negative. A negative production rate would correspond to a write-off or disposal of inventory (for example, due to obsolescence or perishability).
Our next goal of this paper is to determine the candidate optimal control in closed form.
3. Closed form solution for the PDE system
In spite of their clear simplicity, the PDE system (
12) with boundary conditions (
13) presents a host of mathematical difficulties arising from the presence of nonlinear gradient terms
, ...,
, see for details [
8].
The following result will be proved and is the main original element of the article.
Theorem 1. Assume that is a positive definite matrix with all elements of positive. Then, the PDE system (12) with boundary condition (13) has a unique radially symmetric convex positive classical solution with quadratic growth.
Proof of Theorem 1
In the following, we construct the function
which satisfies (
12) with boundary condition (
13). One way of solving this partial differential equation is to show that there exists
that solves (1).
The main task for the proof of existence of (
15) is performed by proving that there exists
such that
or equivalently, after grouping the terms
Now, we consider the system of equations
To solve (
16), we can rearrange those equations 1, ... ,
k such
The arguments in [
15,
16] say that the system (
17) has a unique positive solution. Next, letting
a unique solution of (
17) we observe that the Equations
, ... ,
of (
16) can be written equivalently as
from where using the fact that
has all elements positive, we can see that there exist and are unique
,...,
that solve (
16) and then
solve (
12). This finishes the proof of
Theorem 1.
Because our solution depends on solving a non-linear algebraic system of equations the exact solution of the PDE system cannot be determined using a computer software. In order to be implemented the solution of the PDE system (
12) in a software application in the next section it is necessary to give the numerical approximation of solution to (
16), and therefore the arguments in [
15,
16] are used again.
5. Verification
Next, we show that the control of (
14) obtained in our reduction strategy is indeed optimal. We apply the supermartingale and martingale approach.
Repeating the same argument in [
4], as the first step we can show that the stochastic process
defined below
is supermartingale for all
and martingale for the optimal control
Owing to the well-known Itô Lemma for Markov modulated diffusion (see [
22] for more on this) we have
for some martingale
, and
. Therefore
Then, the claim yields considering HJB equation (
10) and (
12) which says that
is martingale for the optimal control and supermartingale otherwise. This last fact combined with the transversality condition yields the claim.
In the second step, let us establish the optimality of
. Considering the quadratic estimate on the value function
where
,
are the solution of (
16).
Let us provide a lower bound estimate for
so that the transversality condition (
8) is met
holds true. The SDE system (
4) in this case becomes
Using Itô’s Lemma, one gets
By taking expectations in the above equation, we get
Then, in the light of the above equation, we get
Therefore, one must to choose
for the transversality condition to hold true and the proof is completed. Finally, a simple system of nonlinear equations (
16) remains to be solved.
7. Applications
We offer some applications, which also are inspired by the paper of Ghosh, Arapostathis, Marcus [
14].
Application 1. Suppose there is one machine producing two products and let
the machine state that can take values in two regimes 1=good and 2=bad, i.e., for every
we have
. We consider
a continuous time Markov chain with generator
and the inventory
which is governed by the Itô system of stochastic differential equations (
4) with the diffusion
and let
the discount factor. Under these assumptions, the system (
17) becomes
or, with our data
which has a unique positive solution
On the other hand the system (
18) becomes
or, with our data
which has a unique positive solution
Then
and furthermore, the production rate is
We also give the approximate of
,
,
,
by using the Newton-Raphson Method. Denote
and
Using the standard computation, approximations to four digits are
Clearly, the approximations for
and
are
Application 2. Suppose there is one machine producing three products and let
the machine state that can take values in three regimes 1, 2, 3, i.e., for every
we have
. We consider
a continuous time Markov chain with generator
and the inventory
which is governed by (
4) with
and let
the discount factor. Under these assumptions, the system (
17) becomes
or, with our data
which has a unique positive solution
On the other hand, the system (
18) becomes
or, with our data
from where
has a unique positive solution
Then
and furthermore, the production rate is
We also point out that the numerical approximations for
,
,
, using Newton-Raphson Method described, are
when
,
and
. Clearly
.