1. Introduction
An important central limit theorem (CLT) for non-homogeneous Markov chains was established by R.L. Dobrushin in 1956, a result that has since attracted considerable attention. This paper contributes a further extension of this celebrated theorem, which differs from the one proposed in our recent preprint (arXiv:2406.16156). The initial work in this direction by A.A. Markov was subsequently advanced by Bernstein, Sapogov, and Linnik, culminating in Dobrushin’s seminal publication. Dobrushin ultimately showed that, under an additional non-degeneracy condition on the variances, the sufficient condition
is also very close to necessary. Here,
denotes the Dobrushin (or Markov-Dobrushin) ergodic coefficient. The optimality of the rate
was demonstrated by the Bernstein-Dobrushin example. While extensions exist for Markov chains with memory and the results have been simplified using martingale methods, this paper employs an adapted martingale approach to establish a CLT under new, slightly modified assumptions. Our motivation stems from the central role of the CLT in mathematical statistics.
The paper consists of 5 sections: Introduction, The setting, Main results, Auxiliary results, Proof of the theorem.
2. The Setting
Consider a non-homogeneous Markov chain defined on a probability space
. Let
denote the Markov transition kernel governing the evolution from time
i to
on a measurable space
, which is assumed to satisfy the standard conditions for a Markov process (cf. [
5]). The corresponding transition probabilities are defined by the relation:
where
is the trajectory of the process on
. In particular, if the initial distribution is
then the distribution at time
is given by the measure
.
Due to the nature of conditional expectations, certain equalities and inequalities hold only almost surely (a.s.).
For
let us denote by
the
k-step transition kernel, which has a form due to the Chapman-Kolmogorov equation,
Definition 1.
The Markov-Dobrushin (MD) coefficient1 is defined by any of the three equivalent expressions,
where .
Here
. It follows from the definition that
, and that
if and only if the measure
does not depend on
x, and in the latter case the second random variable does not depend on the first one. The measure
will be called non-degenerate of
. To denote integrals, standard notations will be used,
This semi-norm will be applied to random variables (see lemmata 13, 14), interpreted in the sense of the essential supremum norm.That is, for any random variable
X on
, we define:
Further, for any transition kernels
and two-step transition kernel
(the first transition is according to the matrix
, while the second – according to
), the following inequality holds true,
In what follows, and stand for the expectation and for the variance of the random variable Y, respectively.
We work in an array scheme: for each
, let
be a Markov chain on
with transition kernels
and initial distribution
. For simplicity, all processes
are defined on a common probability space
, though this assumption can be relaxed. Define the minimal ergodicity coefficient as:
Further, let
be real-valued functions on
. For any
let
The main result of Dobrushin’s paper, along with a key corollary, is presented in [
13] as follows (we state it as a proposition to distinguish it from our main result, which will be termed a theorem). Here, “⇒” denotes weak convergence.
Proposition .
Let . In this case, if
then the CLT holds true in the array scheme:
Corollary .
Under the assumptions of the theorem, if , and , and if
then weak convergence (5) holds.
3. Main Results
Let fix n and define
to be a non-random sequence of ones and zeros, like
, and let for
,
Then (
3) may be relaxed to
Definition .
We say that the condition is met if there exist and such that for any n the sequence satisfies the following property: for any pair with , the inequality
is satisfied.
In what follows such a sequence is fixed for each n. The next theorem is the main result of the paper.
Theorem .
Let the assumption be met, and let also>
Then, if condition
is satisfied, then and the CLT holds true,
Corollary .
and the conditions (8) along with
hold, then convergence (11) is valid.
4. Auxiliary Results
The following auxiliary results are collected in this section. Unless stated otherwise, they are reproduced without proof from [
7] and [
13]. However, proofs are provided for Lemmata 8, 12, and 14. Because, despite their simplicity, they involve new calculations that are crucial for proving the main result. The notation used is consistent with that in [
13]. In particular, the essential supremum norm of a random variable
Z is defined as:
Proposition .([7,10]) Let for any the process be a martingale with respect to a filtration , , and let . If
Or in other words
then the weak convergence takes place,
Following [
13], we assume without loss of generality that the functions
are centered:
Define
so that
Then, for any
, the representation holds,
Furthermore, for any
, this expression can be equivalently rewritten as
Consequently, the sum
can be expressed in martingale form as follows:
As emphasized in [
13], this transformation originates from the seminal paper [
6] on a "simple" CLT for Markov chains. Given that all terms on the right-hand side of this representation are uncorrelated, it follows that
Let
Then the process
is a martingale with respect to the filtration
for
. The central limit theorem will be established by approximating the normalized sum
by the terminal martingale value
and then applying a martingale CLT, following the approach in [
13] (see Proposition 7). It therefore suffices to verify the two conditions in (
15).
Further, by
we denote the product of the functions
and
. Notice that for any
,
n, and function
,
Lemma .
Under the conditions of the theorem, for any ,
and also for ,
Proof. Since
, then
. It follows from the definitions of the coefficients
and
, and from (
3) that
Since
, the first inequality in (
21) follows from the bounds
The second inequality in (
21) is established similarly. Indeed, we have due to (
20),
To prove the third inequality of the lemma, we estimate using (
21):
as required.
□
The following lemma provides a lower bound for the variance of the sum, a result that is crucial for proving the main theorem. We were unable to improve this bound using the coefficient
; consequently, we state it exactly as in [
13] and without proof. Nevertheless, even with the weaker coefficient
, this bound remains highly useful in the subsequent analysis.
Proposition ([
13], proposition 3.2).
Under the assumptions of the theorem 5, for any ,
To prove this proposition, the following auxiliary statements are necessary.
Lemma (Lemma 4.1, [
13]).
Let λ be a probability measure on with marginals α and β respectively. Let and be the corresponding transition probabilities in the two directions so that and . Let and be square integrable with respect to α and β respectively. If
Lemma (Lemma 4.2, [
13]).
Let and be square integrable with respect to α and β respectively. Then,
We now estimate the ratio
in the following lemma. Note that by the definition of the
norm in (
14), the value
is nonrandom; see the definition of
in (
16).
Lemma 12.
Under the assumptions of the theorem 5, the equality holds,
Proof. We have,
Therefore, because of the equality
and by virtue of (
21), we obtain,
So, it follows,
The last inequality is due to the formula for the geometric sum with the common ratio
. Continue sequance of inequalities:
And here we use inequality
. Finally, applying the bound from the proposition 9, we get,
Due to the condition (
9) this implies the inequality (
23), as required. □
The proof of the following lemma is given in [
13] and is therefore omitted here. Recall that the semi-norm Osc, when applied to a random variable, is defined as in (
2).
Lemma .
Let and , for , are, respectively, the sequence of non-negative random variables and sigma-fields such that . Suppose that
where . Suppose in addition that
Then
Further, denote random variables
These random variables are measurable with respect to the sigma-fields for , respectively.
Lemma .
Under the assumptions of the theorem 5 the convergence
holds true.
Note that in this paper, all
terms are deterministic. This is a consequence of our convention for the oscillation semi-norm applied to random variables, as defined in (
2).
Proof. By the martingale property and definitions (
17) and (
18), it follows that
(a.s.) for all
. Therefore,
By Lemma 12, the second term in (
24) is bounded above by
, which is
. Consequently, its oscillation is also
as
, uniformly in
. Recall that
is defined as (cf. (
16)):
We now estimate the oscillation of the first term in (
24).
Let us highlight that our definition of “Osc” for r.v. in (
2) allows us not to worry about the difference between sure and almost sure inequalities, despite conditional expectations in the expressions. We now apply Lemma 8:
In the remaining cases the estimates are analogous, with at most one or two additional factors of
, each bounded above by 1. Summing over all terms then yields the bound
uniformly with respect to
l. Combining this estimate with Proposition9 and condition
9 completes the proof of the lemma.
□
5. Proof of the Theorem 5
Proof. It follows from the lemma 12 that it suffices to show convergence
This property follows from the proposition 7, because it was already verified that
- (a)
;
- (b)
.
Here, part (a) follows from the lemma 12 (negligibility). To verify condition (b), we apply 13, 14(LLN). This is valid since (a) holds and the variance decomposition (after 13) combined with Lemma 12 gives .
□
Institutional Review Board Statement
Not applicable.
Data Availability Statement
Not applicable.
Acknowledgments
The work was supported by the grant of the Theoretical Physics and Mathematics Advancement Foundation “BASIS”. The author is also thankful to scientific advisor Alexander Veretennikov for valuable remarks.
Conflicts of Interest
No conflict of interests.
References
- S. N. Bernstein, Sur l’extension du théorème du calcul de probabilités aux sommes de quantités dépendantes. Math. Ann. 1926, 97, 1–59. [Google Scholar]
- Russian transl.: S.N. Bernstein, Extension of a limit theorem of probability theory on sums of dependent random variables. Russian Math. Rev., 1944, 10, 65–114.
- O.A. Butkovsky, A.Yu. O.A. Butkovsky, A.Yu. Veretennikov, On asymptotics for Vaserstein coupling of Markov chains, Stochastic Processes and their Applications, 2013, 123(9), 3518–3541. [CrossRef]
- R. L. Dobrushin, Central limit theorems for non-stationary Markov chains I,II. Theory Probab. its Appl. 1956, 1, 65–80. [Google Scholar] [CrossRef]
- E.B. Dynkin, Markov processes, vol. 1, Springer-Verlag, Berlin, Heidelberg, 1965. [CrossRef]
- M. I. Gordin, The central limit theorem for stationary processes. Soviet Math. Dokl. 1969, 10, 1174–1176. [Google Scholar]
- P. Hall, C.C. P. Hall, C.C. Heyde, Martingale Limit Theory and Its Application. Academic Press, New York, 1980. [CrossRef]
- Yu. V. Linnik, On the theory of nonuniform Markov chains. Izv. Akad. Nauk SSSR Ser. Mat. 1949, 13, 65–94. (in Russian). [Google Scholar]
- Yu. V. Linnik, N.A. Sapogov, Multiple integral and local laws for inhomogeneous Markov chains. Izv. Akad. Nauk SSSR Ser. Mat. 1949, 13, 533–566. [Google Scholar]
- R.Sh. Liptser, A.N. R.Sh. Liptser, A.N. Shiryayev, Theory of Martingales, Kluwer Acad. Publ., Dordrecht, The Netherlands, 1989.
- A. A. Markov, Investigation of the general case of trials associated into a chain (in Russian), Zapiski. Akad. Nauk (St. Petersburg) Fiz.-Matem. Otd., (8th Ser.) 25(3)(1910), 33 pp.; also in: Selected works, 1951, Moscow, Acad. Sci. USSR, 465—509 (in Russian); Eng. transl. in: O.B.Sheynin, ed., Probability and Statistics. Russian Papers, Selected and Translated by Oscar Sheynin. NG Verlag, Berlin, 2004. [Google Scholar]
- A. N. Shiryaev, Central limit theorem for non-homogeneous Markov chains with memory, Summary of Papers Presented at the Sessions of the Scientific Research Seminar on Probability Theory, (Moscow, February–May, 1957). Theory of Probability and its Applications 1957, 2, 470–480. [Google Scholar] [CrossRef]
- S. Sethuraman, S.R.S. Varadhan, A Martingale Proof of Dobrushin’s Theorem for Non-Homogeneous Markov Chains. Electron. J. Probab. 2005, 10, 1221–1235. [Google Scholar] [CrossRef]
- A. Yu. Veretennikov, M.A. Veretennikova, On improved bounds and conditions for the convergence of Markov chains. Izvestiya Mathematics 2022, 86, 92–125. [Google Scholar] [CrossRef]
| 1 |
In [ 13] it is called just the contraction coefficient. |
|
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).