Informational Holonomy Curvature and Its Discrete–to–Continuous Convergence

David Gutierrez Ule

doi:10.20944/preprints202512.1887.v1

Submitted:

19 December 2025

Posted:

22 December 2025

You are already at the latest version

Abstract

We introduce an informational holonomy curvature associated with a state bundle over a Riemannian manifold and a family of channels acting on the fibres. In the continuous setting, we define an informational holonomy defect by transporting a reference state around small geodesic loops and measuring the deviation via an informational divergence, and we show that the resulting sectional-type curvature is determined by the curvature of the connection on the state bundle. On quasi-uniform sampling graphs endowed with discrete fibres, divergences and channels, we define a discrete informational holonomy curvature and prove a discrete-to-continuous convergence theorem under explicit sampling and consistency assumptions. In geometric Fisher-type models, the limit reduces, on spaces of constant curvature, to a constant multiple of the classical sectional curvature.

Keywords:

informational holonomy

;

holonomy curvature

;

sectional curvature

;

information geometry

;

Jensen–Shannon divergence

;

Ehresmann connection

;

state bundle

;

sampling graphs

;

discrete-tocontinuous convergence

Subject:

Physical Sciences - Particle and Field Physics

1. Introduction

Curvature is one of the central notions of Riemannian geometry and plays a distinguished rôle in both mathematics and physics. In its classical incarnation, sectional curvature measures the second-order deviation of geodesics in a two-dimensional direction, while scalar curvature is obtained by averaging over all such directions. From a more global perspective, curvature can also be understood in terms of holonomy: transporting a vector around a small closed loop and comparing the result with the original vector. In a flat space the transported vector returns to itself, whereas in a curved space a non-trivial discrepancy appears, and this discrepancy encodes the curvature of the underlying connection.

In parallel, information geometry has developed a rich differential-geometric framework on spaces of probability distributions and quantum states, where Riemannian metrics arise from statistical divergences such as the Kullback–Leibler or Jensen–Shannon divergences. The associated Fisher metrics and their variants have been used to define information-geometric analogues of geodesics, connections and curvature, with applications ranging from statistics and machine learning to quantum information theory; see, e.g., [1].

These two viewpoints—Riemannian curvature and information geometry—suggest a natural question:

Can curvature be reconstructed, or even defined, purely in terms of informational data and their transformations, without a priori access to a smooth Riemannian structure?

In the present paper we do not attempt to solve this reconstruction problem in full generality. Instead, we assume a smooth Riemannian manifold

(M, g)

and a state bundle with a compatible connection as part of the input, and show how their curvature can be encoded and approximated in purely informational terms.

This question has received substantial attention in the context of discrete and non-smooth geometries. In the setting of graphs and Markov chains, several notions of “discrete curvature” have been introduced, most notably the coarse Ricci curvature à la Ollivier [7], Forman’s combinatorial curvature [4], and Regge-type curvature concentrated on simplices in piecewise flat manifolds [8]. These constructions either derive curvature from a discrete metric structure or from combinatorial and measure-theoretic data. In all cases, an important theme is the consistency problem: under suitable sampling or refinement hypotheses, do these discrete curvatures converge to the classical Riemannian curvatures of a smooth limit space?

In previous work, the author considered the following situation: given a family of local informational states

{(ρ_{x})}_{x \in M}

attached to points of a Riemannian manifold

(M, g)

, one can endow a sampling graph with the Jensen–Shannon metric induced by the pairwise divergence of these states. A Regge-type scalar curvature estimator built from this metric was shown, under explicit metric and sampling assumptions, to converge in the weak-* sense to the scalar curvature

R_{g}

of g. This provides a scalar curvature estimator derived purely from a non-geodesic, informational metric.

The present paper aims at a refinement of a different nature. Instead of constructing curvature from distances alone, we introduce a new notion of curvature that is based on holonomy of informational channels. Concretely, we consider a discrete sampling of a manifold in which each vertex carries a space of informational states (classical or quantum), and each oriented edge is equipped with a channel transporting states between neighbouring vertices. By composing these channels along small loops we obtain a discrete notion of holonomy acting on the state space at a base point. We then measure how far this holonomy deviates from the identity, using an informational divergence such as the Jensen–Shannon divergence and the associated informational distance

d = \sqrt{2 D}

, and normalize by the geometric area of the loop. The resulting quantity is what we call the informational holonomy curvature.

Intuitively, one can think of this construction as follows. At each point we choose a reference state

μ_{x}

of the system. We then let this state travel along a small triangular loop

x \to y \to z \to x

by applying, in sequence, the channels assigned to each edge. If the underlying “informational geometry” were flat, we would expect the state to come back unchanged. Any systematic deviation between the initial state and the state obtained after completing the loop signals the presence of curvature. By comparing these two states with a suitable divergence (or equivalently, with the associated informational distance) and normalizing by the area of the loop, we obtain a discrete curvature associated with that loop. By averaging over families of loops that approximate a given two-dimensional direction, we obtain a quantity that plays the rôle of a sectional curvature in an informational setting.

From the Riemannian viewpoint, this is reminiscent of the classical description of sectional curvature via holonomy of the Levi–Civita connection in the tangent bundle. The novelty here is that the objects being transported are not tangent vectors but informational states, and the parallel transport is implemented by channels rather than by a linear connection on a vector bundle. Nevertheless, we shall show that, under appropriate assumptions, the resulting informational holonomy curvature can be expressed in terms of the curvature of a connection on a state bundle over

(M, g)

, and in geometric models induced by the Levi–Civita connection it reduces, in spaces of constant curvature, to a constant multiple of

| \sec_{g} |

.

Main Contributions

We now summarize the main contributions of this work.

We introduce a general framework in which to study informational holonomy on a discrete sampling of a Riemannian manifold. The basic data consist of:
- a sampling graph $G_{ε} = (V_{ε}, E_{ε})$ embedded in a manifold $(M, g)$ ,
- a state space $P_{x}$ attached to each vertex $x \in V_{ε}$ (for instance, a space of probability distributions or density matrices),
- and a family of channels $Φ_{x y} : P_{x} \to P_{y}$ associated with each oriented edge $x \to y \in E_{ε}$ .
Composition of these channels along loops in $G_{ε}$ gives rise to discrete holonomy operators on the fibres $P_{x}$ .
Given a divergence D on each state space $P_{x}$ (in particular, the Jensen–Shannon divergence), we define the informational holonomy defect of a loop $γ$ based at x as

$δ_{γ} (x) : = D (μ_{x}, H_{γ} (μ_{x})),$

where $H_{γ}$ is the holonomy operator obtained by composing channels along $γ$ , and $μ_{x}$ is a reference state at the base point x. We also consider the associated distance defect

$Δ_{γ} (x) : = d (μ_{x}, H_{γ} (μ_{x})), d (s, t) = \sqrt{2 D (s, t)} .$

For sufficiently small loops bounding an area $A_{γ}$ , we define the informational holonomy curvature of the loop by

$K_{hol} (γ) : = \frac{Δ_{γ} (x)}{A_{γ}} .$

By averaging over discrete loops that approximate a two-plane $Π \subset T_{x} M$ , we obtain an informational sectional curvature $K_{hol}^{(ε)} (x, Π)$ at scale $ε$ .
On the continuum side, we consider a smooth Riemannian manifold $(M, g)$ equipped with a smooth bundle of state spaces $P \to M$ and a connection-like structure which assigns, to nearby points $x, y \in M$ , channels $Φ_{x \to y} : P_{x} \to P_{y}$ compatible with g. Under natural regularity and compatibility conditions, we define a continuous informational holonomy curvature by transporting a reference state around infinitesimal loops tangent to a given two-plane $Π \subset T_{x} M$ and measuring, via a divergence, the infinitesimal deviation from the identity. Our first main result shows that this quantity can be written as

$K_{hol}^{cont} (x, Π) = ∥ W_{x} (Π; μ_{x}) ∥_{g^{P_{x}}},$

where $W_{x} (Π; μ_{x})$ depends linearly on the curvature of the connection on $P$ along $Π$ . In particular, when the connection is induced from the Levi–Civita connection via a linear isometric representation, $K_{hol}^{cont} (x, Π)$ is a scalar invariant of the restriction of $R_{x}^{g}$ to $Π$ , and in spaces of constant sectional curvature it is proportional to $| {sec}_{g} (x, Π) |$ .
Second, building on previous work on scalar curvature estimators derived from informational metrics, we show that the discrete informational sectional curvature $K_{hol}^{(ε)} (x, Π)$ converges, as $ε \to 0$ , to the continuous quantity $K_{hol}^{cont} (x, Π)$ above. More precisely, we obtain an estimate of the form

$| K_{hol}^{(ε)} (x, Π) - K_{hol}^{cont} (x, Π) | \leq C κ_{ε},$

where

$κ_{ε} : = r_{ε} + η_{ε} + q_{ε} + ρ_{ε},$

is the error scale appearing in Theorem 2 (see (10)). Here $r_{ε}$ is the sampling radius, $η_{ε}$ and $q_{ε}$ quantify directional anisotropy and area-approximation errors of the triangle families, and $ρ_{ε} = r_{ε}^{α - 1}$ encodes the channel-consistency error from Assumption 9(2). In particular, in spaces of constant curvature this yields convergence to a constant multiple of $| {sec}_{g} (x, Π) |$ .
Finally, we discuss several model constructions and examples. In particular, we consider classical Fisher-type models in which the state bundle is induced by a statistical model on M, and we construct channels by transporting distributions along short geodesic segments. On spaces of constant curvature, we show how the resulting informational holonomy curvature reproduces, up to a constant factor, the expected constant sectional curvature.

Together, these results provide what we view as a natural notion of curvature in a setting where the primary objects are informational states and channels rather than tangent vectors and linear connections. In contrast with purely metric-based discrete curvatures, the informational holonomy curvature fundamentally exploits the dynamical aspect of information transport.

Relation to Previous Work

This work lies at the interface between several active research directions.

First, in Riemannian geometry and general relativity, Regge calculus and its refinements provide discrete models of curvature in piecewise flat manifolds, where curvature is concentrated on codimension-2 simplices and can be described in terms of deficit angles and holonomy [8]. Second, in the study of graphs and Markov chains, discrete Ricci and scalar curvatures have been introduced using optimal transport, entropy convexity, and combinatorial Laplacians [4,7]. Third, information geometry provides natural Riemannian metrics and connections on spaces of probability measures and quantum states, together with associated notions of curvature [1].

Our construction is directly inspired by the Regge and holonomy viewpoints, but it is formulated in an intrinsically informational setting. Rather than starting from a discrete metric or Laplacian, we start from a network of informational channels and a divergence on each fibre. At the technical level, our convergence results build on discrete-to-continuum analysis of curvature estimators on sampling graphs, including previous work on scalar curvature estimators derived from informational metrics.

A simple model to keep in mind throughout the paper is provided by the classical Jensen–Shannon divergence on probability simplices (Section 7.1), which satisfies Assumption 1 and induces fibre metrics proportional to the Fisher information metric. This example shows that the abstract hypotheses on the fibre divergences are realised in a familiar information-geometric setting.

Organization of the Paper

The paper is organized as follows. In Section 2 we recall the necessary background on Riemannian geometry, information geometry, and holonomy, and we set up the continuous model of state bundles and channels. In Section 3 we introduce the discrete sampling framework, define discrete holonomy operators on state spaces, and state the main assumptions on sampling, metric approximation and channel regularity. Section 4 contains the precise definitions of the informational holonomy defect and informational holonomy curvature, both at the level of individual loops and in the averaged sectional form.

In Section 5 we formulate and prove our main continuous theorem relating informational holonomy curvature to the curvature of a connection on the state bundle. Section 6 is devoted to the discrete-to-continuum convergence theorem. Finally, Section 7 presents model constructions and examples, and Section 8 discusses possible extensions and applications.

2. Geometric and Informational Background

In this section we recall the basic geometric notions and fix the continuous framework in which the informational holonomy curvature will be defined. We start with standard Riemannian preliminaries, then introduce the state bundle and divergences, and finally describe a continuous notion of informational transport and holonomy.

2.1. Riemannian Preliminaries

Let

(M, g)

be a smooth, connected, oriented Riemannian manifold of dimension

n \geq 2

. We denote by

d_{g}

the associated geodesic distance, by ∇ the Levi–Civita connection of g, and by

R^{g}

the Riemann curvature tensor, taken with the convention

R^{g} (X, Y) Z : = \nabla_{X} \nabla_{Y} Z - \nabla_{Y} \nabla_{X} Z - \nabla_{[X, Y]} Z .

For

x \in M

and a 2-dimensional subspace

Π \subset T_{x} M

, the sectional curvature of g at

(x, Π)

is defined by

\sec_{g} (x, Π) : = \frac{g (R^{g} (u, v) v, u)}{{∥ u ∥}_{g}^{2} {∥ v ∥}_{g}^{2} - g {(u, v)}^{2}},

where

u, v \in T_{x} M

is any basis of

Π

with

u \land v \neq 0

. This definition is independent of the choice of basis.

We shall frequently use normal coordinate charts. For

x \in M

we denote by

{exp}_{x} : U_{x} \subset T_{x} M ⟶ M

the exponential map at x, defined on a maximal star-shaped neighbourhood

U_{x}

of 0 where

{exp}_{x}

is a diffeomorphism onto its image. There exists

r_{0} > 0

such that for every

x \in M

the open ball

B_{g} (x, r_{0})

is contained in the normal neighbourhood

{exp}_{x} (B (0, r_{0}))

and is geodesically convex: any two points

y, z \in B_{g} (x, r_{0})

are joined by a unique minimizing geodesic contained in

B_{g} (x, r_{0})

.

Given three points

x, y, z \in M

sufficiently close to each other and contained in such a convex normal neighbourhood, there is a unique geodesic triangle

Δ_{g} (x, y, z)

formed by the minimizing geodesic segments

[x y], [y z], [z x]

. We denote by

A_{g} (x, y, z)

the Riemannian area of

Δ_{g} (x, y, z)

and by

α_{x} (x, y, z)

,

α_{y} (x, y, z)

and

α_{z} (x, y, z)

the interior angles at

x, y, z

, respectively. The angle defect of the triangle is

{def}_{g} (x, y, z) : = α_{x} (x, y, z) + α_{y} (x, y, z) + α_{z} (x, y, z) - π .

The following classical fact relates angle defect and sectional curvature; see, for example, [2] [Chapter 6].

Lemma 1

(Angle defect and sectional curvature). Let

(M, g)

be a smooth Riemannian manifold. For each compact set

K \subset M

there exist constants

C_{K} > 0

and

r_{K} > 0

such that the following holds. For any

x \in K

and any geodesic triangle

Δ_{g} (x, y, z)

contained in

B_{g} (x, r_{K})

, let

Π \subset T_{x} M

be the plane spanned by the initial velocities of the geodesics from x to y and from x to z. Then

{def}_{g} (x, y, z) = \sec_{g} (x, Π) A_{g} (x, y, z) + O (ℓ^{3}),

(1)

where

ℓ : = max {d_{g} (x, y), d_{g} (y, z), d_{g} (z, x)}

and the implicit constant in the

O (\cdot)

term is bounded by

C_{K}

.

In particular, for sequences of triangles shrinking to x with diameters of order

ℓ \to 0

and areas of order

A_{g} \sim ℓ^{2}

, the ratio

{def}_{g} (x, y, z) / A_{g} (x, y, z)

converges to

\sec_{g} (x, Π)

.

Holonomy gives an alternative description of sectional curvature. Let

{Hol}_{x} (\nabla) \subset O (T_{x} M, g_{x})

denote the holonomy group of the Levi–Civita connection at x. For any piecewise smooth closed loop

γ : [0, 1] \to M

based at x there is an associated parallel transport operator

P_{γ}^{g} : T_{x} M \to T_{x} M,

obtained by solving the parallel transport equation along

γ

. If

γ

bounds a small geodesic triangle tangent to a plane

Π \subset T_{x} M

, then one has the expansion

P_{γ}^{g} = {Id}_{T_{x} M} + A_{γ} R_{x}^{g} (Π) + O (A_{γ}^{3 / 2}),

(2)

where

A_{γ}

is the area of the triangle, and

R_{x}^{g} (Π)

is a linear map depending linearly on the curvature tensor

R_{x}^{g}

restricted to

Π

; see, e.g., [5] [Chapter II]. We do not need the precise form of

R_{x}^{g} (Π)

; it will be enough to assume the existence of expansions of the form (2) in the informational setting below.

2.2. State Spaces and Informational Divergences

We now recall basic notions from information geometry. For the purposes of this work it is convenient to formulate the discussion at the level of a general smooth state space, although throughout we keep classical probability distributions as a canonical example.

Definition 1

(State space). A state space is a smooth manifold S whose points represent informational states of a system. Typical examples include:

(i): An open subset of the simplex of strictly positive probability vectors on a finite set $Ω = {1, \dots, m}$ :

$S \subset \{p \in R^{m} : p_{i} > 0, \sum_{i = 1}^{m} p_{i} = 1\} .$
(ii): An open submanifold of the space of faithful density matrices on a finite-dimensional Hilbert space (quantum states).

We endow S with two pieces of structure:

a smooth Riemannian metric $g^{S}$ on S;
an informational divergence

$D : S \times S \to [0, \infty),$

which is differentiable of sufficiently high order and vanishes on the diagonal.

The divergence D is not assumed to be a distance (it may fail to be symmetric and may not satisfy the triangle inequality), but we require that it induces the Riemannian metric

g^{S}

in the usual way.

Assumption 1

(Divergence and information metric). We assume that D is of class

C^{3}

in a neighbourhood of the diagonal

{(s, s) : s \in S}

and satisfies:

1.: $D (s, s) = 0$ for all $s \in S$ ;
2.: $\partial_{2} D (s, s) = 0$ for all $s \in S$ (vanishing first derivative in the second argument along the diagonal);
3.: the Hessian of D in the second argument at the diagonal recovers the Riemannian metric $g^{S}$ , i.e.

${Hess}_{2} D (s, s) [w, w] = g_{s}^{S} (w, w), \forall s \in S, \forall w \in T_{s} S .$

The last condition means that for t close to s one has the second-order expansion

D (s, t) = \frac{1}{2} g_{s}^{S} (v, v) + O ({∥ v ∥}_{g^{S}}^{3}),

(3)

where

v \in T_{s} S

is any tangent vector such that

{exp}_{s}^{S} (v) = t

in a normal coordinate chart on S.

Remark 1

(Jensen–Shannon divergence). A central example in this work is the Jensen–Shannon divergence on the simplex of probability vectors on a finite set; see [6]. Let

Ω = {1, \dots, m}

and let

Δ^{\circ} (Ω) : = \{p \in R^{m} : p_{i} > 0, \sum_{i = 1}^{m} p_{i} = 1\}

denote the open probability simplex. For

p, q \in Δ^{\circ} (Ω)

, the Jensen–Shannon divergence is defined by

D_{JS} (p, q) : = H (\frac{p + q}{2}) - \frac{1}{2} H (p) - \frac{1}{2} H (q),

where

H (p) = - \sum_{i = 1}^{m} p_{i} log p_{i}

is the Shannon entropy (with a fixed choice of logarithm). It is well known that

D_{JS}

is symmetric and non-negative and that

\sqrt{D_{JS} (p, q)}

defines a genuine metric on

Δ^{\circ} (Ω)

[3]. In particular, with the normalization used throughout this paper,

d_{JS} (p, q) : = \sqrt{2 D_{JS} (p, q)} = \sqrt{2} \sqrt{D_{JS} (p, q)}

is also a genuine metric on

Δ^{\circ} (Ω)

.

Moreover, the second-order expansion of

D_{JS}

around the diagonal induces a multiple of the Fisher information metric on

Δ^{\circ} (Ω)

. In other words, there exists a constant

c_{JS} > 0

such that, for p fixed and q close to p,

D_{JS} (p, q) = \frac{c_{JS}}{2} g_{p}^{Fisher} (v, v) + O ({∥ v ∥}_{g^{Fisher}}^{3}),

where

v \in T_{p} Δ^{\circ} (Ω)

is as in (3). Thus Assumption 1 holds with

g^{S}

proportional to the Fisher metric.

Assumption 1 ensures that the divergence D and the metric

g^{S}

are compatible in the sense of information geometry: D is a “potential” whose Hessian yields the local quadratic structure. The precise constant relating D to

g^{S}

will play no essential rôle; it will simply be absorbed into the constant c appearing in the main theorems.

2.3. State Bundles over a Riemannian Manifold

We now couple the Riemannian manifold

(M, g)

with the state space S.

Definition 2

(State bundle). A state bundle over

(M, g)

is a smooth fibre bundle

π : P \to M

with typical fibre S, together with:

for each $x \in M$ , a smooth identification of the fibre $P_{x} : = π^{- 1} (x)$ with S;
a smooth family of fibrewise Riemannian metrics ${(g^{P_{x}})}_{x \in M}$ , where each $g^{P_{x}}$ is a Riemannian metric on $P_{x}$ obtained from a copy of $g^{S}$ under the identification $P_{x} ≃ S$ ;
a smooth family of divergences ${(D_{x})}_{x \in M}$ , where each

$D_{x} : P_{x} \times P_{x} \to [0, \infty)$

satisfies Assumption 1 with respect to $g^{P_{x}}$ .

For notational simplicity, we often suppress the explicit identifications and denote the metric on

P_{x}

by

g_{x}^{S}

and the divergence by

D_{x}

, keeping in mind that they vary smoothly with x.

Remark 2.

In the simplest classical setting, one may take

P = M \times S

to be the trivial bundle with fibre

S = Δ^{\circ} (Ω)

, endowed with the product smooth structure. Then each fibre

P_{x}

is naturally identified with S, and one can set

g^{P_{x}} = g^{S}

and

D_{x} = D

independently of x. The non-trivial bundle case is more appropriate, for instance, in quantum settings or in statistical models where the parameterization of states varies with x.

The state bundle

π : P \to M

provides the configuration space for informational states over M. To speak about transport and holonomy of states, we need a notion of connection on

P

.

2.4. Connections and Continuous Informational Transport

The key geometric input in the continuous theory is a connection producing parallel transport on the state bundle. In order to place the holonomy expansion used later on a fully standard footing, we henceforth restrict the continuous framework to the associated-bundle setting of principal connections.

Assumption 2

(Associated-bundle model and regularity). There exist:

(i): A Lie group G acting smoothly on the typical fibre S by Riemannian isometries of $(S, g^{S})$ . We denote the action by $(g, s) \mapsto g \cdot s$ .
(ii): A $C^{3}$ principal G-bundle $π_{Q} : Q \to M$ endowed with a $C^{2}$ principal connection $ω \in Ω^{1} (Q; g)$ , with curvature $F_{ω} \in Ω^{2} (Q; g)$ .
(iii): The state bundle is the associated bundle

$π : P : = Q \times_{G} S ⟶ M,$

and the connection on $P$ is the Ehresmann connection induced by ω.

Moreover, all geometric structures are understood on compact subsets of M, so that (after choosing local trivializations) the coefficients of ω,

F_{ω}

and their first derivatives are uniformly bounded on compact sets.

Under Assumption 2, the principal connection

ω

induces an Ehresmann connection on

π : P \to M

by declaring a vector in

T_{[q, s]} P

to be horizontal if it is represented by a pair

(\dot{q}, \dot{s})

with

\dot{q}

horizontal in

Q

(i.e.

ω (\dot{q}) = 0

) and

\dot{s} = 0

in S. This yields a smooth horizontal distribution

T P = H \oplus V

and, therefore, parallel transport along curves in M.

Definition 3

(Parallel transport in the associated state bundle). Let

γ : [0, 1] \to M

be a piecewise

C^{1}

curve with

γ (0) = x

and

γ (1) = y

. Let

q_{0} \in Q_{x}

and denote by

q (t)

the ω-horizontal lift of γ with

q (0) = q_{0}

. There exists a unique element

g_{γ} (q_{0}) \in G

such that

q (1) = q_{0} \cdot g_{γ} (q_{0})

. Then the induced parallel transport on

P

is the map

{PT}_{γ} : P_{x} \to P_{y}, {PT}_{γ} ([q_{0}, s]) : = [q (1), s] = [q_{0}, g_{γ} (q_{0}) \cdot s] .

Parallel transport satisfies the functorial properties

{PT}_{γ_{2} ★ γ_{1}} = {PT}_{γ_{2}} \circ {PT}_{γ_{1}}, {PT}_{γ^{- 1}} = {PT}_{γ}^{- 1}, {PT}_{constant} = Id,

(4)

whenever the concatenations are defined. In particular, any piecewise smooth closed loop

γ

based at x yields a holonomy map

{Hol}_{γ} : = {PT}_{γ} : P_{x} \to P_{x}

.

Lemma 2

(Parallel transport is a fibrewise Riemannian isometry). Assume the associated-bundle setting of Assumption 2, and that the G-action on

(S, g^{S})

is by Riemannian isometries. Then for every piecewise

C^{1}

curve

γ : [0, 1] \to M

with

γ (0) = x

,

γ (1) = y

, the induced parallel transport map

{PT}_{γ} : P_{x} \to P_{y}

is a Riemannian isometry between the fibres:

d_{y}^{R} ({PT}_{γ} (s), {PT}_{γ} (t)) = d_{x}^{R} (s, t) \forall s, t \in P_{x} .

In particular,

{PT}_{γ}

is locally distance-preserving and globally distance-preserving on each fibre.

Proof.

Fix

q_{0} \in Q_{x}

and let

q (t)

be the

ω

-horizontal lift of

γ

with

q (0) = q_{0}

. By Definition 3, there is

g_{γ} \in G

such that

{PT}_{γ} ([q_{0}, s]) = [q_{0}, g_{γ} \cdot s]

. Since

g_{γ}

acts by a Riemannian isometry of

(S, g^{S})

, it preserves the Riemannian distance on S, and hence

{PT}_{γ}

preserves the Riemannian distance on the fibres

P_{x} ≃ S

and

P_{y} ≃ S

. □

Assumption 3

(Compatibility of divergences with transport). For each compact set

K \subset M

there exist constants

r_{K} > 0

and

C_{K} > 0

such that the following holds. For every piecewise

C^{1}

curve

γ : [0, 1] \to K

with

γ (0) = x

and

γ (1) = y

, and every

s, t \in P_{x}

satisfying

d_{x}^{R} (s, t) \leq r_{K}

, one has

| D_{y} ({PT}_{γ} (s), {PT}_{γ} (t)) - D_{x} (s, t) | \leq C_{K} d_{x}^{R} {(s, t)}^{3},

(5)

where

d_{x}^{R}

denotes the Riemannian distance in the fibre

(P_{x}, g^{P_{x}})

.

Remark 3.

If the fibre divergence is induced by a single divergence D on S which is G-invariant (i.e.

D (g \cdot s, g \cdot t) = D (s, t)

for all

g \in G

), then (5) holds with equality (hence

C_{K} = 0

). In general we treat (5) as a modelling axiom controlling how the chosen divergences vary under parallel transport, with constants uniform on compact sets.

2.5. Informational holonomy in the continuous setting

In the continuous setting, holonomy of the state bundle is defined exactly as in the Riemannian case: for a piecewise smooth closed loop

γ : [0, 1] \to M

based at x we have a holonomy map

{Hol}_{γ} : P_{x} \to P_{x} .

We interpret

{Hol}_{γ}

as an informational channel acting on the state space at x: starting from a state

s \in P_{x}

, one transports s along

γ

using the connection and compares the resulting state

{Hol}_{γ} (s)

with the original one.

To quantify the deviation from trivial holonomy, we introduce reference states and informational defects.

Definition 4

(Reference states). A reference state field is a smooth section

μ : M \to P

of the state bundle, i.e.

π \circ μ = {id}_{M}

. For each

x \in M

, the point

μ_{x} : = μ (x) \in P_{x}

will serve as the base state with respect to which informational changes are measured.

Definition 5

(Continuous informational holonomy defect). Let μ be a reference state field and let γ be a piecewise smooth closed loop in M based at x. The informational holonomy defect of γ at x is

δ_{γ} (x) : = D_{x} (μ_{x}, {Hol}_{γ} (μ_{x})),

where

D_{x}

is the divergence on the fibre

P_{x}

.

We also consider the associated (local) informational distance defect:

Δ_{γ} (x) : = d_{x} (μ_{x}, {Hol}_{γ} (μ_{x})), d_{x} (s, t) = \sqrt{2 D_{x} (s, t)} .

In this continuous framework we will be interested in loops which are the boundaries of small geodesic triangles. Let

x \in M

and let

Π \subset T_{x} M

be a two-dimensional subspace. For

u, v \in Π

sufficiently small, we consider the geodesic triangle with vertices

x, y = {exp}_{x} (u), z = {exp}_{x} (v),

and denote by

γ_{x, u, v}

the corresponding closed loop obtained by traversing the geodesic segments

x \to y \to z \to x

in order. Its Riemannian area is

A_{g} (x, y, z)

.

Definition 6

(Continuous informational holonomy curvature). Let

(M, g, P, μ)

be as above. For

x \in M

and a two-dimensional subspace

Π \subset T_{x} M

, consider geodesic triangles based at x and tangent to Π with area

A \to 0

. We say that thecontinuous informational holonomy curvatureat

(x, Π)

exists if the limit

K_{hol}^{cont} (x, Π) : = lim_{A \to 0} \frac{Δ_{γ_{x, u, v}} (x)}{A_{g} (x, y, z)}

(6)

exists and is independent of the particular way in which the triangle shrinks to x within Π.

In later sections we will show, under appropriate assumptions, that this limit exists and is proportional to

\sec_{g} (x, Π)

, with a constant factor depending on the chosen connection, divergence, and reference state field.

Remark 4.

The definition above is closely analogous to the classical definition of sectional curvature via holonomy of the Levi–Civita connection, with the crucial difference that we compare states in a non-linear state space using an informational divergence/distance, rather than tangent vectors using a linear norm. Assumptions 1 and 3 ensure that, to second order, the informational defect behaves quadratically in the infinitesimal holonomy, while the associated distance defect scales linearly in the holonomy displacement, hence is the natural object to normalize by area.

The continuous framework developed in this section provides the conceptual target for the discrete constructions that follow. In the next section we introduce sampling graphs, discrete channels and discrete holonomy operators, which will serve as the basis for our definition of discrete informational holonomy curvature.

3. Discrete Sampling, Channels and Holonomy

In this section we introduce the discrete framework that will serve as the basis for our definition of informational holonomy curvature. We consider sampling graphs embedded in a Riemannian manifold, discrete state spaces attached to vertices, channels attached to edges, and discrete holonomy operators obtained by composing channels along loops. The assumptions formulated here are discrete counterparts of the continuous structures described in Section 2.

3.1. Sampling Graphs on a Riemannian Manifold

Let

(M, g)

be a smooth Riemannian manifold of dimension

n \geq 2

. For each small parameter

ε > 0

we are given a finite subset

V_{ε} = {x_{i}}_{i \in I_{ε}} \subset M

of sampling points and a simple undirected graph

G_{ε} = (V_{ε}, E_{ε}),

where

E_{ε} \subset {{x_{i}, x_{j}} : i \neq j}

is the set of edges. We denote by

x_{i} \sim x_{j}

the fact that

{x_{i}, x_{j}} \in E_{ε}

.

The vertex sets

V_{ε}

are assumed to be asymptotically dense and quasi-uniform in M, in the following sense.

Assumption 4

(Quasi-uniform sampling). There exist positive constants

c_{1}, c_{2}

and a sequence

r_{ε} \to 0

as

ε \to 0

such that, for all sufficiently small ε:

1.: (Separation) For all distinct $x_{i}, x_{j} \in V_{ε}$ one has

$d_{g} (x_{i}, x_{j}) \geq c_{1} r_{ε} .$
2.: (Covering) For every $x \in M$ there exists $x_{i} \in V_{ε}$ such that

$d_{g} (x, x_{i}) \leq c_{2} r_{ε} .$

Thus the sampling points form a Delone set at scale

r_{ε}

. We shall refer to

r_{ε}

as the sampling radius.

The edge set

E_{ε}

is assumed to connect points that are at distance of order

r_{ε}

.

Assumption 5

(Local connectivity). There exists a constant

c_{3} > 0

such that, for all sufficiently small ε, one has:

1.: If $d_{g} (x_{i}, x_{j}) \leq c_{3} r_{ε}$ , then ${x_{i}, x_{j}} \in E_{ε}$ .
2.: The degrees of the graph $G_{ε}$ are uniformly bounded: there exists $D < \infty$ such that every vertex $x_{i} \in V_{ε}$ satisfies

$deg (x_{i}) \leq D .$

Under Assumptions 4 and 5, each connected component of

G_{ε}

has uniformly bounded local complexity and provides a reasonable discrete approximation of

(M, g)

at scale

r_{ε}

. In particular, for

r_{ε}

sufficiently small, any point

x \in M

and any direction

ξ \in T_{x} M

admit neighbours of x whose geodesic directions approximate

ξ

up to an error of order

r_{ε}

.

In order to average quantities defined at vertices, we will occasionally associate a volume weight

w_{i}^{(ε)}

to each vertex

x_{i} \in V_{ε}

.

Assumption 6

(Volume approximation). For each ε there exist positive weights

w_{i}^{(ε)} > 0

,

x_{i} \in V_{ε}

, such that:

1.: There exists $C > 0$ independent of ε with

$C^{- 1} r_{ε}^{n} \leq w_{i}^{(ε)} \leq C r_{ε}^{n} for all x_{i} \in V_{ε} .$
2.: For every $f \in C_{c} (M)$ one has the quadrature convergence

$| \sum_{x_{i} \in V_{ε}} f (x_{i}) w_{i}^{(ε)} - \int_{M} f (x) {dvol}_{g} (x) | ⟶ 0 as ε \to 0 .$

Assumption 6 is satisfied, for instance, if

{w_{i}^{(ε)}}

are defined as the Riemannian volumes of a Voronoi tessellation associated with

V_{ε}

.

3.2. Discrete State Spaces and Divergences

We now discretize the state bundle

π : P \to M

introduced in Section 2.3. For each sampling point

x_{i} \in V_{ε}

we consider a fibre

P_{i}^{(ε)}

representing the possible states at

x_{i}

. In the simplest setting one may take

P_{i}^{(ε)} = P_{x_{i}}

to be the fibre of the continuous state bundle at

x_{i}

, but we keep the notation

P_{i}^{(ε)}

to emphasize the discrete nature of the sampling.

Assumption 7

(Discrete fibres and divergences). For each

ε > 0

and each

x_{i} \in V_{ε}

we are given:

1.: a smooth manifold $P_{i}^{(ε)}$ and a smooth identification

$ι_{i}^{(ε)} : P_{i}^{(ε)} \to P_{x_{i}}$

with the continuous fibre at $x_{i}$ ;
2.: a Riemannian metric $g^{P_{i}^{(ε)}}$ on $P_{i}^{(ε)}$ obtained as the pullback of the metric $g^{P_{x_{i}}}$ under $ι_{i}^{(ε)}$ ;
3.: an informational divergence

$D_{i}^{(ε)} : P_{i}^{(ε)} \times P_{i}^{(ε)} \to [0, \infty)$

such that, after transporting it to $P_{x_{i}}$ via $ι_{i}^{(ε)}$ , Assumption 1 holds uniformly in $x_{i}$ and ε.

For notational simplicity, we will usually drop the superscript

(ε)

and write

P_{i}

,

g^{P_{i}}

and

D_{i}

, keeping in mind the dependence on

ε

through the underlying sampling set

V_{ε}

.

We also sample the continuous reference state field

μ : M \to P

.

Assumption 8

(Discrete reference states). For each

x_{i} \in V_{ε}

we are given a reference state

μ_{i}^{(ε)} \in P_{i}^{(ε)}

such that, under the identification

ι_{i}^{(ε)}

, one has

ι_{i}^{(ε)} (μ_{i}^{(ε)}) = μ_{x_{i}} .

Thus the discrete fibres, metrics, divergences and reference states are obtained by restriction of the continuous state bundle to the sampling set

V_{ε}

.

3.3. Discrete Channels and Local Consistency

We now assign discrete informational channels to the edges of the sampling graph

G_{ε}

. For each oriented edge

(x_{i}, x_{j})

with

{x_{i}, x_{j}} \in E_{ε}

we are given a channel

Φ_{i j}^{(ε)} : P_{i}^{(ε)} \to P_{j}^{(ε)} .

Again, we often drop the superscript

(ε)

when no confusion arises.

The channels

Φ_{i j}

are required to be local and to approximate the parallel transport maps in the continuous state bundle. Let

γ_{i j}

denote the unique minimizing geodesic segment in

(M, g)

joining

x_{i}

to

x_{j}

, which lies in a convex normal neighbourhood when

d_{g} (x_{i}, x_{j})

is sufficiently small. By Assumption 5 we may (and do) assume that each edge length

d_{g} (x_{i}, x_{j})

is bounded by a constant multiple of

r_{ε}

.

Assumption 9

(Locality and consistency of channels). There exist constants

C > 0

and

α > 1

such that, for all sufficiently small ε and all oriented edges

(x_{i}, x_{j})

:

1.: (Locality) The channel $Φ_{i j}$ depends only on the geometry of $(M, g, P)$ in a neighbourhood of the geodesic segment $γ_{i j}$ and satisfies

$D_{j} (Φ_{i j} (s), Φ_{i j} (t)) \leq C D_{i} (s, t)$

for all $s, t \in P_{i}$ , i.e. $Φ_{i j}$ is (locally) Lipschitz with respect to the divergences.
2.: (Consistency with continuous parallel transport) Let ${PT}_{γ_{i j}} : P_{x_{i}} \to P_{x_{j}}$ be the continuous parallel transport map along the geodesic segment $γ_{i j}$ joining $x_{i}$ to $x_{j}$ . There exist constants $C > 0$ and $α > 1$ such that, for all $s \in P_{i}$ ,

$d_{j}^{R} (Φ_{i j} (s), {PT}_{γ_{i j}} (s)) \leq C d_{g} {(x_{i}, x_{j})}^{1 + α} .$

The exponent

α > 1

ensures that the error in approximating continuous parallel transport along an edge is of order strictly higher than the edge length, which will imply that the error in approximating holonomy around small loops is of order strictly higher than the area of the loop.

Remark 5.

In many concrete models one can take

α = 1

or

α = 2

, depending on how the channels are constructed from the underlying connection on

P

. For the purposes of the convergence theorems, any

α > 1

suffices.

3.4. Discrete Holonomy Operators

Given the channels on edges, we can define discrete holonomy operators by composition along loops in the graph

G_{ε}

.

Definition 7

(Discrete paths and loops). A discrete path in

G_{ε}

of length

m \geq 1

is a sequence of vertices

γ = (x_{i_{0}}, x_{i_{1}}, \dots, x_{i_{m}})

such that

{x_{i_{k - 1}}, x_{i_{k}}} \in E_{ε}

for all

k = 1, \dots, m

. The path is closed or a loop if

x_{i_{m}} = x_{i_{0}}

.

We denote by

{Paths}_{ε}

the set of all discrete paths and by

{Loops}_{ε} (x_{i})

the set of all loops based at a given vertex

x_{i}

.

To each oriented edge

(x_{i_{k - 1}}, x_{i_{k}})

along a path we associate the channel

Φ_{i_{k - 1} i_{k}}

. The channel associated with a path is the composition of the edge channels in order.

Definition 8

(Discrete transport and holonomy). Let

γ = (x_{i_{0}}, \dots, x_{i_{m}})

be a discrete path in

G_{ε}

. The discrete transport along γ is the map

T_{γ}^{(ε)} : = Φ_{i_{m - 1} i_{m}} \circ \dots \circ Φ_{i_{1} i_{2}} \circ Φ_{i_{0} i_{1}} .

If γ is a loop based at

x_{i_{0}}

, i.e.

x_{i_{m}} = x_{i_{0}}

, we call

T_{γ}^{(ε)} : P_{i_{0}} \to P_{i_{0}}

the discrete holonomy operator of γ and denote it by

{Hol}_{γ}^{(ε)} : = T_{γ}^{(ε)} .

The discrete transport operators satisfy the obvious composition rules: if

γ_{1} = (x_{i_{0}}, \dots, x_{i_{m}})

and

γ_{2} = (x_{i_{m}}, \dots, x_{i_{m + ℓ}})

are two paths with matching endpoint and starting point, then

T_{γ_{2} ★ γ_{1}}^{(ε)} = T_{γ_{2}}^{(ε)} \circ T_{γ_{1}}^{(ε)} .

If

\bar{γ} = (x_{i_{m}}, \dots, x_{i_{0}})

denotes the reversed path, then

T_{\bar{γ}}^{(ε)} \approx {(T_{γ}^{(ε)})}^{- 1},

with the approximation becoming exact if the channels satisfy an exact involutive property. In our setting we will only need approximate inversion properties at the infinitesimal level, which follow from Assumption 9.

In the sequel we will focus on loops associated with small discrete triangles.

Definition 9

(Discrete triangles). A discrete triangle in

G_{ε}

is an ordered triple of distinct vertices

(x_{i}, x_{j}, x_{k})

such that all three edges

{x_{i}, x_{j}}

,

{x_{j}, x_{k}}

and

{x_{k}, x_{i}}

belong to

E_{ε}

. The associated oriented loop is

γ_{i j k} : = (x_{i}, x_{j}, x_{k}, x_{i}) .

We denote by

T_{ε}

the set of all discrete triangles in

G_{ε}

.

For each discrete triangle

(x_{i}, x_{j}, x_{k})

we define the corresponding holonomy operator

{Hol}_{i j k}^{(ε)} : = {Hol}_{γ_{i j k}}^{(ε)} = Φ_{k i} \circ Φ_{j k} \circ Φ_{i j} : P_{i} \to P_{i} .

3.5. Triangle Geometry and Area Approximation

In order to compare discrete holonomy around triangles with continuous holonomy around geodesic triangles, we need to relate the combinatorial triangles in

G_{ε}

to small geodesic triangles in

(M, g)

and to assign an appropriate area to each discrete triangle.

Given a discrete triangle

(x_{i}, x_{j}, x_{k}) \in T_{ε}

we consider the unique geodesic triangle

Δ_{g} (x_{i}, x_{j}, x_{k})

formed by the minimizing geodesics between

x_{i}, x_{j}, x_{k}

. For

ε

sufficiently small, Assumption 5 ensures that all edge lengths

d_{g} (x_{i}, x_{j})

,

d_{g} (x_{j}, x_{k})

and

d_{g} (x_{k}, x_{i})

are bounded by a constant multiple of

r_{ε}

, so that

Δ_{g} (x_{i}, x_{j}, x_{k})

is contained in a convex normal neighbourhood. We denote by

A_{g} (i, j, k) : = A_{g} (x_{i}, x_{j}, x_{k})

the Riemannian area of this geodesic triangle.

In purely informational settings one may not have direct access to

A_{g} (i, j, k)

, but for the analytical convergence results we only require that the area used in the discrete curvature is a good approximation of

A_{g} (i, j, k)

.

Assumption 10

(Area approximation). For each triangle

(x_{i}, x_{j}, x_{k}) \in T_{ε}

we are given a non-negative number

A_{ε} (i, j, k)

, called its discrete area, such that there exists a sequence

q_{ε} \to 0

with

| A_{ε} (i, j, k) - A_{g} (i, j, k) | \leq q_{ε} r_{ε}^{2}

for all

(x_{i}, x_{j}, x_{k}) \in T_{ε}

when ε is sufficiently small.

The factor

r_{ε}^{2}

reflects the typical area of a small triangle with edge lengths of order

r_{ε}

.

3.6. Families of Triangles Approximating Two-Planes

To define discrete sectional curvatures, we will need to average over families of discrete triangles that approximate a given point and two-plane in the manifold. This requires an additional isotropy assumption on the sampling.

Let

K \subset M

be a fixed compact subset. For each

x \in K

and each two-dimensional subspace

Π \subset T_{x} M

, we wish to consider families of discrete triangles based at vertices close to x whose geodesic realisations are small and whose edge directions approximate

Π

in an approximately isotropic fashion.

Assumption 11

(Directional sampling and triangle families). There exists a sequence

η_{ε} \to 0

such that the following holds. For each compact

K \subset M

there exists

ε_{K} > 0

such that, for all

0 < ε < ε_{K}

, for every

x \in K

and every two-dimensional subspace

Π \subset T_{x} M

, one can choose:

a vertex $x_{i} \in V_{ε}$ with $d_{g} (x, x_{i}) \leq c_{2} r_{ε}$ ;
a finite non-empty set of triangles

$T_{ε} (x_{i}, Π) \subset {(x_{i}, x_{j}, x_{k}) \in T_{ε}}$

such that:

1.

(Scale and non-degeneracy) For all $(x_{i}, x_{j}, x_{k}) \in T_{ε} (x_{i}, Π)$ , all edge lengths are bounded by $C r_{ε}$ and bounded below by $c r_{ε}$ for some constants $0 < c \leq C < \infty$ independent of $x, Π, ε$ . Moreover, the associated geodesic triangle is uniformly non-degenerate: there exists $c_{A} > 0$ (independent of $x, Π, ε$ ) such that

$A_{g} (i, j, k) \geq c_{A} r_{ε}^{2},$

and consequently (by Assumption 10) also $A_{ε} (i, j, k) \geq \frac{c_{A}}{2} r_{ε}^{2}$ for ε small.

2.

(Planarity) Let $u_{i j}, u_{i k} \in T_{x} M$ denote the initial velocity vectors of the geodesics from x to $x_{j}$ and x to $x_{k}$ (transported back to $T_{x} M$ via parallel transport if necessary). Then the angle between the plane spanned by ${u_{i j}, u_{i k}}$ and Π is at most $η_{ε}$ .

3.

(Directional isotropy) The distribution of directions of the edges incident to $x_{i}$ within $T_{ε} (x_{i}, Π)$ is approximately isotropic in Π, in the sense that for any continuous function φ on the unit circle of Π one has

$| \frac{1}{| T_{ε} (x_{i}, Π) |} \sum_{(x_{i}, x_{j}, x_{k}) \in T_{ε} (x_{i}, Π)} φ (θ_{i j}) - \frac{1}{2 π} \int_{0}^{2 π} φ (θ) d θ | ⟶ 0,$

as $ε \to 0$ , where $θ_{i j}$ denotes the direction of the geodesic from x to $x_{j}$ projected onto Π and normalized.

Assumption 11 is a discrete isotropy condition ensuring that the graph carries enough small, uniformly non-degenerate triangles to sample each two-plane uniformly in the limit

ε \to 0

. It is analogous to assumptions used in the analysis of discrete Laplace–Beltrami operators and curvature estimators on random or quasi-uniform point clouds.

The structures and assumptions introduced in this section—sampling graphs, discrete fibres and divergences, channels, holonomy operators, triangle areas and isotropic triangle families—provide the discrete environment in which we will define the informational holonomy curvature. In the next section we use these ingredients to formulate precise discrete and continuous curvature quantities and to state the main convergence results.

4. Informational Holonomy Curvature: Definitions

In this section we define the informational holonomy defect and the associated curvature, both in the discrete setting of sampling graphs and in the continuous state-bundle setting. A key point, already implicit in Assumption 1, is that the divergence on each fibre induces a Riemannian metric and therefore a natural local distance. The curvature will be defined using this distance, which is linear in the holonomy displacement to first order, rather than the divergence itself, which only captures a quadratic effect.

4.1. Informational Distances and Defects for Discrete Loops

We begin by extracting from each fibre divergence a distance-like function.

Recall that, for each

ε > 0

and each vertex

x_{i} \in V_{ε}

, we have a fibre

P_{i}

endowed with a Riemannian metric

g^{P_{i}}

and a divergence

D_{i} : P_{i} \times P_{i} \to [0, \infty)

satisfying Assumption 1 (after transport to the continuous fibre). We define:

Definition 10

(Discrete informational distance). For each vertex

x_{i} \in V_{ε}

and

s, t \in P_{i}

we define the informational distance

d_{i} (s, t) : = \sqrt{2 D_{i} (s, t)} .

For the purposes of the estimates below, one may replace

d_{i}

by the Riemannian distance on

(P_{i}, g^{P_{i}})

, which is locally equivalent to

d_{i}

by Lemma 4. We implicitly make this replacement whenever the triangle inequality is invoked.

In general

d_{i}

is only guaranteed to be a local distance function near the diagonal (we do not assume the triangle inequality). In the Jensen–Shannon setting,

d_{i}

coincides (up to the global constant factor

\sqrt{2}

) with the usual Jensen–Shannon distance on the probability simplex.

We now assign informational defects to discrete loops. Recall that, for each loop

γ \in {Loops}_{ε} (x_{i})

based at

x_{i}

, we have a holonomy operator

{Hol}_{γ}^{(ε)} : P_{i} \to P_{i}

defined by composition of edge channels along

γ

, and a reference state

μ_{i} \in P_{i}

.

Remark 6

(Riemannian vs. informational distance). For later estimates it will be convenient to work with the genuine Riemannian distance on each fibre

(P_{i}, g^{P_{i}})

, which we denote by

d_{i}^{R}

. By Lemma 4 and compactness,

d_{i}

and

d_{i}^{R}

are locally equivalent: there exist constants

0 < c_{1} \leq c_{2} < \infty

such that, whenever

d_{i}^{R} (s, t)

is sufficiently small,

c_{1} d_{i}^{R} (s, t) \leq d_{i} (s, t) \leq c_{2} d_{i}^{R} (s, t) .

In particular, all notions of “defect” and “curvature” defined using

d_{i}

are unchanged, up to uniform multiplicative constants, if one replaces

d_{i}

by

d_{i}^{R}

. From this point on, whenever the triangle inequality is invoked we implicitly work with

d_{i}^{R}

; the symbol

d_{i}

may be read as either distance, since they are locally equivalent.

Lemma 3

(From divergence contraction to distance contraction). Let D be a nonnegative divergence and define

d (s, t) : = \sqrt{2 D (s, t)}

. If a map Φ satisfies

D (Φ (s), Φ (t)) \leq C D (s, t) \forall s, t,

then

d (Φ (s), Φ (t)) \leq \sqrt{C} d (s, t) \forall s, t .

Proof.

Immediate from

d = \sqrt{2 D}

and monotonicity of the square root. □

Definition 11

(Discrete informational holonomy defects). Let

γ \in {Loops}_{ε} (x_{i})

be a discrete loop based at

x_{i} \in V_{ε}

. We define:

1.: the divergence defect

$δ_{γ}^{(ε)} (x_{i}) : = D_{i} (μ_{i}, {Hol}_{γ}^{(ε)} (μ_{i})) \geq 0;$
2.: the distance defect

$Δ_{γ}^{(ε)} (x_{i}) : = d_{i} (μ_{i}, {Hol}_{γ}^{(ε)} (μ_{i})) = \sqrt{2 δ_{γ}^{(ε)} (x_{i})} \geq 0 .$

For curvature purposes,

δ_{γ}^{(ε)}

and

Δ_{γ}^{(ε)}

contain equivalent information, but the distance defect

Δ_{γ}^{(ε)}

scales linearly with the holonomy displacement and is therefore the natural quantity to normalize by the area of small loops.

As in Section 3.4, we now specialise to loops associated with discrete triangles. For a triangle

(x_{i}, x_{j}, x_{k}) \in T_{ε}

, with associated loop

γ_{i j k} = (x_{i}, x_{j}, x_{k}, x_{i})

and holonomy operator

{Hol}_{i j k}^{(ε)} : = Φ_{k i} \circ Φ_{j k} \circ Φ_{i j} : P_{i} \to P_{i},

we use the shorthand

δ_{i j k}^{(ε)} : = δ_{γ_{i j k}}^{(ε)} (x_{i}), Δ_{i j k}^{(ε)} : = Δ_{γ_{i j k}}^{(ε)} (x_{i}) .

Definition 12

(Triangle defects). For a discrete triangle

(x_{i}, x_{j}, x_{k}) \in T_{ε}

, the divergence defect and distance defect of the triangle based at

x_{i}

are given by

δ_{i j k}^{(ε)} : = D_{i} (μ_{i}, {Hol}_{i j k}^{(ε)} (μ_{i})), Δ_{i j k}^{(ε)} : = d_{i} (μ_{i}, {Hol}_{i j k}^{(ε)} (μ_{i})) .

4.2. Discrete Informational Holonomy Curvature of Triangles

To obtain a curvature quantity from the defects, we normalize by the area associated with each triangle. Assumption 10 provides a discrete area

A_{ε} (i, j, k)

which approximates the Riemannian area

A_{g} (i, j, k)

of the geodesic triangle with vertices

x_{i}, x_{j}, x_{k}

.

Definition 13

(Triangle-wise informational holonomy curvature). Let

(x_{i}, x_{j}, x_{k}) \in T_{ε}

be a discrete triangle. The informational holonomy curvature of the triangle

(x_{i}, x_{j}, x_{k})

is

K_{hol}^{(ε)} (i, j, k) : = \frac{Δ_{i j k}^{(ε)}}{A_{ε} (i, j, k)},

with the convention that

K_{hol}^{(ε)} (i, j, k) = 0

if

A_{ε} (i, j, k) = 0

.

Thus

K_{hol}^{(ε)} (i, j, k)

measures the informational distance travelled by the reference state per unit area when it is transported around the discrete triangle

(x_{i}, x_{j}, x_{k})

.

In order to define a discrete sectional curvature at a point and a two-dimensional direction, we now average the triangle-wise curvature over suitable families of triangles.

Let

K \subset M

be a fixed compact set. For each

x \in K

and each 2-plane

Π \subset T_{x} M

, Assumption 11 provides, for

ε

small enough, a vertex

x_{i} \in V_{ε}

close to x and a finite family of discrete triangles

T_{ε} (x_{i}, Π) \subset T_{ε}

which are small, non-degenerate, have edge directions close to

Π

, and are approximately isotropically distributed in

Π

.

Definition 14

(Discrete informational sectional curvature). Let

K \subset M

be compact,

x \in K

, and

Π \subset T_{x} M

a two-dimensional subspace. For

ε > 0

small enough, let

x_{i} \in V_{ε}

and

T_{ε} (x_{i}, Π)

be as in Assumption 11. The discrete informational sectional curvature at scale ε associated with

(x, Π)

is

K_{hol}^{(ε)} (x, Π) : = \frac{\sum_{(x_{i}, x_{j}, x_{k}) \in T_{ε} (x_{i}, Π)} A_{ε} (i, j, k) K_{hol}^{(ε)} (i, j, k)}{\sum_{(x_{i}, x_{j}, x_{k}) \in T_{ε} (x_{i}, Π)} A_{ε} (i, j, k)} .

(7)

If the denominator vanishes, we set

K_{hol}^{(ε)} (x, Π) : = 0

by convention.

In words,

K_{hol}^{(ε)} (x, Π)

is the area-weighted average of the triangle-wise informational holonomy curvature over all triangles in

T_{ε} (x_{i}, Π)

.

Remark 7

(Choice of base vertex and triangle family). The definition of

K_{hol}^{(ε)} (x, Π)

in (7) involves several auxiliary choices: for each

x \in K

and two-plane

Π \subset T_{x} M

we pick a nearby vertex

x_{i}

with

d_{g} (x, x_{i}) \leq c_{2} r_{ε}

and a finite family of triangles

T_{ε} (x_{i}, Π)

as in Assumption 11. A priori, different admissible choices

(x_{i}, T_{ε} (x_{i}, Π))

could lead to different values of

K_{hol}^{(ε)} (x, Π)

.

Under Assumptions 4–11, however, Lemmas 10 and 11 imply that any two such choices produce values that differ by at most

C κ_{ε}

, with

κ_{ε}

as in (10). In particular, the limit

K_{hol}^{(ε)} (x, Π) \to K_{hol}^{cont} (x, Π)

in Theorem 2 is independent of these auxiliary choices.

4.3. Continuous Informational Holonomy Curvature Revisited

We now recall the continuous framework and align the definitions with the discrete case by introducing the corresponding informational distance on each fibre.

Let

(M, g)

be a Riemannian manifold,

π : P \to M

a state bundle with fibre metrics and divergences as in Definition 2 and Assumption 1, equipped with a connection satisfying Assumption 3. Let

μ : M \to P

be a smooth reference state field.

For each

x \in M

and

s, t \in P_{x}

, we define the continuous informational distance

d_{x} (s, t) : = \sqrt{2 D_{x} (s, t)} .

(8)

By Assumption 1,

d_{x}

is locally equivalent to the Riemannian distance induced by

g^{P_{x}}

on the fibre (see Lemma 4 in Section 5).

For

x \in M

and a two-dimensional subspace

Π \subset T_{x} M

, consider pairs of tangent vectors

u, v \in Π

of sufficiently small norm and the associated geodesic triangle with vertices

x, y = {exp}_{x} (u), z = {exp}_{x} (v) .

Let

γ_{x, u, v}

denote the closed loop

x \to y \to z \to x

obtained by traversing the geodesic segments in order, and let

A_{g} (x, y, z)

be its Riemannian area. The continuous holonomy map

{Hol}_{γ_{x, u, v}} : P_{x} \to P_{x}

is defined by parallel transport along

γ_{x, u, v}

.

Definition 15

(Continuous informational holonomy defects). The continuous divergence defect and continuous distance defect of the loop

γ_{x, u, v}

at x are

δ_{x, u, v} : = D_{x} (μ_{x}, {Hol}_{γ_{x, u, v}} (μ_{x})), Δ_{x, u, v} : = d_{x} (μ_{x}, {Hol}_{γ_{x, u, v}} (μ_{x})) = \sqrt{2 δ_{x, u, v}} .

We are interested in the behaviour of

Δ_{x, u, v}

as the triangle shrinks to x within the plane

Π

.

Definition 16

(Continuous informational sectional curvature). Let

(M, g, P, μ)

be as above, and fix

x \in M

and a two-dimensional subspace

Π \subset T_{x} M

. For

u, v \in Π

sufficiently small we denote by

γ_{x, u, v}

the associated piecewise-geodesic loop and by

Δ_{x, u, v}

its informational distance defect at x (cf. Definition 15).

We say that the continuous informational sectional curvature at

(x, Π)

exists if there are constants

r_{0} > 0

and

c_{0} > 0

and, for each

r \in (0, r_{0}]

, vectors

u_{r}, v_{r} \in Π

such that:

1.: the corresponding geodesic triangle

$Δ_{g} (x, {exp}_{x} (u_{r}), {exp}_{x} (v_{r}))$

is contained in a normal neighbourhood of x and its vertices converge to x as $r \to 0$ ;
2.: the triangle has uniformly non-degenerate shape in $Π$ , in the sense that

$c_{0} r \leq max {∥ u_{r} ∥_{g}, ∥ v_{r} ∥_{g}} \leq r, A_{g} (x, {exp}_{x} (u_{r}), {exp}_{x} (v_{r})) \geq c_{0} r^{2},$

so that the side lengths and the area are uniformly comparable to r and $r^{2}$ , respectively, independently of r.

Whenever this holds and, for every such admissible family

(u_{r}, v_{r})

, the limit

K_{hol}^{cont} (x, Π) : = lim_{r \to 0} \frac{Δ_{x, u_{r}, v_{r}}}{A_{g} (x, {exp}_{x} (u_{r}), {exp}_{x} (v_{r}))}

(9)

exists and has the same value, we call this common value the continuous informational sectional curvature at

(x, Π)

.

In Section 5 we show, under Assumptions 1 and 3, that this limit exists for all

x \in M

and all two-planes

Π \subset T_{x} M

, and that it can be expressed explicitly in terms of the curvature of the connection on

P

along

Π

.

4.4. Main Curvature Theorems

We can now formulate the main results of this work, which will be proved in Section 5 and Section 6. The first theorem relates the continuous informational sectional curvature to the curvature of the connection on the state bundle. The second theorem shows that the discrete informational sectional curvature converges to this continuous quantity as the sampling becomes dense.

Throughout this subsection we fix a compact subset

K \subset M

and tacitly restrict attention to points

x \in K

.

Theorem 1

(Continuous informational holonomy curvature). Let

(M, g)

be a smooth Riemannian manifold,

π : P \to M

a state bundle with fibre metrics and divergences satisfying Assumption 1, endowed with a connection satisfying Assumption 3, and let

μ : M \to P

be a smooth reference state field.

Then, for every

x \in M

and every two-dimensional subspace

Π \subset T_{x} M

, the continuous informational sectional curvature

K_{hol}^{cont} (x, Π)

exists in the sense of Definition 16 and can be written as

K_{hol}^{cont} (x, Π) = ∥ W_{x} (Π; μ_{x}) ∥_{g^{P_{x}}},

where

W_{x} (Π; μ_{x}) \in T_{μ_{x}} P_{x}

is a vector depending linearly on the curvature of the connection on

P

along Π. In particular, when the connection on

P

is induced by the Levi–Civita connection via a linear isometric representation,

K_{hol}^{cont} (x, Π)

is a scalar invariant built from the Riemann curvature tensor

R_{x}^{g}

restricted to Π and is proportional to

| \sec_{g} (x, Π) |

in spaces of constant sectional curvature.

Remark 8

(Dependence on the informational structure). The quantity

K_{hol}^{cont} (x, Π)

should not be thought of as a curvature of the Riemannian manifold

(M, g)

alone. It depends on the choice of state bundle

π : P \to M

, on the fibre divergences

{D_{x}}_{x \in M}

(equivalently, on the fibre metrics

g^{P_{x}}

), on the Ehresmann connection used to define parallel transport, and on the reference state field μ. Different choices over the same

(M, g)

may lead to different informational holonomy curvatures. In particular, two bundles with the same base

(M, g)

but different fibre geometries or connections need not produce the same values of

K_{hol}^{cont}

.

We now turn to the discrete setting. In addition to the continuous structures above, we assume that the manifold

(M, g)

is sampled by graphs

{(G_{ε})}_{ε > 0}

and that discrete fibres, divergences, reference states, channels, areas and triangle families are given as in Assumptions 4–11.

For convenience, we introduce a sequence of positive numbers

{(κ_{ε})}_{ε > 0}

that captures the various discretization errors. More precisely, we set

κ_{ε} : = r_{ε} + η_{ε} + q_{ε} + ρ_{ε}, ρ_{ε} : = r_{ε}^{α - 1} .

(10)

Here

r_{ε}

is the sampling radius from Assumption 4,

η_{ε}

and

q_{ε}

are the anisotropy and area errors from Assumptions 11 and 10, and

ρ_{ε} \to 0

controls the channel consistency scale induced by Assumption 9(2) with exponent

α > 1

.

Theorem 2

(Discrete-to-continuous convergence of informational holonomy curvature). Let

(M, g, P, μ)

be as in Theorem 1, and let

{(G_{ε})}_{ε > 0}

, together with discrete fibres, divergences, reference states, channels, areas and triangle families, satisfy Assumptions 4–11. Let

K \subset M

be compact.

Then there exist constants

C > 0

and

ε_{K} > 0

such that, for all

0 < ε < ε_{K}

, for every

x \in K

and every two-dimensional subspace

Π \subset T_{x} M

, the discrete informational sectional curvature

K_{hol}^{(ε)} (x, Π)

is well defined and satisfies

| K_{hol}^{(ε)} (x, Π) - K_{hol}^{cont} (x, Π) | \leq C κ_{ε},

(11)

where

κ_{ε}

is given by (10). In particular,

| K_{hol}^{(ε)} (x, Π) - ∥ W_{x} (Π; μ_{x}) ∥_{g^{P_{x}}} | \leq C κ_{ε} .

Consequently, as

ε \to 0

, the discrete informational sectional curvatures converge uniformly on compact subsets of M and on the Grassmannian of two-planes to the continuous informational sectional curvature

K_{hol}^{cont}

.

The proof of Theorem 2, given in Section 6, proceeds in two steps. First, we compare the discrete holonomy operators

{Hol}_{i j k}^{(ε)}

around small triangles with the continuous holonomy operators associated with the corresponding geodesic triangles, using Assumption 9 and the local consistency of the sampling. Second, we exploit the isotropy of the triangle families from Assumption 11 to show that the area-weighted average (7) converges to the continuous limit (9), with an error controlled by

κ_{ε}

.

Overview of assumptions.

For the reader’s convenience we briefly summarise the rôle of the hypotheses. Assumption 1 requires each fibre divergence

D_{x}

to admit a second-order expansion whose quadratic part induces the fibre Riemannian metric

g^{P_{x}}

; together with Assumption 3 this ensures that informational distances behave, locally and under parallel transport, like the Riemannian distances of the fibre metrics. Assumption 2 is a structural regularity assumption guaranteeing that the Ehresmann connection on

P

arises from a smooth principal connection.

On the discrete side, Assumptions 4 and 5 encode that the sampling graphs

G_{ε}

form quasi-uniform discretisations of

(M, g)

with bounded degree and uniformly controlled edge lengths, whereas Assumption 6 provides vertex weights approximating the Riemannian volume. Assumption 7 specifies discrete fibres and divergences approximating the continuous ones, and Assumption 8 does the same for the reference states. Assumption 9 postulates channels that are local and Lipschitz and whose first-order behaviour approximates continuous parallel transport, with an error measured by

ρ_{ε}

. Finally, Assumption 10 ensures that the discrete triangle areas

A_{ε} (i, j, k)

approximate the Riemannian areas, and Assumption 11 provides, for each

(x, Π)

, families of triangles that sample directions in

Π

in an almost isotropic way. The quantity

κ_{ε}

in (10) summarises the various errors contributed by these assumptions.

Definitions 13, 14 and 16, together with Theorems 1 and 2, provide the conceptual and analytical core of the informational holonomy curvature framework. In the next sections we make these statements precise by deriving the continuous holonomy-curvature relation and then establishing the discrete-to-continuous convergence.

5. Continuous Holonomy Curvature and Connection Curvature

In this section we prove Theorem 1. We work in the continuous framework of Section 2:

(M, g)

is a smooth Riemannian manifold,

π : P \to M

is a state bundle with fibre metrics and divergences satisfying Assumption 1, endowed with a connection satisfying Assumption 3, and

μ : M \to P

is a smooth reference state field.

The core of the argument consists of two ingredients:

the second-order expansion of the divergence on each fibre, which implies that the informational distance $d_{x} (s, t) = \sqrt{2 D_{x} (s, t)}$ is locally equivalent to the Riemannian distance induced by $g^{P_{x}}$ ;
the first-order (in area) expansion of the holonomy map of the connection on $P$ around small geodesic triangles, controlled by the curvature of the connection.

Combining these, we obtain a linear-in-area behaviour for the distance defect

Δ_{x, u, v}

and hence the existence and explicit form of the continuous informational sectional curvature.

Throughout this section we fix a compact set

K \subset M

, and all constants will be uniform over

x \in K

and over two-planes

Π \subset T_{x} M

.

5.1. Local Expansion of the Informational Distance

We recall that for each

x \in M

and

s, t \in P_{x}

the informational distance is defined by

d_{x} (s, t) : = \sqrt{2 D_{x} (s, t)},

where

D_{x}

is the divergence on the fibre

P_{x}

(see (8)). The following lemma makes precise the local behaviour of

d_{x}

in terms of the Riemannian metric

g^{P_{x}}

on the fibre.

Lemma 4

(Local expansion of the informational distance). Let

K \subset M

be compact. There exist constants

C > 0

and

r > 0

such that for every

x \in K

, for every

s \in P_{x}

, and for every

v \in T_{s} P_{x}

with

{∥ v ∥}_{g^{P_{x}}} \leq r

, the following holds. Let

t : = {exp}_{s}^{P_{x}} (v),

where

{exp}_{s}^{P_{x}}

is the Riemannian exponential map on

(P_{x}, g^{P_{x}})

. Then

| d_{x} (s, t) - {∥ v ∥}_{g^{P_{x}}} | \leq C {∥ v ∥}_{g^{P_{x}}}^{2} .

(12)

In particular, there exist constants

c_{1}, c_{2} > 0

such that, for all such v,

c_{1} {∥ v ∥}_{g^{P_{x}}} \leq d_{x} (s, t) \leq c_{2} {∥ v ∥}_{g^{P_{x}}} .

Proof.

Fix

x \in K

and

s \in P_{x}

. By Assumption 1, in a normal coordinate chart for

(P_{x}, g^{P_{x}})

centred at s, the divergence

D_{x}

satisfies

D_{x} (s, t) = \frac{1}{2} g_{s}^{P_{x}} (v, v) + R_{s} (v),

where

t = {exp}_{s}^{P_{x}} (v)

and

R_{s} (v)

is a remainder term with

| R_{s} (v) | \leq C_{0} {∥ v ∥}_{g^{P_{x}}}^{3}

for

{∥ v ∥}_{g^{P_{x}}} \leq r_{0}

, with

C_{0}, r_{0} > 0

independent of x and s in compact sets (smoothness and compactness of K).

By definition,

d_{x} (s, t) = \sqrt{2 D_{x} (s, t)} = \sqrt{g_{s}^{P_{x}} (v, v) + 2 R_{s} (v)} .

Let

L : = {∥ v ∥}_{g^{P_{x}}}

. Then

g_{s}^{P_{x}} (v, v) = L^{2}

and

d_{x} (s, t) = L \sqrt{1 + \frac{2 R_{s} (v)}{L^{2}}} .

For

0 < L \leq r_{0}

we have

| \frac{2 R_{s} (v)}{L^{2}} | \leq 2 C_{0} L .

Choose

r \leq r_{0}

such that

2 C_{0} r \leq 1 / 2

. Then for

L \leq r

we have

| \frac{2 R_{s} (v)}{L^{2}} | \leq \frac{1}{2} .

For

| z | \leq 1 / 2

the Taylor expansion of

\sqrt{1 + z}

yields

\sqrt{1 + z} = 1 + \frac{z}{2} + θ (z) z^{2},

where

| θ (z) | \leq C_{1}

for some universal constant

C_{1}

. Taking

z = 2 R_{s} (v) / L^{2}

, we obtain

d_{x} (s, t) = L (1 + \frac{R_{s} (v)}{L^{2}} + θ (\frac{2 R_{s} (v)}{L^{2}}) \frac{4 R_{s} {(v)}^{2}}{L^{4}}) .

Using

| R_{s} (v) | \leq C_{0} L^{3}

, we get

| \frac{R_{s} (v)}{L^{2}} | \leq C_{0} L, | \frac{4 R_{s} {(v)}^{2}}{L^{4}} | \leq 4 C_{0}^{2} L^{2} .

Therefore,

| d_{x} (s, t) - L | \leq L (C_{0} L + C_{1} \cdot 4 C_{0}^{2} L^{2}) \leq C L^{2},

for some constant

C > 0

depending only on

C_{0}

and

C_{1}

. This proves (12).

The two-sided inequality follows from (12) by taking L sufficiently small and absorbing the quadratic term into the linear one. □

5.2. Curvature of the Connection and Small-Loop Holonomy

We now recall the curvature of an Ehresmann connection on

π : P \to M

and its relation with holonomy around small loops. We adopt a local viewpoint sufficient for our application to small geodesic triangles.

Let

H_{p} \subset T_{p} P

be the horizontal subspace at

p \in P

and

V_{p} = ker (d π_{p})

the vertical subspace. For each vector field X on M there exists a unique horizontal lift

X^{H}

on

P

such that

d π (X_{p}^{H}) = X_{π (p)}

for all

p \in P

.

The curvature of the connection is the vertical-valued 2-form

Ω

on

P

defined by

Ω_{p} (X, Y) : = {[X^{H}, Y^{H}]}_{p}^{vert},

where

X, Y

are vector fields on M and the superscript

vert

denotes projection onto

V_{p}

in the decomposition

T_{p} P = H_{p} \oplus V_{p}

. This definition is independent of the choice of extensions of

X, Y

.

For each

x \in M

and

p \in P_{x} : = π^{- 1} (x)

, the restriction of

Ω

gives a bilinear alternating map

Ω_{x, p} : Λ^{2} T_{x} M \to T_{p} P_{x},

by setting

Ω_{x, p} (u, v) : = Ω_{p} (\tilde{u}, \tilde{v})

, where

\tilde{u}, \tilde{v}

are any vector fields extending

u, v

in a neighbourhood of x. This is well defined and smooth in

(x, p)

.

We now state the holonomy expansion around small geodesic triangles. The result is a specialization of standard holonomy-curvature relations (see, for example, [5] Chapter II, Sections 3–4]), adapted to our two-dimensional situation and with explicit attention to the dependence on the area of the triangle.

Let

x \in M

and

Π \subset T_{x} M

be a two-dimensional subspace. For

u, v \in Π

sufficiently small, we set

y = {exp}_{x} (u), z = {exp}_{x} (v),

and consider the geodesic triangle with vertices

x, y, z

. Let

γ_{x, u, v}

denote the closed loop obtained by traversing the geodesic segments

x \to y \to z \to x

in order, and let

A_{g} (x, y, z)

be the Riemannian area of the geodesic triangle.

Fix

p_{x} \in P_{x}

. Parallel transport along

γ_{x, u, v}

yields a holonomy map

{Hol}_{γ_{x, u, v}} : P_{x} \to P_{x}

sending

p_{x}

to a point

p_{x, u, v} : = {Hol}_{γ_{x, u, v}} (p_{x}) \in P_{x}

.

Lemma 5

(Small-loop holonomy expansion). Assume the associated-bundle framework of Assumption 2. Let

K \subset M

be compact. Then there exist constants

C > 0

and

r > 0

such that for every

x \in K

, every 2-plane

Π \subset T_{x} M

, every pair

u, v \in Π

with

{∥ u ∥}_{g}, {∥ v ∥}_{g} \leq r

, and every

p_{x} \in P_{x}

, the following holds.

Let

y = {exp}_{x} (u)

,

z = {exp}_{x} (v)

, let

A : = A_{g} (x, y, z)

be the Riemannian area of the geodesic triangle

Δ_{g} (x, y, z)

, and let

γ_{x, u, v}

be the piecewise-geodesic loop

x \to y \to z \to x

. Denote

p_{x, u, v} : = {Hol}_{γ_{x, u, v}} (p_{x}) \in P_{x}

and

ℓ : = max {∥ u ∥_{g}, ∥ v ∥_{g}}

. Then

{exp}_{p_{x}}^{P_{x}}^{- 1} (p_{x, u, v}) = A W_{x} (Π; p_{x}) + R_{x} (u, v; p_{x}),

(13)

where

W_{x} (Π; p_{x}) \in T_{p_{x}} P_{x}

depends smoothly on

(x, Π, p_{x})

and linearly on the curvature of the inducing principal connection, and the remainder satisfies

∥ R_{x} (u, v; p_{x}) ∥_{g^{P_{x}}} \leq C ℓ^{3} .

(14)

More explicitly: fix an orientation of Π and let

(e_{1}, e_{2})

be any oriented g-orthonormal basis of Π. Let

X_{x} (Π) \in g

be the curvature element determined by the principal curvature

F_{ω}

(in any local gauge) evaluated on

(e_{1}, e_{2})

at x. Let

ξ : g \to Γ (T S)

be the infinitesimal action (fundamental vector fields) of G on S. Then, identifying

P_{x} ≃ S

via any choice of

q \in Q_{x}

,

W_{x} (Π; p_{x}) = ξ (X_{x} (Π)) |_{p_{x}} \in T_{p_{x}} P_{x},

and this definition is independent of the chosen gauge because

F_{ω}

is

Ad

-equivariant and the action of G on S is by isometries.

Proof.

We work on a convex normal neighbourhood

U \subset M

of x contained in a compact set K and choose a

C^{3}

local section

σ : U \to Q

. In this gauge, the principal connection is represented by a

g

-valued 1-form

A : = σ^{*} ω

on U, with curvature

F : = σ^{*} F_{ω} = d A + \frac{1}{2} [A \land A]

, whose coefficients are

C^{1}

and uniformly bounded on K (by Assumption 2).

Let

Σ_{x, u, v} \subset U

be the geodesic triangle surface with boundary

\partial Σ_{x, u, v} = γ_{x, u, v}

. The holonomy of the principal connection around

γ_{x, u, v}

is an element

g_{x, u, v} \in G

obtained by the path-ordered exponential of A along

γ_{x, u, v}

. In this standard principal-connection setting, the non-abelian Stokes/holonomy–curvature expansion (see, e.g., [5], Chapter II, §3–§4) yields

g_{x, u, v} = exp (\int_{Σ_{x, u, v}} F) exp (E_{x, u, v}), {∥ E_{x, u, v} ∥}_{g} \leq C_{0} ℓ^{3},

(15)

for ℓ small, with constants uniform for

x \in K

and

Π

varying in the Grassmannian. Moreover, since F is

C^{1}

, Taylor expansion on

Σ_{x, u, v}

gives

\int_{Σ_{x, u, v}} F = A F_{x} (e_{1}, e_{2}) + O (ℓ^{3}) in g,

where

F_{x} (e_{1}, e_{2})

denotes the curvature evaluated at x on an oriented orthonormal basis

(e_{1}, e_{2})

of

Π

(and the

O (ℓ^{3})

term is uniform on K). Combining with (15) yields

g_{x, u, v} = exp (A X_{x} (Π) + {\tilde{E}}_{x, u, v}), {∥ {\tilde{E}}_{x, u, v} ∥}_{g} \leq C_{1} ℓ^{3} .

(16)

Now pass to the associated bundle. Choose

q \in Q_{x}

and identify

P_{x} ≃ S

by

[q, s] \leftrightarrow s

. Under this identification, the holonomy acts by the G-action:

p_{x, u, v} = {Hol}_{γ_{x, u, v}} (p_{x}) = g_{x, u, v} \cdot p_{x} .

Consider the smooth map

H_{p_{x}} : g \supset B (0, δ) \to T_{p_{x}} S, H_{p_{x}} (Z) : = {exp}_{p_{x}}^{S}^{- 1} (exp (Z) \cdot p_{x}),

defined for

δ > 0

small. Since the G-action is smooth,

H_{p_{x}}

is

C^{2}

and satisfies

D H_{p_{x}} {(0) [Z] = ξ (Z) |}_{p_{x}}

(the fundamental vector field). Therefore, Taylor expansion at 0 gives

H_{p_{x}} {(Z) = ξ (Z) |}_{p_{x}} + {O (∥ Z ∥}_{g}^{2}) in T_{p_{x}} S,

(17)

with a uniform constant for

x \in K

and

p_{x}

ranging in compact subsets of the fibres.

Apply (17) to

Z : = A X_{x} (Π) + {\tilde{E}}_{x, u, v}

from (16). Since

A = O (ℓ^{2})

and

∥ {\tilde{E}}_{x, u, v} ∥ = O (ℓ^{3})

, we obtain

{exp}_{p_{x}}^{P_{x}}^{- 1} (p_{x, u, v}) = H_{p_{x}} (Z) = A ξ (X_{x} (Π)) |_{p_{x}} + O (ℓ^{3}),

because the quadratic term

O (∥ Z ∥^{2})

is

O (ℓ^{4})

and hence dominated by

O (ℓ^{3})

. This yields (13) and (14) with

W_{x} (Π; p_{x}) = ξ (X_{x} (Π)) |_{p_{x}}

. □

Remark 9.

The vector

W_{x} (Π; p_{x})

depends linearly on the curvature

Ω_{x, p_{x}}

restricted to Π. In particular, when

P

is an associated bundle to a principal bundle with a connection induced from the Levi–Civita connection via an isometric representation,

W_{x} (Π; p_{x})

is obtained by applying the differential of the representation to the Riemann curvature tensor

R_{x}^{g}

restricted to Π.

5.3. Proof of Theorem 1

Fix

x \in M

and a two-dimensional subspace

Π \subset T_{x} M

. Let

(u_{r}, v_{r}) \in Π \times Π

be a family as in Definition 16, with

∥ u_{r} ∥_{g}, {∥ v_{r} ∥}_{g} \to 0

and with uniformly non-degenerate shape in

Π

. Set

y_{r} : = {exp}_{x} (u_{r}), z_{r} : = {exp}_{x} (v_{r}), A_{r} : = A_{g} (x, y_{r}, z_{r}),

and let

γ_{r} : = γ_{x, u_{r}, v_{r}}

be the boundary loop of the geodesic triangle

Δ_{g} (x, y_{r}, z_{r})

. Finally set

p_{x} : = μ_{x}, p_{r} : = {Hol}_{γ_{r}} (p_{x}) \in P_{x} .

By Lemma 5 (applied with

p_{x} = μ_{x}

) we have, for r small,

v_{r} : = {exp}_{p_{x}}^{P_{x}}^{- 1} (p_{r}) = A_{r} W_{x} (Π; μ_{x}) + R_{r},

(18)

where

∥ R_{r} ∥_{g^{P_{x}}} \leq C ℓ_{r}^{3}

and

ℓ_{r} : = max {∥ u_{r} ∥_{g}, ∥ v_{r} ∥_{g}}

. Since the triangles have uniformly non-degenerate shape in

Π

, there exists

c > 0

such that

A_{r} \geq c ℓ_{r}^{2}

for r small. Consequently,

∥ v_{r} ∥_{g^{P_{x}}} = A_{r} {∥ W_{x} (Π; μ_{x}) ∥}_{g^{P_{x}}} + O (A_{r}^{3 / 2}),

(19)

where the implicit constant is uniform for x in compact sets and for

Π

.

Now the continuous distance defect satisfies

Δ_{x, u_{r}, v_{r}} = d_{x} (p_{x}, p_{r}) = d_{x} (p_{x}, {exp}_{p_{x}}^{P_{x}} (v_{r})) .

Applying Lemma 4 with

s = p_{x}

and

v = v_{r}

yields

Δ_{x, u_{r}, v_{r}} = ∥ v_{r} ∥_{g^{P_{x}}} + O (∥ v_{r} ∥_{g^{P_{x}}}^{2}) = A_{r} {∥ W_{x} (Π; μ_{x}) ∥}_{g^{P_{x}}} + O (A_{r}^{3 / 2}) .

(20)

Dividing by

A_{r}

and letting

r \to 0

we obtain

lim_{r \to 0} \frac{Δ_{x, u_{r}, v_{r}}}{A_{r}} = {∥ W_{x} (Π; μ_{x}) ∥}_{g^{P_{x}}} .

This limit is independent of the chosen shrinking family

(u_{r}, v_{r})

(subject to the non-degeneracy condition), and therefore

K_{hol}^{cont} (x, Π)

exists and equals

∥ W_{x} (Π; μ_{x}) ∥_{g^{P_{x}}}

.

5.4. Geometric Models Induced from the Levi–Civita Connection

We briefly justify the last assertion of Theorem 1. Assume that

P

is an associated bundle to the orthonormal frame bundle

Fr (M) \to M

, with structure group

O (n)

acting on the model fibre S by isometries of

(S, g^{S})

. Let

ρ : O (n) \to Isom (S, g^{S})

denote this representation and let ∇ be the Levi–Civita connection on

Fr (M)

; it induces an Ehresmann connection on

P

.

Let

x \in M

and

Π \subset T_{x} M

be a two-plane. Denote by

R_{x}^{g} (Π) \in so (T_{x} M, g_{x})

the curvature endomorphism of ∇ restricted to

Π

(equivalently, the image of

R_{x}^{g}

under the identification

Λ^{2} T_{x} M ≃ so (T_{x} M, g_{x})

). The curvature of the induced connection on

P

is obtained by applying the differential

d ρ

fibrewise, and therefore there exists a smooth linear map

L_{x, p} : so (T_{x} M, g_{x}) ⟶ T_{p} P_{x}, p \in P_{x},

such that

W_{x} (Π; p) = L_{x, p} (R_{x}^{g} (Π)) .

(21)

In particular,

K_{hol}^{cont} (x, Π) = ∥ L_{x, μ_{x}} (R_{x}^{g} (Π)) ∥

is a scalar invariant determined by the restriction of

R_{x}^{g}

to

Π

.

If

(M, g)

has constant sectional curvature

κ

, then for every x and

Π

one has

R_{x}^{g} (Π) = κ J_{Π}

, where

J_{Π}

is the infinitesimal generator of the

g_{x}

-rotation in

Π

(normalized so that

∥ J_{Π} ∥_{HS} = 1

). Combining this with (21) yields

K_{hol}^{cont} (x, Π) = | κ | ∥ L_{x, μ_{x}} (J_{Π}) ∥_{g^{P_{x}}} .

Under the additional natural hypothesis that the model is

O (n)

-equivariant and the reference field

μ

is chosen compatibly with the symmetry, the factor

∥ L_{x, μ_{x}} (J_{Π}) ∥

is independent of x and

Π

, so that

K_{hol}^{cont}

reduces to a constant multiple of

| κ | = | \sec_{g} |

, as claimed.

6. Discrete-to-Continuous Convergence of Informational Holonomy Curvature

In this section we prove Theorem 2. We work under Assumptions 4–11 and the continuous hypotheses of Theorem 1. Throughout,

K \subset M

is a fixed compact set and all constants are uniform over

x \in K

and over two-planes

Π \subset T_{x} M

.

The proof proceeds in three steps:

we compare discrete and continuous holonomy on individual small triangles (SubSection 6.1);
we convert this comparison into a bound between discrete and continuous triangle-wise informational holonomy curvature (SubSection 6.2);
we pass to the averaged, sectional quantity by exploiting the isotropic triangle families from Assumption 11 and the area approximation from Assumption 10 (SubSection 6.3).

6.1. Comparison of Discrete and Continuous Holonomy on Small Triangles

Fix

ε > 0

small, a vertex

x_{i} \in V_{ε}

and a triangle

(x_{i}, x_{j}, x_{k}) \in T_{ε}

. Let

γ_{i j k} = (x_{i}, x_{j}, x_{k}, x_{i})

be the associated discrete loop and

{Hol}_{i j k}^{(ε)} : = Φ_{k i} \circ Φ_{j k} \circ Φ_{i j} : P_{i} \to P_{i}

the discrete holonomy operator based at

x_{i}

.

On the continuous side, consider the geodesic triangle

Δ_{g} (x_{i}, x_{j}, x_{k})

in

(M, g)

and the loop

{\tilde{γ}}_{i j k}

obtained by traversing the minimizing geodesic segments

x_{i} \to x_{j} \to x_{k} \to x_{i}

in order. Let

{Hol}_{{\tilde{γ}}_{i j k}} : P_{x_{i}} \to P_{x_{i}}

be the corresponding holonomy map induced by the connection on

P

. Via the identification

ι_{i}^{(ε)} : P_{i} \to P_{x_{i}}

from Assumption 7, we may view both

{Hol}_{i j k}^{(ε)}

and

{Hol}_{{\tilde{γ}}_{i j k}}

as acting on the same fibre.

For notational simplicity, we replace

ι_{i}^{(ε)}

by the identity and treat

P_{i}

as

P_{x_{i}}

, understanding that all maps and distances are transported accordingly. Thus, in what follows,

μ_{i}

denotes

μ_{x_{i}}

and

d_{i}

denotes the informational distance

d_{x_{i}}

.

We first estimate how well the discrete holonomy operator approximates the continuous one near the reference state.

Lemma 6

(Staying in the local regime along short compositions). Let

K \subset M

be compact. Assume:

(i): The fibre divergences satisfy Assumption 1 uniformly on K.
(ii): The continuous model is in the associated-bundle setting Assumption 2, so that parallel transport preserves fibre Riemannian distances (Lemma 2).
(iii): The discrete channels satisfy Assumption 9(2) in the fibre Riemannian distance $d^{R}$ with exponent $α > 1$ .
(iv): The channel locality Assumption 9(1) holds.

Then there exist constants

r_{0} > 0

,

C > 0

and

ε_{K} > 0

such that for all

0 < ε < ε_{K}

the following holds.

Let

(x_{i}, x_{j})

be any oriented edge with

x_{i}, x_{j} \in K

, and let

γ_{i j}

be the minimizing geodesic from

x_{i}

to

x_{j}

. For any

s, t \in P_{i}

with

d_{i}^{R} (s, t) \leq r_{0}

one has the local Lipschitz bound

d_{j}^{R} (Φ_{i j} (s), Φ_{i j} (t)) \leq C d_{i}^{R} (s, t) .

(22)

Moreover, for any (nondegenerate) discrete triangle

(x_{i}, x_{j}, x_{k})

in K, define the continuous and discrete intermediate states starting from

μ_{i}

by

{\tilde{s}}_{0} : = μ_{i}, {\tilde{s}}_{1} : = {PT}_{γ_{i j}} ({\tilde{s}}_{0}), {\tilde{s}}_{2} : = {PT}_{γ_{j k}} ({\tilde{s}}_{1}), {\tilde{s}}_{3} : = {PT}_{γ_{k i}} ({\tilde{s}}_{2}),

s_{0} : = μ_{i}, s_{1} : = Φ_{i j} (s_{0}), s_{2} : = Φ_{j k} (s_{1}), s_{3} : = Φ_{k i} (s_{2}) .

Then each pair

(s_{m}, {\tilde{s}}_{m})

remains in the local regime:

d^{R} (s_{m}, {\tilde{s}}_{m}) \leq r_{0} for m = 1, 2, 3,

and, quantitatively,

d^{R} (s_{m}, {\tilde{s}}_{m}) \leq C r_{ε}^{1 + α} (m = 1, 2, 3),

(23)

where

r_{ε}

is the sampling radius.

Proof.

Step 1: uniform local equivalence and choice of

r_{0}

. Since K is compact and

μ

is smooth,

μ (K)

is compact in

P

. By Assumption 1 and smooth dependence of the fibre metrics/divergences, there exists

r_{0} > 0

and constants

0 < c_{1} \leq c_{2} < \infty

such that for any

x \in K

and any

s, t \in P_{x}

with

d_{x}^{R} (s, t) \leq r_{0}

,

c_{1} d_{x}^{R} (s, t) \leq d_{x} (s, t) \leq c_{2} d_{x}^{R} (s, t) .

(24)

Step 2: local Lipschitz of

Φ_{i j}

in

d^{R}

. Assumption 9(1) gives

D_{j} (Φ_{i j} (s), Φ_{i j} (t)) \leq C_{D} D_{i} (s, t)

. By Lemma 3, this implies

d_{j} (Φ_{i j} (s), Φ_{i j} (t)) \leq \sqrt{C_{D}} d_{i} (s, t)

. If

d_{i}^{R} (s, t) \leq r_{0}

then (24) yields

d_{j}^{R} (Φ_{i j} (s), Φ_{i j} (t)) \leq c_{1}^{- 1} d_{j} (Φ_{i j} (s), Φ_{i j} (t)) \leq c_{1}^{- 1} \sqrt{C_{D}} d_{i} (s, t) \leq c_{1}^{- 1} \sqrt{C_{D}} c_{2} d_{i}^{R} (s, t),

which is (22) with

C = c_{1}^{- 1} \sqrt{C_{D}} c_{2}

.

Step 3: staying in the local regime and the bound (23). For

m = 1

, Assumption 9(2) (applied at

s = μ_{i}

) gives

d^{R} (s_{1}, {\tilde{s}}_{1}) = d^{R} (Φ_{i j} (μ_{i}), {PT}_{γ_{i j}} (μ_{i})) \leq C_{0} d_{g} {(x_{i}, x_{j})}^{1 + α} \leq C r_{ε}^{1 + α} .

Choose

ε_{K}

so that

C r_{ε}^{1 + α} \leq r_{0}

for all

ε < ε_{K}

. Then

(s_{1}, {\tilde{s}}_{1})

lies in the local regime.

Assume inductively that

d^{R} (s_{m}, {\tilde{s}}_{m}) \leq r_{0}

and

d^{R} (s_{m}, {\tilde{s}}_{m}) \leq C r_{ε}^{1 + α}

for

m = 1, 2

. Using the triangle inequality in

d^{R}

,

\begin{matrix} d^{R} (s_{m + 1}, {\tilde{s}}_{m + 1}) & = d^{R} (Φ (s_{m}), PT ({\tilde{s}}_{m})) \\ \leq d^{R} (Φ (s_{m}), PT (s_{m})) + d^{R} (PT (s_{m}), PT ({\tilde{s}}_{m})) . \end{matrix}

The first term is bounded by Assumption 9(2):

\leq C r_{ε}^{1 + α}

. The second term equals

d^{R} (s_{m}, {\tilde{s}}_{m})

by Lemma 2. Therefore,

d^{R} (s_{m + 1}, {\tilde{s}}_{m + 1}) \leq C r_{ε}^{1 + α} + d^{R} (s_{m}, {\tilde{s}}_{m}) \leq C^{'} r_{ε}^{1 + α} .

For

ε < ε_{K}

this is

\leq r_{0}

, closing the induction. □

Lemma 7

(Discrete vs continuous holonomy). Let

K \subset M

be compact. There exist constants

C > 0

and

ε_{K} > 0

such that, for all

0 < ε < ε_{K}

, for every vertex

x_{i} \in V_{ε} \cap K

and every discrete triangle

(x_{i}, x_{j}, x_{k}) \in T_{ε}

with all vertices in K and satisfying the non-degeneracy scale condition of Assumption 11(1), the following holds. Let

A_{g} (i, j, k) : = A_{g} (x_{i}, x_{j}, x_{k})

and let

ℓ_{i j k} : = max {d_{g} (x_{i}, x_{j}), d_{g} (x_{j}, x_{k}), d_{g} (x_{k}, x_{i})}

. Then:

1.: $ℓ_{i j k} \leq C r_{ε}$ .
2.: In the fibre $P_{i}$ ,

$d_{i} ({Hol}_{i j k}^{(ε)} (μ_{i}), {Hol}_{{\tilde{γ}}_{i j k}} (μ_{i})) \leq C ρ_{ε} A_{g} (i, j, k), ρ_{ε} = r_{ε}^{α - 1} .$

(25)

Proof.

The first assertion is immediate from Assumption 5: each edge

{x_{i}, x_{j}}

has length

d_{g} (x_{i}, x_{j}) \leq c_{3} r_{ε}

, so any triangle with vertices connected by edges has all side lengths bounded by a constant multiple of

r_{ε}

. Adjusting constants yields

ℓ_{i j k} \leq C r_{ε}

.

For the second assertion, write the continuous holonomy map as a composition of continuous parallel transport along the three geodesic edges:

{Hol}_{{\tilde{γ}}_{i j k}} = {PT}_{γ_{k i}} \circ {PT}_{γ_{j k}} \circ {PT}_{γ_{i j}},

where

γ_{i j}

denotes the minimizing geodesic segment from

x_{i}

to

x_{j}

.

We use a telescoping argument. Set

s_{0} : = μ_{i}, {\tilde{s}}_{0} : = μ_{i},

s_{1} : = Φ_{i j} (s_{0}), {\tilde{s}}_{1} : = {PT}_{γ_{i j}} ({\tilde{s}}_{0}),

s_{2} : = Φ_{j k} (s_{1}), {\tilde{s}}_{2} : = {PT}_{γ_{j k}} ({\tilde{s}}_{1}),

s_{3} : = Φ_{k i} (s_{2}), {\tilde{s}}_{3} : = {PT}_{γ_{k i}} ({\tilde{s}}_{2}) .

Then

{Hol}_{i j k}^{(ε)} (μ_{i}) = s_{3}

and

{Hol}_{{\tilde{γ}}_{i j k}} (μ_{i}) = {\tilde{s}}_{3}

.

By Assumption 9(2), the edge-wise channel error is controlled in the fibre Riemannian distance

d^{R}

: after adjusting constants, for every oriented edge

(x_{a}, x_{b})

and every

s \in P_{a}

,

d_{b}^{R} (Φ_{a b} (s), {PT}_{γ_{a b}} (s)) \leq C_{0} d_{g} {(x_{a}, x_{b})}^{1 + α} .

We therefore perform the telescoping estimate entirely in the distances

d^{R}

(so that the triangle inequality holds globally), and only at the end pass back to the informational distance

d = \sqrt{2 D}

using the local equivalence of Remark 6 (which applies since the final displacement is

O (r_{ε}^{1 + α}) \to 0

).

Moreover, by Assumption 3(1), parallel transport is a fibrewise Riemannian isometry; in particular, it preserves fibre distances:

d_{b}^{R} ({PT}_{γ_{a b}} (s), {PT}_{γ_{a b}} (t)) = d_{a}^{R} (s, t) .

We estimate recursively:

d_{j}^{R} (s_{1}, {\tilde{s}}_{1}) = d_{j}^{R} (Φ_{i j} (s_{0}), {PT}_{γ_{i j}} ({\tilde{s}}_{0})) \leq C_{0} d_{g} {(x_{i}, x_{j})}^{1 + α} .

Next,

\begin{matrix} d_{k}^{R} (s_{2}, {\tilde{s}}_{2}) & = d_{k}^{R} (Φ_{j k} (s_{1}), {PT}_{γ_{j k}} ({\tilde{s}}_{1})) \\ \leq d_{k}^{R} (Φ_{j k} (s_{1}), {PT}_{γ_{j k}} (s_{1})) + d_{k}^{R} ({PT}_{γ_{j k}} (s_{1}), {PT}_{γ_{j k}} ({\tilde{s}}_{1})) \\ \leq C_{0} d_{g} {(x_{j}, x_{k})}^{1 + α} + d_{j}^{R} (s_{1}, {\tilde{s}}_{1}), \end{matrix}

and similarly,

\begin{matrix} d_{i}^{R} (s_{3}, {\tilde{s}}_{3}) & = d_{i}^{R} (Φ_{k i} (s_{2}), {PT}_{γ_{k i}} ({\tilde{s}}_{2})) \\ \leq d_{i}^{R} (Φ_{k i} (s_{2}), {PT}_{γ_{k i}} (s_{2})) + d_{i}^{R} ({PT}_{γ_{k i}} (s_{2}), {PT}_{γ_{k i}} ({\tilde{s}}_{2})) \\ \leq C_{0} d_{g} {(x_{k}, x_{i})}^{1 + α} + d_{k}^{R} (s_{2}, {\tilde{s}}_{2}) . \end{matrix}

Combining the three bounds yields

d_{i}^{R} (s_{3}, {\tilde{s}}_{3}) \leq C_{1} (d_{g} {(x_{i}, x_{j})}^{1 + α} + d_{g} {(x_{j}, x_{k})}^{1 + α} + d_{g} {(x_{k}, x_{i})}^{1 + α}) \leq C_{2} ℓ_{i j k}^{1 + α} .

Finally, since

d_{i}^{R} (s_{3}, {\tilde{s}}_{3}) = O (ℓ_{i j k}^{1 + α}) \to 0

, Remark 6 applies for

ε

small, and after adjusting constants we obtain the same bound in the informational distance:

d_{i} (s_{3}, {\tilde{s}}_{3}) \leq C_{2} ℓ_{i j k}^{1 + α} .

we show that the discrete informational sectional

Now write

ℓ_{i j k}^{1 + α} = ℓ_{i j k}^{α - 1} ℓ_{i j k}^{2} .

By the uniform non-degeneracy in Assumption 11(1), the triangle area satisfies

A_{g} (i, j, k) \geq c_{A} r_{ε}^{2}

, and since

ℓ_{i j k} \leq C r_{ε}

we also have

ℓ_{i j k}^{2} \leq C^{'} A_{g} (i, j, k)

(after adjusting constants). Hence

d_{i} (s_{3}, {\tilde{s}}_{3}) \leq C_{3} ℓ_{i j k}^{α - 1} A_{g} (i, j, k) \leq C_{4} r_{ε}^{α - 1} A_{g} (i, j, k) = C_{4} ρ_{ε} A_{g} (i, j, k),

which is exactly (25). □

6.2. A Triangle-Wise Discrete-to-Continuum Bound

Fix

ε > 0

small,

x_{i} \in V_{ε} \cap K

, and a discrete triangle

(x_{i}, x_{j}, x_{k}) \in T_{ε}

with vertices in K satisfying the non-degeneracy condition of Assumption 11(1). Recall the triangle-wise curvature

K_{hol}^{(ε)} (i, j, k) = \frac{Δ_{i j k}^{(ε)}}{A_{ε} (i, j, k)}, Δ_{i j k}^{(ε)} = d_{i} (μ_{i}, {Hol}_{i j k}^{(ε)} (μ_{i})),

and the corresponding continuous quantity

K_{hol}^{cont} (x_{i}, Π_{i j k}) = lim_{A \to 0} \frac{d_{i} (μ_{i}, {Hol}_{γ} (μ_{i}))}{A},

where

Π_{i j k} \subset T_{x_{i}} M

denotes the plane spanned by the geodesic initial directions from

x_{i}

to

x_{j}

and

x_{i}

to

x_{k}

(and

γ

is any shrinking family of geodesic triangles tangent to

Π_{i j k}

). In practice we will compare the discrete loop

γ_{i j k}

with the continuous holonomy around the geodesic triangle

Δ_{g} (x_{i}, x_{j}, x_{k})

.

Let

Δ_{i j k}^{cont} : = d_{i} (μ_{i}, {Hol}_{{\tilde{γ}}_{i j k}} (μ_{i})),

where

{\tilde{γ}}_{i j k}

is the loop traversing the geodesic edges

i \to j \to k \to i

.

Lemma 8

(Triangle-wise defect comparison). There exist constants

C > 0

and

ε_{K} > 0

such that, for all

0 < ε < ε_{K}

and all triangles

(x_{i}, x_{j}, x_{k}) \in T_{ε}

with vertices in K satisfying Assumption 11(1), one has

| Δ_{i j k}^{(ε)} - Δ_{i j k}^{cont} | \leq C ρ_{ε} A_{g} (i, j, k),

(26)

where

ρ_{ε} = r_{ε}^{α - 1}

.

Proof.

By the triangle inequality for the fibre Riemannian distance (Remark 6) and the local equivalence with

d_{i}

, we have

| Δ_{i j k}^{(ε)} - Δ_{i j k}^{cont} | \leq d_{i} ({Hol}_{i j k}^{(ε)} (μ_{i}), {Hol}_{{\tilde{γ}}_{i j k}} (μ_{i})) .

The right-hand side is bounded by Lemma 7(25), which gives (26). □

We now incorporate the area approximation. Recall from Assumption 10 that

A_{ε} (i, j, k)

satisfies

| A_{ε} (i, j, k) - A_{g} (i, j, k) | \leq q_{ε} r_{ε}^{2} .

Lemma 9

(Triangle-wise curvature comparison). There exist constants

C > 0

and

ε_{K} > 0

such that, for all

0 < ε < ε_{K}

and all triangles

(x_{i}, x_{j}, x_{k}) \in T_{ε}

with vertices in K satisfying Assumption 11(1), one has

| K_{hol}^{(ε)} (i, j, k) - \frac{Δ_{i j k}^{cont}}{A_{g} (i, j, k)} | \leq C (ρ_{ε} + q_{ε}) .

(27)

Proof.

Write

K_{hol}^{(ε)} (i, j, k) - \frac{Δ_{i j k}^{cont}}{A_{g} (i, j, k)} = \frac{Δ_{i j k}^{(ε)}}{A_{ε} (i, j, k)} - \frac{Δ_{i j k}^{cont}}{A_{g} (i, j, k)} .

Add and subtract

Δ_{i j k}^{cont} / A_{ε} (i, j, k)

:

\frac{Δ_{i j k}^{(ε)} - Δ_{i j k}^{cont}}{A_{ε} (i, j, k)} + Δ_{i j k}^{cont} (\frac{1}{A_{ε} (i, j, k)} - \frac{1}{A_{g} (i, j, k)}) .

For the first term, Lemma 8 gives

\frac{| Δ_{i j k}^{(ε)} - Δ_{i j k}^{cont} |}{A_{ε} (i, j, k)} \leq C ρ_{ε} \frac{A_{g} (i, j, k)}{A_{ε} (i, j, k)} .

By Assumption 11(1) and Assumption 10,

A_{ε} (i, j, k)

is uniformly comparable to

A_{g} (i, j, k)

from below and above (for

ε

small), hence the ratio is bounded and the first term is

O (ρ_{ε})

.

For the second term, note that by the holonomy expansion (Lemma 5) and the distance expansion (Lemma 4), there is a uniform constant C such that

Δ_{i j k}^{cont} = d_{i} (μ_{i}, {Hol}_{{\tilde{γ}}_{i j k}} (μ_{i})) \leq C A_{g} (i, j, k),

for triangles in K sufficiently small. Therefore,

| Δ_{i j k}^{cont} (\frac{1}{A_{ε}} - \frac{1}{A_{g}}) | \leq C A_{g} (i, j, k) \frac{| A_{ε} - A_{g} |}{A_{ε} A_{g}} .

Using Assumption 10 and the uniform lower bound

A_{g} (i, j, k) \geq c_{A} r_{ε}^{2}

(Assumption 11(1)), together with the comparability

A_{ε} \geq \frac{c_{A}}{2} r_{ε}^{2}

, we obtain that this term is

O (q_{ε})

. Combining the bounds yields (27). □

6.3. From Triangle-Wise to Sectional Curvature by Averaging

We now pass from the triangle-wise comparison to the averaged sectional quantity

K_{hol}^{(ε)} (x, Π)

defined in (7).

Fix

x \in K

and a two-plane

Π \subset T_{x} M

. By Assumption 11, for

ε

sufficiently small we can choose a vertex

x_{i} \in V_{ε}

with

d_{g} (x, x_{i}) \leq c_{2} r_{ε}

and a finite non-empty family

T_{ε} (x_{i}, Π)

of triangles based at

x_{i}

satisfying:

uniform scale and non-degeneracy at scale $r_{ε}$ ;
planarity up to $η_{ε}$ with respect to $Π$ ;
approximate directional isotropy in $Π$ .

We write the discrete sectional curvature as an area-weighted average:

K_{hol}^{(ε)} (x, Π) = \frac{\sum_{(x_{i}, x_{j}, x_{k}) \in T_{ε} (x_{i}, Π)} Δ_{i j k}^{(ε)}}{\sum_{(x_{i}, x_{j}, x_{k}) \in T_{ε} (x_{i}, Π)} A_{ε} (i, j, k)} .

Similarly, define the corresponding continuous average over the same geodesic triangles:

{\tilde{K}}_{hol}^{cont} (x_{i}, Π) : = \frac{\sum_{(x_{i}, x_{j}, x_{k}) \in T_{ε} (x_{i}, Π)} Δ_{i j k}^{cont}}{\sum_{(x_{i}, x_{j}, x_{k}) \in T_{ε} (x_{i}, Π)} A_{g} (i, j, k)} .

Lemma 10

(Averaging stability). There exist constants

C > 0

and

ε_{K} > 0

such that, for all

0 < ε < ε_{K}

, for every

x \in K

and every two-plane

Π \subset T_{x} M

,

| K_{hol}^{(ε)} (x, Π) - {\tilde{K}}_{hol}^{cont} (x_{i}, Π) | \leq C (ρ_{ε} + q_{ε}) .

(28)

Proof.

By Lemma 9, for each triangle in the family we have

| \frac{Δ_{i j k}^{(ε)}}{A_{ε} (i, j, k)} - \frac{Δ_{i j k}^{cont}}{A_{g} (i, j, k)} | \leq C (ρ_{ε} + q_{ε}) .

Multiplying by

A_{ε} (i, j, k)

and summing over the family gives

| \sum Δ_{i j k}^{(ε)} - \sum Δ_{i j k}^{cont} \frac{A_{ε} (i, j, k)}{A_{g} (i, j, k)} | \leq C (ρ_{ε} + q_{ε}) \sum A_{ε} (i, j, k) .

Using uniform comparability

A_{ε} ≃ A_{g}

on the family (Assumption 11(1) and Assumption 10), we may replace

A_{ε} / A_{g}

by 1 at the cost of an additional

O (q_{ε})

relative error. Dividing by the denominators and using again that

\sum A_{ε} ≃ \sum A_{g}

yields (28). □

We now compare

{\tilde{K}}_{hol}^{cont} (x_{i}, Π)

with the intrinsic continuous quantity

K_{hol}^{cont} (x, Π)

.

Lemma 11

(Planarity and base-point stability). There exist constants

C > 0

and

ε_{K} > 0

such that, for all

0 < ε < ε_{K}

, for every

x \in K

and every two-plane

Π \subset T_{x} M

,

| {\tilde{K}}_{hol}^{cont} (x_{i}, Π) - K_{hol}^{cont} (x, Π) | \leq C (r_{ε} + η_{ε}) .

(29)

Proof.

First, by smoothness of

K_{hol}^{cont}

in

(x, Π)

(Theorem 1 and smooth dependence of

W_{x} (Π; μ_{x})

), moving the base point from x to

x_{i}

induces an error bounded by

C d_{g} (x, x_{i}) \leq C r_{ε}

.

Second, by Assumption 11(2), the planes

Π_{i j k}

spanned by the geodesic directions of each triangle in the family are within angle

η_{ε}

of

Π

. Again by smoothness in

Π

(uniform on K), replacing

Π_{i j k}

by

Π

induces an additional error

O (η_{ε})

.

Finally, the quantity

{\tilde{K}}_{hol}^{cont} (x_{i}, Π)

is an average over finitely many triangles of uniformly comparable shape and size

O (r_{ε})

, hence the above pointwise stability bounds propagate to the average with the same order. □

6.4. Conclusion of the Proof of Theorem 2

Combining Lemmas 10 and 11, we obtain

| K_{hol}^{(ε)} (x, Π) - K_{hol}^{cont} (x, Π) | \leq C (ρ_{ε} + q_{ε} + r_{ε} + η_{ε}),

uniformly for

x \in K

and

Π \subset T_{x} M

. Recalling the definition

κ_{ε} = r_{ε} + η_{ε} + q_{ε} + ρ_{ε}

from (10), this is exactly (11). This completes the proof of Theorem 2.

7. Examples and Model Constructions

In this section we discuss several model constructions that illustrate the notion of informational holonomy curvature and the hypotheses of our convergence theorem. We first describe a basic classical choice of state space and divergence (the Jensen–Shannon model), then introduce a natural geometric state bundle built from tangent distributions and parallel transport, and finally discuss spaces of constant curvature and discrete sampling schemes.

7.1. The Classical Jensen–Shannon Model

We begin with a simple and concrete choice of state space and divergence, which fits into the general framework of Section 2 and Section 4.

Fix a finite set

Ω = {1, \dots, m}, m \geq 2,

and let

S : = Δ^{\circ} (Ω) = \{p \in R^{m} : p_{i} > 0, \sum_{i = 1}^{m} p_{i} = 1\}

denote the open probability simplex. We endow S with:

the Fisher information metric $g^{Fisher}$ (restricted to S),
the Jensen–Shannon divergence

$D_{JS} (p, q) : = H (\frac{p + q}{2}) - \frac{1}{2} H (p) - \frac{1}{2} H (q), H (p) = - \sum_{i = 1}^{m} p_{i} log p_{i} .$

It is well known that

D_{JS}

is symmetric and non-negative and that

\sqrt{D_{JS}}

defines a genuine metric on S [3]. With our normalization

d = \sqrt{2 D}

, we set

d_{JS} (p, q) : = \sqrt{2 D_{JS} (p, q)}

which is also a genuine metric on S. Moreover, the second-order expansion of

D_{JS}

at the diagonal yields a constant multiple of the Fisher metric:

D_{JS} (p, q) = \frac{c_{JS}}{2} g_{p}^{Fisher} (v, v) + O ({∥ v ∥}_{g^{Fisher}}^{3}),

where

v \in T_{p} S

is the tangent vector such that

q = {exp}_{p}^{S} (v)

and

c_{JS} > 0

is a constant.

Thus Assumption 1 is satisfied with

g^{S}

proportional to the Fisher metric and

D = D_{JS}

. In particular, the informational distance

d (s, t) = \sqrt{2 D (s, t)}

is locally equivalent to the Riemannian distance induced by

g^{Fisher}

on S.

Given a Riemannian manifold

(M, g)

, the simplest associated state bundle is the trivial bundle

π : P = M \times S \to M,

with fibre

(S, g^{S}, D_{JS})

independent of

x \in M

. To obtain a non-trivial informational holonomy curvature, however, one needs a connection on

P

whose curvature reflects the geometry of

(M, g)

. The trivial product connection on

M \times S

has zero curvature and yields vanishing holonomy and hence vanishing informational holonomy curvature. Thus, in interesting examples, the state bundle and its connection must be constructed from the Levi–Civita connection in a non-trivial manner. In this trivial product situation the connection on

P

is taken to be the product of the Levi–Civita connection on

(M, g)

with the trivial connection on S, so that parallel transport acts as the identity on the fibre. Consequently, the Jensen–Shannon divergence is exactly preserved under parallel transport, i.e.

D_{JS} ({PT}_{γ} (s), {PT}_{γ} (t)) = D_{JS} (s, t)

for every curve

γ

in M and every

s, t \in S

. Thus Assumption 3 holds with

C_{K} = 0

, and Assumption 9(2) may be realised with vanishing channel-consistency error: in this example the term

ρ_{ε}

in (10) does not contribute to the bound of Theorem 2 if the discrete channels

Φ_{i j}

are chosen to coincide with the exact parallel transport on

P

.

7.2. A Geometric State Bundle from Tangent Distributions

We now describe a natural geometric construction in which the state bundle is built from probability distributions on tangent spaces and the connection is induced by parallel transport in

(M, g)

.

Let

(M, g)

be a smooth Riemannian manifold of dimension n. For each

x \in M

, consider the tangent space

T_{x} M

with its Euclidean inner product

g_{x}

. Let

S_{x} : = P (T_{x} M)

be a chosen smooth manifold of probability measures on

T_{x} M

, for instance:

the manifold of non-degenerate Gaussian measures on $T_{x} M$ ,
or a finite-dimensional exponential family of probability measures with smooth densities with respect to Lebesgue measure on $T_{x} M$ .

To fix ideas, one may take

S_{x}

to be the set of Gaussian measures

N (m, Σ)

on

T_{x} M

, with mean

m \in T_{x} M

and covariance matrix

Σ

in some fixed compact subset of the positive definite cone.

We define the state bundle

π : P \to M, P_{x} = S_{x},

by gluing the fibres

S_{x}

smoothly via the tangent bundle structure. The Riemannian metric

g^{P_{x}}

on each fibre is taken to be the Fisher information metric associated with the chosen statistical model on

T_{x} M

, and the divergence

D_{x}

is taken to be the Jensen–Shannon divergence between distributions in

S_{x}

.

To define the connection on

P

, we use parallel transport in

(M, g)

. Let

γ : [0, 1] \to M

be a smooth curve with

γ (0) = x

and

γ (1) = y

, and let

P_{γ}^{g} : T_{x} M \to T_{y} M

denote the parallel transport map associated with the Levi–Civita connection of g. We define the parallel transport on

P

along

γ

by pushing forward measures under

P_{γ}^{g}

:

{PT}_{γ} : P_{x} \to P_{y}, {PT}_{γ} (ν) : = {(P_{γ}^{g})}_{*} ν .

In particular, if

ν = N (m, Σ) \in S_{x}

is Gaussian, then

{PT}_{γ} (ν) = N (P_{γ}^{g} m, P_{γ}^{g} \circ Σ \circ {(P_{γ}^{g})}^{- 1}) \in S_{y},

so that the family

{(S_{x})}_{x \in M}

is preserved by parallel transport. The resulting parallel transport maps satisfy the functoriality conditions (4), and thus define an Ehresmann connection on

π : P \to M

.

The curvature of this connection is induced by the curvature of

(M, g)

: the curvature of the Levi–Civita connection acts on

T_{x} M

via the Riemann curvature tensor

R_{x}^{g}

, and this, in turn, induces a curvature 2-form

Ω

on

P

by differentiation of the pushforward action on

S_{x}

. In particular, if the base manifold

(M, g)

has zero curvature, then the induced connection on

P

is flat and the informational holonomy curvature vanishes.

The reference state field

μ : M \to P

can be chosen, for instance, as the isotropic Gaussian with mean

0 \in T_{x} M

and covariance

σ^{2} Id

at each

x \in M

, for some fixed

σ > 0

. This is invariant under orthogonal transformations of

T_{x} M

, which simplifies the structure of

W_{x} (Π; μ_{x})

.

Under this construction, Assumptions 1 and 3 are satisfied: the Jensen–Shannon divergence on each

S_{x}

induces the Fisher metric, the pushforward by isometries preserves the Fisher metric and the second-order expansion of

D_{x}

, and the connection on

P

is metric along the fibres. Thus the continuous informational holonomy curvature

K_{hol}^{cont} (x, Π)

is well defined and determined by the curvature tensor

R_{x}^{g}

.

7.3. Spaces of Constant Curvature

We now consider the case where

(M, g)

has constant sectional curvature and specialise the construction of the previous subsection. The aim is to illustrate how the informational holonomy curvature reflects the constant curvature of the base manifold.

Let

(M, g)

be complete, simply connected, and of constant sectional curvature

κ \in R

. Thus

(M, g)

is isometric to the Euclidean space

R^{n}

(if

κ = 0

), the round sphere

S^{n}

(if

κ > 0

), or the hyperbolic space

H^{n}

(if

κ < 0

). The Riemann curvature tensor satisfies

R_{x}^{g} (u, v) w = κ (〈 v, w 〉 u - 〈 u, w 〉 v),

for all

x \in M

and

u, v, w \in T_{x} M

, where

〈 \cdot, \cdot 〉

is the Riemannian inner product.

We equip M with the geometric state bundle

P

of tangent distributions described in Section 7.2, with fibres consisting of Gaussian measures on

T_{x} M

and divergence given by the Jensen–Shannon divergence. We choose the reference state field

μ_{x}

to be the isotropic Gaussian

N (0, σ^{2} Id)

on

T_{x} M

, with fixed

σ > 0

independent of x.

The isotropy of

(M, g)

implies that, for any

x \in M

and any two 2-planes

Π, Π^{'} \subset T_{x} M

, there exists an isometry of

(M, g)

mapping

(x, Π)

to

(x, Π^{'})

. The induced action on the state bundle

P

preserves the connection, the fibre metric and the divergence, and sends

μ_{x}

to itself. Thus, for each fixed x, the map

Π ⟼ W_{x} (Π; μ_{x})

must have constant norm on the Grassmannian of 2-planes at x, and this norm can depend only on

κ

and on the parameters of the state bundle (e.g.

σ

and the choice of divergence). In particular, there exists a constant

c_{hol} (κ, σ) > 0

such that

∥ W_{x} (Π; μ_{x}) ∥_{g^{P_{x}}} = c_{hol} (κ, σ)

for all

x \in M

and all 2-planes

Π \subset T_{x} M

.

When

κ = 0

, the Levi–Civita connection is flat and the parallel transport maps

P_{γ}^{g}

along closed loops are the identity. Hence the induced connection on

P

is flat, the holonomy maps on

P

are trivial, and

W_{x} (Π; μ_{x}) = 0

, so

c_{hol} (0, σ) = 0 .

Alternatively, one can considerFor

κ \neq 0

, the curvature of the Levi–Civita connection is non-zero and so is the curvature of the induced connection on

P

; therefore,

W_{x} (Π; μ_{x})

is non-zero and

c_{hol} (κ, σ) > 0

.

By Theorem 1, the continuous informational sectional curvature is

K_{hol}^{cont} (x, Π) = ∥ W_{x} (Π; μ_{x}) ∥_{g^{P_{x}}} = c_{hol} (κ, σ),

which is constant in

(x, Π)

. In particular, the informational holonomy curvature detects the constant curvature of

(M, g)

modulo the scale factor

c_{hol} (κ, σ)

coming from the choice of state bundle and divergence. In spaces of constant curvature,

K_{hol}^{cont}

is thus a constant multiple of

| κ |

.

7.4. Discrete Sampling Schemes

Finally, we discuss concrete choices of sampling graphs, areas and triangle families that satisfy the assumptions of Section 3 and enable the application of Theorem 2.

Quasi-uniform point clouds

Let

(M, g)

be a compact Riemannian manifold. For each small parameter

ε > 0

, consider a finite set of points

V_{ε} \subset M

obtained, for example, by:

a deterministic quasi-uniform mesh (e.g. a geodesic triangulation or a regular grid in local charts), or
an i.i.d. sample of points with respect to the Riemannian volume measure, followed by a thinning procedure to enforce minimal separation.

Under natural mesh regularity conditions or with high probability under suitable random sampling, one can ensure that

V_{ε}

satisfies the separation and covering properties of Assumption 4 with

r_{ε} \sim {| V_{ε} |}^{- 1 / n}

. For example, in the random setting, results from geometric probability show that the typical spacing between neighbouring points is of order

| V_{ε} |^{- 1 / n}

and that an appropriate choice of thresholds yields a Delone set.

Neighbour graphs and edges

Given

V_{ε}

, one natural choice of graph is the

ρ_{ε}

-neighbourhood graph: for a suitable radius

R_{ε}

satisfying

c_{-} r_{ε} \leq R_{ε} \leq c_{+} r_{ε},

one sets

E_{ε} : = \{{x_{i}, x_{j}} : d_{g} (x_{i}, x_{j}) \leq R_{ε}\} .

Alternatively, one can consider k-nearest neighbour graphs with k fixed or slowly increasing as

ε \to 0

. Under standard conditions, these constructions satisfy Assumption 5: edges connect points at distance of order

r_{ε}

, and the vertex degrees are uniformly bounded.

Discrete areas and triangle families

Given a sampling graph

G_{ε} = (V_{ε}, E_{ε})

embedded in M, one can define the set of discrete triangles

T_{ε}

as in Definition 9. For each triangle

(x_{i}, x_{j}, x_{k}) \in T_{ε}

, the discrete area

A_{ε} (i, j, k)

can be chosen, for example, as:

the Euclidean area of the triangle formed by the images of $x_{i}, x_{j}, x_{k}$ in a normal coordinate chart centred at $x_{i}$ ;
or the area of a piecewise flat triangle obtained by approximating the metric g locally by its value at $x_{i}$ .

In both cases, standard Taylor expansions in normal coordinates show that

| A_{ε} (i, j, k) - A_{g} (i, j, k) | \leq C ℓ_{i j k}^{3} \leq C^{'} r_{ε}^{3},

so Assumption 10 is satisfied with

q_{ε} \sim r_{ε}

.

Families of triangles

T_{ε} (x_{i}, Π)

satisfying Assumption 11 can be constructed by selecting, for each vertex

x_{i}

and each approximate direction in

T_{x_{i}} M

, a finite number of neighbouring vertices whose geodesic directions approximate a given 2-plane

Π \subset T_{x_{i}} M

in an approximately isotropic fashion. In random sampling models, the law of large numbers ensures that the empirical distribution of edge directions becomes asymptotically isotropic, with deviations captured by a parameter

η_{ε} \to 0

.

Discrete channels from continuous transport

Finally, the discrete channels

Φ_{i j} : P_{i} \to P_{j}

on edges

(x_{i}, x_{j})

can be defined by approximating the continuous paralleland local equivalence of transport maps

{PT}_{γ_{i j}}

along the minimizing geodesics

γ_{i j} : [0, 1] \to M

. In the geometric state bundle of Section 7.2, this amounts to approximating the pushforward of tangent distributions by the parallel transport

P_{γ_{i j}}^{g} : T_{x_{i}} M \to T_{x_{j}} M

.

For instance, one can set

Φ_{i j} : = {PT}_{γ_{i j}},

whenever

γ_{i j}

is uniquely defined and computable, in which case Assumption 9(2) holds with

ρ_{ε} = 0

. In numerical settings where

γ_{i j}

and

P_{γ_{i j}}^{g}

are approximated by finite-difference schemes or local polynomial approximations of g, consistency estimates of the form

d_{j} (Φ_{i j} (s), {PT}_{γ_{i j}} (s)) \leq C d_{g} {(x_{i}, x_{j})}^{1 + α}

can be obtained for suitable

α > 1

, leading to a non-zero but convergent

ρ_{ε}

.

Under these constructions, all the assumptions of Section 3 and Section 4 are satisfied, and Theorem 2 applies. Thus the discrete informational sectional curvature

K_{hol}^{(ε)} (x, Π)

computed from sampling graphs, approximate geodesic triangles, and discrete channels converges, as

ε \to 0

, to the continuous informational sectional curvature

K_{hol}^{cont} (x, Π)

determined by the geometric data

(M, g, P, μ)

.

8. Discussion and Outlook

The constructions developed in this work provide a framework for defining and estimating curvature from informational holonomy. Starting from a Riemannian manifold

(M, g)

and a state bundle

π : P \to M

endowed with fibrewise divergences and a compatible connection, we defined a continuous informational holonomy curvature

K_{hol}^{cont} (x, Π)

associated with a point

x \in M

and a two-plane

Π \subset T_{x} M

by measuring, via the informational distance induced by the divergence, the leading (area–linear) effect of transporting a reference state around small geodesic triangles. We then showed that, under explicit sampling, area-approximation, and channel-consistency assumptions, a purely discrete estimator

K_{hol}^{(ε)} (x, Π)

constructed on graphs embedded in M converges to

K_{hol}^{cont} (x, Π)

as the sampling scale

ε \to 0

, with a quantitative error bound controlled by the discretization scale.

8.1. Summary of the Framework

The continuous construction hinges on three ingredients:

a state bundle $π : P \to M$ whose fibres represent informational states (classical or quantum), equipped with a fibre Riemannian metric and a divergence whose second-order expansion induces this metric;
an Ehresmann connection on $P$ compatible with the fibre metrics and divergences, so that parallel transport acts as a fibrewise isometry to first order and preserves the informational structure infinitesimally;
a reference state field $μ : M \to P$ serving as a basepoint for measuring informational defects.

Holonomy of the connection on

P

along small geodesic triangles based at x and tangent to

Π

produces a displacement of

μ_{x}

in

P_{x}

which, by holonomy–curvature expansions, is proportional to the triangle area to first order. The informational distance associated with the fibre divergence then yields a scalar quantity per unit area, identified as the continuous informational sectional curvature

K_{hol}^{cont} (x, Π)

(Theorem 1).

On the discrete side, we considered quasi-uniform sampling graphs

{(G_{ε})}_{ε > 0}

on M, endowed with:

discrete fibres $P_{i}$ and divergences $D_{i}$ at vertices $x_{i} \in V_{ε}$ approximating the continuous fibres and divergences;
edge channels $Φ_{i j} : P_{i} \to P_{j}$ approximating continuous parallel transport along short geodesic segments;
discrete areas $A_{ε} (i, j, k)$ for triangles and triangle families $T_{ε} (x_{i}, Π)$ that are asymptotically isotropic in prescribed directions.

The resulting discrete holonomy operators

{Hol}_{i j k}^{(ε)}

yield distance defects

Δ_{i j k}^{(ε)}

, and the triangle-wise curvatures

K_{hol}^{(ε)} (i, j, k)

are defined by normalising by

A_{ε} (i, j, k)

. Averaging over

T_{ε} (x_{i}, Π)

produces a discrete sectional curvature

K_{hol}^{(ε)} (x, Π)

which converges, at a rate governed by

κ_{ε}

, to the continuous quantity

K_{hol}^{cont} (x, Π)

.

8.2. Relation to Classical and Discrete Curvature Notions

The informational holonomy curvature sits at the intersection of several strands of work on curvature:

Riemannian sectional curvature. In classical Riemannian geometry, sectional curvature can be characterised in terms of angle defects, Jacobi fields, or holonomy of the Levi–Civita connection. Our framework replaces the linear tangent bundle by a (generally non-linear) state bundle and linear norms by informational distances induced by divergences. When the connection on $P$ is induced by the Levi–Civita connection via a linear isometric representation, the vector $W_{x} (Π; μ_{x})$ in Theorem 1 is obtained as a linear image of the restriction of $R_{x}^{g}$ to $Π$ , and $K_{hol}^{cont} (x, Π) = ∥ W_{x} (Π; μ_{x}) ∥$ becomes an invariant of this restriction. In spaces of constant sectional curvature, this reduces to a constant multiple of $| \sec_{g} (x, Π) |$ (equivalently, of $| κ |$ when $\sec_{g} \equiv κ$ ).
Discrete and combinatorial curvature. Various notions of curvature for graphs and discrete spaces have been proposed, including Ollivier–Ricci curvature, Forman curvature, and Regge-type discretisations. The discrete informational holonomy curvature differs from these in two key aspects: it is based on holonomy of a bundle connection (rather than on pairwise comparisons of neighbourhood measures or purely combinatorial angle/defect data), and it uses divergences on state spaces attached to vertices (rather than solely distances in the ambient manifold or graph). In particular, it blends geometric information about $(M, g)$ with an informational structure in the fibres.
Curvature in information geometry. In information geometry, Fisher metrics and $α$ -connections yield Riemannian and affine structures on statistical manifolds, and their curvature encodes statistical properties of models. The present construction can be viewed as a “mixed” curvature: it is controlled by the curvature of a connection on a state bundle over a geometric base, while the informational structure enters via the choice of fibre divergence and reference state. The Jensen–Shannon model of Section 7.1 provides a particularly transparent example where the divergence has a direct information-theoretic meaning.

8.3. Limitations and Choices of State Bundle

The informational holonomy curvature is not a curvature of

(M, g)

alone: it depends on the choice of state bundle, divergence, connection, and reference state field. Different choices can therefore produce different curvature functionals over the same base manifold. This flexibility is both a strength and a limitation.

On the one hand, it allows the notion of curvature to be adapted to an application: classical probability distributions on finite sets with Jensen–Shannon divergence, Gaussian distributions on tangent spaces, or quantum density matrices with quantum Jensen–Shannon or other quantum divergences all fit naturally into the framework. On the other hand, it raises the question of which choices are canonical, or geometrically natural, for a given problem.

A natural option in a purely geometric setting is the geometric state bundle built from tangent distributions (Section 7.2), whose connection is induced canonically by parallel transport in

(M, g)

and whose fibres behave naturally under base isometries. In data-driven or statistical settings, other choices may be more appropriate, for instance, state spaces encoding empirical distributions of local observations, feature vectors, or structured data attached to points in M.

From a foundational perspective, the present work treats the state bundle and its connection as given. Understanding how to construct such bundles in a canonical or data-driven way, and how the resulting

K_{hol}^{cont}

varies across different constructions, remain interesting open questions.

8.4. Potential Applications and Further Directions

We conclude by mentioning several directions in which the informational holonomy curvature framework may be developed further.

Data analysis and manifold learning.

In applications where only a point cloud in M is observed, possibly together with empirical distributions or feature states at each point, the discrete curvature

K_{hol}^{(ε)} (x, Π)

provides a way to estimate curvature-like quantities that combine geometric and informational structure. Compared to purely metric estimators based on distances or angles, informational holonomy curvature incorporates how local states are transported along the graph via channels, which may reflect dynamics, diffusion, or parallel transport in latent spaces. Analysing statistical properties and robustness of such estimators in the presence of noise and finite-sample effects is a natural next step.

Other divergences and connections.

While we focused on divergences whose second-order expansion induces a Riemannian metric (e.g. Jensen–Shannon), one could consider more general f-divergences or Bregman divergences, potentially leading to non-Riemannian local geometry on fibres. Extending the holonomy curvature construction to such settings would require an appropriate notion of distance defect and a careful analysis of higher-order terms. Similarly, one may study families of connections on

P

(for example,

α

-connections in information geometry) and compare the corresponding informational holonomy curvatures.

Ricci-type and scalar informational curvatures.

The present work is focused on sectional-type curvature attached to two-planes. In analogy with Riemannian geometry, one may seek informational versions of Ricci and scalar curvature. One possibility is to average

K_{hol}^{cont} (x, Π)

over the Grassmannian of 2-planes at x with respect to a suitable measure, obtaining a scalar quantity

K_{hol}^{scal} (x)

, and relating it to classical scalar curvature or to information-theoretic quantities such as entropy production or functional inequalities. On the discrete side, different averaging schemes over triangle families may yield Ricci-type informational curvatures along edges or preferred directions in the graph.

Quantum and non-commutative models.

The state-bundle viewpoint naturally accommodates quantum state spaces, where fibres consist of density matrices on finite-dimensional Hilbert spaces and divergences are given by quantum generalisations of Jensen–Shannon or relative entropy. In such settings, the connection on

P

may encode both geometric parallel transport and quantum channels acting along paths in M. Extending the convergence analysis to non-commutative state bundles, and understanding how informational holonomy curvature reflects underlying quantum geometric structure, are promising directions.

Algorithmic and numerical aspects.

From a practical perspective, computing

K_{hol}^{(ε)} (x, Π)

requires:

constructing a sampling graph and identifying triangle families $T_{ε} (x_{i}, Π)$ ;
specifying discrete channels $Φ_{i j}$ and evaluating their composition along loops;
computing divergences and distances in the fibre state spaces.

Each of these steps has algorithmic consequences, and different applications may favour different trade-offs between accuracy and complexity. Designing efficient algorithms for informational holonomy curvature in high-dimensional state spaces, and testing them on simulated and real data, would help assess the practical relevance of the notion.

In summary, the informational holonomy curvature introduced here provides a bridge between classical Riemannian curvature, graph-based approximations, and information-theoretic structures on state spaces. It offers a geometrically grounded way of measuring how “information” twists under transport around small loops. The results of this paper establish its mathematical foundation and discrete-to-continuous consistency; its full potential will likely emerge in concrete applications and in further theoretical developments linking geometry, probability and information.

Author Contributions

The author contributed solely to all aspects of this work: conceptualization, methodology, formal analysis, writing, and revision.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The author declares no conflict of interest.

References

Amari, S.; Nagaoka, H. Methods of Information Geometry; Translations of Mathematical Monographs, Vol. 191; American Mathematical Society: Providence, RI, USA, 2000.
do Carmo, M.P. Riemannian Geometry; Birkhäuser: Boston, MA, USA, 1992. [Google Scholar]
Endres, D.M.; Schindelin, J.E. A new metric for probability distributions. IEEE Trans. Inf. Theory 2003, 49, 1858–1860. [Google Scholar] [CrossRef]
Forman, R. Bochner’s method for cell complexes and combinatorial Ricci curvature. Discrete Comput. Geom. 2003, 29, 323–374. [Google Scholar] [CrossRef]
Kobayashi, S.; Nomizu, K. Foundations of Differential Geometry; Interscience Publishers: New York, NY, USA, 1963; Vol. I. [Google Scholar]
Lin, J. Divergence measures based on the Shannon entropy. IEEE Trans. Inf. Theory 1991, 37, 145–151. [Google Scholar] [CrossRef]
Ollivier, Y. Ricci curvature of Markov chains on metric spaces. J. Funct. Anal. 2009, 256, 810–864. [Google Scholar] [CrossRef]
Regge, T. General relativity without coordinates. Nuovo Cimento 1961, 19, 558–571. [Google Scholar] [CrossRef]

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.

Informational Holonomy Curvature and Its Discrete–to–Continuous Convergence

Abstract

Keywords:

Subject:

1. Introduction

Main Contributions

Relation to Previous Work

Organization of the Paper

2. Geometric and Informational Background

2.1. Riemannian Preliminaries

2.2. State Spaces and Informational Divergences

2.3. State Bundles over a Riemannian Manifold

2.4. Connections and Continuous Informational Transport

2.5. Informational holonomy in the continuous setting

3. Discrete Sampling, Channels and Holonomy

3.1. Sampling Graphs on a Riemannian Manifold

3.2. Discrete State Spaces and Divergences

3.3. Discrete Channels and Local Consistency

3.4. Discrete Holonomy Operators

3.5. Triangle Geometry and Area Approximation

3.6. Families of Triangles Approximating Two-Planes

4. Informational Holonomy Curvature: Definitions

4.1. Informational Distances and Defects for Discrete Loops

4.2. Discrete Informational Holonomy Curvature of Triangles

4.3. Continuous Informational Holonomy Curvature Revisited

4.4. Main Curvature Theorems

Overview of assumptions.

5. Continuous Holonomy Curvature and Connection Curvature

5.1. Local Expansion of the Informational Distance

5.2. Curvature of the Connection and Small-Loop Holonomy

5.3. Proof of Theorem 1

5.4. Geometric Models Induced from the Levi–Civita Connection

6. Discrete-to-Continuous Convergence of Informational Holonomy Curvature

6.1. Comparison of Discrete and Continuous Holonomy on Small Triangles

6.2. A Triangle-Wise Discrete-to-Continuum Bound

6.3. From Triangle-Wise to Sectional Curvature by Averaging

6.4. Conclusion of the Proof of Theorem 2

7. Examples and Model Constructions

7.1. The Classical Jensen–Shannon Model

7.2. A Geometric State Bundle from Tangent Distributions

7.3. Spaces of Constant Curvature

7.4. Discrete Sampling Schemes

Quasi-uniform point clouds

Neighbour graphs and edges

Discrete areas and triangle families

Discrete channels from continuous transport

8. Discussion and Outlook

8.1. Summary of the Framework

8.2. Relation to Classical and Discrete Curvature Notions

8.3. Limitations and Choices of State Bundle

8.4. Potential Applications and Further Directions

Data analysis and manifold learning.

Other divergences and connections.

Ricci-type and scalar informational curvatures.

Quantum and non-commutative models.

Algorithmic and numerical aspects.

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

MDPI Initiatives

Important Links

Subscribe