2. Dually Flat Manifolds
We briefly recall the notions of differential geometry useful for introducing the definition of a dually flat manifold, a central object in information geometry. We keep the exposition to a minimum, without proofs, essentially following [
3]. Other important introductory references are [
5,
7,
9].
Let
be a
n-dimensional smooth manifold with
g a Riemannian metric. In a local chart
the coordinates of a point
are
and the canonical basis is
. We equip
M with a linear connection ∇ whose expression in the local chart is given by the Christoffel symbols
as
Throughout the paper we adopt the convention that repeated indices are summed. The torsion
of the connection ∇ is defined as
where
are vector fields on
M and
is the Lie bracket. The connection ∇ is said to be compatible with the metric
g or simply metric if
If the connection is metric, the parallel transport defined by ∇ is an isometry, preserving the length and the angle between vectors. A celebrated theorem in differential geometry states that in a Riemannian manifold there exists a unique metric connection which is torsion-free
, the so-called Levi-Civita connection
. For later use we introduce the curvature defined by the connection ∇
The connection ∇ is flat if the associated curvature is identically zero. This is a coordinate-free condition and the Christoffel symbols are zero in every coordinate chart. Moreover, there exists a local affine coordinate system with the property that every canonical vector of the basis can be parallel transported along the coordinate line so each coordinate line is a geodesic.
The above notion (
1) of compatibility between the metric
g and a connection ∇ is generalized in Information Geometry using the following
Definition 1.
The connections ∇
and are called conjugate or dual with respect to the metric in if
One can show [
3] that the connection
dual to ∇ is unique. Moreover, in [
3] it is proved that if
are dual and both torsion-free, then
if and only if
. This motivate the following
Definition 2. A Riemannian manifold equipped with two connections which are dual and torsion-free is called dually flat when , i.e. both connections are flat.
As a consequence, there exists on
M two affine local coordinate systems
and
with the property that the vectors
of the basis are ∇-parallel transported along the coordinate lines
and the vectors
are
-parallel transported along the coordinate lines
. The coordinates
and
are called dual or bi-orthogonal because it holds that
where
is the Kronecker symbol. Note that the two basis are orthogonal each other but the vectors of each base need not to be orthogonal. More in detail, if we consider the representation of the metric
g in the two basis
it holds that the two matrices are in the relation
. Finally, let
and
be the transition maps connecting the representation of a point
in the two coordinate charts. Note that they are one the inverse of the other. Using the orthogonality condition
one can prove that their tangent maps coincide with the representation of the metric
g in the two coordinate basis, that is
We now show that
Proposition 1.
The change of coordinate and can be represented using the so-called potential scalar functions and in the sense that
Proof. We prove the first relation of (
3), the proof for the second follows the same pattern. We need to prove that the one form
is exact, that is there exists a scalar function
such that
. A necessary (and sufficient condition if the manifold is simply connected) is that
. Then we need to check that
that is
Now, using (
2), the above relation becomes
which is true. □
Note that the relations (
2) and (
3) together imply that the potentials
and
are convex function. Indeed we have
i.e. that the Hessian matrix of
is positive definite. Since the metric
g is the Hessian of a function, the dually flat manifold is an example of an Hessian manifold. We do not pursue this line of investigation, see [
9] for further details.
Note that the existence of a dual coordinate system induces stringent conditions on the metric
g: indeed by deriving (
2) and using Schwartz condition on mixed second order derivative we have that
Another remarkable consequence is the following, which is a classical result of Legendre transform theory: since the change of coordinate maps are gradient maps, their inverse are also gradient maps and they are related by the Legendre condition
Indeed, using the one-form notation, we have
2.1. Mutually Dual Foliations of a Dually Flat Space
A remarkable fact of dually flat manifolds is that they can admit a couple of orthogonal foliations. Let us consider an
n-dimensional dually flat manifold
and let
,
,
, be a couple of bi-orthogonal affine local coordinate systems. Given a
k with
we partition the coordinates in two subsets
and we consider the coordinate projections
and
. For every
and
we define the submanifolds of
M
In [
3] is is proved that since
M is dually flat and the subsets
and
are defined as affine subspaces of an affine coordinate system , they are totally geodesic submanifolds with respect to the ∇ and
connection respectively, that is it holds that a ∇-geodesic (respectively
-geodesic) which is tangent to the submanifold at a point remains tangent to the submanifold. Moreover since their tangent spaces can be described as
the two submanifolds are mutually orthogonal by the condition
and complementary in the sense that their direct sum of subspaces gives the tangent space
. See
Figure 1 below. If we introduce the so called mixed coordinate system
the metric
g admits the block decomposition
So the complementary projections
and
define a couple of mutually dual foliation of
M whose leaves are orthogonal submanifolds of constant dimension (respectively
, and
) and totally geodesic with respect to the ∇ and
connections. This partition of the space is called
k-cut of a dual coordinate system in [
5]. The notion of mutually dual foliations has important applications on Information Geometry due to the fact that a version of the Pithagorean Theorem applies (see again [
3]).