3.1. Descriptive Statistics
We use daily logarithmic returns of the Kenneth French 49 Industry Portfolios [
13]
1 from 27 February 1976 to 27 February 2026 (12,605 trading days). The portfolios are value-weighted, reconstructed annually from CRSP/Compustat with four-digit SIC codes, and include delistings and corporate actions, eliminating survivorship bias by construction. A rolling window of
trading days yields
observation windows.
We benchmark Mantegna distance against average pairwise correlation and the variance share of the first principal component .
Table 1.
Descriptive statistics of correlation space (, , 1976–2026).
Table 1.
Descriptive statistics of correlation space (, , 1976–2026).
| Metric |
Full sample |
Calm periods |
Stress periods |
p-value |
|
0.248 |
0.239 |
0.315 |
|
|
0.558 |
0.540 |
0.681 |
|
| Mean Mantegna distance |
0.466 |
0.477 |
0.387 |
|
Average pairwise correlation is
in the full sample, consistent with a multi-factor market with pronounced sectoral structure. During stress periods
rises to
and
to
(
), the synchronisation effect documented in prior literature [
3]. The mean Mantegna distance falls from
to
(
), reflecting compression of the correlation metric space under elevated systemic risk.
3.2. Descriptive Topology: Two-Scale Structure
Persistent homology detects
-cycles across the study period; at least one cycle appears in every one of the
rolling windows (mean
, maximum 50;
Table 2).
Figure 1.
Topological changes during financial stress (, ). (a) Distribution of -cycle counts by regime. (b) Marginal density of birth filtration values. (c) Mean Betti number profile ; points significant at (Mann–Whitney) are marked. (d) Corresponding p-value profile (log scale). Shaded regions indicate the HIGH-correlation (red) and LOW-correlation (blue) scales identified on the calibration sample, with median boundaries and 95% bootstrap confidence intervals.
Figure 1.
Topological changes during financial stress (, ). (a) Distribution of -cycle counts by regime. (b) Marginal density of birth filtration values. (c) Mean Betti number profile ; points significant at (Mann–Whitney) are marked. (d) Corresponding p-value profile (log scale). Shaded regions indicate the HIGH-correlation (red) and LOW-correlation (blue) scales identified on the calibration sample, with median boundaries and 95% bootstrap confidence intervals.
We identify filtration scales where market topology differs systematically between stress and calm regimes using a two-stage procedure. First, a dense grid (step ) is pre-filtered by requiring that be significant (, two-sided Mann–Whitney U test) in both temporal halves of the original sample. Second, boundary uncertainty is quantified via a moving-block bootstrap ( replications, block length trading days): significant -ranges with consistent sign are aggregated, and final scales are reported as median boundaries with 95% empirical confidence intervals. This yields two intervals:
- 1.
HIGH-correlation scale (, equivalent ): is higher during stress.
- 2.
LOW-correlation scale (, equivalent ): is higher during calm periods.
Thus, stress induces opposite topological responses at different correlation strengths. In calm periods, sectoral differentiation generates intransitive configurations across a broad range of moderate and weak correlations. As systemic risk rises, a dominant market factor synchronises most pairs, dissolving low-scale cycles; however, incomplete synchronisation among the strongest correlates () creates new persistent high-scale cycles. The bootstrap-robust intervals suggest that the two scales reflect stable structural properties, not sample-specific artefacts.
3.4. Sectoral Decomposition of Intransitive Configurations
A one-dimensional homological cycle () at the high-correlation scale implies an intransitive triple of sectors where two pairwise correlations exceed a filtration threshold while the third remains below it. Extracting these triples provides a sector-level interpretation of the topological signal: the framework identifies configurations where synchronization fails to close transitive triangles.
The pipeline executes in minutes on standard hardware [
15], allowing for daily recomputation.
Figure 3 and
Figure 4 illustrate the sectoral topology at selected thresholds during a calm trading day (2026-02-27) and a stressed trading day (2020-03-10).
During calm regimes, intransitive configurations in the high-correlation band are sparse and topologically disconnected. At (), the network contains 22 triples distributed across isolated clusters, reflecting localized structural frictions. Tightening the threshold to () reduces the triple count to 2. The Containers sector appears in intransitive links across both thresholds, indicating a persistent structural friction (e.g., supply-chain or inventory cycle constraints). This alignment is retrospective and illustrative: the configuration identified in late February 2026 coincided with conditions preceding the March 2026 global logistics disruption. While not predictive, this example shows how localized intransitive configurations can serve as diagnostic markers for structural frictions.
During stress periods, intransitive structures persist even at extreme correlation thresholds. At (), where calm markets show no triples, the stress network contains 73 triples that form a connected component. The sectoral aggregation graph shows contributions from Wholesale, Banks, Business Services, Chemicals, and Communication. These sectors link otherwise disconnected parts of the economy.
Figure 5 shows the temporal evolution of intransitive triple counts across the LOW- and HIGH-correlation bands over 2006–2026. For this illustration, stress is defined exogenously rather than via volatility. The upper panel tracks the LOW-correlation range; the lower panel tracks the HIGH-correlation range. During systemic stress episodes (marked by vertical dashed lines), LOW-correlation triple counts decline while HIGH-correlation counts increase. This pattern is consistent with the two-scale structure identified in the persistence analysis.
The distinction between structural and cyclical intransitivity has implications for risk management. Persistent, threshold-robust triples in isolated sectors (e.g., Containers) may reflect supply-chain or operational bottlenecks. Monitoring HIGH-scale triple accumulation provides a sector-level diagnostic that complements aggregate volatility or correlation metrics.
3.5. Exact Analytical Nulls and Topological Geometry
We derive analytical benchmarks for random metric spaces and compare the empirical market against these baselines.
3.5.1. Intransitive Triangles and Exact Null Distributions
In a Vietoris–Rips filtration, a 1-cycle requires at least four vertices. The intermediate 3-point motif—intransitive triangle—is structurally relevant. For a triangle with edge lengths ordered as , the open 2-path exists at scales , with persistence .
Let
be the order statistics of three i.i.d.
variables conditioned on the triangle inequality
. The conditional joint density on the constrained metric simplex is
. Marginalising over
and changing variables to
yields the exact persistence density
The expected persistence is . In contrast, a perfect ultrametric space (hierarchical tree) has identically.
The probability
that a single intransitive triangle is alive at scale
follows from integrating the same density over the region
:
For Mantegna distances in , scale before applying the formulas.
If all triples share this marginal distribution, the expected number of intransitive triangles alive at scale is .
3.5.2. Deterministic Link and Geometric Amplification
Topological observables are deterministic functions of the correlation matrix via the Mantegna metric . For example, the intransitive triangle count . Consequently, topological metrics cannot serve as independent predictors—they move with the correlations. However, the filtration acts as a geometric amplifier: small correlation changes near critical thresholds can shift thousands of triangles at once, making cycle decompositions more sensitive to an approaching structural shift than pairwise metrics alone.
3.5.3. Empirical Findings: Excess Complexity and Hypo-Frustration
We compare the Fama–French 49 industry portfolio data against two nulls: (1) the exact theoretical null derived above, and (2) a parameter-free Gaussian factor null generated from
, where
is the Ledoit–Wolf shrinkage covariance matrix [
19].
Figure 6 shows three structural observations. (1) At intermediate correlation scales, the empirical triangle count exceeds the theoretical null by about 50%, indicating more topological frustration than a random metric space. (2) Across the same intermediate range, the empirical count is substantially lower than the Gaussian factor null; the market’s geometry is sparser and more hierarchical than a linear factor model. (3) During stress, the count rises sharply at tight scales (small
), surpassing both null benchmarks and the calm-period curve. This localised excess, where sector pairs are most strongly coupled, reflects intransitive triples among tightly correlated industries that neither null reproduces. Away from these tight scales, the stress curve falls below the calm curve, consistent with the dissolution of low-correlation frustration.
The curves diverge in the negative-correlation tail. An intransitive triangle is alive only if its longest edge . Because financial correlations are overwhelmingly positive, Mantegna distances rarely exceed , and the empirical count drops to zero once passes that threshold. The theoretical null admits distances up to (perfect negative correlation) and thus predicts a long tail; this stems from the null’s assumption that negative correlations are as likely as positive ones—a premise that does not hold for equity markets. The theoretical null is therefore a loose outer bound, not a realistic baseline.
Figure 7 shows the conditional mean persistence
. Both calm and stress curves lie strictly below the theoretical random-metric benchmark at all scales—a pattern we term
hypo-frustration: open 2-paths resolve faster than in a random metric space. During stress, persistence nearly vanishes at the tightest scales, consistent with an approach to an ultrametric, tree-like geometry. Taken together, the tight-scale surge in intransitive triangles and the consistently shortened persistence relative to both nulls constitute a geometric signature of market stress.
3.6. Predictive Performance
We evaluate whether topological indicators improve forecasts of near-term stress beyond average correlation . Models are logistic regressions trained on the combined Training (1976–2006) and Validation (2006–2016) periods and applied without re-estimation to the hold-out Test set (2016–2026). The high and low correlation scales are defined on the Training period (1976–2006). The evaluation is strictly chronological: no Test-period data inform indicator construction, hyper-parameter selection, or model fitting.
Indicator Suite and Hyper-Parameters
From each daily persistence diagram we extract five level indicators (
Section 2.3):
Momentum signals are built as
where
and
is the rolling standard deviation of differences, computed over
days and shifted by one day to prevent look-ahead bias. We also include the momentum of average pairwise correlation,
, as an additional competitor. The lag
and window
are selected by grid search on the Validation set, maximizing AUC for the onset target at
days, and then held fixed for all horizons (
Table 3).
Bootstrap Confidence Intervals and Pairwise Tests
Moving-block bootstrap on the Test set (block length 120 days, 1000 replications) confirms the pattern.
Table 5 reports 95% confidence intervals for
; the lower bounds remain above 0.5 at all horizons. Pairwise superiority tests (
Table 6) further show that
’s advantage over the control and over
at long horizons is robust.
A clear horizon-dependent pattern emerges. At short horizons, the momentum of average correlation captures the bulk of the predictive signal, and topological momentum adds no significant incremental information. At 60–100 day horizons, the momentum of the persistence-weighted mean cycle birth, , is the strongest predictor, outperforming both the control model and . This is consistent with a slow, structural build-up of topological frustration that materialises into volatility only over several months.