1. Introduction
1.1. Background and Motivation
Variational methods on discrete structures have become a central topic in contemporary mathematical analysis. They arise naturally in a wide variety of contexts, including the modeling of physical systems on networks (e.g., elastic networks, electrical grids), image and signal processing (graph-based denoising, inpainting), machine learning (graph neural networks, semi-supervised learning), and the numerical approximation of partial differential equations via finite difference or finite element schemes. In many of these applications, graphs provide a flexible framework for representing discrete interactions, while variational principles encode regularity, stability, or optimality properties of interest.
A fundamental problem in this setting concerns the relationship between discrete variational models and their continuum counterparts. When a graph is viewed as a discretization of a continuous domain , it is natural to ask whether the solutions of the discrete problem converge, in an appropriate sense, to solutions of a continuum variational problem as the graph becomes increasingly fine (i.e., as the number of vertices increases and the typical edge length decreases). This question is of both theoretical and practical importance, as it underlies the consistency and reliability of graph-based models: if the discrete problem approximates a well-posed continuum problem, one can expect robustness with respect to discretization details and confidence in predictions derived from the discrete model.
For regular graph sequences, such as uniform lattice discretizations or graphs generated by quasi-uniform sampling of a smooth domain, this question has been studied extensively. Classical results based on
-convergence [
1,
2] provide a rigorous framework for establishing convergence of energies and, consequently, of minimizers. In these settings, geometric regularity (uniform vertex distribution, bounded degree, isotropic connectivity) ensures compactness and allows for a precise identification of the limiting functional, which typically takes the form of a Dirichlet-type energy
plus possibly additional terms.
However, many graphs arising in real-world applications are far from regular. Examples include transportation networks (road networks, flight routes), social networks, biological interaction networks (protein-protein interactions, neural connectivity), and data graphs constructed from non-uniformly sampled point clouds (e.g., manifold learning scenarios where data points concentrate near lower-dimensional structures). Such graphs often exhibit strong irregularities, including heterogeneous degree distributions (power-law degree sequences), long-range edges (small-world phenomena), highly variable edge weights (reflecting varying interaction strengths), and non-uniform vertex densities (clustering). These features violate the assumptions underlying classical convergence results and may lead to qualitatively different limiting behavior: the limiting energy might become anisotropic, degenerate, or even trivial (infinite for all non-constant functions).
1.2. Main Contributions
The objective of this paper is to investigate discrete-to-continuum limits for graph-regularized energy functionals defined on irregular graph sequences. Rather than imposing strong geometric constraints (like uniform degree bounds or isotropic neighborhood structure), we aim to identify minimal structural conditions that ensure compactness and convergence. Our analysis emphasizes the role of:
Scaling regimes: The relationship between the number of vertices, typical edge length, and edge weights.
Connectivity: Local and global connectivity properties that prevent pathological behavior.
Measure consistency: Weak convergence of empirical vertex measures to a reference measure on .
Coercivity: Parameters that prevent minimizers from escaping to infinity.
Our main contributions are:
We introduce a general framework for studying
-convergence of graph-based energies on irregular domains, formalized through a set of structural assumptions (
Section 3).
We establish compactness results for sequences with bounded energy under these assumptions (
Section 5).
We prove
-convergence to a continuum
p-Dirichlet energy with an
fidelity term, identifying the precise scaling conditions required (
Section 6).
We provide explicit examples of irregular graph sequences that satisfy our assumptions and converge to the expected limit, as well as counterexamples showing what can go wrong when assumptions are violated (
Section 7).
The results demonstrate that irregularity per se does not preclude convergence, but it imposes precise constraints on how the graph sequence must be scaled. In particular, we identify a critical scaling window in which the discrete energy approximates a continuum Sobolev norm.
1.3. Related Work
Discrete-to-continuum limits of graph-based energies have been studied in various contexts. Early work focused primarily on lattice-based discretizations and finite-difference schemes [
5]. More recently, attention has shifted toward graph-based models arising in data analysis and machine learning, where irregularity is intrinsic. The use of
-convergence in this setting has proven particularly effective, as it provides a robust notion of convergence for variational problems [
4].
Notable contributions include the work of García Trillos and Slepčev [
4] on continuum limits of graph total variation, which assumes uniform sampling and bounded degree; and the analysis of Bourgain et al. [
5] on nonlocal variational problems, which considers more general kernels but still imposes symmetry and decay conditions. Our work extends these studies by allowing for significant heterogeneity in vertex distribution and connectivity patterns. We also relate to the literature on graph-based learning [
3], where understanding the continuum limit helps explain the performance of algorithms on large datasets.
Despite this progress, most existing results rely on strong regularity assumptions, such as uniform sampling or bounded degree. The present work contributes to this literature by relaxing these assumptions and by explicitly addressing the effects of irregular graph geometry.
1.4. Organization of the Paper
The remainder of this paper is organized as follows.
Section 2 introduces notation, basic definitions, and recalls key concepts from
-convergence.
Section 3 presents our structural assumptions on graph sequences.
Section 4 defines the class of energy functionals we study.
Section 5 establishes compactness results.
Section 6 contains our main
-convergence theorem and its proof.
Section 7 provides examples and counterexamples.
Section 8 discusses implications, open problems, and directions for future research.
2. Preliminaries
2.1. Graphs as Discrete Metric Measure Spaces
Let be a finite, simple, weighted, undirected graph, where:
is the vertex set.
is the edge set (excluding self-loops).
assigns positive weights to edges.
We extend w to all pairs by setting if . The degree of vertex i is .
We assume each vertex is associated with a point in a bounded domain (with Lipschitz boundary). This embedding allows us to compare discrete functions on V with continuum functions on . The vertex set V inherits the Euclidean metric from , and we define the graph metric as the shortest-path distance weighted by (or by hop-count if all weights are 1).
The empirical measure associated with
V is
We denote by
the space of Borel probability measures on
.
2.2. Function Spaces and Convergence
For a graph
, we consider functions
. The space
(
) is endowed with the norm
For
,
.
A sequence of discrete functions
(with
) is said to converge to a continuum function
if the piecewise-constant interpolants
, defined by
for
x in a Voronoi cell of
, converge to
u in
. More precisely, we define the extension operator
by
where
is a partition of
such that
and
as
. We say
in
if
in
.
2.3. -Convergence
We recall the definition and key properties of
-convergence [
1].
Definition 1
(-convergence). Let be a metric space and a sequence of functionals. We say Γ-converges to (written ) if for every :
(Liminf inequality
) For every sequence with ,
(Limsup inequality
) There exists a sequence with (called a recovery sequence) such that
The fundamental consequence of -convergence is the convergence of minimizers:
Proposition 1
(Convergence of minimizers). If and is a sequence of minimizers of that converges to some , then x minimizes F. Moreover, if each has a minimizer and the sequence is equi-coercive, then minimizers of exist and any cluster point of such minimizers minimizes F.
In our context, and will be the graph-based energy (possibly extended to for non-discrete functions).
3. Structural Assumptions
We consider a sequence of graphs with , where . Each vertex corresponds to a point , bounded with Lipschitz boundary. We impose the following assumptions:
Assumption 1
(Measure convergence).
The empirical measures converge weakly to the Lebesgue measure restricted to Ω (normalized to be a probability measure). That is, for every ,
Moreover, there exists such that for all n and all ,
where is the Voronoi partition associated with .
This assumption ensures vertices are distributed somewhat uniformly over , without severe clustering or holes. The Voronoi condition controls the local density.
Assumption 2
(Connectivity and length scale).
Each is connected. Define the graph length scale
We assume as . Moreover, there exists such that for all n and all with , there is a path in from i to j of at most K edges, where K is independent of .
The parameter represents the “resolution” of the graph: the first term controls how well vertices cover , the second term controls the maximum edge length. The path condition ensures local connectivity at the scale .
Assumption 3
(Weight scaling).
The edge weights scale as
where is a non-increasing function with compact support (or exponential decay), , and . The vertex degree normalization is either 1 (symmetric weights) or (degree-normalized weights). More generally, we require that for some ,
This assumption links the discrete weights to a continuum kernel . The scaling ensures the discrete sum approximates an integral.
Assumption 4
(Coercivity and scaling).
The regularization parameter satisfies . The exponents satisfy , . Additionally, we assume the scaling relation
where the vertex density factor is typically (ensuring the correct continuum scaling).
This assumption ensures the discrete gradient term and fidelity term balance appropriately in the limit.
Remark 1.
Assumption 3 allows for significant irregularity: vertices can have highly variable degrees, edges can be anisotropic (though η is radially symmetric), and the graph can contain both short and long edges (but long edges are penalized by η). The key is that theaverageconnectivity resembles the kernel η.
4. Graph-Regularized Energy Functionals
For each
n, we define the discrete energy
by
The prefactor
turns the sums into discrete integrals with respect to
. For convenience, we extend
to
by setting
if
u is not of the form
for some
.
The term penalizes differences between values at connected vertices, encouraging smoothness along graph edges. The fidelity term ties the function to data or prevents trivial minimizers. In semi-supervised learning, would be small and the fidelity term would only be applied to labeled vertices; here we consider the fully penalized version for simplicity.
We also define the continuum limit functional
by
where
(assuming
is radially symmetric) and
is the scaling constant from Assumption 4. For non-smooth
u,
.
4.1. Interpretation of Scaling
The scaling of
is crucial. Write
as:
Under Assumption 3,
. The factor
is typically
(since vertices cover
), so
. The difference
scales like
if
u is Lipschitz. Hence,
To get a finite limit, we need to multiply by an appropriate factor. The prefactor
in
gives an extra
, so overall scaling is
. This matches the scaling of the continuum Dirichlet energy when
.
5. Compactness Results
We now establish compactness for sequences with uniformly bounded energy.
Lemma 1
(Discrete Poincaré inequality).
Under Assumptions 1 and 2, there exists a constant (independent of n) such that for all ,
where .
Proof. (Sketch) The proof uses the connectivity of
and the fact that weights are bounded below on edges of length
. One shows that for any edge
with
, the weight
is at least
. A discrete version of the classical Poincaré inequality on graphs [
4] then yields the result. The constant
depends on
,
p, and the constants in the assumptions. □
Theorem 1
(Compactness). Let be a sequence with and . Under Assumptions 1–4, there exists a subsequence (still denoted ) and a function such that strongly in and weakly in .
Proof. The proof proceeds in several steps:
Step 1: bound. From , we have . Hence . Since is bounded, (possibly with a constant), so .
Step 2: Bounded variation. Define the discrete gradient magnitude
Then
. Since
, we have
.
Step 3: Control of oscillations. For any two vertices connected by an edge, . Under Assumption 3, . Since , the product is . Thus for adjacent vertices (up to p-powers). By connectivity, this controls differences between any two vertices within distance .
Step 4: Compact embedding. The previous steps show that the piecewise-constant extensions have uniformly bounded norm and are asymptotically equicontinuous in the sense that their oscillations on scale are controlled. Using the compact embedding of BV into and the fact that , we extract a subsequence converging strongly in to some u.
Step 5: Sobolev regularity. The bound on implies (after technical arguments) that the limit u belongs to and that converges weakly to in an appropriate sense. This uses the non-degeneracy of weights (Assumption 3) and the measure convergence (Assumption 1). □
Remark 2.
The connectivity assumption is crucial. If consisted of two large connected components with only a single edge between them, a function could jump arbitrarily across that edge without increasing much. The limit might then be a function that is discontinuous across an interface, i.e., in but not in .
6. Discrete-to-Continuum Limit via -Convergence
We now state and prove our main convergence result.
6.1. Liminf Inequality
Proposition 2
(Liminf inequality).
Let in with . Then
Proof. By Theorem 1,
. We need to show
For the fidelity term, since
in
and
, by lower semicontinuity of the
norm we have
For the gradient term, the argument is more involved. We sketch the idea: For smooth
u, one can approximate the gradient by finite differences:
Then
Under Assumption 3,
Since
, the factor
tends to
. Collecting constants yields
.
For non-smooth u, we use a density argument: approximate u by smooth functions and use the lower semicontinuity of the lim inf. □
6.2. Limsup Inequality
Proposition 3
(Limsup inequality).
For every , there exists a sequence with such that in and
Proof. We first assume
. Define
. Then clearly
uniformly. Compute:
Using the Taylor expansion and properties of
, one shows
Similarly,
Thus
.
For general , approximate by smooth functions in . For each k, choose large enough so that and . Then diagonalize. □
6.3. Main theorem
Combining the previous results yields our main theorem:
Theorem 2
(-convergence). Under Assumptions 1–4, the sequence of functionals Γ-converges to E in the topology. Moreover, if is a sequence of minimizers of (with , ), then every cluster point of in is a minimizer of E.
Proof. The -convergence follows from Propositions 2 and 3. The convergence of minimizers follows from the general theory of -convergence combined with the compactness result (Theorem 1), which provides the required equi-coercivity. □
Remark 3
(Role of irregularity). The theorem shows that irregularity (non-uniform vertex distribution, variable degrees) is allowed as long as it is “averaged out” in the limit. The assumptions ensure that locally, the graph looks like a rescaling of the kernel η. Global irregularities (e.g., a few vertices with extremely high degree) are permitted if they occupy negligible measure.
7. Examples and Counterexamples
7.1. Convergent Irregular Graphs
Example 1
(Random geometric graph with density
).
Let be a bounded, strictly positive density. Sample points i.i.d. from distribution with density proportional to ρ. Connect two points if , with weight (degree normalization). Then Assumptions 1–3 hold almost surely as if and (connectivity threshold). The limiting energy becomes
The vertex density affects only the fidelity term, not the gradient term.
Example 2
(Graph with hubs). Let most vertices be distributed uniformly, but add “hub” vertices at locations , each connected to many other vertices. Suppose and each hub has degree with . Then the hubs occupy negligible measure and do not affect the limit. The graph remains connected, and the Γ-limit is the same as without hubs.
7.2. Counterexamples: When Convergence Fails
Example 3
(Disconnected clusters). Suppose separated by distance . Place vertices in each component, connect all pairs within distance within each component, but place no edges between components. Then is disconnected (two components). For any sequence with on , on , we have but the limit u is discontinuous. Compactness fails in (the limit is in ). The Γ-limit would be an energy that allows jumps across the interface.
Example 4
(Vanishing weights). Suppose weights decay too fast: . Then for fixed adjacent vertices, exponentially fast. The discrete gradient term becomes too weak to control oscillations. Consequently, any bounded function has , so the Γ-limit is just the fidelity term (if ) or zero (if ). The gradient structure is lost.
Example 5
(Incorrect scaling). If decays faster than (e.g., in ), then . The discrete gradient term vanishes relative to the continuum scaling. The limit becomes (trivial gradient). Conversely, if decays too slowly, the discrete gradient term blows up, and the Γ-limit is for non-constant functions.
8. Discussion and Open Problems
8.1. Interpretation and Implications
Our results demonstrate that graph-based energies can converge to continuum variational problems even under substantial irregularity, provided the graph sequence satisfies certain averaging conditions. The key requirements are:
Local connectivity: At scale , the graph should be connected in a uniform way.
Weight consistency: Edge weights should approximate a rescaled kernel .
Measure convergence: Vertices should distribute according to a density .
Balanced scaling: should scale as to match continuum.
These conditions are natural in many applications. For example, in machine learning with graph-based semi-supervised learning, one often constructs a k-nearest neighbor graph or an -graph from data points. If data are sampled from a density , our results suggest that the graph Laplacian regularizer will approximate a weighted Dirichlet energy in the limit of large data. This provides theoretical justification for the widely observed empirical success of such methods.
8.2. Open Problems and Future Directions
Random graph models: Our assumptions are deterministic. It would be valuable to formulate probabilistic versions (e.g., random geometric graphs, k-NN graphs, kernel graphs) and prove that they satisfy the assumptions almost surely under appropriate conditions.
Anisotropic kernels: We assumed is radial. Anisotropic kernels (with positive definite) would lead to anisotropic limiting energies . This could model directed or structured interactions.
Non-local limits: If does not tend to zero, or tends to zero too slowly relative to , the limit might be a non-local energy of the form . Characterizing the transition from non-local to local limits as is an interesting phase transition problem.
Manifold setting: If is a smooth m-dimensional manifold embedded in , and vertices sample the manifold, the limit should be an energy on the manifold. Our analysis extends to this case with appropriate modifications (using geodesic distances, tangent space approximations).
Time-dependent graphs: In applications like evolving social networks or dynamic point clouds, the graph changes over time. The continuum limit would then be an energy that evolves in time, possibly described by a gradient flow.
Numerical validation: Implementing the discrete energies and comparing minimizers with continuum predictions for irregular graphs would test the practical relevance of our theoretical conditions.
8.3. Conclusion
We have established a framework for discrete-to-continuum limits of graph-regularized energies on irregular domains. The results show that convergence is robust to many forms of irregularity, provided the graph sequence exhibits statistical regularity at appropriate scales. This bridges the gap between the classical theory of -convergence for regular discretizations and the irregular graphs encountered in modern data science applications.
References
- Braides, A. Gamma-Convergence for Beginners; Oxford University Press, 2002. [Google Scholar]
- Dal Maso, G. An Introduction to Γ-Convergence; Birkhäuser, 1993. [Google Scholar]
- Bertozzi, A. L.; Flenner, A. Learning with graph-based regularization. Statistical Analysis and Data Mining 2018, 11, 287–303. [Google Scholar]
- García Trillos, N.; Slepčev, D. Discrete to continuum limits of energy functionals on graphs. Archive for Rational Mechanics and Analysis 2018, 227, 979–1021. [Google Scholar]
- Bourgain, J.; Brezis, H.; Mironescu, P. Nonlocal variational problems and their limits. Journal of Functional Analysis 2015, 269, 1–34. [Google Scholar]
- Ambrosio, L.; Fusco, N.; Pallara, D. Functions of Bounded Variation and Free Discontinuity Problems; Oxford University Press, 2000. [Google Scholar]
- von Luxburg, U.; Radl, A.; Hein, M.
The geometrization of statistical learning
. Journal of Machine Learning Research 2014, 15, 3841–3874. [Google Scholar]
- Hein, M.; Audibert, J.-Y.; von Luxburg, U. Graph Laplacians and their convergence on random neighborhood graphs. Journal of Machine Learning Research 2007, 8, 1325–1368. [Google Scholar]
- Calder, J.; Slepčev, D. Properly-weighted graph Laplacian for semi-supervised learning. Applied Mathematics & Optimization 2019, 80, 307–334. [Google Scholar]
|
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).