A Variational Screened Poisson Reconstruction for Whole-Slide Stain Normalization

Junlong Xing; Hengli Ni; Qiru Wang; Yijun Jing

doi:10.20944/preprints202602.1947.v1

Submitted:

15 February 2026

Posted:

28 February 2026

You are already at the latest version

Abstract

To address staining variability in digital pathology, we formulate normalization as a variational inverse problem balancing structural fidelity and chromatic alignment. We propose Screened Poisson Normalization (SPN), a convex model in CIE L^∗a^∗b^∗ space governed by the modified Helmholtz equation. By coupling a first-order gradient consistency term with a zeroth-order chromatic anchor, SPN yields a screened reconstruction controlled by a single screening parameter λ. Beyond establishing well-posedness in H1(Ω) and Lipschitz stability, we derive a property crucial for gigapixel wholeslide analysis: the screening term induces an intrinsic localization length ℓ ∼ λ^−1/2, which leads to exponential decay of boundary-induced errors. This theoretical insight provides a principled justification for a scalable, non-overlapping subdomain implementation; when combined with an efficient DCT-based spectral solver, it enables artifact-free processing without elaborate tiling heuristics or boundary post-processing. Extensive experiments on multi-center datasets further demonstrate that SPN achieves robust chromatic consistency while faithfully preserving diagnostically relevant microstructural detail.

Keywords:

stain normalization

;

screened Poisson equation

;

Helmholtz operator

;

domain decomposition

Subject:

Computer Science and Mathematics - Applied Mathematics

1. Introduction

1.1. Background

Histopathological analysis is the current gold standard for disease diagnosis, particularly in oncology. With the rapid evolution of digital pathology [1], the slides containing stained tissue sections are routinely digitized into ultra–high-resolution whole-slide images (WSIs) using advanced microscopy scanners. As a result, computational analysis of digital pathology images is increasingly complementing, and in some cases replacing, manual microscopic evaluation by pathologists. Within this paradigm, the quality, consistency, and standardization of digitized pathological images constitute pivotal determinants of diagnostic reproducibility and the performance of subsequent computational analyses.

The WSIs imaging pipeline begins with conventional tissue preparation, followed by whole-slide scanning into standard RGB image formats. Figure 1 provides an overview of the complete acquisition workflow [28].

Clinically, tissue specimens(obtained via surgical resection or biopsy)are fixed in formalin to preserve structural integrity, dehydrated through graded ethanol solutions, and embedded in paraffin wax to form a block. Thin tissue sections (3–6 µm) are cut from this block, mounted onto slides, and stained, most commonly with hematoxylin and eosin (H&E). Finally, a whole-slide scanner digitizes the slide at subcellular resolution to generate a WSI that is stored in a gigapixel images.However, deviations inevitably accumulate due to heterogeneous tissue preparation (e.g., sectioning, reagents, pH) and diverse digitization conditions (e.g., scanner optics, sensor noise).These artifacts introduce substantial domain shifts that impair diagnostic consistency and algorithmic performance. To address this, robust stain normalization is required to harmonize WSIs from heterogeneous sources, thereby standardizing the input for reliable downstream computational analysis.

1.2. Related Works

Research on stain normalization has advanced significantly since the digitalization of pathology. The seminal work of Reinhard et al. [6] introduced statistical color alignment via mean–variance matching in the decorrelated Lab space. However, this global statistics-based normalization inevitably results in the over-smoothing of fine image details.

To improve illumination robustness and chromatic fidelity, subsequent variations modified the transformation domain. Zarella et al. [14] and Magee et al. [15] adopted color projection in HSV/Lab spaces; Roy et al. [17] and Vijh et al. [18] introduced fuzzy-logic constraints; and Nadeem et al. [20] reformulated stain normalization as an optimal-transport problem via multimarginal Wasserstein barycenters. While these methods enhance perceptual smoothness, they remain fundamentally pixel-domain and global in nature, limiting their ability to preserve tissue microstructures.

Macenko et al. [7] shifted attention to structure preservation through stain vector estimation in optical-density (OD) space via singular value decomposition (SVD). Vahadane et al. [8] further incorporated sparse non-negative matrix factorization (SNMF), which jointly optimizes stain basis and concentration matrices. Adaptive deconvolution [19] and IDA decomposition [16] have since been proposed to address spatial heterogeneity while maintaining textural and morphological integrity. Gupta et al. [9] proposed a geometry-inspired, chemical- and tissue-invariant stain-normalization framework (GCTI). GCTI uses SVD/NMF and explicit geometric alignment to correct color variations caused by illumination, stain color vectors, and stain quantity.

Meanwhile, with the advancement of digital pathology, data-driven deep learning models have demonstrated superior performance to conventional stain normalization techniques in recent years. Existing approaches are mainly characterized by two classes: (1) supervised learning models that require paired or carefully curated training data [3,5,13,22,23,24,25], and (2) generative frameworks based on Generative Adversarial Networks (GANs) [4,11,12,21,26,33]. Despite achieving strong quantitative results, such improvements do not necessarily translate into better clinical diagnostic performance. When applied across institutions, sample-driven deep learning models are vulnerable to distribution shifts, which undermine reliability in real-world deployment. Moreover, their lack of explicit physical modeling of the staining process limits interpretability and hinders clinical acceptance.

1.3. Main Contributions

We propose the Screened Poisson Normalization (SPN) model, a variational framework that reformulates stain normalization as a strictly convex inverse problem governed by reaction-diffusion physics. This approach integrates physical interpretability with rigorous mathematical analysis, establishing the well-posedness and stability of the solution to guarantee the preservation of diagnostic morphology against input perturbations. To address the gigapixel scale of clinical pathology, we further derive a domain decomposition algorithm underpinned by the exponential decay of the screened Poisson Green’s function; this theoretical property ensures that interface errors remain exponentially localized, enabling the seamless and parallel reconstruction of Whole-Slide Images without the boundary artifacts typical of patch-based processing.

The paper is organized as follows. Section 2 investigates the mathematical nature of the stain normalization problem and establishes its theoretical foundations. Section 3 then derives Screened Poisson Normalization (SPN) from a reaction–diffusion interpretation of histological staining and formulates stain standardization as a variational inverse problem in the perceptual CIE

L a b

space. Section 4 presents the rigorous analysis of the model, establishing well-posedness, Lipschitz stability, and the exponential localization property that motivates the non-overlapping subdomain implementation. Section 5 details the numerical realization, including the finite-difference discretization, the DCT-based spectral solver, and practical implementation considerations. Section 6 reports comprehensive experimental validation on multi-center datasets. Finally, Section 7 concludes the paper and discusses limitations and directions for future work.

2. Mathematical Foundations

We define stain normalization as an inverse problem on a bounded Lipschitz domain

Ω \subset R^{2}

. Let

I_{S}, I_{T} : Ω \to R^{3}

denote the source and reference images, respectively. The task is to recover a normalized field

u : Ω \to R

(processed channel-wise in the decorrelated CIE

L a b

space [6]) that simultaneously satisfies structural fidelity to

I_{S}

and statistical alignment with

I_{T}

. This section analyzes the ill-posed nature of this reconstruction and establishes its theoretical connection to reaction–diffusion physics.

2.1. Variational Formulation of the Inverse Problem

Mathematically, recovering u is fundamentally under-determined. The statistical constraint operator

C (u) \approx C (I_{T})

is non-injective, as infinite spatial configurations can share identical global moments. Conversely, relying solely on gradient fidelity

\nabla u \approx \nabla I_{S}

determines the solution only up to an additive constant (the null space of the Neumann operator). Consequently, the unconstrained optimization is ill-posed in the sense of Hadamard [38].

To restore well-posedness, the problem must be cast within a Tikhonov regularization framework [39]. We seek a solution

u \in H^{1} (Ω)

that minimizes a strictly convex energy functional of the form:

J (u) : = \frac{1}{2} {∥ \nabla u - g ∥}_{L^{2} (Ω)}^{2} + \frac{λ}{2} {∥ u - T ∥}_{L^{2} (Ω)}^{2}, λ > 0 .

(1)

Here, the first term enforces structural consistency with a guidance field

g

derived from the source, while the second term anchors the solution to a chromatic prior T. The regularization parameter

λ

balances these competing objectives and ensures the coercivity of the bilinear form, guaranteeing the existence of a unique, stable weak solution.

2.2. Physical Isomorphism: Reaction–Diffusion Kinetics

The variational structure of (1) is not merely a mathematical regularization; it is isomorphic to the physical transport laws [30] governing histological staining. The accumulation of dye within porous tissue follows reaction–diffusion kinetics [27,29,31]. For a dye species with concentration

C (x, t)

, the governing equation is:

\partial_{t} C = \nabla \cdot (D \nabla C) + R (C, x),

(2)

where D is the diffusion coefficient and R describes the binding reaction. Near chemical equilibrium, the binding process approximates linear kinetics

R (C) \approx - k (C - C_{eq})

, where k is the reaction rate and

C_{eq}

is the local equilibrium concentration determined by tissue affinity.

At the thermodynamic steady state (

\partial_{t} C = 0

), assuming isotropic diffusion, (2) reduces to the screened Poisson equation:

- Δ C + κ C = κ C_{eq}, with κ : = k / D .

(3)

This physical equilibrium (3) shares the exact Euler-Lagrange form of the variational problem (1). The regularization parameter

λ

thus corresponds to the physical ratio of reaction to diffusion rates (

κ

), while the anchor T corresponds to the chemical equilibrium potential. This theoretical equivalence provides the physical basis for modeling stain normalization as a screened Poisson process.

3. The Screened Poisson Normalization (SPN) Model

This section details the Screened Poisson Normalization (SPN) model. Operating in the CIE

L a b

color space [6] and using optical density (OD) to represent dye concentration [33,34], we cast stain normalization as a variational reconstruction problem.

For each channel

c \in {L^{*}, a^{*}, b^{*}}

, we define the normalized field

u_{c} \in H^{1} (Ω)

as the unique minimizer of the strictly convex energy functional:

J_{c} (u_{c}) : = \int_{Ω} (\frac{1}{2} {∥ \nabla u_{c} - g_{c} ∥}^{2} + \frac{λ_{c}}{2} {(u_{c} - T_{c})}^{2}) d x .

(4)

The first term ensures structural fidelity by penalizing gradient deviations from a guidance field

g_{c}

, while the second acts as a Tikhonov regularizer, anchoring the solution to a chromatic prior

T_{c}

. The parameter

λ_{c} > 0

governs the trade-off between these objectives and ensures the problem is well-posed.

Minimizing (4) yields the associated Euler–Lagrange equation, which takes the form of a steady-state screened Poisson equation:

- Δ u_{c} + λ_{c} u_{c} = λ_{c} T_{c} - \nabla \cdot g_{c} in Ω,

(5)

subject to the adiabatic boundary condition

(\nabla u_{c} - g_{c}) \cdot n = 0

on

\partial Ω

. Physically, this equation describes a diffusion process driven by a source term

f_{c} : = λ_{c} T_{c} - \nabla \cdot g_{c}

, which balances the injection of high-frequency morphological details with low-frequency chromatic information.

To close the system, we construct the chromatic anchor

T_{c}

and guidance field

g_{c}

from the source

I_{S}

and reference

I_{T}

. The anchor

T_{c}

is defined by matching the global mean

μ

and standard deviation

σ

of the reference:

T_{c} (x) = \frac{σ_{T}^{(c)}}{σ_{S}^{(c)}} ({(I_{S})}_{c} (x) - μ_{S}^{(c)}) + μ_{T}^{(c)} .

(6)

This linear mapping provides a stable baseline for chromaticity that is robust to local variations. To preserve morphology, we parameterize the guidance field

g_{c}

as a spatially varying blend of source and anchor gradients:

g_{c} (x) = (1 - w (x)) α_{c} \nabla {(I_{S})}_{c} (x) + w (x) \nabla T_{c} (x),

(7)

where

α_{c}

is a robust contrast factor defined by the ratio of median gradient magnitudes, i.e.,

α_{c} = median ∥ \nabla {(I_{T})}_{c} ∥ / median ∥ \nabla {(I_{S})}_{c} ∥

. The structural weight

w (x) \in (0, 1]

is derived from the source luminance channel

I_{S, L^{*}}

via:

w (x) = exp (- \frac{∥ \nabla I_{S, L^{*}} {(x) ∥}^{2}}{τ^{2}}), τ > 0 .

(8)

In structurally salient regions (

w \to 0

), the guidance field strictly adheres to the source morphology

\nabla {(I_{S})}_{c}

, whereas in homogeneous regions, the model relaxes toward the smooth chromatic anchor. The resulting solution corresponds to the steady state of a system with screening length

ℓ_{c} \sim λ_{c}^{- 1 / 2}

, preventing local artifacts from propagating globally.

4. Theoretical Analysis

This section analyzes the mathematical properties of the Screened Poisson Normalization (SPN) model. We treat the variational formulation as a linear inverse problem in the Sobolev space

H^{1} (Ω)

, establishing the well-posedness of the solution, its stability with respect to input data, and the convergence of the domain decomposition scheme used for gigapixel processing.

The variational problem derived in Section 3 is formally equivalent to finding a weak solution

u \in H^{1} (Ω)

such that

B (u, v) = F (v), \forall v \in H^{1} (Ω),

(9)

where the bilinear form

B : H^{1} (Ω) \times H^{1} (Ω) \to R

and the linear functional

F : H^{1} (Ω) \to R

are defined by:

B (u, v) : = \int_{Ω} (\nabla u \cdot \nabla v + λ u v) d x, F (v) : = \int_{Ω} (g \cdot \nabla v + λ T v) d x .

We assume

Ω \subset R^{2}

is a bounded Lipschitz domain,

λ > 0

,

g \in {[L^{2} (Ω)]}^{2}

, and

T \in L^{2} (Ω)

.

Theorem 1

(Existence and Uniqueness). For any screening parameter

λ > 0

, guidance field

g \in {[L^{2} (Ω)]}^{2}

, and chromatic anchor

T \in L^{2} (Ω)

, the variational problem (9) admits a unique solution

u^{*} \in H^{1} (Ω)

.

Proof.

The result follows from the Lax–Milgram theorem. First, the bilinear form B is continuous on

H^{1} (Ω) \times H^{1} (Ω)

by the Cauchy–Schwarz inequality:

| B (u, v) | \leq {∥ \nabla u ∥}_{L^{2}} {∥ \nabla v ∥}_{L^{2}} + {λ ∥ u ∥}_{L^{2}} {∥ v ∥}_{L^{2}} \leq {max (1, λ) ∥ u ∥}_{H^{1}} {∥ v ∥}_{H^{1}} .

Second, B is strictly coercive due to the strictly positive screening parameter

λ

. Testing with

v = u

:

B (u, u) = {∥ \nabla u ∥}_{L^{2}}^{2} + {λ ∥ u ∥}_{L^{2}}^{2} \geq min (1, λ) {∥ u ∥}_{H^{1}}^{2} .

Since F defines a bounded linear functional on

H^{1} (Ω)

, the existence of a unique minimizer is guaranteed. □

With the solution uniquely defined, we quantify its stability with respect to perturbations in the input data

(g, T)

.

Theorem 2

(Lipschitz Stability). Let

u_{1}, u_{2} \in H^{1} (Ω)

be the unique weak solutions corresponding to data

(g_{1}, T_{1})

and

(g_{2}, T_{2})

, respectively. The solution depends continuously on the data:

∥ u_{1} - u_{2} ∥_{H^{1} (Ω)} \leq \frac{1}{min (1, λ)} (∥ g_{1} - g_{2} ∥_{L^{2} (Ω)} + λ {∥ T_{1} - T_{2} ∥}_{L^{2} (Ω)}) .

(10)

Proof.

Let

w = u_{1} - u_{2}

. Subtracting the respective weak formulations yields

B (w, v) = \int_{Ω} ((g_{1} - g_{2}) \cdot \nabla v + λ (T_{1} - T_{2}) v) d x, \forall v \in H^{1} (Ω) .

Choosing the test function

v = w

and applying the coercivity of B alongside the Cauchy–Schwarz inequality gives

{min (1, λ) ∥ w ∥}_{H^{1}}^{2} \leq B (w, w) \leq (∥ g_{1} - g_{2} ∥_{L^{2}} + λ {∥ T_{1} - T_{2} ∥}_{L^{2}}) {∥ w ∥}_{H^{1}} .

Dividing by

{∥ w ∥}_{H^{1}}

completes the proof. □

We next quantify the trade-off between morphological preservation and color normalization. The following estimate ensures that structural modifications are strictly bounded by the magnitude of the chromatic transfer.

Lemma 1

(Energy Comparison Principle). Let

u^{*}

be the global minimizer of

J (u)

. For any comparison function

v \in H^{1} (Ω)

,

∥ \nabla u^{*} {- g ∥}_{L^{2} (Ω)}^{2} + λ ∥ u^{*} {- T ∥}_{L^{2} (Ω)}^{2} \leq {∥ \nabla v - g ∥}_{L^{2} (Ω)}^{2} + λ {∥ v - T ∥}_{L^{2} (Ω)}^{2} .

(11)

Proof.

This is an immediate consequence of the optimality condition

J (u^{*}) \leq J (v)

. □

Theorem 3

(Structural Fidelity Bound). Assume

g \approx \nabla I_{S}

. The

L^{2}

-deviation of the normalized gradients

\nabla u^{*}

from the guidance field satisfies:

∥ \nabla u^{*} {- g ∥}_{L^{2} (Ω)} \leq \sqrt{λ} ∥ I_{S} {- T ∥}_{L^{2} (Ω)} + {∥ \nabla I_{S} - g ∥}_{L^{2} (Ω)} .

(12)

Proof.

Set

v = I_{S}

in Lemma 1. Neglecting the non-negative term

λ ∥ u^{*} {- T ∥}_{L^{2}}^{2}

on the left-hand side yields

∥ \nabla u^{*} {- g ∥}_{L^{2}}^{2} \leq ∥ \nabla I_{S} {- g ∥}_{L^{2}}^{2} + λ {∥ I_{S} - T ∥}_{L^{2}}^{2} .

The result follows from the subadditivity of the square root function,

\sqrt{a^{2} + b^{2}} \leq a + b

. □

Finally, we address the convergence of the tiled reconstruction required for gigapixel Whole Slide Images. Let

Ω

be partitioned into non-overlapping subdomains

{Ω_{k}}

. The local solution

u_{tile}

on

Ω_{k}

satisfies the natural boundary condition

(\nabla u_{tile} - g) \cdot n = 0

on

\partial Ω_{k}

. The discrepancy

e : = u_{global} - u_{tile}

arises solely from the flux mismatch

ψ : = (\nabla u_{global} - g) \cdot n

at the interface

Γ : = \partial Ω_{k} ∖ \partial Ω

.

Lemma 2

(Green’s Function Asymptotics). The fundamental solution

G_{λ} (x)

of the operator

- Δ + λ I

in

R^{2}

satisfies the decay estimate for

| x | > 0

:

| G_{λ} (x) | \leq C \frac{e^{- \sqrt{λ} | x |}}{\sqrt{| x |}} .

(13)

Theorem 4

(Exponential Localization of Interface Error). For any

x \in Ω_{k}

, the error

e (x)

decays exponentially with the distance to the interface Γ:

| e (x) | \leq K exp (- \frac{d (x, Γ)}{ℓ_{c}}), ℓ_{c} = λ^{- 1 / 2} .

(14)

Proof.

The error function satisfies

- Δ e + λ e = 0

in

Ω_{k}

with Neumann data

\partial_{n} e = ψ

on

Γ

. Using the boundary layer potential representation

e (x) = \int_{Γ} G_{λ} (x - y) ψ (y) d σ (y),

and noting that

| x - y | \geq d (x, Γ)

for all

y \in Γ

, we invoke the monotonicity of the Green’s kernel:

| e (x) | \leq (sup_{r \geq d (x, Γ)} | G_{λ} (r) |) {∥ ψ ∥}_{L^{1} (Γ)} .

Combining this uniform bound with the exponential decay of

G_{λ}

yields the stated estimate. □

Remark 1.

Theorem 4 implies that boundary artifacts are confined to a layer of characteristic width

O (ℓ_{c})

. For sufficiently large tiles, the interior solution converges exponentially to the global reconstruction, mathematically justifying the use of non-overlapping domain decomposition.

5. Optimization and Implementation

For each channel

c \in {L^{*}, a^{*}, b^{*}}

, the SPN model seeks a scalar field

U_{c} : Ω \to R

that solves the screened Poisson equation (5),where

U_{c}

denotes the discrete (grid-based) version of

u_{c}

.

5.1. Discrete and Non-Overlapping Domain Decomposition

The gigapixel resolution of WSI images renders global algorithms computationally prohibitive. Leveraging the localization properties of the SPN operator established in Section 4, we adopt a patch-wise implementation strategy.

We discretize the continuous domain

Ω

on a regular

H \times W

lattice:

Ω_{h} = {(x, y) \in Z^{2} ∣ 0 \leq x < H, 0 \leq y < W},

with unit grid spacing. The scalar field

U_{c}

is represented as a matrix

U_{c} \in R^{H \times W}

, and the vector field

g_{c}

as a tensor in

R^{H \times W \times 2}

. To avoid directional bias, we employ centered finite differences for the interior gradients:

{(\nabla_{h} U_{c})}_{x, y} = {[\begin{matrix} D_{x} U_{c} \\ D_{y} U_{c} \end{matrix}]}_{x, y} = [\begin{matrix} (U_{c, x + 1, y} - U_{c, x - 1, y}) / 2 \\ (U_{c, x, y + 1} - U_{c, x, y - 1}) / 2 \end{matrix}], 1 \leq x \leq H - 2, 1 \leq y \leq W - 2 .

(15)

At the image boundaries, we utilize forward/backward differences consistent with the homogeneous Neumann condition

\partial_{n} U_{c} = 0

.

The discrete divergence

\nabla_{h} \cdot g_{c}

is defined as the negative adjoint of

\nabla_{h}

, ensuring that the discrete operator retains the symmetric positive-definite structure of the continuous formulation. Combining

\nabla_{h}

and

\nabla_{h} \cdot

yields the standard five-point Laplacian stencil:

{(Δ_{h} U_{c})}_{x, y} = U_{c, x + 1, y} + U_{c, x - 1, y} + U_{c, x, y + 1} + U_{c, x, y - 1} - 4 U_{c, x, y},

(16)

with appropriate modifications at the boundaries. On the vectorized domain, this defines a sparse block-Toeplitz with Toeplitz-blocks (BTTB) matrix

L \in R^{N \times N}

(where

N = H W

). Solving the linear system

(L - λ_{c} I) u_{c} = r_{c}

directly via sparse factorization is infeasible at the WSI scale. Instead, we exploit the spectral properties of

L

for efficient inversion.

5.2. Spectral Diagonalization and DCT Solver

Under homogeneous Neumann boundary conditions, the eigenfunctions of the discrete Laplacian (16) coincide with the basis functions of the 2D Type-II Discrete Cosine Transform (DCT-II). Let

F

denote the orthonormal forward 2D DCT-II operator and

F^{- 1}

its inverse (DCT-III) [42]. For an input image

F \in R^{H \times W}

, the spectral coefficients

\hat{F} = F {F}

are given by:

{\hat{F}}_{u, v} = α_{u} α_{v} \sum_{x = 0}^{H - 1} \sum_{y = 0}^{W - 1} F_{x, y} cos [\frac{π u (2 x + 1)}{2 H}] cos [\frac{π v (2 y + 1)}{2 W}],

(17)

where

0 \leq u < H

,

0 \leq v < W

, and

α_{u}, α_{v}

are normalization constants. In this basis, the Laplacian is diagonal:

F {Δ_{h} U_{c}}_{u, v} = Λ_{u, v} {\hat{U}}_{c, u, v},

with eigenvalues given analytically by:

Λ_{u, v} = 2 (cos \frac{π u}{H} + cos \frac{π v}{W}) - 4, Λ_{u, v} \in [- 8, 0] .

(18)

The mode

(u, v) = (0, 0)

corresponds to the DC component with

Λ_{0, 0} = 0

.

Let

R_{c} = \nabla_{h} \cdot g_{c} - λ_{c} T_{c}

denote the discrete right-hand side. Applying

F

to the discrete screened Poisson equation decouples the system:

(Λ_{u, v} - λ_{c}) {\hat{U}}_{c, u, v} = {\hat{R}}_{c, u, v}, \forall (u, v) .

(19)

The solution is obtained via a simple spectral filter:

{\hat{U}}_{c, u, v} = \frac{{\hat{R}}_{c, u, v}}{Λ_{u, v} - λ_{c}}, U_{c} = F^{- 1} {{\hat{U}}_{c}} .

(20)

Stability and Regularization.

Since

λ_{c} > 0

and

Λ_{u, v} \leq 0

, the denominator

D_{u, v} : = Λ_{u, v} - λ_{c}

satisfies

{sup}_{u, v} D_{u, v} = - λ_{c} < 0

. In particular, the DC mode

(0, 0)

is well-defined, eliminating the global offset ambiguity typical of pure Poisson reconstruction. From an optimization perspective, the screening term

λ_{c} {∥ U_{c} - T_{c} ∥}_{2}^{2}

acts as a Tikhonov regularizer, stabilizing the inversion of low-frequency modes and anchoring the solution to

T_{c}

.

5.3. Tiled SPN Implementation for Gigapixel WSIs

Direct application of the spectral solver to a full-resolution WSI is impractical due to memory constraints. We therefore adopt a non-overlapping domain decomposition [43] where the slide domain

Ω

is partitioned into tiles

{Ω_{k}}_{k = 1}^{K}

. The key requirement is that each local problem must approximate the restriction of the single global SPN functional.

Figure 2. Tiled SPN pipeline. In the first phase, we compute slide-level global prior statistics. In the second phase, we restrict these priors to each tile and solve the screened Poisson equation using a DCT-based spectral solver, assembling the normalized tiles into a coherent WSI.

Our implementation proceeds in two phases:

1. Global Prior Estimation. We first compute the global statistics of the source (

I_{S}

) and reference (

I_{T}

) slides. For each channel c, we calculate the global moments

(μ_{S}^{glob}, σ_{S}^{glob})

and

(μ_{T}, σ_{T})

, define the chromatic anchor

T_{c}

via the affine map (Eq. 6), and estimate the global contrast factor

α_{c}

(Eq. 7). These quantities are computed once at the slide level and provide coherent priors for all local reconstructions.

2. Local Spectral Reconstruction. We solve the screened Poisson equation independently on each tile. For a tile

Ω_{k}

, we restrict the global priors (

T_{c, k} = T_{c} |_{Ω_{k}}

and

g_{c, k} = g_{c}^{glob} |_{Ω_{k}}

), assemble the local right-hand side

ρ_{c, k} = \nabla_{h} \cdot g_{c, k} - λ_{c} T_{c, k}

, and solve:

(Δ_{h} - λ_{c}) U_{c, k} = ρ_{c, k} on Ω_{k} .

(21)

The homogeneous Neumann boundary conditions are implicitly enforced by the DCT-II/III basis. The normalized WSI is obtained by concatenating the tile solutions,

{({\tilde{I}}_{S})}_{c} |_{Ω_{k}} = U_{c, k}

. By Theorem 4, interface discrepancies are exponentially localized near the boundaries with characteristic width

O (λ_{c}^{- 1 / 2})

.

Algorithm 1 Tiled Screened Poisson Normalization (SPN)

Require:: Source $I_{S}$ , Reference $I_{T}$ , parameters $τ, λ_{c}$ , tiling ${Ω_{k}}_{k = 1}^{K}$
Ensure:: Normalized WSI ${\tilde{I}}_{S}$
1:: Phase 1: Global Statistics
2:: for each channel $c \in {L^{*}, a^{*}, b^{*}}$ do
3:: Compute global source moments on $I_{S}$ and target moments on $I_{T}$
4:: Define global chromatic anchor $T_{c}$ via Eq. (6)
5:: Compute global contrast factor $α_{c}$ via Eq. (7)
6:: end for
7:: Phase 2: Tile-wise Spectral Reconstruction
8:: for each tile $Ω_{k}$ in parallel do
9:: Extract source patch $P_{k} \leftarrow I_{S} |_{Ω_{k}}$
10:: Compute structure weight $w_{k} (x) \leftarrow exp (- ∥ \nabla {(P_{k})}_{L^{*}} (x) ∥^{2} / τ^{2})$
11:: for each channel $c \in {L^{*}, a^{*}, b^{*}}$ do
12:: Restrict anchor: $T_{c, k} \leftarrow T_{c} |_{Ω_{k}}$
13:: Form guidance field:
14:: $g_{c, k} \leftarrow (1 - w_{k}) α_{c} \nabla {(P_{k})}_{c} + w_{k} \nabla T_{c, k}$
15:: Assemble RHS: $ρ_{c, k} \leftarrow \nabla_{h} \cdot g_{c, k} - λ_{c} T_{c, k}$
16:: Solve $(Δ_{h} - λ_{c}) U_{c, k} = ρ_{c, k}$ via DCT spectral filter (20)
17:: Update ${\tilde{I}}_{S} |_{Ω_{k}} \leftarrow U_{c, k}$
18:: end for
19:: end for
20:: return ${\tilde{I}}_{S}$

The boundary behavior of this tiled scheme is governed by the screened Poisson operator analyzed in Section 4. The screening term

λ_{c} > 0

introduces a characteristic length scale

ℓ_{c} \approx λ_{c}^{- 1 / 2}

, ensuring exponential decay of flux mismatches away from tile interfaces. Consequently, the assembled reconstruction is structurally and chromatically consistent with the global solution, with seams reduced to negligible boundary effects.

5.4. Computational Complexity

The computational cost is dominated by the 2D DCT spectral solves. For a single tile with

N_{k} = H_{k} W_{k}

pixels:

Assembly: Computing gradients, weights, and the right-hand side $ρ_{c, k}$ scales linearly, i.e., $O (N_{k})$ .
Spectral Solve: The 2D DCT is implemented via separable 1D fast cosine transforms, with a cost of $O (N_{k} log N_{k})$ .

The total cost for a WSI with

N = \sum_{k} N_{k}

pixels is

O (N log N)

, with memory complexity

O (N_{k})

per tile. This approach allows for trivial parallelization over tiles. In comparison, sparse direct solvers typically scale as

O (N^{1.5})

for 2D problems, while multigrid methods, though

O (N)

, require complex data structures. The tiled DCT implementation thus provides an effective balance between asymptotic efficiency and implementation simplicity for gigapixel images.

6. Experiments and Results

6.1. Experimental Setup

Datasets

To evaluate the effectiveness and robustness of the proposed algorithm, we conducted comparative experiments on multiple publicly available and clinical datasets exhibiting diverse staining conditions and imaging variations. In total, the evaluation encompassed 88 patients and more than 300 whole-slide histopathological images (WSIs).

Dataset 1: MITOS-ATYPIA-14 [35], released as part of the MITOS-ATYPIA Grand Challenge, was annotated by the Pathology Department of Pitié-Salpêtrière Hospital in Paris. It consists of 14 pairs of breast carcinoma slides, each scanned using two distinct digital pathology scanners—Aperio ScanScope XT and Hamamatsu NanoZoomer—allowing controlled analysis of inter-scanner color variation.
Dataset 2: Multi-Scanner Squamous Cell Carcinoma (SCC) dataset [36] contains 44 samples of canine cutaneous squamous cell carcinoma digitized using 5 different scanning systems, producing a total of 220 WSIs.
Dataset 3: HE-Staining Variation (HEV) dataset [37] provided by Heidelberg University. The HEV dataset comprises follicular thyroid carcinoma slides prepared under nine distinct staining protocols: standard H&E, over-stained H&E (longHE), under-stained H&E (shortHE), hematoxylin-only (onlyH), eosin-only (onlyE), over-stained hematoxylin (longH), over-stained eosin (longE), under-stained hematoxylin (shortH), and under-stained eosin (shortE). These variations facilitate the evaluation of color normalization under chemically induced staining inconsistencies.
Dataset 4: SUSY-BF-10 clinical dataset (local collection). To address the temporal color degradation that public datasets do not capture, we constructed a clinical dataset from Sun Yat-sen Memorial Hospital, Sun Yat-sen University. The SUSY-BF-10 dataset includes 25 long-term archived breast carcinoma slides, each preserved for over ten years and rescanned to analyze fading and chromatic shifts over time.

A summary of dataset characteristics and representative visualizations is provided in Table 1 and Figure 3.

Implementation Details

In surveying the literature on stain normalization, we observe that most existing methods are evaluated primarily on pre-extracted WSI tiles and are often positioned as a preprocessing step for downstream tasks such as classification or segmentation. In contrast, the ultimate goal of digital pathology in the clinical setting is to provide pathologists with visually consistent slides, especially in cross-site, multi-center collaborations and remote diagnostic workflows. Under these scenarios, a robust and interpretable color normalization scheme at the whole-slide level is of central importance.

Motivated by this observation, we design our experiments to assess performance from two complementary perspectives:

1.: Full-resolution WSI normalization. We evaluate the behavior of each method when applied directly to entire, full-size histopathology slides.
2.: Tile-based reconstruction. We further consider the practically common pipeline in which a large WSI is first partitioned into tiles, normalized tile-wise, and then reassembled into a full-resolution slide. This setting is particularly relevant for memory-constrained or streaming-based deployments.

In addition to the four real-world datasets used in our study, we also construct a synthetic test image (synthetic_image) with controlled color and structural patterns. This synthetic benchmark provides an intuitive and stress-test-style assessment of robustness and generalization across a wide range of stain variations.

All experiments are conducted on a server equipped with an Intel(R) Xeon(R) Platinum 8352V CPU @ 2.10 GHz, an NVIDIA RTX 4090 GPU with 24 GB of VRAM, and 512 GB of system memory. The software stack is based on Python 3.8 and PyTorch 2.5.

Baselines

We compare the proposed method against three representative baselines:

Reinhard. The classical global Reinhard color transfer algorithm [6] serves as our primary reference for global-statistics–based normalization. It matches low-order statistics between source and target images in a decorrelated color space and remains widely used in digital pathology.
GCTI. We include a geometry-aware method with local alignment in the stain vector space (GCTI) [9], which explicitly exploits the underlying geometric structure of color distributions to compensate for scanner- and protocol-induced variations in microscopy images within a unified framework.
CycleGAN. As a deep learning–based baseline, we employ a CycleGAN-style stain transfer model [10]. This method learns a non-linear mapping between source and target stain domains from unpaired data and represents a strong data-driven alternative to hand-crafted normalization schemes.

These baselines jointly cover global-statistics approaches, geometry-aware model-based methods, and modern generative adversarial networks, providing a comprehensive context for evaluating the proposed gradient-domain diffusion normalization.

6.2. Evaluation Metrics

In practice, there is still no widely accepted, unified standard for quantitative evaluation in the stain normalization task. Existing work typically adopts a mixture of structural similarity measures, color-space or gamut-based criteria, and performance gains on downstream tasks (e.g., classification or segmentation). From a clinical perspective, however, the ideal outcome is clear: the normalized image should (i) preserve the structural and morphological content of the source image and (ii) match the color space and distribution of the target image as closely as possible. Achieving this would allow pathology images acquired at different hospitals, regions, and from heterogeneous devices to be mapped to a common color standard, thereby reducing color-induced variability, improving diagnostic efficiency for pathologists, and ultimately promoting more standardized diagnostic criteria.

To better assess the performance of different algorithms under this perspective, we evaluate: (1) the structural similarity between the source image and the normalized result, and (2) the chromatic alignment between the normalized result and the target image. Concretely, we use the Structural Similarity Index (SSIM) [44] as a morphology-oriented metric between the source

I_{S}

and the normalized image

{\tilde{I}}_{S}

, and the 1st-Wasserstein distance [45] between the color distributions of

{\tilde{I}}_{S}

and the target

I_{T}

. Considering the entanglement of structure and color in the RGB space, all metrics are computed in the CIELab color space. The specific definitions are as follows.

Structural Similarity Index (SSIM). To quantify the preservation of tissue morphology and local texture, we employ the SSIM metric. Let

x

and

y

denote local window patches extracted from the luminance channels of the source image

I_{S}

and the normalized image

{\tilde{I}}_{S}

, respectively. The SSIM is defined as:

SSIM (x, y) = \frac{(2 μ_{x} μ_{y} + C_{1}) (2 σ_{x y} + C_{2})}{(μ_{x}^{2} + μ_{y}^{2} + C_{1}) (σ_{x}^{2} + σ_{y}^{2} + C_{2})},

(22)

where

μ_{x}, μ_{y}

represent the local means,

σ_{x}^{2}, σ_{y}^{2}

denote the local variances, and

σ_{x y}

is the cross-covariance between the two patches. The constants

C_{1} = {(k_{1} L)}^{2}

and

C_{2} = {(k_{2} L)}^{2}

are included to ensure numerical stability, where L is the dynamic range of pixel values (e.g.,

L = 255

for 8-bit images). The final SSIM score is computed as the mean over all sliding windows in the image domain.

Wasserstein Distance (W-D). To evaluate chromatic alignment, we calculate the 1st-Wasserstein distance (also known as Earth Mover’s Distance) between the color distributions of the target image

I_{T}

and the result

{\tilde{I}}_{S}

. For a specific color channel

c \in {a^{*}, b^{*}}

, let

P_{c}

and

Q_{c}

represent the probability distributions of pixel intensities in

I_{T}

and

{\tilde{I}}_{S}

, respectively. The Wasserstein distance

W_{1} (P_{c}, Q_{c})

is defined as the minimal cost to transform one distribution into the other:

W_{1} (P_{c}, Q_{c}) = inf_{γ \in Π (P_{c}, Q_{c})} E_{(u, v) \sim γ} [| u - v |],

(23)

where

Π (P_{c}, Q_{c})

denotes the set of all joint distributions

γ (u, v)

whose marginals are

P_{c}

and

Q_{c}

. Since we operate on 1D marginal distributions of color channels, Eq. (23) has a closed-form solution expressed via the cumulative distribution functions (CDFs),

F_{T} (z)

and

F_{R} (z)

:

W_{1} (P_{c}, Q_{c}) = \int_{- \infty}^{+ \infty} | F_{T} (z) - F_{R} (z) | d z .

(24)

A lower

W_{1}

score indicates superior alignment of the statistical color profile, without assuming Gaussianity of the underlying distributions.

6.3. Results and Visual Analysis

We complement the quantitative evaluation with a detailed visual comparison across four representative datasets and a synthetic benchmark. This analysis focuses on both full-size whole-slide views and tiled reconstructions.

Full size baseline test

As illustrated in the first two rows of Figure 4, both Reinhard and SPN achieve faithful chromatic adaptation. This performance is consistent with their explicit statistical matching in the CIELab space, where the

a^{*}

and

b^{*}

channels isolate chromaticity from luminance. In contrast, the third and fourth rows reveal the limited generalization of CycleGAN on out-of-distribution data: when applied to non-MITOS samples, the model introduces a characteristic “deep red” bias inherent to its training set rather than correctly adapting to the target domain. Furthermore, compared to the global Reinhard method, SPN and GCTI preserve high-frequency morphological details, exhibiting significantly sharper cellular boundaries at high magnification.

SCC dataset: full-size vs. tiled normalization.

The SCC dataset contains slides acquired from five different scanners with codes cs2, gt450, nz20, nz210, and p100. We use cs2 as the source device and treat the remaining four devices as distinct targets. Figure 5 compares color normalization on both full-size images and tiled images. The full-size resolution is

2000 \times 1600

, and the tiled setting uniformly partitions each image into a

5 \times 4

grid of non-overlapping patches.

As shown in Figure 5, tile-wise normalization via Reinhard and GCTI introduces distinct boundary discontinuities arising from inconsistent local statistics. While SPN effectively eliminates these seams through gradient preservation, CycleGAN displays a diminished checkerboard effect relative to the statistical methods. This partial suppression is attributed to the shared generator, which enforces a consistent color transform across tiles. Yet, this same mechanism limits adaptability, causing CycleGAN to hallucinate the MITOS style onto the SCC samples rather than respecting the target domain’s specific appearance.

HEV dataset: Multi-stained pathology images.

The HEV dataset encompasses nine distinct staining configurations, offering the richest color diversity among the datasets analyzed. We utilize standard HE as the source domain, treating the remaining eight protocols as targets. For brevity, and because full-slide comparisons show limited variation, Figure 6 focuses exclusively on the tiled results.

The CycleGAN results in Figure 6 mirror those in Figure 5: blocking artifacts are diminished relative to Reinhard and GCTI due to the global constraints of the generator. More significantly, this experiment highlights the robustness limitations of the GCTI framework. GCTI employs a local wedge-finding algorithm that projects pixel intensities onto a subspace defined by SVD, inferring stain vectors via angular percentiles (e.g.,

0.1

th and

99.9

th). This mechanism is susceptible to failure in extreme scenarios: (1) Illumination correction singularities, where near-zero background estimates lead to division-by-zero errors and intensity saturation; and (2) Thresholding instability in uniform regions, where the lack of color variance renders percentile-based thresholds arbitrary. Figure 6 clearly demonstrates the former, where instability in the background estimation step produces extensive artifacts characterized by signal saturation and loss of texture.

Evaluation on Synthetic Image

To rigorously assess algorithmic stability under limiting conditions, we introduce a synthetic image characterized by a smooth background gradient and a central geometric object (Figure 7).

The first row of Figure 7 presents the full-resolution normalization results.SPN and Reinhard achieve the most faithful reconstruction of the background gradient, demonstrating robust structural consistency across both low-frequency trends and high-frequency edges. In contrast, GCTI exhibits a catastrophic failure mode driven by algorithmic instability: the central large circle is misclassified as background, causing the denominator in the illumination correction step to approach zero. This singularity leads to numerical explosion, resulting in a saturated, near-uniform output in the background region.

Additionally, the magnified regions in the second row reveal that CycleGAN produces spurious ringing artifacts around the circular structures, a phenomenon inconsistent with the behavior of the variational and statistical methods. This artifact stems from the generative network’s reliance on learned upsampling filters (transposed convolutions) which, unlike rigorous interpolation schemes, can hallucinate high-frequency noise to satisfy the discriminator. Such artifacts underscore the critical deficit of deep generative models regarding interpretability: unlike the proposed framework, which is constrained by a physical diffusion process, the GAN introduces opaque, architectural artifacts that compromise diagnostic confidence. The tiled results (Row 3) further expose the stability limits of the baseline methods, particularly regarding well-posedness on disjoint subdomains. First, statistical methods break down on homogeneous patches. Reinhard normalization relies on matching first- and second-order moments; when a target tile consists of a pure background (zero variance), the scaling factors become undefined, rendering the transfer function ill-posed. Similarly, GCTI fails in these homogeneous regions because the lack of spectral diversity renders the SVD-based stain estimation rank-deficient, leading to numerical breakdown.

Conversely, SPN maintains both global gradient continuity and local geometric fidelity across all tiles. This robustness confirms that the proposed variational framework effectively regularizes the normalization problem, preventing the degenerate behavior observed in purely statistical approaches when local data is sparse.

6.4. Quantitative Results and Discussion

Table 2 presents a comprehensive quantitative comparison of our proposed SPN against three state-of-the-art methods: CycleGAN, Reinhard, and GCTI. We utilize the Structural Similarity Index Measure (SSIM) to evaluate structure preservation (↑) and the Wasserstein Distance (W-D) to assess stain distribution matching (↓).

It is worth noting that the classic Reinhard and GCTI algorithms remain strong baselines, particularly on relatively simple and highly structured datasets such as SYSU-BF-10 and MITOS-14. In these cases, Reinhard achieves competitive or even state-of-the-art results (e.g., SSIM of 0.981 on SYSU-BF-10). This is likely because these datasets exhibit unimodal color distributions where simple global statistical alignment (mean and standard deviation matching) is sufficient. However, its performance degrades significantly on complex datasets like HEV (W-D often

> 4.0

), where stain variations are non-linear and multi-modal.

As previously discussed, CycleGAN utilizes fixed weights trained on MITOS. While it yields consistent structural outputs (omitted entries in Table 2), its inability to adapt to the specific target reference leads to large domain discrepancies, evidenced by high W-D scores (e.g., 25.624 on SCC).

Our method demonstrates robust performance across diverse scenarios, with its advantages becoming most pronounced on the synthetic_image. This dataset serves as a rigorous benchmark for “general stain normalization” as it provides paired ground truth. On this set, SPN achieves the best results in both metrics (SSIM: 0.950, W-D: 3.706). This confirms that our gradient-guided diffusion mechanism effectively learns the intrinsic mapping between domains without overfitting to specific dataset artifacts, demonstrating superior generalization compared to baselines.

7. Conclusions

We presented Screened Poisson Normalization (SPN), a physics-informed framework that reformulates stain normalization as a screened Poisson reconstruction problem. By modeling the staining process as a reaction–diffusion system, SPN explicitly bridges statistical moment matching with physically interpretable gradient-domain regularization.

Our theoretical and experimental analysis yields three key conclusions:

1.: Structural Fidelity and Scalability: Operating in the gradient domain allows SPN to preserve diagnostic morphological details (e.g., nuclear boundaries) often degraded by pure statistical alignment. Furthermore, the exponential decay of the screened Green’s function enables rigorous non-overlapping domain decomposition, permitting seamless gigapixel processing without tiling artifacts.
2.: Determinism vs. Data-Dependence: In contrast to deep learning approaches (e.g., CycleGAN) which are prone to domain bias and generative hallucinations on out-of-distribution data, SPN is training-free and deterministic. This ensures that all output structures are causally linked to the input, providing the explainability and reproducibility essential for clinical diagnostics.
3.: Robustness to Heterogeneity: SPN demonstrates consistent performance across diverse scanners, staining protocols, and archived slides. The global anchoring (screening) term effectively stabilizes the solution in tissue-sparse or background-dominated regions, overcoming the numerical brittleness observed in local geometry-based methods.

In summary, SPN offers a transparent, mathematically well-posed alternative to "black-box" style transfer, establishing a stable foundation for computational pathology in heterogeneous clinical settings. Future work will focus on parallel optimization for high-throughput deployment and extending the gradient-domain formulation to multi-stain separation and volumetric reconstruction.

Author Contributions

Conceptualization, J.X.; Methodology, J.X.; Software, J.X.; Formal analysis, J.X.; Validation, J.X.; Visualization, J.X.; Writing—original draft, J.X.; Writing—review and editing, J.X., H.N. and Q.W.; Investigation, H.N. and Q.W.; Resources, H.N.; Data curation, H.N. and Y.J.; Supervision, J.X. and Q.W.; Project administration, J.X.; Funding acquisition, J.X. ,Q.W., H.N. All authors have read and agreed to the published version of the manuscript.

Funding

Junlong Xing was partially supported by the National Key R&D Program of China (No.2020YFA0712500, No.2021YFA1001300),the Research and Development Project of Pazhou Lab (Huangpu) (Grant Number 2023K0601).Qiru Wang was partially supported by the National Natural Science Foundation of China (No. 12471176) and the Guangdong Basic and Applied Basic Research Foundation (No.2025A1515012221). Hengli Niwas partially supported by the Young Scientists Fund of the National Natural Science Foundation of China (82303584) and the Guangdong Provincial Key Laboratory of Mathematical and Neural Dynamical Systems (2024B1212010004).

Institutional Review Board Statement

The publicly available datasets used in this study are cited in the References. For the curated clinical WSI collection from Sun Yat-sen University mentioned in the Data Availability Statement, institutional approval and an appropriate data-use agreement were obtained; details can be provided by the corresponding author upon reasonable request.

Informed Consent Statement

For the curated clinical WSI collection, informed-consent requirements were handled according to the approving institutional policies and local regulations (i.e., consent was obtained or waived by the Institutional Review Board as appropriate).

Data Availability Statement

The data supporting the findings of this study comprise three publicly available whole-slide histopathology datasets and one locally collected clinical dataset. The public datasets used in our experiments—MITOS-ATYPIA-14 [35], the Multi-Scanner Squamous Cell Carcinoma (SCC) dataset [36], and the HE-Staining Variation (HEV) dataset [37]—are available from their respective providers under the original terms of use and licensing described in the corresponding references. In addition, we curated a clinical WSI collection from Sun Yat-sen Memorial Hospital, Sun Yat-sen University (SYSU-BF-10; referred to as SYSU-BF-10 in the main text), consisting of 25 long-term archived breast carcinoma slides (archived for over ten years and rescanned to study temporal color degradation). Due to ethics, privacy, and institutional governance requirements for clinical data, SYSU-BF-10 is not publicly available at the time of publication. We will release the SYSU-BF-10 dataset after the subsequent ethics and privacy procedures have been reviewed. Until completion of these procedures, access may be considered on reasonable request to the corresponding author, subject to institutional approval and an appropriate data-use agreement.

Acknowledgments

The authors thank the dataset providers and the open-source community for making resources available.

Conflicts of Interest

The authors declare no conflict of interest.

References

Xu, C.; Sun, Y.; Zhang, Y.; Liu, T.; Wang, X.; Hu, D.; Huang, S.; Li, J.; Zhang, F.; Li, G. Stain Normalization of Histopathological Images Based on Deep Learning: A Review. Diagnostics 2025, 15(8), 1032. [Google Scholar] [CrossRef]
Hetz, M.J.; Bucher, T.C.; Brinker, T.J. Multi-domain stain normalization for digital pathology: A cycle-consistent adversarial network for whole slide images. Medical Image Analysis 2024, 94, 103149. [Google Scholar] [CrossRef] [PubMed]
Wang, C.; Li, S.; Ke, J.; Zhang, C.; Shen, Y. RandStainNA++: Enhance random stain augmentation and normalization through foreground and background differentiation. IEEE Journal of Biomedical and Health Informatics 2024, 28, 3660–3671. [Google Scholar] [CrossRef] [PubMed]
Baykal Kablan, E. Regional realness-aware generative adversarial networks for stain normalization. Neural Computation and Applications 2023, 35, 17915–17927. [Google Scholar] [CrossRef]
Li, Y.; Zhou, H.; Liu, N.; Shen, Y. (2023). Stain normalization and augmentation in frequency space for histology analysis. In 2023 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), 2031–2035.
Reinhard, E.; Ashikhmin, M.; Gooch, B.; Shirley, P. Color transfer between images. IEEE Computer Graphics and Applications 2001, 21(5), 34–41. [Google Scholar] [CrossRef]
Macenko, M.; Niethammer, M.; Marron, J.S.; Borland, D.; Woosley, J.T.; Guan, X.; Schmitt, C.; Thomas, N.E. (2009). A method for normalizing histology slides for quantitative analysis. In 2009 IEEE International Symposium on Biomedical Imaging: From Nano to Macro, 1107–1110.
Vahadane, A.; et al. Structure-preserving color normalization and sparse stain separation. IEEE Transactions on Medical Imaging 2016, 35(8), 1962–1971. [Google Scholar] [CrossRef]
Gupta, A.; Duggal, R.; Gehlot, S.; Gupta, R.; Mangal, A.; Kumar, L.; Thakkar, N.; Satpathy, D. GCTI-SN: Geometry-inspired chemical and tissue invariant stain normalization of microscopic medical images. Medical Image Analysis 2020, 65, 101788. [Google Scholar] [CrossRef]
Runz, M.; Rusche, D.; Schmidt, S.; et al. Normalization of H&E-stained histological images using cycle consistent generative adversarial networks. Diagnostic Pathology 2021, 16, 71. [Google Scholar]
Kang, H.; Luo, D.; Feng, W.; et al. StainNet: A fast and robust stain normalization network. Frontiers in Medicine 2021, 8, 746307. [Google Scholar] [CrossRef]
Shaban, M.T.; Baur, C.; Navab, N.; Albarqouni, S. (2019). Staingan: Stain style transfer for digital histological images. In 2019 IEEE 16th International Symposium on Biomedical Imaging (ISBI 2019), 953–956.
BenTaieb, A.; Hamarneh, G. Adversarial stain transfer for histopathology image analysis. IEEE Transactions on Medical Imaging 2017, 37(3), 792–802. [Google Scholar] [CrossRef]
Zarella, M.D.; Yeoh, C.; Breen, D.E.; Garcia, F.U. An alternative reference space for H&E color normalization. PLoS One 2017, 12(3), e0174489. [Google Scholar]
Magee, D.; Treanor, D.; Crellin, D.; Shires, M.; Smith, K.; Mohee, K.; Quirke, P. (2009). Colour normalisation in digital histopathology images. Proceedings of Optical Tissue Image Analysis in Microscopy, Histopathology and Endoscopy (MICCAI Workshop), 100–111.
Alsubaie, N.; Trahearn, N.; Raza, S.E.A.; Snead, D.; Rajpoot, N.M. Stain deconvolution using statistical analysis of multi-resolution stain colour representation. PLoS One 2017, 12(1), e0169875. [Google Scholar] [CrossRef] [PubMed]
Roy, S.; Lal, S.; Kini, J.R. Novel color normalization method for Hematoxylin & Eosin stained histopathology images. IEEE Access 2019, 7, 28982–28998. [Google Scholar] [CrossRef]
Vijh, S.; Saraswat, M.; Kumar, S. A new complete color normalization method for H&E stained histopathological images. Applied Intelligence 2021, 51, 1–14. [Google Scholar]
Zheng, Y.; Jiang, Z.; Zhang, H.; Xie, F.; Shi, J.; Xue, C. Adaptive color deconvolution for histological WSI normalization. Computer Methods and Programs in Biomedicine 2019, 170, 107–120. [Google Scholar] [CrossRef]
Nadeem, S.; Hollmann, T.; Tannenbaum, A. (2020). Multimarginal Wasserstein barycenter for stain normalization and augmentation. In International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 362–371.
Shrivastava, A.; Adorno, W.; Sharma, Y.; et al. (2021). Self-attentive adversarial stain normalization. In Pattern Recognition. ICPR Workshops and Challenges, 120–140. Springer.
Nishar, H.; Chavanke, N.; Singhal, N. (2020). Histopathological stain transfer using style transfer network with adversarial loss. In Medical Image Computing and Computer Assisted Intervention (MICCAI), 330–340.
Liang, H.; Plataniotis, K.N.; Li, X. (2020). Stain style transfer of histopathology images via structure-preserved generative learning. In Machine Learning for Medical Image Reconstruction, 153–162. Springer.
Kausar, T.; Kausar, A.; Ashraf, M.A.; et al. SA-GAN: Stain acclimation generative adversarial network for histopathology image analysis. Applied Sciences 2021, 12, 288. [Google Scholar] [CrossRef]
Cho, H.; Lim, S.; Choi, G.; Min, H. Neural stain-style transfer learning using GAN for histopathological images. arXiv 2017, arXiv:1710.08543. [Google Scholar] [CrossRef]
Lee, C.C.; Kuo, P.T.P.; Peng, C.H. (2022). H&E stain normalization using U-net. In 2022 IEEE 22nd International Conference on Bioinformatics and Bioengineering (BIBE), 29–32.
Menning, J.D.M.; Wallmersperger, T.; Meinhardt, M.; Ehrenhofer, A. Modeling and simulation of diffusion and reaction processes during the staining of tissue sections on slides. Histochemistry and Cell Biology 2022, 158, 137–148. [Google Scholar] [CrossRef]
Gurina, T.S.; Simms, L. Histology, Staining. In StatPearls [Internet]; StatPearls Publishing, 2023. [Google Scholar]
Suvarna, K.S.; Layton, C.; Bancroft, J.D. (Eds.) Bancroft’s Theory and Practice of Histological Techniques, 8th ed.; Elsevier, 2019. [Google Scholar]
Mayerhöfer, T.G.; Pahlow, S.; Popp, J. The Bouguer-Beer-Lambert law: Shining light on the obscure. ChemPhysChem 2020, 21(18), 2029–2046. [Google Scholar] [CrossRef]
Murray, J.D. Mathematical Biology I: An Introduction, 3rd ed.; Springer: New York, NY, USA, 2002. [Google Scholar]
Sheehan, D.C.; Hrapchak, B.B. Theory and Practice of Histotechnology, 3rd ed.; Mosby, 2012. [Google Scholar]
Hetz, M.J.; Bucher, T.C.; Brinker, T.J. Multi-domain stain normalization for digital pathology: A cycle-consistent adversarial network for whole slide images. Medical Image Analysis 2024, 94, 103149. [Google Scholar] [CrossRef]
Landini, G. Colour deconvolution with ImageJ and Fiji: a bibliography. Bioinformatics 2021, 37, 4181–4182. [Google Scholar]
Roux, L.; Racoceanu, D.; Loménie, N.; et al. Mitosis detection in breast cancer histological images: An ICPR 2012 contest. Journal of Pathology Informatics 2013, 4, 8. [Google Scholar] [CrossRef]
Wilm, F.; Fragoso, M.; Bertram, C.A.; Stathonikos, N.; Öttl, M.; Qiu, J.; Klopfleisch, R.; Maier, A.; Breininger, K.; Aubreville, M. Multi-scanner canine cutaneous squamous cell carcinoma histopathology dataset. arXiv 2023, arXiv:2301.04423. [Google Scholar] [CrossRef]
Runz, M.; Weis, C.-A. (2021). Normalization of HE-Stained Histological Images using Cycle Consistent Generative Adversarial Networks [Dataset]. heiDATA, V1. [CrossRef]
Pérez, P.; Gangnet, M.; Blake, A. (2023). Poisson image editing. In Seminal Graphics Papers: Pushing the Boundaries, Volume 2 (pp. 577–582).
Habring, A.; Holler, M. Neural-network-based regularization methods for inverse problems in imaging. GAMM-Mitteilungen 2024, 47(4), e202470004. [Google Scholar] [CrossRef]
Hansen, P.C. (2010). Discrete Inverse Problems: Insight and Algorithms. SIAM.
Benning, M.; Burger, M. Modern regularization methods for inverse problems. Acta Numerica 2018, 27, 1–111. [Google Scholar] [CrossRef]
Fortunato, D.; Townsend, A.; Trefethen, L.N. A Spectral Method for Elliptic Problems in a Rectangle. IMA Journal of Numerical Analysis 2020, 40, 2250–2280. [Google Scholar]
Klawonn, A.; Lanser, M.; Weber, J. Machine learning and domain decomposition methods - a survey. Computational Science and Engineering 2024, 1(1), 2. [Google Scholar] [CrossRef]
Bakurov, I.; Buzzelli, M.; Schettini, R.; Bianco, S.; Raimondo, F. Structural similarity index (SSIM) revisited: A data-driven approach. Expert Systems with Applications 2022, 189, 116087. [Google Scholar] [CrossRef]
Nguyen, K.; Ho, N. Revisiting sliced Wasserstein on images: From vectorization to convolution. Advances in Neural Information Processing Systems 2022, 35, 17788–17801. [Google Scholar]

Figure 1. Typical workflow of WSIs imaging , illustrating color variations arising from staining and scanner differences. Staining protocol: standard H&E (HE), long time stained Eosin (longE), and long time stained H&E (longHE). Scanners: Aperio ScanScope CS2 (CS2), NanoZoomer 2.0-HT (NZ20), NanoZoomer S210 (NZ210), and Aperio GT 450 (GT450).

Figure 3. Visualizations for the four datasets. The HEV dataset (row a) demonstrates the five different staining protocols on serial sections: (a1) longE, (a2) onlyE, (a3) H&E, (a4) shortE, and (a5) shortH. The SYSU-BF-10 dataset (row b), from (b1) to (b5), displays stained sections from five different patients, illustrating staining and fading conditions over a 10-year period. The SCC dataset (row c) presents imaging results acquired from five different scanning devices. Finally, in the MITOS-ATYPIA-14 dataset (row d), images (d1), (d2), and (d3) represent one set at three different magnification levels (10×, 20×, and 40×), while images (d4), (d5), and (d6) form a color control group scanned with a different device at the same three magnification levels.

Figure 4. Full-size results of different methods on the MITOS-ATYPIA-14 and SYSU-BF-10 datasets. Normalization setup for the MITOS-ATYPIA-14 (cross-scanner: Aperio to Hamamatsu,

10 \times

/

40 \times

) and SYSU-BF-10 (temporal drift: 10-year rescan) datasets.Note that the CycleGAN results are obtained using a model trained only on mitos dataset.

Figure 4. Full-size results of different methods on the MITOS-ATYPIA-14 and SYSU-BF-10 datasets. Normalization setup for the MITOS-ATYPIA-14 (cross-scanner: Aperio to Hamamatsu,

10 \times

/

40 \times

) and SYSU-BF-10 (temporal drift: 10-year rescan) datasets.Note that the CycleGAN results are obtained using a model trained only on mitos dataset.

Figure 5. Full-size and tiled normalization results on the multi-scanner SCC dataset (cs2, gt450, nz20, nz210, p100). Due to limited generalization outside the MITOS-14 training set, CycleGAN produces identical outputs across different target domains. We therefore display the base CycleGAN reconstruction in Column 2, with corresponding high-magnification details shown in col 4, 6, and 8.

Figure 6. Tiled-image results of different methods on the HEV dataset. The CycleGAN model is still trained on MITOS-ATYPIA-14, leading to nearly identical outputs across the eight HEV targets; we therefore show a single CycleGAN row with selected regions magnified.

Figure 7. Robustness evaluation on the synthetic imgae. Row 1: Full-resolution normalization. SPN preserves the background gradient faithfully. GCTI exhibits a division-by-zero singularity due to background misclassification, causing saturation. Rows 2–3: Tiled reconstruction using a non-overlapping grid. CycleGAN traind by mitos data Reinhard fails on homogeneous tiles where statistical moments are undefined. GCTI suffers from rank deficiency in zero-gradient regions (pure background); tiles where the algorithm failed to converge are masked in gray.

Table 1. Summary of datasets used for experimental evaluation.

Dataset	#Samples	#WSIs	Magnification	Color variation source	Body part
MITOS-ATYPIA-14	14	28	40×/0.25 $μ$ m	Two imaging scanners	Breast
SCC dataset	44	220	40×/0.25 $μ$ m	Five imaging scanners	Skin
HEV dataset	1	9	40×/0.25 $μ$ m	Nine staining protocols	Thyroid
SUSY-BF-10	25	50	40×/0.25 $μ$ m	Color fading in 10 years	Breast

Table 2. Quantitative comparison of different stain normalization methods. The best results (max SSIM and min W-D) for each row are highlighted in bold.

Dataset	Pair	CycleGAN^†		Reinhard		GCTI		SPN (ours)
Dataset	Pair	SSIM ↑	W-D ↓	SSIM ↑	W-D ↓	SSIM ↑	W-D ↓	SSIM ↑	W-D ↓
SYSU-BF	18-n → o	0.742	18.572	0.981	0.751	0.970	1.104	0.875	0.689
mitos-14	A06_01 → H	0.790	7.274	0.911	2.558	0.813	7.042	0.822	3.027
HEV data	HE → longE	0.957	15.392	0.920	4.280	0.736	6.338	0.983	3.524
	HE → onlyE	–	12.134	0.729	2.464	0.832	6.485	0.718	1.940
	HE → shortE	–	11.902	0.932	5.577	0.832	13.999	0.992	3.705
	HE → longHE	–	5.892	0.765	3.370	0.656	10.915	0.814	4.255
	HE → onlyH	–	18.802	0.920	2.540	0.634	4.427	0.979	3.672
	HE → shortHE	–	15.113	0.873	3.488	0.667	4.096	0.896	3.118
	HE → longH	–	4.792	0.875	4.745	0.755	22.539	0.921	2.146
	HE → shortH	–	13.199	0.825	2.918	0.694	13.107	0.857	3.218
SCC data	cs2 → gt450	0.921	25.624	0.679	0.495	0.757	1.873	0.641	0.360
	cs2 → nz20	–	20.885	0.838	0.801	0.898	1.950	0.882	1.715
	cs2 → nz210	–	16.042	0.880	0.524	0.917	2.074	0.827	0.581
	cs2 → p1000	–	10.979	0.886	1.022	0.798	4.310	0.912	1.579
synthetic_image		0.816	17.433	0.906	4.352	0.831	17.117	0.950	3.706

^†CycleGAN model trained on the MITOS-ATYPIA-14 dataset. Since SSIM is calculated between result and source image, for the same source test, we omit meaningless comparisons using “–”.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.

A Variational Screened Poisson Reconstruction for Whole-Slide Stain Normalization

Abstract

Keywords:

Subject:

1. Introduction

1.1. Background

1.2. Related Works

1.3. Main Contributions

2. Mathematical Foundations

2.1. Variational Formulation of the Inverse Problem

2.2. Physical Isomorphism: Reaction–Diffusion Kinetics

3. The Screened Poisson Normalization (SPN) Model

4. Theoretical Analysis

5. Optimization and Implementation

5.1. Discrete and Non-Overlapping Domain Decomposition

5.2. Spectral Diagonalization and DCT Solver

Stability and Regularization.

5.3. Tiled SPN Implementation for Gigapixel WSIs

5.4. Computational Complexity

6. Experiments and Results

6.1. Experimental Setup

Datasets

Implementation Details

Baselines

6.2. Evaluation Metrics

6.3. Results and Visual Analysis

Full size baseline test

SCC dataset: full-size vs. tiled normalization.

HEV dataset: Multi-stained pathology images.

Evaluation on Synthetic Image

6.4. Quantitative Results and Discussion

7. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

MDPI Initiatives

Important Links

Subscribe