Modified Algorithm for 2D Maximum Sum Subarray Problem

Boris Shukhat

doi:10.20944/preprints202604.1989.v1

Submitted:

25 April 2026

Posted:

28 April 2026

You are already at the latest version

Abstract

This paper presents a two-phase adaptive algorithm to solve the 2-Dimensional Maximum Sum Sub-array Problem. By reframing the search order to establish a single-column baseline first, the algorithm generates mathematical pruning bounds in O(NM) time. These bounds are utilized in a second phase to skip unpromising multi-column scans in O(1) time. This “Two-Phase” approach achieves a quadratic best-case floor of O(M² + NM), while significantly improving the expected performance across typical data distributions and maintaining the cubic worst-case. This adaptive strategy effectively bridges the gap between theoretical sub-cubic complexity and practical implementation.

Keywords:

maximum sum sub-array

;

Kadane’s algorithm

;

branch-and-bound

;

pruning heuristics

;

summed area table

;

data-dependent performance

Subject:

Computer Science and Mathematics - Data Structures, Algorithms and Complexity

1. Background

The classical approach to the 2D Maximum Sum Subarray Problem is an extension of the 1D problem famously solved by Kadane’s algorithm [1,2]. Traditionally, the 2D variant is reduced to

O (M^{2})

1D problems applied to all possible aggregated columns (strips). This exhaustive cubic approach remains the benchmark in algorithmic theory due to its simplicity and low constant-factor overhead.

While research, notably [3], has focused on lowering theoretical asymptotic bounds using complex transformations, these methods often encounter substantial practical limitations. This paper proposes a middle ground that prioritizes algorithmic engineering and practical effectiveness via branch-and-bound principles [4].

2. Formal Pruning Logic

The efficiency of the proposed algorithm relies on establishing a mathematical bound that allows for the skipping of vertical scans. We define

M_{c} (j)

as the maximum 1D subarray sum of column j in the original matrix A.

Theorem 1.

For any aggregated column bounded horizontally by columns

[c_{1}, c_{2}]

, the maximum subarray sum S is strictly bounded by the sum of individual column maximums:

S \leq \sum_{j = c_{1}}^{c_{2}} M_{c} (j)

Proof.

Let

A_{i, j}

denote the elements of the original matrix A. Consider any aggregated column defined by column indices

[c_{1}, c_{2}]

. A subarray of this aggregated column corresponds to a row range

[r_{1}, r_{2}]

.

The sum S is given by:

S = \sum_{i = r_{1}}^{r_{2}} \sum_{j = c_{1}}^{c_{2}} A_{i, j}

By reordering the summations, we express the total sum as the sum of column slices:

S = \sum_{j = c_{1}}^{c_{2}} (\sum_{i = r_{1}}^{r_{2}} A_{i, j})

For any specific column j, the term

\sum_{i = r_{1}}^{r_{2}} A_{i, j} \leq M_{c} (j)

by definition of the 1D column maximum. Summing this inequality across all columns in the range

[c_{1}, c_{2}]

yields:

\sum_{j = c_{1}}^{c_{2}} \sum_{i = r_{1}}^{r_{2}} A_{i, j} \leq \sum_{j = c_{1}}^{c_{2}} M_{c} (j)

Thus,

S \leq \sum_{j = c_{1}}^{c_{2}} M_{c} (j)

, proving that the sum of pre-calculated column maximums provides a rigorous upper bound for the maximum subarray sum of the aggregated column. □

Corollary 1.

If the upper bound established in Theorem 1 is less than or equal to the current global maximum sum found by the algorithm, the

O (N)

vertical scan for the corresponding aggregated column

[c_{1}, c_{2}]

can be safely bypassed.

3. Two-Phase Adaptive Search

For simplicity, this implementation focuses exclusively on determining the value of the maximum sub-region sum; it does not explicitly track or return the coordinates

(r_{1}, r_{2}, c_{1}, c_{2})

of the resulting sub-matrix.

Phase: 1: Baseline and Bound Preparation: The algorithm begins with a mandatory $O (N M)$ pass to initialize the search environment. It performs a row scan of each row i to find its maximum $M_{r} (i)$ , simultaneously constructing the rowMaxPrefixSum bound array. It then performs a column scan of each column j to find $M_{c} (j)$ , simultaneously constructing the columnMaxPrefixSum. The results from both row and column baseline scans update the $g l o b a l M a x S u m$ , establishing a high pruning floor immediately. Simultaneously, a 2D Summed Area Table (SAT) [5] is constructed.
Phase: 2: Multi-Column Pruning: This phase executes the adaptive search by iterating through all multi-column pairs $L < R$ . For each pair, if the upper bound derived from columnMaxPrefixSum is no greater than $g l o b a l M a x S u m$ , the entire region is bypassed. Otherwise, the vertical worker performs a scan, utilizing the SAT for constant-time row-sum retrieval and rowMaxPrefixSum to terminate early if no improvement over $g l o b a l M a x S u m$ is mathematically possible.

4. Implementation

Algorithm 1 PrunedKadane1D Vertical Worker

functionPrunedKadane1D(slice, rowMaxPrefixSum, globalMaxSum)
$c u r r S u m \leftarrow 0, l o c a l M a x S u m \leftarrow - \infty, N \leftarrow s l i c e . l e n g t h$
for $i = 0$ to $N - 1$ do
$c u r r S u m \leftarrow c u r r S u m + s l i c e [i]$
if $c u r r S u m > l o c a l M a x S u m$ then
$l o c a l M a x S u m \leftarrow c u r r S u m$
end if
if $c u r r S u m < 0$ then
$c u r r S u m \leftarrow 0, s t a r t R o w \leftarrow i + 1$
if $s t a r t R o w < N$ then
$u p p e r B o u n d \leftarrow r o w M a x P r e f i x S u m [N] - r o w M a x P r e f i x S u m [s t a r t R o w]$
if $u p p e r B o u n d \leq g l o b a l M a x S u m$ then
break
end if
end if
end if
end for
return $l o c a l M a x S u m$
end function

Algorithm 2 Adaptive 2D Search Manager

$g l o b a l M a x S u m \leftarrow - \infty, N \leftarrow R o w C o u n t, M \leftarrow C o l C o u n t$
Precompute $S A T$ ▹ $O (N M)$ ; can be performed inside the row scan loop
$r o w M a x P r e f i x S u m \leftarrow$ new Array of size $N + 1, r o w M a x P r e f i x S u m [0] \leftarrow 0$
for $i = 0$ to $N - 1$ do▹ Phase 1: Row Scan
$r o w M a x \leftarrow Kadane 1 D on row i$
$r o w M a x P r e f i x S u m [i + 1] \leftarrow r o w M a x P r e f i x S u m [i] + r o w M a x$
if $r o w M a x > g l o b a l M a x S u m$ then
$g l o b a l M a x S u m \leftarrow r o w M a x$
end if
end for
$c o l u m n M a x P r e f i x S u m \leftarrow$ new Array of size $M + 1, c o l u m n M a x P r e f i x S u m [0] \leftarrow 0$
for $j = 0$ to $M - 1$ do▹ Phase 1: Column Scan
$c o l M a x \leftarrow P r u n e d K a d a n e 1 D (column j, r o w M a x P r e f i x S u m, g l o b a l M a x S u m)$
$c o l u m n M a x P r e f i x S u m [j + 1] \leftarrow c o l u m n M a x P r e f i x S u m [j] + c o l M a x$
if $c o l M a x > g l o b a l M a x S u m$ then
$g l o b a l M a x S u m \leftarrow c o l M a x$
end if
end for
for $L = 0$ to $M - 1$ do▹ Phase 2: Multi-Column Search
for $R = L + 1$ to $M - 1$ do
$u p p e r B o u n d \leftarrow c o l u m n M a x P r e f i x S u m [R + 1] - c o l u m n M a x P r e f i x S u m [L]$
if $u p p e r B o u n d \leq g l o b a l M a x S u m$ then
continue
end if
for $i = 0$ to $N - 1$ do
$c u r r S l i c e [i] \leftarrow S A T [i + 1] [R + 1] - S A T [i] [R + 1] - S A T [i + 1] [L] + S A T [i] [L]$
end for
$r e s \leftarrow P r u n e d K a d a n e 1 D (c u r r S l i c e, r o w M a x P r e f i x S u m, g l o b a l M a x S u m)$
if $r e s > g l o b a l M a x S u m$ then
$g l o b a l M a x S u m \leftarrow r e s$
end if
end for
end for
return $g l o b a l M a x S u m$

5. Complexity Analysis

The computational complexity is data-dependent, oscillating between a cubic worst-case and a quadratic best-case. By establishing a high global maximum and all prefix sum bounds early in Phase 1, the solver achieves a best-case floor of

O (M^{2} + N M)

when the majority of multi-column search regions are successfully pruned.

Table 1. Complexity Summary.

Case	Complexity	Condition
Worst-Case	$O (M^{2} N)$	Late discovery of global max or loose heuristics
Empirical Benchmark	Pruned $O (M^{2} N)$	Uniformly distributed values (stress-test environment)
Best-Case	$O (M^{2} + N M)$	All multi-column regions pruned after Phase 1

6. Experimental Results and Discussion

Empirical evaluation on matrices populated with values drawn from a uniform distribution (

[- 100, 100]

) demonstrates that the pruning heuristics are effective even in high-entropy environments (which can be considered a stress-test). Java-based benchmarks on

1000 \times 1000

matrices indicate a consistent 20% reduction in total execution time compared to a standard non-adaptive implementation. If the matrix were sparse (mostly negative with a few positive “islands”), the time reduction would jump from 20% to over 80%. While the column-skipping rate is lower in perfectly uniform noise, the vertical worker’s ability to terminate early via the row-maximum bounds provides significant computational savings.

7. Performance Comparison

Compared to Takaoka’s sub-cubic algorithm [3], this adaptive approach offers superior practical scalability and maintains an

O (N M)

space efficiency. This is particularly critical for “tall” matrices where

N ≫ M

, as Takaoka’s method requires

O (M^{2})

auxiliary memory.

Table 2. Algorithm Metric Comparison.

Metric	Kadane 2D	Takaoka	Adaptive Pruned
Complexity	$O (M^{2} N)$	$O (M^{2} N \sqrt{\frac{log log M}{log M}})$	$O (M^{2} N)$ to $O (M^{2} + N M)$
Memory	$O (N M)$	$O (M^{2})$	$O (N M)$

8. Extensibility

8.1. Parallelization

The dual-loop structure over column indices L and R is naturally parallel. An atomic update mechanism for the record maximum allows parallel workers to immediately benefit from pruning triggers discovered by any thread.

8.2. Multi-Dimensional Case

The pruning logic can be generalized to d-dimensional arrays (

d > 2

). By nesting the

(d - 1)

-dimensional pruning bounds, an algorithm could navigate d-dimensional space with a best-case polynomial floor of

O (N^{2 d - 2})

.

8.3. Greedy Search Order

A “Greedy Search” optimization could involve using a Max-Heap to prioritize column ranges based on their pre-calculated bounds. Evaluating high-potential regions first maximizes the probability of early pruning in the remaining search space.

9. Conclusion

This adaptive approach bridges the gap between theoretical sub-cubic models and standard cubic implementations. By utilizing dual-level prefix sum heuristics and Summed Area Tables, the algorithm achieves a best-case floor of

O (M^{2} + N M)

and significantly improves performance across a variety of input distributions.

Acknowledgments

The author acknowledges the use of artificial intelligence tools for assistance in technical writing and LaTeX document preparation. The algorithms and their verification remain the sole responsibility of the author.

References

Bentley, J. Programming Pearls: Algorithm Design Techniques. Commun. ACM 1984, 27(9), 865–873. [Google Scholar] [CrossRef]
Kadane, J. B. Two Kadane Algorithms for the Maximum Sum Subarray Problem. Algorithms 2023, 16(11), 519. [Google Scholar] [CrossRef]
Takaoka, T. Efficient algorithms for the maximum subarray problem by distance matrix multiplication. ENTCS 2002, 66(1), 191–200. [Google Scholar] [CrossRef]
Land, A. H.; Doig, A. G. An automatic method of solving discrete programming problems. Econometrica 1960, 28(3), 497–520. [Google Scholar] [CrossRef]
Crow, F. C. Summed-area tables for texture mapping. ACM SIGGRAPH Comput. Graph. 1984, 18(3), 207–212. [Google Scholar] [CrossRef]

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.