Submitted:
07 January 2026
Posted:
08 January 2026
You are already at the latest version
Abstract
Keywords:
0. Introduction
1. Related Work
- Geometric Distortion from Global Warps: The dominant homography model [11,12,30] is frequently used because its 8 degrees of freedom allow it to model full perspective effects. However, when applied to real-world scenes with parallax and depth variation, the homography is forced to reconcile conflicting motions, which often results in non-uniform distortions like spindle-shaped warps, unnatural stretching, or spherical bulging. Advanced hybrid warps, such as the "as-projective-as-possible" (APAP) [9], and elastic warping improve flexibility but risk overfitting or over-flexibility, which can produce local stretching artifacts.
- Reliance on Complex Preprocessing: Many seamline optimization methods( [15,26,27]) formulate a cost function over the overlapping region and search for a seam that minimizes this cost, ideally passing through visually consistent areas. However, these approaches often depend on complex preprocessing steps, such as semantic segmentation or depth estimation, to guide the cost map, increasing computational cost and sensitivity to errors and inaccuracies.
-
Structural Assumptions and Computational Overhead:
- –
- Plane or Multi-Homography Models [12,18] attempt to reduce single homography distortion by fitting local projective models per plane or blending dual homographies. These methods, however, rely on strong assumptions about scene structure (e.g., two planes or projective-consistent regions) and can become brittle in complex or irregular depth geometries.
- –
- Structure-Preserving Warps successfully reduce distortions and preserve salient structures by incorporating complex optimization frameworks and constraints (e.g., collinearity constraints). These methods, however, are often computationally demanding and remain sensitive to poor feature distribution.
- –
- Learning-Based Approaches use deep learning for tasks like transformer-based warping and optical flow with inpainting [2,13,19,25]. While they generalize well and show strong performance, they demand heavy training requirements, reliance on large datasets, and risk hallucinating content or propagating errors, reducing interpretability compared to geometric models.
2. Seamless and Structurally Consistent Image Stitching
2.1. The Three Stage Framework
- Local Image Warping: We deliberately move beyond global homographies and spline-based deformations. The source image is coarsely aligned using a global affine transformation (estimated via RANSAC). This is followed by a refinement stage where the overlap is subdivided into local grids, and distinct affine models are fitted to local feature correspondences. This process generates a Smooth Free-Form Deformation (FFD) field, which is blended and adaptively smoothed using a seamguard strategy that utilizes a ramp mask and match density weighting to prevent discontinuities at boundaries. This preserves overall structural fidelity while reducing geometric distortion.
- Adequate Parallax Minimized Zone Identification: In contrast to seam selection methods relying on semantic segmentation, we introduce a model free strategy. This method analyzes disparity variations and geometric consistency among matched features to isolate a stable stitching zone that inherently minimizes parallax artifacts.
- Image Partitioning and Reconstruction: Within the identified stable zone, an ordered chain of refined keypoints defines the optimal stitching line. Both images are then partitioned into corresponding vertical slices anchored by these keypoints. This anchor-based segmentation enforces structural consistency between the two images by construction: every vertical slice in one image has a direct, aligned counterpart in the other. By tying the seamline to these geometric anchors, our method eliminates the duplication, ghosting, and misalignment issues often associated with blending.
2.2. Locally Adaptive Image Warping
Step 1: Global Affine Estimation via RANSAC
Step 2: Overlap Region and Local Grid Cell Generation
- A binary mask , derived from the intersection of the grid cell with .
- A centroid , calculated using image moments.
- A bounding box (), utilized for subsequent spatial weighting and diagnostics.
Step 3: Local Affine Refinement with Adaptive Transformation Selection & Confidence Scoring
Spatial Confidence Metric
Adaptive Transformation Selection via Composite Diagnostic Score
Step 4: Free-Form Deformation Field via Confidence-Weighted Local Transformation Blending
Confidence-Weighted Spatial Blending
Step 5: Seam Guarding via Dual-Channel Gating
- Geometric Ramp (): A smootherstep function applied to the signed distance field of the overlap region, with bandwidth proportional to image diagonal ().
- Match Density Map (): A Gaussian-blurred heatmap of inlier keypoint locations, normalized to , with kernel standard deviation ().
Dual-Gated Seam Suppression
Step 6: Final Warping and Output
2.3. Optimal Stitching Line
2.3.1. Determination of an Adequate Parallax-Free Zone
- First, , reflecting the relative frontal position of the camera of the source image.
- , for one camera physically positioned above the other in most cases.
| Algorithm 1: Threshold-Based Disparity Clustering. |
|
- Standard Deviation (): This measures the internal coherence of the cluster. A smaller standard deviation indicates that the individual class mean disparities are tightly grouped around the cluster mean, suggesting a more consistent depth plane.
- Cardinality (): This is the total number of keypoints contained within all classes of the cluster. High cardinality indicates that the cluster is supported by a large amount of data, which directly increases the reliability and robustness of its mean disparity calculation.
- Disparity Deviation (): This is defined as the absolute difference between the mean disparity of the cluster () and the global mean disparity of all classes ().
- : standard deviation of the disparities within the cluster,
- C: cardinality (number of keypoints in the cluster),
- : weighting factor controlling the influence of the disparity deviation,
- : absolute difference between the mean disparity of the cluster and the global mean disparity ,
- : a small constant added to avoid division by zero.
2.3.2. Keypoint Chain Refinement
2.4. Partitioning and Reconstruction
2.4.1. Image Partitioning
-
Valid Segments (Consistent Direction):
- Invalid Segments (Inconsistent Direction): If the direction is inconsistent between the two images (e.g., and or vice versa), the segments are rejected as they are not complementary (see Figure 6). This prevents structural inconsistencies from being introduced during the final reconstruction
2.4.2. Reconstruction
3. Experimentation and Results
3.1. Quantitative Evaluation
3.2. Qualitative evaluation








3.3. Ablation Studies






3.4. Limitations

4. Conclusion
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Fischler, M.A.; Bolles, R.C. Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Communications of the ACM 1981, 24, 381–395. [Google Scholar]
- Cai, W.; Yang, W. Object-level geometric structure preserving for natural image stitching. In Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), Philadelphia, PA, USA, 2025; Vol. 39, pp. 1926–1934. [Google Scholar]
- Cai, W.; Yang, W. Object-level geometric structure preserving for natural image stitching. In Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), Philadelphia, PA,USA, 2025; AAAI; Vol. 39, pp. 1926–1934. [Google Scholar]
- Du, P.; Ning, J.; Cui, J.; Huang, S.; Wang, X.; Wang, J. Geometric structure preserving warp for natural image stitching. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, LA, USA, 2022; pp. 3688–3696. [Google Scholar]
- Herrmann, C.; Wang, C.; Bowen, R.S.; Keyder, E.; Zabih, R. Object-centered image stitching. In Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany, 2018; pp. 821–835. [Google Scholar]
- Bay, H.; Tuytelaars, T.; Van Gool, L. SURF: Speeded up robust features. In Computer Vision–ECCV 2006; Springer: Graz, Austria, 2006; pp. 404–417. [Google Scholar]
- Lowe, D.G. Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision 2004, 60, 91–110. [Google Scholar] [CrossRef]
- Juan, L.; Gwun, O. A comparison of SIFT, PCA-SIFT and SURF. International Journal of Image Processing 2009, 3, 143–152. [Google Scholar]
- Zaragoza, J.; Chin, T.-J.; Brown, M.S.; Suter, D. As-projective-as-possible image stitching with moving DLT. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Portland, OR, USA, 2013; pp. 2339–2346. [Google Scholar]
- Gao, J.; Kim, S.J.; Brown, M.S. Constructing image panoramas using dual-homography warping. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Colorado Springs, CO, USA, 2011; pp. 49–56. [Google Scholar]
- Yadav, S.; Choudhary, P.; Goel, S.; Parameswaran, S.; Bajpai, P.; Kim, J. Selfie Stitch: Dual homography based image stitching for wide-angle selfie. In Proceedings of the IEEE International Conference on Multimedia & Expo Workshops (ICMEW), San Diego, CA, USA, 2018; pp. 1–4. [Google Scholar]
- Jia, Q.; Li, Z.; Fan, X.; Zhao, H.; Teng, S.; Ye, X.; Latecki, L.J. Leveraging line-point consistence to preserve structures for wide parallax image stitching. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021; pp. 12186–12195. [Google Scholar]
- Nie, L.; Lin, C.; Liao, K.; Liu, S.; Zhao, Y. Parallax-tolerant unsupervised deep image stitching. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), Paris, France, 2023; pp. 7399–7408. [Google Scholar]
- Zhang, F.; Liu, F. Parallax-tolerant image stitching. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2014; pp. 3262–3269. [Google Scholar]
- Huang, H.; Chen, F.; Cheng, H.; Li, L.; Wang, M. Semantic segmentation guided feature point classification and seam fusion for image stitching. Journal of Algorithms & Computational Technology 2021, 15, 17483026211065399. [Google Scholar] [CrossRef]
- Li, X.; He, L.; He, X. Combined regional homography-affine warp for image stitching. Proceedings of the International Conference on Graphics and Image Processing (ICGIP) 2022, Vol. 12705, 258–264. [Google Scholar]
- Liao, T.; Li, N. Single-perspective warps in natural image stitching. IEEE Transactions on Image Processing 2019, 29, 724–735. [Google Scholar] [CrossRef] [PubMed]
- Wen, S.; Wang, X.; Zhang, W.; Wang, G.; Huang, M.; Yu, B. Structure preservation and seam optimization for parallax-tolerant image stitching. IEEE Access 2022, 10, 78713–78725. [Google Scholar] [CrossRef]
- Nie, L.; Lin, C.; Liao, K.; Liu, S.; Zhao, Y. Unsupervised deep image stitching: Reconstructing stitched features to images. IEEE Transactions on Image Processing 2021, 30, 6184–6197. [Google Scholar] [CrossRef] [PubMed]
- Sikka, P.; Asati, A.R.; Shekhar, C. Real time FPGA implementation of a high speed and area optimized Harris corner detection algorithm. Microprocessors and Microsystems 2021, 80, 103514. [Google Scholar] [CrossRef]
- Ji, X.; Yang, H.; Han, C. Research on image stitching method based on improved ORB and stitching line calculation. Journal of Electronic Imaging 2022, 31, 051404. [Google Scholar] [CrossRef]
- Liu, W.; Zhang, K.; Zhang, Y.; He, J.; Sun, B. Utilization of merge-sorting method to improve stitching efficiency in multi-scene image stitching. Applied Sciences 2023, 13, 2791. [Google Scholar] [CrossRef]
- Potje, G.; Cadar, F.; Araujo, A.; Martins, R.; Nascimento, E.R. XFeat: Accelerated features for lightweight image matching. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024; pp. 2682–2691. [Google Scholar]
- Nghonda, E. Enable 360-Degree Immersion for Outside-In Camera Systems for Live Events. Ph.D. Thesis, University of Florida, Gainesville, FL, USA, 2023. [Google Scholar]
- Chen, K.; Garg, A.; Wang, Y.-S. Seamless-through-breaking: Rethinking image stitching for optimal alignment. In Proceedings of the Asian Conference on Computer Vision (ACCV), 2024; pp. 4352–4367. [Google Scholar]
- Qin, Y.; Li, J.; Jiang, P.; Jiang, F. Image stitching by feature positioning and seam elimination. Multimedia Tools and Applications 2021, 80, 20869–20881. [Google Scholar] [CrossRef]
- Chai, X.; Chen, J.; Mao, Z.; Zhu, Q. An upscaling–downscaling optimal seamline detection algorithm for very large remote sensing image mosaicking. Remote Sensing 2022, 15, 89. [Google Scholar] [CrossRef]
- Garg, A.; Dung, L.-R. Stitching strip determination for optimal seamline search. In Proceedings of the International Conference on Imaging, Signal Processing and Communications (ICISPC), 2020; pp. 29–33. [Google Scholar]
- Li, J.; Wang, Z.; Lai, S.; Zhai, Y.; Zhang, M. Parallax-tolerant image stitching based on robust elastic warping. IEEE Transactions on Multimedia 2017, 20, 1672–1687. [Google Scholar] [CrossRef]
- Zheng, J.; Wang, Y.; Wang, H.; Li, B.; Hu, H.-M. A novel projective-consistent plane based image stitching method. IEEE Transactions on Multimedia 2019, 21, 2561–2575. [Google Scholar] [CrossRef]
- Li, J.; Lai, S.; Liu, Y.; Wang, Z.; Zhang, M. Local facet approximation for image stitching. Journal of Electronic Imaging 2018, 27, 013011. [Google Scholar] [CrossRef]
- Zhang, J.; Wang, C.; Liu, S.; Jia, L.; Ye, N.; Wang, J.; Zhou, J.; Sun, J. Content-aware unsupervised deep homography estimation. In Computer Vision–ECCV 2020; Springer: Glasgow, UK, 2020; pp. 653–669. [Google Scholar]
- Nie, L.; Lin, C.; Liao, K.; Zhao, Y. Learning edge-preserved image stitching from large-baseline deep homography. arXiv 2020, arXiv:2012.06194. [Google Scholar]
- Nie, L.; Lin, C.; Liao, K.; Liu, M.; Zhao, Y. A view-free image stitching network based on global homography. Journal of Visual Communication and Image Representation 2020, 73, 102950. [Google Scholar] [CrossRef]

| [b]0.28 | |
| APAP | [b]0.29 |
| ELA | [b]0.30 |
| UDIS | |
| [b]0.30 | |
| UDIS++ | [b]0.30 |
| SEAMLESS | [b]0.30 |
| OURS |






| Method | PSNR | SSIM | ||||||
|---|---|---|---|---|---|---|---|---|
| Easy | Mod | Hard | Avg | Easy | Mod | Hard | Avg | |
| 15.87 | 12.76 | 10.68 | 12.86 | 0.530 | 0.286 | 0.146 | 0.303 | |
| SIFT+RANSAC [1] | 28.75 | 24.08 | 18.55 | 23.27 | 0.916 | 0.833 | 0.636 | 0.779 |
| APAP [9] | 27.96 | 24.39 | 20.21 | 23.79 | 0.901 | 0.837 | 0.682 | 0.794 |
| ELA [29] | 29.36 | 25.10 | 19.19 | 24.01 | 0.917 | 0.855 | 0.691 | 0.808 |
| SPW [17] | 26.98 | 22.67 | 16.77 | 21.60 | 0.880 | 0.758 | 0.490 | 0.687 |
| LPC [12] | 26.94 | 22.63 | 19.31 | 22.59 | 0.878 | 0.764 | 0.610 | 0.736 |
| UDIS [19] | 25.16 | 20.96 | 18.36 | 21.17 | 0.834 | 0.669 | 0.495 | 0.648 |
| UDIS++ [13] | 30.19 | 25.84 | 21.57 | 25.43 | 0.933 | 0.875 | 0.739 | 0.838 |
| OURS | 24.92 | 26.00 | 27.89 | 26.27 | 0.823 | 0.839 | 0.882 | 0.848 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).