Submitted:
02 March 2025
Posted:
03 March 2025
Read the latest preprint version here
Abstract
Keywords:
1. Introduction
- Physics-Informed Integration and Cascaded Swin-UNet Architecture: This work incorporates thermal data as a physics-informed channel in the segmentation process to leverage the distinct thermal signatures of DCG, enhancing segmentation accuracy. This is a novel application in the context of Swin-UNet architectures. Furthermore, the method employs a cascaded architecture involving two Swin-UNet models. The first model generates initial segmentation masks, which are refined by a second Swin-UNet that processes inputs augmented with thermal imaging data. This cascaded approach boosts the precision and reliability of the segmentation outputs.
- SVD-Based Weight Pruning: Furthermore, it optimize the Swin-UNet using Singular Value Decomposition (SVD) to prune the model weights. This technique reduces the model’s complexity and computational demands, facilitating deployment on platforms with limited processing capabilities while maintaining high segmentation accuracy.
- Hybrid Dice-Focal Loss Function: In addition the use of a hybrid loss function combining Dice loss and focal loss addresses class imbalance effectively. This dual approach is particularly adept at improving the model’s ability to delineate under-represented classes and enhances the overlap accuracy between predicted and actual segmentation masks.
2. Location of Study Area

2.1. Dataset and Data Explanation
2.2. Data Characteristics and Pre-Processing
2.3. Dataset Details
- LE7 Bands: B1 (Blue), B2 (Green), B3 (Red), B4 (Near Infrared), B5 (Shortwave Infrared 1), B6_VCID_1 (Low-gain Thermal Infrared), B6_VCID_2 (High-gain Thermal Infrared), B7 (Shortwave Infrared 2) and B8 (Panchromatic).
- Quality Bitmask (BQA) to identify the quality and usability of each pixel.
- Vegetation Index (NDVI), Snow Index (NDSI) and Water Index (NDWI) for specialized environmental analyses.
- SRTM 90 Elevation and Slope to provide topographical context which is crucial for understanding glacier dynamics.
3. Swin-UNet Model Architecture
3.1. Integrating Swin Transformer as Backbone

3.2. Hierarchical Feature Representation
3.3. Shifted Window Self-Attention
3.4. SVD-Based Optimization
4. Proposed Segmentation Scheme
4.1. Data Pre-Processing
4.1.1. Initial NaN Value Replacement
4.1.2. Channel Selection Using Point-Biserial Correlation
4.2. Swin-UNet Model Training
4.3. SVD-Based Model Weight Pruning
4.3.1. Proof of Optimality of Weights Extracted by SVD
- denotes the Frobenius norm.
- is the approximation of A using the top k singular values.
- are the singular values of A.
- r is the rank of A.
4.4. Cascaded Swin-UNet Model
| Algorithm 1: Proposed Model for Debris Ice Segmentation |
|
5. Experiment
5.1. Comparison Metrics
- (1)
- Accuracy: This metric measures the proportion of true results (both true positives and true negatives) among the total number of cases examined. It is defined as:
- (2)
- F1 Score: The harmonic mean of Precision and Recall, the F1 Score provides a balance between them, particularly useful when dealing with an uneven class distribution. It is defined as:
- (3)
- Precision: Also known as positive predictive value, Precision measures the accuracy of positive predictions made by the model. It is computed as:
- (4)
- Recall: Also known as sensitivity, Recall indicates the ability of the model to identify all relevant instances. It is calculated as:
- (5)
- Intersection over Union (IoU): Also known as the Jaccard index, IoU measures the overlap between the predicted segmentation and the ground truth. It is defined as:
- (6)
- Area Under the Curve (AUC): Refers to the area under the Receiver Operating Characteristic (ROC) curve. It quantifies the overall ability of the model to discriminate between the classes across all thresholds. A higher AUC value indicates better model performance.
5.2. Experimental Setup
5.3. Pre-Processing Results
5.4. Results of Proposed Model


6. Results and Discussion

7. Conclusions
Author Contributions
Funding
Conflicts of Interest
Appendix A
| Band | Wavelength () | Resolution (m) | Primary Use |
|---|---|---|---|
| 1 | 0.45 - 0.52 | 30 | Visible (Blue) |
| 2 | 0.52 - 0.60 | 30 | Visible (Green) |
| 3 | 0.63 - 0.69 | 30 | Visible (Red) |
| 4 | 0.76 - 0.90 | 30 | Near-Infrared |
| 5 | 1.55 - 1.75 | 30 | Shortwave Infrared 1 |
| 7 | 2.08 - 2.35 | 30 | Shortwave Infrared 2 |
| 6 | 10.40 - 12.50 | 60 | Thermal Infrared |
| 8 | 0.52 - 0.90 | 15 | Panchromatic |
References
- Deutsche Welle. Swiss Glaciers Record Unprecedented Melt Rates. https://www.dw.com/en/swiss-glaciers-melt/a-57483958, 2023. Accessed: 2024-04-24.
- Sakai, A.; Fujita, K. Contrasting glacier responses to recent climate change in high-mountain Asia. Scientific Reports 2017, 7, 13717. [Google Scholar] [CrossRef] [PubMed]
- Quincey, D.J.; Glasser, N.F. Morphological and ice-dynamical changes on the tasman glacier, New Zealand, 1990–2007. Glob. Planet. Change 2009, 68, 185–197. [Google Scholar] [CrossRef]
- Jiang, S.; Nie, Y.; Liu, Q.; Wang, J.; Liu, L.; Hassan, J.; et al. Glacier change, supraglacial debris expansion and glacial lake evolution in the gyirong river basin, central Himalayas, between 1988 and 2015. Remote Sens. 2018, 10, 986. [Google Scholar] [CrossRef]
- Mölg, N.; Bolch, T.; Walter, A.; Vieli, A. Unravelling the evolution of zmuttgletscher and its debris cover since the end of the little ice age. Cryosphere 2019, 13, 1889–1909. [Google Scholar] [CrossRef]
- Tielidze, L.G.; Bolch, T.; Wheate, R.D.; Kutuzov, S.S.; Lavrentiev, I.I.; Zemp, M. Supra-glacial debris cover changes in the Greater Caucasus from 1986 to 2014. Cryosphere 2020, 14, 585–598. [Google Scholar] [CrossRef]
- Herreid, S.; Pellicciotti, F. The state of rock debris covering Earth’s glaciers. Nat. Geosci 2020, 13, 621–627. [Google Scholar] [CrossRef]
- Ragettli, S.; Immerzeel, W.W.; Pellicciotti, F. Contrasting climate change impact on river flows from high-altitude catchments in the Himalayan and Andes Mountains. Proc. Natl. Acad. Sci. U. S. A. 2016, 113, 9222–9227. [Google Scholar] [CrossRef]
- Rounce, D.R.; King, O.; Mccarthy, M.; Shean, D.E.; Salerno, F. Quantifying debris thickness of debris-covered glaciers in the everest region of Nepal through inversion of a subdebris melt model. J. Geophys. Res. Earth Surf. 2018, 123, 1094–1115. [Google Scholar] [CrossRef]
- Racoviteanu, A.; Nicholson, L.; Glasser, N. Surface composition of debris-covered glaciers across the Himalaya using linear spectral unmixing of Landsat 8 OLI imagery. Cryosphere 2021, 15, 4557–4588. [Google Scholar] [CrossRef]
- Kulkarni, S.C.; Rege, P.P. Pixel level fusion techniques for SAR and optical images: A review. Information Fusion 2020, 59, 13–29. [Google Scholar]
- Begam, S.; Sen, D. Mapping of moraine dammed glacial lakes and assessment of their areal changes in the central and eastern Himalayas using satellite data. Journal of Mountain Science 2019, 16, 77–94. [Google Scholar]
- Ardelean, F.; Ginzler, C.; Gartner-Roer, I.; Götz, J. Glacier classification and mapping by object-based image analysis (OBIA), using optical data from different sensors: A case study from the Suntar-Khayata Range, Russian Siberia. Remote Sensing 2011, 3, 782–800. [Google Scholar]
- Bajracharya, S.R.; Shrestha, B.R. Automated mapping of debris-covered glaciers using object-based image analysis of Landsat data: A case study of Yala Glacier in Langtang Valley, Nepal. Geocarto International 2015, 30, 636–651. [Google Scholar]
- Karimi, F.; Bahremand, A.; Roodposhti, M.S.; et al. . Detection of supraglacial lakes on debris-covered glaciers in the Himalaya/Karakoram using Landsat imagery. Journal of Applied Remote Sensing 2015, 9, 096098. [Google Scholar]
- Nie, Y.; Sheng, Y.; Wang, Y.; et al. . A semi-automated object-based approach for glacier mapping. Journal of Geographical Sciences 2010, 20, 415–428. [Google Scholar]
- Rastner, P.; Bolch, T.; Notarnicola, C.; Paul, F. A comparison of pixel- and object-based glacier classification with optical satellite images. IEEE J. Sel. Top. Appl. Earth Observations Remote Sens. 2014, 7, 853–862. [Google Scholar] [CrossRef]
- Smith, J.D.; Johnson, E.F. Glacier segmentation from satellite imagery. Remote Sensing Letters 2007, 3, 307–316. [Google Scholar]
- Jones, M.A.; Smith, J.D. Comparative analysis of clustering algorithms for glacier segmentation. International Journal of Remote Sensing 2010, 31, 5417–5434. [Google Scholar]
- Johnson, E.F.; Wang, L.; Liu, W. Automated glacier boundary extraction from aerial photographs using edge detection algorithms. Photogrammetric Engineering & Remote Sensing 2015, 81, 159–169. [Google Scholar]
- Wang, L.; Liu, W. Deep glacier segmentation using convolutional neural networks. IEEE Transactions on Geoscience and Remote Sensing 2018, 56, 218–233. [Google Scholar]
- Nagai, H.; Abe, T.; Ohki, M. SAR-based flood monitoring for flatland with frequently fluctuating water surfaces: Proposal for the normalized backscatter amplitude difference index (NoBADI). Remote Sensing 2021, 13, 4136. [Google Scholar]
- Muhammad, S.; Thapa, A. Daily Terra–Aqua MODIS cloud-free snow and Randolph Glacier Inventory 6.0 combined product (M* D10A1GL06) for high-mountain Asia between 2002 and 2019. Earth System Science Data 2021, 13, 767–776. [Google Scholar]
- Bhardwaj, A.; et al. Synthesizing optical and SAR imagery from land cover maps and auxiliary raster data. IEEE Transactions on Geoscience and Remote Sensing 2021, 60, 1–12. [Google Scholar]
- Singh, S.; et al. A sensitivity analysis on the spectral signatures of low-backscattering sea areas in Sentinel-1 SAR images. Remote Sensing 2021, 13, 1183. [Google Scholar]
- Corcione, V.; Buono, A.; Nunziata, F.; Migliaccio, M. A sensitivity analysis on the spectral signatures of low-backscattering sea areas in Sentinel-1 SAR images. Remote Sensing 2021, 13, 1183. [Google Scholar]
- Stonevicius, E.; Uselis, G.; Grendaite, D. Ice Detection with Sentinel-1 SAR Backscatter Threshold in Long Sections of Temperate Climate Rivers. Remote Sensing 2022, 14, 1627. [Google Scholar]
- Raissi, M.; Perdikaris, P.; Karniadakis, G.E. Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations. Journal of Computational Physics 2019, 378, 686–707. [Google Scholar]
- Karniadakis, G.E.; Kevrekidis, I.G.; Lu, L.; Perdikaris, P.; Wang, S.; Yang, L. Physics-informed machine learning. Nature Reviews Physics 2021, 3, 422–440. [Google Scholar]
- Karpatne, A.; Atluri, G.; Faghmous, J.H.; Steinbach, M.; Banerjee, A.; Ganguly, A.; Shekhar, S.; Samatova, N.F.; Kumar, V. Theory-guided data science: A new paradigm for scientific discovery from data. IEEE Transactions on Knowledge and Data Engineering 2017, 29, 2318–2331. [Google Scholar]
- Karpatne, A.; Watkins, W.; Read, J.; Kumar, V. Physics-guided neural networks (PGNN): An application in lake temperature modeling. arXiv 2018, arXiv:1710.11431. [Google Scholar]
- Aslam, L.; Zou, R.; Awan, E.; Butt, S.A. Integrating Physics-Informed Vectors for Improved Wind Speed Forecasting with Neural Networks. In Proceedings of the 2024 14th Asian Control Conference (ASCC). IEEE, 2024, pp. 1902–1907.
- Morales, R.; Vidal, J. A physics-informed neural network for wind field retrieval from SAR images. Remote Sensing 2020, 12, 2507. [Google Scholar]
- Talukder, B.; Matthew, R.; Bunch, M.J.; Hipel, K.W.; Orbinski, J.; et al. Melting of Himalayan glaciers and planetary health. Current Opinion in Environmental Sustainability 2021, 50, 98–108. [Google Scholar] [CrossRef]
- Bajracharya, S.R.; Shrestha, B.R. The Status of Glaciers in the Hindu Kush-Himalayan Region 2011. International Centre for Integrated Mountain Development (ICIMOD). [CrossRef]
- Aryal, B.; Miles, K.E.; Vargas Zesati, S.A.; Fuentes, O. Boundary Aware U-Net for Glacier Segmentation. Journal of Remote Sensing or Conference Proceedings Name 2023, Volume Number, Page Range. [CrossRef]
- Vaswani, A.; et al. Attention Is All You Need. In Proceedings of the Advances in Neural Information Processing Systems 30, 2017.
- Devlin, J.; et al. BERT: Bidirectional Encoder Representations from Transformers. In Proceedings of the Proceedings of NAACL-HLT 2019, 2018.
- Dosovitskiy, A.; et al. An Image Is Worth 16x16 Words: Transformers for Image Recognition at Scale. arXiv 2020, arXiv:2010.11929. [Google Scholar]
- Liu, Z.; et al. Swin Transformer: Hierarchical Vision Transformer using Shifted Windows. arXiv 2021, arXiv:2103.14030. [Google Scholar]



| Data Split | Number of Images |
|---|---|
| Training | 564,823 |
| Testing | 105,942 |
| Validation | 35,314 |
| Model | Acc. | F1 | Pre. | Rec. | IoU | AUC | IT (s) |
|---|---|---|---|---|---|---|---|
| Swin-UNet | 94.12% | 93.65% | 93.09% | 94.21% | 86.73% | 93.93% | 0.1865 |
| SPSW-UNet | 92.34% | 93.35% | 92.92% | 93.80% | 84.89% | 91.20% | 0.1247 |
| UNet-ResNet34 | 89.9% | 92.05% | 92.2% | 91.91% | 81.24% | 87.6% | 0.3059 |
| PICSw-UNet | 95.67% | 95.37% | 94.23% | 96.53% | 91.65% | 96.4% | 0.2795 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).