Submitted:
23 July 2025
Posted:
06 August 2025
Read the latest preprint version here
Abstract
Keywords:
1. Introduction
2. Related Work
2.1. Conventional and Convolution-based Techniques
2.2. Attention-based Techniques
2.3. State-Space Based Techniques
3. Semantic-Injected State Modeling
3.1. Semantic Decomposition and Prototype Anchoring
3.2. 2D State Modeling Module
3.3. Semantic Injection State Modeling Block
3.4. Geographically-Chunked Parallel Processing
4. Methodology
4.1. Model Architecture
4.2. Semantic-Injected State-Space Block

4.3. Omni-Shift Mechanism

4.4. Channel Attention

4.5. Loss Function
5. Experiments
5.1. Datasets for UAV-Based Ecological Monitoring
5.2. Experiment Settings
5.3. Quantitative Results
| Datasets | Method | PSNR ↑ | SSIM ↑ | NIQE ↓ | LPIPS ↓ | RMSE ↓ | SAM ↓ |
|---|---|---|---|---|---|---|---|
| RSUAV-QH | SRCNN[27] | 21.1437 | 0.7093 | 9.8234 | 0.3373 | 7.7725 | 0.1506 |
| VDSR[28] | 21.3548 | 0.7346 | 10.3282 | 0.3338 | 7.5716 | 0.1602 | |
| SwinIR[17] | 22.6073 | 0.7891 | 8.8427 | 0.2909 | 7.6129 | 0.1528 | |
| HAT[31] | 23.8924 | 0.8617 | 8.3129 | 0.2189 | 6.4297 | 0.1583 | |
| MambaIR[39] | 24.8382 | 0.8365 | 9.3182 | 0.2469 | 6.6814 | 0.1478 | |
| MambaIRv2[40] | 23.6127 | 0.8173 | 9.5198 | 0.2504 | 7.1163 | 0.1476 | |
| SIMSR (Ours) | 24.9281 | 0.8665 | 7.9073 | 0.2135 | 6.2426 | 0.1463 | |
| UCM | SRCNN[27] | 21.0571 | 0.7018 | 9.9216 | 0.3467 | 7.8808 | 0.1542 |
| VDSR[28] | 21.2666 | 0.7280 | 10.4365 | 0.3422 | 7.6733 | 0.1644 | |
| SwinIR[17] | 22.5205 | 0.7812 | 8.9439 | 0.2983 | 7.7241 | 0.1565 | |
| HAT[31] | 24.7515 | 0.8540 | 9.6211 | 0.2254 | 6.5343 | 0.1625 | |
| MambaIR[39] | 23.8029 | 0.8291 | 9.4208 | 0.2531 | 6.7818 | 0.1514 | |
| MambaIRv2[40] | 23.5250 | 0.8104 | 8.4174 | 0.2568 | 7.2239 | 0.1514 | |
| SIMSR (Ours) | 24.8312 | 0.8598 | 8.1124 | 0.2198 | 6.3469 | 0.1501 | |
| WHU-RS19 | SRCNN[27] | 23.2700 | 0.7069 | 8.1523 | 0.3589 | 8.0819 | 0.1516 |
| VDSR[28] | 23.7775 | 0.7122 | 7.1796 | 0.3682 | 8.1232 | 0.1427 | |
| SwinIR[17] | 23.4011 | 0.5908 | 7.8451 | 0.4720 | 7.8556 | 0.1473 | |
| HAT[31] | 23.6580 | 0.5993 | 8.3204 | 0.4633 | 8.6052 | 0.1422 | |
| MambaIR[39] | 23.7002 | 0.7084 | 6.6657 | 0.4052 | 8.0293 | 0.1506 | |
| MambaIRv2[40] | 23.9886 | 0.7208 | 7.5345 | 0.3755 | 7.9644 | 0.1501 | |
| SIMSR (Ours) | 24.2634 | 0.7296 | 6.5012 | 0.3544 | 7.8764 | 0.1406 |
5.4. Qualitative Results and Feature Analysis
| Classes | Method | PSNR ↑ | SSIM ↑ | NIQE ↓ | LPIPS ↓ | RMSE ↓ | SAM ↓ |
|---|---|---|---|---|---|---|---|
| aGrass | SRCNN[27] | 31.6593 | 0.7717 | 7.3087 | 0.4076 | 5.4812 | 0.1617 |
| VDSR[28] | 31.7324 | 0.7729 | 7.1464 | 0.4024 | 5.4694 | 0.1614 | |
| SwinIR[17] | 31.7514 | 0.7647 | 7.5602 | 0.3620 | 5.3454 | 0.1579 | |
| HAT[31] | 31.7586 | 0.7733 | 7.2579 | 0.4036 | 5.4552 | 0.1565 | |
| MambaIR[39] | 32.5437 | 0.7951 | 7.3111 | 0.3375 | 4.9799 | 0.1548 | |
| MambaIRv2[40] | 32.8623 | 0.8202 | 7.1536 | 0.3357 | 6.1316 | 0.1587 | |
| SIMSR (Ours) | 32.9074 | 0.8257 | 7.0268 | 0.3312 | 4.8296 | 0.1539 | |
| bField | SRCNN[27] | 30.9887 | 0.6972 | 7.7422 | 0.4384 | 5.8524 | 0.1579 |
| VDSR[28] | 31.0113 | 0.6880 | 7.9297 | 0.4175 | 5.8004 | 0.1592 | |
| SwinIR[17] | 31.0884 | 0.6996 | 7.5414 | 0.4309 | 5.8324 | 0.1563 | |
| HAT[31] | 31.1053 | 0.6998 | 7.6590 | 0.4331 | 5.8254 | 0.1546 | |
| MambaIR[39] | 31.5853 | 0.7124 | 7.7017 | 0.3976 | 5.5454 | 0.1542 | |
| MambaIRv2[40] | 31.8543 | 0.7381 | 7.6077 | 0.3744 | 7.2795 | 0.1544 | |
| SIMSR (Ours) | 31.9009 | 0.7428 | 7.4724 | 0.3713 | 5.4483 | 0.1530 | |
| cIndustry | SRCNN[27] | 23.8071 | 0.6530 | 7.5993 | 0.3959 | 7.7559 | 0.1540 |
| VDSR[28] | 24.2170 | 0.6841 | 6.9130 | 0.4155 | 7.7071 | 0.1544 | |
| SwinIR[17] | 24.2811 | 0.6874 | 7.0884 | 0.3713 | 7.4907 | 0.1543 | |
| HAT[31] | 24.5127 | 0.6972 | 7.0539 | 0.3946 | 7.6396 | 0.1539 | |
| MambaIR[39] | 24.5198 | 0.6976 | 7.3548 | 0.3897 | 7.6512 | 0.1525 | |
| MambaIRv2[40] | 25.1669 | 0.7423 | 6.1057 | 0.3465 | 7.5097 | 0.1522 | |
| SIMSR (Ours) | 25.2485 | 0.7488 | 5.8562 | 0.3416 | 7.2029 | 0.1501 | |
| dRiverLake | SRCNN[27] | 26.0556 | 0.7788 | 6.8601 | 0.3210 | 6.1159 | 0.1512 |
| VDSR[28] | 28.9093 | 0.7741 | 6.7854 | 0.3498 | 5.7623 | 0.1517 | |
| SwinIR[17] | 28.9152 | 0.7847 | 6.8352 | 0.3793 | 5.8470 | 0.1517 | |
| HAT[31] | 29.0360 | 0.7872 | 6.8792 | 0.3729 | 5.8266 | 0.1632 | |
| MambaIR[39] | 29.0572 | 0.7875 | 6.8479 | 0.3748 | 5.8134 | 0.1675 | |
| MambaIRv2[40] | 29.4932 | 0.8008 | 6.8506 | 0.3075 | 5.4813 | 0.1698 | |
| SIMSR (Ours) | 29.5927 | 0.8072 | 6.7513 | 0.3035 | 5.2786 | 0.1508 | |
| eForest | SRCNN[27] | 26.3516 | 0.5835 | 9.0943 | 0.5012 | 7.9684 | 0.1637 |
| VDSR[28] | 26.3947 | 0.5854 | 8.9165 | 0.5087 | 7.9465 | 0.1537 | |
| SwinIR[17] | 26.4321 | 0.5852 | 8.8184 | 0.4994 | 7.9394 | 0.1514 | |
| HAT[31] | 26.4655 | 0.5713 | 8.8667 | 0.4525 | 7.9155 | 0.1509 | |
| MambaIR[39] | 26.8391 | 0.5948 | 9.2291 | 0.4448 | 7.7438 | 0.1625 | |
| MambaIRv2[40] | 30.2330 | 0.8339 | 6.4067 | 0.3120 | 8.2871 | 0.1586 | |
| SIMSR (Ours) | 30.2738 | 0.8414 | 6.3091 | 0.3061 | 7.6905 | 0.1502 | |
| fResident | SRCNN[27] | 22.9982 | 0.6361 | 8.5432 | 0.4148 | 8.2945 | 0.1669 |
| VDSR[28] | 23.2386 | 0.6630 | 8.3945 | 0.4454 | 8.2801 | 0.1571 | |
| SwinIR[17] | 23.4050 | 0.6661 | 8.4604 | 0.3976 | 8.0717 | 0.1560 | |
| HAT[31] | 23.4675 | 0.6743 | 8.1616 | 0.4290 | 8.2068 | 0.1543 | |
| MambaIR[39] | 23.4765 | 0.6749 | 8.4901 | 0.4248 | 8.2172 | 0.1541 | |
| MambaIRv2[40] | 27.6244 | 0.6900 | 9.4581 | 0.4196 | 8.1094 | 0.1564 | |
| SIMSR (Ours) | 27.6757 | 0.6955 | 8.0123 | 0.4127 | 7.8572 | 0.1508 | |
| gParking | SRCNN[27] | 23.2839 | 0.6139 | 7.5217 | 0.4232 | 7.7822 | 0.1560 |
| VDSR[28] | 23.5637 | 0.6429 | 6.8784 | 0.4400 | 7.7423 | 0.1573 | |
| SwinIR[17] | 23.6548 | 0.6469 | 7.0965 | 0.3974 | 7.5371 | 0.1558 | |
| HAT[31] | 23.8184 | 0.6568 | 6.7592 | 0.4190 | 7.6764 | 0.1553 | |
| MambaIR[39] | 23.8386 | 0.6578 | 6.9988 | 0.4155 | 7.6839 | 0.1552 | |
| MambaIRv2[40] | 25.0994 | 0.7680 | 7.3963 | 0.3538 | 7.4748 | 0.1545 | |
| SIMSR (Ours) | 25.1659 | 0.7766 | 6.6415 | 0.3489 | 7.3752 | 0.1533 |
5.5. Efficiency Study
5.6. Ablation Study and Comprehensive Analysis
| Component | Method | PSNR ↑ | SSIM ↑ | NIQE ↓ | LPIPS ↓ | RMSE ↓ | SAM ↓ |
|---|---|---|---|---|---|---|---|
| Token Shift | Uni-Shift | 26.1232 | 0.7133 | 6.5121 | 0.3211 | 6.4345 | 0.170245 |
| Quad-Shift | 26.3523 | 0.7299 | 6.3325 | 0.3189 | 6.4023 | 0.169702 | |
| Omni-Shift (Ours) | 26.7741 | 0.7493 | 6.0509 | 0.3086 | 6.3994 | 0.165984 | |
| Backbone | ResNet | 26.4526 | 0.8101 | 5.6533 | 0.2576 | 6.1755 | 0.168675 |
| Naive Attention | 26.5205 | 0.8313 | 5.5299 | 0.2398 | 6.1466 | 0.165219 | |
| SISM (Ours) | 26.6128 | 0.8469 | 5.2425 | 0.2167 | 6.1121 | 0.159541 | |
| MLP Variants | MLP(ReLU) | 30.5086 | 0.9066 | 5.6065 | 0.1752 | 4.7284 | 0.158447 |
| MLP(GELU) | 30.5653 | 0.9164 | 5.4276 | 0.1554 | 4.7006 | 0.156102 | |
| ChannelAtt (Ours) | 30.7739 | 0.9574 | 4.8475 | 0.1119 | 4.6229 | 0.144746 | |
| Scan Methods | 1D Scan | 27.4849 | 0.7789 | 6.1649 | 0.2913 | 6.2295 | 0.180285 |
| 2D Scan (Ours) | 27.5970 | 0.8023 | 5.8193 | 0.2635 | 6.1906 | 0.172631 |
6. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Platel, A.; Sandino, J.; Shaw, J.; Bollard, B.; Gonzalez, F. Advancing Sparse Vegetation Monitoring in the Arctic and Antarctic: A Review of Satellite and UAV Remote Sensing, Machine Learning, and Sensor Fusion. Remote Sensing 2025, 17. [CrossRef]
- Mathieu, R.; Freeman, C.; Aryal, J. Mapping private gardens in urban areas using object-oriented techniques and very high-resolution satellite imagery. Landscape and Urban Planning 2007, 81, 179–192. [CrossRef]
- Muhmad Kamarulzaman, A.M.; Wan Mohd Jaafar, W.S.; Mohd Said, M.N.; Saad, S.N.M.; Mohan, M. UAV Implementations in Urban Planning and Related Sectors of Rapidly Developing Nations: A Review and Future Perspectives for Malaysia. Remote Sensing 2023, 15. [CrossRef]
- Kumar, S.; Meena, R.S.; Sheoran, S.; Jangir, C.K.; Jhariya, M.K.; Banerjee, A.; Raj, A. Chapter 5 - Remote sensing for agriculture and resource management. In Natural Resources Conservation and Advances for Sustainability; Jhariya, M.K.; Meena, R.S.; Banerjee, A.; Meena, S.N., Eds.; Elsevier, 2022; pp. 91–135. [CrossRef]
- de França e Silva, N.R.; Chaves, M.E.D.; Luciano, A.C.d.S.; Sanches, I.D.; de Almeida, C.M.; Adami, M. Sugarcane Yield Estimation Using Satellite Remote Sensing Data in Empirical or Mechanistic Modeling: A Systematic Review. Remote Sensing 2024, 16. [CrossRef]
- Li, J.; Pei, Y.; Zhao, S.; Xiao, R.; Sang, X.; Zhang, C. A Review of Remote Sensing for Environmental Monitoring in China. Remote Sensing 2020, 12. [CrossRef]
- Lockhart, K.; Sandino, J.; Amarasingam, N.; Hann, R.; Bollard, B.; Gonzalez, F. Unmanned Aerial Vehicles for Real-Time Vegetation Monitoring in Antarctica: A Review. Remote Sensing 2025, 17. [CrossRef]
- Panagiotopoulou, A.; Grammatikopoulos, L.; El Saer, A.; Petsa, E.; Charou, E.; Ragia, L.; Karras, G. Super-Resolution Techniques in Photogrammetric 3D Reconstruction from Close-Range UAV Imagery. Heritage 2023, 6, 2701–2715. [CrossRef]
- Albanwan, H.; Qin, R.; Liu, J.K. Remote Sensing-Based 3D Assessment of Landslides: A Review of the Data, Methods, and Applications. Remote Sensing 2024, 16. [CrossRef]
- Stöcker, C.; Bennett, R.; Nex, F.; Gerke, M.; Zevenbergen, J. Review of the Current State of UAV Regulations. Remote Sensing 2017, 9. [CrossRef]
- Song, Y.; Sun, L.; Bi, J.; Quan, S.; Wang, X. DRGAN: A Detail Recovery-Based Model for Optical Remote Sensing Images Super-Resolution. IEEE Transactions on Geoscience and Remote Sensing 2025, 63, 1–13. [CrossRef]
- Zhang, Y.; Zheng, P.; Zeng, C.; Xiao, B.; Li, Z.; Gao, X. Jointly RS Image Deblurring and Super-Resolution With Adjustable-Kernel and Multi-Domain Attention. IEEE Transactions on Geoscience and Remote Sensing 2025, 63, 1–16. [CrossRef]
- Chung, M.; Jung, M.; Kim, Y. Enhancing Remote Sensing Image Super-Resolution Guided by Bicubic-Downsampled Low-Resolution Image. Remote Sensing 2023, 15. [CrossRef]
- Luo, J.; Han, L.; Gao, X.; Liu, X.; Wang, W. SR-FEINR: Continuous Remote Sensing Image Super-Resolution Using Feature-Enhanced Implicit Neural Representation. Sensors 2023, 23. [CrossRef]
- Luo, W.; Li, Y.; Urtasun, R.; Zemel, R. Understanding the Effective Receptive Field in Deep Convolutional Neural Networks. In Proceedings of the Advances in Neural Information Processing Systems; Lee, D.; Sugiyama, M.; Luxburg, U.; Guyon, I.; Garnett, R., Eds. Curran Associates, Inc., 2016, Vol. 29.
- Lim, B.; Son, S.; Kim, H.; Nah, S.; Lee, K.M. Enhanced deep residual networks for single image super-resolution. In Proceedings of the Proceedings of the IEEE conference on computer vision and pattern recognition workshops, 2017, pp. 136–144.
- Liang, J.; Cao, J.; Sun, G.; Zhang, K.; Van Gool, L.; Timofte, R. Swinir: Image restoration using swin transformer. In Proceedings of the Proceedings of the IEEE/CVF international conference on computer vision, 2021, pp. 1833–1844.
- Zhang, Y.; Li, K.; Li, K.; Wang, L.; Zhong, B.; Fu, Y. Image super-resolution using very deep residual channel attention networks. In Proceedings of the Proceedings of the European conference on computer vision (ECCV), 2018, pp. 286–301.
- Vaswani, A.; Shazeer, N.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A.N.; Kaiser, .; Polosukhin, I. Attention is all you need. In Proceedings of the Advances in neural information processing systems, 2017, pp. 5998–6008.
- Somvanshi, S.; Monzurul Islam, M.; Sultana Mimi, M.; Bashar Polock, S.B.; Chhetri, G.; Das, S. From S4 to Mamba: A Comprehensive Survey on Structured State Space Models. arXiv e-prints 2025, p. arXiv:2503.18970, [arXiv:stat.ML/2503.18970]. [CrossRef]
- Gu, A.; Goel, K.; Ré, C. Efficiently modeling long sequences with structured state spaces. arXiv preprint arXiv:2111.00396 2021.
- Liu, Y.; Tian, Y.; Zhao, Y.; Yu, H.; Xie, L.; Wang, Y.; Ye, Q.; Jiao, J.; Liu, Y. VMamba: Visual State Space Model. In Proceedings of the Advances in Neural Information Processing Systems; Globerson, A.; Mackey, L.; Belgrave, D.; Fan, A.; Paquet, U.; Tomczak, J.; Zhang, C., Eds. Curran Associates, Inc., 2024, Vol. 37, pp. 103031–103063.
- Wang, Y.; Yuan, W.; Xie, F.; Lin, B. ESatSR: Enhancing Super-Resolution for Satellite Remote Sensing Images with State Space Model and Spatial Context. Remote Sensing 2024, 16. [CrossRef]
- Peng, B.; Alcaide, E.; Anthony, Q.; Al-Ghamdi, A.; Fan, B.; Gao, L.; Gao, Y.; He, J.; He, P.; He, X.; et al. RWKV: Reinventing RNNs for the Transformer Era. arXiv preprint arXiv:2305.13048 2023.
- Peng, B.; Goldstein, D.; Anthony, Q.; Albalak, A.; Alcaide, E.; Biderman, S.; Cheah, E.; Ferdinan, T.; Hou, H.; Kazienko, P.; et al. Eagle and Finch: RWKV with Matrix-Valued States and Dynamic Recurrence. ArXiv 2024, abs/2404.05892.
- Keys, R. Cubic convolution interpolation for digital image processing. IEEE Transactions on Acoustics, Speech, and Signal Processing 1981, 29, 1153–1160.
- Dong, C.; Loy, C.C.; He, K.; Tang, X. Image super-resolution using deep convolutional networks. In Proceedings of the IEEE Transactions on Pattern Analysis and Machine Intelligence. IEEE, 2015, Vol. 38, pp. 295–307.
- Kim, J.; Lee, J.K.; Lee, K.M. Accurate image super-resolution using very deep convolutional networks. In Proceedings of the Proceedings of the IEEE conference on computer vision and pattern recognition, 2016, pp. 1646–1654.
- Zhang, Z.; Liu, J.; Wang, L. Swinfir: Rethinking the swinir for image restoration and enhancement. In Proceedings of the 2022 IEEE International Conference on Multimedia and Expo (ICME). IEEE, 2022, pp. 1–6.
- Chen, X.; Wang, Y.; Wu, G.; Chen, J.; Liu, J. Activating more pixels in image super-resolution transformer. In Proceedings of the Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2023, pp. 11612–11621.
- Chen, X.; Wang, X.; Zhang, W.; Kong, X.; Qiao, Y.; Zhou, J.; Dong, C. HAT: Hybrid Attention Transformer for Image Restoration, 2024, [arXiv:cs.CV/2309.05239].
- Chen, C.; Yang, H.; Chen, S.; Xi, F.; Liu, Z. A Data-Driven Motion Compensation Scheme for Compressed Sensing SAR Image Restoration. IEEE Transactions on Geoscience and Remote Sensing 2025, 63, 1–18. [CrossRef]
- Luan, X.; Fan, H.; Wang, Q.; Yang, N.; Liu, S.; Li, X.; Tang, Y. FMambaIR: A Hybrid State-Space Model and Frequency Domain for Image Restoration. IEEE Transactions on Geoscience and Remote Sensing 2025, 63, 1–14. [CrossRef]
- Zhou, Y.; Suo, J.; Wang, Y.; Su, J.; Xiao, W.; Hong, Z.; Ranjan, R.; Wang, L.; Wen, Z. MMCANet A Multimodal and Cross-Attention Network for Cloud Removal and Exploration of Progressive Remote Sensing Images Restoration Algorithm. IEEE Transactions on Geoscience and Remote Sensing 2025, 63, 1–13. [CrossRef]
- Zhang, W.; Qu, Q.; Qiu, A.; Li, Z.; Liu, X.; Li, Y. Efficient Denoising of Ultrasonic Logging While Drilling Images: Multinoise Diffusion Denoising and Distillation. IEEE Transactions on Geoscience and Remote Sensing 2025, 63, 1–17. [CrossRef]
- Huang, Z.; Yang, Y.; Yu, H.; Li, Q.; Shi, Y.; Zhang, Y.; Fang, H. RCST: Residual Context-Sharing Transformer Cascade to Approximate Taylor Expansion for Remote Sensing Image Denoising. IEEE Transactions on Geoscience and Remote Sensing 2025, 63, 1–15. [CrossRef]
- Cui, Y.; Bin Waheed, U.; Chen, Y. Unsupervised Deep Learning for DAS-VSP Denoising Using Attention-Based Deep Image Prior. IEEE Transactions on Geoscience and Remote Sensing 2025, 63, 1–14. [CrossRef]
- Zhang, Y.; Li, K.; Li, K.; Wang, L.; Zhong, B.; Fu, Y. Image Super-Resolution Using Very Deep Residual Channel Attention Networks. In Proceedings of the The European Conference on Computer Vision (ECCV), September 2018.
- Zhao, W.; Wang, L.; Zhang, K. MambaIR: A Simple and Efficient State Space Model for Image Restoration. arXiv preprint arXiv:2403.09963 2024.
- Guo, H.; Guo, Y.; Zha, Y.; Zhang, Y.; Li, W.; Dai, T.; Xia, S.T.; Li, Y. MambaIRv2: Attentive State Space Restoration, 2025, [arXiv:eess.IV/2411.15269].
- Zhu, L.; Liao, B.; Zhang, Q.; Wang, X.; Liu, W.; Wang, X. Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model, 2024, [arXiv:cs.CV/2401.09417].
- Zhang, H.; Zhu, Y.; Wang, D.; Zhang, L.; Chen, T.; Wang, Z.; Ye, Z. A Survey on Visual Mamba. Applied Sciences 2024, 14. [CrossRef]
- He, X.; Cao, K.; Zhang, J.; Yan, K.; Wang, Y.; Li, R.; Xie, C.; Hong, D.; Zhou, M. Pan-Mamba: Effective pan-sharpening with state space model. Information Fusion 2025, 115, 102779. [CrossRef]
- Zhu, Q.; Zhang, G.; Zou, X.; Wang, X.; Huang, J.; Li, X. ConvMambaSR: Leveraging State-Space Models and CNNs in a Dual-Branch Architecture for Remote Sensing Imagery Super-Resolution. Remote Sensing 2024, 16. [CrossRef]
- Zou, Q.; Ni, L.; Zhang, T.; Wang, Q. Deep Learning Based Feature Selection for Remote Sensing Scene Classification. IEEE Geoscience and Remote Sensing Letters 2015, 12, 2321–2325. [CrossRef]
- Yang, Y.; Newsam, S.D. Bag-of-visual-words and spatial extensions for land-use classification. In Proceedings of the 18th ACM SIGSPATIAL International Symposium on Advances in Geographic Information Systems, ACM-GIS 2010, November 3-5, 2010, San Jose, CA, USA, Proceedings; Agrawal, D.; Zhang, P.; Abbadi, A.E.; Mokbel, M.F., Eds. ACM, 2010, pp. 270–279. [CrossRef]
- Dai, D.; Yang, W. Satellite Image Classification via Two-Layer Sparse Coding With Biased Image Representation. IEEE Geoscience and Remote Sensing Letters 2011, 8, 173–176. [CrossRef]
- Wang, Z.; Bovik, A.; Sheikh, H.; Simoncelli, E. Image quality assessment: from error visibility to structural similarity. IEEE Transactions on Image Processing 2004, 13, 600–612. [CrossRef]
- Yuhas, R.; Goetz, A.; Boardman, J. Discrimination among semi-arid landscape endmembers using the Spectral Angle Mapper (SAM) algorithm 1992.
- Mittal, A.; Soundararajan, R.; Bovik, A.C. Making a “Completely Blind” Image Quality Analyzer. IEEE Signal Processing Letters 2013, 20, 209–212. [CrossRef]
- Zhang, R.; Isola, P.; Efros, A.A.; Shechtman, E.; Wang, O. The Unreasonable Effectiveness of Deep Features as a Perceptual Metric. In Proceedings of the Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2018.
- Gu, J.; Dong, C. Interpreting Super-Resolution Networks with Local Attribution Maps. In Proceedings of the Computer Vision and Pattern Recognition, 2021.
- Sundararajan, M.; Taly, A.; Yan, Q. Axiomatic Attribution for Deep Networks. In Proceedings of the Proceedings of the 34th International Conference on Machine Learning; Precup, D.; Teh, Y.W., Eds. PMLR, 06–11 Aug 2017, Vol. 70, Proceedings of Machine Learning Research, pp. 3319–3328.
- Glorot, X.; Bordes, A.; Bengio, Y. Deep Sparse Rectifier Neural Networks. In Proceedings of the International Conference on Artificial Intelligence and Statistics, 2011.
- Hendrycks, D.; Gimpel, K. Gaussian Error Linear Units (GELUs), 2023, [arXiv:cs.LG/1606.08415].








| Implementation | FLOPs (G) | Training | Inference |
|---|---|---|---|
| Naive PyTorch | 89.23 | 12h 23m | 1h 21m 10s |
| Triton (Element-wise, FP32) | 71.23 | 4h 56m | 17m 57s |
| Triton (Element-wise, BF16) | 71.23 | 4h 45m | 13m 50s |
| SIMSR (Chunk-wise, BF16) | 60.78 | 1h 25m | 7m 29s |
| Method | FLOPs (G) | Training | Inference | Params (M) |
|---|---|---|---|---|
| SwinIR [17] | 154.95 | 5h 23m | 28m 10s | 2.78 |
| HAT [31] | 142.95 | 4h 56m | 27m 57s | 2.82 |
| MambaIR [39] | 121.34 | 4h 45m | 23m 50s | 2.87 |
| MambaIRv2 [40] | 132.34 | 5h 1m | 25m 12s | 2.92 |
| SIMSR | 60.78 | 1h 25m | 7m 29s | 2.12 |
| Model | L2 Hit Rate (%) | Memory Reads (GB/s) |
|---|---|---|
| SwinIR | 41.2 | 210 |
| HAT | 44.7 | 204 |
| MambaIR | 48.5 | 164 |
| MambaIRv2 | 54.1 | 169 |
| SIMSR | 86.7 | 78 |
| Components | Omni-Shift | SISM Backbone | Semantic Decomp. | 2D Scan | PSNR ↑ | SSIM ↑ |
|---|---|---|---|---|---|---|
| Baseline | 25.82 | 0.712 | ||||
| + Channel Attention + Omni-Shift | ✓ | 26.35 | 0.7304 | |||
| + SISM Backbone | ✓ | ✓ | 26.61 | 0.8479 | ||
| + Semantic Decomposition | ✓ | ✓ | ✓ | 30.77 | 0.8574 | |
| Full Model (SIMSR) | ✓ | ✓ | ✓ | ✓ | 31.21 | 0.8962 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).