Submitted:
22 January 2025
Posted:
23 January 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Principal of Proposed Method

2.1. Semantic Analysis Module
2.2. Adaptive Variable Compression Rate Module
2.3. Semantic Enhancement Image Compression Module
3. Experimental Results
3.1. Dataset and Implementation details
3.2. Comparison with traditional image compression methods
3.3. Comparison with semantic deep learning based image compression methods
3.4. Comparison with variable rate image compression methods
3.5. Visual comparison
3.6. Ablation study
4. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Data Availability Statement
Conflicts of Interest
References
- Yamauchi, S.; Kawamura, M. A neural-network-based watermarking method approximating JPEG quantization. Journal of Imaging, 2024, 10, 138. [CrossRef]
- Jiang, Y.; Cui, R.; Liu, F. Multi-resolutional human visual perception optimized pathology image progressive coding based on JPEG2000. Signal Processing: Image Communication, 2023, 115, 116960. [CrossRef]
- Naumenko, V.; Kovalenko, B.; Lukin, V. BPG-based compression analysis of poisson-noisy medical images. Radioelectronic and Computer Systems, 2023, 3, 91-100. [CrossRef]
- Zhao, C.; Xiang, S.; Wang, Y.; Cai, Z.; Shen, J.; Zhou, S. et al. Context-aware network fusing transformer and V-Net for semi-supervised segmentation of 3D left atrium. Expert Systems with Applications, 2023, 214, 119105. [CrossRef]
- Li, J.; Wu, Q.; Wang, Y.; Zhou, S.; Zhang, L.; Wei, J.; Zhao, D. DiffCAS: diffusion based multi-attention network for segmentation of 3D coronary artery from CT angiography. Signal, Image and Video Processing, 2024, 18, 7487-7498. [CrossRef]
- Xiang, S.; Li, N.; Wang, Y.; Zhou, S.; Wei, J.; Li, S. Automatic delineation of the 3D left atrium from LGE-MRI: actor-critic based detection and semi-supervised segmentation. IEEE Journal of Biomedical and Health Informatics. 2024, 28, 3545-3556. [CrossRef]
- Loraksa, C.; Mongkolsomlit, S.; Nimsuk, N.; Uscharapong, M.; Kiatisevi, P. Effectiveness of learning systems from common image file types to detect osteosarcoma based on convolutional neural networks (CNNs) models. Journal of Imaging, 2021, 8, 2. [CrossRef]
- Sakurai, T.; Inoue, U. Lossless image set compression using animated FLIF. In Proceedings of the 8th International Virtual Conference on Applied Computing & Information Technology, Kanazawa, Japan, 20-22 June 2021.
- Yang, F.; Herranz, L.; Van De Weijer, J.; Guitián, J. A. I.; López, A. M.; Mozerov, M. G. Variable rate deep image compression with modulated autoencoder. IEEE Signal Processing Letters, 2020, 27, 331-335. [CrossRef]
- Hu, Y.; Yang, W.; Ma, Z.; Liu, J. Learning end-to-end lossy image compression: A benchmark. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2021, 44, 4194-4211.
- He, D.; Yang, Z.; Peng, W.; Ma, R.; Qin, H.; Wang, Y. ELIC: efficient learned image compression with unevenly grouped space-channel contextual adaptive coding. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA, 18-24 June 2022.
- Tong, K.; Wu, Y.; Li, Y. et al. Qvrf: a quantization-error-aware variable rate framework for learned image compression. In Proceedings of the IEEE International Conference on Image Processing, Taipei, Taiwan, China, 15-18 October 2023.
- Sebai, D; Sehli, M.; Ghorbel, F. End-to-End variable-rate learning-based depth compression guided by deep correlation features. Journal of Signal Processing Systems, 2024, 96(1), 81-97. [CrossRef]
- Yang, R.; Mandt, S. Lossy image compression with conditional diffusion models. In Proceedings of the 37th Conference on Neural Information Processing Systems (NeurIPS), Vancouver, BC, Canada, 9-15 December , 2024.
- Prakash, A.; Moran, N.; Garber, S. et al. Semantic perceptual image compression using deep convolution networks. In Proceedings of the Data Compression Conference, Snowbird, UT, USA, 28-30 March 2017.
- Agustsson, E.; Tschannen, M.; Mentzer, F.; Timofte, R.; Gool, L. V. Generative adversarial networks for extreme learned image compression. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2019), Long Beach, CA, USA, 15-20 June 2019.
- Wang, C.; Han, Y.; Wang, W. An end-to-end deep learning image compression framework based on semantic analysis. Applied Sciences. 2019, 9, 3580. [CrossRef]
- Akbari, M.; Liang, J.; Han, J. DSSLIC: deep semantic segmentation-based layered image compression. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2019), Brighton, UK, 12-17 May 2019.
- Kingma, D.P.; Salimans, T.; Jozefowicz, R.; Chen, X.; Sutskever, I.; Welling, M. Improved variational inference with inverse autoregressive flow. In Proceedings of the 30th International Conference on Neural Information Processing Systems (NIPS 2016), Barcelona, Spain, 5-10 December 2016.
- Qi, L.; Ren, Y.; Fang, Y.; Zhou, J. Two-view LSTM variational auto-encoder for fault detection and diagnosis in multivariable manufacturing processes. Neural Computing and Applications, 2023, 35, 22007-22026. [CrossRef]
- Ballé, J.; Minnen, D.; Singh, S.; Hwang, S.J.; Johnston, N. Variational image compression with a scale hyperprior. In Proceedings of the International Conference on Learning Representations, Vancouver, BC, Canada, 30 April-3 May, 2018.
- Duan, Z.; Lu, M.; Ma, J.; Huang, Y.; Ma, Z.; Zhu, F. QARV: quantization-aware resnet VAE for lossy image compression. IEEE Trans. Pattern Anal. Mach. Intell. 2024, 46, 436-450. [CrossRef]
- Peng, P.; Li, Z.-N. Self-information weighting for image quality assessment. In Proceedings of the 4th International Congress on Image and Signal Processing (CISP 2011), Shanghai, China, 16-18 October 2011.
- Griffin, G.; Holub, A.; Perona, P. Caltech-256 object category dataset. Technical Report 7694, California Institute of Technology, Pasadena, CA, 2007.
- Lin, T. Y. ; Maire, M. ; Belongie, S. ; Hays, J. ; Zitnick, C. L. Microsoft coco: common objects in context. In Proceedings of the 13th European Conference on Computer Vision (ECCV 2014), Zurich, Switzerland, 6-12 September 2014. [CrossRef]
- Kwan, C.; Chou, B. Further improvement of debayering performance of RGBW color filter arrays using deep learning and pansharpening techniques. Journal of Imaging, 2019, 5, 68.
- Asuni, N.; Giachetti, A. TESTIMAGES: a large-scale archive for testing visual devices and basic image processing algorithms. In STAG 2014, 63-70.
- Kingma D.; Ba J. Adam: a method for stochastic optimization. arXiv 2014, arXiv:1412.6980.
- Liu, J.; Sun, H.; Katto, J. Learned image compression with mixed Transformer-CNN architectures. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2023), Vancouver, Canada, 18-22 June 2023.
- Xie, Y.; Cheng, K. L.; Chen, Q. Enhanced invertible encoding for learned image compression. In Proceedings of the 29th ACM International Conference on Multimedia, Chengdu, China, 20-24 October 2021.
- Hoang, T. M.; Zhou, J.; Fan, Y. Image compression with encoder-decoder matched semantic segmentation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2020), Seattle, WA, USA, 13-19 June 2020.
- Luo, S.; Yang, Y.; Yin, Y.; Shen, C.; Zhao, Y.; Song, M. DeepSIC: deep semantic image compression. In Proceedings of the 25th International Conference on Neural Information Processing (ICONIP 2018), Siem Reap, Cambodia, 13-16 December 2018.
- Lei, Z.; Hong, X.; Shi, J.; Su, M.; Lin, C.; Xia, W. Quantization-Based adaptive deep image compression using semantic information. in IEEE Access, 2023, 11, 118061-11807. [CrossRef]














| Literature date | Bibliography | Module | PSNR | MS-SSIM |
|---|---|---|---|---|
| 1991 | [1] | JPEG | 33.2 | 0.950 |
| 2002 | [2] | JPEG2000 | 37.5 | 0.960 |
| 2017 | [3] | BPG | 38.5 | 0.973 |
| 2023 | [29] | TCM | 39.5 | 0.994 |
| 2021 | [30] | INN | 38.2 | 0.992 |
| Song method | 40.1 | 0.998 |
| Literature date | Bibliography | Module | PSNR | MS-SSIM |
|---|---|---|---|---|
| 2019 | [18] | DSSLIC | 39.8 | 0.991 |
| 2021 | [31] | EDMS | 34.2 | 0.993 |
| 2018 | [32] | DeepSIC | 37.3 | 0.985 |
| 2023 | [33] | ADAPTIVE DIC | 39.7 | 0.992 |
| Song method | 40.1 | 0.996 |
| Literature date | Bibliography | Module | PSNR | MS-SSIM |
|---|---|---|---|---|
| 2020 | [9] | MAE | 38 | 0.985 |
| 2021 | [10] | Coarse-to-Fine | 41.2 | 0.987 |
| 2022 | [11] | ELIC | 38.8 | 0.993 |
| 2023 | [12] | QVRF | 39.8 | 0.995 |
| Song method | 41.5 | 0.997 |
| Dataset | bpp | Metrics | Baseline | Baseline+semantic |
|---|---|---|---|---|
| Kodak | 1.124 | PSNR | 40.34 | 40.57 |
| MS-SSIM | 0.995 | 0.998 | ||
| Tecnick TESTIMAGES | 0.846 | PSNR | 40.40 | 40.42 |
| MS-SSIM | 0.993 | 0.996 | ||
| CLIC 2022 | 0.836 | PSNR | 39.42 | 39.41 |
| MS-SSIM | 0.992 | 0.994 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).