Submitted:
05 November 2025
Posted:
05 November 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
- The DMAS framework is based on two essential sub-processes: The first stage involves adversarial pre-training of two segmentation networks and their respective discriminators to develop two preliminary pseudo-label annotation models. The second stage is dynamic mutual learning, which measures the discrepancies between different segmentation models through confidence maps to mitigate the effects of potential pseudo label labeling errors, thereby enhancing the accuracy of the training process.
- The adversarial training method is mainly used for training a segmentation model and a fully convolutional discriminator with labeled data to generate pseudo-labels. This allows the discriminator to differentiate between real label maps and their predicted counterparts by generating a confidence map. Also, it enables a quantitative assessment of segmentation accuracy in specific regions of the pseudo-labels.
- Dynamic mutual learning guides different models based on their varying prior knowledge. It leverages the divergence between these models to detect inaccuracies in pseudo-label generation. By employing a dynamically reweighted loss function, it reflects the discrepancies between two models trained with each other’s pseudo-labels, thereby assigning lower weights to pixels with a higher likelihood of error.
- We validate the effectiveness of our proposed method on various underwater datasets, namely the DUT dataset and the SUIM dataset, demonstrating that the proposed semi-supervised learning algorithm is capable of enhancing the performance of models trained with limited and noisy annotations to be comparable to models fully-supervisedly trained with large amounts of labeled data.
2. Related Work
2.1. Pseudo-Label Methods for Semi-Supervised Semantic Segmentation
2.2. Adversarial Learning for Semi-Supervised Semantic Segmentation
3. Methodology
3.1. Overview
3.2. Adversarial Pre-training
| Algorithm 1 Model adversarial training. |
|
3.2.1. Fully Supervised Training
3.2.2. Semi-Supervised Training
3.3. Dynamic Mutual Learning
3.3.1. Dynamic Mutual Iterative Framework
3.3.2. Dynamic Reweighting Loss
4. Experiments


4.1. Dataset
4.2. Implementation Details
4.3. Evaluation Metrics
4.4. Performance Comparison
4.4.1. Quantitative Analysis
4.4.2. Qualitative Analysis
4.5. Ablation Study
4.6. Discussion
5. Conclusions
References
- B. P. McNelly, “Advances in autonomous underwater vehicle technologies for enhanced harbor protection,” Ph.D. dissertation, Johns Hopkins University, 2023.
- Z. Li, S. Liang, M. Guo, H. Zhang, H. Wang, Z. Li, and H. Li, “Adrc-based underwater navigation control and parameter tuning of an amphibious multirotor vehicle,” IEEE Journal of Oceanic Engineering, 2024. [CrossRef]
- X. Yang, Z. Song, I. King, and Z. Xu, “A survey on deep semi-supervised learning,” IEEE Transactions on Knowledge and Data Engineering, vol. 35, no. 9, pp. 8934–8954, 2022. [CrossRef]
- H. Zeng, Z. Liu, and H. Cai, “Research on the application of deep learning in computer network information security,” in Journal of Physics: Conference Series, vol. 1650, no. 3. IOP Publishing, 2020, p. 032117. [CrossRef]
- M. Zhang, Y. Zhou, J. Zhao, Y. Man, B. Liu, and R. Yao, “A survey of semi-and weakly supervised semantic segmentation of images,” Artificial Intelligence Review, vol. 53, pp. 4259–4288, 2020. [CrossRef]
- Q. Li, X.-M. Wu, H. Liu, X. Zhang, and Z. Guan, “Label efficient semi-supervised learning via graph filtering,” in Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2019, pp. 9582–9591. [CrossRef]
- Y. Wei, H. Xiao, H. Shi, Z. Jie, J. Feng, and T. S. Huang, “Revisiting dilated convolution: A simple approach for weakly-and semi-supervised semantic segmentation,” in Proceedings of the IEEE conference on computer vision and pattern recognition, 2018, pp. 7268–7277. [CrossRef]
- Y. Ouali, C. Hudelot, and M. Tami, “Semi-supervised semantic segmentation with cross-consistency training,” in Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2020, pp. 12 674–12 684. [CrossRef]
- D.-H. Lee et al., “Pseudo-label: The simple and efficient semi-supervised learning method for deep neural networks,” in Workshop on challenges in representation learning, ICML, vol. 3, no. 2. Atlanta, 2013, p. 896.
- R. Li, S. Li, C. He, Y. Zhang, X. Jia, and L. Zhang, “Class-balanced pixel-level self-labeling for domain adaptive semantic segmentation,” in Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2022, pp. 11 593–11 603. [CrossRef]
- I. Triguero, S. García, and F. Herrera, “Self-labeled techniques for semi-supervised learning: taxonomy, software and empirical study,” Knowledge and Information systems, vol. 42, pp. 245–284, 2015. [CrossRef]
- S.-S. Learning, “Semi-supervised learning,” CSZ2006. html, vol. 5, p. 2, 2006.
- L. Yang, W. Zhuo, L. Qi, Y. Shi, and Y. Gao, “St++: Make self-training work better for semi-supervised semantic segmentation,” in Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2022, pp. 4268–4277. [CrossRef]
- E. W. Teh, T. DeVries, B. Duke, R. Jiang, P. Aarabi, and G. W. Taylor, “The gist and rist of iterative self-training for semi-supervised segmentation,” in 2022 19th Conference on Robots and Vision (CRV). IEEE, 2022, pp. 58–66. [CrossRef]
- H. Li and H. Zheng, “A residual correction approach for semi-supervised semantic segmentation,” in Pattern Recognition and Computer Vision: 4th Chinese Conference, PRCV 2021, Beijing, China, October 29–November 1, 2021, Proceedings, Part IV 4. Springer, 2021, pp. 90–102. [CrossRef]
- J. Yuan, Y. Liu, C. Shen, Z. Wang, and H. Li, “A simple baseline for semi-supervised semantic segmentation with strong data augmentation,” in Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021, pp. 8229–8238. [CrossRef]
- Y. Zhang, T. Xiang, T. M. Hospedales, and H. Lu, “Deep mutual learning,” in Proceedings of the IEEE conference on computer vision and pattern recognition, 2018, pp. 4320–4328. [CrossRef]
- Z. Feng, Q. Zhou, Q. Gu, X. Tan, G. Cheng, X. Lu, J. Shi, and L. Ma, “Dmt: Dynamic mutual training for semi-supervised learning,” Pattern Recognition, vol. 130, p. 108777, 2022. [CrossRef]
- Y. Zhou, R. Jiao, D. Wang, J. Mu, and J. Li, “Catastrophic forgetting problem in semi-supervised semantic segmentation,” IEEE Access, vol. 10, pp. 48 855–48 864, 2022. [CrossRef]
- A. Creswell, T. White, V. Dumoulin, K. Arulkumaran, B. Sengupta, and A. A. Bharath, “Generative adversarial networks: An overview,” IEEE signal processing magazine, vol. 35, no. 1, pp. 53–65, 2018. [CrossRef]
- J. Gui, Z. Sun, Y. Wen, D. Tao, and J. Ye, “A review on generative adversarial networks: Algorithms, theory, and applications,” IEEE transactions on knowledge and data engineering, vol. 35, no. 4, pp. 3313–3332, 2021. [CrossRef]
- Z. Wang, Q. She, and T. E. Ward, “Generative adversarial networks in computer vision: A survey and taxonomy,” ACM Computing Surveys (CSUR), vol. 54, no. 2, pp. 1–38, 2021. [CrossRef]
- N. Souly, C. Spampinato, and M. Shah, “Semi supervised semantic segmentation using generative adversarial network,” in Proceedings of the IEEE international conference on computer vision, 2017, pp. 5688–5696. [CrossRef]
- D. Li, J. Yang, K. Kreis, A. Torralba, and S. Fidler, “Semantic segmentation with generative models: Semi-supervised learning and strong out-of-domain generalization,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021, pp. 8300–8311. [CrossRef]
- G. Jin, C. Liu, and X. Chen, “Adversarial network integrating dual attention and sparse representation for semi-supervised semantic segmentation,” Information Processing & Management, vol. 58, no. 5, p. 102680, 2021. [CrossRef]
- D. Xu and Z. Wang, “Semi-supervised semantic segmentation using an improved generative adversarial network,” Journal of Intelligent & Fuzzy Systems, vol. 40, no. 5, pp. 9709–9719, 2021. [CrossRef]
- R. Mendel, L. A. De Souza, D. Rauber, J. P. Papa, and C. Palm, “Semi-supervised segmentation based on error-correcting supervision,” in Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part XXIX 16. Springer, 2020, pp. 141–157. [CrossRef]
- Z. Ke, D. Qiu, K. Li, Q. Yan, and R. W. Lau, “Guided collaborative training for pixel-wise semi-supervised learning,” in Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part XIII 16. Springer, 2020, pp. 429–445. [CrossRef]
- W.-C. Hung, Y.-H. Tsai, Y.-T. Liou, Y.-Y. Lin, and M.-H. Yang, “Adversarial learning for semi-supervised semantic segmentation,” arXiv preprint arXiv:1802.07934, 2018. [CrossRef]
- J. Zhang, Z. Li, C. Zhang, and H. Ma, “Stable self-attention adversarial learning for semi-supervised semantic image segmentation,” Journal of Visual Communication and Image Representation, vol. 78, p. 103170, 2021. [CrossRef]
- P. Luc, C. Couprie, S. Chintala, and J. Verbeek, “Semantic segmentation using adversarial networks,” arXiv preprint arXiv:1611.08408, 2016. [CrossRef]
- S. C. Yurtkulu, Y. H. Şahin, and G. Unal, “Semantic segmentation with extended deeplabv3 architecture,” in 2019 27th Signal Processing and Communications Applications Conference (SIU). IEEE, 2019, pp. 1–4. [CrossRef]
- J. Long, E. Shelhamer, and T. Darrell, “Fully convolutional networks for semantic segmentation,” in Proceedings of the IEEE conference on computer vision and pattern recognition, 2015, pp. 3431–3440. [CrossRef]
- A. Howard, M. Sandler, G. Chu, L.-C. Chen, B. Chen, M. Tan, W. Wang, Y. Zhu, R. Pang, V. Vasudevan et al., “Searching for mobilenetv3,” in Proceedings of the IEEE/CVF international conference on computer vision, 2019, pp. 1314–1324. [CrossRef]
- G. French, S. Laine, T. Aila, M. Mackiewicz, and G. Finlayson, “Semi-supervised semantic segmentation needs strong, varied perturbations,” arXiv preprint arXiv:1906.01916, 2019. [CrossRef]
- S. Mittal, M. Tatarchenko, and T. Brox, “Semi-supervised semantic segmentation with high-and low-level consistency,” IEEE transactions on pattern analysis and machine intelligence, vol. 43, no. 4, pp. 1369–1379, 2019. [CrossRef]





Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).