Submitted:
19 March 2026
Posted:
23 March 2026
You are already at the latest version
Abstract
Keywords:
1. Introduction
- We propose a unified learning paradigm that simultaneously promotes compact, discriminative feature representations and mitigates the impact of label noise, addressing the dual challenges of missing and incorrect annotations in PMLL.
- By constructing a similarity graph among instances using the fuzzy C-means algorithm, our framework effectively propagates label information, enabling the correction of inaccurate annotations and the inference of missing labels.
- We design a strategy to separate reliable label-specific features from misleading or noisy signals, allowing the model to focus on informative representations and reduce the adverse effects of corrupted supervision.
- The proposed approach integrates feature learning and label refinement into a single optimization process, ensuring synergistic improvements in both representation quality and label reliability.
2. Related Work
2.1. Partial Multi-Label Learning (PMLL)
2.2. Multi-Label Learning with Missing Labels
2.3. Multi-Label Learning with Noisy and Incomplete Supervision
3. The Proposed Methodology
3.1. Problem Statement and Notations
3.2. Instance-level Graph Adjacency Matrix
3.3. Compact Feature-Label Collaboration
3.4. Label Recovery via Graph Propagation
3.5. Ambiguous Feature Identification
4. Solutions to the Optimization Problem
4.1. Updating by Fixing Other Variables
4.2. Updating by Fixing Other Variables
4.3. Updating by Fixing Other Variables
4.4. Updating by Fixing Other Variables
4.5. Updating by Fixing Other Variables
4.6. The Solution of Equation (14)
4.7. Convergence Analysis
| Algorithm 1 The Proposed WPML Algorithm |
|
4.8. Complexity Analysis
5. Experiments
5.1. Experimental Setup
5.2. Compared Methods and Evaluation Metrics
- WPML (Algorithm 1): The hyperparameters and are selected from , is tuned within , and controlling noisy label sparsity is chosen from . The neighbor size k is fixed at 10.
5.3. Performance Analysis
5.4. Statistical Analysis of Algorithm Performance
5.5. Sensitivity Analysis
5.6. Ablation Study
5.7. Convergence Analysis
6. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Data Availability Statement
Conflicts of Interest
References
- Liu, W.; Wang, H.; Shen, X.; Tsang, I.W. The emerging trends of multi-label learning. IEEE Trans. Pattern Anal. Mach. Intell. 2021, 44(11), 7955–7974. [Google Scholar] [CrossRef]
- Xie, M.; Huang, S. Partial multi-label learning. In Proc. AAAI Conf. Artif. Intell., 2018; pp. 4302–4309.
- Zhong, J.; Shang, R.; Zhao, F.; Zhang, W.; Xu, S. Negative label and noise information guided disambiguation for partial multi-label learning. IEEE Trans. Multim. 2024, 26, 9920–9935. [Google Scholar] [CrossRef]
- Chen, J.-Y.; Li, S.-Y.; Huang, S.-J.; Chen, S.; Wang, L.; Xie, M.-K. UNM: A universal approach for noisy multi-label learning. IEEE Trans. Knowl. Data Eng. 2024, 36(9), 4968–4980. [Google Scholar] [CrossRef]
- Zhang, M.; Yu, F.; Tang, C. Disambiguation-free partial label learning. IEEE Trans. Knowl. Data Eng. 2017, 29(10), 2155–2167. [Google Scholar] [CrossRef]
- Gibaja, E.; Ventura, S. A tutorial on multilabel learning. ACM Comput. Surv. 2015, 47(3), 1–38. [Google Scholar] [CrossRef]
- Zhang, M.; Fang, J. Partial multi-label learning via credible label elicitation. IEEE Trans. Pattern Anal. Mach. Intell. 2021, 43, 3587–3599. [Google Scholar] [CrossRef] [PubMed]
- Sun, L.; Du, W.; Ding, W.; Long, Q.; Xu, J. Granular ball-based fuzzy multineighborhood rough set for feature selection via label enhancement. Eng. Appl. Artif. Intell. 2025, 145, 110191. [Google Scholar] [CrossRef]
- Yan, Y.; Li, S.; Feng, L. Partial multi-label learning with mutual teaching. Knowl.-Based Syst. 2021, 212, 106624. [Google Scholar] [CrossRef]
- Xie, M.; Huang, S. Semi-supervised partial multi-label learning. In IEEE Int. Conf. Data Mining (ICDM), 2020; pp. 691–700.
- Li, Z.; Jia, Y.; Yu, M.; Miao, Z. Calibrated disambiguation for partial multi-label learning. In Proc. AAAI Conf. Artif. Intell., vol. 39, 2025; pp. 18620–18628.
- Wang, C.; Zhang, Y.; An, S.; Deng, T. Adaptive feature selection based on fuzzy rough set fusion model with class variance. Pattern Recognit. 2026, 170, 112014. [Google Scholar] [CrossRef]
- Wang, C.; Wang, Y.; Deng, T.; Huang, Y. A nonlinear multi-label learning model based on tanh mapping. Eng. Appl. Artif. Intell. 2023, 126, 106837. [Google Scholar] [CrossRef]
- Han, Q.; Hu, L.; Gao, W. Integrating label confidence-based feature selection for partial multi-label learning. Pattern Recognit. 2025, 161, 111281. [Google Scholar] [CrossRef]
- Hang, J.-Y.; Zhang, M.-L. Partial multi-label learning via label-specific feature corrections. Sci. China Inf. Sci. 2025, 68(3), 132104. [Google Scholar] [CrossRef]
- Sun, Z.; Chen, Z.; Liu, J.; Chen, Y.; Yu, Y. Partial multi-label feature selection via low-rank and sparse factorization with manifold learning. Knowl.-Based Syst. 2024, 296, 111899. [Google Scholar] [CrossRef]
- Jalali, H.; Kasneci, G. Multilabel Classification for Entry-Dependent Expert Selection in Distributed Gaussian Processes. Entropy 2025, 27, 307. [Google Scholar] [CrossRef]
- Yang, F.; Jia, Y.; Liu, H.; Dong, Y.; Hou, J. Noisy label removal for partial multi-label learning. In Proc. 30th ACM SIGKDD Conf. Knowl. Discov. Data Min., 2024; pp. 3724–3735.
- Qian, W.; Tu, Y.; Huang, J.; Shu, W.; Cheung, Y.-M. Partial multilabel learning using noise-tolerant broad learning system with label enhancement and dimensionality reduction. IEEE Trans. Neural Netw. Learn. Syst. 2025, 36(2), 3758–3772. [Google Scholar] [CrossRef] [PubMed]
- Li, R. J.; Ma, Y. C.; Chen, H.; Yang, X. F.; Xing, Z. W. Coordinate descent for top-k multi-label feature selection with pseudo-label learning and manifold learning. Neurocomput. 2025, 658, 131640. [Google Scholar] [CrossRef]
- Bucak, S.S.; Jin, R.; Jain, A.K. Multi-label learning with incomplete class assignments. In Proc. IEEE/CVF Conf. Comput. Vis. Pattern Recognit., 2011; pp. 2801–2808.
- Yu, H.-F.; Jain, P.; Kar, P.; Dhillon, I. Large-scale multi-label learning with missing labels. In Int. Conf. Mach. Learn., PMLR, 2014; pp. 593–601.
- Chen, G.; Song, Y.; Wang, F.; Zhang, C. Semi-supervised multi-label learning by solving a sylvester equation. In Proc. 2008 SIAM Int. Conf. Data Mining, 2008; pp. 410–419.
- Jain, V.; Modhe, N.; Rai, P. Scalable generative models for multi-label learning with missing labels. In Int. Conf. Mach. Learn., PMLR, 2017; pp. 1636–1644.
- Kong, X.; Wu, Z.; Li, L.-J.; Zhang, R.; Yu, P.S.; Wu, H.; Fan, W. Large-scale multi-label learning with incomplete label assignments. In Proc. 2014 SIAM Int. Conf. Data Mining, 2014; pp. 920–928.
- Wang, C.; Wang, Y.; Deng, T.; Ding, W. Missing multi-label learning based on the fusion of two-level nonlinear mappings. Inf. Fusion 2024, 103, 102105. [Google Scholar] [CrossRef]
- Jiang, L.; Yu, G. X.; Guo, M. Z.; Wang, J. Feature selection with missing labels based on label compression and local feature correlation. Neurocomput. 2020, 395, 95–106. [Google Scholar] [CrossRef]
- Yang, H.; Zhou, J.T.; Cai, J. Improving multi-label learning with missing labels by structured semantic correlations. In Comput. Vis.–ECCV 2016: 14th Eur. Conf., Springer, 2016; pp. 835–851.
- Wu, B.; Jia, F.; Liu, W.; Ghanem, B.; Lyu, S. Multi-label learning with missing labels using mixed dependency graphs. Int. J. Comput. Vis. 2018, 126(8), 875–896. [Google Scholar] [CrossRef]
- Braytee, A.; Liu, W.; Anaissi, A.; Kennedy, P.J. Correlated multi-label classification with incomplete label space and class imbalance. ACM Trans. Intell. Syst. Technol. 2019, 10(5), 1–26. [Google Scholar] [CrossRef]
- Ma, J.; Chow, T.W. Label-specific feature selection and two-level label recovery for multi-label classification with missing labels. Neural Netw. 2019, 118, 110–126. [Google Scholar] [CrossRef]
- Yin, T.; Chen, H.; Wang, Z.; Liu, K.; Yuan, Z.; Horng, S.-J.; Li, T. Feature selection for multilabel classification with missing labels via multi-scale fusion fuzzy uncertainty measures. Pattern Recognit. 2024, 110580. [Google Scholar] [CrossRef]
- Dai, J.; Li, M.; Zhang, C. Multi-label feature selection with missing labels by weak-label fusion fuzzy discernibility pair. Inf. Fusion 2025, 117, 102921. [Google Scholar] [CrossRef]
- Sun, L.; Yin, T.; Ding, W.; Qian, Y.; Xu, J. Feature selection with missing labels using multilabel fuzzy neighborhood rough sets and maximum relevance minimum redundancy. IEEE Trans. Fuzzy Syst. 2021, 30(5), 1197–1211. [Google Scholar] [CrossRef]
- Sun, L.; Lyu, G.; Feng, S.; Huang, X. Beyond missing: Weakly-supervised multi-label learning with incomplete and noisy labels. Appl. Intell. 2021, 51(3), 1552–1564. [Google Scholar] [CrossRef]
- Ding, J.; Zhang, Y.; Jia, L.; Fu, X.; Jiang, Y. Noisy feature decomposition-based multi-label learning with missing labels. Inf. Sci. 2024, 662, 120228. [Google Scholar] [CrossRef]
- Wei, T.; Guo, L.; Li, Y.; Gao, W. Learning safe multi-label prediction for weakly labeled data. Mach. Learn. 2018, 107, 703–725. [Google Scholar] [CrossRef]
- Fang, Q.; Xiang, C.; Duan, J.; Soufiyan, B.; Shao, C.; Yang, X.; Xu, S.; Yu, H. OMAL: A Multi-Label Active Learning Approach from Data Streams. Entropy 2025, 27, 363–363. [Google Scholar] [CrossRef] [PubMed]
- Bezdek, J. C. Pattern recognition with fuzzy objective function algorithms; Springer Sci. & Bus. Media, 2013.
- Nie, F.; Xu, D.; Tsang, I.; Zhang, C. Flexible manifold embedding: A framework for semi-supervised and unsupervised dimension reduction. IEEE Trans. Image Process. 2010, 19(7), 1921–1932. [Google Scholar] [CrossRef]
- Liu, G.; Li, Q.; Yang, X.; Xing, Z.; Ma, Y. Partial multi-label feature selection based on label matrix decomposition. Neural Comput. Appl. 2025, 37(6), 4207–4227. [Google Scholar] [CrossRef]
- Xie, M.; Huang, S. Partial multi-label learning with noisy label identification. IEEE Trans. Pattern Anal. Mach. Intell. 2022, 44, 3676–3687. [Google Scholar]
- Yu, Z.; Zhang, M. Multi-label classification with label-specific feature generation: A wrapped approach. IEEE Trans. Pattern Anal. Mach. Intell. 2022, 44, 5199–5210. [Google Scholar] [CrossRef]
- Ma, Z.; Chen, C. Expand globally, shrink locally: Discriminant multilabel learning with missing labels. Pattern Recognit. 2021, 111, 107675. [Google Scholar] [CrossRef]
- Zhu, Y.; Kwok, J.; Zhou, Z. Multilabel learning with global and local label correlation. IEEE Trans. Knowl. Data Eng. 2018, 30(6), 1081–1094. [Google Scholar] [CrossRef]
- Akbarnejad, A.; Baghshah, M. An efficient semi-supervised multilabel classifier capable of handling missing labels. IEEE Trans. Knowl. Data Eng. 2019, 31(2), 229–242. [Google Scholar] [CrossRef]
- Zhang, J.; Li, S.; Jiang, M.; Tan, K. Learning from weakly labeled data based on manifold regularized sparse model. IEEE Trans. Cybern. 2022, 52, 3841–3854. [Google Scholar] [CrossRef] [PubMed]
- Friedman, M. A comparison of alternative tests of significance for the problem of m rankings. Ann. Math. Stat. 1940, 11(1), 86–92. [Google Scholar] [CrossRef]
- Demsar, J. Statistical comparisons of classifiers over multiple data sets. J. Mach. Learn. Res. 2006, 7, 1–30. [Google Scholar]
| 1 | |
| 2 | |
| 3 | |
| 4 | |
| 5 | |
| 6 | |
| 7 | |
| 8 |





| Notation | Meaning | |
|---|---|---|
| n | Instance number | |
| d | Feature size | |
| q | Label number | |
| m | Embedded feature size | |
| Feature vector | ||
| Feature matrix | ||
| Label vector | ||
| Observed label matrix, where `0’ means unknown annotation | ||
| Fuzzy similarity matrix | ||
| ith row vector of | ||
| jth column vector of | ||
| th entry of | ||
| Top k-nearest neighbors of | ||
| -norm | ||
| Frobenius norm | ||
| L2-norm | ||
| ∘ | Hadamard product (entry-wise product) | |
| ⊘ | Hadamard division (entry-wise division) | |
| Trace norm | ||
| T | Transposition | |
| Hilbert-Schmidt inner product | ||
| Diagonal matrix of vector | ||
| Predicted label matrix | ||
| Label occurrence indicator matrix | ||
| Noisy label matrix | ||
| Embedding projection matrix | ||
| Latent mapping matrix | ||
| Noisy mapping matrix |
| Data set | n | d | q | ||
|---|---|---|---|---|---|
| Birds | 645 | 260 | 19 | 1.014 | .053 |
| Business | 5000 | 438 | 30 | 1.588 | .053 |
| Bibtex | 7395 | 1836 | 159 | 2.402 | .015 |
| CAL500 | 502 | 68 | 174 | 26.044 | .150 |
| Computers | 5000 | 681 | 33 | 1.509 | .048 |
| Education | 5000 | 550 | 33 | 1.461 | .044 |
| Emotions | 593 | 72 | 6 | 1.868 | .311 |
| Entertainment | 5000 | 640 | 21 | 1.414 | .067 |
| Eron | 1702 | 1001 | 53 | 3.378 | .064 |
| Genbase | 662 | 1186 | 27 | 1.252 | .046 |
| Health | 5000 | 612 | 32 | 1.662 | .052 |
| Image | 2000 | 294 | 5 | 1.236 | .247 |
| Medical | 978 | 1449 | 45 | 1.245 | .028 |
| Reference | 5000 | 793 | 33 | 1.169 | .035 |
| Scene | 2407 | 294 | 6 | 1.074 | .179 |
| Yeast | 2417 | 103 | 14 | 4.237 | .303 |
| Methods | WPML | PML-NI | PML-VLS | PML-MAP | D2ML | Glocal | ESMC | MSWL |
|---|---|---|---|---|---|---|---|---|
| Birds | .846 | .840 | .692 | .729 | .765 | .806 | .728 | .813 |
| Business | .920 | .766 | .540 | .538 | .897 | .775 | .737 | .663 |
| Bibtex | .751 | .728 | .496 | .496 | .680 | .660 | .721 | .560 |
| CAL500 | .746 | .716 | .514 | .514 | .709 | .703 | .597 | .714 |
| Computer | .867 | .809 | .559 | .551 | .823 | .807 | .777 | .788 |
| Education | .890 | .822 | .676 | .661 | .843 | .801 | .778 | .826 |
| Emotions | .718 | .708 | .502 | .502 | .686 | .716 | .594 | .766 |
| Entertainment | .858 | .853 | .591 | .574 | .817 | .778 | .807 | .802 |
| Eron | .877 | .792 | .501 | .501 | .793 | .797 | .846 | .536 |
| Genbase | .986 | .985 | .502 | .502 | .972 | .980 | .958 | .984 |
| Health | .926 | .912 | .636 | .604 | .888 | .851 | .882 | .874 |
| Image | .793 | .781 | .790 | .786 | .743 | .797 | .600 | .762 |
| Medical | .958 | .918 | .896 | .896 | .914 | .919 | .874 | .650 |
| Reference | .851 | .874 | .626 | .620 | .868 | .857 | .834 | .857 |
| Scene | .545 | .357 | .504 | .504 | .387 | .559 | .453 | .767 |
| Yeast | .653 | .562 | .505 | .505 | .618 | .601 | .564 | .509 |
| Methods | WPML | PML-NI | PML-VLS | PML-MAP | D2ML-nl | Glocal | ESMC | MSWL |
|---|---|---|---|---|---|---|---|---|
| Birds | .121 | .126 | .156 | .137 | .192 | .157 | .235 | .152 |
| Business | .051 | .244 | .115 | .046 | .077 | .232 | .270 | .337 |
| Bibtex | .234 | .257 | .304 | .344 | .325 | .337 | .269 | .434 |
| CAL500 | .257 | .284 | .614 | .194 | .291 | .297 | .400 | .286 |
| Computer | .101 | .164 | .160 | .111 | .140 | .180 | .193 | .206 |
| Education | .104 | .165 | .093 | .108 | .150 | .180 | .209 | .157 |
| Emotions | .259 | .272 | .480 | .461 | .292 | .266 | .384 | .209 |
| Entertainment | .110 | .114 | .160 | .162 | .158 | .190 | .159 | .168 |
| Eron | .105 | .187 | .192 | .127 | .183 | .186 | .137 | .464 |
| Genbase | .003 | .004 | .009 | .140 | .012 | .007 | .022 | .004 |
| Health | .051 | .063 | .153 | .108 | .091 | .119 | .088 | .196 |
| Image | .165 | .176 | .198 | .186 | .214 | .186 | .369 | .350 |
| Medical | .037 | .070 | .140 | .141 | .074 | .070 | .116 | .178 |
| Reference | .124 | .105 | .159 | .101 | .117 | .125 | .143 | .125 |
| Scene | .426 | .629 | .427 | .498 | .593 | .417 | .531 | .209 |
| Yeast | .346 | .432 | .265 | .272 | .373 | .391 | .431 | .499 |
| Methods | WPML | PML-NI | PML-VLS | PML-MAP | D2ML-nl | Glocal | ESMC | MSWL |
|---|---|---|---|---|---|---|---|---|
| Birds | .723 | .707 | .695 | .688 | .569 | .653 | .563 | .651 |
| Business | .860 | .573 | .854 | .866 | .679 | .536 | .570 | .468 |
| Bibtex | .256 | .207 | .243 | .223 | .224 | .239 | .232 | .116 |
| CAL500 | .443 | .418 | .382 | .468 | .418 | .398 | .289 | .383 |
| Computer | .654 | .593 | .513 | .586 | .544 | .599 | .573 | .494 |
| Education | .565 | .541 | .583 | .543 | .427 | .543 | .500 | .556 |
| Emotions | .726 | .722 | .537 | .566 | .702 | .711 | .634 | .759 |
| Entertainment | .664 | .660 | .515 | .556 | .465 | .591 | .594 | .605 |
| Eron | .636 | .531 | .480 | .490 | .504 | .554 | .630 | .477 |
| Genbase | .994 | .994 | .936 | .945 | .931 | .990 | .893 | .994 |
| Health | .757 | .751 | .660 | .612 | .611 | .685 | .719 | .699 |
| Image | .795 | .779 | .777 | .782 | .735 | .554 | .595 | .763 |
| Medical | .864 | .776 | .714 | .797 | .759 | .794 | .653 | .606 |
| Reference | .666 | .658 | .555 | .595 | .558 | .660 | .617 | .662 |
| Scene | .512 | .348 | .426 | .418 | .405 | .507 | .393 | .759 |
| Yeast | .619 | .572 | .523 | .627 | .584 | .593 | .564 | .575 |
| Metric | Critical Value | |
|---|---|---|
| AUC | 49.3722 | 3.031 |
| Ranking Loss | 9.5255 | 3.031 |
| Coverage | 9.7286 | 3.031 |
| Average Precision | 11.2720 | 3.031 |
| Hamming Loss | 8.1667 | 3.031 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).