Submitted:
06 March 2026
Posted:
06 March 2026
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Materials and Methods
2.1. Sample and Study Environment Description
2.2. Experimental Design and Control Settings
2.3. Measurement Methods and Quality Control
2.4. Data Processing and Model Formulation
2.5. Implementation and Training Procedure
3. Results and Discussion
3.1. Task Success under Multi-Budget Constraints
3.2. Cost Efficiency and Budget Violation Patterns
3.3. Tail-Failure Reduction and CVaR0.1_{0.1}0.1 Performance
3.4. Ablation Analysis and Relation to Existing Methods
4. Conclusions
References
- Qiu, Y., & Wang, J. (2023, October). A machine learning approach to credit card customer segmentation for economic stability. In Proceedings of the 4th International Conference on Economic Management and Big Data Applications, ICEMBDA (pp. 27-29).
- Chezelles, D.; Le Sellier, T.; Shayegan, S. O.; Jang, L. K.; Lù, X. H.; Yoran, O.; Lacoste, A. The browsergym ecosystem for web agent research. arXiv 2024, arXiv:2412.05467. [Google Scholar] [CrossRef]
- Ma, Q.; Yue, L.; Xu, S.; Shi, Y.; Liu, H. Web Agent Agentic Reinforcement Learning Decision Model Under Multi-Cost and Failure Risk Constraints. 2026. [Google Scholar] [CrossRef]
- Peddinti, S.R.; Katragadda, S.R.; Pandey, B.K.; Tanikonda, A. Utilizing large language models for advanced service management: potential applications and operational challenges. Journal of Science & Technology 2023, 4(2). [Google Scholar]
- Zhu, W.; Yao, Y.; Yang, J. Real-Time Risk Control Effects of Digital Compliance Dashboards: An Empirical Study Across Multiple Enterprises Using Process Mining, Anomaly Detection, and Interrupt Time Series. 2025. [Google Scholar] [PubMed]
- Li, T.; Xia, J.; Liu, S.; Jiang, Y. Digital Transformation of Human Resources: From Consulting Frameworks to AI-Enabled Learning Management Systems. 2025. [Google Scholar] [CrossRef]
- Kushwaha, A.; Ravish, K.; Lamba, P.; Kumar, P. A survey of safe reinforcement learning and constrained mdps: A technical survey on single-agent and multi-agent safety. arXiv 2025, arXiv:2505.17342. [Google Scholar]
- Gu, X.; Liu, M.; Yang, J. Application and Effectiveness Evaluation of Federated Learning Methods in Anti-Money Laundering Collaborative Modeling Across Inter-Institutional Transaction Networks. 2025. [Google Scholar]
- Lagaros, N. D.; Kournoutos, M.; Kallioras, N. A.; Nordas, A. N. Constraint handling techniques for metaheuristics: a state-of-the-art review and new variants. Optimization and Engineering 2023, 24(4), 2251–2298. [Google Scholar] [CrossRef]
- Gu, X.; Yang, J.; Liu, M. Research on a Green Money Laundering Identification Framework and Risk Monitoring Mechanism Integrating Artificial Intelligence and Environmental Governance Data. 2025. [Google Scholar] [CrossRef]
- Sener, N. Risk-Averse Green Hub Location Under Multi-Source Uncertainty: A CVaR-Based Model With Scenario Reduction. IEEE Access 2026, 14, 26621–26634. [Google Scholar] [CrossRef]
- Cai, B., Bai, W., Lu, Y., & Lu, K. (2024, June). Fuzz like a Pro: Using Auditor Knowledge to Detect Financial Vulnerabilities in Smart Contracts. In 2024 International Conference on Meta Computing (ICMC) (pp. 230-240). IEEE.
- Yaseen, M., Nizami, I. F., Aldajani, M. B., Raja, A. A., Haroon, F., & Abbas, Q. (2026). Resilient Constraint Energy Management for Microgrids: Integrating Wasserstein DRO and CVaR-Constrained MPC Under Renewable Uncertainty. IEEE Access.
- Wang, Y.; Feng, Y.; Fang, Y.; Zhang, S.; Jing, T.; Li, J.; Xu, R. HERO: Hierarchical Traversable 3D Scene Graphs for Embodied Navigation Among Movable Obstacles. arXiv 2025, arXiv:2512.15047. [Google Scholar] [CrossRef]
- Cai, Z.; Qiu, H.; Zhao, H.; Wan, K.; Li, J.; Gu, J.; Hu, J. From Preferences to Prejudice: The Role of Alignment Tuning in Shaping Social Bias in Video Diffusion Models. arXiv 2025, arXiv:2510.17247. [Google Scholar] [CrossRef]
- Ashqar, H. I. (2025, July). A Critical Review of Benchmarking LLMs for Real-World Applications: Trends and Limitations. In 2025 Sixteenth International Conference on Ubiquitous and Future Networks (ICUFN) (pp. 344-346). IEEE.
- Dong, H.; Zhang, P.; Lu, M.; Shen, Y.; Ke, G. MachineLearningLM: Scaling Many-shot In-context Learning via Continued Pretraining. arXiv 2025, arXiv:2509.06806. [Google Scholar]
- Dolon, M.S.A. Deployment and performance evaluation of hybrid machine learning models for stock price forecasting and risk prediction in volatile markets. American Journal of Scholarly Research and Innovation 2025, 4, 287–319. [Google Scholar] [CrossRef]
- Liu, S., Feng, H., & Liu, X. (2025). A Study on the Mechanism of Generative Design Tools' Impact on Visual Language Reconstruction: An Interactive Analysis of Semantic Mapping and User Cognition. Authorea Preprints.
- Du, Y. Research on Deep Learning Models for Forecasting Cross-Border Trade Demand Driven by Multi-Source Time-Series Data. Journal of Science, Innovation & Social Impact 2025, 1, 63–70. [Google Scholar]
- Srivastava, K. K. (2025). S3: Stable Subgoal Selection by Constraining Uncertainty of Coarse Dynamics in Hierarchical Reinforcement Learning (Master's thesis, University of Massachusetts Lowell).
- Mao, Y.; Ma, X.; Li, J. Research on API Security Gateway and Data Access Control Model for Multi-Tenant Full-Stack Systems. 2025. [Google Scholar]
- Farooq, A., Raza, S., Karim, M. N., Iqbal, H., Vasilakos, A. V., & Emmanouilidis, C. (2025). Evaluating and regulating agentic ai: A study of benchmarks, metrics, and regulation. Metrics, and Regulation.
- Zhu, W., Yang, J., & Yao, Y. (2025, October). How Compliance Maturity Translates to Risk Reduction: A Multi-Case Comparison of Global Operations Using fsQCA and Hierarchical Bayesian Methods. In Proceedings of the 2025 2nd International Conference on Digital Economy and Computer Science (pp. 672-676).
- Akshathala, S.; Adnan, B.; Ramesh, M.; Vaidhyanathan, K.; Muhammed, B.; Parthasarathy, K. Beyond Task Completion: An Assessment Framework for Evaluating Agentic AI Systems. arXiv 2025, arXiv:2512.12791. [Google Scholar] [CrossRef]
- Li, T.; Xia, J.; Liu, S.; Hong, E. Strategic Human Resource Leadership in Global Biopharmaceutical Enterprises: Integrating HR Analytics and Cross-Cultural. 2025. [Google Scholar] [CrossRef]
- Borjigin, A.; He, C. Safe and Compliant Cross-Market Trade Execution via Constrained RL and Zero-Knowledge Audits. arXiv 2025, arXiv:2510.04952. [Google Scholar]
- Mao, Y.; Ma, X.; Li, J. Research on Web System Anomaly Detection and Intelligent Operations Based on Log Modeling and Self-Supervised Learning. 2025. [Google Scholar] [CrossRef]


Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).