Submitted:
03 October 2024
Posted:
04 October 2024
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Review of Past Findings
2.1. Gradient-Based Methods
2.2. Escaping Saddle Points
2.3. Non-Convex Regularization
3. Gradient-Based Methods for Machine Learning
4. Escaping Saddle Points
5. Non-Convex Regularization
6. Machine Learning Models and Non-Convex Optimization
7. Challenges and Open Problems
8. Non-Convex Optimization and Reducing Computational Costs in Machine Learning
8.1. Sparse Solutions through Non-Convex Regularization
8.2. Escaping Flat Regions and Saddle Points Efficiently
8.3. Efficient Subsampling and Approximation Techniques
8.4. Model Pruning and Compression
8.5. Efficient Search for Good Local Minima
8.6. Structured Non-Convexity in Specific Models
9. Future Research Direction
10. Conclusions
References
- Kingma, D.P.; Ba, J. Adam: A method for stochastic optimization. International Conference on Learning Representations (ICLR) 2015. [Google Scholar]
- Welling, M.; Teh, Y.W. Bayesian learning via stochastic gradient Langevin dynamics. Proceedings of the 28th International Conference on Machine Learning (ICML-11) 2011. pp. 681–688. [Google Scholar]
- Srivastava, N.; Hinton, G.; Krizhevsky, A.; Sutskever, I.; Salakhutdinov, R. Dropout: A simple way to prevent neural networks from overfitting. The Journal of Machine Learning Research 2014, 15, 1929–1958. [Google Scholar]
- Elhedhli, S.; Li, Z.; Bookbinder, J.H. Airfreight forwarding under system-wide and double discounts. EURO Journal on Transportation and Logistics 2017, 6, 165–183. [Google Scholar] [CrossRef]
- Polyak, B.T. Some methods of speeding up the convergence of iteration methods. USSR Computational Mathematics and Mathematical Physics 1964, 4, 1–17. [Google Scholar] [CrossRef]
- Duchi, J.; Hazan, E.; Singer, Y. Adaptive subgradient methods for online learning and stochastic optimization. Journal of Machine Learning Research 2011, 12, 2121–2159. [Google Scholar]
- Nesterov, Y.E. A method for solving the convex programming problem with convergence rate O(1/k²). Soviet Mathematics Doklady 1983, 27, 372–376. [Google Scholar]
- Martens, J. Deep learning via Hessian-free optimization. Proceedings of the 27th International Conference on Machine Learning (ICML-10), 2010, pp. 735–742.
- Nesterov, Y.; Polyak, B.T. Cubic regularization of Newton method and its global performance. Mathematical Programming 2006, 108, 177–205. [Google Scholar] [CrossRef]
- Ge, R.; Huang, F.; Jin, C.; Yuan, Y. Escaping from saddle points–online stochastic gradient for tensor decomposition. Conference on Learning Theory, 2015, pp. 797–842.
- Xu, Z.; Chang, Y.; Xu, F.; Zhang, H. L1/2 regularization: A thresholding representation theory and a fast solver. IEEE Transactions on Neural Networks and Learning Systems 2012, 23, 1013–1027. [Google Scholar]
- Fan, J.; Li, R. Variable selection via nonconcave penalized likelihood and its oracle properties. Journal of the American Statistical Association 2001, 96, 1348–1360. [Google Scholar] [CrossRef]
- Zhang, C.H. Nearly unbiased variable selection under minimax concave penalty. The Annals of Statistics 2010, 38, 894–942. [Google Scholar] [CrossRef]
- Collobert, R.; Sinz, F.; Weston, J.; Bottou, L. Trading convexity for scalability. Proceedings of the 23rd International Conference on Machine Learning (ICML), 2006, pp. 201–208.
- Wang, Z.; Zhu, Y.; Li, Z.; Wang, Z.; Qin, H.; Liu, X. Graph neural network recommendation system for football formation. Applied Science and Biotechnology Journal for Advanced Research 2024, 3, 33–39. [Google Scholar]
- PEIRIS, M.S. TRANSFORMATIVE INTEGRATION OF ARTIFICIAL INTELLIGENCE IN TELEMEDICINE, REMOTE HEALTHCARE, AND VIRTUAL PATIENT MONITORING: ENHANCING DIAGNOSTIC ACCURACY, PERSONALIZING CARE. International Journal of Intelligent Healthcare Analytics 2024, 104, 1019–1030. [Google Scholar]
- Changeux, A.; Montagnier, S. Strategic Decision-Making Support Using Large Language Models (LLMs). Management Journal for Advanced Research 2024, 4, 102–108. [Google Scholar]
- Alom, N.B.; others. A Comprehensive Analysis of Customer Behavior Analytics, Privacy Concerns, and Data Protection Regulations in the Era of Big Data and Machine Learning. International Journal of Applied Machine Learning and Computational Intelligence 2024, 14, 21–40. [Google Scholar]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).