Submitted:
24 May 2024
Posted:
11 June 2024
Read the latest preprint version here
Abstract
Keywords:
MSC: 65Y05; 68W10; 65-04
1. Introduction
2. Parallel Algorithm






3. Description of the Full Version of the Software Implementation of the Parallel Algorithm (Using the MPI-4 Standard)
- Before the main loop while, it is necessary to generate persistent communication requests using MPI routines of the form request[] = Bcast_init() for all routines of collective communications of processes Bcast(), Scatterv(), Reduce() and Gatherv(), which are called multiple times with the same set of arguments.
- Replace MPI routine calls with a sequence of function calls Start(request[]) “+” Wait(request[]).










4. Efficiency and Scalability of Software Implementation of the Proposed Parallel Algorithm
5. Discussion
- Some minimization functions use the BLAS [32], LAPACK [33], Intel MKL etc. libraries. In this case, they can use all the cores of a multi-core processor. As a result, the efficiency of parallelization within a single computing node can drop. When using “core”-parallelization using MPI, it is recommended to disable parallelism within the libraries.
- The operation of sending a “flag” can be combined with sending data for calculations. We did not do this to avoid unnecessary complexity of the algorithm.
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Virtanen, P.; Gommers, R.; Oliphant, T.E.; Haberland, M.; Reddy, T.; Cournapeau, D.; Burovski, E.; Peterson, P.; Weckesser, W.; Bright, J.; et al. SciPy 1.0: Fundamental Algorithms for Scientific Computing in Python. Nature Methods 2020, 17, 261–272. [Google Scholar] [CrossRef]
- Nelder, J.A.; Mead, R. A Simplex Method for Function Minimization. The Computer Journal 1965, 7, 308–313. [Google Scholar] [CrossRef]
- Powell, M.J.D. An efficient method for finding the minimum of a function of several variables without calculating derivatives. The Computer Journal 1964, 7, 155–162. [Google Scholar] [CrossRef]
- Hestenes, M.R.; Stiefel, E. Methods of conjugate gradients for solving linear systems. J Res NIST 1952, 49, 409–436. [Google Scholar] [CrossRef]
- Broyden, C. A new double-rank minimisation algorithm. Preliminary report. American Mathematical Society, Notices 1969, 16, 670. [Google Scholar]
- Fletcher, R. A new approach to variable metric algorithms. The computer journal 1970, 13, 317–322. [Google Scholar] [CrossRef]
- Goldfarb, D. A family of variable-metric methods derived by variational means. Mathematics of Computation 1970, 24, 23–26. [Google Scholar] [CrossRef]
- Shanno, D.F. Conditioning of Quasi-Newton Methods for Function Minimization. j-MATH-COMPUT 1970, 24, 647–656. [Google Scholar] [CrossRef]
- Dembo, R.S.; Steihaug, T. Truncated-Newton algorithms for large-scale unconstrained optimization. Mathematical Programming 1983, 26, 190–212. [Google Scholar] [CrossRef]
- Byrd, R.H.; Lu, P.; Nocedal, J.; Zhu, C. A Limited Memory Algorithm for Bound Constrained Optimization. SIAM Journal on Scientific Computing 1995, 16, 1190–1208. [Google Scholar] [CrossRef]
- Nash, S.G. Newton-Type Minimization via the Lanczos Method. SIAM Journal on Numerical Analysis 1984, 21, 770–788. [Google Scholar] [CrossRef]
- Powell, M.J.D. A Direct Search Optimization Method That Models the Objective and Constraint Functions by Linear Interpolation. In Advances in Optimization and Numerical Analysis; Gomez, S., Hennart, J.P., Eds.; Springer Netherlands, 1994; pp. 51–67. [Google Scholar] [CrossRef]
- Kraft, D. A Software Package for Sequential Quadratic Programming; Deutsche Forschungs- und Versuchsanstalt für Luft- und Raumfahrt Köln: Forschungsbericht, Wiss. Berichtswesen d. DFVLR, 1988. [Google Scholar]
- Byrd, R.H.; Hribar, M.E.; Nocedal, J. An Interior Point Algorithm for Large-Scale Nonlinear Programming. SIAM Journal on Optimization 1999, 9, 877–900. [Google Scholar] [CrossRef]
- Lalee, M.; Nocedal, J.; Plantenga, T. On the Implementation of an Algorithm for Large-Scale Equality Constrained Optimization. SIAM Journal on Optimization 1998, 8, 682–706. [Google Scholar] [CrossRef]
- A hybrid method for nonlinear equations. In Numerical methods for nonlinear algebraic equations; Gordon and Breach, 1970; pp. 87–114.
- Large-Scale Unconstrained Optimization. Numerical Optimization; Springer New York: New York, NY, 2006; pp. 164–192. [Google Scholar] [CrossRef]
- Lenders, F.; Kirches, C.; Potschka, A. trlib: a vector-free implementation of the GLTR method for iterative solution of the trust region problem. Optimization Methods and Software 2018, 33, 420–449. [Google Scholar] [CrossRef]
- Gould, N.I.M.; Lucidi, S.; Roma, M.; Toint, P.L. Solving the Trust-Region Subproblem using the Lanczos Method. SIAM Journal on Optimization 1999, 9, 504–525. [Google Scholar] [CrossRef]
- Conn, A.R.; Gould, N.I.M.; Toint, P.L. Trust Region Methods; Society for Industrial and Applied Mathematics, 2000; pp. 169–200. [CrossRef]
- Wales, D.J.; Doye, J.P.K. Global Optimization by Basin-Hopping and the Lowest Energy Structures of Lennard-Jones Clusters Containing up to 110 Atoms. The Journal of Physical Chemistry A 1997, 101, 5111–5116. [Google Scholar] [CrossRef]
- Johansson, R. Optimization. In Numerical Python: A Practical Techniques Approach for Industry; Apress: Berkeley, CA, USA, 2015; pp. 147–168. [Google Scholar] [CrossRef]
- Storn, R.; Price, K. Differential Evolution – A Simple and Efficient Heuristic for global Optimization over Continuous Spaces. Journal of Global Optimization 1997, 11, 341–359. [Google Scholar] [CrossRef]
- Endres, S.; Sandrock, C.; Focke, W. A simplicial homology algorithm for Lipschitz optimisation. Journal of Global Optimization 2018, 72, 181–217. [Google Scholar] [CrossRef]
- Xiang, Y.; Gubian, S.; Suomela, B.; Hoeng, J. Generalized Simulated Annealing for Global Optimization: The GenSA Package. The R Journal 2013, 5, 13–28. [Google Scholar] [CrossRef]
- Jones, D.R.; Perttunen, C.D.; Stuckman, B.E. Lipschitzian optimization without the Lipschitz constant. Journal of Optimization Theory and Applications 1993, 79, 157–181. [Google Scholar] [CrossRef]
- Gablonsky, J.M.; Kelley, C.T. A Locally-Biased form of the DIRECT Algorithm. Journal of Global Optimization 2001, 21, 27–37. [Google Scholar] [CrossRef]
- Dalcin, L.; Fang, Y.L.L. mpi4py: Status Update After 12 Years of Development. Computing in Science & Engineering 2021, 23, 47–54. [Google Scholar] [CrossRef]
- Rogowski, M.; Aseeri, S.; Keyes, D.; Dalcin, L. mpi4py.futures: MPI-Based Asynchronous Task Execution for Python. IEEE Transactions on Parallel and Distributed Systems 2023, 34, 611–622. [Google Scholar] [CrossRef]
- Voevodin, V.; Antonov, A.; Nikitenko, D.; Shvets, P.; Sobolev, S.; Sidorov, I.; Stefanov, K.; Voevodin, V.; Zhumatiy, S. Supercomputer Lomonosov-2: Large Scale, Deep Monitoring and Fine Analytics for the User Community. Supercomputing Frontiers and Innovations 2019, 6, 4–11. [Google Scholar] [CrossRef]
- Lukyanenko, D. Parallel algorithm for solving overdetermined systems of linear equations, taking into account round-off errors. Algorithms 2023, 16, 242. [Google Scholar] [CrossRef]
- Blackford, L.S.; Petitet, A.; Pozo, R.; Remington, K.; Whaley, R.C.; Demmel, J.; Dongarra, J.; Duff, I.; Hammarling, S.; Henry, G.; et al. An updated set of basic linear algebra subprograms (BLAS). ACM Transactions on Mathematical Software 2002, 28, 135–151. [Google Scholar]
- Anderson, E.; Bai, Z.; Bischof, C.; Blackford, S.; Demmel, J.; Dongarra, J.; Du Croz, J.; Greenbaum, A.; Hammarling, S.; McKenney, A.; Sorensen, D. LAPACK Users’ Guide, 3rd ed.; Society for Industrial and Applied Mathematics: Philadelphia, PA, USA, 1999. [Google Scholar]





Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).