Submitted:
30 June 2025
Posted:
02 July 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Related Work
3. Heuristic Techniques
- The lattice is defined by:
- For the 2D square lattice, the two encodings are denoted as follows:
- string – for absolute encoding, where R stands for right, L for left, U for up, and D for down;
- string – for relative encoding, where S stands for straight, R for right, and L for left.
- For the 3D cubic lattice, the two encodings are denoted as follows:
- string – for absolute encoding, where R stands for right, L for left, U for up, D for down, F for front, and B for back;
- string – for relative encoding, where S stands for straight, R for right, L for left, F for front, and B for back.
4. Optimisation Algorithms
4.1. Monte Carlo Simulation


4.2. Simulated Annealing


4.3. ACGA Algorithm
- Population Initialization: Firstly, a population of potential solutions is created. Each solution, referred to as a chromosome, is randomly generated, and it has an associated objective function called fitness.
- Exploration Stage: The mutation and crossover operators are applied to a certain percentage of the chromosomes in the population, which are chosen randomly. These operators ensure the dispersion of the population in the space of possible solutions, promoting exploration of the solution space.
- Through the selection operation, from a percentage of the individuals of the population, those with the best fitness are selected. In this way, a new population is created, usually statistically better, and this represents the next generation.
- Exploitation Stage: Through the selection operation, a certain percentage of individuals with the best fitness are chosen from the population. This process creates a new population, which is usually statistically better and represents the next generation of potential solutions. The selection operation helps exploit the combinatorial space by favoring the fitter individuals for reproduction.
- string
- string – absolute 2D square
- string – relative 2D square
- string –absolute 3D cubic
- string – relative 3D cubic
- Set i = 1. Initialize SRL[i] = ’S’.
- i = i + 1. if continue with the next step. Otherwise, the conformation is completely generated in the string.
- Choose a random direction ’d’ from the {S,R,L}.
- For the 2D square lattice:
- If ,
- For the 3D cubic lattice:
- If ,
- For 2D square lattice, if : ;
- For 3D cubic lattice, if :

| Algorithm 1 All Conformations Genetic Algorithm (ACGA) |
|

5. Results Analysis
| ID Seq | Length | Optimal 2D | ACGA 2D | Optimal 3D | ACGA 3D |
|---|---|---|---|---|---|
| 1 | 20 | -9 | -9 | -11 | -11 |
| 2 | 24 | -9 | -9 | -13 | -13 |
| 3 | 25 | -8 | -8 | -9 | -9 |
| 4 | 36 | -14 | -14 | -18 | -18 |
| 5 | 48 | -23 | -22 | -31 | -31 |
| 6 | 50 | -21 | -21 | -34 | -31 |
| 7 | 60 | -36 | -35 | -55 | -49 |
| 8 | 64 | -42 | -38 | -59 | -49 |
| 9 | 85 | -53 | -48 | - | -73 |
| ID Seq | Optimal conformations on 2D square HP model |
|---|---|
| 1 | ULLDRDLDLDRRURDRUUL |
| 2 | LULDLDRDRURRRDRURULULDL |
| 3 | RULURUUURDDRURDDLDRDLLUU |
| 4 | LDLDRDLDDLLURULLURRUULDLUULDDDLDRDD |
| 5 | RURDRURDRRRDLDLULDLULDLULDLDRRRDLDRRRRURULLLDR |
| 6 | RDLLUURURDRURDRDLLDDLLDLLLLLLUULURRURDDLDRRURUULD |
| 7 | URDDDRUURDDDLDRRUUUUULLURRURRRRDLDLULDDRRRURDDLLLLDRRDLLDRR |
| 8 | RDLLULDLULDLUURURDRURDRRRDRDLDRDLDRDLDLULDLULDDLUUURRRRULLLURRR |
| 9 | RRRRULLLURULLDDLULDLLDLUUURDRURRURRUUULDDLLDLULDLLDLUURRURRRRULLLLLLURURDRURDRURRDL |
| ID Seq | Optimal conformations on 3D square HP model |
|---|---|
| 1 | DLLURUFDLDRRUUURDBL |
| 2 | RRULUBDRDLDLUBUFULDLFRR |
| 3 | RBRULUURBLDDRDLLUULLFRRD |
| 4 | FLLLDRFLLLURURDRFDLULURRURDBLBULDLL |
| 5 | DRDRURRULURULLLDDRUFRDLDRFFLLURBRRULLULDLBRDFDB |
| 6 | RDLLULFDRURDRURULLUBDLLUFRDLFDRUURDRDLDLFRRUULLDR |
| 7 | RDLFRDLDRRURDDBLURULLLDRDLLUUUBRRDLDRRRULBDRDLLUUULDDDFRDLD |
| 8 | BRRULLFRDRURDDLLFURRULLULDLDRDDFURDRURULLLURRRULLFFDRDLDRBUULDD |
| 9 | FFLDLBRRDRRUULDBURDDDRFLFULLURURDRUBDRFFLULDLDLLURULURRDFRDLDLUULDDLFRULURRURDDRDLLU |

- 2D Grid View – Classic square lattice with H and P residues colour-coded.
- 3D Grid View – Cubic lattice with interactive 3D navigation.
- Ribbon View – Stylised representation highlighting the protein backbone.
- Space-filling View – Van der Waals surface model showing spatial occupation.
- Surface View – Solvent-accessible surface representation.
- MC achieved the best energy result () and demonstrated dynamic search behaviour with strong exploratory capability. However, due to its fixed temperature, it is prone to high fluctuations and occasional instability.
- SA displayed smoother convergence, better stability, and was more resilient to local optima thanks to its cooling schedule. Nonetheless, it resulted in a slightly higher final energy (), indicating slower convergence in practice.
- GA was effective at exploring diverse conformations and maintaining population diversity. Its performance depends strongly on the balance between exploration and exploitation parameters, such as mutation rate and selection pressure.
6. Discussion
7. Conclusion
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Baynes, J.W.; Dominiczak, M.H. Medical Biochemistry FIFTH EDITION; Elsevier, 2019. fifth edition.
- Felix-Saul, J.C.; García-Valdez, M.; Merelo Guervós, J.J.; Castillo, O. Extending Genetic Algorithms with Biological Life-Cycle Dynamics. Biomimetics 2024, 9. [CrossRef]
- Rashid, M.; Newton, M.A.H.; Hoque, M.; Sattar, A. Mixing Energy Models in Genetic Algorithms for On-Lattice Protein Structure Prediction. BioMed research international 2013, 2013, 924137. [CrossRef]
- Rashid, M.A.; Iqbal, S.; Khatib, F.; Hoque, M.T.; Sattar, A. Guided macro-mutation in a graded energy based genetic algorithm for protein structure prediction. Computational Biology and Chemistry 2016, 61, 162–177. [CrossRef]
- Hardy, J.; Selkoe, D.J. Amyloid hypothesis of Alzheimer’s disease: progress and problems on the road to therapeutics. Science 2002, 297, 353–356.
- Leaver-Fay, A.; .; et al. Rosetta3: an object-oriented software suite for the simulation and design of macromolecules. Methods in enzymology 2011, 487, 545–574.
- Abraham, M.J.; et al. GROMACS: High performance molecular simulations through multi-level parallelism from laptops to supercomputers. SoftwareX 2015, 1, 19–25.
- Anfinsen, C.B. Principles that govern the folding of protein chains. Science 1973, 181, 223–230.
- Unger, R.; Moult, J. Genetic algorithms for protein folding simulations. Journal of molecular biology 1993, 231 1, 75–81.
- Custódio, F.; Barbosa, H.; Dardenne, L. Investigation of the three-dimensional lattice HP protein folding model using a genetic algorithm. Genetics and Molecular Biology - GENET MOL BIOL 2004, 27. [CrossRef]
- Jiang, T.; Cui, Q.; Shi, G.; Ma, S. Protein folding simulations of the hydrophobic–hydrophilic model by combining tabu search with genetic algorithms. The Journal of Chemical Physics 2003, 119, 4592–4596. [CrossRef]
- Cox, G.; Mortimer-Jones, T.; Taylor, R.; Johnston, R. Development and optimisation of a novel genetic algorithm for studying model protein folding. Theoretical Chemistry Accounts 2004, 112, 163–178. [CrossRef]
- Cox, G.; Johnston, R. Analyzing energy landscapes for folding model proteins. The Journal of chemical physics 2006, 124, 204714. [CrossRef]
- Liang, F.; Wong, W.H. Evolutionary Monte Carlo for protein folding simulations. The Journal of Chemical Physics 2001, 115, 3374–3380. [CrossRef]
- Zhou, C.; et al. Enhanced hybrid search algorithm for protein structure prediction using the 3D-HP lattice model. Journal of Molecular Modeling 2013, 19, 3883–3891. [CrossRef]
- Huang, C.; Yang, X.; He, Z. Protein folding simulations of 2D HP model by the genetic algorithm based on optimal secondary structures. Computational biology and chemistry 2010, 34, 137–42. [CrossRef]
- Benitez, C.; Parpinelli, R.; Lopes, H. Parallelism, hybridism and coevolution in a multi-level ABC-GA approach for the protein structure prediction problem. Concurrency and Computation: Practice and Experience 2011, 24, 635 – 646. [CrossRef]
- Rashid, M.; Newton, M.A.H.; Hoque, M.; Sattar, A. A local search embedded genetic algorithm for simplified protein structure prediction. In Proceedings of the 2013 IEEE Congress on Evolutionary Computation, CEC 2013, 06 2013. [CrossRef]
- Rashid, M.A.; Newton, M.H.; Hoque, M.T.; Sattar, A. Collaborative Parallel Local Search for Simplified Protein Structure Prediction. In Proceedings of the 2013 12th IEEE International Conference on Trust, Security and Privacy in Computing and Communications, 07 2013, pp. 966–973. [CrossRef]
- Dill, K.A. Theory for the folding and stability of globular proteins. Biochemistry 1985, 24, 1501.
- Lau, K.F.; Dill, K.A. A lattice statistical mechanics model of the conformational and sequence spaces of proteins. Macromolecules 1989, 22, 3986–3997. [CrossRef]
- Boumedine, N.; Bouroubi, S. A new hybrid genetic algorithm for protein structure prediction on the 2D triangular lattice. TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES 2021, 29, 499–513. [CrossRef]
- Mazidi, A.; Roshanfar, F. PSPGA: A New Method for Protein Structure Prediction based on Genetic Algorithm. Journal of Applied Dynamic Systems and Control 2020, 3, 9–16.
- Rezaei, M.; Kheyrandish, M.; Mosleh, M. A novel algorithm based on a modified PSO to predict 3D structure for proteins in HP model using Transfer Learning. Expert Systems with Applications 2024, 235, 121233. [CrossRef]
- S., G.; E.R., V. Protein Secondary Structure Prediction Using Cascaded Feature Learning Model. Applied Soft Computing 2023, 140, 110242. [CrossRef]
- Berger, B.; Leighton, T. Protein Folding in the Hydrophobic-Hydrophilic (HP) Model is NP-Complete. Journal of Computational Biology 1998, 5, 27–40. [CrossRef]
- Burley, S.K.; et al. RCSB Protein Data Bank (RCSB.org): delivery of experimentally-determined PDB structures alongside one million computed structure models of proteins from artificial intelligence/machine learning. Nucleic Acids Research 2023, 51, D488–D508. [CrossRef]
- Chira, C.; Horvath, D.; Dumitrescu, D. Hill-Climbing search and diversification within an evolutionary approach to protein structure prediction. BioData mining 2011, 4, 23. [CrossRef]
- Rotar, C. An Evolutionary Technique for Multicriterial Optimization Based on Endocrine Paradigm. 06 2004, Vol. 3103, pp. 414–415. [CrossRef]
- Bahi, J.M.; Cote, N.; Guyeux, C. Chaos of Protein Folding. Neural Networks (IJCNN) 2011, pp. 1948–1954.
- Bahi, J.M.; Côté, N.; Guyeux, C.; Salomon, M. Protein Folding in the 2D Hydrophobic–Hydrophilic (HP) Square Lattice Model is Chaotic. Cognitive Computation 2011, 4, 98–114.
- Jumper, J.; Evans, R.; Pritzel, A.; Green, T.; Figurnov, M.; Ronneberger, O.; Tunyasuvunakool, K.; Bates, R.; Zidek, A.; Potapenko, A.; et al. Highly accurate protein structure prediction with AlphaFold. Nature 2021, 596, 583–589.
- Callaway, E. What’s next for AlphaFold and the AI protein-folding revolution. Nature 2022, 604, 234–238.
- Krishna, R.; et al. Generalized biomolecular modeling and design with RoseTTAFold All-Atom. Science 2024, 384, eadl2528. [CrossRef]
- Baek, M.; DiMaio, F.; Anishchenko, I.; Dauparas, J.; Ovchinnikov, S.; Lee, G.R.; Wang, J.; Cong, Q.; Kinch, L.N.; Schaeffer, R.D.; et al. Accurate prediction of protein structures and interactions using a three-track neural network. Science 2021, 373, 871–876.
- Cruz-Fernández, M.; et al. Model Parametrization-Based Genetic Algorithms Using Velocity Signal and Steady State of the Dynamic Response of a Motor. Biomimetics 2025, 10. [CrossRef]
- Crescenzi, P.; Goldman, D.; Papadimitriou, C.; Piccolboni, A.; Yannakakis, M. On the Complexity of Protein Folding. Journal of Computational Biology 1998, 5, 423–465. [CrossRef]
- Zhang Hongjin, W.H. Enhanced genetic algorithm for indoor low-illumination stereo matching energy function optimization. Alexandria Engineering Journal 2025, 121, 1–17. [CrossRef]
- Cristea, D. Hybrid Combinatorial Problems Used for Multimodal Optimisation. In Proceedings of the 2024 IEEE 24th International Conference on Bioinformatics and Bioengineering (BIBE), 2024, pp. 1–8.
- Sauer, R.T. Protein folding from a combinatorial perspective. Folding and Design 1996, 1, R27 – R30. [CrossRef]
- Metropolis, N.; Rosenbluth, A.W.; Rosenbluth, M.N.; Teller, A.H.; Teller, E. Equation of state calculations by fast computing machines. The journal of chemical physics 1953, 21, 1087–1092.
- Kirkpatrick, S.; Gelatt, C.; Vecchi, M. Optimization by simulated annealing. Science 1983, 220, 671–680.
- Sima, I.; Parv, B. Protein Folding Simulation Using Combinatorial Whale Optimization Algorithm. In Proceedings of the 2019 21st International Symposium on Symbolic and Numeric Algorithms for Scientific Computing (SYNASC), 2019, pp. 159–166.
- Mann, M.; Backofen, R.; Will, S. Faster exact protein structure prediction with the CPSP approach. Constraints 2005, 10, 5–30.
- Milde, F.; Giegerich, R.; Will, S. CPSP-tools–Exact and complete algorithms for high-throughput 3D lattice protein studies. In Proceedings of the Bioinformatics Research and Applications. Springer, 2010, pp. 21–32.

| ID | No of | Sequence |
|---|---|---|
| Seq | AA | |
| 1 | 20 | HPHP PHHP HPPH PHHP PHPH |
| 2 | 24 | HHPP HPPH PPHP PHPP HPPH PPHH |
| 3 | 25 | PPHP PHHP PPPH HPPP PHHP PPPH H |
| 4 | 36 | PPPH HPPH HPPP PPHH HHHH HPPH HPPP PHHP PHPP |
| 5 | 48 | PPHP PHHP PHHP PPPP HHHH HHHH HHPP PPPP HHPP HHPP HPPH HHHH |
| 6 | 50 | HHPH PHPH PHHH HPHP PPHP PPHP PPPH PPPH PPPH PHHH HPHP HPHP HH |
| 7 | 60 | PPHH HPHH HHHH HHPP PHHH HHHH HHHP HPPP HHHH HHHH HHHH PPPP HHHH HHPH HPHP |
| 8 | 64 | HHHH HHHH HHHH PHPH PPHH PPHH PPHP PHHP PHHP PHPP HHPP HHPP HPHP HHHH HHHH HHHH |
| 9 | 85 | HHHH PPPP HHHH HHHH HHHH PPPP PPHH HHHH HHHH HHPP PHHH HHHH HHHH HPPP HHHH |
| HHHH HHHH PPPH PPHH PPHH PPHPH |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).