Submitted:
25 June 2025
Posted:
30 June 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Materials and Methods
Experimental conditions
Downstream processing of Qβ VLPs
Analytical measurements
Dataset for hybrid model training
Hybrid process model structure and training
Hybrid model architecture screening
Hybrid model transfer learning
Model Predictive Control (MPC) architecture
3. Results
3.1. Optimal hybrid model architecture screening

3.2. Adapting the hybrid model to Qβ dataset with Transfer Learning
3.3. Hybrid MPC experimental validation
4. Discussion
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
Appendix A

References
- Bailey, J.E. Mathematical Modeling and Analysis in Biochemical Engineering: Past Accomplishments and Future Opportunities. Biotechnol Prog 1998, 14, 8–20. [Google Scholar] [CrossRef] [PubMed]
- Noll, P.; Henkel, M. History and Evolution of Modeling in Biotechnology: Modeling & Simulation, Application and Hardware Performance. Comput Struct Biotechnol J 2020, 18, 3309–3323. [Google Scholar] [CrossRef]
- Heinemann, M.; Panke, S. Synthetic Biology - Putting Engineering into Biology. Bioinformatics 2006, 22, 2790–2799. [Google Scholar] [CrossRef] [PubMed]
- Lee, J.; Kao, H.-A.; Bagheri, B. Recent Advances and Trends of Cyber-Physical Systems and Big Data Analytics in Industrial Informatics. Proceeding of Int. Conference on Industrial Informatics (INDIN). Porto Alegre, Brazil, July 2014. [CrossRef]
- Agharafeie, R.; Ramos, J.R.C.; Mendes, J.M.; Oliveira, R. From Shallow to Deep Bioprocess Hybrid Modeling: Advances and Future Perspectives. Fermentation 2023, 9, 1–22. [Google Scholar] [CrossRef]
- Carbonell, P.; Radivojevic, T.; García Martín, H. Opportunities at the Intersection of Synthetic Biology, Machine Learning, and Automation. ACS Synth Biol 2019, 8, 1474–1477. [Google Scholar] [CrossRef] [PubMed]
- Glassey, J.; Gernaey, K. V.; Clemens, C.; Schulz, T.W.; Oliveira, R.; Striedner, G.; Mandenius, C.F. Process Analytical Technology (PAT) for Biopharmaceuticals. Biotechnol J 2011, 6, 369–377. [Google Scholar] [CrossRef]
- Sun, Y.; Nathan-Roberts, W.; Pham, T.D.; Otte, E.; Aickelin, U. Multi-Fidelity Gaussian Process for Biomanufacturing Process Modeling with Small Data. arXiv 2022, arXiv:2211.14493. [Google Scholar]
- Pinto, J.; Ramos, J.R.C.; Costa, R.S.; Oliveira, R. Hybrid Deep Modeling of a GS115 (Mut+) Pichia Pastoris Culture with State–Space Reduction. Fermentation 2023, 9. [Google Scholar] [CrossRef]
- Pinto, J.; Mestre, M.; Ramos, J.; Costa, R.S.; Striedner, G.; Oliveira, R. A General Deep Hybrid Model for Bioreactor Systems: Combining First Principles with Deep Neural Networks. Comput Chem Eng 2022, 165, 107952. [Google Scholar] [CrossRef]
- Ramos, J.R.C.; Pinto, J.; Poiares-Oliveira, G.; Peeters, L.; Dumas, P.; Oliveira, R. Deep Hybrid Modeling of a HEK293 Process: Combining Long Short-Term Memory Networks with First Principles Equations. Biotechnol Bioeng 2024, 121, 1554–1568. [Google Scholar] [CrossRef]
- Ferreira, A.R.; Dias, J.M.L.; Von Stosch, M.; Clemente, J.; Cunha, A.E.; Oliveira, R. Fast Development of Pichia Pastoris GS115 Mut+ Cultures Employing Batch-to-Batch Control and Hybrid Semi-Parametric Modeling. Bioprocess Biosyst Eng 2014, 37, 629–639. [Google Scholar] [CrossRef] [PubMed]
- Narayanan, H.; von Stosch, M.; Feidl, F.; Sokolov, M.; Morbidelli, M.; Butté, A. Hybrid Modeling for Biopharmaceutical Processes: Advantages, Opportunities, and Implementation. Front chem eng 2023, 5. [Google Scholar] [CrossRef]
- Schweidtmann, A.M.; Zhang, D.; von Stosch, M. A Review and Perspective on Hybrid Modeling Methodologies. Digit Chem Eng 2024, 10, 100136. [Google Scholar] [CrossRef]
- von Stosch, M.; Oliveira, R.; Peres, J.; Feyo de Azevedo, S. Hybrid Semi-Parametric Modeling in Process Systems Engineering: Past, Present and Future. Comput Chem Eng 2014, 60, 86–101. [Google Scholar] [CrossRef]
- Psichogios, D.C.; Ungar, L.H. A Hybrid Neural Network-First Principles Approach to Process Modeling. AIChE J. 1992, 38, 1499–1511. [Google Scholar] [CrossRef]
- von Stosch, M.; Davy, S.; Francois, K.; Galvanauskas, V.; Hamelink, J.M.; Luebbert, A.; Mayer, M.; Oliveira, R.; O’Kennedy, R.; Rice, P.; et al. Hybrid Modeling for Quality by Design and PAT-Benefits and Challenges of Applications in Biopharmaceutical Industry. Biotechnol J 2014, 9, 719–726. [Google Scholar] [CrossRef]
- Mhaskar, H.N.; Poggio, T. Deep vs. Shallow Networks: An Approximation Theory Perspective. Anal Appl 2016, 14, 829–848. [Google Scholar] [CrossRef]
- Lecun, Y.; Bengio, Y.; Hinton, G. Deep Learning. Nature 2015, 521, 436–444. [Google Scholar] [CrossRef]
- Kingma, D.P.; Ba, J. Adam: A Method for Stochastic Optimization. arXiv 2014, arXiv:1412.6980. [Google Scholar]
- Helleckes, L.M.; Hemmerich, J.; Wiechert, W.; von Lieres, E.; Grünberger, A. Machine Learning in Bioprocess Development: From Promise to Practice. Trends Biotechnol 2023, 41, 817–835. [Google Scholar] [CrossRef]
- Alzubaidi, L.; Bai, J.; Al-Sabaawi, A.; Santamaría, J.; Albahri, A.S.; Al-dabbagh, B.S.N.; Fadhel, M.A.; Manoufali, M.; Zhang, J.; Al-Timemy, A.H.; et al. A Survey on Deep Learning Tools Dealing with Data Scarcity: Definitions, Challenges, Solutions, Tips, and Applications. J Big Data 2023, 10. [Google Scholar] [CrossRef]
- How Invert uses meta-learning to leverage old bioprocess data. Available online: https://invertbio.com/blogs/how-invert-uses-meta-learning-to-leverage-old-data-and-make-better-biomanufacturing-predictions (accessed on 25 June 2025).
- Duong-Trung, N.; Born, S.; Kim, J.W.; Schermeyer, M.-T.; Paulick, K.; Borisyak, M.; Cruz-Bournazou, M.N.; Werner, T.; Scholz, R.; Schmidt-Thieme, L.; et al. When Bioprocess Engineering Meets Machine Learning: A Survey from the Perspective of Automated Bioprocess Development. Biochem Eng J 2022, 190, 108764. [Google Scholar] [CrossRef]
- Pan, S.J.; Yang, Q. A Survey on Transfer Learning. IEEE Trans Knowl Data Eng 2010, 22, 1345–1359. [Google Scholar] [CrossRef]
- Weiss, K.; Khoshgoftaar, T.M.; Wang, D.D. A Survey of Transfer Learning. J Big Data 2016, 3. [Google Scholar] [CrossRef]
- Rogers, A.W.; Vega-Ramon, F.; Yan, J.; del Río-Chanona, E.A.; Jing, K.; Zhang, D. A Transfer Learning Approach for Predictive Modeling of Bioprocesses Using Small Data. Biotechnol Bioeng 2022, 119, 411–422. [Google Scholar] [CrossRef]
- Kensert, A.; Harrison, P.J.; Spjuth, O. Transfer Learning with Deep Convolutional Neural Networks for Classifying Cellular Morphological Changes. SLAS Discov 2019, 24, 466–475. [Google Scholar] [CrossRef] [PubMed]
- Iman, M.; Rasheed, K.; Arabnia, H.R. A Review of Deep Transfer Learning and Recent Advancements. Technologies 2022, 11. [Google Scholar] [CrossRef]
- Narayanan, H.; Luna, M.; Sokolov, M.; Butté, A.; Morbidelli, M. Hybrid Models Based on Machine Learning and an Increasing Degree of Process Knowledge: Application to Cell Culture Processes. Ind Eng Chem Res 2022, 61, 8658–8672. [Google Scholar] [CrossRef]
- Liashchynskyi, P.; Liashchynskyi, P. Grid Search, Random Search, Genetic Algorithm: A Big Comparison for NAS, arXiv 2019. arXiv:1912.06059v1.
- Snoek, J.; Larochelle, H.; Adams, R.P. Practical Bayesian Optimization of Machine Learning Algorithms, arXiv 2012. arXiv:1206.2944.
- Bolmanis, E.; Dubencovs, K.; Suleiko, A.; Vanags, J. Model Predictive Control—A Stand Out among Competitors for Fed-Batch Fermentation Improvement. Fermentation 2023, 9, 206. [Google Scholar] [CrossRef]
- Mears, L.; Stocks, S.M.; Sin, G.; Gernaey, K. V. A Review of Control Strategies for Manipulating the Feed Rate in Fed-Batch Fermentation Processes. J Biotechnol 2017, 245, 34–46. [Google Scholar] [CrossRef]
- Rashedi, M.; Khodabandehlou, H.; Demers, M.; Wang, T.; Garvin, C. Model Predictive Controller Design for Bioprocesses Based on Machine Learning Algorithms. In Proceedings of the 13th IFAC Symposium on Dynamics and Control of Process Systems, including Biosystems DYCOPS 2022, Busan, Republic of Korea, 14–17 June 2022; 45–50. [Google Scholar] [CrossRef]
- Eslami, T.; Jungbauer, A. Control Strategy for Biopharmaceutical Production by Model Predictive Control. Biotechnol Prog 2024, 40, 1–19. [Google Scholar] [CrossRef]
- Lyubenova, V.; Ignatova, M.; Zoteva, D.; Roeva, O. Model-Based Adaptive Control of Bioreactors—A Brief Review. Mathematics 2024, 12. [Google Scholar] [CrossRef]
- Mahanty, B. Hybrid Modeling in Bioprocess Dynamics: Structural Variabilities, Implementation Strategies, and Practical Challenges. Biotechnol Bioeng 2023, 120, 2072–2091. [Google Scholar] [CrossRef] [PubMed]
- De Jong, H.; Casagranda, S.; Giordano, N.; Cinquemani, E.; Ropers, D.; Geiselmann, J.; Gouzé, J.L. Mathematical Modelling of Microbes: Metabolism, Gene Expression and Growth. J R Soc Interface 2017, 14. [Google Scholar] [CrossRef] [PubMed]
- Tsopanoglou, A.; Jiménez del Val, I. Moving towards an Era of Hybrid Modelling: Advantages and Challenges of Coupling Mechanistic and Data-Driven Models for Upstream Pharmaceutical Bioprocesses. Curr Opin Chem Eng 2021, 32. [Google Scholar] [CrossRef]
- Galvanauskas, V.; Simutis, R.; Lübbert, A. Hybrid Modeling of Biochemical Processes. In Hybrid Modeling in Process Industries, 1st ed.; Glassey, J., von Stosch, M., Eds.; CRC Press: Boca Raton, USA, 2018; pp. 89–127. [Google Scholar]
- Cregg, J.M.; Cereghino, J.L.; Shi, J.; Higgins, D.R. Recombinant Protein Expression in Pichia Pastoris. Mol Biotechnol 2000, 16, 23–52. [Google Scholar] [CrossRef] [PubMed]
- Macauley-Patrick, S.; Fazenda, M.L.; McNeil, B.; Harvey, L.M. Heterologous Protein Production Using the Pichia Pastoris Expression System. Yeast 2005, 22, 249–270. [Google Scholar] [CrossRef]
- Bolmanis, E.; Grigs, O.; Kazaks, A.; Galvanauskas, V. High-Level Production of Recombinant HBcAg Virus-like Particles in a Mathematically Modelled P. Pastoris GS115 Mut+ Bioreactor Process under Controlled Residual Methanol Concentration. Bioprocess Biosyst Eng 2022, 45, 1447–1463. [Google Scholar] [CrossRef]
- Looser, V.; Bruhlmann, B.; Bumbak, F.; Stenger, C.; Costa, M.; Camattari, A.; Fotiadis, D.; Kovar, K. Cultivation Strategies to Enhance Productivity of Pichia Pastoris: A Review. Biotechnol Adv 2014, 33, 1177–1193. [Google Scholar] [CrossRef]
- Vuitika, L.; Côrtes, N.; Malaquias, V.B.; Silva, J.D.Q.; Lira, A.; Prates-Syed, W.A.; Schimke, L.F.; Luz, D.; Durães-Carvalho, R.; Balan, A.; et al. A Self-Adjuvanted VLPs-Based Covid-19 Vaccine Proven Versatile, Safe, and Highly Protective. Sci Rep 2024, 14. [Google Scholar] [CrossRef]
- Spohn, G.; Keller, I.; Beck, M.; Grest, P.; Jennings, G.T.; Bachmann, M.F. Active Immunization with IL-1 Displayed on Virus-like Particles Protects from Autoimmune Arthritis. Eur J Immunol 2008, 38, 877–887. [Google Scholar] [CrossRef] [PubMed]
- Freivalds, J.; Dislers, A.; Ose, V.; Skrastina, D.; Cielens, I.; Pumpens, P.; Sasnauskas, K.; Kazaks, A. Assembly of Bacteriophage Qβ Virus-like Particles in Yeast Saccharomyces Cerevisiae and Pichia Pastoris. J Biotechnol 2006, 123, 297–303. [Google Scholar] [CrossRef] [PubMed]
- Invitrogen Corporation Pichia Fermentation Process Guidelines. Available online: https://tools.thermofisher.com/content/sfs/manuals/pichiaferm_prot.pdf (accessed on day month year).
- Grigs, O.; Bolmanis, E.; Galvanauskas, V. Application of In-Situ and Soft-Sensors for Estimation of Recombinant P. Pastoris GS115 Biomass Concentration: A Case Analysis of HBcAg (Mut+) and HBsAg (MutS) Production Processes under Varying Conditions. Sensors 2021, 21, 1268. [Google Scholar] [CrossRef] [PubMed]
- Bolmanis, E.; Uhlendorff, S.; Pein-Hackelbusch, M.; Galvanauskas, V.; Grigs, O. Anomaly Detection and Removal Strategies for In-Line Permittivity Sensor Signal Used in Bioprocesses. Front. Bioeng. Biotechnol. 2025. (Article in press).
- Bolmanis, E.; Bogans, J.; Akopjana, I.; Suleiko, A.; Kazaka, T.; Kazaks, A. Production and Purification of Soy Leghemoglobin from Pichia Pastoris Cultivated in Different Expression Media. Processes 2023, 11, 3215. [Google Scholar] [CrossRef]
- White, C.; Ai, A.; Ai, C.; Neiswanger, W.; Savani, Y.; Ai, Y. BANANAS: Bayesian Optimization with Neural Architectures for Neural Architecture Search. arXiv 2020, arXiv:1910.11858. [Google Scholar] [CrossRef]
- Blume, S.; Benedens, T.; Schramm, D. Hyperparameter Optimization Techniques for Designing Software Sensors Based on Artificial Neural Networks. Sensors 2021, 21. [Google Scholar] [CrossRef]
- Gal, Y.; Ghahramani, Z. Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning. arXiv 2016, arXiv:1506.02142. [Google Scholar]
- Vien, B.S.; Kuen, T.; Rose, L.R.F.; Chiu, W.K. Optimisation and Calibration of Bayesian Neural Network for Probabilistic Prediction of Biogas Performance in an Anaerobic Lagoon. Sensors 2024, 24. [Google Scholar] [CrossRef]




| Exp No. | Strain, product | Induction time, h | DCW, g·L-1 | Feed rate, mL·min-1 | Vend, L | Reference |
|---|---|---|---|---|---|---|
| 1* | GS115, HBcAg | 65 | 37.5—101.6 | 0.12—0.78 | 2.85 | [44,50] |
| 2* | GS115, HBcAg | 45 | 40.6—113.5 | 0.12—1.00 | 3.09 | |
| 3* | GS115, HBcAg | 43 | 41.2—120.1 | 0.12—0.98 | 3.13 | |
| 4* | GS115, HBcAg | 50 | 59.2—120.1 | 0.12—0.36 | 2.54 | |
| 5* | GS115, HBcAg | 51 | 41.4—96.6 | 0.12—0.36 | 2.87 | |
| 6 | GS115, HBcAg | 48 | 49.1—120.0 | 0.12—0.50 | 2.88 | |
| 7 | GS115, HBcAg | 43 | 53.7—101.5 | 0.12—0.36 | 2.74 | Unpublished data |
| 8 | GS115, CA IX | 54 | 44.1—84.0 | 0.12—0.56 | 2.75 | |
| 9* | X-33, LegH | 65 | 55.4—123.2 | 0.12—0.36 | 2.57 | [52] |
| 10* | X-33, LegH | 46 | 49.5—95.4 | 0.12—0.60 | 2.98 | |
| 11 | X-33, LegH | 65 | 48.9—111.2 | 0.12—0.36 | 2.85 | |
| 12 | X-33, LegH | 48 | 56.4—105.3 | 0.12—0.50 | 2.63 | |
| 13 | X-33, LegH | 50 | 45.3—101.3 | 0.12—0.36 | 2.61 | |
| 14 | X-33, LegH | 45 | 52.9—103.1 | 0.12—0.36 | 2.55 | |
| 15 | X-33, LegH | 46 | 45.1—101.3 | 0.12—0.36 | 2.52 | |
| 16 | X-33, LegH | 65 | 51.0—101.7 | 0.12—0.36 | 2.66 | |
| 17 | X-33, LegH | 46 | 50.6—92.4 | 0.12—0.60 | 3.00 | |
| 18 | X-33, Qβ | 65 | 52.5—117.6 | 0.12—0.49 | 3.23 | This research |
| 19 | X-33, Qβ | 48 | 49.3—117.2 | 0.12—1.00 | 3.40 | |
| 20 | X-33, Qβ | 55 | 50.1—107.7 | 0.12—0.36 | 2.84 |
| First layer type | No. of FC layers | Hidden units | Nodes | Activation | Validation Loss (%) | No. of parameters | AICc |
|---|---|---|---|---|---|---|---|
| LSTM | 1 | 5 | 5 | Tanh | 9.73 | 268 | 2417 |
| 5 | 5 | LeakyReLu | 10.07 | 268 | 2431 | ||
| 5 | 4 | Tanh | 10.17 | 259 | 2312 | ||
| 1 | 5 | ReLu | 10.26 | 56 | 1126 | ||
| 5 | 5 | Tanh | 10.37 | 268 | 2444 | ||
| 3 | 5 | Tanh | 10.42 | 146 | 1448 | ||
| 4 | 5 | None | 10.51 | 203 | 1783 | ||
| 5 | 5 | Tanh | 10.68 | 268 | 2457 | ||
| 4 | 5 | ReLu | 10.75 | 203 | 1792 | ||
| 5 | 4 | Tanh | 11.30 | 259 | 2358 |
| Hidden units | Nodes | Activation | Validation Loss (%) | No. of parameters | AICc |
|---|---|---|---|---|---|
| 5 | 9 | LeakyReLu | 8.85 | 304 | 3050 |
| 4 | 10 | LeakyReLu | 9.27 | 243 | 2092 |
| 3 | 6 | LeakyReLu | 9.29 | 153 | 1427 |
| 3* | 5 | LeakyReLu | 9.37 | 146 | 1403 |
| 2* | 10 | LeakyReLu | 9.68 | 127 | 1334 |
| 2* | 9 | Tanh | 9.99 | 121 | 1316 |
| 2* | 8 | ReLu | 10.03 | 115 | 1302 |
| 1* | 9 | Tanh | 10.26 | 76 | 1189 |
| 1 | 6 | LeakyReLu | 10.21 | 61 | 1139 |
| 1 | 4 | ReLu | 10.25 | 51 | 1112 |
| 1 | 1 | Tanh | 10.36 | 36 | 1079 |
| Hidden units | Nodes | Activation | Validation Loss (%) | No. of parameters | AICc |
|---|---|---|---|---|---|
| 3 | 5 | LeakyReLu | 7.28 | 146 | 1294 |
| 2 | 10 | LeakyReLu | 6.37 | 127 | 1155 |
| 2 | 9 | Tanh | 8.14 | 121 | 1236 |
| 2 | 8 | ReLu | 4.93 | 115 | 998 |
| 1 | 9 | Tanh | 8.27 | 76 | 1090 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).