Submitted:
31 January 2023
Posted:
31 January 2023
You are already at the latest version
Abstract

Keywords:
1. Introduction
2. Materials and methods
2.1. General SBML hybrid model
2.1. Interfacing with SBML databases and SBML modeling tools
2.3. Case studies
- Step 1: The original systems biology model was retrieved from the JWS database in SBML format. The respective files are provided as supplementary material
- Step 2: A synthetic time series dataset was generated by simulating the original model in the JWS platform. The resulting data set is provided as supplementary material. This data is needed to train the hybrid model as proof-of-concept. No experimental data was used in this step. More details are provided in the results section.
- Step 3: A feedforward neural network (FFNN) was inserted in the mechanistic model and converted to the HMOD format using the SBML2HYB python tool, freely available in Pinto et al. [24]. The size of the FFNN and interface with the mechanistic model depended on the case study. More details are given in the results section.
- Step 4: the hybrid mechanistic/FFNN ensemble encoded in HMOD format was trained using a Octave/Matlab tool by applying the deep learning approach described by Pinto et al. [25] and the dataset generated in step 2. The ADAM algorithm with stochastic regularization and semidirect sensitivity equations was employed. Implementation details varied in the case studies (more to this in the results section). It should be noted that developing a robust hybrid model may require a deeper analysis than the one performed in this study. Factors such as the size/depth of the FFNN, data partitioning, weights initialization, sensitivity equations and ADAM parameters were investigated elsewhere [25]. The concern here was the proof-of-concept that such hybrid models may be efficiently trained to a comparable performance of the original mechanistic models. The final trained hybrid model with the updated FFNN weights was saved in HMOD format.
- Step 5: The trained hybrid model in HMOD format was reconverted to SBML using the SBML2HYB tool. In this step, the FFNN information is mapped as assignment rules in the SBML file. The hybrid model structure encoded in SBML was visualized using the freely available Cytoscape cy3sbml tool [30]. The respective hybrid model SBML files are provided as supplementary material.
- Step 6: The final trained hybrid model in SBML format is now freely available for the community to analyze. For proof-of-concept, the original SBML model (step 1) and the final trained SBML hybrid model (step 5) were simulated and compared using the JWS online simulator (https://jjj.bio.vu.nl/models/experiments/) showing that their output is practically coincident.
| Case Study | Number of species | Number of reactions | Number of parameters | JWS Online ID | Reference |
|---|---|---|---|---|---|
| E. coli threonine synthesis pathway | 11 | 7 | 47 | chassagnole1 | [26] |
| P58IPK signal transduction pathway | 9 (4 fixed) | 9 | 10 | goodman | [27] |
| Yeast glycolytic oscillations | 7 (1 fixed) | 11 | 31 | dano1 | [28] |

3. Results and discussion
3.1. Case study 2: P58IPK signal transduction pathway
3.1. Case study 3: yeast glycolytic oscillations
4. Conclusions
Supplementary Materials
Author Contributions
Funding
Conflicts of Interest
References
- Antonakoudis, A.; Barbosa, R.; Kotidis, P.; Kontoravdi, C. The era of big data: Genome-scale modelling meets machine learning. Comput Struct Biotec 2020, 18, 3287–3300. [Google Scholar] [CrossRef] [PubMed]
- von Stosch, M.; Oliveira, R.; Peres, J.; de Azevedo, S.F. Hybrid semi-parametric modeling in process systems engineering: Past, present and future. Computers & Chemical Engineering 2014, 60, 86–101. [Google Scholar] [CrossRef]
- Psichogios, D.C.; Ungar, L.H. A Hybrid Neural Network-1st Principles Approach to Process Modeling. Aiche Journal 1992, 38, 1499–1511. [Google Scholar] [CrossRef]
- Thompson, M.L.; Kramer, M.A. Modeling Chemical Processes Using Prior Knowledge and Neural Networks. Aiche Journal 1994, 40, 1328–1340. [Google Scholar] [CrossRef]
- Schubert, J.; Simutis, R.; Dors, M.; Havlik, I.; Lubbert, A. Hybrid Modeling of Yeast Production Processes - Combination of a-Priori Knowledge on Different Levels of Sophistication. Chem Eng Technol 1994, 17, 10–20. [Google Scholar] [CrossRef]
- Agharafeie, R.; Oliveira, R.; Ramos, J.; Mendes, J. Application of Hybrid Neural Models to Bioprocesses: A Systematic Literature Review. 2023. [Google Scholar] [CrossRef]
- Kim, Y.; Kim, G.B.; Lee, S.Y. Machine learning applications in genome-scale metabolic modeling. Current Opinion in Systems Biology 2021, 25. [Google Scholar] [CrossRef]
- Carinhas, N.; Bernal, V.; Teixeira, A.P.; Carrondo, M.J.T.; Alves, P.M.; Oliveira, R. Hybrid metabolic flux analysis: combining stoichiometric and statistical constraints to model the formation of complex recombinant products. Bmc Systems Biology 2011, 5. [Google Scholar] [CrossRef] [PubMed]
- Isidro, I.A.; Portela, R.M.; Clemente, J.J.; Cunha, A.E.; Oliveira, R. Hybrid metabolic flux analysis and recombinant protein prediction in Pichia pastoris X-33 cultures expressing a singlechain antibody fragment. Bioprocess and Biosystems Engineering 2016, 39, 1351–1363. [Google Scholar] [CrossRef]
- Ferreira, A.R.; Dias, J.M.L.; Teixeira, A.P.; Carinhas, N.; Portela, R.M.C.; Isidro, I.A.; von Stosch, M.; Oliveira, R. Projection to latent pathways (PLP): a constrained projection to latent variables (PLS) method for elementary flux modes discrimination. Bmc Systems Biology 2011, 5. [Google Scholar] [CrossRef]
- Teixeira, A.P.; Dias, J.M.L.; Carinhas, N.; Sousa, M.; Clemente, J.J.; Cunha, A.E.; von Stosch, M.; Alves, P.M.; Carrondo, M.J.T.; Oliveira, R. Cell functional enviromics: Unravelling the function of environmental factors. Bmc Systems Biology 2011, 5. [Google Scholar] [CrossRef]
- von Stosch, M.; Peres, J.; de Azevedo, S.F.; Oliveira, R. Modelling biochemical networks with intrinsic time delays: a hybrid semi-parametric approach. Bmc Systems Biology 2010, 4. [Google Scholar] [CrossRef] [PubMed]
- Folch-Fortuny, A.; Marques, R.; Isidro, I.A.; Oliveira, R.; Ferrer, A. Principal elementary mode analysis (PEMA). Mol Biosyst 2016, 12, 737–746. [Google Scholar] [CrossRef]
- von Stosch, M.; Hamelink, J.M.; Oliveira, R. Hybrid modeling as a QbD/PAT tool in process development: an industrial E-coli case study. Bioprocess and Biosystems Engineering 2016, 39, 773–784. [Google Scholar] [CrossRef] [PubMed]
- Folch-Fortuny, A.; Teusink, B.; Hoefsloot, H.C.J.; Smilde, A.K.; Ferrer, A. Dynamic elementary mode modelling of non-steady state flux data. Bmc Systems Biology 2018, 12. [Google Scholar] [CrossRef]
- Yang, J.H.; Wright, S.N.; Hamblin, M.; McCloskey, D.; Alcantar, M.A.; Schrubbers, L.; Lopatkin, A.J.; Satish, S.; Nili, A.; Palsson, B.O.; et al. A White-Box Machine Learning Approach for Revealing Antibiotic Mechanisms of Action. Cell 2019, 177, 1649–1661. [Google Scholar] [CrossRef]
- Lewis, J.E.; Kemp, M.L. Integration of machine learning and genome-scale metabolic modeling identifies multi-omics biomarkers for radiation resistance. Nat Commun 2021, 12. [Google Scholar] [CrossRef] [PubMed]
- Vijayakumar, S.; Rahman, P.K.S.M.; Angione, C. A Hybrid Flux Balance Analysis and Machine Learning Pipeline Elucidates Metabolic Adaptation in Cyanobacteria. Iscience 2020, 23. [Google Scholar] [CrossRef]
- Ramos, J.R.C.; Oliveira, G.P.; Dumas, P.; Oliveira, R. Genome-scale modeling of Chinese hamster ovary cells by hybrid semi-parametric flux balance analysis. Bioprocess Biosyst Eng 2022, 45, 1889–1904. [Google Scholar] [CrossRef]
- Le Novere, N.; Bornstein, B.; Broicher, A.; Courtot, M.; Donizelli, M.; Dharuri, H.; Li, L.; Sauro, H.; Schilstra, M.; Shapiro, B.; et al. BioModels Database: a free, centralized database of curated, published, quantitative kinetic models of biochemical and cellular systems. Nucleic Acids Research 2006, 34, D689–D691. [Google Scholar] [CrossRef]
- Olivier, B.G.; Snoep, J.L. Web-based kinetic modelling using JWS Online. Bioinformatics 2004, 20, 2143–2144. [Google Scholar] [CrossRef] [PubMed]
- Costa, R.S.; Verissimo, A.; Vinga, S. KiMoSys: a web-based repository of experimental data for KInetic MOdels of biological SYStems. BMC Syst Biol 2014, 8, 85. [Google Scholar] [CrossRef] [PubMed]
- Hucka, M.; Fineey, A.; Sauro, H.M.; al., e. The systems biology markup language (SBML): a medium for representation and exchange of biochemical network models. Bioinformatics 2003, 19, 524–531. [CrossRef]
- Pinto, J.; Costa, R.S.; Alexandre, L.; Ramos, J.; Oliveira, R. SBML2HYB: a Python interface for SBML compatible hybrid modelling. Bioinformatics 2023. [Google Scholar] [CrossRef] [PubMed]
- Pinto, J.; Mestre, M.; Ramos, J.; Costa, R.S.; Striedner, G.; Oliveira, R. A general deep hybrid model for bioreactor systems: Combining first principles with deep neural networks. Computers & Chemical Engineering 2022, 165. [Google Scholar] [CrossRef]
- Chassagnole, C.; Fell, D.A.; Rais, B.; Kudla, B.; Mazat, J.P. Control of the threonine-synthesis pathway in Escherichia coli: a theoretical and experimental approach. Biochemical Journal 2001, 356, 433–444. [Google Scholar] [CrossRef] [PubMed]
- Goodman, A.G.; Tanner, B.C.W.; Chang, S.T.; Esteban, M.; Katze, M.G. Virus infection rapidly activates the P58(IPK) pathway, delaying peak kinase activation to enhance viral replication. Virology 2011, 417, 27–36. [Google Scholar] [CrossRef] [PubMed]
- Dano, S.; Madsen, M.F.; Schmidt, H.; Cedersund, G. Reduction of a biochemical model with preservation of its basic dynamic properties. Febs Journal 2006, 273, 4862–4877. [Google Scholar] [CrossRef] [PubMed]
- Hoops, S.; Sahle, S.; Gauges, R.; Lee, C.; Pahle, J.; Simus, N.; Singhhal, M.; Xu, L.; Mendes, P.; Kummer, U. COPASI — a COmplex PAthway SImulator. Bioinformatics 2006, 22, 3067–3074. [Google Scholar] [CrossRef]
- Konig, M.; Drager, A.; Holzhutter, H.G. CySBML: a Cytoscape plugin for SBML. Bioinformatics 2012, 28, 2402–2403. [Google Scholar] [CrossRef]







Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).