Submitted:
28 May 2026
Posted:
29 May 2026
You are already at the latest version
Abstract
Keywords:
1. Introduction
1.1. Context and Motivation
1.2. State of the Art
1.2.1. Ecosystem Modeling and Digital Twins
1.2.2. Machine Learning for Biodiversity Monitoring and Prediction
1.2.3. Generative AI for Scenario Analysis and Decision Support
1.3. Research Gap and Proposed Solution
1.4. Contributions
- 1.
- Conversational configuration of ecological modeling workflows: We present a generative AI interface that allows users to configure the core of a conservation-oriented digital twin. It allows to inspect, modify and validate, and persist a structured YAML file to configure the system without directly editing code or complex files.
- 2.
- Configurable digital twin pipeline: We describe a modular prototype that connects data preprocessing, target and feature selection, model training, prediction generation, traceability metadata, and database-backed visualization in a reproducible workflow.
- 3.
- Explainability for conservation decision support: We incorporate SHAP-based feature attribution for the trained models, enabling users to complement predictions with interpretable information about the variables that most influence model outputs.
- 4.
- TRL-4 feasibility assessment in a National Park: We evaluate the system as a laboratory prototype applied to a real conservation context, focusing on functional capability, agent-assisted configuration, and qualitative usefulness rather than claiming operational predictive performance.
1.5. Paper Structure
2. Materials and Methods
2.1. Study Area: Doñana National Park
2.2. System Architecture Overview
2.3. Data Ingestion and Processing
2.3.1. Data Sources
- 1.
- Meteorological and environmental data: Historical climate records were obtained from the Spanish State Meteorological Agency (AEMET), including variables such as precipitation, temperature, relative humidity, and wind speed from monitoring stations within and around Doñana [43]. These data were complemented with measurements from the ICTS Doñana monitoring infrastructure [44], including maximum, mean, and minimum values for air pressure, relative humidity, air temperature, accumulated hail, hail duration, accumulated rainfall, rainfall duration, soil temperature, water level, and water temperature.
- 2.
- Waterbird census data: Long-term bird census data were collected from the ICTS Doñana infrastructure [45] for a set of iconic waterbird species selected for their ecological relevance and their potential to communicate conservation challenges to a broader audience. The selected species were Northern Pintail (Anas acuta), Greylag Goose (Anser anser), Black Stork (Ciconia nigra), Black-tailed Godwit (Limosa limosa), Eurasian Wigeon (Mareca penelope), Greater Flamingo (Phoenicopterus roseus), Eurasian Spoonbill (Platalea leucorodia), and Glossy Ibis (Plegadis falcinellus). These species provide a recognizable and policy-relevant biological layer for evaluating how the Digital Twin can support conservation-oriented exploration [5,46].
- 3.
- Iberian lynx population data: Population indicators for the Iberian lynx (Lynx pardinus) in Doñana were extracted from EBD-CSIC reports, including total individuals, reproductive females, and cubs born. Unlike many species in Doñana that show declining or unstable trends, the Iberian lynx is subject to an active reintroduction and recovery programme, which has produced an upward population trend. Including this species therefore allows the system to represent a contrasting conservation trajectory and to test how the Digital Twin handles species with different ecological and management dynamics.
- 4.
- Socio-demographic data: Population data for municipalities surrounding the park were obtained from the Spanish National Statistics Institute (INE) [47]. The selected municipalities were Almonte, Hinojos, La Puebla del Río, and Aznalcázar. These data were incorporated to complement environmental variables and to explore whether nearby human population size or population density, calculated using municipal surface area, may be associated with changes in the abundance of selected species.
2.3.2. Data Preprocessing Pipeline
2.4. Predictive Engine: Digital Twin Core
2.4.1. Model Library
2.4.2. Training and Model Selection
- Mean absolute error (MAE):
- Root mean squared error (RMSE):
- Mean absolute percentage Error (MAPE):
2.4.3. Prediction Generation and Traceability
2.5. Conversational AI Architecture and Configuration Management
2.6. Decision Support Dashboard and User Interaction Workflow
2.7. Implementation Details and Infrastructure
2.7.1. Prototype Deployment Architecture
2.7.2. Software Stack
- Backend: Python 3.11, FastAPI 0.120.1, Psycopg2 2.9.10, Pandas 2.3.0, NumPy 1.26.4.
- Machine learning: scikit-learn 1.7.0, XGBoost 3.0.2, statsmodels 0.14.4, matplotlib 3.10.3, SHAP 0.49.1.
- Conversational AI: Chainlit 2.6.2, Google Generative AI SDK 1.26.0.
- AI generated configuration validation: Pydantic 2.11.7.
- Containerization: Docker 29.0.2.
- Database: PostgreSQL 17.5.
- Deployment: AWS Copilot CLI for infrastructure-as-code (IaC) provisioning.
2.7.3. Evaluation Protocol
3. Results
3.1. Overview of Prototype Evaluation
3.2. Agent-Assisted Configuration Workflow
- Displaying the current configuration and explaining editable fields;
- Updating one or more configuration parameters, such as the target variable, feature set, time frequency, preprocessing method, or enabled models;
- Checking whether requested values are compatible with the known configuration structure;
- Asking for clarification when the request is ambiguous;
- Saving changes only after explicit user confirmation;
- Launching model training from the active configuration;
- Generating predictions from a trained model when the required model identifier and input information are available;
- Reporting tool errors directly to the user instead of hiding failed execution.
| Task category | Metric | Interpretation |
|---|---|---|
| Configuration inspection | Success rate: 90.0%, latency: 2.3 s | Ability to expose the current YAML state to non-technical users. |
| Single or multiple parameter updates | Correct update rate: 95.0%, clarification turns: 1.5 avg. | Ability to map natural language onto valid configuration fields. |
| Training requests | Execution success: 90.0%, returned metadata completeness: 100.0% | Ability to trigger the Digital Twin core and report model identifiers and diagnostics. |
| Prediction requests | Execution success: 90.0%, required inputs requested: 100.0% | Ability to route prediction requests according to model type and available data. |
| Error handling | Invalid changes blocked: 100.0%, errors reported: 100.0% | Ability to preserve human control and avoid silent failures. |
3.3. Representative Configuration Scenario
3.4. Selected Model Training and Prediction Runs
3.5. SHAP-Based Interpretability
3.6. Traceability and Dashboard Outputs
4. Discussion
4.1. Interpretation of Findings
4.2. Conversational Configuration as Access Infrastructure
4.3. The Role of SHAP in Conservation-Oriented Modeling
4.4. Implications for Ecosystem Management
4.5. Comparison with Existing Approaches
4.6. Current Limitations
4.7. Future Research Directions
- Expanded predictive validation: Predictive experiments should be reproduced on selected conservation variables using richer and more complex datasets. This would allow the system to be evaluated against specific conservation problems and would help determine where the current modeling pipeline is useful, where errors remain too high, and which variables or data sources are needed to improve performance.
- Integration of remote sensing data: Incorporating satellite-derived data, such as Sentinel-2 vegetation indices or Sentinel-1 soil moisture indicators, could improve spatial resolution and predictive capacity [75,76]. Recent advances in geospatial AI could also support automated feature extraction from imagery [77].
- End-user validation and participatory design: The system should be tested with park managers, conservation practitioners, and other end users to validate usability, trust, and practical value. Participatory modeling workshops could help refine the interface, identify real decision-making needs, and complement quantitative predictions with qualitative expertise from local stakeholders.
- Domain-aware conversational agent: The agent should be extended with domain-specific knowledge to improve its ability to guide users, interpret modeling outputs, and identify potential limitations in the results. Retrieval-Augmented Generation (RAG) could be used to incorporate technical documents, project documentation, ecological references, and operational guidelines into the agent’s responses.
- Controlled field deployment and long-term monitoring: Moving beyond TRL-4 will require deployment in a controlled operational setting within Doñana National Park. Longitudinal studies would be needed to assess how the system integrates into daily workflows, how users rely on its outputs over time, and whether it has measurable effects on decision-making processes or conservation outcomes [74].
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
Appendix A. System Prompt and Configuration
Appendix A.1. Configuration Agent System Prompt
| Listing A1: Agent System Prompt |
|
Appendix A.2. Default Configuration File
| Listing A2: Digital twin core configuration file |
|
References
- IPBES. Global Assessment Report on Biodiversity and Ecosystem Services; IPBES Secretariat: Bonn, Germany, 2019. [CrossRef]
- Steffen, W.; Richardson, K.; Rockström, J.; Schellnhuber, H.J.; Dube, O.P.; Dutreuil, S.; Lenton, T.M.; Lubchenco, J. The emergence and evolution of Earth System Science. Nature Reviews Earth & Environment 2020, 1, 54–63. [CrossRef]
- Jetz, W.; McGeoch, M.A.; Guralnick, R.; Ferrier, S.; Beck, J.; Costello, M.J.; Fernandez, M.; Geller, G.N.; Keil, P.; Merow, C.; et al. Essential biodiversity variables for mapping and monitoring species populations. Nature Ecology & Evolution 2019, 3, 539–551. [CrossRef]
- González-García, A.; Palomo, I.; González, J.A.; García-Nieto, A.P.; Montes, C.; Martín-López, B. Quantifying spatial supply-demand mismatches in ecosystem services provides insights for land-use planning. Land Use Policy 2023, 124, 106397. [CrossRef]
- Rendón, M.A.; Green, A.J.; Aguilera, E.; Almaraz, P. Status, distribution and long-term changes in the waterbird community wintering in Doñana, south-west Spain. Biological Conservation 2008, 141, 1371–1388. [CrossRef]
- Green, A.J.; Alcorlo, P.; Peeters, E.T.; Morris, E.P.; Espinar, J.L.; Bravo-Utrera, M.A.; Bustamante, J.; Díaz-Delgado, R.; Koelmans, A.A.; Mateo, R.; et al. Creating a safe operating space for wetlands in a changing climate. Frontiers in Ecology and the Environment 2017, 15, 99–107. [CrossRef]
- Cabin, R.J. Intelligent Tinkering: Bridging the Gap Between Science and Practice; Island Press: Washington, DC, USA, 2018; ISBN 978-1610918688.
- Pennekamp, F.; Iles, A.C.; Garland, J.; Brennan, G.; Brose, U.; Gaedke, U.; Jacob, U.; Kratina, P.; Matthews, B.; Momo, F.; et al. The intrinsic predictability of ecological time series and its potential to guide forecasting. Ecological Monographs 2019, 89, e01359. [CrossRef]
- Mouquet, N.; Lagadeuc, Y.; Devictor, V.; Doyen, L.; Duputié, A.; Eveillard, D.; Faure, D.; Garnier, E.; Gimenez, O.; Huneman, P.; et al. Predictive ecology in a changing world. Journal of Applied Ecology 2015, 52, 1293–1310. [CrossRef]
- Addison, P.F.E.; Rumpff, L.; Bau, S.S.; Carey, J.M.; Chee, Y.E.; Jarrad, F.C.; McBride, M.F.; Burgman, M.A. Practical solutions for making models indispensable in conservation decision-making. Diversity and Distributions 2013, 19, 490–502. [CrossRef]
- Rose, D.C.; Sutherland, W.J.; Amano, T.; González-Varo, J.P.; Robertson, R.J.; Simmons, B.I.; Wauchope, H.S.; Kovacs, E.; Durán, A.P.; Vadrot, A.B.; et al. The major barriers to evidence-informed conservation policy and possible solutions. Conservation Letters 2020, 13, e12564. [CrossRef]
- Grieves, M.; Vickers, J. Digital Twin: Mitigating Unpredictable, Undesirable Emergent Behavior in Complex Systems. In Transdisciplinary Perspectives on Complex Systems; Kahlen, F.J., Flumerfelt, S., Alves, A., Eds.; Springer: Cham, Switzerland, 2016; pp. 85–113. [CrossRef]
- Batty, M. Digital twins. Environment and Planning B: Urban Analytics and City Science 2018, 45, 817–820. [CrossRef]
- VanDerHorn, E.; Mahadevan, S. Digital Twin: Generalization, characterization and implementation. Decision Support Systems 2021, 145, 113524. [CrossRef]
- Reichstein, M.; Camps-Valls, G.; Stevens, B.; Jung, M.; Denzler, J.; Carvalhais, N.; Prabhat. Deep learning and process understanding for data-driven Earth system science. Nature 2019, 566, 195–204. [CrossRef]
- Willard, J.; Jia, X.; Xu, S.; Steinbach, M.; Kumar, V. Integrating scientific knowledge with machine learning for engineering and environmental systems. ACM Computing Surveys 2022, 55, 1–37. [CrossRef]
- Moriasi, D.N.; Gitau, M.W.; Pai, N.; Daggupati, P. Hydrologic and water quality models: Performance measures and evaluation criteria. Transactions of the ASABE 2015, 58, 1763–1785. [CrossRef]
- Harris, D.J.; Taylor, S.D.; White, E.P. Forecasting biodiversity in breeding birds using best practices. PeerJ 2018, 6, e4278. [CrossRef]
- Christin, S.; Hervet, É.; Lecomte, N. Applications for deep learning in ecology. Methods in Ecology and Evolution 2019, 10, 1632–1644. [CrossRef]
- Thakur, M.P.; van der Putten, W.H.; Wilschut, R.A.; Veen, G.F.; Kardol, P.; van Ruijven, J.; Allan, E.; Roscher, C.; van Kleunen, M.; Bezemer, T.M. Plant-soil feedbacks and temporal dynamics of plant diversity-productivity relationships. Trends in Ecology & Evolution 2021, 36, 651–661. [CrossRef]
- Valavi, R.; Guillera-Arroita, G.; Lahoz-Monfort, J.J.; Elith, J. Predictive performance of presence-only species distribution models: A benchmark study with reproducible code. Ecological Monographs 2022, 92, e01486. [CrossRef]
- Ward, E.J.; Holmes, E.E.; Thorson, J.T.; Collen, B. Complexity is costly: A meta-analysis of parametric and non-parametric methods for short-term population forecasting. Oikos 2014, 123, 652–661. [CrossRef]
- Bellard, C.; Bertelsmeier, C.; Leadley, P.; Thuiller, W.; Courchamp, F. Impacts of climate change on the future of biodiversity. Ecology Letters 2012, 15, 365–377. [CrossRef]
- Cutler, D.R.; Edwards Jr, T.C.; Beard, K.H.; Cutler, A.; Hess, K.T.; Gibson, J.; Lawler, J.J. Random forests for classification in ecology. Ecology 2007, 88, 2783–2792. [CrossRef]
- Mi, C.; Huettmann, F.; Guo, Y.; Han, X.; Wen, L. Why choose Random Forest to predict rare species distribution with few samples in large undersampled areas? Three Asian crane species models provide supporting evidence. PeerJ 2017, 5, e2849. [CrossRef]
- Pullin, A.S.; Knight, T.M. Doing more good than harm - Building an evidence-base for conservation and environmental management. Biological Conservation 2009, 142, 931–934. [CrossRef]
- Cook, C.N.; Possingham, H.P.; Fuller, R.A. Contribution of systematic reviews to management decisions. Conservation Biology 2013, 27, 902–915. [CrossRef]
- Cook, C.N.; Mascia, M.B.; Schwartz, M.W.; Possingham, H.P.; Fuller, R.A. Achieving conservation science that bridges the knowledge-action boundary. Conservation Biology 2013, 27, 669–678. [CrossRef]
- Vaswani, A.; Shazeer, N.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A.N.; Kaiser, Ł.; Polosukhin, I. Attention is all you need. In Advances in Neural Information Processing Systems 30; Curran Associates, Inc.: Red Hook, NY, USA, 2017; pp. 5998–6008.
- Brown, T.B.; Mann, B.; Ryder, N.; Subbiah, M.; Kaplan, J.; Dhariwal, P.; Neelakantan, A.; Shyam, P.; Sastry, G.; Askell, A.; et al. Language models are few-shot learners. In Advances in Neural Information Processing Systems 33; Curran Associates, Inc.: Red Hook, NY, USA, 2020; pp. 1877–1901.
- Devlin, J.; Chang, M.W.; Lee, K.; Toutanova, K. BERT: Pre-training of deep bidirectional transformers for language understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1; Association for Computational Linguistics: Minneapolis, MN, USA, 2019; pp. 4171–4186. [CrossRef]
- Raffel, C.; Shazeer, N.; Roberts, A.; Lee, K.; Narang, S.; Matena, M.; Zhou, Y.; Li, W.; Liu, P.J. Exploring the limits of transfer learning with a unified text-to-text transformer. Journal of Machine Learning Research 2020, 21, 1–67.
- Toderas, M. Artificial Intelligence for Sustainability: A Systematic Review and Critical Analysis of AI Applications, Challenges, and Future Directions. Sustainability 2025, 17, 8049. [CrossRef]
- Liu, P.; Yuan, W.; Fu, J.; Jiang, Z.; Hayashi, H.; Neubig, G. Pre-train, prompt, and predict: A systematic survey of prompting methods in natural language processing. ACM Computing Surveys 2023, 55, 1–35. [CrossRef]
- Kasneci, E.; Seßler, K.; Küchemann, S.; Bannert, M.; Dementieva, D.; Fischer, F.; Gasser, U.; Groh, G.; Günnemann, S.; Hüllermeier, E.; et al. ChatGPT for good? On opportunities and challenges of large language models for education. Learning and Individual Differences 2023, 103, 102274. [CrossRef]
- Tondeur, J.; Petko, D.; Christensen, R.; Drossel, K.; Starkey, L.; Knezek, G.; Schmidt-Crawford, D.A. Quality criteria for conceptual technology integration models in education: Bridging research and practice. Educational Technology Research and Development 2023, 71, 1915–1937.
- McIntosh, B.S.; Ascough, J.C.; Twery, M.; Chew, J.; Elmahdi, A.; Haase, D.; Harou, J.J.; Hepting, D.; Cuddy, S.; Jakeman, A.J.; et al. Environmental decision support systems (EDSS) development - Challenges and best practices. Environmental Modeling & Software 2011, 26, 1389–1402. [CrossRef]
- Matthies, M.; Giupponi, C.; Ostendorf, B. Environmental decision support systems: Current issues, methods and tools. Environmental Modeling & Software 2007, 22, 123–127. [CrossRef]
- Rodríguez-Caro, R.C.; Graciá, E.; Anadón, J.D.; Giménez, A. Maintained effects of fire on individual growth and survival rates in a spur-thighed tortoise population. European Journal of Wildlife Research 2019, 65, 1–9.
- Camacho, C.; Negro, J.J.; Elmberg, J.; Fox, A.D.; Nagy, S.; Pain, D.J.; Green, A.J. Groundwater extraction poses extreme threat to Doñana World Heritage Site. Nature Ecology & Evolution 2022, 6, 654–655. [CrossRef]
- Liu, Y.; Tang, Q.; Liu, X.; Wang, G.; Zhang, X.; Leng, G. Recent decrease in summer precipitation over the Iberian Peninsula closely linked to the weakening of local moisture recycling. Hydrology and Earth System Sciences 2022, 26, 1925–1936. [CrossRef]
- Newman, S. Building Microservices: Designing Fine-Grained Systems; O’Reilly Media: Sebastopol, CA, USA, 2015; ISBN 978-1491950357.
- Agencia Estatal de Meteorología (AEMET). Climate Data Portal. Available online: https://www.aemet.es/en/datos_abiertos (accessed on 15 December 2025).
- ICTS-RBD. Hydromet: Hydrometeorological Data; Singular Scientific and Technical Infrastructure – Doñana Biological Reserve: Seville, Spain, 2026. Available online: https://datos-automaticos.icts-donana.es/en/ (last accessed on 14 May 2026).
- ICTS-DOÑANA. Online Database: Aerial Waterbird Census in the Guadalquivir River Marshes; Doñana Biological Station (EBD-CSIC): Seville, Spain, 2026. Available online: https://censos-aereos.icts-donana.es/ (last accessed on 14 May 2026).
- Santoro, S.; Green, A.J.; Figuerola, J. Immigration enhances fast growth of a newly established source population. Ecology 2013, 94, 1058–1067.
- Instituto Nacional de Estadística (INE). Base de Datos de Indicadores Urbanos; INE: Madrid, Spain, 2026. Available online: https://www.ine.es/dyngs/DAB/index.htm?cid=1100 (last accessed on 15 May 2026).
- PostgreSQL Global Development Group. PostgreSQL 15 Documentation. Available online: https://www.postgresql.org/docs/15/ (last accessed on 15 December 2025).
- Hyndman, R.J.; Athanasopoulos, G. Forecasting: Principles and Practice, 3rd ed.; OTexts: Melbourne, Australia, 2018. Available online: https://otexts.com/fpp3/ (accessed on 15 December 2025).
- James, G.; Witten, D.; Hastie, T.; Tibshirani, R. An Introduction to Statistical Learning with Applications in R; Springer: New York, NY, USA, 2013; ISBN 978-1461471370. [CrossRef]
- Breiman, L.; Friedman, J.H.; Olshen, R.A.; Stone, C.J. Classification and Regression Trees; Wadsworth International Group: Belmont, CA, USA, 1984.
- Breiman, L. Random forests. Machine Learning 2001, 45, 5–32.
- Chen, T.; Guestrin, C. XGBoost: A scalable tree boosting system. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining; San Francisco, CA, USA, 13–17 August 2016; pp. 785–794.
- Box, G.E.P.; Jenkins, G.M. Time Series Analysis: Forecasting and Control; Holden-Day: San Francisco, CA, USA, 1970; ISBN 978-0816211043.
- Pedregosa, F.; Varoquaux, G.; Gramfort, A.; Michel, V.; Thirion, B.; Grisel, O.; Blondel, M.; Prettenhofer, P.; Weiss, R.; Dubourg, V.; et al. Scikit-learn: Machine learning in Python. Journal of Machine Learning Research 2011, 12, 2825–2830.
- Seabold, S.; Perktold, J. Statsmodels: Econometric and statistical modeling with Python. In Proceedings of the 9th Python in Science Conference (SciPy 2010); Austin, TX, USA, 28 June–3 July 2010; pp. 92–96. [CrossRef]
- Bergmeir, C.; Benítez, J.M. On the use of cross-validation for time series predictor evaluation. Information Sciences 2012, 191, 192–213. [CrossRef]
- Tashman, L.J. Out-of-sample tests of forecasting accuracy: An analysis and review. International Journal of Forecasting 2000, 16, 437–450. [CrossRef]
- Gemini Team; Anil, R.; Borgeaud, S.; Alayrac, J.-B.; Yu, J.; Soricut, R.; Schalkwyk, J.; Dai, A.M.; Hauth, A.; Millican, K.; et al. Gemini: A Family of Highly Capable Multimodal Models. arXiv 2024, arXiv:2312.11805. [CrossRef]
- Chainlit Development Team. Chainlit: Build Production-Ready Conversational AI. Available online: https://docs.chainlit.io/ (last accessed on 20 December 2025).
- Schick, T.; Dwivedi-Yu, J.; Dessì, R.; Raileanu, R.; Lomeli, M.; Zettlemoyer, L.; Cancedda, N.; Scialom, T. Toolformer: Language models can teach themselves to use tools. arXiv preprint arXiv:2302.04761 2023.
- Amershi, S.; Weld, D.; Vorvoreanu, M.; Fourney, A.; Nushi, B.; Collisson, P.; Suh, J.; Iqbal, S.; Bennett, P.N.; Inkpen, K.; et al. Guidelines for human-AI interaction. In Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems; Glasgow, UK, 4–9 May 2019; pp. 1–13. [CrossRef]
- Microsoft Corporation. Power BI Documentation. Available online: https://docs.microsoft.com/en-us/power-bi/ (last accessed on 20 December 2025).
- Lundberg, S.M.; Lee, S.I. A unified approach to interpreting model predictions. In Advances in Neural Information Processing Systems 30; Curran Associates, Inc.: Red Hook, NY, USA, 2017; pp. 4765–4774.
- Google Cloud. Vertex AI Documentation. Available online: https://docs.cloud.google.com/vertex-ai/docs (last accessed on 15 May 2026).
- Mankins, J.C. Technology readiness levels: A white paper. Advanced Concepts Office, Office of Space Access and Technology, NASA, April 1995.
- Laniak, G.F.; Olchin, G.; Goodall, J.; Voinov, A.; Hill, M.; Glynn, P.; Whelan, G.; Geller, G.; Quinn, N.; Blind, M.; et al. Integrated environmental modeling: A vision and roadmap for the future. Environmental Modeling & Software 2013, 39, 3–23. [CrossRef]
- Arrieta, A.B.; Díaz-Rodríguez, N.; Del Ser, J.; Bennetot, A.; Tabik, S.; Barbado, A.; García, S.; Gil-López, S.; Molina, D.; Benjamins, R.; et al. Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI. Information Fusion 2020, 58, 82–115. [CrossRef]
- Saltelli, A.; Bammer, G.; Bruno, I.; Charters, E.; Di Fiore, M.; Didier, E.; Nelson Espeland, W.; Kay, J.; Lo Piano, S.; Mayo, D.; et al. Five ways to ensure that models serve society: A manifesto. Nature 2020, 582, 482–484. [CrossRef]
- Karpatne, A.; Atluri, G.; Faghmous, J.H.; Steinbach, M.; Banerjee, A.; Ganguly, A.; Shekhar, S.; Samatova, N.; Kumar, V. Theory-guided data science: A new paradigm for scientific discovery from data. IEEE Transactions on Knowledge and Data Engineering 2017, 29, 2318–2331. [CrossRef]
- Lütkepohl, H. New Introduction to Multiple Time Series Analysis; Springer: Berlin, Germany, 2005; ISBN 978-3540401728. [CrossRef]
- Chen, R.T.Q.; Rubanova, Y.; Bettencourt, J.; Duvenaud, D. Neural ordinary differential equations. In Advances in Neural Information Processing Systems 31; Curran Associates, Inc.: Red Hook, NY, USA, 2018; pp. 6571–6583.
- Gelman, A.; Carlin, J.B.; Stern, H.S.; Dunson, D.B.; Vehtari, A.; Rubin, D.B. Bayesian Data Analysis, 3rd ed.; CRC Press: Boca Raton, FL, USA, 2013; ISBN 978-1439840955.
- Stem, C.; Margoluis, R.; Salafsky, N.; Brown, M. Monitoring and evaluation in conservation: A review of trends and approaches. Conservation Biology 2005, 19, 295–309. [CrossRef]
- Turner, W.; Rondinini, C.; Pettorelli, N.; Mora, B.; Leidner, A.K.; Szantoi, Z.; Buchanan, G.; Dech, S.; Dwyer, J.; Herold, M.; et al. Free and open-access satellite data are key to biodiversity conservation. Biological Conservation 2015, 182, 173–176. [CrossRef]
- Pettorelli, N.; Schulte to Bühne, H.; Tulloch, A.; Dubois, G.; Macinnis-Ng, C.; Queirós, A.M.; Keith, D.A.; Wegmann, M.; Schrodt, F.; Stellmes, M.; et al. Satellite remote sensing of ecosystem functions: Opportunities, challenges and way forward. Remote Sensing in Ecology and Conservation 2018, 4, 71–93. [CrossRef]
- Zhu, X.X.; Tuia, D.; Mou, L.; Xia, G.S.; Zhang, L.; Xu, F.; Fraundorfer, F. Deep learning in remote sensing: A comprehensive review and list of resources. IEEE Geoscience and Remote Sensing Magazine 2017, 5, 8–36. [CrossRef]






| Target variable | Training configuration | Results |
|---|---|---|
| Iberian lynxes individuals | Features: target lags (1, 2, 5, and 10 years), cubs born, mean temperature, mean precipitation, and population of nearby municipalities. |
MAPE: 26.8%; RMSE: 0.80. Selected model: XGBoost. |
| Anas acuta individuals | Features: target lags (1, 2, 5, and 10 years), mean precipitation, mean water temperature, mean soil temperature, mean water level, mean water temperature, and mean rainfall duration. |
MAPE: 54.15%; RMSE: 1.33. Selected model: XGBoost. |
| Anser anser individuals | Features: target lags (1, 2, 5, and 10 years), mean precipitation, mean water temperature, mean soil temperature, mean water level, mean water temperature, and mean rainfall duration. | MAPE: 36.76%; RMSE: 0.79. Selected model: DecisionTreeRegressor. |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).