Submitted:
19 July 2024
Posted:
22 July 2024
You are already at the latest version
Abstract
Keywords:
1. Introduction
1.1. Problem Definition
1.2. Motivation
1.3. Aims and Objectives
2. Materials and Methods
2.1. The SSC Dataset
2.2. The Lagrangian Model
2.3. Time Series Modelling
2.4. Data Integration and Preprocessing
2.5. The Lagrangian Model Development
- CheckOutOfBounds: deletes particles from the simulation if they move beyond the defined boundaries. This is necessary because no data is available outside the boundary, causing particles to get stuck.
- CheckError: deletes particles encountering computational errors. This ensures the simulation proceeds without disrupted or incorrect particle data.
- UpdateElapsedTime: shows how long a particle has been in the simulation. This tracks the duration of the particle within the environment.
- UpdatePreviousPosition: captures the position of particles before they move. This is useful as it allows us to save all the previous positions of the particles.
- ReflectOnLand: applies a reflection behaviour when particles encounter land, as defined by the land-sea mask. It also introduces a probabilistic component where there is a 15% chance that particles will `beach’ and be removed from the simulation, while the remaining 85% chance allows particles to be reflected back into the sea. This probabilistic distribution is justified by the geographic characteristics of Malta, where the predominance of rocky coastlines over sandy beaches increases the likelihood of debris being deflected back into the sea rather than getting beached.
2.6. ML Model Selection
2.6.1. Data Preprocessing and Geospatial Filtering
2.6.2. The Main Loop
2.6.3. Making Real-World Predictions

2.7. Integrating the ML Models with the Lagrangian Model

2.8. Evaluation Strategy
- Mean, median, and standard deviation of centroids: we compute the geographical centroids of the merged predictions from both the LSTM and GRU models to assess the proximity of the final debris movement predictions generated by the two models. Smaller mean, median, and standard deviation values suggest a higher degree of consistency between the models’ predictions.
- Spread of LSTM and GRU: the spatial spread is determined by calculating the standard deviation of distances from each model’s centroid. A lower standard deviation indicated a tighter clustering around the centroid, reflecting more consistent model performance across the area.
- Longitudinal and latitudinal skewness of LSTM and GRU: to understand the directional tendencies of the models’ predictions, we calculate the skewness for the distribution of the prediction points’ longitude and latitude. A skewness close to zero indicates a symmetrical distribution of prediction errors, whereas a positive or negative skewness value points to a systematic bias in a particular direction.
3. Results
3.1. LSTM vs GRU
3.1.1. Error Metrics Results
3.1.2. Discussion of Error Metrics Results
3.2. Geospatial Analysis
3.2.1. A Hypothesis
3.2.2. Heat Maps Results and Analysis
3.2.3. Comparison of Lagrangian Simulations
3.2.4. Comparison of Final Lagrangian Visualisations

4. Discussion and Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
Abbreviations
| AI | Artificial Intelligence |
| ANN | Artificial Neural Networks |
| GRU | Gated Recurrent Unit |
| IQR | Interquartile Range |
| LSTM | Long Short-Term Memory |
| MAE | Mean Absolute Error |
| ML | Machine Learning |
| MSE | Mean Squared Error |
| NetCDF | Network Common Data Form |
| NaN | Not a Number |
| RNN | Recurrent Neural Network |
| RMSE | Root Mean Squared Error |
| CSV | Comma-separated Value |
| SSC | Sea Surface Currents |
References
- Kehl, C.; Nooteboom, P.D.; Kaandorp, M.L.A.; van Sebille, E. Efficiently simulating Lagrangian particles in large-scale ocean flows — Data structures and their impact on geophysical applications. Computers and Geosciences 2023, 175, 105322. [Google Scholar] [CrossRef]
- Suaria, G.; Aliani, S. Floating debris in the Mediterranean Sea. Marine pollution bulletin 2014, 86, 494–504. [Google Scholar] [CrossRef] [PubMed]
- Compa, M.; Alomar, C.; Wilcox, C.; van Sebille, E.; Lebreton, L.; Hardesty, B.D.; Deudero, S. Risk assessment of plastic pollution on marine diversity in the Mediterranean Sea. Science of The Total Environment 2019, 678, 188–196. [Google Scholar] [CrossRef] [PubMed]
- Laist, D.W. Impacts of Marine Debris: Entanglement of Marine Life in Marine Debris Including a Comprehensive List of Species with Entanglement and Ingestion Records; Marine Debris: Sources, Impacts, and Solutions, Springer New York: New York, NY, 1997; pp. 99–139. [CrossRef]
- Rochman, C.M.; Browne, M.A.; Underwood, A.J.; van Franeker, J.A.; Thompson, R.C.; Amaral-Zettler, L.A. The ecological impacts of marine debris: unraveling the demonstrated evidence from what is perceived. Ecology 2016, 97, 302–312. [Google Scholar] [CrossRef] [PubMed]
- Agamuthu, P.; Mehran, S.B.; Norkhairah, A.; Norkhairiyah, A. Marine debris: A review of impacts and global initiatives. Waste management and research : the journal of the International Solid Wastes and Public Cleansing Association, ISWA 2019, 37, 987–1002. [Google Scholar] [CrossRef]
- Mansui, J.; Darmon, G.; Ballerini, T.; van Canneyt, O.; Ourmieres, Y.; Miaud, C. Predicting marine litter accumulation patterns in the Mediterranean basin: Spatio-temporal variability and comparison with empirical data. Progress in Oceanography 2020, 182, 102268. [Google Scholar] [CrossRef]
- Ryan, P.G. A Brief History of Marine Litter Research; Marine Anthropogenic Litter, Springer International Publishing: Cham, 2015; pp. 1–25. [CrossRef]
- Harlan, J.; Terrill, E.; Hazard, L.; Keen, C.; Barrick, D.; Whelan, C.; Howden, S.; Kohut, J. The Integrated Ocean Observing System High-Frequency Radar Network: Status and Local, Regional, and National Applications. Marine Technology Society journal 2010, 44, 122–132. [Google Scholar] [CrossRef]
- UNIDATA | NETCDF. https://www.unidata.ucar.edu/software/netcdf/. Accessed: 14-03-2024.
- van Sebille, E.; Griffies, S.M.; Abernathey, R.; Adams, T.P.; Berloff, P.; Biastoch, A.; Blanke, B.; Chassignet, E.P.; Cheng, Y.; Cotter, C.J.; et al. Lagrangian ocean analysis: Fundamentals and practices. Ocean Modelling 2018, 121, 49–75. [Google Scholar] [CrossRef]
- Lonin, S.A. Lagrangian model for oil spill diffusion at sea. Spill Science and Technology Bulletin 1999, 5, 331–336. [Google Scholar] [CrossRef]
- Lebreton, L.C.M.; Greer, S.D.; Borrero, J.C. Numerical modelling of floating debris in the world’s oceans. Marine pollution bulletin 2012, 64, 653–661. [Google Scholar] [CrossRef]
- Dawson, M.N.; Gupta, A.S.; England, M.H. Coupled biophysical global ocean model and molecular genetic analyses identify multiple introductions of cryptogenic species. Proceedings of the National Academy of Sciences 2005, 102, 11968–11973. [Google Scholar] [CrossRef] [PubMed]
- Hertwig, D.; Burgin, L.; Gan, C.; Hort, M.; Jones, A.; Shaw, F.; Witham, C.; Zhang, K. Development and demonstration of a Lagrangian dispersion modeling system for real-time prediction of smoke haze pollution from biomass burning in Southeast Asia. Journal of geophysical research. Atmospheres 2015, 120, 12605–12630. [Google Scholar] [CrossRef]
- Williams, R.G.; Follows, M.J. Ocean Dynamics and the Carbon Cycle: Principles and Mechanisms; Cambridge University Press: Cambridge, 2011. [Google Scholar] [CrossRef]
- OceanParcels. https://oceanparcels.org. Accessed: 2024-03-27.
- PyGNOME. https://gnome.orr.noaa.gov/doc/pygnome/index.html. Accessed: 2024-03-27.
- Pisso, I.; Sollum, E.; Grythe, H.; Kristiansen, N.I.; Cassiani, M.; Eckhardt, S.; Arnold, D.; Morton, D.; Thompson, R.L.; Zwaaftink, C.G.; et al. The Lagrangian particle dispersion model FLEXPART version 10.4. 2019. [Google Scholar] [CrossRef]
- Adhikari, R.; Agrawal, R.K. An Introductory Study on Time Series Modeling and Forecasting. ArXiv 2013. [Google Scholar]
- Raicharoen, T.; Lursinsap, C.; Sanguanbhokai, P. Application of critical support vector machine to time series prediction. 2003, Vol. 5, p. V. [CrossRef]
- Raksha, S.; Graceline, J.S.; Anbarasi, J.; Prasanna, M.; Kamaleshkumar, S. Weather Forecasting Framework for Time Series Data using Intelligent Learning Models. 2021, pp. 783–787. [CrossRef]
- Chatterjee, A.; Bhowmick, H.; Sen, J. Stock Price Prediction Using Time Series, Econometric, Machine Learning, and Deep Learning Models. 2021, pp. 289–296. [CrossRef]
- Wang, P.; Gurmani, S.H.; Tao, Z.; Liu, J.; Chen, H. Interval time series forecasting: A systematic literature review. Journal of forecasting 2024, 43, 249–285. [Google Scholar] [CrossRef]
- Jadon, S.; Milczek, J.; Patankar, A. Challenges and approaches to time-series forecasting in data center telemetry: A Survey. Technical report, Cornell University Library, arXiv.org, 2021. [CrossRef]
- Alsharef, A.; Sonia. ; Kumar, K.; Iwendi, C. Time Series Data Modeling Using Advanced Machine Learning and AutoML. Sustainability 2022, 14. [Google Scholar] [CrossRef]
- Hamayel, M.J.; Owda, A.Y. A Novel Cryptocurrency Price Prediction Model Using GRU, LSTM and bi-LSTM Machine Learning Algorithms. AI 2021, 2, 496. [Google Scholar] [CrossRef]
- Yamak, P.T.; Yujian, L.; Gadosey, P.K. A Comparison between ARIMA, LSTM, and GRU for Time Series Forecasting, 2020. [CrossRef]
- Eriksen, M.; Lebreton, L.C.M.; Carson, H.S.; Thiel, M.; Moore, C.J.; Borerro, J.C.; Galgani, F.; Ryan, P.G.; Reisser, J. Plastic Pollution in the World’s Oceans: More than 5 Trillion Plastic Pieces Weighing over 250,000 Tons Afloat at Sea. PloS one 2014, 9, e111913. [Google Scholar] [CrossRef] [PubMed]
- Ali, A.M.; Zhuang, H.; VanZwieten, J.; Ibrahim, A.K.; Chérubin, L. A Deep Learning Model for Forecasting Velocity Structures of the Loop Current System in the Gulf of Mexico. Forecasting 2021, 3, 953. [Google Scholar] [CrossRef]
- Zulfa, I.I.; Novitasari, D.C.R.; Setiawan, F.; Fanani, A.; Hafiyusholeh, M. Prediction of Sea Surface Current Velocity and Direction Using LSTM. IJEIS (Indonesian Journal of Electronics and Instrumentation Systems) (Online) 2021, 11, 93–102. [Google Scholar] [CrossRef]
- Choi, H.M.; Kim, M.K.; Yang, H. Deep-learning model for sea surface temperature prediction near the Korean Peninsula. Deep Sea Research Part II: Topical Studies in Oceanography 2023, 208, 105262. [Google Scholar] [CrossRef]
- van Sebille, E.; Aliani, S.; Law, K.L.; Maximenko, N.; Alsina, J.M.; Bagaev, A.; Bergmann, M.; Chapron, B.; Chubarenko, I.; Cózar, A.; et al. The physical oceanography of the transport of floating marine debris. Environmental research letters 2020, 15, 23003–32. [Google Scholar] [CrossRef]
- Aijaz, S.; Colberg, F.; Brassington, G.B. Lagrangian and Eulerian modelling of river plumes in the Great Barrier Reef system, Australia. Ocean Modelling 2024, 188, 102310. [Google Scholar] [CrossRef]
- Yadav, H.; Thakkar, A. NOA-LSTM: An efficient LSTM cell architecture for time series forecasting. Expert Systems with Applications 2024, 238, 122333. [Google Scholar] [CrossRef]
- Naderalvojoud, B.; Hernandez-Boussard, T. Improving machine learning with ensemble learning on observational healthcare data. AMIA Annu Symp Proc 2024, 2023, 521–529. [Google Scholar] [PubMed]
- Vaswani, A.; Shazeer, N.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A.N.; Kaiser, L.u.; Polosukhin, I. Attention is All you Need. Advances in Neural Information Processing Systems; Guyon, I.; Luxburg, U.V.; Bengio, S.; Wallach, H.; Fergus, R.; Vishwanathan, S.; Garnett, R., Eds. Curran Associates, Inc., 2017, Vol. 30.
| 1 | |
| 2 | |
| 3 | |
| 4 | |
| 5 | This is distinct from the case where NaNs were ignored (Section 2.5) |
















| Metric | Mean | Std Dev | IQR |
|---|---|---|---|
| MAE | 0.141 | 0.226 | 0.058 |
| MSE | 0.116 | 0.513 | 0.010 |
| RMSE | 0.179 | 0.291 | 0.053 |
| Metric | Mean | Std Dev | IQR |
|---|---|---|---|
| MAE | 0.144 | 0.134 | 0.141 |
| MSE | 0.064 | 0.109 | 0.073 |
| RMSE | 0.183 | 0.175 | 0.212 |
| Metric | Mean | Std Dev | IQR |
|---|---|---|---|
| MAE | 0.148 | 0.222 | 0.067 |
| MSE | 0.116 | 0.503 | 0.016 |
| RMSE | 0.187 | 0.285 | 0.070 |
| Metric | Mean | Std Dev | IQR |
|---|---|---|---|
| MAE | 0.145 | 0.138 | 0.149 |
| MSE | 0.066 | 0.112 | 0.066 |
| RMSE | 0.184 | 0.179 | 0.202 |
| Metric | Mean | Std Dev | IQR |
|---|---|---|---|
| MAE | 1.031 | 2.118 | 0.299 |
| MSE | 14.765 | 40.758 | 0.243 |
| RMSE | 1.634 | 3.478 | 0.397 |
| Metric | Mean | Std Dev | IQR |
|---|---|---|---|
| MAE | 2.622 | 5.507 | 0.506 |
| MSE | 97.858 | 253.429 | 0.860 |
| RMSE | 4.239 | 8.938 | 0.844 |
| Metric | Mean | Std Dev | IQR |
|---|---|---|---|
| MAE | 1.051 | 2.112 | 0.568 |
| MSE | 14.772 | 40.773 | 0.466 |
| RMSE | 1.651 | 3.471 | 0.586 |
| Metric | Mean | Std Dev | IQR |
|---|---|---|---|
| MAE | 2.653 | 5.502 | 0.537 |
| MSE | 97.980 | 253.623 | 1.144 |
| RMSE | 4.268 | 8.931 | 0.991 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).