Submitted:
01 July 2025
Posted:
02 July 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
1.1. Maritime Security Threats and Illegal Activities
1.1.1. Illegal Migration and Border Crossings
1.1.2. Drug Trafficking and Human Smuggling
1.1.3. Illegal Fishing
1.1.4. Environmental Threats and Marine Pollution
1.2. Limitations of Traditional Surveillance Approaches
- Coverage Gaps: Traditional surveillance systems offer limited coverage, particularly in the deep sea where patrolling infrastructure is insufficient.
- Evasion Tactics: Various malicious actors working in crime groups leverage technology to manipulate AIS data, operate without transponders, or exploit blind spots in satellite coverage.
- Data Overload and Latency: The volume and speed of data generated by various sensors mounted in the sea and satellite systems are too much to be processed by human operators causing delayed responses to threats or even missing them completely.
1.3. Scope and Objectives of the Paper
2. Deep Learning for Maritime Object Detection and Tracking
2.1. Vessel Detection
2.1.1. Satellite Imagery
2.1.2. Aerial Imagery
2.1.3. Surface Imagery
2.1.4. Radar Data
2.1.5. AIS Data
2.1.6. Integration of Data from Different Sources
2.1.7. Challenges in Vessel Detection
2.2. Anomaly Detection in Vessel Behaviour
2.2.1. Analysing AIS Data for Unusual Patterns
2.2.2. Combining AIS with Contextual Information
2.2.3. Using Sequence-Based Models
3. Deep Learning for Maritime Surveillance and Situational Awareness
3.1. Maritime Image and Video Analysis
3.1.1. Event and Activity Recognition
3.1.2. Scene Understanding and Context
3.1.3. Video Surveillance for Tracking and Anomaly
3.2. Fusion of Multi-Sensor Data
3.2.1. Fusion Architecture
3.2.2. Sensor-Specific Examples
3.3. Maritime Domain Awareness Systems Using Deep Learning
3.3.1. Decision Support and Visualisation
4. Deep Learning for Specific Maritime Security Applications
4.1. Illegal Fishing Detection
4.2. Piracy and Armed Robbery Prevention
4.3. Smuggling and Trafficking Detection
4.4. Maritime Environmental Monitoring
4.5. Safety, Search and Rescue Operations
5. Key Sources for Deep Learning in Maritime Domain
6. Challenges and Future Directions
7. Discussion and Conclusion
Author Contributions
Conflicts of Interest
References
- European External Action Service. Maritime Security, 2025. Accessed: 2025-06-24.
- Kowalski, M. .; Pałka, N.; Młyńczak, J.; Karol, M.; Czerwińska, E.; Życzkowski, M.; Ciurapiński, W.; Zawadzki, Z.; Brawata, S. Detection of inflatable boats and people in thermal infrared with deep learning methods. Sensors 2021, 21, 5330. [Google Scholar] [CrossRef] [PubMed]
- Galdelli, A.; Narang, G.; Pietrini, R.; Zazzarini, M.; Fiorani, A.; Tassetti, A.N. Multimodal AI-enhanced ship detection for mapping fishing vessels and informing on suspicious activities. Pattern Recognition Letters 2025, 191, 15–22. [Google Scholar] [CrossRef]
- Guan, Y.; Zhang, X.; Chen, S.; Liu, G.; Jia, Y.; Zhang, Y.; Gao, G.; Zhang, J.; Li, Z.; Cao, C. Fishing vessel classification in SAR images using a novel deep learning model. IEEE Transactions on Geoscience and Remote Sensing 2023, 61, 1–21. [Google Scholar] [CrossRef]
- Ventikos, N.P.; Koimtzoglou, A.; Michelis, A.; Stouraiti, A.; Kopsacheilis, I.; Podimatas, V. A Bayesian network-based tool for crisis classification in piracy or armed robbery incidents on passenger ships. Proceedings of the Institution of Mechanical Engineers, Part M: Journal of Engineering for the Maritime Environment 2024, 238, 251–261. [Google Scholar] [CrossRef]
- Trujillo-Acatitla, R.; Tuxpan-Vargas, J.; Ovando-Vázquez, C.; Monterrubio-Martínez, E. Marine oil spill detection and segmentation in SAR data with two steps deep learning framework. Marine Pollution Bulletin 2024, 204, 116549. [Google Scholar] [CrossRef]
- Gamage, C.; Dinalankara, R.; Samarabandu, J.; Subasinghe, A. A comprehensive survey on the applications of machine learning techniques on maritime surveillance to detect abnormal maritime vessel behaviors. WMU Journal of Maritime Affairs 2023, 22, 447–477. [Google Scholar] [CrossRef]
- Bentes, C.; Velotto, D.; Tings, B. Ship classification in TerraSAR-X images with convolutional neural networks. IEEE Journal of Oceanic Engineering 2017, 43, 258–266. [Google Scholar] [CrossRef]
- Wang, S.; Kim, B. Scale-Sensitive Attention for Multi-Scale Maritime Vessel Detection Using EO/IR Cameras. Applied Sciences 2024, 14, 11604. [Google Scholar] [CrossRef]
- Jiang, X.; Liu, T.; Song, T.; Cen, Q. Optimized Marine Target Detection in Remote Sensing Images with Attention Mechanism and Multi-Scale Feature Fusion. Information 2025, 16, 332. [Google Scholar] [CrossRef]
- Mujtaba, D.F.; Mahapatra, N.R. Deep Learning for Spatiotemporal Modeling of Illegal, Unreported, and Unregulated Fishing Events. In Proceedings of the 2022 International Conference on Computational Science and Computational Intelligence (CSCI); 2022; pp. 423–425. [Google Scholar] [CrossRef]
- Yang, D.; Solihin, M.I.; Ardiyanto, I.; Zhao, Y.; Li, W.; Cai, B.; Chen, C. A streamlined approach for intelligent ship object detection using EL-YOLO algorithm. Scientific Reports 2024, 14, 15254. [Google Scholar] [CrossRef]
- Karst, J.; McGurrin, R.; Gavin, K.; Luttrell, J.; Rippy, W.; Coniglione, R.; McKenna, J.; Riedel, R. Enhancing Maritime Domain Awareness Through AI-Enabled Acoustic Buoys for Real-Time Detection and Tracking of Fast-Moving Vessels. Sensors 2025, 25, 1930. [Google Scholar] [CrossRef] [PubMed]
- Yan, H.; Chen, C.; Jin, G.; Zhang, J.; Wang, X.; Zhu, D. Implementation of a Modified Faster R-CNN for Target Detection Technology of Coastal Defense Radar. Remote Sensing 2021, 13. [Google Scholar] [CrossRef]
- Hu, H.; Zhou, W.; Jiang, B.; Zhang, J.; Cheng, T. Exploring deep learning techniques for the extraction of lit fishing vessels from Luojia1-01. Ecological Indicators 2024, 159, 111682. [Google Scholar] [CrossRef]
- Ding, J.; Li, W.; Pei, L.; Yang, M.; Ye, C.; Yuan, B. Sw-YoloX: An anchor-free detector based transformer for sea surface object detection. Expert Systems with Applications 2023, 217, 119560. [Google Scholar] [CrossRef]
- Walsh, P.W.; Cuibus, M.V. People crossing the English Channel in small boats. Briefing, Migration Observatory, University of Oxford, 2025. Accessed 2025-06-25.
- Government of Canada. Illegal, Unreported and Unregulated (IUU) Fishing, 2019.
- Cheng, X.; Wang, J.; Chen, X.; Zhang, F. Attention-enhanced and integrated deep learning approach for fishing vessel classification based on multiple features. Scientific Reports 2025, 15, 8642. [Google Scholar] [CrossRef] [PubMed]
- Burgherr, P. In-depth analysis of accidental oil spills from tankers in the context of global spill trends from all sources. Journal of hazardous materials 2007, 140, 245–256. [Google Scholar] [CrossRef]
- Bui, N.A.; Oh, Y.; Lee, I. Oil spill detection and classification through deep learning and tailored data augmentation. International Journal of Applied Earth Observation and Geoinformation 2024, 129, 103845. [Google Scholar] [CrossRef]
- Qu, J.; Gao, Y.; Lu, Y.; Xu, W.; Liu, R.W. Deep learning-driven surveillance quality enhancement for maritime management promotion under low-visibility weathers. Ocean & Coastal Management 2023, 235, 106478. [Google Scholar]
- Baswaid, M.H.; Darir, F.F.F.; Qin, C.Y.; Sofian, A.P.; Amin, N. Deep Learning-Based Ship Detection: Enhancing Maritime Surveillance with Convolutional Neural Networks 2025. [CrossRef]
- Dimitrov, T. Applying Artificial Intelligence for improving Situational awareness and Threat monitoring at sea as key factor for success in Naval operation. In Proceedings of the ENVIRONMENT. TECHNOLOGIES. RESOURCES. Proceedings of the International Scientific and Practical Conference, 2024, Vol. 4, pp. 49–55. [CrossRef]
- Kanjir, U.; Greidanus, H.; Oštir, K. Vessel detection and classification from spaceborne optical images: A literature survey. Remote sensing of environment 2018, 207, 1–26. [Google Scholar] [CrossRef]
- Ahmed, M.; El-Sheimy, N.; Leung, H. Dual-Modal Approach for Ship Detection: Fusing Synthetic Aperture Radar and Optical Satellite Imagery. Sensors (Basel, Switzerland) 2025, 25, 329. [Google Scholar] [CrossRef]
- Jeon, I.; Ham, S.; Cheon, J.; Klimkowska, A.M.; Kim, H.; Choi, K.; Lee, I. A real-time drone mapping platform for marine surveillance. The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences 2019, 42, 385–391. [Google Scholar] [CrossRef]
- Park, J.J.; Park, K.A.; Kim, T.S.; Oh, S.; Lee, M. Aerial hyperspectral remote sensing detection for maritime search and surveillance of floating small objects. Advances in space research 2023, 72, 2118–2136. [Google Scholar] [CrossRef]
- Zhang, Z.; Lu, X.; Cao, G.; Yang, Y.; Jiao, L.; Liu, F. ViT-YOLO: Transformer-based YOLO for object detection. In Proceedings of the Proceedings of the IEEE/CVF international conference on computer vision, 2021, pp. 2799–2808.
- PhiliP-Kpae, F.O.; Ogbondamati, L.E.; Ebri, K.F.E. Evaluating Marine Radar Object Detection System Using Yolo-Based Deep Learning Algorithm. Direct Research Journal of Engineering and Information Technology 2025, 13, 7–15. [Google Scholar] [CrossRef]
- International Maritime Organization. Automatic Identification Systems (AIS) transponders. https://www.imo.org/en/OurWork/Safety/Pages/AIS.aspx, 2025. Accessed: 2025-06-09.
- Murray, B.; Perera, L.P. An AIS-based deep learning framework for regional ship behavior prediction. Reliability Engineering & System Safety 2021, 215, 107819. [Google Scholar]
- V7 Labs. Multimodal Deep Learning: Definition, Examples, Applications. https://www.v7labs.com/blog/multimodal-deep-learning-guide, 2025. Accessed: 2025-06-09.
- Zhang, Q.; Wang, L.; Meng, H.; Zhang, Z.; Yang, C. Ship Detection in Maritime Scenes under Adverse Weather Conditions. Remote Sensing 2024, 16. [Google Scholar] [CrossRef]
- Chen, X.; Wei, C.; Xin, Z.; Zhao, J.; Xian, J. Ship Detection under Low-Visibility Weather Interference via an Ensemble Generative Adversarial Network. Journal of Marine Science and Engineering 2023, 11. [Google Scholar] [CrossRef]
- Riveiro, M.J. Visual analytics for maritime anomaly detection. PhD thesis, Örebro universitet, 2011.
- Riveiro, M.; Pallotta, G.; Vespe, M. Maritime anomaly detection: A review. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery 2018, 8, e1266. [Google Scholar] [CrossRef]
- Nguyen, D.; Vadaine, R.; Hajduch, G.; Garello, R.; Fablet, R. GeoTrackNet—A maritime anomaly detector using probabilistic neural network representation of AIS tracks and a contrario detection. IEEE Transactions on Intelligent Transportation Systems 2021, 23, 5655–5667. [Google Scholar] [CrossRef]
- Pradipta, G.A.; Wardoyo, R.; Musdholifah, A.; Sanjaya, I.N.H.; Ismail, M. SMOTE for Handling Imbalanced Data Problem : A Review. In Proceedings of the 2021 Sixth International Conference on Informatics and Computing (ICIC); 2021; pp. 1–8. [Google Scholar] [CrossRef]
- Liu, F.T.; Ting, K.M.; Zhou, Z.H. Isolation Forest. In Proceedings of the 2008 Eighth IEEE International Conference on Data Mining; 2008; pp. 413–422. [Google Scholar] [CrossRef]
- González-Muñiz, A.; Díaz, I.; Cuadrado, A.A.; García-Pérez, D.; Pérez, D. Two-step residual-error based approach for anomaly detection in engineering systems using variational autoencoders. Computers and Electrical Engineering 2022, 101, 108065. [Google Scholar] [CrossRef]
- Wijaya, W.M.; Nakamura, Y. Loitering behavior detection by spatiotemporal characteristics quantification based on the dynamic features of Automatic Identification System (AIS) messages. PeerJ Computer Science 2023, 9, e1572. [Google Scholar] [CrossRef]
- Nguyen, D.; Vadaine, R.; Hajduch, G.; Garello, R.; Fablet, R. GeoTrackNet—A maritime anomaly detector using probabilistic neural network representation of AIS tracks and a contrario detection. IEEE Transactions on Intelligent Transportation Systems 2021, 23, 5655–5667. [Google Scholar] [CrossRef]
- Duan, H.; Ma, F.; Miao, L.; Zhang, C. A semi-supervised deep learning approach for vessel trajectory classification based on AIS data. Ocean & Coastal Management 2022, 218, 106015. [Google Scholar]
- Maganaris, C.; Protopapadakis, E.; Doulamis, N. Outlier detection in maritime environments using AIS data and deep recurrent architectures. In Proceedings of the Proceedings of the 17th International Conference on PErvasive Technologies Related to Assistive Environments, 2024, pp. 420–427. [CrossRef]
- Rong, H.; Teixeira, A.; Guedes Soares, C. A framework for ship abnormal behaviour detection and classification using AIS data. Reliability Engineering & System Safety 2024, 247, 110105. [Google Scholar] [CrossRef]
- Wolsing, K.; Roepert, L.; Bauer, J.; Wehrle, K. Anomaly Detection in Maritime AIS Tracks: A Review of Recent Approaches. Journal of Marine Science and Engineering 2022, 10. [Google Scholar] [CrossRef]
- Minßen, F.M.; Klemm, J.; Steidel, M.; Niemi, A. Predicting Vessel Tracks in Waterways for Maritime Anomaly Detection. Transactions on Maritime Science 2024, 13. [Google Scholar] [CrossRef]
- Martinčič, T.; Štepec, D.; Costa, J.P.; Čagran, K.; Chaldeakis, A. Vessel and Port Efficiency Metrics through Validated AIS data. In Proceedings of the Global Oceans 2020: Singapore – U.S. Gulf Coast; 2020; pp. 1–6. [Google Scholar] [CrossRef]
- Radon, A.N.; Wang, K.; Glässer, U.; Wehn, H.; Westwell-Roper, A. Contextual verification for false alarm reduction in maritime anomaly detection. In Proceedings of the 2015 IEEE International Conference on Big Data (Big Data); 2015; pp. 1123–1133. [Google Scholar] [CrossRef]
- Su, L.; Zuo, X.; Li, R.; Wang, X.; Zhao, H.; Huang, B. A systematic review for transformer-based long-term series forecasting. Artificial Intelligence Review 2025, 58, 80. [Google Scholar] [CrossRef]
- Capobianco, S.; Millefiori, L.M.; Forti, N.; Braca, P.; Willett, P. Deep learning methods for vessel trajectory prediction based on recurrent neural networks. IEEE Transactions on Aerospace and Electronic Systems 2021, 57, 4329–4346. [Google Scholar] [CrossRef]
- Nguyen, D.; Vadaine, R.; Hajduch, G.; Garello, R.; Fablet, R. GeoTrackNet-A Maritime Anomaly Detector using Probabilistic Neural Network Representation of AIS Tracks and A Contrario Detection. CoRR 2019, abs/1912.00682, [1912.00682]. [CrossRef]
- Petković, M. ENHANCING MARITIME VIDEO SURVEILLANCE TROUGH DEEP LEARNING AND HYBRID DISTANCE ESTIMATION. PhD thesis, University of Split. Faculty of Maritime Studies. Department of maritime …, 2024.
- Seong, N.; Kim, J.; Lim, S. Graph-Based Anomaly Detection of Ship Movements Using CCTV Videos. Journal of Marine Science and Engineering 2023, 11, 1956. [Google Scholar] [CrossRef]
- Xue, H.; Chen, X.; Zhang, R.; Wu, P.; Li, X.; Liu, Y. Deep learning-based maritime environment segmentation for unmanned surface vehicles using superpixel algorithms. Journal of Marine Science and Engineering 2021, 9, 1329. [Google Scholar] [CrossRef]
- Bilous, N.; Malko, V.; Frohme, M.; Nechyporenko, A. Comparison of CNN-Based Architectures for Detection of Different Object Classes. AI 2024, 5, 2300–2320. [Google Scholar] [CrossRef]
- Matasci, G.; Plante, J.; Kasa, K.; Mousavi, P.; Stewart, A.; Macdonald, A.; Webster, A.; Busler, J. Deep learning for vessel detection and identification from spaceborne optical imagery. ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences 2021, 3, 303–310. [Google Scholar] [CrossRef]
- Huang, Y.; Han, D.; Han, B.; Wu, Z. ADV-YOLO: improved SAR ship detection model based on YOLOv8. The Journal of Supercomputing 2025, 81, 34. [Google Scholar] [CrossRef]
- Li, X. Ship segmentation via combined attention mechanism and efficient channel attention high-resolution representation network. Journal of Marine Science and Engineering 2024, 12, 1411. [Google Scholar] [CrossRef]
- Xing, Z.; Ren, J.; Fan, X.; Zhang, Y. S-DETR: A transformer model for real-time detection of marine ships. Journal of Marine Science and Engineering 2023, 11, 696. [Google Scholar] [CrossRef]
- Guo, L.; Wang, Y.; Guo, M.; Zhou, X. YOLO-IRS: Infrared Ship Detection Algorithm Based on Self-Attention Mechanism and KAN in Complex Marine Background. Remote Sensing 2024, 17, 20. [Google Scholar] [CrossRef]
- Farahnakian, F.; Heikkonen, J. Deep learning based multi-modal fusion architectures for maritime vessel detection. Remote Sensing 2020, 12, 2509. [Google Scholar] [CrossRef]
- Kalliovaara, J.; Jokela, T.; Asadi, M.; Majd, A.; Hallio, J.; Auranen, J.; Seppänen, M.; Putkonen, A.; Koskinen, J.; Tuomola, T.; et al. Deep learning test platform for maritime applications: Development of the em/s salama unmanned surface vessel and its remote operations center for sensor data collection and algorithm development. Remote Sensing 2024, 16, 1545. [Google Scholar] [CrossRef]
- Lu, Y.; Yang, K.; Yang, D.; Ding, H.; Weng, J.; Liu, R.W. Graph Learning-Driven Multi-Vessel Association: Fusing Multimodal Data for Maritime Intelligence. arXiv preprint arXiv:2504.09197 2025. arXiv:2504.09197 2025. [CrossRef]
- Lu, Y.; Ma, H.; Smart, E.; Vuksanovic, B.; Chiverton, J.; Prabhu, S.R.; Glaister, M.; Dunston, E.; Hancock, C. Fusion of camera-based vessel detection and ais for maritime surveillance. In Proceedings of the 2021 26th International Conference on Automation and Computing (ICAC). IEEE, 2021, pp. 1–6. [CrossRef]
- MIT Sea Grant Autonomous Underwater Vehicles Lab. AUV Lab – Marine Perception Datasets (AUVLab). https://seagrant.mit.edu/auvlab-datasets-marine-perception-2-3/, 2022. Accessed: 2025-06-09.
- Protopapadakis, E.; Voulodimos, A.; Doulamis, A.; Doulamis, N.; Dres, D.; Bimpas, M. Stacked autoencoders for outlier detection in over-the-horizon radar signals. Computational intelligence and neuroscience 2017, 2017, 5891417. [Google Scholar] [CrossRef]
- Yao, S.; Guan, R.; Wu, Z.; Ni, Y.; Huang, Z.; Liu, R.W.; Yue, Y.; Ding, W.; Lim, E.G.; Seo, H.; et al. Waterscenes: A multi-task 4d radar-camera fusion dataset and benchmarks for autonomous driving on water surfaces. IEEE Transactions on Intelligent Transportation Systems 2024. [CrossRef]
- Varga, M.; Liggett, K.K.; Bivall, P.; Lavigne, V.; [other contributors]. Exploratory Visual Analytics (STO-TR-IST-141). Technical Report 2023.
- FAO. State Of Worlds Fisheries And Aquaculture 2002. Food and Agriculture Organization of the United Nations: Rome, Italy 2020.
- Jin, M.; Shi, W.; Lin, K.C.; Li, K.X. Marine piracy prediction and prevention: Policy implications. Marine Policy 2019, 108, 103528. [Google Scholar] [CrossRef]
- Li, H.; Yang, Z. Towards safe navigation environment: The imminent role of spatio-temporal pattern mining in maritime piracy incidents analysis. Reliability Engineering & System Safety 2023, 238, 109422. [Google Scholar]
- Fahreza, M.I.; Hirata, E. Maritime piracy and armed robbery analysis in the Straits of Malacca and Singapore through the utilization of natural language processing. Maritime Policy & Management 2024, pp. 1–14. [CrossRef]
- Hu, Z.; Sun, Y.; Zhao, Y.; Wu, W.; Gu, Y.; Chen, K. Msif-Sstr: A Ship Smuggling Trajectory Recognition Method Based on Multi-Source Information Fusion. Applied Ocean Research 2025. [Google Scholar]
- Sun, Z.; Yang, Q.; Yan, N.; Chen, S.; Zhu, J.; Zhao, J.; Sun, S. Utilizing deep learning algorithms for automated oil spill detection in medium resolution optical imagery. Marine Pollution Bulletin 2024, 206, 116777. [Google Scholar] [CrossRef] [PubMed]
- Deo, R.; John, C.M.; Zhang, C.; Whitton, K.; Salles, T.; Webster, J.M.; Chandra, R. Deepdive: Leveraging Pre-trained Deep Learning for Deep-Sea ROV Biota Identification in the Great Barrier Reef. Scientific Data 2024, 11, 957. [Google Scholar] [CrossRef] [PubMed]
- Taipalmaa, J.; Raitoharju, J.; Queralta, J.P.; Westerlund, T.; Gabbouj, M. On automatic person-in-water detection for marine search and rescue operations. IEEE Access 2024. [Google Scholar] [CrossRef]
- Wang, S.; Han, Y.; Chen, J.; Zhang, Z.; Wang, G.; Du, N. A deep-learning-based sea search and rescue algorithm by UAV remote sensing. In Proceedings of the 2018 IEEE CSAA Guidance, Navigation and Control Conference (CGNCC). IEEE, 2018, pp. 1–5. [CrossRef]
- Moosbauer, S.; Konig, D.; Jakel, J.; Teutsch, M. A benchmark for deep learning based object detection in maritime environments. In Proceedings of the Proceedings of the IEEE/CVF conference on computer vision and pattern recognition workshops, 2019, pp. 0–0. [CrossRef]
- Kiefer, B.; Kristan, M.; Perš, J.; Žust, L.; Poiesi, F.; Andrade, F.; Bernardino, A.; Dawkins, M.; Raitoharju, J.; Quan, Y.; et al. 1st workshop on maritime computer vision (macvi) 2023: Challenge results. In Proceedings of the Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023, pp. 265–302. [CrossRef]
- AISHub. AIS data sharing and vessel tracking by AISHub. https://www.aishub.net/, 2025. Accessed: 2025-06-27.
- Al-Saad, M.; Aburaed, N.; Panthakkan, A.; Al Mansoori, S.; Al Ahmad, H.; Marshall, S. Airbus ship detection from satellite imagery using frequency domain learning. In Proceedings of the Image and Signal Processing for Remote Sensing XXVII. SPIE, 2021, Vol. 11862, pp. 279–285. [CrossRef]
- Shao, Z.; Wu, W.; Wang, Z.; Du, W.; Li, C. Seaships: A large-scale precisely annotated dataset for ship detection. IEEE transactions on multimedia 2018, 20, 2593–2604. [Google Scholar] [CrossRef]
- Huang, W.; Feng, H.; Xu, H.; Liu, X.; He, J.; Gan, L.; Wang, X.; Wang, S. Surface Vessels Detection and Tracking Method and Datasets with Multi-Source Data Fusion in Real-World Complex Scenarios. Sensors 2025, 25, 2179. [Google Scholar] [CrossRef] [PubMed]
- Nanda, A.; Cho, S.W.; Lee, H.; Park, J.H. KOLOMVERSE: Korea Open Large-Scale Image Dataset for Object Detection in the Maritime Universe. IEEE Transactions on Intelligent Transportation Systems 2024. [CrossRef]
- Wei, S.; Zeng, X.; Qu, Q.; Wang, M.; Su, H.; Shi, J. HRSID: A high-resolution SAR images dataset for ship detection and instance segmentation. Ieee Access 2020, 8, 120234–120254. [Google Scholar] [CrossRef]
- Chen, S.Q.; Zhan, R.H.; Zhang, J. Robust single stage detector based on two-stage regression for SAR ship detection. In Proceedings of the Proceedings of the 2nd International Conference on Innovation in Artificial Intelligence, 2018, pp. 169–174. [CrossRef]
- Watson, R.A. A database of global marine commercial, small-scale, illegal and unreported fisheries catch 1950–2014. Scientific Data 2017, 4, 1–9. [Google Scholar] [CrossRef]
- Blondeau-Patissier, D.; Schroeder, T.; Suresh, G.; Li, Z.; Diakogiannis, F.I.; Irving, P.; Witte, C.; Steven, A.D. Detection of marine oil-like features in Sentinel-1 SAR images by supplementary use of deep learning and empirical methods: Performance assessment for the Great Barrier Reef marine park. Marine Pollution Bulletin 2023, 188, 114598. [Google Scholar] [CrossRef]



| Model | Architecture Type | Application | Strengths | Weaknesses | Reported Performance |
| YOLO (v4–v8) [57] | CNN (single-shot, anchor-based) | Ship/Object Detection | Very fast inference; high accuracy; real-time capable. | Anchor design requires tuning; may miss very small objects. | mAP=, F1= (on general object benchmarks) |
| RetinaNet [58] | CNN (one-stage, FPN) | Ship/Object Detection | Handles class imbalance with focal loss; high detection accuracy. | Comparatively heavy; slower than YOLO; may struggle on tiny targets. | F1= on multiscale spaceborne dataset |
| CNN-MR [8] | CNN (multi-resolution input) | SAR Ship Classification | Utilizes multi-scale SAR inputs for richer features; excellent classification. | Requires multi-resolution SAR data; more complex input. | F1=0.94 |
| EL-YOLO [12] | CNN (YOLOv8 variant) | Ship/Object Detection (RGB) | Lightweight YOLOv8 variant; improved bounding box regression (AWIoU, SMFN); better small object performance. | Still CNN-heavy; many components to tune. | = 0.672, =0.348 on Sea ships (significant gain YOLOv3-tiny) |
| ADV-YOLO [59] | CNN (YOLOv8 variant) | SAR Ship Detection | Enhanced for SAR: space-to-depth and dilation modules; uses WIoU loss. | May be heavyweight; specialised to SAR imagery. | HRSID: (+4.5% vs. YOLOv8n); SSDD: +1.1%. |
| CA2HRNet [60] | CNN (HRNet with attention) | Ship Segmentation/Detection | High resolution feature extraction with combined channel/spatial attention; achieves very high accuracy and IoU. | Computationally heavy (segmentation network); specialised. | Accuracy=99.77%, F1=97.0%, IoU=96.97% |
| S-DETR [61] | Transformer (DETRbased) | Ship/Object Detection | End-to-end detection; built-in scale attention and dense queries for multi-scale ships; comparable speed to single-stage models. | Higher complexity; slow convergence; needs many epochs. | Achieves state-of-art multi-scale detection in trials (real-time capable) |
| YOLO-IRS [62] | CNN+Transformer (Swin) | IR Ship Detection | YOLOv10-based IR model with Swin transformer backbone; better small/ weak target detection, anti-interference. | Slightly higher complexity; still emerging research. | +1.3% precision, +0.5% , +1.7% vs YOLOv10 |
| Fusion Type | Sensors Combined | Techniques Used | Applications | Performance Highlights |
| Early Fusion [63] | RGB (EO)+IR imagery | CNN (concatenate inputs) | Vessel detection in visible/thermal | Fusing raw pixel data allows CNN to learn combined features; robust in mixed lighting. |
| Mid Fusion [63] | RGB+IR imagery | CNN (feature-level fusion) | Vessel detection across modalities | Multi-modal mid-fusion gave highest accuracy: AP= (daytime) and 61.6% (night), outperforming uni-modal. |
| Late Fusion [63] | RGB+IR imagery | CNN (separate branches) | Ensemble detection/classification | Decision-level fusion improves robustness; effectively integrates complementary IR and RGB cues. |
| Mid Fusion [68] | AIS+Marine Radar | RNN, CNN | Vessel behaviour classification | Learns spatiotemporal patterns from trajectories and radar; showed moderate precision (data-limited) in identifying vessel status. |
| Association (graph) [65] | AIS+EO Video (CCTV) | GNN with attention | Multi-target vessel association | Graph-based fusion with spatiotemporal attention improved association accuracy and robustness. |
| Model | Architecture Type | Application | Strengths | Weaknesses | Reported Performance |
| BiLSTM-CNN-Attention [19] | BiLSTM, CNN and attention mechanism | Illegal Fishing Detection | High accuracy; real-time capable; capturing both past and future context in the sequential data | Data bias problems; misclassifies stow-net vessels and gillnetters as illegal fishing trawlers | Accuracy≈74%, Precision=0.7562, Recall=0.7410, F1 Score=0.7408 |
| FishNet [4] | A combination of DenseNet, Feature Fusion (CNN-based module), and Multilevel Feature Aggregation | Fishing vessels classification | High accuracy | Longer training time | Accuracy≈90%, Precision=0.9017, Recall=0.8981, F1 Score=0.8971 |
| Stacked-YOLOV5 [15] | CNN (YOLOv5) | Lit fishing boats detection | Improved feature extraction and detection performance | Poor detection accuracy when lights from non-fishing vessels introduce noise | Precision=0.966, Recall=0.930, Map@0.5=0.931 F1 Score=0.948 |
| YOLOv10s [3] | CNN (YOLOv10 small) | Dark vessels detection | Able to detect small ships; reduced architecture with unnecessary Conv and C2f layers removed | The proposed pipeline demands high computational resources. | accuracy =0.8588, =0.6631, precision=0.9370, recall= 0.9381, and specificity= 0.9869 |
| YOLOv8m [75] | CNN (YOLOv8m) | Ship-to-ship smuggling detection | High accuracy; fusion of radar trajectories and the corresponding meteorological data | Higher complexity | F1=0.97, accuracy=94% |
| Faster R-CNN with ResNet101 [2] | CNN, RNN (YOLOv2-v3, Faster R-CNN), feature extraction (GoogLeNet, ResNet18, ResNet50, and ResNet101) | Small inflatable smuggling boats detection | Faster R-CNN with ResNet101 achieves high detection rate | Higher complexity; slow convergence; needs many epochs; detection capability reduction in varying environmental conditions | Accuracy=95%, mIoU=79% |
| Sw-YoloX [16] | CNN (Convolutional Block Attention Module, Atrous Spatial Pyramid Pooling) | Search and Rescue Operations | High accuracy | Requires pruning for lower weights to reduce memory overhead | F1=0.78, mAP=54, recall=0.72 |
| Dataset Name | Sensor/Modality | Data Type | Annotations | Size/Scale | Limitations |
| WaterScenes [69] | Camera (RGB), 4D Radar, GPS/IMU | Image sequences (video) | 2D bounding boxes (camera), 3D point clusters (radar) | 54,120 RGB frames+radar scans; ∼200k object instances | Same locale (Singapore); weather range limited. |
| SeaDronesSee [81] | UAV RGB Video | Images & video | Bounding boxes (boats, people, flares); track IDs (multi/SOT) | 8,930 train+ 3,750 test images (drones); includes full video clips for tracking | Mostly temperate marine conditions; daytime imagery |
| Airbus Ship Detection [83] | Satellite optical (SPOT) | Image chips | Pixel-wise ship masks (RLE) | 231,723 images, 81,723 contain at-least 1 ship | Primarily daylight RGB; many empty frames; oriented masks |
| SeaShips [84] | Shorebased cameras (RGB) | Images | Bounding boxes + ship type (6 classes) | 31,455 images of coastal traffic | Fixed coastal perspectives; limited environmental diversity |
| SPSCD [85] | Port surveillance (RGB) | Images | Bounding boxes + ship class (12 types) | 19,337 images, 27,849 labeled ship instances | Focused on port environments; no AIS tracking |
| KOLOMVERSE [86] | UAV 4K images | Images | Bounding boxes (vessels) | 100,000+ 4K images of one class “boat" | Single object class (“boat"); access upon request |
| HRSID [87] | SAR imagery | Images | Bounding boxes (ships) | 5,604 high-res SAR images, 16,951 ship instances | SAR-only modality (requires specialised processing) |
| SSDD [88] | SAR imagery (Sentinel-1, TerraSAR-X) | Images | Bounding boxes (ships) | 2,752 SAR image chips (ships/non-ships) | Limited to SAR; chip-based (small images) |
| Dataset Name | Application | Size/Scale | Limitations |
| Global Fisheries Catch 1950–2014 [89] | A database of global marine commercial, small-scale, illegal and unreported fisheries catch 1950–2014 | Nearly 868 million records with 12 descriptive fields, structured in 5-year blocks starting from 1950 | Data can be heavily skewed toward certain regions or time periods, undermining representativeness |
| FishingVesselSAR [4] | SAR images for fishing vessel classification | 369 high-resolution SAR image (116 gillnetters, 72 seiners, and 181 trawlers) | Data can be heavily skewed toward certain regions or time periods, undermining representativeness |
| [15] | Nighttime SAR images for fishing vessel classification | 1,364 high-resolution SAR image of 1,281 lit fishing vessels | The sample dataset is relatively small and the presence of lights from non-fishing vessels may introduce noise. |
| Maritime Piracy Incidents [72] | Structured data of piracy incidents | 8,369 records of piracy incidents from 1990-2021 | Dataset primarily focuses on high-risk areas, potentially overlooking other regions. |
| HS3-S2 [3] | SAR, Sentinel-2, and high-resolution optical images for detecting suspicious maritime activities | 69,331 images | Integrating multiple sources of satellite imagery increases the complexity of pre-processing and model training. Additionally, the varying resolutions of the images from different sources can pose challenges in standardising the input data for the detection model. |
| HN_BF [75] | Ship trajectories near Qiongzhou Strait in China from March to May 2024 | 5,337 labeled trajectories including 1,473 as “Big flyer" and the rest as “Normal" | Focusing on one particular region which may impact model generalisation ability when employed outside the specified region. |
| CSIRO [90] | Oil spill detection dataset | 5,630 image chips: 3,725 chips class 0 (no oil features) and 1,905 chips with class 1 (containing oil features) | Look-alike features such as wind shadows, reef structures, or biogenic slicks may increase the false positive rate of oil-like feature detection. |
| Oil spill [21] | Oil spill segmentation and classification dataset | 19,544 RGB images: 8,376 cropped images, 3,168 resized images, and 8,000 synthetic images | The dataset is imbalanced, with certain types of oil spills being underrepresented compared to others. The images come from various sources with different resolutions, which can affect the model’s performance. |
| Deepdive [77] | Deep-sea biota images captured by a remotely operated vehicle (ROV) | 4,158 images of deep-sea biota belonging to 62 different classes | The manual labeling process, despite rigorous quality control, may still introduce errors due to the complexity of deep-sea biota shapes and overlapping boundaries. |
| SeaDronesSee [81] | UAV videos for maritime surveillance, rescue operations, human detection in aquatic environments, drone-based vision research. | 54,000 image with 400,000 instances with class labels such as boats, people, and buoys. | It is a synthetic dataset, however effectiveness of computer vision algorithms is heavily reliant on real-case training data. |
| SAR-HumanDetection-FinlandProper [78] | UAV images for maritime surveillance, rescue operations, human detection in aquatic environments, drone-based vision research. | 72000 images of instances with positive class label as swimming/floating person. | The dataset lacks complex scenarios and weather conditions, as the images are daylight and clear summer weather. It may be ineffective in detection tasks in real-world cases. |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).