Submitted:
09 May 2025
Posted:
12 May 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Fundamentals of Image Processing
2.1. Mathematical Representation and Processing Methods of Images
2.2. Feature Extraction and Representation of Images
3. Core Algorithms in Computer Vision
3.1. Image Segmentation and Edge Detection
3.2. Object Detection and Recognition
3.3. Feature Matching and Tracking
4. Applications of Image Processing and Computer Vision
5. Research Challenges and Development Trends
6. Conclusion
References
- Tan C, Li X, Wang X, et al. Real-time Video Target Tracking Algorithm Utilizing Convolutional Neural Networks (CNN)[C]//2024 4th International Conference on Electronic Information Engineering and Computer (EIECT). IEEE, 2024: 847-851. [CrossRef]
- Khan, Asharul Islam, and Salim Al-Habsi. "Machine learning in computer vision." Procedia Computer Science 167 (2020): 1444-1451. [CrossRef]
- Zhang J, Xiang A, Cheng Y, et al. Research on Detection of Floating Objects in River and Lake Based on AI Image Recognition[J]. Journal of Artificial Intelligence Practice, 2024, 7(2): 97-106. [CrossRef]
- Wu Z. Deep learning with improved metaheuristic optimization for traffic flow prediction[J]. Journal of Computer Science and Technology Studies, 2024, 6(4): 47-53. [CrossRef]
- Zhang W, Huang J, Wang R, et al. Integration of Mamba and Transformer--MAT for Long-Short Range Time Series Forecasting with Application to Weather Dynamics[J]. arXiv preprint arXiv:2409.08530, 2024. [CrossRef]
- Wang T, Cai X, Xu Q. Energy Market Price Forecasting and Financial Technology Risk Management Based on Generative AI[J]. Applied and Computational Engineering, 2024, 100: 29-34. [CrossRef]
- Wu, X., Sun, Y., & Liu, X. (2024). Multi-Class Classification of Breast Cancer Gene Expression Using PCA and XGBoost. Preprints. [CrossRef]
- Min, Liu, et al. "Financial Prediction Using DeepFM: Loan Repayment with Attention and Hybrid Loss." 2024 5th International Conference on Machine Learning and Computer Application (ICMLCA). IEEE, 2024. [CrossRef]
- Wang, G. Zhang, Y. Zhao, F. Lai, W. Cui, J. Xue, Q. Wang, H. Zhang, and Y. Lin, “Rpf-eld: Regional prior fusion using early and late distillation for breast cancer recognition in ultrasound images,” in 2024 IEEE International Conference on Bioinformatics and Biomedicine (BIBM). IEEE, 2024, pp. 2605–2612. [CrossRef]
- Zhao Y, Hu B, Wang S. Prediction of brent crude oil price based on lstm model under the background of low-carbon transition[J]. arXiv preprint arXiv:2409.12376, 2024. [CrossRef]
- Yu Q, Wang S, Tao Y. Enhancing anti-money laundering detection with self-attention graph neural networks[C]//SHS Web of Conferences. EDP Sciences, 2025, 213: 01016. [CrossRef]
- Mo K, Chu L, Zhang X, et al. Dral: Deep reinforcement adaptive learning for multi-uavs navigation in unknown indoor environment[J]. arXiv preprint arXiv:2409.03930, 2024. [CrossRef]
- Ma D, Wang M, Xiang A, et al. Transformer-Based Classification Outcome Prediction for Multimodal Stroke Treatment[J]. arXiv preprint arXiv:2404.12634, 2024. [CrossRef]
- Li X, Cao H, Zhang Z, et al. Artistic Neural Style Transfer Algorithms with Activation Smoothing[J]. arXiv preprint arXiv:2411.08014, 2024. [CrossRef]
- Guo H, Zhang Y, Chen L, et al. Research on vehicle detection based on improved YOLOv8 network[J]. arXiv preprint arXiv:2501.00300, 2024. [CrossRef]
- Diao, Su, et al. "Ventilator pressure prediction using recurrent neural network." arXiv preprint arXiv:2410.06552 (2024). [CrossRef]
- Cheng Y, Yang Q, Wang L, et al. Research on Credit Risk Early Warning Model of Commercial Banks Based on Neural Network Algorithm[J]. arXiv preprint arXiv:2405.10762, 2024. [CrossRef]
- Xiang A, Qi Z, Wang H, et al. A Multimodal Fusion Network For Student Emotion Recognition Based on Transformer and Tensor Product[J]. arXiv preprint arXiv:2403.08511, 2024. [CrossRef]
- Tang, Xirui, et al. "Research on heterogeneous computation resource allocation based on data-driven method." 2024 6th International Conference on Data-driven Optimization of Complex Systems (DOCS). IEEE, 2024. [CrossRef]
- Tan C, Zhang W, Qi Z, et al. Generating Multimodal Images with GAN: Integrating Text, Image, and Style[J]. arXiv preprint arXiv:2501.02167, 2025. [CrossRef]
- Yan, Hao, et al. "Research on image generation optimization based deep learning." Proceedings of the International Conference on Machine Learning, Pattern Recognition and Automation Engineering. 2024. [CrossRef]
- Yang H, Wang L, Zhang J, et al. Research on Edge Detection of LiDAR Images Based on Artificial Intelligence Technology[J]. arXiv preprint arXiv:2406.09773, 2024. [CrossRef]
- Xiang A, Zhang J, Yang Q, et al. Research on splicing image detection algorithms based on natural image statistical characteristics[J]. arXiv preprint arXiv:2404.16296, 2024. [CrossRef]
- Paneru, Suman, and Idris Jeelani. "Computer vision applications in construction: Current state, opportunities & challenges." Automation in Construction 132 (2021): 103940. [CrossRef]
- Chouhan, Siddharth Singh, Uday Pratap Singh, and Sanjeev Jain. "Applications of computer vision in plant pathology: a survey." Archives of computational methods in engineering 27.2 (2020): 611-632. [CrossRef]
- Xiang A, Huang B, Guo X, et al. A neural matrix decomposition recommender system model based on the multimodal large language model[J]. arXiv preprint arXiv:2407.08942, 2024. [CrossRef]
- Shih K, Han Y, Tan L. Recommendation System in Advertising and Streaming Media: Unsupervised Data Enhancement Sequence Suggestions[J]. arXiv preprint arXiv:2504.08740, 2025. [CrossRef]
- Wu Z, Wang X, Huang S, et al. Research on prediction recommendation system based on improved markov model[J]. Advances in Computer, Signals and Systems, 2024, 8(5): 87-97. [CrossRef]
- Shi X, Tao Y, Lin S C. Deep Neural Network-Based Prediction of B-Cell Epitopes for SARS-CoV and SARS-CoV-2: Enhancing Vaccine Design through Machine Learning[J]. arXiv preprint arXiv:2412.00109, 2024. [CrossRef]
- Zhao R, Hao Y, Li X. Business Analysis: User Attitude Evaluation and Prediction Based on Hotel User Reviews and Text Mining[J]. arXiv preprint arXiv:2412.16744, 2024. [CrossRef]
- Ziang H, Zhang J, Li L. Framework for lung CT image segmentation based on UNet++[J]. arXiv preprint arXiv:2501.02428, 2025. [CrossRef]
- Gao, Dawei, et al. "Synaptic resistor circuits based on Al oxide and Ti silicide for concurrent learning and signal processing in artificial intelligence systems." Advanced Materials 35.15 (2023): 2210484. [CrossRef]
- Wu Z. Mpgaan: Effective and efficient heterogeneous information network classification[J]. Journal of Computer Science and Technology Studies, 2024, 6(4): 8-16. [CrossRef]
- Wang L, Cheng Y, Xiang A, et al. Application of Natural Language Processing in Financial Risk Detection[J]. arXiv preprint arXiv:2406.09765, 2024. [CrossRef]
- Rakhimov, Bakhtiyar Saidovich, et al. "Review And Analysis Of Computer Vision Algorithms." The American Journal of Applied sciences 3.5 (2021): 245-250. [CrossRef]
- Fernandes, Arthur Francisco Araújo, João Ricardo Rebouças Dórea, and Guilherme Jordão de Magalhães Rosa. "Image analysis and computer vision applications in animal sciences: an overview." Frontiers in Veterinary Science 7 (2020): 551269. [CrossRef]
- Oliveira, Dario Augusto Borges, et al. "A review of deep learning algorithms for computer vision systems in livestock." Livestock Science 253 (2021): 104700. [CrossRef]
- Afif, Mouna, Yahia Said, and Mohamed Atri. "Computer vision algorithms acceleration using graphic processors NVIDIA CUDA." Cluster Computing 23.4 (2020): 3335-3347. [CrossRef]
- Desai, Brishaman, et al. "Image filtering-techniques algorithms and applications." Applied GIS 7.11 (2020): 970-975.
- Huang B, Lu Q, Huang S, et al. Multi-modal clothing recommendation model based on large model and VAE enhancement[J]. arXiv preprint arXiv:2410.02219, 2024. [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).