Submitted:
27 May 2026
Posted:
28 May 2026
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Related Study
3. Methodology
Data Representation and Preprocessing
Feature Extraction
Model Construction
Model Evaluation
Experimental Workflow
- Input Layer: Raw social media text is collected and fed into the preprocessing pipeline.
- Preprocessing: Tokenization, stopword removal, and lemmatization standardize text.
- Feature Extraction: TF-IDF and Word2Vec models convert text into numeric vectors.
- Classification: ML models (SVM, Logistic Regression, Random Forest) are trained and optimized.
- Prediction: The trained model predicts whether new input text contains hate speech.
- Evaluation: Metrics are computed to assess accuracy and robustness.
Mathematical Summary
Summary
4. Results and Discussion
5. Conclusions
Ethical Considerations
Acknowledgments
Conflicts of Interest
References
- Naseeb. Machine Learning- and Deep Learning-Based Multi-Model system for hate speech detection on Facebook. Algorithms 2025, 18, 331. [Google Scholar] [CrossRef]
- Hossain, M.A.; Traini, E.; Amenta, F. Machine Learning Applications for Diagnosing Parkinson’s Disease via Speech, Language, and Voice Changes: A Systematic review. Inventions 2025, 10, 48. [Google Scholar] [CrossRef]
- Pan, L. The importance of deep learning models in speech signal processing: Fundamentals, strategies, and future research directions. Int. J. Speech Technol. 2025. [Google Scholar] [CrossRef]
- Haboussi, S.; Oukas, N.; Zerrouki, T.; Djettou, H. Arabic speech recognition using neural networks: Concepts, literature review and challenges. J. Umm Al-Qura Univ. Appl. Sci. 2025. [Google Scholar] [CrossRef]
- Naz; Khan, H.U.; Bukhari, A.; Alshemaimri, B.; Daud, A.; Ramzan, M. Machine and deep learning for personality traits detection: A comprehensive survey and open research challenges. Artif. Intell. Rev. 2025, 58. [Google Scholar] [CrossRef]
- Zangl, M.; et al. A multidisciplinary analysis of transparent AI-driven toxicity detection tools for civic engagement platforms. In AI & Society; 2025. [Google Scholar] [CrossRef]
- Chowdhury, T. Decoding silent speech: A machine learning perspective on data, methods, and frameworks. In Neural Computing and Applications; 2025. [Google Scholar] [CrossRef]
- Shankar, R.; Bundele, A.; Mukhopadhyay, A. A Systematic review of Natural language processing Techniques for Early Detection of Cognitive Impairment. Mayo Clin. Proc. Digit. Health 2025, 3, 100205. [Google Scholar] [CrossRef]
- Najib, F.M. Sign language interpretation using machine learning and artificial intelligence. In Neural Computing and Applications; 2024. [Google Scholar] [CrossRef]
- Jalayer, R.; Jalayer, M.; Baniasadi, A. A review on sound source localization in Robotics: Focusing on deep learning methods. Appl. Sci. 2025, 15, 9354. [Google Scholar] [CrossRef]
- Amirgaliyev; Mussabek, M.; Rakhimzhanova, T.; Zhumadillayeva, A. A Review of Machine Learning and Deep Learning Methods for Person Detection, Tracking and Identification, and Face Recognition with Applications. Sensors 2025, 25, 1410. [Google Scholar] [CrossRef]
- Alsehaimi, A.; Babour; Alahmadi, D. Toward Transparent Modeling: A Scoping Review of Explainability for Arabic Sentiment Analysis. Appl. Sci. 2025, 15, 10659. [Google Scholar] [CrossRef]
- Dutta, S.; Banducci; Camargo, C.Q. Divided by discipline? A systematic literature review on the quantification of online sexism and misogyny using a semi-automated approach. Scientometrics 2025. [Google Scholar] [CrossRef] [PubMed]
- Rodrigo-Guillen, R.; Garcia-D’Urso, N.; Mora-Mora, H.; Azorin-Lopez, J. Detecting abnormal behavior events and gatherings in public spaces using Deep Learning: A review. J. Sens. Actuator Netw. 2025, 14, 69. [Google Scholar] [CrossRef]
- Rasool, N.; Bhat, J.I. Brain tumour detection using machine and deep learning: A systematic review. In Multimedia Tools and Applications; 2024. [Google Scholar] [CrossRef]
- Thapa, S. Large language models (LLM) in computational social science: Prospects, current state, and challenges. Soc. Netw. Anal. Min. 2025, 15. [Google Scholar] [CrossRef]






| Model | Accuracy (%) | Precision (%) | Recall (%) | F1-Score (%) |
| Logistic Regression | 91.2 | 90.4 | 89.7 | 90.0 |
| SVM | 93.4 | 92.8 | 93.1 | 92.9 |
| Random Forest | 89.8 | 88.9 | 88.1 | 88.5 |
| Naïve Bayes | 85.5 | 84.7 | 82.3 | 83.5 |
| Model | Accuracy (%) | Precision (%) | Recall (%) | F1-Score (%) |
| Logistic Regression | 90.5 | 89.9 | 91.2 | 90.4 |
| SVM | 92.1 | 91.7 | 92.5 | 92.0 |
| Random Forest | 88.7 | 87.4 | 88.9 | 88.1 |
| Naïve Bayes | 84.3 | 82.8 | 83.7 | 83.2 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.