Submitted:
02 October 2025
Posted:
03 October 2025
Read the latest preprint version here
Abstract

Keywords:
1. Introduction
1.1. Background and Motivation
1.2. Related Work
1.3. Contribution
- We offer a thorough examination of domains, datasets, and tasks specifically within the field of SA. Additionally, we provide insightful analyses derived from the compiled information.
- Our exploration both ML and DL algorithms, including Support Vector Machines (SVM), Convolutional Neural Networks (CNNs), and Recurrent Neural Networks (RNNs), played crucial roles. Subsequently, we present a comprehensive analysis of the advantages and disadvantages associated with these approaches in the context of SA.
- We delve into the challenges confronting SA models, addressing issues such as the dynamic nature of language, context-dependent interpretations, and the prospect of constructing a knowledge graph representation for semantic analytics or establishing sentiment scores for each entity.
- We draw from the findings in the SA research publication, we present and succinctly outline the essential steps involved in constructing a SA system.
2. Review Methodology
- Published from 01/01/2020 to 31/12/2024.
- Written in English, regardless of geographical region or dataset language.
- Paper title, keywords, or abstract must include: ("Sentiment Analysis" OR "Opinion Mining") AND ("Machine Learning" OR "Deep Learning" OR Classification) AND (Datasets OR Applications), adapted to each library’s Boolean search format.
3. Classification and Analysis
3.1. Overview
3.2. Domains
3.3. Tasks
3.4. Models
3.5. Datasets
3.6. Suggested Directions for Future Research
4. Results and Discussion
4.1. Domains
4.2. Tasks
4.3. Models and Algorithms
4.4. Datasets
4.5. Suggested Directions for Future Research
5. Conclusions
References
- Rodríguez-Ibánez, M.; et al. A review on sentiment analysis from social media platforms. Expert Systems with Applications 2023, 223, 119862. [CrossRef]
- Hande, A.; Priyadharshini, R.; Chakravarthi, B.R. KanCMD: Kannada CodeMixed Dataset for Sentiment Analysis and Offensive Language Detection. In Proceedings of the 3rd PEOPLES Workshop. ACL, 2020, pp. 54–63.
- Ngo, V.M.; Le-Khac, N.A.; Kechadi, M.T. Designing and Implementing Data Warehouse for Agricultural Big Data. In Proceedings of the Big Data – BigData 2019, 2019, pp. 1–17.
- Cao, T.H.; Ngo, V.M.; Hong, D.T.; Quan, T.T. A Named-Entity-Based Multi-Vector Space Model for Semantic Document Clustering. In Proceedings of the Proceedings of PAKDD Workshop (WMWA’2008, Japan), 2008.
- Duong, T.; Vu, T.; Ngo, V. Detecting Vietnamese Opinion Spam. In Proceedings of the Proceedings of Scientific Researches on the Information and Communication Technology in 2012 (ICTFIT’12), 2012.
- Tran, T.N.T.; Nguyen, L.K.N.; Ngo, V.M. Machine Learning based English Sentiment Analysis. Journal of Science and Technology, Vietnam Academy of Science and Technology 2014, 52, 142–155.
- Nguyen, L.H.; Pham, N.H.T.; Ngo, V.M. Opinion spam recognition method for online reviews using ontological features. Journal of Science, Special Issue: Natural Science and Technology 2014, 61, 44–59.
- Nguyen, P.T.; Le, L.T.; Ngo, V.M.; Nguyen, P.M. Using Entity Relations for Opinion Mining of Vietnamese Comments. Journal of Science and Technology, Vietnam Academy of Science and Technology 2014, 52, 120–132.
- Wankhade, M.; et al. A survey on sentiment analysis methods, applications, and challenges. Artificial Intelligence Review 2022, 55, 5731–5780. [CrossRef]
- Bordoloi, M.; Biswas, S.K. Sentiment analysis: A survey on design framework, applications and future scopes. Artif Intell Rev 2023, pp. 1–56. [CrossRef]
- Ezhilarasan, M.; et al. Sentiment Analysis On Product Review: A Survey. In Proceedings of the Proceedings of the 2019 ICCPEIC, 2019, pp. 180–192.
- Ngo, V.M.; et al. Investigation, Detection and Prevention of Online Child Sexual Abuse Materials: A Comprehensive Survey. In Proceedings of the 2022 RIVF International Conference on Computing and Communication Technologies, 2022, pp. 707–713.
- Dao, P.Q.; Nguyen-Tat, T.B.; Roantree, M.; Ngo, V.M. Exploring Multimodal Sentiment Analysis Models: A Comprehensive Survey. In Proceedings of the MAPR conf., 2024, pp. 1–7.
- Capuano, N.; et al. Sentiment analysis for customer relationship management: an incremental learning approach. Applied Intelligence 2020, 51, 3339–3352. [CrossRef]
- Nti, I.K.; Adekoya, A.F.; Weyori, B.A. A novel multi-source information-fusion predictive framework based on deep neural networks for accuracy enhancement in stock market prediction. Journal of Big Data 2021, 8. [CrossRef]
- Chakravarthi, B.R. Hope speech detection in YouTube comments. Soc Netw Anal Min 2022, 12, 75. [CrossRef]
- Dake, D.K.; Gyimah, E. Using sentiment analysis to evaluate qualitative students’ responses. Educ Inf Technol (Dordr) 2023, 28, 4629–4647. [CrossRef]
- Kaplan, H.; Weichselbraun, A.; Brasoveanu, A.M.P. Integrating Economic Theory, Domain Knowledge, and Social Knowledge into Hybrid Sentiment Models for Predicting Crude Oil Markets. Cognit Comput 2023, pp. 1–17. [CrossRef]
- Umair, A.; Masciari, E. Sentiment Analysis Using Improved CT-BERT_CONVLayer Fusion Model for COVID-19 Vaccine Recommendation. SN Computer Science 2024, 5. [CrossRef]
- Abimbola, B.; et al. Sentiment analysis of Canadian maritime case law: a sentiment case law and deep learning approach. International Journal of Information Technology 2024, 16, 3401–3409. [CrossRef]
- Kastrati, M.; et al. Leveraging distant supervision and deep learning for twitter sentiment and emotion classification. J. Intell. Inf. Syst. 2024, 62, 1045–1070. [CrossRef]
- Colon-Ruiz, C.; Segura-Bedmar, I. Comparing deep learning architectures for sentiment analysis on drug reviews. J Biomed Inform 2020, 110, 103539. [CrossRef]
- Rakhmanov, O. A Comparative Study on Vectorization and Classification Techniques in Sentiment Analysis to Classify Student-Lecturer Comments. Procedia Computer Science 2020, 178, 194–204. 9th YSC-2020,. [CrossRef]
- Iddrisu, A.M.; et al. A sentiment analysis framework to classify instances of sarcastic sentiments within the aviation sector. IJIM Data Insights 2023, 3. [CrossRef]
- Spinde, T.; et al. What do Twitter comments tell about news article bias? Assessing the impact of news article bias on its perception on Twitter. Online Social Networks and Media 2023, 37-38. [CrossRef]
- Meena, G.; et al. A hybrid DL approach for detecting sentiment polarities and KG representation on monkeypox tweets. Decision Analytics Journal 2023, 7.
- Atandoh, P.; et al. Integrated deep learning paradigm for document-based sentiment analysis. J. King Saud Univ. - Comput. Inform. Sci. 2023, 35.
- George, M.; Murugesan, R. Improving sentiment analysis of financial news headlines using hybrid Word2Vec-TFIDF feature extraction technique. Procedia Computer Science 2024, 244, 1–8. 6th Int. Conf. on AI in Computational Linguistics. [CrossRef]
- Tanoto, K.; et al. Investigation of challenges in aspect-based sentiment analysis enhanced using softmax function on twitter during the 2024 Indonesian presidential election. Procedia Computer Science 2024, 245, 989–997. 9th ICCSCI-2024. [CrossRef]
- Greeshma, M.; Simon, P. Bidirectional Gated Recurrent Unit with Glove Embedding and Attention Mechanism for Movie Review Classification. Procedia Computer Science 2024, 233, 528–536. 5th ICIDCA-2024. [CrossRef]
- Taneja, K.; Vashishtha, J.; Ratnoo, S. Transformer Based Unsupervised Learning Approach for Imbalanced Text Sentiment Analysis of E-Commerce Reviews. Procedia Computer Science 2024, 235, 2318–2331. ICMLDE-2023. [CrossRef]
- Zhai, G.; et al. Multi-attention fusion modeling for sentiment analysis of educational big data. Big Data Mining and Analytics 2020, 3, 311–319. [CrossRef]
- Mishev, K.; et al. Evaluation of Sentiment Analysis in Finance: From Lexicons to Transformers. IEEE Access 2020, 8, 131662–131682. [CrossRef]
- Fazil, M.; et al. Attentional Multi-Channel Convolution With Bidirectional LSTM Cell Toward Hate Speech Prediction. IEEE Access 2023, 11, 16801–16811. [CrossRef]
- Zhong, Q.; et al. Knowledge Graph Augmented Network Towards Multiview Representation Learning for Aspect-Based Sentiment Analysis. IEEE Transactions on Knowledge and Data Engineering 2023, 35, 10098–10111. [CrossRef]
- Xiang, C.; et al. Phrase-Aware Financial SA Based on Constituent Syntax. IEEE/ACM Tran. on Audio, Speech, and Lan. Processing 2024, 32, 1994–2005.
- Duan, G.; Yan, S.; Zhang, M. A Hybrid Neural Network Model for Sentiment Analysis of Financial Texts Using Topic Extraction, Pre-Trained Model, and Enhanced Attention Mechanism Methods. IEEE Access 2024, 12, 98207–98224. [CrossRef]
- Guan, M.; Li, F.; Xue, Y. Enhanced Syntactic and Semantic Graph Convolutional Network With Contrastive Learning for Aspect-Based Sentiment Analysis. IEEE Transactions on Computational Social Systems 2024, 11, 859–870. [CrossRef]
- Sherin, A.; Jasmine Selvakumari Jeya, I.; Deepa, S.N. Enhanced Aquila Optimizer Combined Ensemble Bi-LSTM-GRU With Fuzzy Emotion Extractor for Tweet Sentiment Analysis and Classification. IEEE Access 2024, 12, 141932–141951. [CrossRef]
- Xiao, H.; Luo, L. An Automatic Sentiment Analysis Method for Short Texts Based on Transformer-BERT Hybrid Model. IEEE Access 2024, 12, 93305–93317. [CrossRef]
- Perti, A.; Sinha, A.; Vidyarthi, A. Cognitive Hybrid Deep Learning-based Multi-modal Sentiment Analysis for Online Product Reviews. ACM Transactions on Asian and Low-Resource Language Information Processing 2023. [CrossRef]
- Dang, C.N.; et al. Sentiment Analysis for Vietnamese – Based Hybrid Deep Learning Models. In Proceedings of the 18th Hybrid Artificial Intelligent Systems (HAIS), 2023, pp. 293–303.
- Alzaidi, S.A.; et al. A Text-Inception-Based NLP for Sentiment Analysis of Drug Experiences. ACM Trans. Asian Low-Resource Lang. Infomation Process. 2024.
| 1 | |
| 2 | |
| 3 | |
| 4 | |
| 5 |







| Publisher | The number of literatures | ||
|---|---|---|---|
| with search | After | After reviewing | After reviewing |
| criteria | downloading | title and abstract | full text |
| Springer | 1,886 | 64 | 9 (Res[1]: 8, Sur[2]: 1) |
| Elsevier | 2,066 | 91 | 11 (Res: 10, Sur: 1) |
| IEEE | 465 | 39 | 9 (Res: 9, Sur: 0) |
| ACM | 292 | 12 | 3 (Res: 3, Sur: 0) |
| Total | 4,709 | 206 | 32 (Res: 30, Sur: 2) |
| Others with unlimited publisher and pub. year | 2 (Res: 0, Sur: 2) | ||
| Final Review | 34 (Res: 30, Sur: 4) | ||
| No | Models | Year | Domains | Dataset1 | Task2 | Algorithm and Accuracy |
| Springer | ||||||
| 1 | DL [14] | 2020 | Management | Self+NO | impr | HAN, Acc: 0.80 |
| 2 | FE (DL)+DL [15] | 2021 | Finance | Self+NO | new-m | IKN-ConvLSTM, Acc: 0.98 |
| 3 | Corpus+DL [16] | 2022 | Detection | Self+NO | new-m | T5-Sentence+CNN, F1: 0.75 |
| 4 | ML [17] | 2022 | Education | Self+NO | comp | SVM, Acc: 0.63 |
| 5 | DL [18] | 2023 | Finance | Self+O | new-m | CrudeBERT, Acc: 0.98 |
| 6 | DL [19] | 2024 | Health | 3rd+O | impr | CT-BERT, Acc: 0.87 |
| 7 | FE (DL)+DL [20] | 2024 | Politics | Self+O | new-m | CNN+LSTM, Acc: 0.92 |
| 8 | Corpus+DL [21] | 2024 | Technology | Self+NO | new-m | FastText+BiLSTM, F1: 0.71 |
| Elsevier | ||||||
| 1 | FE (DL)+DL [22] | 2020 | Health | 3rd+O | comp | BERT+LSTM, F1: 0.947 |
| 2 | Corpus+ML [23] | 2020 | Education | Self+NO | expe | TF-IDF+RF, Acc: 0.96 |
| 3 | Corpus+ML [24] | 2023 | Product | 3rd+O | impr | TF-IDF+SVM, Acc: 0.99 |
| 4 | FE (DL)+DL [25] | 2023 | Detection | Self+O | expe | fine-tuning XLNet, Acc: 0.95 |
| 5 | FE (DL)+DL [26] | 2023 | Health | 3rd+O | expe | CNN+LSTM, Acc: 0.94 |
| 6 | FE (DL)+DL [27] | 2023 | Product | 3rd+O | new-m | B-MLCNN, Acc: 0.95 |
| 7 | Corpus+ML [28] | 2024 | Finance | Self+NO | impr | TF-IDF+SVM, Acc: 0.82 |
| 8 | ML [29] | 2024 | Politics | 3rd+O | expe | SVM, Acc: 0.72 |
| 9 | Corpus+DL [30] | 2024 | Product | 3rd+O | new-m | GloVe+BiGRU, Acc: 0.98 |
| 10 | DL [31] | 2024 | Product | 3rd+O | comp | DistilBERT, Acc: 0.96 |
| IEEE | ||||||
| 1 | FE (DL)+DL [32] | 2020 | Education | Self+NO | new-m | Multi-AFM, Acc: 0.946 |
| 2 | DL [33] | 2020 | Finance | 3rd+O | expe | BART-Large, Acc: 0.947 |
| 3 | FE (DL)+DL [34] | 2023 | Detection | 3rd+O | new-m | CNN-BiLSTM+BERT, Acc: 0.94 |
| 4 | DL [35] | 2023 | Product | 3rd+O | impr | KGAN+RoBERTa, Acc: 0.9438 |
| 5 | FE (DL)+DL [36] | 2024 | Finance | 3rd+O | impr | MET-GAT, Acc: 0.8556 |
| 6 | FE (DL)+DL [37] | 2024 | Finance | 3rd+O | new-m | FinBERT-BiGRU, Acc: 0.95 |
| 7 | FE (DL)+DL [38] | 2024 | Product | 3rd+O | new-m | ESSGCN, Acc: 0.8786 |
| 8 | FE (DL)+DL [39] | 2024 | Product | 3rd+O | new-m | enBiLSTM-GRU, Acc: 0.94 |
| 9 | FE (DL)+DL [40] | 2024 | Product | 3rd+O | new-m | BERT-BiGRU, Acc: 0.94 |
| ACM | ||||||
| 1 | FE (DL)+ML [41] | 2023 | Product | Self+NO | expe | ConvNN+SVM, Acc: 0.845 |
| 2 | FE (DL)+ML [42] | 2023 | Education | Self+NO | new-m | CNN,LSTM+SVM, Acc: 0.93 |
| 3 | Corpus+DL [43] | 2024 | Health | Self+NO | new-m | TextInception+BiGRU, Acc: 0.99 |
| 1 Self = self-built; 3rd = third-party; NO = no Open for access; O = Open for access | ||||||
| 2 new-m = new model; comp = comparison; impr = improvement; expe = experiment | ||||||
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).