Submitted:
23 January 2026
Posted:
23 January 2026
You are already at the latest version
Abstract
Keywords:
1. Introduction
- We provide an overview of research on multilingual sentiment analysis over the past decade, categorizing it based on the classifiers utilized. For each paper reviewed, we provide a brief introduction covering aspects such as classification model, supported languages, preprocessing techniques, datasets used, feature extraction methods, and experimental results.
- We summarize several representative datasets for multilingual sentiment analysis and provide an overview of the evaluation metrics.
- We discuss the current limitations in the field of multilingual sentiment analysis and provide several possible perspectives for future research.
2. Algorithms
2.1. Machine Learning
2.2. Deep Learning
2.3. Ensemble Learning
3. Datasets
3.1. SemEval-2016 Task 5
3.2. Multilingual Amazon Reviews Corpus
3.3. SemEval-2017 Task 4
3.4. HECM
3.5. SemEval-2020 task 9
3.6. Sentiment Analysis in Indian Languages
4. Evaluation Metrics
4.1. Accuracy
4.2. Precision
4.3. Recall
4.4. F1-score
5. Limitations
6. Future Research Directions
7. Conclusions
Data Availability Statement
Conflicts of Interest
References
- Kruspe, A.; Häberle, M.; Kuhn, I.; Zhu, X.X. Cross-language sentiment analysis of european twitter messages duringthe covid-19 pandemic. arXiv arXiv:2008.12172. [CrossRef]
- Oliveira, A.S.; Renda, A.I.; Correia, M.B.; Antonio, N. Hotel customer segmentation and sentiment analysis through online reviews: An analysis of selected European markets. Tourism & Management Studies 2022, 18, 29–40. [Google Scholar]
- Vicente, I.S.; Saralegi, X.; Agerri, R. Real Time Monitoring of Social Media and Digital Press. arXiv arXiv:1810.00647. [CrossRef]
- Abdullah, N.A.S.; Rusli, N.I.A. Multilingual Sentiment Analysis: A Systematic Literature Review. Pertanika Journal of Science & Technology 2021, 29, 445–470. [Google Scholar] [CrossRef]
- Agüero-Torales, M.M.; Abreu Salas, J.I.; López-Herrera, A.G. Deep learning and multilingual sentiment analysis on social media data: An overview. Applied Soft Computing 2021, 107, 107373. [Google Scholar] [CrossRef]
- Mabokela, K.R.; Celik, T.; Raborife, M. Multilingual Sentiment Analysis for Under-Resourced Languages: A Systematic Review of the Landscape. IEEE Access 2023, 11, 15996–16020. [Google Scholar] [CrossRef]
- Aliyu, Y.; Sarlan, A.; Usman Danyaro, K.; Rahman, A.S.B.A.; Abdullahi, M. Sentiment Analysis in Low-Resource Settings: A Comprehensive Review of Approaches, Languages, and Data Sources. IEEE Access 2024, 12, 66883–66909. [Google Scholar] [CrossRef]
- Elawady, R.; Barakat, S.; Elrashidy, N. Sentimentanalysis for Arabic and English datasets. International Journal of Intelligent Computing and Information Sciences 2015, 15, 55–70. [Google Scholar] [CrossRef]
- Kumar, A.; Kohail, S.; Ekbal, A.; Biemann, C. IIT-TUDA: System for Sentiment Analysis in Indian Languages Using Lexical Acquisition. In Proceedings of the Mining Intelligence and Knowledge Exploration; Prasath, R., Vuppala, A.K., Kathirvalavakumar, T., Eds.; Cham, 2015; pp. 684–693. [Google Scholar]
- Makrynioti, N.; Vassalos, V. Sentiment Extraction from Tweets: Multilingual Challenges. In Proceedings of the Big Data Analytics and Knowledge Discovery; Madria, S., Hara, T., Eds.; Cham, 2015; pp. 136–148. [Google Scholar]
- Povoda, L.; Burget, R.; Dutta, M.K. Sentiment analysis based on Support Vector Machine and Big Data. In Proceedings of the 2016 39th International Conference on Telecommunications and Signal Processing (TSP), 2016; pp. 543–545. [Google Scholar] [CrossRef]
- Bobichev, V.; Kanishcheva, O.; Cherednichenko, O. Sentiment analysis in the Ukrainian and Russian news. In Proceedings of the 2017 IEEE First Ukraine Conference on Electrical and Computer Engineering (UKRCON), 2017; pp. 1050–1055. [Google Scholar] [CrossRef]
- Patel, S.; Nolan, B.; Hofmann, M.; Owende, P.; Patel, K. Sentiment analysis: Comparative analysis of multilingual sentiment and opinion classification techniques. World Academy of Science, Engineering and Technology, International Journal of Computer, Electrical, Automation, Control and Information Engineering 2017, 11, 565–571. [Google Scholar]
- Tellez, E.S.; Miranda-Jiménez, S.; Graff, M.; Moctezuma, D.; Suárez, R.R.; Siordia, O.S. A simple approach to multilingual polarity classification in Twitter. Pattern Recognition Letters 2017, 94, 68–74. [Google Scholar] [CrossRef]
- Abo, M.E.M.; Shah, N.A.K.; Balakrishnan, V.; Abdelaziz, A. Sentiment analysis algorithms: evaluation performance of the Arabic and English language. In Proceedings of the 2018 International Conference on Computer, Control, Electrical, and Electronics Engineering (ICCCEEE), 2018; pp. 1–5. [Google Scholar] [CrossRef]
- Pustulka-Hunt, E.; Hanne, T.; Blumer, E.; Frieder, M. Multilingual sentiment analysis for a swiss gig. In Proceedings of the 2018 6th International Symposium on Computational and Business Intelligence (ISCBI), 2018; IEEE; pp. 94–98. [Google Scholar]
- Arun, K.; Srinagesh, A. Multi-lingual Twitter sentiment analysis using machine learning. International Journal of Electrical & Computer Engineering Accessed. 2020, 10, 5992–6000. [Google Scholar] [CrossRef]
- Abubakar, A.I.; Roko, A.; Bui, A.M.; Saidu, I. An Enhanced Feature Acquisition for Sentiment Analysis of English and Hausa Tweets. International Journal of Advanced Computer Science and Applications 2021, 12. [Google Scholar] [CrossRef]
- Sumana; Kanchan, P. Hindi and Kannada Twitter Sentiment Analysis Using Machine Learning Algorithm. In Proceedings of the 2022 4th International Conference on Advances in Computing, Communication Control and Networking (ICAC3N), 2022; pp. 370–377. [Google Scholar] [CrossRef]
- Helmy, A.; Nassar, R.; Ramdan, N. Depression detection for twitter users using sentiment analysis in English and Arabic tweets. Artificial Intelligence in Medicine 2024, 147, 102716. [Google Scholar] [CrossRef]
- Maree, M.; Eleyat, M.; Mesqali, E. Optimizing Machine Learning-based Sentiment Analysis Accuracy in Bilingual Sentences via Preprocessing Techniques. The International Arab Journal of Information Technology (IAJIT) 2024, 21, 257–270. [Google Scholar] [CrossRef]
- Alharbi, Y.; Khan, S.S. Classifying Multi-Lingual Reviews Sentiment Analysis in Arabic and English Languages Using the Stochastic Gradient Descent Model. Computers, Materials and Continua 2025, 83, 1275–1290. [Google Scholar] [CrossRef]
- Zhou, X.; Wan, X.; Xiao, J. Attention-based LSTM network for cross-lingual sentiment classification. In Proceedings of the Proceedings of the 2016 conference on empirical methods in natural language processing, 2016; pp. 247–256. [Google Scholar]
- Becker, W.; Wehrmann, J.; Cagnini, H.E.L.; Barros, R.C. An efficient deep neural architecture for multilingual sentiment analysis in twitter. In Proceedings of the 30th FLAIRS, Brasil., 2017. [Google Scholar]
- Deriu, J.; Lucchi, A.; De Luca, V.; Severyn, A.; Müller, S.; Cieliebak, M.; Hofmann, T.; Jaggi, M. Leveraging Large Amounts of Weakly Supervised Data for Multi-Language Sentiment Classification. In Proceedings of the Proceedings of the 26th International Conference on World Wide Web, Republic and Canton of Geneva, CHE, 2017; WWW ’17, pp. 1045–1052. [Google Scholar] [CrossRef]
- Medrouk, L.; Pappa, A. Deep Learning Model for Sentiment Analysis in Multi-lingual Corpus. In Proceedings of the Neural Information Processing; Liu, D., Xie, S., Li, Y., Zhao, D., El-Alfy, E.S.M., Eds.; Cham, 2017; pp. 205–212. [Google Scholar]
- Can, E.F.; Ezen-Can, A.; Can, F. Multilingual Sentiment Analysis: An RNN-Based Framework for Limited Data. arXiv 2018, arXiv:1806.04511. [Google Scholar] [CrossRef]
- Medrouk, L.; Pappa, A. Do Deep Networks Really Need Complex Modules for Multilingual Sentiment Polarity Detection and Domain Classification? In Proceedings of the 2018 International Joint Conference on Neural Networks (IJCNN), 2018; pp. 1–6. [Google Scholar] [CrossRef]
- Wehrmann, J.; Becker, W.E.; Barros, R.C. A multi-task neural network for multilingual sentiment classification and language detection on Twitter. In Proceedings of the Proceedings of the 33rd Annual ACM Symposium on Applied Computing, New York, NY, USA, 2018; SAC ’18, pp. 1805–1812. [Google Scholar] [CrossRef]
- Akhtar, M.S.; Kumar, A.; Ekbal, A.; Biemann, C.; Bhattacharyya, P. Language-agnostic model for aspect-based sentiment analysis. In Proceedings of the Proceedings of the 13th international conference on computational semantics-long papers, 2019; pp. 154–164. [Google Scholar]
- Lal, Y.K.; Kumar, V.; Dhar, M.; Shrivastava, M.; Koehn, P. De-Mixing Sentiment from Code-Mixed Text. In Proceedings of the Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop; Alva-Manchego, F., Choi, E., Khashabi, D., Eds.; Florence, Italy, 2019; pp. 371–377. [Google Scholar] [CrossRef]
- Chundi, R.; Hulipalled, V.R.; Simha, J. SAEKCS: Sentiment Analysis for English – Kannada Code SwitchText Using Deep Learning Techniques. In Proceedings of the 2020 International Conference on Smart Technologies in Computing, Electrical and Electronics (ICSTCEE), 2020; pp. 327–331. [Google Scholar] [CrossRef]
- Dahiya, A.; Battan, N.; Shrivastava, M.; Sharma, D.M. Curriculum Learning Strategies for Hindi-English Code-Mixed Sentiment Analysis. In Proceedings of the Artificial Intelligence. IJCAI 2019 International Workshops, Cham; Seghrouchni, El Fallah, Sarne, A.D., Eds.; 2020; pp. 177–189. [Google Scholar]
- Shakeel, M.H.; Faizullah, S.; Alghamidi, T.; Khan, I. Language Independent Sentiment Analysis. In Proceedings of the 2019 International Conference on Advances in the Emerging Computing Technologies (AECT), 2020; pp. 1–5. [Google Scholar] [CrossRef]
- Shen, J.; Liao, X.; Lei, S. Cross-lingual Sentiment Analysis via AAE and BiGRU. In Proceedings of the 2020 Asia-Pacific Conference on Image Processing, Electronics and Computers (IPEC), 2020; pp. 237–241. [Google Scholar] [CrossRef]
- Younas, A.; Nasim, R.; Ali, S.; Wang, G.; Qi, F. Sentiment Analysis of Code-Mixed Roman Urdu-English Social Media Text using Deep Learning Approaches. In Proceedings of the 2020 IEEE 23rd International Conference on Computational Science and Engineering (CSE), 2020; pp. 66–71. [Google Scholar] [CrossRef]
- Dowlagar, S.; Mamidi, R. Graph Convolutional Networks with Multi-headed Attention for Code-Mixed Sentiment Analysis. In Proceedings of the Proceedings of the First Workshop on Speech and Language Technologies for Dravidian Languages, Kyiv; Chakravarthi, B.R., Priyadharshini, R., Kumar, M, Krishnamurthy, A., Sherly, P.E., Eds.; 2021; pp. 65–72. [Google Scholar]
- Mboutayeb, S.; Majda, A.; Nikolov, N.S. Multilingual Sentiment Analysis: A Deep Learning Approach. In Proceedings of the Conference: International Conference On Big Data, Modelling And Machine Learning (BML’21), 2021; pp. 1–7. [Google Scholar]
- Barbieri, F.; Anke, L.E.; Camacho-Collados, J. XLM-T: Multilingual Language Models in Twitter for Sentiment Analysis and Beyond, 2022. arXiv arXiv:cs.
- Chaves, I.C.; Martins, A.D.F.; Praciano, F.D.; Brito, F.T.; Monteiro, J.M.; Machado, J.C. BPA: A Multilingual Sentiment Analysis Approach based on BiLSTM. In Proceedings of the ICEIS (1), 2022; pp. 553–560. [Google Scholar]
- Suresh Kumar, K.; Helen Sulochana, C. Local search five-element cycle optimized reLU-BiLSTM for multilingual aspect-based text classification. Concurrency and Computation: Practice and Experience 2022, 34, e7374. [Google Scholar] [CrossRef]
- Yang, Q.; Kadeer, Z.; Gu, W.; Sun, W.; Wumaier, A. Affective Knowledge Augmented Interactive Graph Convolutional Network for Chinese-Oriented Aspect-Based Sentiment Analysis. IEEE Access 2022, 10, 130686–130698. [Google Scholar] [CrossRef]
- Astuti, L.W.; Sari, Y.; Suprapto. Code-mixed sentiment analysis using transformer for twitter social media data. International Journal of Advanced Computer Science and Applications 2023, 14. [Google Scholar] [CrossRef]
- Choudhary, N.; Singh, R.; Bindlish, I.; Shrivastava, M. Sentiment Analysis of Code-Mixed Languages Leveraging Resource Rich Languages. In Proceedings of the Computational Linguistics and Intelligent Text Processing, Cham; Gelbukh, A., Ed.; 2023; pp. 104–114. [Google Scholar]
- Deole, Y.; Jagadish, T.; Shabrez, M.; M, P.; G, A. Multi-Lingual Sentiment Analysis of Urdu and English Tweets Using RNN with Bidirectional LSTM. In Proceedings of the 2023 6th International Conference on Recent Trends in Advance Computing (ICRTAC), 2023; pp. 813–817. [Google Scholar] [CrossRef]
- Kar, P.; Debbarma, S. Sentimental analysis & Hate speech detection on English and German text collected from social media platforms using optimal feature extraction and hybrid diagonal gated recurrent neural network. Engineering Applications of Artificial Intelligence 2023, 126, 107143. [Google Scholar] [CrossRef]
- Mercha, E.M.; Benbrahim, H.; Erradi, M. SlideGCN: Slightly Deep Graph Convolutional Network for Multilingual Sentiment Analysis. In Proceedings of the Advances in Computational Intelligence; Rojas, I., Joya, G., Catala, A., Eds.; Cham, 2023; pp. 91–103. [Google Scholar]
- Omran, T.M.; Sharef, B.T.; Grosan, C.; Li, Y. Transfer learning and sentiment analysis of Bahraini dialects sequential text data using multilingual deep learning approach. Data & Knowledge Engineering 2023, 143, 102106. [Google Scholar] [CrossRef]
- Gupta, N.; Bhattarai, M.; Chavan, A. Multilingual Sentiment Analysis using DistilBert. In Proceedings of the 2024 International Conference on Artificial Intelligence and Quantum Computation-Based Sensor Application (ICAIQSA), 2024; pp. 1–6. [Google Scholar] [CrossRef]
- Kathunia, A.; Kaif, M.; Arora, N.; Narotam, N. Sentiment Analysis Across Languages: Evaluation Before and After Machine Translation to English. arXiv 2024, arXiv:2405.02887. [Google Scholar] [CrossRef]
- Mamta; Ekbal, A. Transformer based multilingual joint learning framework for code-mixed and english sentiment analysis. Journal of Intelligent Information Systems 2024, 62, 231–253. [Google Scholar] [CrossRef]
- Yacoub, A.D.; Aboutabl, A.E.; Slim, S.O. Multilingual Sarcasm Detection for Enhancing Sentiment Analysis using Deep Learning Algorithms. Journal of Communications Software and Systems 2024, 20, 278–289. [Google Scholar] [CrossRef]
- Jain, V.; Malviya, L.; S, A. Optimized hybrid deep learning for cross-linguistic sentiment analysis: a novel approach. Journal of Cloud Computing 2025, 14, 30. [Google Scholar] [CrossRef]
- Jain, V.; Mohanani, G.; Gaur, A.; Patheja, P.S. Enhanced Multilingual Sentiment Analysis Using Hybrid CNN-GRU Deep Learning Architecture. In Proceedings of the Advanced Network Technologies and Intelligent Computing; Verma, A., Verma, P., Pattanaik, K.K., Buyya, R., Dasgupta, D., Eds.; Cham, 2025; pp. 357–370. [Google Scholar]
- Nazir, M.K.; Faisal, C.N.; Habib, M.A.; Ahmad, H. Leveraging Multilingual Transformer for Multiclass Sentiment Analysis in Code-Mixed Data of Low-Resource Languages. IEEE Access 2025, 13, 7538–7554. [Google Scholar] [CrossRef]
- Jhanwar, M.G.; Das, A. An Ensemble Model for Sentiment Analysis of Hindi-English Code-Mixed Data, 2018. arXiv arXiv:cs.
- Zhang, X.; Zhang, C.; Shi, H. Ensemble of Binary Classification for the Emotion Detection in Code-Switching Text. In Proceedings of the Natural Language Processing and Chinese Computing; Zhang, M., Ng, V., Zhao, D., Li, S., Zan, H., Eds.; Cham, 2018; pp. 178–189. [Google Scholar]
- Sarkar, K. Heterogeneous classifier ensemble for sentiment analysis of Bengali and Hindi tweets. Sādhanā 2020, 45, 196. [Google Scholar] [CrossRef]
- Al Shamsi, A.A.; Abdallah, S. Ensemble Stacking Model for Sentiment Analysis of Emirati and Arabic Dialects. Journal of King Saud University - Computer and Information Sciences 2023, 35, 101691. [Google Scholar] [CrossRef]
- Aryal, S.K.; Prioleau, H.; Washington, G.; Burge, L. Evaluating Ensembled Transformers for Multilingual Code-Switched Sentiment Analysis. In Proceedings of the 2023 International Conference on Computational Science and Computational Intelligence (CSCI), 2023; pp. 165–173. [Google Scholar] [CrossRef]
- Habbat, N.; Nouri, H.; Anoun, H.; Hassouni, L. Sentiment analysis of imbalanced datasets using BERT and ensemble stacking for deep learning. Engineering Applications of Artificial Intelligence 2023, 126, 106999. [Google Scholar] [CrossRef]
- Choudhary, C.; Thakur, J.; Singh, M.; Anurag. Sentiment Analysis in Code-Mixed Video Comments Using Machine Learning on a Hinglish Dataset. In Proceedings of the 2024 First International Conference on Software, Systems and Information Technology (SSITCON), 2024; pp. 1–6. [Google Scholar] [CrossRef]
- Hasan, M.A. Ensemble Language Models for Multilingual Sentiment Analysis. arXiv 2024, arXiv:2403.06060. [Google Scholar] [CrossRef]
- Hassan, M.E.; Maab, I.; Hussain, M.; Habib, U.; Matsuo, Y. Polarity Classification of Low Resource Roman Urdu and Movie Reviews Sentiments Using Machine Learning-Based Ensemble Approaches. IEEE Open Journal of the Computer Society 2024, 5, 599–611. [Google Scholar] [CrossRef]
- Miah, M.S.U.; Kabir, M.M.; Sarwar, T.B.; Safran, M.; Alfarhood, S.; Mridha, M. A multimodal approach to cross-lingual sentiment analysis with ensemble of transformer and LLM. Scientific Reports 2024, 14, 9603. [Google Scholar] [CrossRef] [PubMed]
- Dharini, N.; Madhuvanthi, M.; Aswini, C.S.; Lakshya, R.; Triumbika, M.; Saranya, N. Ensemble-Driven Multilingual Sentiment Analysis Framework for YouTube Comments with Dashboard. In Proceedings of the 2025 International Conference on Visual Analytics and Data Visualization (ICVADV); 2025; pp. 1524–1529. [Google Scholar] [CrossRef]
- Pontiki, M.; Galanis, D.; Papageorgiou, H.; Androutsopoulos, I.; Manandhar, S.; AL-Smadi, M.; Al-Ayyoub, M.; Zhao, Y.; Qin, B.; De Clercq, O.; et al. SemEval-2016 Task 5: Aspect Based Sentiment Analysis. In Proceedings of the Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval-2016); Bethard, S., Carpuat, M., Cer, D., Jurgens, D., Nakov, P., Zesch, T., Eds.; San Diego, California, 2016; pp. 19–30. [Google Scholar] [CrossRef]
- Keung, P.; Lu, Y.; Szarvas, G.; Smith, N.A. The Multilingual Amazon Reviews Corpus. arXiv 2020, arXiv:2010.02573. [Google Scholar] [CrossRef]
- Rosenthal, S.; Farra, N.; Nakov, P. SemEval-2017 Task 4: Sentiment Analysis in Twitter, 2019. arXiv arXiv:cs.
- Joshi, A.; Prabhu, A.; Shrivastava, M.; Varma, V. Towards Sub-Word Level Compositions for Sentiment Analysis of Hindi-English Code Mixed Text. In Proceedings of the Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers, Osaka, Japan; Matsumoto, Y., Prasad, R., Eds.; 2016; pp. 2482–2491. [Google Scholar]
- Patwa, P.; Aguilar, G.; Kar, S.; Pandey, S.; PYKL, S.; Gambäck, B.; Chakraborty, T.; Solorio, T.; Das, A. SemEval-2020 Task 9: Overview of Sentiment Analysis of Code-Mixed Tweets, 2020. arXiv arXiv:cs.
- Patra, B.G.; Das, D.; Das, A.; Prasath, R. Shared Task on Sentiment Analysis in Indian Languages (SAIL) Tweets - An Overview. In Proceedings of the Mining Intelligence and Knowledge Exploration; Prasath, R., Vuppala, A.K., Kathirvalavakumar, T., Eds.; Cham, 2015; pp. 650–655. [Google Scholar]
- Padmaja, S.; Fatima, S.S. Opinion mining and sentiment analysis-an assessment of peoples’ belief: A survey. International Journal of Ad hoc, Sensor & Ubiquitous Computing 2013, 4, 21. [Google Scholar]
| Literature | Dataset | Language(s) | Features | Classifier | Accuracy |
|---|---|---|---|---|---|
| [8] | Self-collected dataset (from Amazon and Youtube) | English, Arabic | TF-IDF | DT, NB, SVM | 0.877(English), 0.8934 (Arabic) |
| [9] | SAIL 2015 | Hindi, Bengali | - | SVM | 0.432 (Bengali), 0.4968 (Hindi) |
| [10] | Self-collected Greek dataset, SemEval 2013 | Greek and English | - | SVM | 0.686 (mF1 Greek), 0.642 (mF1 English) |
| [11] | Self-collected dataset(from websites) | Czech, German, English, Spanish | TF-IDF, 2-gram | SVM | 0.9531 (English), 0.9323 (Spanish), and 0.8905 (Czech) |
| [12] | Self-collected dataset(from news) | Ukrainian, Russian | TF-IDF, correlation-based feature subset selection, information gain, and bag of words | NB, DMNBtext,NB Multinomial, SVM | 0.82 (F1-score) |
| [13] | HindEnCorps, English movie reviews dataset, Hindi movie reviews dataset | Hindi, English | - | SVM, NB, ME, DT, RF, KNN | 0.81 (English), 0.78 (Hindi) |
| [14] | TASS-15, SemEval 15-16, SENTIPOLC-14, other datasets in different languages | English, Spanish, Italian, Arabic, German, Portuguese, Russian, and Swedish | TF-IDF, word based n-grams, and character q-grams | SVM | 0.637 (TASS-15 dataset), 0.456 (SemEval-16) |
| [15] | Self-collected dataset | MSA, DA, English | NB, DT | - | NB: 0.508 (DA), 0.895 (MSA) and 0.84 (English). DT: 0.544 (DA), 0.97 (MSA), 0.84 (English) |
| [16] | Self-collected dataset (Gigworker and Company Ratings, Twitter) | German, French, and English | TF-IDF, N-gram | LSVC, RSVC, MNB, KNN, Tree, LR, RF | LR: 0.869 |
| [17] | Self-collected dataset(from Twitter) | English, Telugu, Hindi, Bengali, Urdu, Arabic and other undefined languages | Bag-of-words | MNB, LR, SVM, DT, KNN, RF | SVM: 0.95; RF: 0.93 |
| [18] | HWN lexical resource, Self-collected dataset (from Twitter) | Hausa and English | TF-IDF, term frequency, n-gram | NB, SVM, ME | NB: 0.68 |
| [19] | Self-collected dataset(from Twitter) | Hindi, Kannada | - | NB, KNN, Tree, RF | RF: 0.997 (Hindi) 0.995 (Kannada) |
| [20] | Self-collected dataset (from Twitter) | Arabic, English | TF-IDF, BOW | LR, RF, NB,SVM | RF: 0.85; Rbf-SVM: 0.966(F1-score) |
| [21] | ASTD, AJGT, English translated datasets | Arabic, English | - | LR, RF, NB,SVM | SVM: 0.71(ASTD), 0.86 (AJGT translated into English) |
| [22] | Arabic Review Dataset, IMDB | Arabic, English | TF-IDF | SGD | 0.8489 (Arabic dataset); 0.8744(IMDB) |
| Literature | Classifier | Dataset | Language(s) | Features | Accuracy |
|---|---|---|---|---|---|
| [23] | attention-based LSTM | NLP&CC 2013 | English, Chinese | Word2Vec | 0.824 |
| [24] | CNN | 4-language tweets | English, Spanish, Portuguese, and German | - | 0.695 |
| [25] | CNN | Self-collected dataset(from websites) | Italian, German, French, English | skip-gram, word embedding | 0.6779(F1-score, Italian), 0.6509(F1-score, German), 0.6479(F1-score, French), 0.6226(F1-score, English) |
| [26] | CNN | Self-collected | English, French and Greek | - | 0.88 (F1-score, trilingual) |
| [27] | RNN, BI-GRU | restaurant reviews (from multiple sources) | Spanish, Turkish, Dutch and Russian | Pre-trained global vectors | 0.8561( Russian), 0.8421(Spanish), 0.7436 (Turkish), 0.8177 (Dutch) |
| [28] | CNN, LSTM | Self-collected dataset( reviews from website) | French, English, Greek | - | CNN: 0.9125; LSTM: 0.9127 |
| [29] | CNN | subset of a Twitter corpora | English, Spanish, Portuguese, and German | - | 0.72 |
| [30] | Bi-LSTM | SemEval-2016 task 5 and GermEval-2017 task on ABSA | English, Hindi, Spanish, French, Dutch, and German | word embedding, hand-crafted manual features | 0.834 (English), 0.669 (Hindi), 0.871 (Spanish), 0.753 (French), 0.819 (Dutch), 0.872 (German) |
| [31] | CNN, BiLSTM | HECM | Hindi, English | sub-word representation | 0.8354 |
| [32] | CNN, BiLSTM | Self-collected | English, Kannada | Sub word level embedding | 0.776 |
| [33] | Bi-LSTM | Hindi-English code-mixed datasets | Hindi, English | character trigram | 0.7251 |
| [34] | CNN, LSTM | Sanders Twitter corpus, GermEval dataset | English, German | - | 0.8094(English), 0.7475(German), 0.7508(mixed) |
| [35] | LSTM,Adversarial Auto Encoder, BiGRU | cross-lingual Amazon product dataset | Chinese, English, and German | word embedding | 0.78757 |
| [36] | mBERT, XLM-R | self-collected | English, Roman Urdu | - | mBERT: 0.69; XLM-R:0.71 |
| [37] | GCN | Dravidian code-mixed dataset | Tamil, Malayalam, English | word embedding | 0.73 (Malayalam-English), 0.71 (Tamil-English) |
| [38] | CNN, RNN, CNN-RNN | self-collected | English, Portuguese and Arabic | word embedding | 0.8591 |
| [39] | XLM-T | self-collected dataset | 8 languages (for evaluation) | - | 0.6953(average F1) |
| [40] | BiLSTM | SST-2, SemEval 2017 Subtask A, TASS 2017 Task 1 General Corpus, two self-collected datasets | English, Spanish, and Portuguese | - | 0.901(SST-2), 0.865(CCMD-ES), 0.923(CCMD-PT) |
| [41] | BiLSTM, | self-constructed datasets( from multiple sources) | multiple languages | BERT | 0.9725 |
| [42] | GCN | 4 Chinese datasets(Car, Phone, Notebook, Camera), 6 English datasets( Tshirt, Twitter, MAMS, REST14, REST15, REST16) | Chinese, English | BERT | 0.9771(camera); 0.9302(REST16) |
| [43] | pretrained BERT | self-collected | Indonesian, English | - | 0.7656 |
| [44] | Bi-LSTM | English Twitter dataset, SemEval 2013, and Hindi-English Code-Mixed (HECM) | Hindi, English | Skip-gram, character trigram embedding | 0.78(HECM) |
| [45] | RNN-BiLSTM | Pakistan Government Twitter Dataset | Urdu, English | - | 0.9564 |
| [46] | hybrid diagonal gated RNN | HASOC 2019 | English, German | - | 0.9687(English), 0.9331(German) |
| [47] | GCN | self-constructed (MARC, 3 movie review datasets) | English, German, French, Spanish | - | 0.87(FR-DE), 0.88(movie review) |
| [48] | LSTM | self-constructed | English, MSA, BD, BD dialect | - | 0.9704(English), 0.971(MSA) |
| [49] | DistilBERT | SST2, GLUE, self-collected multilingual Twitter datasets | English, French, German, Arabic, Italian, Indonesian | - | 0.989 (SST2), 0.911(GLUE) and 0.915 (Indonesian dataset) |
| [50] | BERT, XLM-RoBERTa | subset of MARC | French, German, Spanish, Japanese, Chinese | - | No MT:0.86715 (Chinese XLM), 0.90730 (German BERT) MT: 0.89965 (Spanish XLM), 0.82391 (Chinese BERT) |
| [51] | task-specific BERT | English Dataset, Hinglish code-mixed dataset(SemEval2020 Task 9 dataset), Punglish code-mixed dataset, HindiMD dataset | English, Hindi, Punjabi | - | 0.7431 (Hindi-English), 0.734 (Punjabi-English) |
| [52] | BiLSTM, LSTM | ArSarcasm-v2, iSarcasmEval, IMDB, SentiMixArEn | English, Arabic | word embedding | 0.8132 (Arsarcasm, BiLSTM with BERT), 0.9061 (IMDB, BiLSTM with BERT) |
| [53] | CNN, LSTM | self-collected | Czech, German, Spanish, French, Polish, and Slovak | word embedding | 0.9598 |
| [54] | CNN-GRU | Self-constructed(from Kaggle) | English, Portuguese, Tamil, and French | - | 0.98(English), 0.92(Portuguese), 0.91(Tamil), 0.89(French) |
| [55] | mBERT | self-collected (from Youtube and Whatsapp) | Roman Punjabi, Roman Urd | BERT | 0.7994 |
| Literature | Classifier | Dataset | Language(s) | Features | Accuracy |
|---|---|---|---|---|---|
| [56] | LSTMs, Multinomial Naive Bayes | HECM | Hindi, English | character-trigram, word-ngram | 0.708 |
| [57] | Fasttext, RCNN, CNN | NLPCC2018 Task1 | English and Chinese. | - | 0.734(happiness); 0.616(sadness) |
| [58] | Multinomial NB, SVM | SAIL 2015 | Bengali, Hindi | word n-gram, character n-gram, unigram | 0.5753 (Bengali), 0.6263 (Hindi) |
| [59] | Bi-LSTM-Bi-GRU model, CNN-Bi-GRU model, CNN-Bi-LSTM, Bi-LSTM-LSTM model, Bi-GRU model, BI-LSTM model, Deep CNN model | ESAAD, ASAD, ACRD, PSAD | Emirati, Arabic, English | AraBERT, MARBERT | 0.9462 (undersampled ESAAD), 0.946 (ACRD), 0.983 (balanced oversampled ACRD), 0.9779 (PSAD) |
| [60] | Cardiffnlp Base, Cardiffnlp XLM, distiluse | SemEval2020 Task 9 dataset, Tamil-English and Malayalam-English datasets, CMTET | Hindi, Spanish, Tamil, Malayalam, Telugu, English | - | Original dataset: 0.84(binary classification), 0.74(ternary classification) |
| [61] | LSTM, GRU, BiLSTM | Sentiment140, Twitter US Airline Sentiment, Arabic-Reviews of Hotels Dataset, Arabic Twitter dataset, MAC dataset, Dialect dataset | English, Arabic, Moroccan | BERT | 0.94 (Sentiment140 dataset), 0.94 (Arabic Twitter Dataset), 0.943(MAC dataset) |
| [62] | RF, LR | self-collected (from Youtube) | Hindi, English | TF-IDF, word embeddings | 0.92 |
| [63] | AraBERT or RoBERTa Model, multilingual BERT | SemEval-2017 task4, ASTD | English, Arabic | - | 0.6891 (English), 0.6767 (Arabic) |
| [64] | SVM, RF, LR, NB | UCL Roman Urdu dataset, IMDB dataset | Roman Urdu, English | BOW, TF-IDF, unigram and bigram | 0.803 (UCL dataset), 0.9092 (IMDB) |
| [65] | transformers and LLM | self-constructed | Arabic, Chinese, French, Italian | - | 0.8671(google translate) |
| [66] | LR, RF, SVM | self-collected ( from Youtube) | Tamil, English, Tanglish | mBERT and XLM-RoBERTa | 0.97 |
| Dataset | Languages | Sentiment Classes | Scale |
|---|---|---|---|
| SemEval-2016 Task 5 [67] | English, Arabic, Chinese, Dutch, French, Russian, Spanish and Turkish | 3,4 | 70,790 tuples |
| MARC [68] | English, Japanese, German, French, Spanish, and Chinese | 5 | 1.26 M reviews |
| SemEval-2017 Task 4 [69] | English, Arabic | 2,3,5 | 70 k+ tweets |
| HECM [70] | Hindi, English | 3 | 3,879 sentences |
| SemEval-2020 Task 9 [71] | Hindi, English, Spanish | 3 | 20k(Hindi-English), 19k(Spanish-English) |
| SAIL [72] | Bengali, Hindi, Tamil | 3 | 1,498 tweets (Bengali), 1,688 tweets (Hindi), 1,663 tweets (Tamil) |
| Predicted positive | Predicted negative | |
|---|---|---|
| Actually Positive | True Positives (TP) | False Negatives (FN) |
| Actually Negative | False Positives (FP) | True Negatives (TN) |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
