Submitted:
08 April 2024
Posted:
09 April 2024
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Related Work
3. Materials and Methods
3.1. Dataset
3.2. Data Preprocessing
3.2.1. Word Tokenization
3.2.2. Token Labelling
3.2.3. Stemming
3.2.4. Lemmatization

3.3. Deep Learning
3.3.1. Convolutional Neural Network
3.3.2. Long Short-Term Memory
3.3.3. Bidirectional Encoder Representations from Transformers
3.4. Evaluation Parameters
3.4.1. Accuracy
3.4.2. Precision
3.4.3. Recall
3.4.4. F1 Score
3.5. Modelling
3.5.1. CNN Model
3.5.2. LSTM Model
3.5.3. BERT Model
3.5.4. Word Cloud
4. Results
5. Discussion
6. Conclusions
Author Contributions
Funding
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
| EVD | Ebola Virus Disease |
| CNN | Convolutional Neural Network |
| LSTM | Long-Short-Term Memory Model ( |
| BERT | Bidirectional Encoder Representations from Transformers |
References
- Jacob, Shevin T., Ian Crozier, William A. Fischer, Angela Hewlett, Colleen S. Kraft, Marc-Antoine de La Vega, Moses J. Soka, Victoria Wahl, Anthony Griffiths, Laura Bollinger, and others. "Ebola virus disease." Nature reviews Disease primers 6.1 (2020): 1-31.
- Denis Malvy, Anita K. McElroy, Hilde de Clerck, Stephan Günther, Johan van Griensven Ebola virus disease, The Lancet, vol. 393, no. 10174, pp. 936–948, 2019, Elsevier.
- JJ Muyembe-Tamfum, M Kipasa, C Kiyungu, R Colebunders, Ebola outbreak in Kikwit, Democratic Republic of the Congo: discovery and control measures,The Journal of infectious diseases, vol. 179, no. Supplement, pp. S259–S262, 1999, The University of Chicago Press.
- Silvia Dallatomasina, Rosa Crestani, James Sylvester Squire, Hilde Declerk, Grazia Marta Caleo, Anja Wolz, Kathryn Stinson, Gabriela Patten, Raphael Brechard, Osman Bamba-Moi Gbabai, and others, Ebola outbreak in rural West Africa: epidemiology, clinical features and outcomes, Tropical Medicine & International Health, vol. 20, no. 4, pp. 448–454, 2015, Wiley Online Library.
- Chih-Peng Tseng, Yu-Jiun Chan, Overview of Ebola virus disease in 2014, Journal of the Chinese Medical Association, vol. 78, no. 1, pp. 51–55, 2015, Elsevier.
- Sima Rugarabamu, Leonard Mboera, Mark Rweyemamu, Gaspary Mwanyika, Julius Lutwama, Janusz Paweska, Gerald Misinzo, Forty-two years of responding to Ebola virus outbreaks in Sub-Saharan Africa: a review, BMJ Global Health, vol. 5, no. 3, pp. e00 1955, 2020, BMJ Specialist Journals.
- P Bres, The epidemic of Ebola haemorrhagic fever in Sudan and Zaire, 1976: introductory note., Bulletin of the World Health Organization, vol. 56, no. 2, pp. 245, 1978, World Health Organization.
- Alicia Rosello, Mathias Mossoko, Stefan Flasche, Albert Jan Van Hoek, Placide Mbala, Anton Camacho, Sebastian Funk, Adam Kucharski, Benoit Kebela Ilunga, W John Edmunds, and others, Ebola virus disease in the Democratic Republic of the Congo, 1976-2014, Elife, vol. 4, pp. e0 9015, 2015, eLife Sciences Publications Limited.
- Marcel Salathé, Shashank Khandelwal, Assessing vaccination sentiments with online social media: implications for infectious disease dynamics and control, PLoS computational biology, vol. 7, no. 10, pp. e100 2199, 2011, Public Library of Science San Francisco, USA.
- Tanushree Mitra, Scott Counts, James W Pennebaker, Understanding anti-vaccination attitudes in social media, Tenth International AAAI Conference on Web and Social Media, 2016.
- Viju Raghupathi, Jie Ren, Wullianallur Raghupathi, Studying public perception about vaccination: A sentiment analysis of tweets, International journal of environmental research and public health, vol. 17, no. 10, pp. 3464, 2020, Multidisciplinary Digital Publishing Institute.
- Walaa Medhat, Ahmed Hassan, Hoda Korashy, Sentiment analysis algorithms and applications: A survey, Ain Shams engineering journal, vol. 5, no. 4, pp. 1093– 1113, 2014, Elsevier.
- Koyel Chakraborty, Surbhi Bhatia, Siddhartha Bhattacharyya, Jan Platos, Rajib Bag, Aboul Ella Hassanien, Sentiment Analysis of COVID-19 tweets by Deep Learning Classifiers—A study to show how popularity is affecting accuracy in social media, Applied Soft Computing, vol. 97, pp. 10 6754, 2020, Elsevier.
- Rohitash Chandra, Aswin Krishna, COVID-19 sentiment analysis via deep learning during the rise of novel cases, PLoS One, vol. 16, no. 8, pp. e025 5615, 2021, Public Library of Science San Francisco, CA USA.
- Ali Shariq Imran, Sher Muhammad Daudpota, Zenun Kastrati, Rakhi Batra, Cross-cultural polarity and emotion detection using sentiment analysis and deep learning on COVID-19 related tweets, Ieee Access, vol. 8, pp. 181074–18 1090, 2020, IEEE.
- CR Rao, Venkat N Gudivada, Computational analysis and understanding of natural languages: principles, methods and applications,2018, Elsevier.
- Furqan Rustam, Madiha Khalid, Waqar Aslam, Vaibhav Rupapara, Arif Mehmood, Gyu Sang Choi, A performance comparison of supervised machine learning models for Covid-19 tweets sentiment analysis, Plos one, vol. 16, no. 2, pp. e024 5909, 2021, Public Library of Science San Francisco, CA USA.
- Anjali Ganesh Jivani, and others, A comparative study of stemming algorithms, Int. J. Comp. Tech. Appl, vol. 2, no. 6, pp. 1930–1938, 2011.
- BP Pande, HS Dhami, Application of natural language processing tools in stemming, International Journal of Computer Applications, vol. 27, no. 6, pp. 14–19, 2011, Citeseer.
- Divya Khyani, BS Siddhartha, NM Niveditha, BM Divya, An Interpretation of Lemmatization and Stemming in Natural Language Processing, Journal of University of Shanghai for Science and Technology, vol. 22, no. 10, pp. 350–357, 2021.
- Mohammed Jabreel, Antonio Moreno, A deep learning-based approach for multi-label emotion classification in tweets, Applied Sciences, vol. 9, no. 6, p. 1123, 2019, MDPI.
- Nalini Chintalapudi, Gopi Battineni, Francesco Amenta, Sentimental analysis of COVID-19 tweets using deep learning models, Infectious Disease Reports, vol. 13, no. 2, pp. 329–339, 2021, MDPI.
- Harleen Kaur, Shafqat Ul Ahsaan, Bhavya Alankar, Victor Chang, A proposed sentiment analysis deep learning algorithm for analyzing COVID-19 tweets, Information Systems Frontiers, vol. 23, no. 6, pp. 1417– 1429, 2021, Springer.
- Mohammad Ehsan Basiri, Shahla Nemati, Moloud Abdar, Somayeh Asadi, U Rajendra Acharrya, A novel fusion-based deep learning model for sentiment analysis of COVID-19 tweets, Knowledge-Based Systems, vol. 228, p. 10 7242, 2021, Elsevier.
- Md Mahbubar Rahman, Muhammad Nazrul Islam, Exploring the performance of ensemble machine learning classifiers for sentiment analysis of COVID-19 tweets, in Sentimental Analysis and Deep Learning: Proceedings of ICSADL 2021, pp. 383–396, 2021, Springer.
- Hui Yin, Shuiqiao Yang, Jianxin Li, Detecting topic and sentiment dynamics due to COVID-19 pandemic using social media, in International conference on advanced data mining and applications, pp. 610–623, 2020, Springer.
- Natt Leelawat, Sirawit Jariyapongpaiboon, Arnon Promjun, Samit Boonyarak, Kumpol Saengtabtim, Ampan Laosunthara, Alfan Kurnia Yudha, Jing Tang, Twitter data sentiment analysis of tourism in Thailand during the COVID-19 pandemic using machine learning, Heliyon, vol. 8, no. 10, p. e1 0894, 2022, Elsevier.
- Zhikang Qin, Elisabetta Ronchieri, Exploring Pandemics Events on Twitter by Using Sentiment Analysis and Topic Modelling, Applied Sciences, vol. 12, no. 23, p. 1 1924, 2022, MDPI.
- Nainika Kaushik, Manjot Kaur Bhatia, Twitter Sentiment Analysis Using K-means and Hierarchical Clustering on COVID Pandemic, in International Conference on Innovative Computing and Communications, pp. 757–769, 2022, Springer.
- M. Odlum and S. Yoon, HIV/AIDS and the millennium development goals: a public sentiment analysis of World AIDS Day Twitter chat, Int J AIDS Res, vol. 3, no. 9, pp. 129–132, 2016.
- C. Song, X.-K. Wang, P.-F. Cheng, J.-q. Wang, and L. Li, SACPC: A framework based on probabilistic linguistic terms for short text sentiment analysis,bKnowledge-Based Systems, vol. 194, p. 105572, 2020, Elsevier.
- L. S. Martinez, M. W. Savage, E. Jones, E. Mikita, V. Yadav, and M.-H. Tsou, Examining Vaccine Sentiment on Twitter and Local Vaccine Deployment during the COVID-19 Pandemic, International Journal of Environmental Research and Public Health, vol. 20, no. 1, p. 354, 2022, MDPI.
- V. Balakrishnan and E. Lloyd-Yemoh, Stemming and lemmatization: A comparison of retrieval performances, UM Research Repository, vol. 170, no. 2, pp. 174–179, 2014.
- A. Jacovi, O. S. Shalom, and Y. Goldberg, Understanding convolutional neural networks for text classification, arXiv preprint arXiv:1809.08037, vol. 3, no. 1, p. 10, 2018.
- J. Wang, Z. Wang, D. Zhang, and J. Yan, Combining Knowledge with Deep Convolutional Neural Networks for Short Text Classification, in IJCAI, vol. 350, pp. 3172077–3172295, 2017.
- P. Song, C. Geng, and Z. Li, Research on text classification based on convolutional neural network, in 2019 International Conference on Computer Network, Electronic and Automation (ICCNEA), pp. 229–232, 2019, IEEE.
- Y. LeCun, Y. Bengio, and G. Hinton, Deep learning, nature, vol. 521, no. 7553, pp. 436–444, 2015, Nature Publishing Group.
- J. Wang, L.-C. Yu, K. R. Lai, and X. Zhang, Dimensional sentiment analysis using a regional CNN-LSTM model, in Proceedings of the 54th annual meeting of the association for computational linguistics (volume 2: Short papers), pp. 225–230, 2016.
- F. Huang, X. Li, C. Yuan, S. Zhang, J. Zhang, and S. Qiao, Attention-emotion-enhanced convolutional LSTM for sentiment analysis, IEEE transactions on neural networks and learning systems, vol. 33, no. 9, pp. 4332–4345, 2021, IEEE.
- S. Liao, J. S. Liao, J. Wang, R. Yu, K. Sato, and Z. Cheng, CNN for situations understanding based on sentiment analysis of twitter data, Procedia computer science, vol. 111, pp. 376–381, 2017, Elsevier.
- Sepp Hochreiter and Jürgen Schmidhuber, Long short-term memory, Neural computation, vol. 9, no. 8, pp. 1735– 1780, 1997, MIT Press.
- Yuandong Luan and Shaofu Lin, Research on text classification based on CNN and LSTM, 2019 IEEE international conference on artificial intelligence and computer applications (ICAICA), pp. 352–355, 2019, IEEE.
- Hu Xu, Bing Liu, Lei Shu, and Philip S. Yu, BERT post-training for review reading comprehension and aspect-based sentiment analysis, arXiv preprint arXiv:1904.02232, vol. 2, no. 1, p. 12, 2019.
- Xin Li, Lidong Bing, Wenxuan Zhang, and Wai Lam, Exploiting BERT for end-to-end aspect-based sentiment analysis, arXiv preprint arXiv:1910.00883, vol. 2, no. 1910, p. 8, 2019.
- Shivaji Alaparthi and Manit Mishra, BERT: A sentiment analysis odyssey, Journal of Marketing Analytics, vol. 9, no. 2, pp. 118–126, 2021,Springer.
- Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova, Bert: Pre-training of deep bidirectional transformers for language understanding, arXiv preprint arXiv:1810.04805, vol. 2, no. 1, p. 16, 2018.
- Moritz Laurer, W v Atteveldt, Andreu Casas, and Kasper Welbers, Less annotating, more classifying–addressing the data scarcity issue of supervised machine learning with deep transfer learning and bert-nli, May 2022.
- José Alberto Benítez-Andrades, Álvaro González-Jiménez, Álvaro López-Brea, Jose Aveleira-Mata, José-Manuel Alija-Pérez, and María Teresa García-Ordás, Detecting racism and xenophobia using deep learning models on Twitter data: CNN, LSTM and BERT, PeerJ Computer Science, vol. 8, p. e906, 2022,PeerJ Inc.
- Irwan Bello, Barret Zoph, Vijay Vasudevan, and Quoc V Le, Neural optimizer search with reinforcement learning, International Conference on Machine Learning, pp. 459–468, 2017, PMLR.
- M. P., Renuka, D. K, Improving the performance of aspect-based sentiment analysis using fine-tuned Bert Base Uncased model, International Journal of Intelligent Networks, 2, 64-69,2021.
- Ouyang, Xi and Zhou, Pan and Li, Cheng Hua and Liu, Lijun, Sentiment analysis using convolutional neural network, 2015 IEEE international conference on computer and information technology; ubiquitous computing and communications; dependable, autonomic and secure computing; pervasive intelligence and computing,pp. 2359–2364,2015.


| Parameter | Value |
|---|---|
| Learning Rate | |
| Epochs | 10 |
| Batch Size | 64 |
| Kernel Size | |
| Dropout | 0.5 |
| Activation | Softmax |
| Optimizer | Adam |
| Loss | sparse_categorical_crossentropy |
| Models | Accuracy | Precision | Recall | F1-Score | |
|---|---|---|---|---|---|
| CNN | 0.77 | 0.81 | 0.74 | 0.74 | |
| LSTM | 0.87 | 0.91 | 0.84 | 0.85 | |
| BERT | 0.95 | 0.96 | 0.93 | 0.94 | |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).