Submitted:
20 December 2023
Posted:
22 December 2023
You are already at the latest version
Abstract
Keywords:
1. Introduction
1.1. Research Title:
1.2. String Development:
- For this SLR, keywords were identified for the research title.
- Three synonyms for each keyword were identified.
- Strings were developed according to three synonym for each keyword.
- In total 15 strings were formed.
1.3. Searching protocol:
- All research articles published in the years
- 2019,
- 2020,
- 2021,
- 2022,
- 2023 were searched.
-
Three research databases
- IEEE,
- ACM,
- Elsevier were used for searching.
1.4. Inclusion Criteria:
- All papers from years 2019, 2020, 2021, 2022, 2023 were included.
- Papers from three pages of research articles databases were included.
- Papers that were relevant to research strings were included.
- Papers that are not published yet were not included.
1.5. Screening:
2. Title Based screening:
3. Abstract Based Screening:
4. Objective Based Screening:

5. Detailed Literature:
3. Critical Analysis:
| Detection Algorithm | Effort Year | Technique | Performance Metrics | Shortcoming |
| [1] | 2021 | Abstractive and Extractive summarization of reviews with the use of PCA and SVD. | Precision, Recall, F-Measure | PCA cannot be used for larger datasets and SVD is appropriate only for linear datasets |
| [2] | 2023 | Term frequency-inverse document frequency (TF-IDF), n-gram features & emoticon polarities , Long Short Term Memory (LSTM) | Precision , accuracy, Recall and F-1 Score, AUC | LSTM requires a longer time to process. [14] |
| [3] | 2019 | Hybrid Technique, KNN | Accuracy | KNN is not efficient when training data increases. It has Poor performance on imbalanced data If Optimal value of K is chosen incorrectly, the model will be under or over fitted to the data |
| [4] | 2019 | Reviews classification with Naïve Bayes | High Precision, recall with low computational cost. |
|
| [5] | 2021 | Machine Learning Techniques for reviews classification | Provide novel ML-CSA framework and provides guidelines that have never been provided in earlier SLRs. [5] | The proposed framework is based on research till year 2020, some of the research is not tested. [5] |
| [6] | 2023 | multitask learning model with main focus on aspect polarity classification | Accuracy, Macroaverage F1 Score | In the process of training of an MTL network, multiple tasks can compete with each other to get a better learning representation and one or more tasks can dominate the training process. [16] Also the loss function of MTL can be complex because of multiple summed losses, therefore making the optimization more difficult. [17] |
| [7] | 2023 | Review of Machine Learning and Deep Learning Techniques | Avg. Recall, Avg. Accuracy, Avg. Precision | Classical machine learning methods cannot work better for larger and higher dimensions data[18]. Emotion detection datasets are often limited in size and quality, making it challenging to train accurate and generalizable models . |
| [8] | 2020 | Deep Learning based techniques are compared | Area under the receiver operating characteristics curve (AUROC) | In sentiment classification based on deep learning models , the best model structure depends on the characteristics of the dataset on which this model is trained. Also, model is manually selected based on the domain knowledge of an expert or selected from a grid search of possible candidates. [8] |
| [9] | 2020 | Naive Bayes Classifier, Logistic Regression, Support Vector Machine (SVM), and Random Forest algorithms are compared. Random Forest performed best. | Accuracy, precision, Recall, F-1 Score | A Random Forest can’t generalize. It can only make a prediction based on previously observed labels. [19] |
| [10] | 2022 | Random Forest , K-Nearest Neighbour (KNN) | Accuracy, precision, Recall, F-1 Score | A Random Forest can’t generalize. It can only make a prediction based on previously observed labels.[19] With KNN, classification is slow when dataset is larger. Also it cannot deal with missing values. [20] |
| [11] | 2019 | Bernoulli Naïve Bayes (BNB), Decision Tree (DE), Support Vector Machine (SVM), Maximum Entropy (ME), as well as Multinomial Naïve Bayes (MNB) | Accuracy, Precision and F-1score, Recall. | Early convergence and the cold start issues, encountered in the multinomial models [21]. |
| [12] | 2020 | Bag-of-Words for feature Extraction, Naïve Bayes for review classification, word2vec for feature extraction for summarization, Semantic Clustering for summarization generation | Accuracy, Precision and F-measure, Recall. | For larger data sets, bag of words technique cannot work best. As feature dimension is dependent on unique tokens. Also Bag of words does not preserve the relationships between tokens[22]. Word2Vec cannot understand the words that are not available in training data. Also it ignores the formation of words with same meaning[22]. |
| [13] | 2019 | Latent Dirichlet Allocation (E-LDA) | F-1 Score | Limited number of generated topics, unsupervised and sentence structure is not modeled[23]. |
4. Research Challenges:
6. Conclusions
References
- S. Bhatia, "A Comparative Study of Opinion Summarization Techniques," in IEEE Transactions on Computational Social Systems, vol. 8, no. 1, pp. 110-117, Feb. 2021. [CrossRef]
- Kaur, G., Sharma, A. “A deep learning-based model using hybrid feature extraction approach for consumer sentiment analysis” J Big Data 10, 5 (2023). [CrossRef]
- N. Yadav, R. Kumar, B. Gour and A. U. Khan, "Extraction-Based Text Summarization and Sentiment Analysis of Online Reviews Using Hybrid Classification Method," 2019 Sixteenth International Conference on Wireless and Optical Communication Networks (WOCN), Bhopal, India, 2019, pp. 1-6. [CrossRef]
- Manuel J. Sánchez-Franco, Antonio Navarro-García, Francisco Javier Rondán-Cataluña, “A naive Bayes strategy for classifying customer satisfaction: A study based on online reviews of hospitality services” , Journal of Business Research, Volume 101, 2019, Pages 499-506. [CrossRef]
- Praphula Kumar Jain, Rajendra Pamula, Gautam Srivastava, A systematic literature review on machine learning applications for consumer sentiment analysis using online reviews, Computer Science Review,Volume 41, 2021. [CrossRef]
- Zhao, Guoshuai & Luo, Yiling & Chen, Qiang & Qian, Xueming. Aspect-based sentiment analysis via multitask learning for online reviews. (2023) Knowledge-Based Systems. 264. 110326. [CrossRef]
- Hung, Lai & Alias, Suraya. Beyond Sentiment Analysis: A Review of Recent Trends in Text-Based Sentiment Analysis and Emotion Detection. (2023). Journal of Advanced Computational Intelligence and Intelligent Informatics. 27. 84-95. [CrossRef]
- S. Seo, C. Kim, H. Kim, K. Mo and P. Kang, "Comparative Study of Deep Learning-Based Sentiment Classification," in IEEE Access, vol. 8, pp. 6861-6875, 2020. [CrossRef]
- K. Zahoor, N. Z. Bawany and S. Hamid, "Sentiment Analysis and Classification of Restaurant Reviews using Machine Learning," 2020 21st International Arab Conference on Information Technology (ACIT), Giza, Egypt, 2020, pp. 1-6. [CrossRef]
- Dadhich, A., Thankachan, B. (2022). Sentiment Analysis of Amazon Product Reviews Using Hybrid Rule-Based Approach. In: Somani, A.K., Mundra, A., Doss, R., Bhattacharya, S. (eds) Smart Systems: Innovations in Computing. Smart Innovation, Systems and Technologies, vol 235. Springer, Singapore. [CrossRef]
- A. Rahman and M. S. Hossen, "Sentiment Analysis on Movie Review Data Using Machine Learning Approach," 2019 International Conference on Bangla Speech and Language Processing (ICBSLP), Sylhet, Bangladesh, 2019, pp. 1-4. [CrossRef]
- Atif Khan, Muhammad Adnan Gul, M. Irfan Uddin, Syed Atif Ali Shah, Shafiq zAhmad, Muhammad Dzulqarnain Al Firdausi, Mazen Zaindin, "Summarizing Online Movie Reviews: A Machine Learning Approach to Big Data Analytics", Scientific Programming, vol. 2020, Article ID 5812715, 13 pages, 2020. [CrossRef]
- Sharan Srinivas, Suchithra Rajendran, Topic-based knowledge mining of online student reviews for strategic planning in universities, Computers & Industrial Engineering, Volume 128, 2019, Pages 974-984,. [CrossRef]
- N. C. Dang, M. N. Moreno-García, and F. De la Prieta, “Sentiment analysis based on deep learning: a comparative study,” Electronics, vol. 9, no. 3, p. 483, 2020. [CrossRef]
- Naïve Bayes Algorithm's Advantages and Disadvantages | Data Science and Machine Learning | Kaggle.
- Alex Kendall, Yarin Gal, Roberto Cipolla , “Multi-Task Learning Using Uncertainty to Weigh Losses for Scene Geometry and Semantics”. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2018. [CrossRef]
- How to Do Multi-Task Learning Intelligently (thegradient.pub).
- Cai, Y.; Li, X.; Li, J. Emotion Recognition Using Different Sensors, Emotion Models, Methods and Datasets: A Comprehensive Review. Sensors, 2023; 23, 2455. [Google Scholar] [CrossRef]
- A limitation of Random Forest Regression | by Ben Thompson | Towards Data Science.
- Introduction to Classification Using K Nearest Neighbours | by Ashwin Raj | Towards Data Science.
- Multinomial Naїve Bayes’ For Documents Classification and Natural Language Processing (NLP) | by Arthur V. Ratz | Towards Data Science.
- All You Need to Know About Bag of Words and Word2Vec — Text Feature Extraction | by Albers Uzila | Towards Data Science.
- Latent Dirichlet Allocation. Latent Dirichlet Allocation (LDA) is… | by Harsh Bansal | Analytics Vidhya | Medium.
- Shahid, H. , Ashraf, H., Javed, H., Humayun, M., Jhanjhi, N. Z., & AlZain, M. A. (2021). Energy optimised security against wormhole attack in iot-based wireless sensor networks. Comput. Mater. Contin, 68(2), 1967-81. [CrossRef]
- Wassan, S. , Chen, X., Shen, T., Waqar, M., & Jhanjhi, N. Z. (2021). Amazon product sentiment analysis using machine learning techniques. Revista Argentina de Clínica Psicológica, 30(1), 695. [CrossRef]
- Almusaylim, Z. A. , Zaman, N., & Jung, L. T. (2018, August). Proposing a data privacy aware protocol for roadside accident video reporting service using 5G in Vehicular Cloud Networks Environment. In 2018 4th International conference on computer and information sciences (ICCOINS) (pp. 1-5). IEEE. [CrossRef]
- Ghosh, G. , Verma, S., Jhanjhi, N. Z., & Talib, M. N. (2020, December). Secure surveillance system using chaotic image encryption technique. In IOP conference series: materials science and engineering (Vol. 993, No. 1, p. 012062). IOP Publishing. [CrossRef]
- Humayun, M., Alsaqer, M. S., & Jhanjhi, N. (2022). Energy optimization for smart cities using iot. Applied Artificial Intelligence, 36(1), 2037255. [CrossRef]
- Hussain, S. J. , Ahmed, U., Liaquat, H., Mir, S., Jhanjhi, N. Z., & Humayun, M. (2019, April). IMIAD: intelligent malware identification for android platform. In 2019 International Conference on Computer and Information Sciences (ICCIS) (pp. 1-6). IEEE. [CrossRef]
- Diwaker, C., Tomar, P., Solanki, A., Nayyar, A., Jhanjhi, N. Z., Abdullah, A., & Supramaniam, M. (2019). A new model for predicting component-based software reliability using soft computing. IEEE Access, 7, 147191-147203. [CrossRef]
- Gaur, L., Afaq, A., Solanki, A., Singh, G., Sharma, S., Jhanjhi, N. Z., ... & Le, D. N. (2021). Capitalizing on big data and revolutionary 5G technology: Extracting and visualizing ratings and reviews of global chain hotels. Computers and Electrical Engineering, 95, 107374. [CrossRef]
- Nanglia, S., Ahmad, M., Khan, F. A., & Jhanjhi, N. Z. (2022). An enhanced Predictive heterogeneous ensemble model for breast cancer prediction. Biomedical Signal Processing and Control, 72, 103279. [CrossRef]
- Kumar, T., Pandey, B., Mussavi, S. H. A., & Zaman, N. (2015). CTHS based energy efficient thermal aware image ALU design on FPGA. Wireless Personal Communications, 85, 671-696. [CrossRef]
- Gaur, L., Singh, G., Solanki, A., Jhanjhi, N. Z., Bhatia, U., Sharma, S., ... & Kim, W. (2021). Disposition of youth in predicting sustainable development goals using the neuro-fuzzy and random forest algorithms. Human-Centric Computing and Information Sciences, 11, NA.
- Lim, M. , Abdullah, A., & Jhanjhi, N. Z. (2021). Performance optimization of criminal network hidden link prediction model with deep reinforcement learning. Journal of King Saud University-Computer and Information Sciences, 33(10), 1202-1210. [CrossRef]

| Keyword | Synonym1 | Synonym2 | Synonym3 |
| Classification | Categorizing | Grouping | Organization |
| Summarization | Abstract Generation | Encapsulation | Outline |
| Online Review | Online Evaluation | Online Audit | Online Survey |
| Approach | Framework | Perspective | Perception |
| No. | Strings |
| 1. | Classification and summarization of online reviews- A machine learning approach |
| 2. | Categorizing and summarization of online reviews – A machine learning approach |
| 3. | Grouping and summarization of online reviews- A machine learning approach |
| 4. | and summarization of online reviews- A machine learning approach |
| 5. | Organization and summarization of online reviews- A machine learning approach |
| 6. | Classification and Abstract Generation of online reviews- A machine learning approach |
| 7. | Classification and encapsulation of online reviews- A machine learning approach |
| 8. | Classification and summing up of online reviews- A machine learning approach |
| 9. | Classification and summarization of online Evaluations- A machine learning approach |
| 10. | Classification and summarization of online audits- A machine learning approach |
| 11. | Classification and summarization of online Surveys- A machine learning approach |
| 12. | Classification and summarization of online Surveys- A machine learning Framework |
| 13. | Classification and summarization of online Surveys- A machine learning Perspective |
| 14. | Classification and summarization of online Surveys- A machine learning Perception |
| Objective | Abbreviation |
| Precision | PR |
| Recall | RC |
| Accuracy | ACC |
| F-Score | F-1 SC |
| Summarization | SUM |
| Classification | CLS |
| Machine Leaning and Deep learning Techniques | ML &DL |
| Aspect Polarity Classification | APC |
| Emotion Detection | ED |
| Topic Modeling | TM |
| Reference. | PR. | RC | ACC | F-1 SC | SUM | CLS | ML&DL | APC | ED | TM |
| 1 | - | - | - | - | ✓ | - | - | - | - | - |
| 2 | - | - | - | - | ✓ | ✓ | ✓ | - | - | - |
| 3 | ✓ | ✓ | ✓ | ✓ | - | ✓ | ✓ | - | - | - |
| 4 | ✓ | ✓ | - | - | - | ✓ | - | - | - | - |
| 5 | - | - | - | - | - | - | ✓ | - | - | - |
| 6 | - | - | - | - | - | ✓ | - | ✓ | - | - |
| 7 | - | - | - | - | - | ✓ | - | - | ✓ | - |
| 8 | - | - | - | - | - | ✓ | ✓ | - | - | - |
| 9 | - | - | ✓ | - | - | ✓ | - | - | - | - |
| 10 | - | - | - | - | - | ✓ | ✓ | - | - | - |
| 11 | ✓ | ✓ | ✓ | ✓ | - | ✓ | ✓ | - | - | - |
| 12 | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | - | - | - | - |
| 13 | - | - | - | ✓ | - | ✓ | - | - | - | ✓ |
| Article | Objectives | Inclusion Criteria | Decision |
| 1 | To proposes a novel graph-based technique for generating abstractive summaries of duplicate sentences | Paper is relevant to research where graph based technique is used to generate summaries of sentences. | Included |
| 2 | To perform sentiment classification from the point of view of the consumer review summarization model | Paper is relevant to research where hybrid approach is used for sentiment analysis. | Included |
| 3 | To develop a hybrid model for sentiment analysis using convolutional neural network-long short-term memory | Relevant to research where ML &DL techniques are used. | Included |
| 4 | A study is presented to identify relevant features that could be used to classify customer satisfaction. | Relevant to research | Included |
| 5 | Combining machine learning (ML) techniques for consumer sentiment analysis (CSA) in the domain of hospitality and tourism | ML and DL techniques combined and analyzed for sentiment analysis | Included |
| 6 | To identify aspect terms in online reviews and performs aspect polarity classification. | The paper is the most recent and related to research. | Included |
| 7 | Analyze recent trends in sentiment analysis and text-based emotion detection. | The paper is relevant to the research. | Included |
| 8 | Comparing multiple deep learning models for sentiment classification using different datasets | Deep Learning techniques for sentiment analysis. | Included |
| 9 | Analyze the sentiments as positive and negative and classify the text based on feedback. | Different machine learning techniques are used for sentiment analysis to find the best one. | Included |
| 10 | To present an approach for automatic comment analysis and classification | The paper is relevant to the research where Machine Learning techniques are used. | Included |
| 11 | Comparison of multiple machine learning algorithms for sentiment analysis and classification | The Paper is relevant to the research where Machine Learning techniques are used. | Included |
| 12 | Classification of sentiments and summarization of reviews with optimal values for Precision, recall, Accuracy, F-1 Score. | Paper is relevant to research where Machine learning techniques are used and paper is latest one. | Included |
| 13 | Analyzing online reviews by integrating multiple techniques including topic modeling | The paper is relevant to the research. | Included |
| Technique | Challenge |
| Machine Learning Based Algorithms [1,2,4,5,7,9,11,12] |
With some algorithms, longer time is required to process the data. Naive Bayes has less in real-world use cases. This technique does not provide best results for larger and high dimensional dataset. |
| Latent Dirichlet Allocation (E-LDA) [13] | Old technique for topic modeling where limited number of topics can be generated. It is unsupervised technique. |
| Deep Learning Based Techniques [8,10] |
The best model structure depends on the characteristics of the dataset on which this model is trained. Model is manually selected. With some models, classification is slow when dataset is larger. Also cannot deal with missing values. . |
| Multi Task Learning Models [6] | Multiple tasks can compete with each other while training of an MTL network and one or more tasks can dominate the training process. |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).