Submitted:
29 August 2025
Posted:
01 September 2025
You are already at the latest version
Abstract
Keywords:
Introduction:
2. Literature Review

3. Methodology
- Dictionary creation: A custom dictionary of positive and negative sentiment words was developed. Corresponding tables were created in the sentiment database to store these classified terms.
- Preprocessing and cleaning– Reviews were manually cleaned to remove URLs, usernames, redundant information, duplicate sentences, and unnecessary symbols. Informal colloquial expressions and misspelled words were standardized. Stop words were filtered out, and special attention was given to handling negations, as they significantly alter sentiment polarity (e.g., “not good” → negative).
| Word | Sentiment | Word | Sentiment |
|---|---|---|---|
| Never | Negative | Good | Positive |
| choice | Neutral | Generally considered | Neutral |
| remove | Negative | wasting | Negative |
| do | Positive | mess up | Negative |
| don’t | Negative | Best | Positive |
| prefer | Neutral | Wasting | Negative |
- 2.
- Tools used: “R” programming packages such as textclean and tidytext were employed for preprocessing and sentiment analysis.
- 3.
- Lemmatization: Lemmatization was applied for sentiment scoring because it provided higher accuracy than stemming by ensuring that words were reduced to valid root forms found in the dictionary (Figure 4).
- 4.
- Sentiment scoring: Finally, sentiment scores were calculated by comparing the processed reviews with the entries in the sentiment dictionary. The sampled data was categorized as positive, negative, or neutral based on this matching process.
4. Sentiment Analysis Techniques
- Dataset Size – The dataset of reviews from the coaching industry was relatively small. Supervised machine learning models typically require large, balanced datasets for training, which were not available in this case.
- Data Imbalance – The presence of a limited number of fake or deceptive reviews would have posed a high risk of misclassification in machine learning models, which tend to struggle with skewed class distributions.
- Rapid Classification – The lexicon approach enabled efficient large-scale classification without the need for extensive manual labeling, making it practical for the exploratory scope of this study.
5. Mechanisms of Digital Deception:


6. Case Studies of Digital Deception in India’s Coaching Sector:
| Case No. | Type of Deception | Coaching Segment | Location | Mechanism / Method | Psychological Bias Exploited | Key Outcome / Impact |
|---|---|---|---|---|---|---|
| 1 | Misleading Advertisements | General | India | Single student’s success advertised as entire batch achievement | Halo Effect | Manufactured trust; inflated perception of overall quality |
| 2 | Bot-Generated Reviews | IIT-JEE | Delhi | Scripted reviews, fake profiles on Google/Facebook | Bandwagon Effect | Artificial reputation boost; misled aspirants |
| 3 | Fake AIR Claims | IIT JEE | Pune | Unverified rank advertised as AIR | FOMO, Bandwagon Effect | Students/parents influenced to enrol; psychological manipulation |
| 4 | Identity Forgery | NEET | North India | Same Topper for all Institutes. | FOMO, Local Trust Bias | Exploited local community trust; doctored success narratives |
| 5 | Misrepresentation Fined by Regulators | IIT-JEE | Delhi | Inflated admission claims: regulator notices issued | Halo Effect, Bandwagon Effect | Regulatory fines issued; public awareness raised, but enforcement limited |
7. Results and Analysis
| 1 | Never join Aakash. It's literally a trash kiddo Akash never ya choice is good for having offline studies but remove bc aaksh it's the worst thing happen in my life |
Negative |
| 2 | choose Allen ya vmc, both are great, have had interaction with both students | Positive |
| 3 | join Allen or fiitjee, don't join VMC they recently changed their curriculum, and it is very messed up | Negative |
| 4 | Never ever join FIITJEE punjabi bagh, and any Allen Delhi branch. | Negative bias |
| 5 | Vmc Pitampura is good | Positive |
| 6 | You can prefer Allen since they have given u 80% scholarship | Positive |
| 7 | take a risk and go for a dummy. You will be wasting time whole day in school | Negative |
| 8 | Allen for sure never join fiitjee | Negative bias |
| 9 | Join Allen. When they opened the institute in Delhi and NCR 2 years ago, they took some best teachers of other coaching into their own. | Positive |
| 10 | dont join fiitjee and akaash. it’s hard being disciplined in online classes, so also remove pw | Negative bias |
| 11 | Either allen or vmc. allen is generally considered better than vmc. so, join allen. | Positive |
| 12 | your best bet will be allen. and i'll say risk it and join dummy school | Negative |
| 13 | In delhi none of the above institute have competition enviroment like Fiitjee punjabibagh. It is the best. You will not regret. | Negative |
| 14 | For off-line in Delhi NCR, the Best is Vidya Mandir classes. | Positive |
| 15 | if ur getting 100% scholarship in fiitjee PB (which is arguably the best fiitjee rn) then ur probably get into the top batches. | Positive |
8. Sentiment Analysis Findings:
9. Regulatory Insights:


10. Recommended framework:
11. Conclusion
References
- Mite-Baidal, K. , Delgado-Vera, C., Solís-Avilés, E., Espinoza, A. H., Ortiz-Zambrano, J., & Varela-Tapia, E. (2018, October). Sentiment analysis in education domain: A systematic literature review. In International conference on technologies and innovation (pp. 285-297). Cham: Springer International Publishing.
- Kwek, C. L. , Lei, B., Leong, L. Y., & Peh, Y. X. (2020, June). The Impacts of Online Comments and Bandwagon Effect on the Perceived Credibility of the Information in Social Commerce: The Moderating Role of Perceived Acceptance. In 8th International Conference of Entrepreneurship and Business Management Untar (ICEBM 2019) (pp. 451-460). Atlantis Press.
- Molina, M. D. , Wang, J., Sundar, S. S., Le, T., & DiRusso, C. Reading, commenting and sharing of fake news: How online bandwagons and bots dictate user engagement. Communication Research 2023, 50, 667–694. [Google Scholar]
- Gandhi, A. , Hollenbeck, B., & Li, Z. (2024). Misinformation and Mistrust: The Equilibrium Effects of Fake Reviews on Amazon. com, Working Paper.
- Munusamy, S. , Syasyila, K., Shaari, A. A. H., Pitchan, M. A., Kamaluddin, M. R., & Jatnika, R. Psychological factors contributing to the creation and dissemination of fake news among social media users: a systematic review. BMC psychology 2024, 12, 673. [Google Scholar]
- Emre, İ. E. , & Köse, G. G. (). UNDERSTANDING FEAR OF MISSING OUT PHENOMENA AND SOCIAL MEDIA USING BIBLIOMETRIC ANALYSIS (2013-2023). Öneri Dergisi 2025, 20, 197–225. [Google Scholar]
- Gullipalli, V. S., & Dholey, M. K. (n.d.). Sentiment analysis of colleges in India. International Research Journal of Engineering and Technology (IRJET), 11(03).
- Chandurkar, T. , & Tijare, D. P. (2021). Sentiment analysis: A review and comparative analysis on colleges. International Journal of Scientific Research in Computer Science, Engineering, and Information Technology, 526-531.
- Kandhro, I. A. , Wasi, S., Kumar, K., Rind, M., & Ameen, M. Sentiment analysis of students’ comment using long-short term model. Indian Journal of Science and Technology 2019, 12, 1–16. [Google Scholar]
- Aung, K. Z. , & Myo, N. N. (2017, May). Sentiment analysis of students' comment using lexicon-based approach. In 2017 IEEE/ACIS 16th international conference on computer and information science (ICIS) (pp. 149-154). IEEE.
- Mamgain, N. , Mehta, E., Mittal, A., & Bhatt, G. (2016, March). Sentiment analysis of top colleges in India using Twitter data. In 2016 international conference on computational techniques in information and communication technologies (ICCTICT) (pp. 525-530). IEEE.
- https://www.ascionline.in/wp-content/uploads/2024/09/agm-report-20-Aug-2024.pdf.
- Varthana. (n.d.). Dissecting private educational expenditures in Indian households.https://varthana.com/school/dissecting-private-educational-expenditures-in-indian-households/.
- Rajesh, N. , & Ramachandra, A. C. (2023, November). Fake reviews detection based on sentiment analysis using ml classifiers. In 2023 International Conference on Ambient Intelligence, Knowledge Informatics and Industrial Electronics (AIKIIE) (pp. 1-6). IEEE.
- Kauffmann, E. , Peral, J., Gil, D., Ferrández, A., Sellers, R., & Mora, H. A framework for big data analytics in commercial social networks: A case study on sentiment analysis and fake review detection for marketing decision-making. Industrial Marketing Management 2020, 90, 523–537. [Google Scholar]
- Tap-a-gain. (n.d.). Narayana advertisement controversy: Misleading ads in India. https://tap-a-gain.com/narayana-advertisement-controversy-misleading-ads-india/.
- https://www.reddit.com/r/JEENEETards/comments/1gmeix2/which_institute_to_join/.
- IIT JEE Pune Wellwishers. (2011, ). IIT JEE Pune fake results by coaching institutes. https://iitjeepunewellwishers.wordpress.com/2011/08/05/iit-jee-pune-fake-results-by-coaching-institutes-2/. 5 August 2011.
- Reddit. (n.d.). The same girl studied at three coaching institutes. https://www.reddit.com/r/india/comments/xdvzia/the_same_girl_studied_at_three_coaching/.
- Business Standard. (n.d.). IITPK fined for misleading IIT-JEE results advertisements. https://www.business-standard.com/india-news/iitpk-fined-misleading-iit-jee-results-ads-125021400896_1.html.





Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).