Submitted:
18 March 2025
Posted:
18 March 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Literature Review
3. Materials and Methods
3. Results and Discussion
3.1. Two-Stage Classification
3.2. Genre Incorporated Model
3.3. Practical Implications
3.3. Deployment
4. Conclusions
References
- Wallin, N. L., Merker, B., & Brown, S., “The Origins of Music” in Google Books, MIT Press, (2001).
- Petrušić, D., “New Challenges to Education: Lessons from around the World” in BCES Conference Books, (2021).
- Ramadhan, L., “American Subculture: An Identity Transformation of Hip Hop” in Rubikon, (2023), pp. 190.
- Frisby, C. M., & Behm-morawitz, E., “Undressing the Words: Prevalence of Profanity, Misogyny, Violence, and Gender Role References in Popular Music from 2006-2016” in Media Watch, (2019).
- Steinke, J., Lapinski, M., Zietsman-Thomas, A., Nwulu, P., Crocker, N., Williams, Y., Higdon, S., & Kuchibhotla, S., “Middle School-Aged Children’s Attitudes toward Women in Science, Engineering, and Technology and the Effects of Media Literacy Training” in Journal of Women and Minorities in Science and Engineering, (2007), pp. 295–323. [CrossRef]
- Keenan-Kroff, S. L., Coyne, S. M., Shawcroft, J., Sheppard, J. A., James, S. L., Ehrenreich, S. E., & Underwood, M., “Associations between Sexual Music Lyrics and Sexting across Adolescence” in Computers in Human Behavior, (2023). [CrossRef]
- Bogt, T. T., Hale, W. W., & Becht, A., “Wild Years: Rock Music, Problem Behaviors and Mental Well-being in Adolescence and Young Adulthood” in Journal of Youth and Adolescence, (2021).
- Im, H., Song, H., & Jung, J., “The Effect of Streaming Services on the Concentration of Digital Music Consumption” in Information Technology & People, (2019). [CrossRef]
- RIAA, “Year-End 2022 RIAA Revenue Statistics” in RIAA, (2022).
- Anantrasirichai, N., & Bull, D., “Artificial Intelligence in the Creative Industries: A Review” in Artificial Intelligence Review, (2021). [CrossRef]
- Agrawal, Y., Shanker, R. G. R., & Alluri, V., “Transformer-Based Approach Towards Music Emotion Recognition from Lyrics” in Lecture Notes in Computer Science, (2021), pp. 167–175. [CrossRef]
- Incekas, A. B., & Asan, U., “Data Driven Positioning Analysis of Music Streaming Platforms” in Lecture Notes in Networks and Systems, (2023), pp. 634–641. [CrossRef]
- Fell, M., Cabrio, E., Corazza, M., & Gandon, F., “Comparing Automated Methods to Detect Explicit Content in Song Lyrics”, (2019).
- Salam, A., Ghosh, R., Gosh, K., & Sarker, I., “Toxicity Classification on Music Lyrics Using Machine Learning Algorithms” in International Conference on Computer and Information Technology (ICCIT), (2021). [CrossRef]
- Chin, H., Kim, J., Kim, Y., Shin, J., & Yi, Mun. Y., “Explicit Content Detection in Music Lyrics Using Machine Learning” in IEEE Xplore, (2018)sing Machine Learning” in IEEE Xplore, (2018). [CrossRef]
- Bergelid, L., “Classification of Explicit Music Content Using Lyrics and Music Metadata” in Unpublished, (2018).
- Mayerl, M., Vötter, M., Moosleitner, M., & Zangerle, E., “Comparing Lyrics Features for Genre Recognition” in ACLWeb, (2020).
- Egivenia, Setiawan, G. R., Mintara, S. S., & Suhartono, D., “Classification of Explicit Music Content Based on Lyrics, Music Metadata, and User Annotation” in 6th International Conference on Sustainable Information Engineering and Technology 2021, (2021).
- Bolla, B. K., Pattnaik, S. R., & Patra, S., “Detection of Objectionable Song Lyrics Using Weakly Supervised Learning and Natural Language Processing Techniques” in Procedia Computer Science, (2023). [CrossRef]
- Rospocher, M., “Explicit Song Lyrics Detection with Subword-Enriched Word Embeddings” in Expert Systems with Applications, (2021). [CrossRef]
- Rospocher, M., “On Exploiting Transformers for Detecting Explicit Song Lyrics” in Entertainment Computing, (2022). [CrossRef]
- Akalp, H., Furkan Cigdem, E., Yilmaz, S., Bolucu, N., & Can, B., “Language Representation Models for Music Genre Classification Using Lyrics” in 2021 International Symposium on Electrical, Electronics and Information Engineering, (2021).
- Li, Y., Zhang, Z., Ding, H., & Chang, L., “Music Genre Classification Based on Fusing Audio and Lyric Information” in Multimedia Tools and Applications, (2022). [CrossRef]
- Thompson, C., “Lyric-Based Classification of Music Genres Using Hand-Crafted Features” in Reinvention: An International Journal of Undergraduate Research, (2021).




| Author | Dataset | Class Task | Model + Embedding | Best Performance | |||
| Accuracy | Precision | Recall | F1 Score | ||||
| Salam et al. (2021) | Lyrics | Explicitness | XGB, RF, LR | 0.94 | 0.96 | 0.92 | 0.94 |
| Chin et al. (2018) | Lyrics | Explicitness | Adaboost, Bagging TF-IDF |
ns | 0.70 | 0.55 | 0.62 |
| Bergelid (2018) | Lyrics Other Features |
Explicitness | RF, SVM, NB, KNN TF-IDF, D2V |
0.85 | 0.93 | 0.76 | 0.83 |
| Egivenia et al. (2021) | Lyrics Other Features |
Explicitness | RF TF-IDF |
0.96 | 0.99 | 0.94 | 0.96 |
| Bolla et al. (2023) | Lyrics Other Features |
Explicitness | LR, RF, SVM CV, TF-IDF, DBert |
0.87 | 0.81 | 0.73 | 0.77 |
| Rospocher (2021) | Lyrics | Explicitness | LR TF-IDF, BOW, FT |
0.97 | 0.90 | 0.63 | 0.74 |
| Rospocher (2022) | Lyrics | Explicitness | LR TF-IDF, BOW, FT |
ns | 0.89 | 0.69 | 0.78 |
| Akalp et al. (2021) | Lyrics | Genre | BiLSTM, BERT, DBert | 0.71 | mv | mv | mv |
| Li et al. (2022) | Lyrics Other Features |
Genre | CNN BERT |
ns | 0.87 | 0.87 | 0.87 |
| Thompson (2021) | Lyrics Other Features |
Genre | KNN, SVM, RF, NB PCA |
0.56 | mv | mv | mv |
| Mayerl et al. (2020) | Lyrics Other Features |
Genre | NN, RF, SVM, KNN TF-IDF, LDA |
ns | ns | ns | mv |
| Attribute | Source |
| songTitle | Kaggle.com |
| artistName | Kaggle.com |
| Lyrics | Kaggle.com |
| genre | Last.fm API |
| explicit | Spotify API |
| Model | Accuracy | Precision | Recall | F1 Score |
| Baseline (RF.L) | 93.73 | 0.9518 | 0.4413 | 0.6030 |
| Two-stage classification | 82.51 | 0.3764 | 0.9441 | 0.5382 |
| Model | Accuracy | Precision | Recall | F1 Score |
| S.RF.LG | 0.9552 | 0.9840 | 0.9737 | 0.9788 |
| S.SVM.LG | 0.9946 | 0.9641 | 0.9895 | 0.9766 |
| S.SVM.L | 0.9940 | 0.9884 | 0.9553 | 0.9716 |
| S.RF.L | 0.9916 | 0.9882 | 0.9330 | 0.9598 |
| S.MNB.LG | 0.9367 | 0.7014 | 0.7789 | 0.7381 |
| S.MNB.L | 0.9204 | 0.5830 | 0.9218 | 0.7143 |
| SVM.LG | 0.9391 | 0.8450 | 0.5737 | 0.6834 |
| RF.LG | 0.9379 | 0.9485 | 0.4842 | 0.6411 |
| RF.L | 0.9373 | 0.9518 | 0.4413 | 0.6030 |
| SVM.L | 0.9361 | 0.9398 | 0.4358 | 0.5955 |
| MNB.LG | 0.9017 | 0.8857 | 0.1632 | 0.2756 |
| MNB.L | 0.8890 | 0.0000 | 0.0000 | 0.0000 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).