Preprint Article Version 1 Preserved in Portico This version is not peer-reviewed

GenCo: A Generative Learning Model for Heterogeneous Text Classification Based on Collaborative Partial Classifications

Version 1 : Received: 30 May 2023 / Approved: 1 June 2023 / Online: 5 June 2023 (02:57:36 CEST)

A peer-reviewed article of this Preprint also exists.

Ekolle, Z.E.; Kohno, R. GenCo: A Generative Learning Model for Heterogeneous Text Classification Based on Collaborative Partial Classifications. Appl. Sci. 2023, 13, 8211. Ekolle, Z.E.; Kohno, R. GenCo: A Generative Learning Model for Heterogeneous Text Classification Based on Collaborative Partial Classifications. Appl. Sci. 2023, 13, 8211.

Abstract

The use of artificial intelligence in natural language processing (NLP) has significantly contributed to the advancement of natural language applications such as sentimental analysis, topic modeling, text classification, chatbots, and spam filtering. With a large amount of text generated each day from different sources such as webpages, blogs, emails, social media, and articles, one of the most common tasks in natural language processing is the classification of a text corpus. This is important in many institutions for planning, decision-making, and archives of their projects. Many algorithms exist to automate text classification operations but the most intriguing of them is that which also learns these operations automatically. In this study, we present a new model to infer and learn from data using probabilistic logic and apply it to text classification. This model, called GenCo, is a multi-input single-output (MISO) learning model that uses a collaboration of partial classifications to generate the desired output. It provides a heterogeneity measure to explain its classification results and enables the reduction of the curse of dimensionality in text classification. The classification results are compared with those of conventional text classification models, and it shows that our proposed model has a higher classification performance than conventional models.

Keywords

Natural language processing; text classification; probabilistic models; machine learning; generative learning; collaborative learning; explainable AI

Subject

Computer Science and Mathematics, Artificial Intelligence and Machine Learning

Comments (0)

We encourage comments and feedback from a broad range of readers. See criteria for comments and our Diversity statement.

Leave a public comment
Send a private comment to the author(s)
* All users must log in before leaving a comment
Views 0
Downloads 0
Comments 0


×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.
We use cookies on our website to ensure you get the best experience.
Read more about our cookies here.