Preprint Article Version 1 Preserved in Portico This version is not peer-reviewed

FutureCite: Predicting Research Articles Impact using Machine Learning and Text and Graph Mining Techniques

Version 1 : Received: 27 April 2024 / Approved: 28 April 2024 / Online: 29 April 2024 (04:22:54 CEST)

How to cite: Thafar, M. A.; Alsulami, M. M.; Albarade, S. FutureCite: Predicting Research Articles Impact using Machine Learning and Text and Graph Mining Techniques. Preprints 2024, 2024041854. https://doi.org/10.20944/preprints202404.1854.v1 Thafar, M. A.; Alsulami, M. M.; Albarade, S. FutureCite: Predicting Research Articles Impact using Machine Learning and Text and Graph Mining Techniques. Preprints 2024, 2024041854. https://doi.org/10.20944/preprints202404.1854.v1

Abstract

The growth rate in academic and scientific publications increased very fast. Researchers must choose a good representative and significant literature in publications, which has become challenging worldwide. Usually, the paper citation number indicates this paper's potential influence and importance. However, this standard metric of citation numbers cannot be used as a measurement to judge the popularity and significance of the recently published papers. To address this challenge, this paper presents an effective prediction method called FutureCite to predict the future citation level of research articles. FutureCite combines machine learning with text and graph mining techniques, leveraging their abilities in classification, datasets in-depth analysis, and feature extraction. FutureCite aims to predict future citation levels of research articles applying a multilabel classification approach. FutureCite can extract significant semantic features and capture the interconnection relationships found in scientific articles during feature extraction using textual content, citation networks, and metadata as feature resources. Our goal is to contribute to the advancement of substantial effective approaches impacting the citation counts in scientific publications by enhancing the precision of future citations. We conducted several experiments using a comprehensive publication dataset to evaluate our method and determine the impact of using a variety of machine learning algorithms. FutureCite demonstrated its robustness and efficiency and showed promising results based on different evaluation metrics. Using the FutureCite model has significant implications in improving the researchers' ability to determine targeted literature for their research and better understand the potential impact of research publications.

Keywords

 citation prediction; machine learning; data mining; graph mining; feature extraction; multilabel classification; pagerank; betweenness centrality 

Subject

Computer Science and Mathematics, Artificial Intelligence and Machine Learning

Comments (0)

We encourage comments and feedback from a broad range of readers. See criteria for comments and our Diversity statement.

Leave a public comment
Send a private comment to the author(s)
* All users must log in before leaving a comment
Views 0
Downloads 0
Comments 0
Metrics 0


×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.
We use cookies on our website to ensure you get the best experience.
Read more about our cookies here.