Preprint Article Version 2 Preserved in Portico This version is not peer-reviewed

Multilingual Ranking of Wikipedia Articles with Quality and Popularity Assessment in Different Topics

Version 1 : Received: 10 May 2019 / Approved: 13 May 2019 / Online: 13 May 2019 (08:10:17 CEST)
Version 2 : Received: 2 August 2019 / Approved: 5 August 2019 / Online: 5 August 2019 (12:26:34 CEST)

A peer-reviewed article of this Preprint also exists.

Lewoniewski, W.; Węcel, K.; Abramowicz, W. Multilingual Ranking of Wikipedia Articles with Quality and Popularity Assessment in Different Topics. Computers 2019, 8, 60. Lewoniewski, W.; Węcel, K.; Abramowicz, W. Multilingual Ranking of Wikipedia Articles with Quality and Popularity Assessment in Different Topics. Computers 2019, 8, 60.

Abstract

In Wikipedia, articles about various topics can be created and edited independently in each language version. Therefore, quality of information about the same topic depends on language. Any interested user can improve an article and that improvement may depend on popularity of the article. The goal of this study is to show what topics are best represented in different language versions of Wikipedia using results of quality assessment for over 39 million articles in 55 languages. In this paper, we also analyze how popular are selected topics among readers and authors in various languages. We used two approaches to assign articles to various topics. First, we selected 27 main multilingual categories and analyzed all their connections with sub-categories based on information extracted from over 10 million categories in 55 language versions. To classify the articles to one of the 27 main categories we took into account over 400 million links from articles to over 10 million categories and over 26 million links between categories. In the second approach we used data from DBpedia and Wikidata. We also showed how the results of the study can be used to build local and global rankings of the Wikipedia content.

Supplementary and Associated Material

http://data.lewoniewski.info/computers/vn1/: Articles coverage between language versions of Wikipedia (over 150 thousand of interactive combinations of Venn diagrams online)
http://data.lewoniewski.info/computers/vn2/: Coverage of articles between selected main topics in Wikipedia (over million of interactive combinations of Venn diagrams)
https://wikirank.net: Quality and popularity assessment of Wikipedia articles

Keywords

Wikipedia; Information quality; Popularity; Topics identification; Wikidata; DBpedia; WikiRank

Subject

Computer Science and Mathematics, Information Systems

Comments (1)

Comment 1
Received: 5 August 2019
Commenter: Włodzimierz Lewoniewski
Commenter's Conflict of Interests: Author
Comment: We provided changes to structure and added/edited some of the content (including figures) in the paper. In the new version of the paper after Introduction we place section Topic Classifications of Wikipedia Articles. Next we described current and proposed approach to measure quality and popularity. After that we showed results of the articles assessment based on the proposed methods. Finally we expanded Conclusion section.
+ Respond to this comment

We encourage comments and feedback from a broad range of readers. See criteria for comments and our Diversity statement.

Leave a public comment
Send a private comment to the author(s)
* All users must log in before leaving a comment
Views 0
Downloads 0
Comments 1
Metrics 0


×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.
We use cookies on our website to ensure you get the best experience.
Read more about our cookies here.