Preprint Communication Version 3 Preserved in Portico This version is not peer-reviewed

MonkeyPox2022Tweets: The First Public Twitter Dataset on the 2022 MonkeyPox Outbreak

Version 1 : Received: 12 June 2022 / Approved: 13 June 2022 / Online: 13 June 2022 (08:23:36 CEST)
Version 2 : Received: 28 June 2022 / Approved: 28 June 2022 / Online: 28 June 2022 (07:53:18 CEST)
Version 3 : Received: 25 July 2022 / Approved: 25 July 2022 / Online: 25 July 2022 (09:41:19 CEST)

A peer-reviewed article of this Preprint also exists.

Thakur, N. MonkeyPox2022Tweets: A Large-Scale Twitter Dataset on the 2022 Monkeypox Outbreak, Findings from Analysis of Tweets, and Open Research Questions. Infect. Dis. Rep. 2022, 14, 855-883. https://doi.org/10.3390/idr14060087 Thakur, N. MonkeyPox2022Tweets: A Large-Scale Twitter Dataset on the 2022 Monkeypox Outbreak, Findings from Analysis of Tweets, and Open Research Questions. Infect. Dis. Rep. 2022, 14, 855-883. https://doi.org/10.3390/idr14060087

Abstract

The world is currently facing an outbreak of the monkeypox virus and confirmed cases have been reported from 74 countries. Following a recent “emergency meeting”, the World Health Organization just declared monkeypox a global health emergency. As a result, people from all over the world are using social media platforms, such as Twitter, for information seeking and sharing related to the outbreak, as well as for familiarizing themselves with the guidelines and protocols that are being recommended by various policy-making bodies to reduce the spread of the virus. This is resulting in the generation of tremendous amounts of Big Data related to such paradigms of social media behavior. Mining this Big Data and compiling it in the form of a dataset can serve a wide range of use-cases and applications such as analysis of public opinions, interests, views, perspectives, attitudes, and sentiment towards this outbreak. Therefore, this work presents MonkeyPox2022Tweets, an open-access dataset of more than 255,000 Tweets related to the 2022 monkeypox outbreak that were posted on Twitter since the first detected case of this outbreak on May 7, 2022. The dataset is compliant with the privacy policy, developer agreement, and guidelines for content redistribution of Twitter, as well as with the FAIR principles (Findability, Accessibility, Interoperability, and Reusability) principles for scientific data management.

Keywords

Monkeypox; monkey pox; Twitter; Dataset; Tweets; Social Media; Big Data; Data Mining; Data Science

Subject

Computer Science and Mathematics, Information Systems

Comments (1)

Comment 1
Received: 25 July 2022
Commenter: Nirmalya Thakur
Commenter's Conflict of Interests: Author
Comment: The following is an overview of the changes in the most recent version of the paper:
1. The associated dataset now contains more than 255,000 Tweet IDs 
2. The dataset has been updated to comprise Tweet IDs of relevant tweets posted up to July 23, 2022
3. The link to the most recent version of the dataset has been included in the paper
4. A new section – Section 3.3 has been added that comprises several research questions that may be investigated using this dataset
+ Respond to this comment

We encourage comments and feedback from a broad range of readers. See criteria for comments and our Diversity statement.

Leave a public comment
Send a private comment to the author(s)
* All users must log in before leaving a comment
Views 0
Downloads 0
Comments 1
Metrics 0


×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.
We use cookies on our website to ensure you get the best experience.
Read more about our cookies here.