PreprintData DescriptorVersion 2Preserved in Portico This version is not peer-reviewed
MonkeyPox2022Tweets: The First Public Twitter Dataset on the 2022 MonkeyPox Outbreak
Nirmalya Thakur
*
Version 1
: Received: 12 June 2022 / Approved: 13 June 2022 / Online: 13 June 2022 (08:23:36 CEST)
Version 2
: Received: 28 June 2022 / Approved: 28 June 2022 / Online: 28 June 2022 (07:53:18 CEST)
Version 3
: Received: 25 July 2022 / Approved: 25 July 2022 / Online: 25 July 2022 (09:41:19 CEST)
Thakur, N. MonkeyPox2022Tweets: A Large-Scale Twitter Dataset on the 2022 Monkeypox Outbreak, Findings from Analysis of Tweets, and Open Research Questions. Infect. Dis. Rep. 2022, 14, 855-883. https://doi.org/10.3390/idr14060087
Thakur, N. MonkeyPox2022Tweets: A Large-Scale Twitter Dataset on the 2022 Monkeypox Outbreak, Findings from Analysis of Tweets, and Open Research Questions. Infect. Dis. Rep. 2022, 14, 855-883. https://doi.org/10.3390/idr14060087
Thakur, N. MonkeyPox2022Tweets: A Large-Scale Twitter Dataset on the 2022 Monkeypox Outbreak, Findings from Analysis of Tweets, and Open Research Questions. Infect. Dis. Rep. 2022, 14, 855-883. https://doi.org/10.3390/idr14060087
Thakur, N. MonkeyPox2022Tweets: A Large-Scale Twitter Dataset on the 2022 Monkeypox Outbreak, Findings from Analysis of Tweets, and Open Research Questions. Infect. Dis. Rep. 2022, 14, 855-883. https://doi.org/10.3390/idr14060087
Abstract
The world is currently facing an outbreak of the monkeypox virus, and confirmed cases have been reported from 28 countries. Following a recent “emergency meeting”, the World Health Organization is considering whether the outbreak should be assessed as a “potential public health emergency of international concern” or PHEIC, as was done for the COVID-19 and Ebola outbreaks in the past. During this time, people from all over the world are using social media platforms, such as Twitter, for information seeking and sharing related to the outbreak, as well as for familiarizing themselves with the guidelines and protocols that are being recommended by various policy-making bodies to reduce the spread of the virus. This is resulting in the generation of tremendous amounts of Big Data related to such paradigms of social media behavior. Mining this Big Data and compiling it in the form of a dataset can serve as a data resource for a wide range of use-cases and applications such as analysis of public opinions, interests, views, perspectives, attitudes, and sentiment towards this outbreak. Therefore, this work presents MonkeyPox2022Tweets, an open-access dataset of Tweets related to the 2022 monkeypox outbreak that were posted on Twitter since the first detected case of this outbreak on May 7, 2022. The dataset is compliant with the privacy policy, developer agreement, and guidelines for content redistribution of Twitter, as well as with the FAIR principles (Findability, Accessibility, Interoperability, and Reusability) principles for scientific data management.
Keywords
Monkeypox; monkey pox; Twitter; Dataset; Tweets; Social Media; Big Data; Data Mining; Data Science
Subject
Computer Science and Mathematics, Information Systems
Copyright:
This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Received:
28 June 2022
Commenter:
Nirmalya Thakur
Commenter's Conflict of Interests:
Author
Comment:
The following is an overview of the changes in the most recent version of the paper: 1. The associated dataset now contains more than 100,000 Tweet IDs 2. The dataset has been updated to comprise Tweet IDs of relevant tweets posted up to June 26, 2022 3. The link to the most recent version of the dataset has been included in the paper
Commenter: Nirmalya Thakur
Commenter's Conflict of Interests: Author
1. The associated dataset now contains more than 100,000 Tweet IDs
2. The dataset has been updated to comprise Tweet IDs of relevant tweets posted up to June 26, 2022
3. The link to the most recent version of the dataset has been included in the paper