Preprint Data Descriptor Version 1 Preserved in Portico This version is not peer-reviewed

A Dataset Containing Tweets and Their Meta Data for Understanding Social Media Conversations around Movies during Their Release

Version 1 : Received: 16 June 2022 / Approved: 17 June 2022 / Online: 17 June 2022 (04:39:16 CEST)

How to cite: Michielsen, J.; Lelli, F. A Dataset Containing Tweets and Their Meta Data for Understanding Social Media Conversations around Movies during Their Release. Preprints 2022, 2022060246. https://doi.org/10.20944/preprints202206.0246.v1 Michielsen, J.; Lelli, F. A Dataset Containing Tweets and Their Meta Data for Understanding Social Media Conversations around Movies during Their Release. Preprints 2022, 2022060246. https://doi.org/10.20944/preprints202206.0246.v1

Abstract

In this paper we intend to present a dataset that contain a collection of tweets generated as reactions of the release of 50 different movies. The dataset can be used for gaining useful insights regarding the conversation that is generated around a particular movie. It is particularly suitable for conducting sentiment analysis and other NLP techniques. The dataset contains approximately 2.5 million tweets with their related meta data and cover 50 movies. For each movie, its IMDb rating is included. The movies are the 25 releases with the highest number of votes during 2020 and 2021. The collected tweets represent the reactions of the twitter community during the first week of the release date in US of that particular movie. The tweets per movie ranged from 1.000 to approximately 200.000 tweets with an average of 50.000 per release. We used The Internet Archive Wayback Machine in order to retrieve the IMDb movie rating after one week of the US release date. The tweets and related metadata have been collected using the Tweet Downloader tool.

Supplementary and Associated Material

https://doi.org/10.34894/6WEAUR: Doi if the dataset
https://francescolelli.info/: Contact for the dataset

Keywords

dataset; twitter; tweets; IMDb ratings; movies; sentiment analysis; NLP

Subject

Computer Science and Mathematics, Information Systems

Comments (0)

We encourage comments and feedback from a broad range of readers. See criteria for comments and our Diversity statement.

Leave a public comment
Send a private comment to the author(s)
* All users must log in before leaving a comment
Views 0
Downloads 0
Comments 0
Metrics 0


×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.
We use cookies on our website to ensure you get the best experience.
Read more about our cookies here.