Search | Preprints.org

Search Results

7 articles found

Order by Most Viewed Most Downloaded Newest Relevance

Preprint DATA DESCRIPTOR | doi:10.20944/preprints202206.0246.v1

A Dataset Containing Tweets and Their Meta Data for Understanding Social Media Conversations around Movies during Their Release

Joost Michielsen, Francesco Lelli

Subject: Computer Science And Mathematics, Information Systems Keywords: dataset; twitter; tweets; IMDb ratings; movies; sentiment analysis; NLP

Online: 17 June 2022 (04:39:16 CEST)

Show abstract| Download PDF| Share

Preprint COMMUNICATION | doi:10.20944/preprints202206.0172.v3

MonkeyPox2022Tweets: The First Public Twitter Dataset on the 2022 MonkeyPox Outbreak

Nirmalya Thakur

Subject: Computer Science And Mathematics, Information Systems Keywords: Monkeypox; monkey pox; Twitter; Dataset; Tweets; Social Media; Big Data; Data Mining; Data Science

Online: 25 July 2022 (09:41:19 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202406.0860.v1

Tweets Classification for Digital Epidemiology of Childhood Health Outcomes Using Pre-Trained Language Models

Dasun Athukoralage, Thushari Atapattu, Menasha Thilakaratne, Katrina Falkner

Subject: Computer Science And Mathematics, Artificial Intelligence And Machine Learning Keywords: digital epidemiology, childhood health, pre-trained language models, ensemble models, natural language processing, tweets classification

Online: 13 June 2024 (09:42:13 CEST)

Show abstract| Download PDF| Share

Preprint DATA DESCRIPTOR | doi:10.20944/preprints202206.0146.v2

A Large-Scale Dataset of Twitter Chatter about Online Learning during the Current COVID-19 Omicron Wave

Nirmalya Thakur

Subject: Computer Science And Mathematics, Information Systems Keywords: COVID-19; COVID; Omicron; online learning; remote learning; online education; Twitter; dataset; Tweets; social media; Big Data

Online: 21 July 2022 (08:05:19 CEST)

Show abstract| Download PDF| Share

Preprint COMMUNICATION | doi:10.20944/preprints202206.0383.v2

Twitter Big Data as A Resource for Exoskeleton Research: A Large-Scale Dataset of about 140,000 Tweets and 100 Research Questions

Nirmalya Thakur

Subject: Computer Science And Mathematics, Information Systems Keywords: Exoskeleton; Twitter; Tweets; Big Data; social media; Data Mining; dataset; Data Science; Natural Language Processing; Information Retrieval

Online: 21 July 2022 (04:06:53 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints201808.0269.v1

Sensing Earthquake Disaster Information: A Named Entity Recognition Approach Using Twitter Collaborative Data

Aldo Hernandez-Suarez, Gabriel Sanchez-Perez, Karina Toscano-Medina, Hector Perez-Meana, Jose Portillo-Portillo, Victor Sanchez

Subject: Computer Science And Mathematics, Information Systems Keywords: social sensing; supervised learning; statistical methods; social networks; twitter; tweets; natural disaster; random forest, kernel density estimation

Online: 15 August 2018 (11:34:43 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202205.0238.v2

An Exploratory Study of Tweets about the SARS-CoV-2 Omicron Variant: Insights from Sentiment Analysis, Language Interpretation, Source Tracking, Type Classification, and Embedded URL Detection

Nirmalya Thakur, Chia Y. Han

Subject: Computer Science And Mathematics, Information Systems Keywords: COVID-19; SARS-CoV-2; Omicron; Twitter; tweets; sentiment analysis; big data; Natural Language Processing; Data Science; Data Analysis

Online: 7 July 2022 (08:36:40 CEST)

Show abstract| Download PDF| Share

This paper presents the findings of an exploratory study on the continuously generating Big Data on Twitter related to the sharing of information, news, views, opinions, ideas, knowledge, feedback, and experiences about the COVID-19 pandemic, with a specific focus on the Omicron variant, which is the globally dominant variant of SARS-CoV-2 at this time. A total of 12028 tweets about the Omicron variant were studied, and the specific characteristics of tweets that were analyzed include - sentiment, language, source, type, and embedded URLs. The findings of this study are manifold. First, from sentiment analysis, it was observed that 50.5% of tweets had the ‘neutral’ emotion. The other emotions - ‘bad’, ‘good’, ‘terrible’, and ‘great’ were found in 15.6%, 14.0%, 12.5%, and 7.5% of the tweets, respectively. Second, the findings of language interpretation showed that 65.9% of the tweets were posted in English. It was followed by Spanish or Castillian, French, Italian, Japanese, and other languages, which were found in 10.5%, 5.1%, 3.3%, 2.5%, and <2% of the tweets, respectively. Third, the findings from source tracking showed that “Twitter for Android” was associated with 35.2% of tweets. It was followed by “Twitter Web App”, “Twitter for iPhone”, “Twitter for iPad”, “TweetDeck”, and all other sources that accounted for 29.2%, 25.8%, 3.8%, 1.6%, and <1% of the tweets, respectively. Fourth, studying the type of tweets revealed that retweets accounted for 60.8% of the tweets, it was followed by original tweets and replies that accounted for 19.8% and 19.4% of the tweets, respectively. Fifth, in terms of embedded URL analysis, the most common domains embedded in the tweets were found to be twitter.com, which was followed by biorxiv.org, nature.com, wapo.st, nzherald.co.nz, recvprofits.com, science.org, and other URLs. Finally, to support similar research and development in this field centered around the analysis of tweets, we have developed an open-access Twitter dataset that comprises tweets about the SARS-CoV-2 omicron variant since the first detected case of this variant on November 24, 2021.

We use cookies on our website to ensure you get the best experience.
Read more about our cookies here.