Submitted:
12 March 2024
Posted:
13 March 2024
You are already at the latest version
Abstract
Keywords:
1. Introduction
1.1. What Is AI and Should We Be Afraid?
1.2. Fear of AI around Us
1.3. Are News Headlines Consequential?
1.4. News Impact—Emotions and Sensationalism
1.5. Motivation and Objectives
2. Literature Review
2.1. AI in News Headlines
2.2. Impact of News Media on Public Perception
2.3. Sentiment Analysis of News Headlines
2.4. Other NLP Methods
2.5. Machine Learning and LLMs
3. Data
3.1. Data Collection Methods
3.2. Data Processing and Preparation
4. Methodology
4.1. Exploratory Data Analysis
4.1.1. Temporal Distribution of Articles
4.1.2. Linguistic and Geographic Features
4.1.3. News Headline Textual Analysis
4.1.4. Publication Sources
4.1.5. N-Grams Analysis for Full Data
4.2. Sentiment Classification
4.2.1. Sentiment Analysis
- TextBlob: is a library that provides a simple API for diving into common NLP tasks such as part-of-speech tagging, noun phrase extraction, and sentiment analysis. The sentiment function of TextBlob [73] returns a polarity score within the range of -1 to 1, where -1 indicates a negative sentiment, 1 indicates a positive sentiment, and scores around 0 indicate neutrality.
- VADER, on the other hand, is a lexicon and rule-based sentiment analysis tool specifically attuned to sentiments expressed in social media. It uses a combination of a sentiment lexicon that is human- and machine-curated and considers factors such as intensity and context. VADER’s compound score, which we used, is a normalized, weighted composite score that also ranges from -1 (most extreme negative) to +1 (most extreme positive).
- Afinn sentiment analysis tool assigns scores to words based on a predefined list where scores range from -5 to +5, with negative scores indicating negative sentiment and positive scores indicating positive sentiment.
- FLAIR We used a pretrained model, FlairNLP, a comprehensive NLP framework. This model leverages sequence labeling in order to detect either positive or negative sentiment in a given text. Table 5 Showcases the results obtained using this model.
4.3. Large Language Models for Topic Modeling
4.3.1. Topic Modeling with BERT
4.3.2. Topic Modeling with Llama 2
- Model Optimization: Given the limitation of our hardware, we employed model optimization techniques to facilitate the execution of the 13 billion parameter Llama 2[2] model. The principal optimization technique was 4-bit quantization, significantly reducing the memory footprint by condensing the 64-bit representation to a 4-bit one. This approach is not only efficient but also maintains the model’s performance integrity for topic modeling tasks.
-
Prompt Engineering for Llama 2: To effectively utilize Llama 2 for topic labeling, we designed a structured prompt template, incorporating both system and user prompts. The system prompt positions Llama 2[2] as a specialized assistant for topic labeling, providing a consistent contextual foundation for all interactions. The user prompt, however, is more dynamic, consisting of an example prompt to demonstrate the desired output and a main prompt that includes placeholders for documents and keywords specific to each topic. This design facilitates the generation of concise topic labels, optimizing Llama 2’s output for our topic modeling objectives. Within our prompt, two specific tags from BERTopic[76] are of critical importance:[DOCUMENTS]: This tag encompasses the five most pertinent documents related to the topic.[KEYWORDS]: This tag includes the ten most crucial keywords associated with the topic, identified via c-TF-IDF. This format is designed to be populated with the relevant information for each topic under investigation.
-
Implementation with BERTopic: The integration with BERTopic[76] involves a two-step process. Initially, BERTopic[76] generates topics and their corresponding clusters using documents from our dataset. These topics, characterized by their most relevant documents and keywords identified through c-TF-IDF, serve as the input for Llama 2[2], following our prompt template. The template is populated with the top 5 most relevant documents and the top 10 keywords for each topic, guiding Llama 2[2] to generate a short, precise label for the topic.To enrich our topic representations, we incorporated additional models alongside Llama 2[2]. Specifically, we utilized c-TF-IDF for the primary representation and supplemented it with KeyBERT[82], MMR (Maximal Marginal Relevance), and Llama 2[2] for multi-faceted topic insights. This approach allows for a comprehensive view of each topic from multiple analytical perspectives. With the models and methodologies in place, we proceeded to train our topic model by supplying BERTopic[76] with the designated sub-models. The training process involved fitting the model to our dataset and transforming the data to extract topics. Through careful optimization and prompt engineering, we achieved an efficient and effective topic modeling process, suitable for environments with limited computational resources.
4.4. AI Fear Classification with LLMs
4.4.1. Fear Classification with DistilBert
4.4.2. Fear Classification with Llama 2
4.4.3. Fear Classification with Mistral
4.5. Machine Learning
4.5.1. Data Preparation
4.5.2. Problem Statement
4.5.3. Modeling Process
4.5.4. Logistic Regression
4.5.5. Support Vector Classifier and Gaussian Naive Bayes
4.5.6. Bagging and Boosting
5. Results
5.1. Sentiment Analysis
| VADER | AFINN | TextBlob | |
|---|---|---|---|
| Mean | 0.175 | 0.346 | -0.040 |
| Max | 0.949 | 14 | 1 |
| Min | -0.944 | -11 | -1 |
| Variance | 0.110 | 2.640 | 0.083 |
| Sentiment | Count | Percentage |
|---|---|---|
| Positive | 40425 | 60.25 |
| Negative | 26666 | 39.74 |
5.2. Results—Topic Modeling with BERTopic
5.3. Results—Topic Modeling with Llama 2
- AI’s Impending Dangers : This category includes topics that highlight potential risks and challenges posed by AI technology. Examples include "Emerging risks of AI technology," "Generative AI Risks in 2023," and "AI-generated child sexual abuse content."
- AI Advancements: Topics under this category are primarily focused on providing insights, explanations, and informative perspectives on various aspects of AI. Examples include "AI technology competition," "Impact of Artificial Intelligence on Education," and "Advancements in AI-assisted image and video editing."
- Negative Capabilities of AI: This category encompasses topics that discuss the adverse impacts or capabilities of AI and ChatGPT, shedding light on concerns such as bias, discrimination, and privacy issues. Examples are more nuanced in this category but could include discussions around "Bias and discrimination in AI systems" and "Privacy and security risks in AI-driven data management."
- Positive Capabilities of AI: Conversely, this category highlights the beneficial aspects and positive applications of AI and ChatGPT. Topics such as "Artificial Intelligence in Healthcare," "AI in Music Industry," and "Using AI to combat wildfires in California" exemplify the positive impact AI can have across different sectors.
- Experimental Reporting for AI: For the category of Experimental Reporting, this encompasses cutting-edge explorations and innovative uses of AI that are at the forefront of technology and research. Examples include the application of AI in predicting natural disasters with greater accuracy and timeliness, such as using machine learning algorithms to forecast earthquakes or volcanic eruptions. Another example is the use of AI in environmental conservation, like deploying AI-driven drones for monitoring wildlife populations or analyzing satellite imagery to track deforestation. Additionally, experimental applications in digital biology and genome editing highlight AI’s role in advancing medical science, such as using AI to decipher complex genetic codes or to personalize medicine by predicting an individual’s response to certain treatments.
5.4. Results—Fear Classification with LLMs
5.4.1. Fear Classification with DistilBERT
5.4.2. Fear Classification with Llama 2
- Class 0 (Not Fear-Inducing): Llama 2’s performance in identifying headlines that do not induce fear is marked by a precision of 0.7627, suggesting that when it classifies a headline as not fear-inducing, it is correct around 76% of the time. The recall rate of 0.9426 indicates that the model is highly effective, identifying approximately 94% of all not fear-inducing headlines in the dataset. The combination of these metrics leads to an F1 score of 0.8432, reflecting a strong balance between precision and recall for this class. This high recall rate is particularly significant, as it demonstrates Llama 2’s ability to conservatively identify content that is unlikely to cause fear, ensuring a cautious approach in marking headlines as fear-inducing.
- Class 1 (Fear-Inducing): For headlines that are classified as fear-inducing, Llama 2 shows a precision of 0.9261, meaning that it has a high likelihood of correctly identifying genuine instances of fear-inducing content. The recall of 0.7102, though lower than for Class 0, signifies that the model successfully captures a substantial proportion of fear-inducing headlines. However, there is room for improvement in recognizing every such instance within the dataset. The resulting F1 score of 0.8039 for Class 1 illustrates a robust performance, although the challenge remains to enhance recall without sacrificing the model’s high precision.
5.4.3. Fear Classification with Mistral
5.5. Results—Fear Classification with Machine Learning
- Precision is the ratio of correctly predicted fear-inducing headlines to the total predicted as fear-inducing. It is calculated as:
- Recall (or Sensitivity) measures the proportion of actual fear-inducing headlines that are correctly identified. It is calculated as:
- F1 Score provides a balance between Precision and Recall, offering a single metric to assess the model’s accuracy. It is especially useful in cases where the class distribution is imbalanced. The F1 Score is calculated as:
- Accuracy measures the overall correctness of the model, calculated by dividing the sum of true positives and true negatives by the total number of cases:
6. Discussion and Future Research
6.1. Limitations
6.2. Future Opportunities
7. Recommendations
8. Conclusion
References
- Albert Q Jiang, Alexandre Sablayrolles, Arthur Mensch, Chris Bamford, Devendra Singh Chaplot, Diego de las Casas, Florian Bressand, Gianna Lengyel, Guillaume Lample, Lucile Saulnier, et al. Mistral 7b. arXiv preprint arXiv:2310.06825, 2023. [CrossRef]
- Hugo Touvron, Louis Martin, Kevin Stone, Peter Albert, Amjad Almahairi, Yasmine Babaei, Nikolay Bashlykov, Soumya Batra, Prajjwal Bhargava, Shruti Bhosale, et al. Llama 2: Open foundation and fine-tuned chat models. arXiv preprint arXiv:2307.09288, 2023. [CrossRef]
- Jim Samuel, Abhishek Tripathi, and Ensela Mema. A new era of artificial intelligence begins–where will it lead us? Journal of Big Data and Artificial Intelligence, 2(1), 2024. [CrossRef]
- VASILE GHERHEȘ. Why are we afraid of artificial intelligence (ai). European Review Of Applied Sociology, 11(17):6–15, 2018.
- Jungmin Kim. Fear of artificial intelligence on people’s attitudinal & behavioral attributes: An exploratory analysis of ai phobia. GSJ, 7(10), 2019.
- David L Altheide. The news media, the problem frame, and the production of fear. The sociological quarterly, 38(4):647–668, 1997.
- David L Altheide. Creating fear: News and the construction of crisis. Routledge, 2018.
- Frank Furedi. Culture of fear revisited. A&C Black, 2006.
- Jim Samuel. A call for proactive policies for informatics and artificial intelligence technologies. Scholars Strategy Network, 2021.
- Jim Samuel. The critical need for transparency and regulation amidst the rise of powerful artificial intelligence models. Scholars Strategy Network (SSN) Key Findings. Accessed November, 27:2023, 2023.
- Jim Samuel, Margaret Brennan-Tonetta, Yana Samuel, Pradeep Subedi, and Jack Smith. Strategies for democratization of supercomputing: Availability, accessibility and usability of high performance computing for education and practice of big data analytics. Journal of Big Data Theory and Practice, 2021. [CrossRef]
- Katarzyna Molek-Kozakowska. Towards a pragma-linguistic framework for the study of sensationalism in news headlines. Discourse & Communication, 7(2):173–197, 2013. [CrossRef]
- Hye-Jin Paek, Sang-Hwa Oh, and Thomas Hove. How fear-arousing news messages affect risk perceptions and intention to talk about risk. Health communication, 31(9):1051–1062, 2016. [CrossRef]
- Rachel C Adams, Petroc Sumner, Solveiga Vivian-Griffiths, Amy Barrington, Andrew Williams, Jacky Boivin, Christopher D Chambers, and Lewis Bott. How readers understand causal and correlational expressions used in news headlines. Journal of experimental psychology: applied, 23(1):1, 2017. [CrossRef]
- Celine Klemm, Tilo Hartmann, and Enny Das. Fear-mongering or fact-driven? illuminating the interplay of objective risk and emotion-evoking form in the response to epidemic news. Health communication, 34(1):74–83, 2019. [CrossRef]
- Valerie Hase and Katherine M Engelke. Emotions in crisis coverage: How uk news media used fear appeals to report on the coronavirus crisis. Journalism and Media, 3(4):633–649, 2022. [CrossRef]
- Sushree Panigrahi and Jeet Singh. Deadly combination of fake news and social media. Rajiv Gandhi Institute for Contemporary Studies, 4, 2017.
- Yuwei Chuai and Jichang Zhao. Anger makes fake news viral online. arXiv preprint arXiv:2004.10399, 2020. [CrossRef]
- Natascha de Hoog and Peter Verboon. Is the news making us unhappy? the influence of daily news exposure on emotional states. British Journal of Psychology, 111(2):157–173, 2020. [CrossRef]
- Stuart WG Derbyshire. Culture of fear: Risk taking and the morality of low expectation. BMJ, 315(7111):823, 1997.
- Junesoo Lee and Jaehyuk Park. Ai as “another i”: Journey map of working with artificial intelligence from ai-phobia to ai-preparedness. Organizational Dynamics, 52(3):100994, 2023. [CrossRef]
- Lucía Martín Holguín. Communicating artificial intelligence through newspapers: Where is the real danger, 2018.
- Hannes Cools, Baldwin Van Gorp, and Michael Opgenhaffen. Where exactly between utopia and dystopia? a framing analysis of ai and automation in us newspapers. Journalism, 25(1):3–21, 2024. [CrossRef]
- Dennis Nguyen and Erik Hekman. The news framing of artificial intelligence: a critical exploration of how media discourses make sense of automation. AI & SOCIETY, pages 1–15, 2022. [CrossRef]
- Shaojing Sun, Yujia Zhai, Bin Shen, and Yibei Chen. Newspaper coverage of artificial intelligence: A perspective of emerging technologies. Telematics and Informatics, 53:101433, 2020. [CrossRef]
- Ching-Hua Chuan, Wan-Hsiu Sunny Tsai, and Su Yeon Cho. Framing artificial intelligence in american newspapers. In Proceedings of the 2019 AAAI/ACM Conference on AI, Ethics, and Society, pages 339–344, 2019. [CrossRef]
- Albert C Gunther. The persuasive press inference: Effects of mass media on perceived public opinion. Communication Research, 25(5):486–504, 1998. [CrossRef]
- Diana C Mutz and Joe Soss. Reading public opinion: The influence of news coverage on perceptions of public sentiment. Public Opinion Quarterly, pages 431–451, 1997.
- David Giles and Rachel L Shaw. The psychology of news influence and the development of media framing analysis. Social and personality psychology compass, 3(4):375–393, 2009. [CrossRef]
- Jesper Strömbäck, Yariv Tsfati, Hajo Boomgaarden, Alyt Damstra, Elina Lindgren, Rens Vliegenthart, and Torun Lindholm. News media trust and its impact on media use: Toward a framework for future research. Annals of the International Communication Association, 44(2):139–156, 2020. [CrossRef]
- Wen Yang, Dongtong Lin, and Zelong Yi. Impacts of the mass media effect on investor sentiment. Finance Research Letters, 22:1–4, 2017. [CrossRef]
- Qing Li, TieJun Wang, Ping Li, Ling Liu, Qixu Gong, and Yuanzhu Chen. The effect of news and public mood on stock movements. Information Sciences, 278:826–840, 2014. [CrossRef]
- Md Mokhlesur Rahman, GG Md Nawaz Ali, Xue Jun Li, Jim Samuel, Kamal Chandra Paul, Peter HJ Chong, and Michael Yakubov. Socioeconomic factors analysis for covid-19 us reopening sentiment with twitter and census data. Heliyon, 7(2):e06200, 2021. [CrossRef]
- GG Ali, Md Mokhlesur Rahman, Amjad Hossain, Shahinoor Rahman, Kamal Chandra Paul, Jean-Claude Thill, Jim Samuel, et al. Public perceptions about covid-19 vaccines: Policy implications from us spatiotemporal sentiment analytics. Available at SSRN 3849138, 2021. [CrossRef]
- Myles D Garvey, Jim Samuel, and Alexander Pelaez. Would you please like my tweet?! an artificially intelligent, generative probabilistic, and econometric based system design for popularity-driven tweet content generation. Decision Support Systems, 144:113497, 2021. [CrossRef]
- Jim Samuel, GG Ali, Md Rahman, Ek Esawi, Yana Samuel, et al. Covid-19 public sentiment insights and machine learning for tweets classification. Information, 11(6):314, 2020a. [CrossRef]
- Jim Samuel, Md Mokhlesur Rahman, GG Md Nawaz Ali, Yana Samuel, Alexander Pelaez, Peter Han Joo Chong, and Michael Yakubov. Feeling positive about reopening? new normal scenarios from covid-19 us reopen sentiment analytics. IEEE Access, 8:142173–142190, 2020b. [CrossRef]
- Maite Taboada, Julian Brooke, Milan Tofiloski, Kimberly Voll, and Manfred Stede. Lexicon-based methods for sentiment analysis. Computational linguistics, 37(2):267–307, 2011. [CrossRef]
- Antony Samuels and John Mcgonical. News sentiment analysis. arXiv preprint arXiv:2007.02238, 2020.
- Yanghui Rao, Jingsheng Lei, Liu Wenyin, Qing Li, and Mingliang Chen. Building emotional dictionary for sentiment analysis of online news. World Wide Web, 17:723–742, 2014. [CrossRef]
- Lars Buitinck and et al. API design for machine learning software: experiences from the scikit-learn project. In ECML PKDD Workshop: Languages for Data Mining and Machine Learning, pages 108–122, 2013. [CrossRef]
- Rajkumar S Jagdale, Vishal S Shirsat, and Sachin N Deshmukh. Sentiment analysis on product reviews using machine learning techniques. In Cognitive Informatics and Soft Computing: Proceeding of CISC 2017, pages 639–647. Springer, 2019.
- MS Neethu and R Rajasree. Sentiment analysis in twitter using machine learning techniques. In 2013 fourth international conference on computing, communications and networking technologies (ICCCNT), pages 1–5. IEEE, 2013.
- Anurag Singh and Goonjan Jain. Sentiment analysis of news headlines using simple transformers. In 2021 Asian Conference on Innovation in Technology (ASIANCON), pages 1–6. IEEE, 2021. [CrossRef]
- David Rozado, Ruth Hughes, and Jamin Halberstadt. Longitudinal analysis of sentiment and emotion in news media headlines using automated labelling with transformer language models. Plos one, 17(10):e0276367, 2022. [CrossRef]
- László Nemes and Attila Kiss. Prediction of stock values changes using sentiment analysis of stock news headlines. Journal of Information and Telecommunication, 5(3):375–394, 2021. [CrossRef]
- Faheem Aslam, Tahir Mumtaz Awan, Jabir Hussain Syed, Aisha Kashif, and Mahwish Parveen. Sentiments and emotions evoked by news headlines of coronavirus disease (covid-19) outbreak. Humanities and Social Sciences Communications, 7(1), 2020. [CrossRef]
- Wafaa S El-Kassas, Cherif R Salama, Ahmed A Rafea, and Hoda K Mohamed. Automatic text summarization: A comprehensive survey. Expert systems with applications, 165:113679, 2021. [CrossRef]
- Dima Suleiman and Arafat A Awajan. Deep learning based extractive text summarization: approaches, datasets and evaluation measures. In 2019 Sixth International Conference on Social Networks Analysis, Management and Security (SNAMS), pages 204–210. IEEE, 2019. [CrossRef]
- Robiert Sepúlveda-Torres, Marta Vicente, Estela Saquete, Elena Lloret, and Manuel Palomar. Headlinestancechecker: Exploiting summarization to detect headline disinformation. Journal of Web Semantics, 71:100660, 2021. [CrossRef]
- Ike Vayansky and Sathish AP Kumar. A review of topic modeling methods. Information Systems, 94:101582, 2020. [CrossRef]
- Zara Nasar, Syed Waqar Jaffry, and Muhammad Kamran Malik. Named entity recognition and relation extraction: State-of-the-art. ACM Computing Surveys (CSUR), 54(1):1–39, 2021. [CrossRef]
- Jim Samuel, Gavin Rozzi, and Ratnakar Palle. The dark side of sentiment analysis: An exploratory review using lexicons, dictionaries, and a statistical monkey and chimp. Dictionaries, and a Statistical Monkey and Chimp (January 6, 2022), 2022a.
- Nourah Alswaidan and Mohamed El Bachir Menai. A survey of state-of-the-art approaches for emotion recognition in text. Knowledge and Information Systems, 62:2937–2987, 2020. [CrossRef]
- Huimin Xu, Man Lan, and Yuanbin Wu. Ecnu at semeval-2018 task 1: Emotion intensity prediction using effective features and machine learning models. In Proceedings of the 12th international workshop on semantic evaluation, pages 231–235, 2018.
- Lovejit Singh, Sarbjeet Singh, and Naveen Aggarwal. Two-stage text feature selection method for human emotion recognition. In Proceedings of 2nd International Conference on Communication, Computing and Networking: ICCCN 2018, NITTTR Chandigarh, India, pages 531–538. Springer, 2018.
- Zarrin Tasnim Sworna, Zahra Mousavi, and Muhammad Ali Babar. Nlp methods in host-based intrusion detection systems: A systematic review and future directions. Journal of Network and Computer Applications, page 103761, 2023. [CrossRef]
- Basemah Alshemali and Jugal Kalita. Improving the reliability of deep neural networks in nlp: A review. Knowledge-Based Systems, 191:105210, 2020. [CrossRef]
- Kian Long Tan, Chin Poo Lee, Kalaiarasi Sonai Muthu Anbananthen, and Kian Ming Lim. Roberta-lstm: a hybrid model for sentiment analysis with transformer and recurrent neural network. IEEE Access, 10:21517–21525, 2022. [CrossRef]
- Seunghak Yu, Sathish Reddy Indurthi, Seohyun Back, and Haejun Lee. A multi-stage memory augmented neural network for machine reading comprehension. In Proceedings of the workshop on machine reading for question answering, pages 21–30, 2018. [CrossRef]
- Diksha Khurana, Aditya Koli, Kiran Khatter, and Sukhdev Singh. Natural language processing: State of the art, current trends and challenges. Multimedia tools and applications, 82(3):3713–3744, 2023. [CrossRef]
- Jingfeng Yang, Hongye Jin, Ruixiang Tang, Xiaotian Han, Qizhang Feng, Haoming Jiang, Shaochen Zhong, Bing Yin, and Xia Hu. Harnessing the power of llms in practice: A survey on chatgpt and beyond. ACM Transactions on Knowledge Discovery from Data, 2023. [CrossRef]
- Surendrabikram Thapa, Usman Naseem, and Mehwish Nasim. From humans to machines: can chatgpt-like llms effectively replace human annotators in nlp tasks. In Workshop Proceedings of the 17th International AAAI Conference on Web and Social Media, 2023. [CrossRef]
- Akshay Goel, Almog Gueta, Omry Gilon, Chang Liu, Sofia Erell, Lan Huong Nguyen, Xiaohong Hao, Bolous Jaber, Shashir Reddy, Rupesh Kartha, et al. Llms accelerate annotation for medical information extraction. In Machine Learning for Health (ML4H), pages 82–100. PMLR, 2023. [CrossRef]
- Xinyi Wang, Wanrong Zhu, Michael Saxon, Mark Steyvers, and William Yang Wang. Large language models are implicitly topic models: Explaining and finding good demonstrations for in-context learning. In Workshop on Efficient Systems for Foundation Models@ ICML2023, 2023. [CrossRef]
- Yasir Gamieldien. Innovating the study of self-regulated learning: An exploration through nlp, generative ai, and llms. 2023.
- Lei Li, Yongfeng Zhang, and Li Chen. Prompt distillation for efficient llm-based recommendation. In Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, pages 1348–1357, 2023. [CrossRef]
- Googlenewsrss. https://news.google.com/, 2024.
- Matthew Honnibal and Ines Montani. spaCy 2: Natural language understanding with Bloom embeddings, convolutional neural networks and incremental parsing. To appear, 2017.
- Tom Brown, Benjamin Mann, Nick Ryder, Melanie Subbiah, Jared D Kaplan, Prafulla Dhariwal, Arvind Neelakantan, Pranav Shyam, Girish Sastry, Amanda Askell, et al. Language models are few-shot learners. Advances in neural information processing systems, 33:1877–1901, 2020. [CrossRef]
- Fabian Pedregosa, Gaël Varoquaux, Alexandre Gramfort, Vincent Michel, Bertrand Thirion, Olivier Grisel, Mathieu Blondel, Peter Prettenhofer, Ron Weiss, Vincent Dubourg, et al. Scikit-learn: Machine learning in python. Journal of machine learning research, 12(Oct):2825–2830, 2011.
- Clayton Hutto and Eric Gilbert. Vader: A parsimonious rule-based model for sentiment analysis of social media text. In Proceedings of the international AAAI conference on web and social media, volume 8, pages 216–225, 2014. [CrossRef]
- Steven Loria et al. textblob documentation. Release 0.15, 2(8):269, 2018.
- Finn Årup Nielsen. A new ANEW: evaluation of a word list for sentiment analysis in microblogs. In Matthew Rowe, Milan Stankovic, Aba-Sah Dadzie, and Mariann Hardey, editors, Proceedings of the ESWC2011 Workshop, volume 718, pages 93–98, 2011. URL http://ceur-ws.org/Vol-718/paper_16.pdf.
- Alan Akbik, Duncan Blythe, and Roland Vollgraf. Contextual string embeddings for sequence labeling. In COLING 2018, 27th International Conference on Computational Linguistics, pages 1638–1649, 2018.
- Maarten Grootendorst. Bertopic: Neural topic modeling with a class-based tf-idf procedure. arXiv preprint arXiv:2203.05794, 2022. [CrossRef]
- Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805, 2018. [CrossRef]
- David M Blei, Andrew Y Ng, and Michael I Jordan. Latent dirichlet allocation. Journal of machine Learning research, 3(Jan):993–1022, 2003.
- Dimo Angelov. Top2vec: Distributed representations of topics. arXiv preprint arXiv:2008.09470, 2020. [CrossRef]
- Leland McInnes, John Healy, Steve Astels, et al. hdbscan: Hierarchical density based clustering. J. Open Source Softw., 2(11):205, 2017. [CrossRef]
- Leland McInnes, John Healy, and James Melville. Umap: Uniform manifold approximation and projection for dimension reduction. arXiv preprint arXiv:1802.03426, 2018.
- Maarten Grootendorst. Keybert: Minimal keyword extraction with bert., 2020. URL . [CrossRef]
- Victor Sanh, Lysandre Debut, Julien Chaumond, and Thomas Wolf. Distilbert, a distilled version of bert: smaller, faster, cheaper and lighter. arXiv preprint arXiv:1910.01108, 2019. [CrossRef]
- Falconsai. Finetuned distilbert for fear mongering detection. https://huggingface.co/Falconsai/fear_mongering_detection, 2023. Accessed: 2024-03-11.
- Thomas Wolf, Lysandre Debut, Victor Sanh, Julien Chaumond, Clement Delangue, Anthony Moi, Pierric Cistac, Tim Rault, Rémi Louf, Morgan Funtowicz, et al. Huggingface’s transformers: State-of-the-art natural language processing. arXiv preprint arXiv:1910.03771, 2019. [CrossRef]
- Tim Dettmers, Mike Lewis, Younes Belkada, and Luke Zettlemoyer. Llm.int8(): 8-bit matrix multiplication for transformers at scale. arXiv preprint arXiv:2208.07339, 2022. [CrossRef]
- David R Cox. The regression analysis of binary sequences. Journal of the Royal Statistical Society: Series B (Methodological), 20(2):215–232, 1958.
- Corinna Cortes and Vladimir Vapnik. Support-vector networks. Machine learning, 20(3):273–297, 1995. [CrossRef]
- Tianqi Chen and Carlos Guestrin. XGBoost: A scalable tree boosting system. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD ’16, pages 785–794, New York, NY, USA, 2016. ACM. ISBN 978-1-4503-4232-2. doi:10.1145/2939672.2939785. URL. [CrossRef]
- Tin Kam Ho. Random decision forests. In Proceedings of 3rd international conference on document analysis and recognition, volume 1, pages 278–282. IEEE, 1995. [CrossRef]
- Karl Pearson F.R.S. Liii. on lines and planes of closest fit to systems of points in space. The London, Edinburgh, and Dublin Philosophical Magazine and Journal of Science, 2(11):559–572, 1901. [CrossRef]
- Peter I Frazier. A tutorial on bayesian optimization. arXiv preprint arXiv:1807.02811, 2018. [CrossRef]
- Nils Reimers and Iryna Gurevych. Sentence-BERT: Sentence embeddings using Siamese BERT-networks. In Proceedings of the 2019 Conference on EMNLP-IJCNLP, pages 3982–3992, Hong Kong, China, November 2019. ACM. [CrossRef]
- Richard Anderson, Carmela Scala, Jim Samuel, Vivek Kumar, and Parth Jain. Are emotions conveyed across machine translations? establishing an analytical process for the effectiveness of multilingual sentiment analysis with italian text. Journal of Big Data and Artificial Intelligence, 2(1), 2024. [CrossRef]
- Parth Hasmukh Jain, Vivek Kumar, Jim Samuel, Sushmita Singh, Abhinay Mannepalli, and Richard Anderson. Artificially intelligent readers: An adaptive framework for original handwritten numerical digits recognition with ocr methods. Information, 14(6):305, 2023. [CrossRef]
- Yana Samuel, Margaret Brennan-Tonetta, Jim Samuel, Rajiv Kashyap, Vivek Kumar, Sri Krishna Kaashyap, Nishitha Chidipothu, Irawati Anand, and Parth Jain. Cultivation of human centered artificial intelligence: culturally adaptive thinking in education (cate) for ai. Frontiers in Artificial Intelligence, 6:1198180, 2023. [CrossRef]
- Joseph Jones. Don’t fear artificial intelligence, question the business model: How surveillance capitalists use media to invade privacy, disrupt moral autonomy, and harm democracy. Journal of Communication Inquiry, page 01968599241235209, 2024.
- Jim Samuel, Rajiv Kashyap, Yana Samuel, and Alexander Pelaez. Adaptive cognitive fit: Artificial intelligence augmented management of information facets and representations. International journal of information management, 65:102505, 2022b. [CrossRef]

















| Data Source | Google News RSS Feed |
| Search Date | November 1, 2020 to February 16, 2024 |
| Search Terms | AI; A.I.; Artificial Intelligence |
| Retrieval Tool | Custom URL generation |
| Additional Tool | ScrapingBee API (conditional use) |
| Sentiment | Headline |
|---|---|
| Positive | 4 ways AI can help with climate change, from detecting methane to preventing fires Beneficial AI: Safe, Secure, and Trustworthy Artificial Intelligence for Food Safety How AI can help the education of blind and visually impaired people |
| Negative | Elon Musk warns AI could cause ‘civilization destruction’ even as he invests in it Bias in AI is a real problem Stupid Artificial Intelligence |
| Neutral | 3 Artificial Intelligence (AI) Stocks With More Upside Than Nvidia NVIDIA Announces Jetson Platform Expansion for Robotics and Edge Stanford Releases Report on the Current State of AI |
| Emotion | Headlines |
|---|---|
| Fear | The Dystopia is Here, AI is Taking over Data Science Jobs in 2021 AI doomsday warnings a distraction from the danger it already poses, warns expert The Godfather of Artificial Intelligence warns of a dark future |
| Absence of Fear | Artificial intelligence could help doctors predict breast cancer risks How AI is helping to save the Amazon - Positive News AI Predicts Future Pancreatic Cancer | Harvard Medical School |
| Algorithm | Accuracy (%) | Precision | Recall | F1 Score |
|---|---|---|---|---|
| DistilBERT | 66 | 0.69 | 0.58 | 0.63 |
| LLama 2 | 82.5 | 0.93 | 0.71 | 0.80 |
| Mistral | 81.1 | 0.96 | 0.63 | 0.77 |
| Headline | Actual | Predicted |
|---|---|---|
| How generative AI is boosting the spread of disinformation and propaganda | 1 | Fear-Mongering |
| Will definitely replace me’: Americans fear artificial intelligence will steal their jobs | 1 | Fear-Mongering |
| 5 EU FinTechs Using AI to Support Consumers, Businesses | 0 | Non-Fear-Mongering |
| Meta’s new learning algorithm can teach AI to multi-task | 0 | Non-Fear-Mongering |
| Headline | Actual | Predicted |
|---|---|---|
| Artificial Intelligence Begins to Enter the World of Stock Investment | 0 | Fear-Mongering |
| Generative AI-nxiety | 1 | Non-Fear-Mongering |
| Algorithm | Accuracy (%) | Precision | Recall | F1 Score |
|---|---|---|---|---|
| Logistic Regression | 83.0 | 0.841 | 0.822 | 0.832 |
| SVC | 81.4 | 0.809 | 0.823 | 0.816 |
| Gaussian NB | 80.5 | 0.799 | 0.816 | 0.807 |
| Random Forest | 81.1 | 0.814 | 0.808 | 0.811 |
| XGBoost | 83.3 | 0.823 | 0.797 | 0.810 |
| Model | Class | Precision | Recall | F1-Score |
|---|---|---|---|---|
| Logistic Regression | 0 | 0.84 | 0.82 | 0.83 |
| XGBoost | 0 | 0.79 | 0.91 | 0.84 |
| Logistic Regression | 1 | 0.82 | 0.84 | 0.83 |
| XGBoost | 1 | 0.89 | 0.76 | 0.82 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).