Submitted:
03 June 2024
Posted:
04 June 2024
You are already at the latest version
Abstract
Keywords:
1. Introduction
- 1
- False cause fallacy (non causa pro causa)[4]: This fallacy consists of mislocating the cause of one phenomenon to another that is only seemingly related. A media outlet may incorrectly causally link two events, such as attributing a rise in crime to a recent policy change without adequately exploring other factors that contributed.
- 2
- Missing cause bias (information omission) [5]: a special type of omission that consistently omits attributing responsibility, placing blame or giving praise to specific acts or actors that caused an event, such as passively describing a violent attack by using sentences that do not contain a subject.
2. Related Work
2.1. Bias Detection
2.2. Causal Language Modelling
2.3. Applications of Causal Language Modelling in Media Analysis
3. Experimental Setup
3.1. Dataset Collection
3.2. Causal Extraction Models to Detect Media Bias
- Causal Sequence Classification: Returns a causal label (either causal or non-causal)
- Cause-Effect Span Detection: Identifies the text related to the Cause-Effect spans, i.e., which words correspond to to the cause and effect arguments.
- Causal Pair Classification: Identifies whether a marked entity pair is causally related.

4. Results
4.1. BBC vs AlJazeera Coverage
4.2. BBC’s Coverage of the Russia-Ukraine Conflict
5. Discussion
6. Conclusions
Author Contributions
Funding
Data Availability Statement
Acknowledgements
Conflicts of Interest
Abbreviations
| NLP | Natural Language Processing |
| CE | Causal Extraction |
| SVM | Support Vector Machines |
| CNN | Convoluted Neural Network |
| CRF | Conditional Random Fields |
| LSTM | Long Short-Term Memory |
| LLM | Large Language Model |
| GPT | Generative Pre-Trained Transformer |
| BERT | Bidirectional Encoder Representations from Transformers |
References
- Bernhardt, D.; Krasa, S.; Polborn, M. Political polarization and the electoral effects of media bias. Journal of Public Economics 2008, 92, 1092–1104. [Google Scholar] [CrossRef]
- Saez-Trumper, D.; Castillo, C.; Lalmas, M. Social media news communities: Gatekeeping, coverage, and statement bias. Proceedings of the 22nd ACM international conference on Information & Knowledge Management, 2013, pp. 1679–1684.
- Karlova, N.; Fisher, K.E. A social diffusion model of misinformation and disinformation for understanding human information behaviour. Inf. Res. 2013, 18. [Google Scholar]
- Manninen, B.A. False Cause. In Bad Arguments; Arp, R.; Barbone, S.; Bruce, M., Eds.; Wiley, 2018. [CrossRef]
- Gentzkow, M.; Shapiro, J.M. Media Bias and Reputation. Journal of Political Economy 2006, 114, 280–316. [Google Scholar] [CrossRef]
- Waldmann, M.; Hagmayer, Y. Causal reasoning. In Oxford Handbook of Cognitive Psychology; Oxford University Press, 2013; pp. 733–752.
- Caselli, T.; Vossen, P. The Event StoryLine Corpus: A New Benchmark for Causal and Temporal Relation Extraction. Proceedings of the Events and Stories in the News Workshop; Caselli, T., Miller, B., van Erp, M., Vossen, P., Palmer, M., Hovy, E., Mitamura, T., Caswell, D., Eds.; Association for Computational Linguistics: Vancouver, Canada, 2017; pp. 77–86. [Google Scholar] [CrossRef]
- Drury, B.; Gonçalo Oliveira, H.; de Andrade Lopes, A. A survey of the extraction and applications of causal relations. Natural Langauge Engineering 2022. [Google Scholar] [CrossRef]
- Lin, Y.R.; Bagrow, J.; Lazer, D. More voices than ever? quantifying media bias in networks. Proceedings of the international AAAI conference on web and social media, 2011, Vol. 5, pp. 193–200.
- Best, C.; van der Goot, E.; Blackler, K.; Garcia, T.; Horby, D. Europe media monitor. Technical Report EUR221 73 EN, European Commission, 2005. [Google Scholar]
- Hube, C.; Fetahu, B. Detecting biased statements in wikipedia. Companion proceedings of the the web conference 2018, 2018, pp. 1779–1786. [Google Scholar]
- Horne, B.; Khedr, S.; Adali, S. Sampling the news producers: A large news and feature data set for the study of the complex media landscape. Proceedings of the International AAAI Conference on Web and Social Media, 2018, Vol. 12.
- Cremisini, A.; Aguilar, D.; Finlayson, M.A. A challenging dataset for bias detection: The case of the crisis in the ukraine. Social, Cultural, and Behavioral Modeling: 12th International Conference, SBP-BRiMS 2019, Washington, DC, USA, 9–12 July 2019, Proceedings 12. Springer, 2019, pp. 173–183.
- Baly, R.; Karadzhov, G.; An, J.; Kwak, H.; Dinkov, Y.; Ali, A.; Glass, J.; Nakov, P. What was written vs. who read it: News media profiling using text analysis and social media context. arXiv, 2020; arXiv:2005.04518. [Google Scholar]
- Al-Sarraj, W.F.; Lubbad, H.M. Bias Detection of Palestinian/Israeli Conflict in Western Media: A Sentiment Analysis Experimental Study. 2018 International Conference on Promising Electronic Technologies (ICPET), 2018, pp. 98–103. [CrossRef]
- Wei, J.; Santos, E. Narrative Origin Classification of Israeli-Palestinian Conflict Texts. The Thirty-Third International FLAIRS Conference (FLAIRS-33), 2020.
- Cremisini, A.; Aguilar, D.; Finlayson, M.A. A Challenging Dataset for Bias Detection: The Case of the Crisis in the Ukraine. Social, Cultural, and Behavioral Modeling. SBP-BRiMS 2019; Thomson, R.; Bisgin, H.; Dancy, C.; Hyder, A., Eds. Springer, Cham, 2019, Vol. 11549, Lecture Notes in Computer Science. [CrossRef]
- Potash, P.; Romanov, A.; Gronas, M.; Rumshisky, A.; Gronas, M. Tracking Bias in News Sources Using Social Media: The Russia-Ukraine Maidan Crisis of 2013–2014. Proceedings of the 2017 EMNLP Workshop: Natural Language Processing meets Journalism; Popescu, O., Strapparava, C., Eds.; Association for Computational Linguistics: Copenhagen, Denmark, 2017; pp. 13–18. [Google Scholar] [CrossRef]
- Solstad, T.; Bott, O. 619Causality and Causal Reasoning in Natural Language. The Oxford Handbook of Causal Reasoning, Oxford University Press, 2017. [Google Scholar] [CrossRef]
- Neeleman, A.; Van de Koot, H. The Linguistic Expression of Causation. In The Theta System: Argument Structure at the Interface; Oxford University Press: Oxford, 2012. [Google Scholar] [CrossRef]
- Dunietz, J.; Levin, L.; Carbonell, J. The BECauSE Corpus 2.0: Annotating Causality and Overlapping Relations. Proceedings of the 11th Linguistic Annotation Workshop; Association for Computational Linguistics: Valencia, Spain, 2017; pp. 95–104. [Google Scholar]
- Tan, F.A.; Zuo, X.; Ng, S. UniCausal: Unified Benchmark and Repository for Causal Text Mining. Big Data Analytics and Knowledge Discovery - 25th International Conference, DaWaK 2023, Penang, Malaysia, 28-30 August 2023, Proceedings; Wrembel, R.; Gamper, J.; Kotsis, G.; Tjoa, A.M.; Khalil, I., Eds. Springer, 2023, Vol. 14148, Lecture Notes in Computer Science, pp. 248–262.
- Hidey, C.; Mckeown, K. Identifying Causal Relations Using Parallel Wikipedia Articles. 54th Annual Meeting of the Association for Computational Linguistics, ACL 2016 - Long Papers 2016, 3, 1424–1433. [Google Scholar] [CrossRef]
- Zhao, S.; Wang, Q.; Massung, S.; Qin, B.; Liu, T.; Wang, B.; Zhai, C. Constructing and Embedding Abstract Event Causality Networks from Text Snippets. Proceedings of the Tenth ACM International Conference on Web Search and Data Mining. ACM, 2017, pp. 335–344. [CrossRef]
- Kruengkrai, C.; Torisawa, K.; Hashimoto, C.; Kloetzer, J.; Oh, J.H.; Tanaka, M. Improving Event Causality Recognition with Multiple Background Knowledge Sources Using Multi-Column Convolutional Neural Networks. Proceedings of the AAAI Conference on Artificial Intelligence, 2017, Vol. 31. [CrossRef]
- de Silva, T.N.; Zhibo, X.; Rui, Z.; Kezhi, M. Causal Relation Identification Using Convolutional Neural Networks and Knowledge Based Features. International Journal of Computer and Systems Engineering 2017, 11. [Google Scholar]
- Fu, J.F.; Liu, Z.T.; Liu, W.; Zhou, W. Event Causal Relation Extraction Based on Cascaded Conditional Random Fields. Pattern Recognition and Artificial Intelligence 2011, 24, 567. [Google Scholar]
- Li, Z.; Li, Q.; Zou, X.; Ren, J. Causality Extraction based on Self-Attentive BiLSTM-CRF with Transferred Embeddings. ArXiv, 2019; arXiv:abs/1904.07629. [Google Scholar]
- Dasgupta, T.; Saha, R.; Dey, L.; Naskar, A. Automatic Extraction of Causal Relations from Text using Linguistically Informed Deep Neural Networks. Proceedings of the SIGDIAL 2018 Conference; Association for Computational Linguistics: Melbourne, Australia, 2018; pp. 12–14. [Google Scholar]
- Yang, J.; Xiong, H.; Zhang, H.; Hu, M.; An, N. Causal Pattern Representation Learning for Extracting Causality from Literature. Proceedings of the 2022 5th International Conference on Machine Learning and Natural Language Processing; Association for Computing Machinery: New York, NY, USA, 2023; MLNLP ’22; pp. 229–233. [Google Scholar] [CrossRef]
- Khetan, V.; Ramnani, R.R.; Anand, M.; Sengupta, S.; Fano, A.E. Causal BERT: Language Models for Causality Detection Between Events Expressed in Text. Intelligent Computing. Lecture Notes in Networks and Systems; Arai, K., Ed., 2022, Vol. vol 283. [CrossRef]
- Shukla, N.K.; Katikeri, R.; Raja, M.; Sivam, G.; Yadav, S.; Vaid, A.; Prabhakararao, S. Investigating Large Language Models for Financial Causality Detection in Multilingual Setup. 2023 IEEE International Conference on Big Data (BigData), 2023, pp. 2866–2871. [CrossRef]
- Hosseini, P.; Diab, M.; Broniatowski, D.A. Does Causal Coherence Predict Online Spread of Social Media? Social, Cultural, and Behavioral Modeling. SBP-BRiMS 2019; Thomson, R.; Bisgin, H.; Dancy, C.; Hyder, A., Eds. Springer, Cham, 2019, Vol. 11549, Lecture Notes in Computer Science. [CrossRef]
- Mariko, D.; Abi-Akl, H.; Trottier, K.; El-Haj, M. The Financial Causality Extraction Shared Task (FinCausal 2022). Proceedings of the 4th Financial Narrative Processing Workshop @LREC2022; El-Haj, M., Rayson, P., Zmandar, N., Eds.; European Language Resources Association: Marseille, France, 2022; pp. 105–107. [Google Scholar]
- Devlin, J.; Chang, M.; Lee, K.; Toutanova, K. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. CoRR, 2018; abs/1810.04805. [Google Scholar]
- Hidey, C.; McKeown, K. Identifying Causal Relations Using Parallel Wikipedia Articles. Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers); Erk, K., Smith, N.A., Eds.; Association for Computational Linguistics: Berlin, Germany, 2016; pp. 1424–1433. [Google Scholar] [CrossRef]
- Dunietz, J.; Levin, L.; Carbonell, J. The BECauSE Corpus 2.0: Annotating Causality and Overlapping Relations. Proceedings of the 11th Linguistic Annotation Workshop; Schneider, N., Xue, N., Eds.; Association for Computational Linguistics: Valencia, Spain, 2017; pp. 95–104. [Google Scholar] [CrossRef]
- Mirza, P.; Sprugnoli, R.; Tonelli, S.; Speranza, M. Annotating Causality in the TempEval-3 Corpus. Proceedings of the EACL 2014 Workshop on Computational Approaches to Causality in Language (CAtoCL); Kolomiyets, O., Moens, M.F., Palmer, M., Pustejovsky, J., Bethard, S., Eds.; Association for Computational Linguistics: Gothenburg, Sweden, 2014; pp. 10–19. [Google Scholar] [CrossRef]
- Webber, B.; Prasad, R.; Lee, A.; Joshi, A. The penn discourse treebank 3.0 annotation manual. Philadelphia, University of Pennsylvania 2019, 35, 108. [Google Scholar]
- Hendrickx, I.; Kim, S.N.; Kozareva, Z.; Nakov, P.; Ó Séaghdha, D.; Padó, S.; Pennacchiotti, M.; Romano, L.; Szpakowicz, S. SemEval-2010 Task 8: Multi-Way Classification of Semantic Relations between Pairs of Nominals. Proceedings of the 5th International Workshop on Semantic Evaluation; Erk, K., Strapparava, C., Eds.; Association for Computational Linguistics: Uppsala, Sweden, 2010; pp. 33–38. [Google Scholar]
| 1 | |
| 2 | |
| 3 | |
| 4 |

| Source | Number of Articles |
|---|---|
| BBC Middle East2 | 793 |
| AlJazeera Israel-Palestine Conflict 3 | 1465 |
| BBC Ukraine | 784 |
| AlJazeera Ukraine - Russia War | 1018 |
| Conflict | Number of Articles |
|---|---|
| Israel - Palestine | 98 |
| Ukraine - Russia | 65 |
| BBC | Al Jazeera |
|---|---|
| Israel - Palestine Conflict Headlines | |
| Refaat Alareer: Palestinians mourn writer killed in air strike | Palestinians mourn poet Refaat Alareer killed in Israeli air strike |
| ICJ says Israel must prevent genocide in Gaza | ICJ orders Israel to prevent acts of genocide in Gaza |
| Hamas deputy leader Saleh al-Arouri killed in Beirut blast | Senior Hamas official Saleh al-Arouri killed in Beirut suburb |
| Ukraine - Russia Conflict Headlines | |
| Rustem Umerov: Who is Ukraine’s next defence minister? | Who is Rustem Umerov, Ukraine’s next defence minister? |
| Ukraine and Russia complete first prisoner swap since plane crash | Russia and Ukraine complete first prisoner exchange since plane crash |
| Ukraine celebrates first Christmas on 25 December | Ukraine officially celebrates Christmas on December 25 for the first time |
| Corpus | Source |
|---|---|
| AltLex [36] | News |
| BECAUSE 2.0 [37] | News, Congress Hearings |
| CausalTimeBank (CTB) [38] | News |
| EventStoryLine V1.0 (ESL) [7] | News |
| Penn Discourse Treebank V3.0 (PDTB) [39] | News |
| SemEval 2010 Task 8 (SemEval) [40] | Web |
| Headlines | BBC Middle East | Al Jazeera | BBC Russia-Ukraine |
|---|---|---|---|
| Number of Causal Sentences | |||
| Total Headlines | 793 | 1465 | 784 |
| Causal Headlines | 225 (28.4%) | 350 (24%) | 236 (30%) |
| Causal Sentences with cause spans that include terms related to Israel/Russia | |||
| Number of Headlines | 20 (8%) | 206 (57%) | 98 (41.5%) |
| Causal Sentences with effect spans that include terms related to killed or dead | |||
| Number of Headlines | 78 | 112 | 47 |
| Combined | 8 (1%) | 54 (48%) | 16 (34%) |
| BBC | Al Jazeera |
|---|---|
| Israel-Palestine Conflict Headlines | |
| Samer Abudaqa: Al Jazeera cameraman killed in Gaza drone strike | Al Jazeera journalist Samer Abudaqa killed in Israeli attack in Gaza |
| Wael Al-Dahdouh: Al Jazeera reporter’s family killed in Gaza strike | Al Jazeera condemns Israeli killing of journalist Wael Al-Dahdouh’s family |
| Al Jazeera bureau chief’s son Hamza al-Dahdouh among journalists killed in Gaza | Hamza son of Al Jazeera’s Wael Dahdouh killed in Israeli attack in Gaza |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).