Machine Learning Techniques for Detecting Identifying Linguistic Patterns in News Media

A Samuel Pottinger

doi:10.20944/preprints201906.0051.v1

Submitted:

04 June 2019

Posted:

06 June 2019

You are already at the latest version

Abstract

An article's tone and framing not only influence an audience's perception of a story but may also reveal attributes of author identity and bias. Building upon prior media, psychological, and machine learning research, this neural network-based system detects those writing characteristics in ten news agencies' reporting, discovering patterns that, intentional or not, may reveal an agency's topical perspectives or common contextualization patterns. Specifically, learning linguistic markers of different organizations through a newly released open database, this probabilistic classifier predicts an article's publishing agency with 74% hidden test set accuracy given only a short snippet of text. The resulting model demonstrates how unintentional 'filter bubbles' can emerge in machine learning systems and, by comparing agencies' patterns and highlighting outlets' prototypical articles through an open source exemplar search engine, this paper offers new insight into news media bias.

Keywords:

NLP

;

news media

;

bias

;

neural networking

;

LSTM

;

information retrieval

;

filter bubble

Subject:

Computer Science and Mathematics - Artificial Intelligence and Machine Learning

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.

Machine Learning Techniques for Detecting Identifying Linguistic Patterns in News Media

Abstract

Keywords:

Subject:

MDPI Initiatives

Important Links

Subscribe