<em>What Do We Learn from Word Associations</em>? Evaluating Machine Learning Algorithms for the Extraction of Contextual Word Meaning in Natural Language Processing

Epaminondas Kapetanios; Saad Alshahrani; Anastasia Angelopoulou; Mark Baldwin

doi:10.20944/preprints201805.0102.v2

Submitted:

09 May 2018

Posted:

10 May 2018

You are already at the latest version

Abstract

“You should know the words by the company they keep!” has been one of the most famous slogans attributed to John Rubert Firth, 1957. This has ignited a whole school in linguistic research known as the British empiricist contextualism. Sixty years later, many un- or semi-supervised machine learning algorithms have been successfully designed and implemented aiming at extracting word meaning from within the context of a text corpus. These algorithms treat words, more or less, as vectors of real numbers representing frequencies of word occurrences within context and word meaning as positions of words in a high-dimensional vector space model. Word associations, in turn, are treated as calculated distances among them. With the rise of Deep Learning (DL) and other artificial neural networks based architectures, learning the positioning of words and extracting word associations as measured by their distances has further improved. In this paper, however, we revisited the main stream of algorithmic approaches and set the stage for a partly cross-disciplinary evaluation framework to judge about the nature of the extracted word associations by state-of-the-art machine learning algorithms. Our preliminary results are based on word associations extracted from the application of DL framework on a Google News text corpus, as well as on comparisons with human created word association lists such as word collocation dictionaries and psycholinguistic experiments. The results and conclusions provide some insights into the inherited limitations in interpreting the type of word associations and underpinning relations between words with inevitable consequences in other areas, such as extraction of knowledge graphs or image understanding.

Keywords:

machine learning

;

algorithms

;

natural language processing

;

deep learning

;

vector space models

;

semantic similarity

;

distributional semantics

;

latent semantic analysis

;

word2vec

Subject:

Computer Science and Mathematics - Artificial Intelligence and Machine Learning

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.

What Do We Learn from Word Associations? Evaluating Machine Learning Algorithms for the Extraction of Contextual Word Meaning in Natural Language Processing

Abstract

Keywords:

Subject:

MDPI Initiatives

Important Links

Subscribe