Preprint Article Version 1 This version is not peer-reviewed

Towards a Universal Semantic Dictionary

Version 1 : Received: 25 July 2019 / Approved: 29 July 2019 / Online: 29 July 2019 (11:05:16 CEST)

How to cite: Castro-Bleda, M.J.; Iklodi, E.; Recski, G.; Borbely, G. Towards a Universal Semantic Dictionary. Preprints 2019, 2019070336 (doi: 10.20944/preprints201907.0336.v1). Castro-Bleda, M.J.; Iklodi, E.; Recski, G.; Borbely, G. Towards a Universal Semantic Dictionary. Preprints 2019, 2019070336 (doi: 10.20944/preprints201907.0336.v1).

Abstract

A novel method for finding linear mappings among word embeddings for several languages, taking as pivot a shared, universal embedding space, is proposed in this paper. Previous approaches learn translation matrices between two specific languages, but this method learn translation matrices between a given language and a shared, universal space. The system was first trained on bilingual, and later on multilingual corpora as well. In the first case two different training data were applied; Dinu’s English-Italian benchmark data, and English-Italian translation pairs extracted from the PanLex database. In the second case only the PanLex database was used. The system performs on English-Italian languages with the best setting significantly better than the baseline system of Mikolov et al. [1], and it provides a comparable performance with the more sophisticated systems of Faruqui and Dyer [2] and Dinu et al. [3]. Exploiting the richness of the PanLex database, the proposed method makes it possible to learn linear mappings among an arbitrary number of languages.

Subject Areas

natural language processing; semantics; word embeddings; multilingual embeddings; translation; artificial neural networks

Comments (0)

We encourage comments and feedback from a broad range of readers. See criteria for comments and our diversity statement.

Leave a public comment
Send a private comment to the author(s)
Views 0
Downloads 0
Comments 0
Metrics 0


×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.