Top 2 papers published by Aitor Soroa from University of the Basque Country in 2019

Proceedings Article•DOI•

Analyzing the Limitations of Cross-lingual Word Embedding Mappings.

[...]

Aitor Ormazabal, Mikel Artetxe¹, Gorka Labaka¹, Aitor Soroa¹, Eneko Agirre¹ - Show less +1 more•Institutions (1)

01 Jun 2019

TL;DR: Joint learning yields to more isomorphic embeddings, is less sensitive to hubness, and obtains stronger results in bilingual lexicon induction, concluding that current mapping methods do have strong limitations.

...read moreread less

Abstract: Recent research in cross-lingual word embeddings has almost exclusively focused on offline methods, which independently train word embeddings in different languages and map them to a shared space through linear transformations. While several authors have questioned the underlying isomorphism assumption, which states that word embeddings in different languages have approximately the same structure, it is not clear whether this is an inherent limitation of mapping approaches or a more general issue when learning cross-lingual embeddings. So as to answer this question, we experiment with parallel corpora, which allows us to compare offline mapping to an extension of skip-gram that jointly learns both embedding spaces. We observe that, under these ideal conditions, joint learning yields to more isomorphic embeddings, is less sensitive to hubness, and obtains stronger results in bilingual lexicon induction. We thus conclude that current mapping methods do have strong limitations, calling for further research to jointly learn cross-lingual embeddings with a weaker cross-lingual signal.

...read moreread less

60 citations

Posted Content•

Analyzing the Limitations of Cross-lingual Word Embedding Mappings

[...]

Aitor Ormazabal, Mikel Artetxe¹, Gorka Labaka¹, Aitor Soroa¹, Eneko Agirre¹ - Show less +1 more•Institutions (1)

University of the Basque Country¹

12 Jun 2019-arXiv: Computation and Language

TL;DR: The authors compare offline mapping to an extension of skip-gram that jointly learns both embedding spaces, and conclude that joint learning yields to more isomorphic embeddings, is less sensitive to hubness, and obtains stronger results in bilingual lexicon induction.

...read moreread less

7 citations

Showing papers by "Aitor Soroa published in 2019"