Earth Mover’s Distance Minimization for Unsupervised Bilingual Lexicon Induction
Citations
1,068 citations
538 citations
Cites background from "Earth Mover’s Distance Minimization..."
..., 2018), and optimal transport (Zhang et al., 2017)....
[...]
...Later approaches reduced the amount of supervision required using self-training (Artetxe et al., 2017) and unsupervised strategies such as adversarial training (Conneau et al., 2018a), heuristic initialisation (Artetxe et al., 2018), and optimal transport (Zhang et al., 2017)....
[...]
414 citations
Cites background or methods or result from "Earth Mover’s Distance Minimization..."
...We report the results in the dataset of Zhang et al. (2017a) at Table 1....
[...]
...Given that Zhang et al. (2017a) report using a different value of their hyperparameter λ for different language pairs (λ = 10 for English-Turkish and λ = 1 for the rest), we test both values in all our experiments to 4The test dictionaries were obtained through personal communication with the…...
[...]
...5Despite our efforts, Zhang et al. (2017b) was left out because: 1) it does not create a one-to-one dictionary, thus difficulting direct comparison, 2) it depends on expensive proprietary software 3) its computational cost is orders of magnitude higher (running the experiments would have taken…...
[...]
...The method of Zhang et al. (2017a) does not work at all in this more challenging scenario, which is in line with the negative results reported by the authors themselves for similar conditions (only %2.53 accuracy in their large Gigaword dataset)....
[...]
...Together with it, we also test the methods of Zhang et al. (2017a) and Conneau et al. (2018) using the publicly available implementations from the authors5....
[...]
270 citations
Cites result from "Earth Mover’s Distance Minimization..."
...…dictionary, typically in the range of a few thousand entries, although a recent line of work has managed to achieve comparable results in a fully unsupervised manner based on either self-learning (Artetxe et al., 2017, 2018b) or adversarial training (Zhang et al., 2017a,b; Conneau et al., 2018)....
[...]
239 citations
References
38,211 citations
"Earth Mover’s Distance Minimization..." refers background in this paper
...Generative adversarial nets (GANs) are originally proposed to generate natural images (Goodfellow et al., 2014)....
[...]
24,012 citations
"Earth Mover’s Distance Minimization..." refers background or methods or result in this paper
...This is exactly the supervised scenario, and previous works typically resort to gradient-based solvers (Mikolov et al., 2013a)....
[...]
...This idea has led to previous supervised methods: • Translation matrix (TM) (Mikolov et al., 2013a): the pioneer of this type of methods, using linear transformation....
[...]
...Interestingly, as computational models of word semantics, monolingual word embeddings also exhibit isomorphism across languages (Mikolov et al., 2013a)....
[...]
...As we aim to eliminate the need for crosslingual supervision from word translation pairs, the measure cannot be defined at the word level as in previous work (Mikolov et al., 2013a)....
[...]
...…et al., 2016), or the word level (i.e. in the form of seed lexicon) (Gouws and Søgaard, 2015; Wick et al., 2016; Duong et al., 2016; Shi et al., 2015; Mikolov et al., 2013a; Faruqui and Dyer, 2014; Lu et al., 2015; Dinu et al., 2015; Lazaridou et al., 2015; Ammar et al., 2016; Zhang et al.,…...
[...]
6,759 citations
"Earth Mover’s Distance Minimization..." refers background in this paper
...Therefore, a lot of research efforts have been dedicated to the investigation into stabler training (Radford et al., 2015; Salimans et al., 2016; Nowozin et al., 2016; Metz et al., 2016; Poole et al., 2016; Arjovsky and Bottou, 2017), and the recently proposed Wasserstein GAN (Arjovsky et al....
[...]
...Therefore, a lot of research efforts have been dedicated to the investigation into stabler training (Radford et al., 2015; Salimans et al., 2016; Nowozin et al., 2016; Metz et al., 2016; Poole et al., 2016; Arjovsky and Bottou, 2017), and the recently proposed Wasserstein GAN (Arjovsky et al.,…...
[...]
5,711 citations
5,593 citations
"Earth Mover’s Distance Minimization..." refers methods in this paper
...As neural networks are universal function approximators (Hornik, 1991), we can attempt to approximate f with a neural network, called the critic D, with weight clipping to ensure the function family is K-Lipschitz....
[...]