scispace - formally typeset
A

Armand Joulin

Researcher at Facebook

Publications -  136
Citations -  36652

Armand Joulin is an academic researcher from Facebook. The author has contributed to research in topics: Computer science & Word (computer architecture). The author has an hindex of 55, co-authored 125 publications receiving 25130 citations. Previous affiliations of Armand Joulin include Microsoft & École Normale Supérieure.

Papers
More filters
Proceedings ArticleDOI

Loss in Translation: Learning Bilingual Word Mapping with a Retrieval Criterion

TL;DR: This paper proposes an unified formulation that directly optimizes a retrieval criterion in an end-to-end fashion for word translation, and shows that this approach outperforms the state of the art on word translation.
Proceedings ArticleDOI

A graph-matching kernel for object categorization

TL;DR: This algorithm is used to construct a kernel appropriate for SVM-based image classification, and experiments with the Caltech 101, Caltech 256, and Scenes datasets demonstrate performance that matches or exceeds the state of the art for methods using a single type of features.
Proceedings ArticleDOI

Unsupervised Pre-Training of Image Features on Non-Curated Data

TL;DR: This work proposes a new unsupervised approach which leverages self-supervision and clustering to capture complementary statistics from large-scale data and validates its approach on 96 million images from YFCC100M, achieving state-of-the-art results among unsuper supervised methods on standard benchmarks.
Posted Content

Improving Neural Language Models with a Continuous Cache

TL;DR: This article propose an extension to neural network language models to adapt their prediction to the recent history, which stores past hidden activations as memory and accesses them through a dot product with the current hidden activation.
Proceedings Article

Unsupervised Learning of Visual Features by Contrasting Cluster Assignments

TL;DR: SwAV as discussed by the authors uses a "swapped" prediction mechanism where they predict the cluster assignment of a view from the representation of another view, instead of comparing features directly as in contrastive learning.