scispace - formally typeset
A

Armand Joulin

Researcher at Facebook

Publications -  136
Citations -  36652

Armand Joulin is an academic researcher from Facebook. The author has contributed to research in topics: Computer science & Word (computer architecture). The author has an hindex of 55, co-authored 125 publications receiving 25130 citations. Previous affiliations of Armand Joulin include Microsoft & École Normale Supérieure.

Papers
More filters
Book ChapterDOI

Efficient Image and Video Co-localization with Frank-Wolfe Algorithm

TL;DR: This paper shows how the Frank-Wolfe algorithm is able to naturally incorporate temporal terms and constraints for video co-localization into a quadratic programming framework, leading to increased efficiency in both memory and speed.
Proceedings Article

Reducing Transformer Depth on Demand with Structured Dropout

TL;DR: LayerDrop, a form of structured dropout, is explored, which has a regularization effect during training and allows for efficient pruning at inference time, and shows that it is possible to select sub-networks of any depth from one large network without having to finetune them and with limited impact on performance.
Book ChapterDOI

Learning Visual Features from Large Weakly Supervised Data

TL;DR: In this paper, the authors explore the potential of leveraging massive, weakly-labeled image collections for learning good visual features, and train convolutional networks on a dataset of 100 million Flickr photos and comments.
Proceedings ArticleDOI

Libri-Light: A Benchmark for ASR with Limited or No Supervision

TL;DR: In this article, the authors introduce a new collection of spoken English audio suitable for training speech recognition systems under limited or no supervision, which is derived from open-source audio books from the LibriVox project.
Proceedings ArticleDOI

Co-localization in Real-World Images

TL;DR: An extensive evaluation of the method compared to previous state-of-the-art approaches on the challenging PASCAL VOC 2007 and Object Discovery datasets and a large-scale study of co-localization on ImageNet, involving ground-truth annotations for 3, 624 classes and approximately 1 million images.