Institution

Facebook

Company•Tel Aviv, Israel•

About: Facebook is a company organization based out in Tel Aviv, Israel. It is known for research contribution in the topics: Computer science & Artificial neural network. The organization has 7856 authors who have published 10906 publications receiving 570123 citations. The organization is also known as: facebook.com & FB.

...read moreread less

Topics: Computer science, Artificial neural network, Language model, Context (language use), Reinforcement learning ...read more

Papers published on a yearly basis

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

Self-Training for End-to-End Speech Recognition

[...]

Jacob Kahn¹, Ann B. Lee¹, Awni Hannun¹•Institutions (1)

Facebook¹

04 May 2020

TL;DR: This article showed that training with pseudo-labels can substantially improve the accuracy of a baseline model, using a strong baseline acoustic and language model, filtering mechanisms tailored to common errors from sequence-to-sequence models, and a novel ensemble approach to increase pseudo-label diversity.

...read moreread less

Abstract: We revisit self-training in the context of end-to-end speech recognition. We demonstrate that training with pseudo-labels can substantially improve the accuracy of a baseline model. Key to our approach are a strong baseline acoustic and language model used to generate the pseudo-labels, filtering mechanisms tailored to common errors from sequence-to-sequence models, and a novel ensemble approach to increase pseudo-label diversity. Experiments on the LibriSpeech corpus show that with an ensemble of four models and label filtering, self-training yields a 33.9% relative improvement in WER compared with a baseline trained on 100 hours of labelled data in the noisy speech setting. In the clean speech setting, self-training recovers 59.3% of the gap between the baseline and an oracle model, which is at least 93.8% relatively higher than what previous approaches can achieve.

...read moreread less

110 citations

Posted Content•

Very Deep Convolutional Networks for Text Classification

[...]

Alexis Conneau¹, Holger Schwenk, Loïc Barrault, Yann LeCun•Institutions (1)

Facebook¹

06 Jun 2016-arXiv: Computation and Language

TL;DR: Very deep convolutional neural networks (VDCNN) as mentioned in this paper have been applied to text classification tasks and have achieved state-of-the-art performance on several text classification problems.

...read moreread less

Abstract: The dominant approach for many NLP tasks are recurrent neural networks, in particular LSTMs, and convolutional neural networks. However, these architectures are rather shallow in comparison to the deep convolutional networks which have pushed the state-of-the-art in computer vision. We present a new architecture (VDCNN) for text processing which operates directly at the character level and uses only small convolutions and pooling operations. We are able to show that the performance of this model increases with depth: using up to 29 convolutional layers, we report improvements over the state-of-the-art on several public text classification tasks. To the best of our knowledge, this is the first time that very deep convolutional nets have been applied to text processing.

...read moreread less

110 citations

Posted Content•

Best of Both Worlds: Transferring Knowledge from Discriminative Learning to a Generative Visual Dialog Model

[...]

Jiasen Lu¹, Anitha Kannan², Jianwei Yang¹, Devi Parikh³, Dhruv Batra¹ - Show less +1 more•Institutions (3)

Virginia Tech¹, Facebook², Georgia Institute of Technology³

05 Jun 2017-arXiv: Computer Vision and Pattern Recognition

TL;DR: A novel training framework for neural sequence models, particularly for grounded dialog generation, that leverages the recently proposed Gumbel-Softmax approximation to the discrete distribution, and introduces a stronger encoder for visual dialog, and employs a self-attention mechanism for answer encoding.

...read moreread less

Abstract: We present a novel training framework for neural sequence models, particularly for grounded dialog generation. The standard training paradigm for these models is maximum likelihood estimation (MLE), or minimizing the cross-entropy of the human responses. Across a variety of domains, a recurring problem with MLE trained generative neural dialog models (G) is that they tend to produce 'safe' and generic responses ("I don't know", "I can't tell"). In contrast, discriminative dialog models (D) that are trained to rank a list of candidate human responses outperform their generative counterparts; in terms of automatic metrics, diversity, and informativeness of the responses. However, D is not useful in practice since it cannot be deployed to have real conversations with users. Our work aims to achieve the best of both worlds -- the practical usefulness of G and the strong performance of D -- via knowledge transfer from D to G. Our primary contribution is an end-to-end trainable generative visual dialog model, where G receives gradients from D as a perceptual (not adversarial) loss of the sequence sampled from G. We leverage the recently proposed Gumbel-Softmax (GS) approximation to the discrete distribution -- specifically, an RNN augmented with a sequence of GS samplers, coupled with the straight-through gradient estimator to enable end-to-end differentiability. We also introduce a stronger encoder for visual dialog, and employ a self-attention mechanism for answer encoding along with a metric learning loss to aid D in better capturing semantic similarities in answer responses. Overall, our proposed model outperforms state-of-the-art on the VisDial dataset by a significant margin (2.67% on recall@10). The source code can be downloaded from this https URL.

...read moreread less

110 citations

Posted Content•

Unsupervised Learning of Dense Visual Representations

[...]

Pedro O. Pinheiro¹, Amjad Almahairi², Ryan Y. Benmalek³, Florian Golemo⁴, Aaron Courville⁴ - Show less +1 more•Institutions (4)

Facebook¹, New York University², Cornell University³, Université de Montréal⁴

11 Nov 2020-arXiv: Computer Vision and Pattern Recognition

TL;DR: View-Agnostic Dense Representation (VADeR) is proposed for unsupervised learning of dense representations of pixelwise representations by forcing local features to remain constant over different viewing conditions through pixel-level contrastive learning.

...read moreread less

Abstract: Contrastive self-supervised learning has emerged as a promising approach to unsupervised visual representation learning. In general, these methods learn global (image-level) representations that are invariant to different views (i.e., compositions of data augmentation) of the same image. However, many visual understanding tasks require dense (pixel-level) representations. In this paper, we propose View-Agnostic Dense Representation (VADeR) for unsupervised learning of dense representations. VADeR learns pixelwise representations by forcing local features to remain constant over different viewing conditions. Specifically, this is achieved through pixel-level contrastive learning: matching features (that is, features that describes the same location of the scene on different views) should be close in an embedding space, while non-matching features should be apart. VADeR provides a natural representation for dense prediction tasks and transfers well to downstream tasks. Our method outperforms ImageNet supervised pretraining (and strong unsupervised baselines) in multiple dense prediction tasks.

...read moreread less

110 citations

Journal Article•DOI•

Original Sin: A Cross-National Study of the Legality of Homosexual Acts

[...]

Victor Asal¹, Udi Sommer², Paul G. Harwood³•Institutions (3)

University at Albany, SUNY¹, Tel Aviv University², Facebook³

01 Mar 2013-Comparative Political Studies

TL;DR: This article examined the legality of homosexual acts quantitatively in a cross-national perspective with a large sample of countries from 1972 to 2002, employing path dependence as its theoretical framework, and found that path dependence can be used to predict the legal status of same-sex relations.

...read moreread less

Abstract: This article examines the legality of homosexual acts quantitatively in a cross-national perspective with a large sample of countries from 1972 to 2002. Employing path dependence as its theoretical...

...read moreread less

110 citations

Collapse

Authors

Showing all 7875 results

Name	H-index	Papers	Citations
Yoshua Bengio	202	1033	420313
Xiang Zhang	154	1733	117576
Jitendra Malik	151	493	165087
Trevor Darrell	148	678	181113
Christopher D. Manning	138	499	147595
Robert W. Heath	128	1049	73171
Pieter Abbeel	126	589	70911
Yann LeCun	121	369	171211
Li Fei-Fei	120	420	145574
Jon Kleinberg	117	444	87865
Sergey Levine	115	652	59769
Richard Szeliski	113	359	72019
Sanjeev Kumar	113	1325	54386
Bruce Neal	108	561	87213
Larry S. Davis	107	693	49714

Network Information

Related Institutions (5)

Google

39.8K papers, 2.1M citations

98% related

Microsoft

86.9K papers, 4.1M citations

96% related

Adobe Systems

8K papers, 214.7K citations

94% related

Carnegie Mellon University

104.3K papers, 5.9M citations

38.6K papers, 1.3M citations

90% related

Performance

Metrics

10,939

Papers

851,954

Citations

No. of papers from the Institution in previous years
Year	Papers
2024	1
2022	37
2021	1,738
2020	2,017
2019	1,607
2018	1,229