Institution

Facebook

Company•Tel Aviv, Israel•

About: Facebook is a company organization based out in Tel Aviv, Israel. It is known for research contribution in the topics: Artificial neural network & Language model. The organization has 7856 authors who have published 10906 publications receiving 570123 citations. The organization is also known as: facebook.com & FB.

...read moreread less

Topics: Artificial neural network, Language model, Reinforcement learning, Machine translation, Social network ...read more

Papers published on a yearly basis

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

Predicting Deeper into the Future of Semantic Segmentation

[...]

Pauline Luc¹, Natalia Neverova¹, Camille Couprie¹, Jakob Verbeek, Yann LeCun² - Show less +1 more•Institutions (2)

Facebook¹, New York University²

23 Mar 2017

TL;DR: In this paper, an autoregressive convolutional neural network was proposed to predict semantic segmentation maps of future frames, which lie up to half a second or further in the future.

...read moreread less

Abstract: The ability to predict and therefore to anticipate the future is an important attribute of intelligence. It is also of utmost importance in real-time systems, e.g. in robotics or autonomous driving, which depend on visual scene understanding for decision making. While prediction of the raw RGB pixel values in future video frames has been studied in previous work, here we introduce the novel task of predicting semantic segmentations of future frames. Given a sequence of video frames, our goal is to predict segmentation maps of not yet observed video frames that lie up to a second or further in the future. We develop an autoregressive convolutional neural network that learns to iteratively generate multiple frames. Our results on the Cityscapes dataset show that directly predicting future segmentations is substantially better than predicting and then segmenting future RGB frames. Prediction results up to half a second in the future are visually convincing and are much more accurate than those of a baseline based on warping semantic segmentations using optical flow.

...read moreread less

171 citations

Proceedings Article•DOI•

Latent credibility analysis

[...]

Jeff Pasternack¹, Dan Roth²•Institutions (2)

Facebook¹, University of Illinois at Urbana–Champaign²

13 May 2013

TL;DR: A new approach to information credibility, Latent Credibility Analysis (LCA), is introduced, constructing strongly principled, probabilistic models where the truth of each claim is a latent variable and the credibility of a source is captured by a set of model parameters.

...read moreread less

Abstract: A frequent problem when dealing with data gathered from multiple sources on the web (ranging from booksellers to Wikipedia pages to stock analyst predictions) is that these sources disagree, and we must decide which of their (often mutually exclusive) claims we should accept. Current state-of-the-art information credibility algorithms known as "fact-finders" are transitive voting systems with rules specifying how votes iteratively flow from sources to claims and then back to sources. While this is quite tractable and often effective, fact-finders also suffer from substantial limitations; in particular, a lack of transparency obfuscates their credibility decisions and makes them difficult to adapt and analyze: knowing the mechanics of how votes are calculated does not readily tell us what those votes mean, and finding, for example, that a source has a score of 6 is not informative. We introduce a new approach to information credibility, Latent Credibility Analysis (LCA), constructing strongly principled, probabilistic models where the truth of each claim is a latent variable and the credibility of a source is captured by a set of model parameters. This gives LCA models clear semantics and modularity that make extending them to capture additional observed and latent credibility factors straightforward. Experiments over four real-world datasets demonstrate that LCA models can outperform the best fact-finders in both unsupervised and semi-supervised settings.

...read moreread less

170 citations

Proceedings Article•

Fixing the train-test resolution discrepancy

[...]

Hugo Touvron¹, Andrea Vedaldi², Matthijs Douze³, Hervé Jégou³•Institutions (3)

University of Strasbourg¹, University of Oxford², Facebook³

14 Jun 2019

TL;DR: In this article, the authors proposed a simple strategy to optimize the classifier performance, that employs different train and test resolutions, and achieved state-of-the-art performance on ImageNet.

...read moreread less

Abstract: Data-augmentation is key to the training of neural networks for image classification. This paper first shows that existing augmentations induce a significant discrepancy between the size of the objects seen by the classifier at train and test time: in fact, a lower train resolution improves the classification at test time! We then propose a simple strategy to optimize the classifier performance, that employs different train and test resolutions. It relies on a computationally cheap fine-tuning of the network at the test resolution. This enables training strong classifiers using small training images, and therefore significantly reduce the training time. For instance, we obtain 77.1% top-1 accuracy on ImageNet with a ResNet-50 trained on 128x128 images, and 79.8% with one trained at 224x224. A ResNeXt-101 32x48d pre-trained with weak supervision on 940 million 224x224 images and further optimized with our technique for test resolution 320x320 achieves 86.4% top-1 accuracy (top-5: 98.0%). To the best of our knowledge this is the highest ImageNet single-crop accuracy to date.

...read moreread less

169 citations

Journal Article•DOI•

Operator growth in the SYK model

[...]

Daniel A. Roberts¹, Douglas Stanford, Alexandre Streicher², Alexandre Streicher³•Institutions (3)

Facebook¹, Stanford University², University of California³

07 Feb 2018-Journal of High Energy Physics

TL;DR: In this paper, the size distribution of a time-evolving operator in the SYK model is discussed and the authors evaluate the distribution numerically for N = 30, and show how to compute it in the large-N theory using the dressed fermion propagator.

...read moreread less

Abstract: We discuss the probability distribution for the “size” of a time-evolving operator in the SYK model. Scrambling is related to the fact that as time passes, the distribution shifts towards larger operators. Initially, the rate is exponential and determined by the infinite-temperature chaos exponent. We evaluate the size distribution numerically for N = 30, and show how to compute it in the large-N theory using the dressed fermion propagator. We then evaluate the distribution explicitly at leading nontrivial order in the large-q expansion.

...read moreread less

169 citations

Posted Content•DOI•

Language models enable zero-shot prediction of the effects of mutations on protein function

[...]

Joshua Meier¹, Joshua Meier², Roshan Rao³, Robert Verkuil¹, Jason Liu¹, Tom Sercu¹, Alexander Rives², Alexander Rives¹ - Show less +4 more•Institutions (3)

Facebook¹, New York University², University of California, Berkeley³

10 Jul 2021-bioRxiv

TL;DR: This paper used zero-shot inference to capture the functional effects of sequence variation, and achieved state-of-the-art performance on protein language models without any supervision from experimental data or additional training.

...read moreread less

Abstract: Modeling the effect of sequence variation on function is a fundamental problem for understanding and designing proteins. Since evolution encodes information about function into patterns in protein sequences, unsupervised models of variant effects can be learned from sequence data. The approach to date has been to fit a model to a family of related sequences. The conventional setting is limited, since a new model must be trained for each prediction task. We show that using only zero-shot inference, without any supervision from experimental data or additional training, protein language models capture the functional effects of sequence variation, performing at state-of-the-art.

...read moreread less

169 citations

Collapse

Authors

Showing all 7875 results

Name	H-index	Papers	Citations
Yoshua Bengio	202	1033	420313
Xiang Zhang	154	1733	117576
Jitendra Malik	151	493	165087
Trevor Darrell	148	678	181113
Christopher D. Manning	138	499	147595
Robert W. Heath	128	1049	73171
Pieter Abbeel	126	589	70911
Yann LeCun	121	369	171211
Li Fei-Fei	120	420	145574
Jon Kleinberg	117	444	87865
Sergey Levine	115	652	59769
Richard Szeliski	113	359	72019
Sanjeev Kumar	113	1325	54386
Bruce Neal	108	561	87213
Larry S. Davis	107	693	49714

Network Information

Related Institutions (5)

Google

39.8K papers, 2.1M citations

98% related

Microsoft

86.9K papers, 4.1M citations

96% related

Adobe Systems

8K papers, 214.7K citations

94% related

Carnegie Mellon University

104.3K papers, 5.9M citations

38.6K papers, 1.3M citations

90% related

Performance

Metrics

10,939

Papers

851,954

Citations

No. of papers from the Institution in previous years
Year	Papers
2024	1
2022	37
2021	1,738
2020	2,017
2019	1,607
2018	1,229