Institution

Facebook

Company•Tel Aviv, Israel•

About: Facebook is a company organization based out in Tel Aviv, Israel. It is known for research contribution in the topics: Computer science & Artificial neural network. The organization has 7856 authors who have published 10906 publications receiving 570123 citations. The organization is also known as: facebook.com & FB.

...read moreread less

Topics: Computer science, Artificial neural network, Language model, Context (language use), Reinforcement learning ...read more

Papers published on a yearly basis

Papers

PDF

Open Access

More filters

Patent•

Systems and methods for a symbol-adaptable keyboard

[...]

Jenny Yuen¹, Luke St. Clair¹•Institutions (1)

Facebook¹

05 Dec 2012

TL;DR: In this article, a method for detecting a communication session between a first user and one or more second users is presented. But the method does not consider the social context of the communication session.

...read moreread less

Abstract: In one embodiment, a method includes detecting a communication session between a first user and one or more second users. The method also includes determining a social context of the communication session, and determining based at least in part on the social context a set of symbols for communication by the first user in the communication session with the second users. The method further includes providing for display to the first user a set of keys corresponding to the set of symbols. The keys indicate symbols for input by the first user in the communication session.

...read moreread less

130 citations

Posted Content•

Self-Supervised Exploration via Disagreement

[...]

Deepak Pathak¹, Dhiraj Gandhi², Abhinav Gupta²•Institutions (2)

University of California, Berkeley¹, Facebook²

10 Jun 2019-arXiv: Learning

TL;DR: In this article, an ensemble of dynamics models is used to incentivize the agent to explore such that the disagreement of those ensembles is maximized, which results in a sample-efficient exploration.

...read moreread less

Abstract: Efficient exploration is a long-standing problem in sensorimotor learning. Major advances have been demonstrated in noise-free, non-stochastic domains such as video games and simulation. However, most of these formulations either get stuck in environments with stochastic dynamics or are too inefficient to be scalable to real robotics setups. In this paper, we propose a formulation for exploration inspired by the work in active learning literature. Specifically, we train an ensemble of dynamics models and incentivize the agent to explore such that the disagreement of those ensembles is maximized. This allows the agent to learn skills by exploring in a self-supervised manner without any external reward. Notably, we further leverage the disagreement objective to optimize the agent's policy in a differentiable manner, without using reinforcement learning, which results in a sample-efficient exploration. We demonstrate the efficacy of this formulation across a variety of benchmark environments including stochastic-Atari, Mujoco and Unity. Finally, we implement our differentiable exploration on a real robot which learns to interact with objects completely from scratch. Project videos and code are at this https URL

...read moreread less

130 citations

Patent•

Intelligent identification of multimedia content for synchronization

[...]

David Biderman¹•Institutions (1)

Facebook¹

30 Dec 2004

TL;DR: In this paper, an intelligent synchronization tool ensures access to desired content in a manner that automatically keeps the content current on the portable media device by introducing a variation threshold or user-specified degree of content variation among content downloaded to a user's mobile device to prevent the user from becoming bored.

...read moreread less

Abstract: An intelligent synchronization tool ensures access to desired content in a manner that automatically keeps the content current on the portable media device. A variation threshold or user-specified degree of content variation may be introduced among content downloaded to a user's mobile device to prevent the user from becoming bored. Furthermore, intelligent synchronization may automatically populate the portable media device with popular content to save a user time and/or use passive monitoring techniques to ascertain a user's preferences for subsequent population.

...read moreread less

130 citations

Proceedings Article•DOI•

Semantic Amodal Segmentation

[...]

Yan Zhu¹, Yuandong Tian², Dimitris N. Metaxas¹, Piotr Dollár²•Institutions (2)

Rutgers University¹, Facebook²

01 Jul 2017

TL;DR: A detailed image annotation that captures information beyond the visible pixels and requires complex reasoning about full scene structure is proposed, and it is shown that the proposed full scene annotation is surprisingly consistent between annotators, including for regions and edges.

...read moreread less

Abstract: Common visual recognition tasks such as classification, object detection, and semantic segmentation are rapidly reaching maturity, and given the recent rate of progress, it is not unreasonable to conjecture that techniques for many of these problems will approach human levels of performance in the next few years. In this paper we look to the future: what is the next frontier in visual recognition? We offer one possible answer to this question. We propose a detailed image annotation that captures information beyond the visible pixels and requires complex reasoning about full scene structure. Specifically, we create an amodal segmentation of each image: the full extent of each region is marked, not just the visible pixels. Annotators outline and name all salient regions in the image and specify a partial depth order. The result is a rich scene structure, including visible and occluded portions of each region, figure-ground edge information, semantic labels, and object overlap. We create two datasets for semantic amodal segmentation. First, we label 500 images in the BSDS dataset with multiple annotators per image, allowing us to study the statistics of human annotations. We show that the proposed full scene annotation is surprisingly consistent between annotators, including for regions and edges. Second, we annotate 5000 images from COCO. This larger dataset allows us to explore a number of algorithmic ideas for amodal segmentation and depth ordering. We introduce novel metrics for these tasks, and along with our strong baselines, define concrete new challenges for the community.

...read moreread less

130 citations

Posted Content•

Support-set bottlenecks for video-text representation learning

[...]

Mandela Patrick¹, Po-Yao Huang¹, Yuki M. Asano¹, Florian Metze¹, Alexander G. Hauptmann², João F. Henriques³, Andrea Vedaldi³ - Show less +3 more•Institutions (3)

Facebook¹, Carnegie Mellon University², University of Oxford³

06 Oct 2020-arXiv: Computer Vision and Pattern Recognition

TL;DR: This paper proposes a novel method that leverages a generative model to naturally push related samples together, and results in representations that explicitly encode semantics shared between samples, unlike noise contrastive learning.

...read moreread less

Abstract: The dominant paradigm for learning video-text representations -- noise contrastive learning -- increases the similarity of the representations of pairs of samples that are known to be related, such as text and video from the same sample, and pushes away the representations of all other pairs. We posit that this last behaviour is too strict, enforcing dissimilar representations even for samples that are semantically-related -- for example, visually similar videos or ones that share the same depicted action. In this paper, we propose a novel method that alleviates this by leveraging a generative model to naturally push these related samples together: each sample's caption must be reconstructed as a weighted combination of other support samples' visual representations. This simple idea ensures that representations are not overly-specialized to individual samples, are reusable across the dataset, and results in representations that explicitly encode semantics shared between samples, unlike noise contrastive learning. Our proposed method outperforms others by a large margin on MSR-VTT, VATEX and ActivityNet, and MSVD for video-to-text and text-to-video retrieval.

...read moreread less

129 citations

Collapse

Authors

Showing all 7875 results

Name	H-index	Papers	Citations
Yoshua Bengio	202	1033	420313
Xiang Zhang	154	1733	117576
Jitendra Malik	151	493	165087
Trevor Darrell	148	678	181113
Christopher D. Manning	138	499	147595
Robert W. Heath	128	1049	73171
Pieter Abbeel	126	589	70911
Yann LeCun	121	369	171211
Li Fei-Fei	120	420	145574
Jon Kleinberg	117	444	87865
Sergey Levine	115	652	59769
Richard Szeliski	113	359	72019
Sanjeev Kumar	113	1325	54386
Bruce Neal	108	561	87213
Larry S. Davis	107	693	49714

Network Information

Related Institutions (5)

Google

39.8K papers, 2.1M citations

98% related

Microsoft

86.9K papers, 4.1M citations

96% related

Adobe Systems

8K papers, 214.7K citations

94% related

Carnegie Mellon University

104.3K papers, 5.9M citations

38.6K papers, 1.3M citations

90% related

Performance

Metrics

10,939

Papers

851,954

Citations

No. of papers from the Institution in previous years
Year	Papers
2024	1
2022	37
2021	1,738
2020	2,017
2019	1,607
2018	1,229