Institution

Facebook

Company•Tel Aviv, Israel•

About: Facebook is a company organization based out in Tel Aviv, Israel. It is known for research contribution in the topics: Computer science & Artificial neural network. The organization has 7856 authors who have published 10906 publications receiving 570123 citations. The organization is also known as: facebook.com & FB.

...read moreread less

Topics: Computer science, Artificial neural network, Language model, Context (language use), Reinforcement learning ...read more

Papers published on a yearly basis

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

KILT: a Benchmark for Knowledge Intensive Language Tasks

[...]

Fabio Petroni¹, Aleksandra Piktus², Angela Fan², Patrick S. H. Lewis³, Majid Yazdani², Nicola De Cao⁴, James Thorne⁵, Yacine Jernite⁶, Vladimir Karpukhin², Jean Maillard², Vassilis Plachouras², Tim Rocktäschel², Sebastian Riedel² - Show less +9 more•Institutions (6)

University of California, Berkeley¹, Facebook², University College London³, University of Amsterdam⁴, University of Cambridge⁵, New York University⁶

01 Jun 2021

TL;DR: It is found that a shared dense vector index coupled with a seq2seq model is a strong baseline, outperforming more tailor-made approaches for fact checking, open-domain question answering and dialogue, and yielding competitive results on entity linking and slot filling, by generating disambiguated text.

...read moreread less

Abstract: Challenging problems such as open-domain question answering, fact checking, slot filling and entity linking require access to large, external knowledge sources. While some models do well on individual tasks, developing general models is difficult as each task might require computationally expensive indexing of custom knowledge sources, in addition to dedicated infrastructure. To catalyze research on models that condition on specific information in large textual resources, we present a benchmark for knowledge-intensive language tasks (KILT). All tasks in KILT are grounded in the same snapshot of Wikipedia, reducing engineering turnaround through the re-use of components, as well as accelerating research into task-agnostic memory architectures. We test both task-specific and general baselines, evaluating downstream performance in addition to the ability of the models to provide provenance. We find that a shared dense vector index coupled with a seq2seq model is a strong baseline, outperforming more tailor-made approaches for fact checking, open-domain question answering and dialogue, and yielding competitive results on entity linking and slot filling, by generating disambiguated text. KILT data and code are available at https://github.com/facebookresearch/KILT.

...read moreread less

227 citations

Journal Article•DOI•

DIGIT: A Novel Design for a Low-Cost Compact High-Resolution Tactile Sensor With Application to In-Hand Manipulation

[...]

Mike Lambeta¹, Po-Wei Chou¹, Stephen Tian¹, Brian Yang¹, Benjamin Maloon¹, Victoria Rose Most¹, Dave Stroud¹, Raymond Santos¹, Ahmad Byagowi¹, Gregg Kammerer¹, Dinesh Jayaraman¹, Roberto Calandra¹ - Show less +8 more•Institutions (1)

Facebook¹

28 Feb 2020

TL;DR: The DIGIT sensor is introduced, an inexpensive, compact, and high-resolution tactile sensor geared towards in-hand manipulation that is demonstrated by training deep neural network model-based controllers to manipulate glass marbles in- hand with a multi-finger robotic hand.

...read moreread less

Abstract: Despite decades of research, general purpose in-hand manipulation remains one of the unsolved challenges of robotics. One of the contributing factors that limit current robotic manipulation systems is the difficulty of precisely sensing contact forces – sensing and reasoning about contact forces are crucial to accurately control interactions with the environment. As a step towards enabling better robotic manipulation, we introduce DIGIT, an inexpensive, compact, and high-resolution tactile sensor geared towards in-hand manipulation. DIGIT improves upon past vision-based tactile sensors by miniaturizing the form factor to be mountable on multi-fingered hands, and by providing several design improvements that result in an easier, more repeatable manufacturing process, and enhanced reliability. We demonstrate the capabilities of the DIGIT sensor by training deep neural network model-based controllers to manipulate glass marbles in-hand with a multi-finger robotic hand. To provide the robotic community access to reliable and low-cost tactile sensors, we open-source the DIGIT design at www.digit.ml .

...read moreread less

227 citations

Posted Content•

Learning from Noisy Labels with Deep Neural Networks

[...]

Sainbayar Sukhbaatar¹, Rob Fergus²•Institutions (2)

New York University¹, Facebook²

09 Jun 2014

TL;DR: In this article, an extra noise layer is added to the network to adapt the network outputs to match the noisy label distribution, which can be estimated as part of the training process and involve simple modifications to current training infrastructures.

...read moreread less

Abstract: The availability of large labeled datasets has allowed Convolutional Network models to achieve impressive recognition results. However, in many settings manual annotation of the data is impractical; instead our data has noisy labels, i.e. there is some freely available label for each image which may or may not be accurate. In this paper, we explore the performance of discriminatively-trained Convnets when trained on such noisy data. We introduce an extra noise layer into the network which adapts the network outputs to match the noisy label distribution. The parameters of this noise layer can be estimated as part of the training process and involve simple modifications to current training infrastructures for deep networks. We demonstrate the approaches on several datasets, including large scale experiments on the ImageNet classification benchmark.

...read moreread less

227 citations

Posted Content•

Abductive Commonsense Reasoning

[...]

Chandra Bhagavatula¹, Ronan Le Bras¹, Chaitanya Malaviya¹, Keisuke Sakaguchi¹, Ari Holtzman¹, Hannah Rashkin¹, Doug Downey¹, Scott Wen-tau Yih², Yejin Choi³ - Show less +5 more•Institutions (3)

Allen Institute for Artificial Intelligence¹, University of Washington², Facebook³

15 Aug 2019-arXiv: Computation and Language

TL;DR: This study introduces a challenge dataset, ART, that consists of over 20k commonsense narrative contexts and 200k explanations, and conceptualizes two new tasks -- Abductive NLI: a multiple-choice question answering task for choosing the more likely explanation, and Abduction NLG: a conditional generation task for explaining given observations in natural language.

...read moreread less

Abstract: Abductive reasoning is inference to the most plausible explanation. For example, if Jenny finds her house in a mess when she returns from work, and remembers that she left a window open, she can hypothesize that a thief broke into her house and caused the mess, as the most plausible explanation. While abduction has long been considered to be at the core of how people interpret and read between the lines in natural language (Hobbs et al., 1988), there has been relatively little research in support of abductive natural language inference and generation. We present the first study that investigates the viability of language-based abductive reasoning. We introduce a challenge dataset, ART, that consists of over 20k commonsense narrative contexts and 200k explanations. Based on this dataset, we conceptualize two new tasks -- (i) Abductive NLI: a multiple-choice question answering task for choosing the more likely explanation, and (ii) Abductive NLG: a conditional generation task for explaining given observations in natural language. On Abductive NLI, the best model achieves 68.9% accuracy, well below human performance of 91.4%. On Abductive NLG, the current best language generators struggle even more, as they lack reasoning capabilities that are trivial for humans. Our analysis leads to new insights into the types of reasoning that deep pre-trained language models fail to perform--despite their strong performance on the related but more narrowly defined task of entailment NLI--pointing to interesting avenues for future research.

...read moreread less

226 citations

Patent•

Providing Personalized Platform Application Content

[...]

Adam D'Angelo¹, Charles Duplain Cheever¹, Arieh Steinberg¹, James Wang¹, Mark Slee¹ - Show less +1 more•Institutions (1)

Facebook¹

12 Jun 2008

TL;DR: In this paper, the authors present a privacy settings associated with a user profile allowing the social networking website to limit the profile data communicated to the third-party application server, allowing the third party application server to use this profile data to personalize the application performed for the user.

...read moreread less

Abstract: A social networking website maintains a profile for each user of the website. The profile includes data associated with a user, such as a connection to one or more plurality of other users of the social networking website or user preferences. The social networking website communicates with one or more third-party application servers to provide one or more applications to social networking website users. When a social networking website user requests an application provided by a third-party application server, the social networking website communicates a subset of the user's profile to the third-party application server, allowing the third-party application server to use this profile data to personalize the application performed for the user. A privacy settings associated with a user profile allows the social networking website to limit the profile data communicated to the third-party application server.

...read moreread less

226 citations

Collapse

Authors

Showing all 7875 results

Name	H-index	Papers	Citations
Yoshua Bengio	202	1033	420313
Xiang Zhang	154	1733	117576
Jitendra Malik	151	493	165087
Trevor Darrell	148	678	181113
Christopher D. Manning	138	499	147595
Robert W. Heath	128	1049	73171
Pieter Abbeel	126	589	70911
Yann LeCun	121	369	171211
Li Fei-Fei	120	420	145574
Jon Kleinberg	117	444	87865
Sergey Levine	115	652	59769
Richard Szeliski	113	359	72019
Sanjeev Kumar	113	1325	54386
Bruce Neal	108	561	87213
Larry S. Davis	107	693	49714

Network Information

Related Institutions (5)

Google

39.8K papers, 2.1M citations

98% related

Microsoft

86.9K papers, 4.1M citations

96% related

Adobe Systems

8K papers, 214.7K citations

94% related

Carnegie Mellon University

104.3K papers, 5.9M citations

38.6K papers, 1.3M citations

90% related

Performance

Metrics

10,939

Papers

851,954

Citations

No. of papers from the Institution in previous years
Year	Papers
2024	1
2022	37
2021	1,738
2020	2,017
2019	1,607
2018	1,229