Institution

Facebook

Company•Tel Aviv, Israel•

About: Facebook is a company organization based out in Tel Aviv, Israel. It is known for research contribution in the topics: Artificial neural network & Language model. The organization has 7856 authors who have published 10906 publications receiving 570123 citations. The organization is also known as: facebook.com & FB.

...read moreread less

Topics: Artificial neural network, Language model, Reinforcement learning, Machine translation, Social network ...read more

Papers published on a yearly basis

Papers

PDF

Open Access

More filters

Proceedings Article•

Hard Negative Mixing for Contrastive Learning

[...]

Yannis Kalantidis¹, Mert Bulent Sariyildiz², Noe Pion, Philippe Weinzaepfel³, Diane Larlus³ - Show less +1 more•Institutions (3)

Facebook¹, Bilkent University², Naver Corporation³

02 Oct 2020

TL;DR: It is argued that an important aspect of contrastive learning, i.e., the effect of hard negatives, has so far been neglected and proposed hard negative mixing strategies at the feature level, that can be computed on-the-fly with a minimal computational overhead.

...read moreread less

Abstract: Contrastive learning has become a key component of self-supervised learning approaches for computer vision. By learning to embed two augmented versions of the same image close to each other and to push the embeddings of different images apart, one can train highly transferable visual representations. As revealed by recent studies, heavy data augmentation and large sets of negatives are both crucial in learning such representations. At the same time, data mixing strategies either at the image or the feature level improve both supervised and semi-supervised learning by synthesizing novel examples, forcing networks to learn more robust features. In this paper, we argue that an important aspect of contrastive learning, i.e., the effect of hard negatives, has so far been neglected. To get more meaningful negative samples, current top contrastive self-supervised learning approaches either substantially increase the batch sizes, or keep very large memory banks; increasing the memory size, however, leads to diminishing returns in terms of performance. We therefore start by delving deeper into a top-performing framework and show evidence that harder negatives are needed to facilitate better and faster learning. Based on these observations, and motivated by the success of data mixing, we propose hard negative mixing strategies at the feature level, that can be computed on-the-fly with a minimal computational overhead. We exhaustively ablate our approach on linear classification, object detection and instance segmentation and show that employing our hard negative mixing procedure improves the quality of visual representations learned by a state-of-the-art self-supervised learning method.

...read moreread less

311 citations

Proceedings Article•DOI•

TaBERT: Pretraining for Joint Understanding of Textual and Tabular Data

[...]

Pengcheng Yin¹, Graham Neubig², Wen-tau Yih³, Sebastian Riedel⁴•Institutions (4)

Carnegie Mellon University¹, University of California, San Diego², Facebook³, University College London⁴

01 Jul 2020

TL;DR: TaBERT is a pretrained LM that jointly learns representations for NL sentences and (semi-)structured tables that achieves new best results on the challenging weakly-supervised semantic parsing benchmark WikiTableQuestions, while performing competitively on the text-to-SQL dataset Spider.

...read moreread less

Abstract: Recent years have witnessed the burgeoning of pretrained language models (LMs) for text-based natural language (NL) understanding tasks. Such models are typically trained on free-form NL text, hence may not be suitable for tasks like semantic parsing over structured data, which require reasoning over both free-form NL questions and structured tabular data (e.g., database tables). In this paper we present TaBERT, a pretrained LM that jointly learns representations for NL sentences and (semi-)structured tables. TaBERT is trained on a large corpus of 26 million tables and their English contexts. In experiments, neural semantic parsers using TaBERT as feature representation layers achieve new best results on the challenging weakly-supervised semantic parsing benchmark WikiTableQuestions, while performing competitively on the text-to-SQL dataset Spider.

...read moreread less

310 citations

Patent•

Social Advertisements and Other Informational Messages on a Social Networking Website, and Advertising Model for Same

[...]

Timothy A. Kendall¹, Matthew R. Cohler¹, Mark E. Zuckerberg¹, Yun-Fang Juan¹, Robert Kang-Xing Jin¹, Justin M. Rosenstein¹, Andrew G. Bosworth¹, Yishan Wong¹, Adam D'Angelo¹, Chamath M. Palihapitiya¹ - Show less +6 more•Institutions (1)

Facebook¹

18 Aug 2008

TL;DR: In this paper, a social network website logs information about actions taken by members of the website and generates socially relevant ads for the member based on the actions logged for other members on the website to whom the member is connected (i.e., the member's online friends).

...read moreread less

Abstract: A social networking website logs information about actions taken by members of the website. For a particular member of the website, the website generates socially relevant ads for the member based on the actions logged for other members on the website to whom the member is connected (i.e., the member's online friends). The advertiser associated with the social ad may compensate the social networking website for publishing the ad on the website. When presenting a member with a social ad, the website may optimize advertising revenue by selecting an ad from the received ads that will maximize the expected value of the social ad. The expected value may be computed according to a function that includes the member's affinity for the ad content and the bid amount. The technique is also applied for providing socially relevant information off the social networking website.

...read moreread less

309 citations

Proceedings Article•DOI•

LinkBench: a database benchmark based on the Facebook social graph

[...]

Timothy G. Armstrong¹, Vamsi Ponnekanti², Dhruba Borthakur², Mark Callaghan²•Institutions (2)

University of Chicago¹, Facebook²

22 Jun 2013

TL;DR: LinkBench provides a realistic and challenging test for persistent storage of social and web service data, filling a gap in the available tools for researchers, developers and administrators.

...read moreread less

Abstract: Database benchmarks are an important tool for database researchers and practitioners that ease the process of making informed comparisons between different database hardware, software and configurations. Large scale web services such as social networks are a major and growing database application area, but currently there are few benchmarks that accurately model web service workloads.In this paper we present a new synthetic benchmark called LinkBench. LinkBench is based on traces from production databases that store "social graph" data at Facebook, a major social network. We characterize the data and query workload in many dimensions, and use the insights gained to construct a realistic synthetic benchmark. LinkBench provides a realistic and challenging test for persistent storage of social and web service data, filling a gap in the available tools for researchers, developers and administrators.

...read moreread less

309 citations

Posted Content•

Unifying distillation and privileged information

[...]

David Lopez-Paz¹, Léon Bottou¹, Bernhard Schölkopf, Vladimir Vapnik², Vladimir Vapnik¹ - Show less +1 more•Institutions (2)

Facebook¹, Columbia University²

11 Nov 2015-arXiv: Machine Learning

TL;DR: The authors unify distillation and privileged information into generalized distillation, a framework to learn from multiple machines and data representations, and demonstrate its efficacy on a variety of numerical simulations on both synthetic and real-world data.

...read moreread less

Abstract: Distillation (Hinton et al., 2015) and privileged information (Vapnik & Izmailov, 2015) are two techniques that enable machines to learn from other machines. This paper unifies these two techniques into generalized distillation, a framework to learn from multiple machines and data representations. We provide theoretical and causal insight about the inner workings of generalized distillation, extend it to unsupervised, semisupervised and multitask learning scenarios, and illustrate its efficacy on a variety of numerical simulations on both synthetic and real-world data.

...read moreread less

308 citations

Collapse

Authors

Showing all 7875 results

Name	H-index	Papers	Citations
Yoshua Bengio	202	1033	420313
Xiang Zhang	154	1733	117576
Jitendra Malik	151	493	165087
Trevor Darrell	148	678	181113
Christopher D. Manning	138	499	147595
Robert W. Heath	128	1049	73171
Pieter Abbeel	126	589	70911
Yann LeCun	121	369	171211
Li Fei-Fei	120	420	145574
Jon Kleinberg	117	444	87865
Sergey Levine	115	652	59769
Richard Szeliski	113	359	72019
Sanjeev Kumar	113	1325	54386
Bruce Neal	108	561	87213
Larry S. Davis	107	693	49714

Network Information

Related Institutions (5)

Google

39.8K papers, 2.1M citations

98% related

Microsoft

86.9K papers, 4.1M citations

96% related

Adobe Systems

8K papers, 214.7K citations

94% related

Carnegie Mellon University

104.3K papers, 5.9M citations

38.6K papers, 1.3M citations

90% related

Performance

Metrics

10,939

Papers

851,954

Citations

No. of papers from the Institution in previous years
Year	Papers
2024	1
2022	37
2021	1,738
2020	2,017
2019	1,607
2018	1,229