scispace - formally typeset
Search or ask a question
Institution

Facebook

CompanyTel Aviv, Israel
About: Facebook is a company organization based out in Tel Aviv, Israel. It is known for research contribution in the topics: Computer science & Artificial neural network. The organization has 7856 authors who have published 10906 publications receiving 570123 citations. The organization is also known as: facebook.com & FB.


Papers
More filters
Posted Content
Gautier Izacard1, Edouard Grave1
TL;DR: This paper proposes a technique to learn retriever models for downstream tasks, inspired by knowledge distillation, and which does not require annotated pairs of query and documents.
Abstract: The task of information retrieval is an important component of many natural language processing systems, such as open domain question answering. While traditional methods were based on hand-crafted features, continuous representations based on neural networks recently obtained competitive results. A challenge of using such methods is to obtain supervised data to train the retriever model, corresponding to pairs of query and support documents. In this paper, we propose a technique to learn retriever models for downstream tasks, inspired by knowledge distillation, and which does not require annotated pairs of query and documents. Our approach leverages attention scores of a reader model, used to solve the task based on retrieved documents, to obtain synthetic labels for the retriever. We evaluate our method on question answering, obtaining state-of-the-art results.

114 citations

Posted Content
TL;DR: In this article, a purely bilinear model is trained to learn a metric between an image representation and phrases that are used to describe the image, and the model is then able to infer phrases from a given image sample.
Abstract: Generating a novel textual description of an image is an interesting problem that connects computer vision and natural language processing. In this paper, we present a simple model that is able to generate descriptive sentences given a sample image. This model has a strong focus on the syntax of the descriptions. We train a purely bilinear model that learns a metric between an image representation (generated from a previously trained Convolutional Neural Network) and phrases that are used to described them. The system is then able to infer phrases from a given image sample. Based on caption syntax statistics, we propose a simple language model that can produce relevant descriptions for a given test image using the phrases inferred. Our approach, which is considerably simpler than state-of-the-art models, achieves comparable results in two popular datasets for the task: Flickr30k and the recently proposed Microsoft COCO.

113 citations

Proceedings ArticleDOI
18 Apr 2016
TL;DR: It is shown that Flash disaggregation allows scaling CPU and Flash resources independently in a cost effective manner through resource-efficient scale-out and is used to draw conclusions about data and control plane issues in remote storage.
Abstract: PCIe-based Flash is commonly deployed to provide datacenter applications with high IO rates. However, its capacity and bandwidth are often underutilized as it is difficult to design servers with the right balance of CPU, memory and Flash resources over time and for multiple applications. This work examines Flash disaggregation as a way to deal with Flash overprovisioning. We tune remote access to Flash over commodity networks and analyze its impact on workloads sampled from real datacenter applications. We show that, while remote Flash access introduces a 20% throughput drop at the application level, disaggregation allows us to make up for these overheads through resource-efficient scale-out. Hence, we show that Flash disaggregation allows scaling CPU and Flash resources independently in a cost effective manner. We use our analysis to draw conclusions about data and control plane issues in remote storage.

113 citations

Journal ArticleDOI
TL;DR: The Alcator C-Mod tokamak as discussed by the authors is a high-field toroidal confinement device that uses high-power radio frequency (RF) waves for heating and current drive with innovative launching structures.
Abstract: The object of this review is to summarize the achievements of research on the Alcator C-Mod tokamak [Hutchinson et al., Phys. Plasmas 1, 1511 (1994) and Marmar, Fusion Sci. Technol. 51, 261 (2007)] and to place that research in the context of the quest for practical fusion energy. C-Mod is a compact, high-field tokamak, whose unique design and operating parameters have produced a wealth of new and important results since it began operation in 1993, contributing data that extends tests of critical physical models into new parameter ranges and into new regimes. Using only high-power radio frequency (RF) waves for heating and current drive with innovative launching structures, C-Mod operates routinely at reactor level power densities and achieves plasma pressures higher than any other toroidal confinement device. C-Mod spearheaded the development of the vertical-target divertor and has always operated with high-Z metal plasma facing components—approaches subsequently adopted for ITER. C-Mod has made ground-breaking discoveries in divertor physics and plasma-material interactions at reactor-like power and particle fluxes and elucidated the critical role of cross-field transport in divertor operation, edge flows and the tokamak density limit. C-Mod developed the I-mode and the Enhanced Dα H-mode regimes, which have high performance without large edge localized modes and with pedestal transport self-regulated by short-wavelength electromagnetic waves. C-Mod has carried out pioneering studies of intrinsic rotation and demonstrated that self-generated flow shear can be strong enough in some cases to significantly modify transport. C-Mod made the first quantitative link between the pedestal temperature and the H-mode's performance, showing that the observed self-similar temperature profiles were consistent with critical-gradient-length theories and followed up with quantitative tests of nonlinear gyrokinetic models. RF research highlights include direct experimental observation of ion cyclotron range of frequency (ICRF) mode-conversion, ICRF flow drive, demonstration of lower-hybrid current drive at ITER-like densities and fields and, using a set of novel diagnostics, extensive validation of advanced RF codes. Disruption studies on C-Mod provided the first observation of non-axisymmetric halo currents and non-axisymmetric radiation in mitigated disruptions. A summary of important achievements and discoveries are included.

113 citations

Proceedings ArticleDOI
14 Jun 2016
TL;DR: This paper identifies five important design decisions that affect their ease of use, performance, fault tolerance, scalability, and correctness in the realtime stream processing systems Puma, Swift, and Stylus and illustrates how these decisions and systems satisfy the requirements for multiple use cases at Facebook.
Abstract: Realtime data processing powers many use cases at Facebook, including realtime reporting of the aggregated, anonymized voice of Facebook users, analytics for mobile applications, and insights for Facebook page administrators. Many companies have developed their own systems; we have a realtime data processing ecosystem at Facebook that handles hundreds of Gigabytes per second across hundreds of data pipelines. Many decisions must be made while designing a realtime stream processing system. In this paper, we identify five important design decisions that affect their ease of use, performance, fault tolerance, scalability, and correctness. We compare the alternative choices for each decision and contrast what we built at Facebook to other published systems. Our main decision was targeting seconds of latency, not milliseconds. Seconds is fast enough for all of the use cases we support and it allows us to use a persistent message bus for data transport. This data transport mechanism then paved the way for fault tolerance, scalability, and multiple options for correctness in our stream processing systems Puma, Swift, and Stylus. We then illustrate how our decisions and systems satisfy our requirements for multiple use cases at Facebook. Finally, we reflect on the lessons we learned as we built and operated these systems.

113 citations


Authors

Showing all 7875 results

NameH-indexPapersCitations
Yoshua Bengio2021033420313
Xiang Zhang1541733117576
Jitendra Malik151493165087
Trevor Darrell148678181113
Christopher D. Manning138499147595
Robert W. Heath128104973171
Pieter Abbeel12658970911
Yann LeCun121369171211
Li Fei-Fei120420145574
Jon Kleinberg11744487865
Sergey Levine11565259769
Richard Szeliski11335972019
Sanjeev Kumar113132554386
Bruce Neal10856187213
Larry S. Davis10769349714
Network Information
Related Institutions (5)
Google
39.8K papers, 2.1M citations

98% related

Microsoft
86.9K papers, 4.1M citations

96% related

Adobe Systems
8K papers, 214.7K citations

94% related

Carnegie Mellon University
104.3K papers, 5.9M citations

91% related

Performance
Metrics
No. of papers from the Institution in previous years
YearPapers
20241
202237
20211,738
20202,017
20191,607
20181,229