Institution

Facebook

Company•Tel Aviv, Israel•

About: Facebook is a company organization based out in Tel Aviv, Israel. It is known for research contribution in the topics: Computer science & Artificial neural network. The organization has 7856 authors who have published 10906 publications receiving 570123 citations. The organization is also known as: facebook.com & FB.

...read moreread less

Topics: Computer science, Artificial neural network, Language model, Context (language use), Reinforcement learning ...read more

Papers published on a yearly basis

Papers

PDF

Open Access

More filters

Posted Content•

Distilling Knowledge from Reader to Retriever for Question Answering

[...]

Gautier Izacard¹, Edouard Grave¹•Institutions (1)

Facebook¹

08 Dec 2020-arXiv: Computation and Language

TL;DR: This paper proposes a technique to learn retriever models for downstream tasks, inspired by knowledge distillation, and which does not require annotated pairs of query and documents.

...read moreread less

Abstract: The task of information retrieval is an important component of many natural language processing systems, such as open domain question answering. While traditional methods were based on hand-crafted features, continuous representations based on neural networks recently obtained competitive results. A challenge of using such methods is to obtain supervised data to train the retriever model, corresponding to pairs of query and support documents. In this paper, we propose a technique to learn retriever models for downstream tasks, inspired by knowledge distillation, and which does not require annotated pairs of query and documents. Our approach leverages attention scores of a reader model, used to solve the task based on retrieved documents, to obtain synthetic labels for the retriever. We evaluate our method on question answering, obtaining state-of-the-art results.

...read moreread less

114 citations

Posted Content•

Phrase-based Image Captioning

[...]

Rémi Lebret¹, Rémi Lebret², Pedro O. Pinheiro¹, Pedro O. Pinheiro², Ronan Collobert³ - Show less +1 more•Institutions (3)

Idiap Research Institute¹, École Polytechnique Fédérale de Lausanne², Facebook³

12 Feb 2015-arXiv: Computation and Language

TL;DR: In this article, a purely bilinear model is trained to learn a metric between an image representation and phrases that are used to describe the image, and the model is then able to infer phrases from a given image sample.

...read moreread less

Abstract: Generating a novel textual description of an image is an interesting problem that connects computer vision and natural language processing. In this paper, we present a simple model that is able to generate descriptive sentences given a sample image. This model has a strong focus on the syntax of the descriptions. We train a purely bilinear model that learns a metric between an image representation (generated from a previously trained Convolutional Neural Network) and phrases that are used to described them. The system is then able to infer phrases from a given image sample. Based on caption syntax statistics, we propose a simple language model that can produce relevant descriptions for a given test image using the phrases inferred. Our approach, which is considerably simpler than state-of-the-art models, achieves comparable results in two popular datasets for the task: Flickr30k and the recently proposed Microsoft COCO.

...read moreread less

113 citations

Proceedings Article•DOI•

Flash storage disaggregation

[...]

Ana Klimovic¹, Christos Kozyrakis¹, Eno Thereska², Binu John³, Sanjeev Kumar³ - Show less +1 more•Institutions (3)

Stanford University¹, Imperial College London², Facebook³

18 Apr 2016

TL;DR: It is shown that Flash disaggregation allows scaling CPU and Flash resources independently in a cost effective manner through resource-efficient scale-out and is used to draw conclusions about data and control plane issues in remote storage.

...read moreread less

Abstract: PCIe-based Flash is commonly deployed to provide datacenter applications with high IO rates. However, its capacity and bandwidth are often underutilized as it is difficult to design servers with the right balance of CPU, memory and Flash resources over time and for multiple applications. This work examines Flash disaggregation as a way to deal with Flash overprovisioning. We tune remote access to Flash over commodity networks and analyze its impact on workloads sampled from real datacenter applications. We show that, while remote Flash access introduces a 20% throughput drop at the application level, disaggregation allows us to make up for these overheads through resource-efficient scale-out. Hence, we show that Flash disaggregation allows scaling CPU and Flash resources independently in a cost effective manner. We use our analysis to draw conclusions about data and control plane issues in remote storage.

...read moreread less

113 citations

Journal Article•DOI•

20 years of research on the Alcator C-Mod tokamak

[...]

Martin Greenwald¹, Aaron Bader², Seung Gyou Baek¹, Mohammad Reza Bakhtiari², Harold Barnard¹, W. Beck¹, W. Bergerson³, Igor Bespamyatnov⁴, P.T. Bonoli¹, D. L. Brower³, Dan Brunner¹, W. Burke¹, Jeff Candy⁵, M. Churchill⁶, Istvan Cziegler⁷, Ahmed Diallo⁶, Arturo Dominguez⁶, B. P. Duval⁸, E. Edlund⁶, Paul Ennever¹, D.R. Ernst¹, Ian Faust¹, C.L. Fiore¹, Thomas W. Fredian¹, Odd Erik Garcia⁹, C. Gao¹, John Goetz², Theodore Golfinopoulos¹, Robert Granetz¹, Olaf Grulke¹⁰, Z.S. Hartwig¹, S. Horne, Nathan Howard¹¹, Amanda Hubbard¹, Jerry Hughes¹, Ian H. Hutchinson¹, J. H. Irby¹, V.A. Izzo⁷, C.E. Kessel⁶, Brian LaBombard¹, Cornwall Lau¹², C. K. Li¹, Yu-Ming Lin¹, Bruce Lipschultz¹³, A. Loarte¹⁴, Earl Marmar¹, A. Mazurenko, G. McCracken¹⁵, R. M. McDermott¹⁰, Orso Meneghini⁵, D. R. Mikkelsen⁶, D. A. Mossessian, Robert Mumgaard¹, James Myra, E. Nelson-Melby¹⁶, R. Ochoukov¹⁷, G.M. Olynyk¹⁸, R.R. Parker¹, S. Pitcher¹⁴, Yuri Podpaly¹⁹, Miklos Porkolab¹, Matthew Reinke¹³, John Rice¹, W. L. Rowan⁴, Andrea Schmidt²⁰, S. D. Scott⁶, S. Shiraiwa¹, J. M. Sierchio¹, N. Smick, J. A. Snipes¹⁴, P. B. Snyder⁵, Brandon Sorbom¹, Joshua Stillerman¹, Choongki Sung¹, Yuichi Takase²¹, Vincent Tang²⁰, J.L. Terry¹, D. Terry¹, Christian Theiler⁸, A. Tronchin-James²², Naoto Tsujii²¹, R.F. Vieira¹, J.R. Walk¹, Gregory Wallace¹, Anne White¹, D.G. Whyte¹, James R. Wilson⁶, S.M. Wolfe¹, G.M. Wright¹, John Wright¹, S.J. Wukitch¹, Stewart Zweben⁶ - Show less +88 more•Institutions (22)

25 Nov 2014-Physics of Plasmas

TL;DR: The Alcator C-Mod tokamak as discussed by the authors is a high-field toroidal confinement device that uses high-power radio frequency (RF) waves for heating and current drive with innovative launching structures.

...read moreread less

Abstract: The object of this review is to summarize the achievements of research on the Alcator C-Mod tokamak [Hutchinson et al., Phys. Plasmas 1, 1511 (1994) and Marmar, Fusion Sci. Technol. 51, 261 (2007)] and to place that research in the context of the quest for practical fusion energy. C-Mod is a compact, high-field tokamak, whose unique design and operating parameters have produced a wealth of new and important results since it began operation in 1993, contributing data that extends tests of critical physical models into new parameter ranges and into new regimes. Using only high-power radio frequency (RF) waves for heating and current drive with innovative launching structures, C-Mod operates routinely at reactor level power densities and achieves plasma pressures higher than any other toroidal confinement device. C-Mod spearheaded the development of the vertical-target divertor and has always operated with high-Z metal plasma facing components—approaches subsequently adopted for ITER. C-Mod has made ground-breaking discoveries in divertor physics and plasma-material interactions at reactor-like power and particle fluxes and elucidated the critical role of cross-field transport in divertor operation, edge flows and the tokamak density limit. C-Mod developed the I-mode and the Enhanced Dα H-mode regimes, which have high performance without large edge localized modes and with pedestal transport self-regulated by short-wavelength electromagnetic waves. C-Mod has carried out pioneering studies of intrinsic rotation and demonstrated that self-generated flow shear can be strong enough in some cases to significantly modify transport. C-Mod made the first quantitative link between the pedestal temperature and the H-mode's performance, showing that the observed self-similar temperature profiles were consistent with critical-gradient-length theories and followed up with quantitative tests of nonlinear gyrokinetic models. RF research highlights include direct experimental observation of ion cyclotron range of frequency (ICRF) mode-conversion, ICRF flow drive, demonstration of lower-hybrid current drive at ITER-like densities and fields and, using a set of novel diagnostics, extensive validation of advanced RF codes. Disruption studies on C-Mod provided the first observation of non-axisymmetric halo currents and non-axisymmetric radiation in mitigated disruptions. A summary of important achievements and discoveries are included.

...read moreread less

113 citations

Proceedings Article•DOI•

Realtime Data Processing at Facebook

[...]

Guoqiang Jerry Chen¹, Janet L. Wiener¹, Shridhar Iyer¹, Anshul Jaiswal¹, Ran Lei¹, Nikhil Simha¹, Wei Wang¹, Kevin Wilfong¹, Tim Williamson¹, Serhat Yilmaz¹ - Show less +6 more•Institutions (1)

Facebook¹

14 Jun 2016

TL;DR: This paper identifies five important design decisions that affect their ease of use, performance, fault tolerance, scalability, and correctness in the realtime stream processing systems Puma, Swift, and Stylus and illustrates how these decisions and systems satisfy the requirements for multiple use cases at Facebook.

...read moreread less

Abstract: Realtime data processing powers many use cases at Facebook, including realtime reporting of the aggregated, anonymized voice of Facebook users, analytics for mobile applications, and insights for Facebook page administrators. Many companies have developed their own systems; we have a realtime data processing ecosystem at Facebook that handles hundreds of Gigabytes per second across hundreds of data pipelines. Many decisions must be made while designing a realtime stream processing system. In this paper, we identify five important design decisions that affect their ease of use, performance, fault tolerance, scalability, and correctness. We compare the alternative choices for each decision and contrast what we built at Facebook to other published systems. Our main decision was targeting seconds of latency, not milliseconds. Seconds is fast enough for all of the use cases we support and it allows us to use a persistent message bus for data transport. This data transport mechanism then paved the way for fault tolerance, scalability, and multiple options for correctness in our stream processing systems Puma, Swift, and Stylus. We then illustrate how our decisions and systems satisfy our requirements for multiple use cases at Facebook. Finally, we reflect on the lessons we learned as we built and operated these systems.

...read moreread less

113 citations

Collapse

Authors

Showing all 7875 results

Name	H-index	Papers	Citations
Yoshua Bengio	202	1033	420313
Xiang Zhang	154	1733	117576
Jitendra Malik	151	493	165087
Trevor Darrell	148	678	181113
Christopher D. Manning	138	499	147595
Robert W. Heath	128	1049	73171
Pieter Abbeel	126	589	70911
Yann LeCun	121	369	171211
Li Fei-Fei	120	420	145574
Jon Kleinberg	117	444	87865
Sergey Levine	115	652	59769
Richard Szeliski	113	359	72019
Sanjeev Kumar	113	1325	54386
Bruce Neal	108	561	87213
Larry S. Davis	107	693	49714

Network Information

Related Institutions (5)

Google

39.8K papers, 2.1M citations

98% related

Microsoft

86.9K papers, 4.1M citations

96% related

Adobe Systems

8K papers, 214.7K citations

94% related

Carnegie Mellon University

104.3K papers, 5.9M citations

38.6K papers, 1.3M citations

90% related

Performance

Metrics

10,939

Papers

851,954

Citations

No. of papers from the Institution in previous years
Year	Papers
2024	1
2022	37
2021	1,738
2020	2,017
2019	1,607
2018	1,229