Deep Neural Networks for YouTube Recommendations

doi:10.1145/2959100.2959190

Open AccessProceedings ArticleDOI

Deep Neural Networks for YouTube Recommendations

Paul Covington, +2 more

- pp 191-198

Chats0

TLDR

This paper details a deep candidate generation model and then describes a separate deep ranking model and provides practical lessons and insights derived from designing, iterating and maintaining a massive recommendation system with enormous user-facing impact.

Abstract:

YouTube represents one of the largest scale and most sophisticated industrial recommendation systems in existence. In this paper, we describe the system at a high level and focus on the dramatic performance improvements brought by deep learning. The paper is split according to the classic two-stage information retrieval dichotomy: first, we detail a deep candidate generation model and then describe a separate deep ranking model. We also provide practical lessons and insights derived from designing, iterating and maintaining a massive recommendation system with enormous user-facing impact.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

McDRAM v2: In-Dynamic Random Access Memory Systolic Array Accelerator to Address the Large Model Problem in Deep Neural Networks on the Edge

Seung-Hwan Cho, +4 more

- 01 Jan 2020 -

IEEE Access

TL;DR: The proposed McDRAM v2 is a novel in-dynamic random access memory (DRAM) systolic array accelerator architecture that can handle large DNN models without off-chip memory accesses, in a fast and efficient manner, by exposing the large DRAM capacity and large in-DRAM bandwidth directly to an input syStolic array of a processing element matrix.

...read moreread less

Posted Content

One Person, One Model, One World: Learning Continual User Representation without Forgetting

Fajie Yuan, +6 more

- 29 Sep 2020 -

arXiv: Information Retrieval

TL;DR: This paper presents Conure the first continual, or lifelong, user representation learner, and proposes iteratively removing less important weights of old tasks in a deep user representation model, motivated by the fact that neural network models are usually over-parameterized.

...read moreread less

Journal ArticleDOI

Predicting yield performance of parents in plant breeding: A neural collaborative filtering approach

Saeed Khaki, +2 more

- 21 May 2020 -

PLOS ONE

TL;DR: A collaborative filtering method which is an ensemble of matrix factorization method and a neural network to solve the problem of Identification of best parent combinations for crossing and suggested that the proposed model significantly outperformed other models such as deep factorization machines (DeepFM) and neural networks.

...read moreread less

Proceedings ArticleDOI

Contrastive Learning for Debiased Candidate Generation in Large-Scale Recommender Systems

Chang Zhou, +4 more

TL;DR: Zhang et al. as discussed by the authors theoretically proved that contrastive loss is equivalent to reducing the exposure bias via inverse propensity weighting, which provides a new perspective for understanding the effectiveness of contrastive learning.

...read moreread less

Journal ArticleDOI

Near-Memory Processing in Action: Accelerating Personalized Recommendation with AxDIMM

Liu Ke, +19 more

- 16 Jul 2021 -

IEEE Micro

TL;DR: This work developed a scalable, practical DIMM-based NMP solution tailor-designed for accelerating the inference serving of personalized recommendation system using industry-representative recommendation framework and experimentally validated the performance of a two-ranked AxDIMM prototype.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Book ChapterDOI

I and J

William Marsden

Proceedings Article

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

Sergey Ioffe, +1 more

TL;DR: Applied to a state-of-the-art image classification model, Batch Normalization achieves the same accuracy with 14 times fewer training steps, and beats the original model by a significant margin.

...read moreread less

Proceedings Article

Distributed Representations of Words and Phrases and their Compositionality

Tomas Mikolov, +4 more

TL;DR: This paper presents a simple method for finding phrases in text, and shows that learning good vector representations for millions of phrases is possible and describes a simple alternative to the hierarchical softmax called negative sampling.

...read moreread less

Posted Content

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

Sergey Ioffe, +1 more

- 11 Feb 2015 -

arXiv: Learning

TL;DR: Batch Normalization as mentioned in this paper normalizes layer inputs for each training mini-batch to reduce the internal covariate shift in deep neural networks, and achieves state-of-the-art performance on ImageNet.

...read moreread less

Posted Content

Distributed Representations of Words and Phrases and their Compositionality

Tomas Mikolov, +4 more

- 16 Oct 2013 -

arXiv: Computation and Language

TL;DR: In this paper, the Skip-gram model is used to learn high-quality distributed vector representations that capture a large number of precise syntactic and semantic word relationships and improve both the quality of the vectors and the training speed.

...read moreread less

Collapse

Deep Neural Networks for YouTube Recommendations

Citations

McDRAM v2: In-Dynamic Random Access Memory Systolic Array Accelerator to Address the Large Model Problem in Deep Neural Networks on the Edge

One Person, One Model, One World: Learning Continual User Representation without Forgetting

Predicting yield performance of parents in plant breeding: A neural collaborative filtering approach

Contrastive Learning for Debiased Candidate Generation in Large-Scale Recommender Systems

Near-Memory Processing in Action: Accelerating Personalized Recommendation with AxDIMM

References

I and J

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

Distributed Representations of Words and Phrases and their Compositionality

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

Distributed Representations of Words and Phrases and their Compositionality

Related Papers (5)

Matrix Factorization Techniques for Recommender Systems

Neural Collaborative Filtering

Item-based collaborative filtering recommendation algorithms

Adam: A Method for Stochastic Optimization

Deep Residual Learning for Image Recognition