Deep Neural Networks for YouTube Recommendations

doi:10.1145/2959100.2959190

Open AccessProceedings ArticleDOI

Deep Neural Networks for YouTube Recommendations

Paul Covington, +2 more

- pp 191-198

Chats0

TLDR

This paper details a deep candidate generation model and then describes a separate deep ranking model and provides practical lessons and insights derived from designing, iterating and maintaining a massive recommendation system with enormous user-facing impact.

Abstract:

YouTube represents one of the largest scale and most sophisticated industrial recommendation systems in existence. In this paper, we describe the system at a high level and focus on the dramatic performance improvements brought by deep learning. The paper is split according to the classic two-stage information retrieval dichotomy: first, we detail a deep candidate generation model and then describe a separate deep ranking model. We also provide practical lessons and insights derived from designing, iterating and maintaining a massive recommendation system with enormous user-facing impact.

Citations

PDF

Open Access

More filters

Posted Content

Personalized Context-aware Re-ranking for E-commerce Recommender Systems.

Changhua Pei, +9 more

TL;DR: The proposed re-ranking model directly optimizes the whole recommendation list by employing a transformer structure to encode the information of all items in the list, and introduces the personalized embedding to model the dierences between feature distributions for users.

...read moreread less

Posted Content

CryptoRec: Privacy-preserving Recommendation as a Service

Jun Wang, +3 more

- 07 Feb 2018 -

arXiv: Cryptography and Security

TL;DR: This paper proposes CryptoRec, a secure two-party computation protocol for Recommendation as a Service, which encompasses a novel recommender system and possesses two interesting properties: It models user-item interactions in an item-only latent feature space in which personalized user representations are automatically captured by an aggregation of pre-learned item features.

...read moreread less

Proceedings ArticleDOI

Lambda Learner: Fast Incremental Learning on Data Streams

Rohan Ramanath, +7 more

- 11 Oct 2020 -

arXiv: Learning

TL;DR: This paper proposes Lambda Learner, a new framework for training models by incremental updates in response to mini-batches from data streams, and provides theoretical proof that the incremental learning updates improve the loss-function over a stale batch model.

...read moreread less

Journal ArticleDOI

Revisiting Negative Sampling vs. Non-sampling in Implicit Recommendation

Cheng Chen, +5 more

- 24 Mar 2022 -

ACM Transactions on Information Systems

TL;DR: The role of negative sampling and non-sampling for implicit recommendation is analyzed, and the results empirically show that although negative sampling has been widely applied to recent recommendation models, it is non-trivial for uniform sampling methods to show comparable performance to non-Sampling learning methods.

...read moreread less

Proceedings ArticleDOI

A Dynamic Neural Network Model for Click-Through Rate Prediction in Real-Time Bidding

Xianshan Qu, +5 more

TL;DR: A dynamic CTR prediction model designed for the Samsung demand-side platform (DSP) and developed using a Dynamic Neural Network model that effectively captures the dynamic evolutions of both users and ads and integrates auxiliary data sources to better model users’ preferences.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Book ChapterDOI

I and J

William Marsden

Proceedings Article

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

Sergey Ioffe, +1 more

TL;DR: Applied to a state-of-the-art image classification model, Batch Normalization achieves the same accuracy with 14 times fewer training steps, and beats the original model by a significant margin.

...read moreread less

Proceedings Article

Distributed Representations of Words and Phrases and their Compositionality

Tomas Mikolov, +4 more

TL;DR: This paper presents a simple method for finding phrases in text, and shows that learning good vector representations for millions of phrases is possible and describes a simple alternative to the hierarchical softmax called negative sampling.

...read moreread less

Posted Content

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

Sergey Ioffe, +1 more

- 11 Feb 2015 -

arXiv: Learning

TL;DR: Batch Normalization as mentioned in this paper normalizes layer inputs for each training mini-batch to reduce the internal covariate shift in deep neural networks, and achieves state-of-the-art performance on ImageNet.

...read moreread less

Posted Content

Distributed Representations of Words and Phrases and their Compositionality

Tomas Mikolov, +4 more

- 16 Oct 2013 -

arXiv: Computation and Language

TL;DR: In this paper, the Skip-gram model is used to learn high-quality distributed vector representations that capture a large number of precise syntactic and semantic word relationships and improve both the quality of the vectors and the training speed.

...read moreread less

Collapse

Deep Neural Networks for YouTube Recommendations

Citations

Personalized Context-aware Re-ranking for E-commerce Recommender Systems.

CryptoRec: Privacy-preserving Recommendation as a Service

Lambda Learner: Fast Incremental Learning on Data Streams

Revisiting Negative Sampling vs. Non-sampling in Implicit Recommendation

A Dynamic Neural Network Model for Click-Through Rate Prediction in Real-Time Bidding

References

I and J

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

Distributed Representations of Words and Phrases and their Compositionality

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

Distributed Representations of Words and Phrases and their Compositionality

Related Papers (5)

Matrix Factorization Techniques for Recommender Systems

Neural Collaborative Filtering

Item-based collaborative filtering recommendation algorithms

Adam: A Method for Stochastic Optimization

Deep Residual Learning for Image Recognition