Deep Neural Networks for YouTube Recommendations

doi:10.1145/2959100.2959190

Open AccessProceedings ArticleDOI

Deep Neural Networks for YouTube Recommendations

Paul Covington, +2 more

- pp 191-198

Chats0

TLDR

This paper details a deep candidate generation model and then describes a separate deep ranking model and provides practical lessons and insights derived from designing, iterating and maintaining a massive recommendation system with enormous user-facing impact.

Abstract:

YouTube represents one of the largest scale and most sophisticated industrial recommendation systems in existence. In this paper, we describe the system at a high level and focus on the dramatic performance improvements brought by deep learning. The paper is split according to the classic two-stage information retrieval dichotomy: first, we detail a deep candidate generation model and then describe a separate deep ranking model. We also provide practical lessons and insights derived from designing, iterating and maintaining a massive recommendation system with enormous user-facing impact.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

Aspect-Aware Latent Factor Model: Rating Prediction with Ratings and Reviews

Zhiyong Cheng, +3 more

TL;DR: This paper applies a proposed aspect-aware topic model (ATM) on the review text to model user preferences and item features from different aspects, and estimates the aspect importance of a user towards an item, and introduces a weighted matrix to associate those latent factors with the same set of aspects discovered by ATM.

...read moreread less

Proceedings ArticleDOI

Recommending what video to watch next: a multitask ranking system

Zhe Zhao, +9 more

TL;DR: This paper introduces a large scale multi-objective ranking system for recommending what video to watch next on an industrial video sharing platform and explored a variety of soft-parameter sharing techniques such as Multi-gate Mixture-of-Experts to efficiently optimize for multiple ranking objectives.

...read moreread less

Posted Content

MOReL : Model-Based Offline Reinforcement Learning

Rahul Kidambi, +3 more

- 12 May 2020 -

arXiv: Learning

TL;DR: Theoretically, it is shown that MOReL is minimax optimal (up to log factors) for offline RL, and through experiments, it matches or exceeds state-of-the-art results in widely studied offline RL benchmarks.

...read moreread less

Journal ArticleDOI

Spatial-Aware Hierarchical Collaborative Deep Learning for POI Recommendation

Hongzhi Yin, +4 more

- 01 Nov 2017 -

IEEE Transactions on Knowledge and Data ...

TL;DR: The extensive experimental analysis shows that the proposed Spatial-Aware Hierarchical Collaborative Deep Learning model outperforms the state-of-the-art recommendation models, especially in out- of-town and cold-start recommendation scenarios.

...read moreread less

Posted Content

Product-based Neural Networks for User Response Prediction

Yanru Qu, +6 more

- 01 Nov 2016 -

arXiv: Learning

TL;DR: Product-based Neural Networks (PNN) as mentioned in this paper uses an embedding layer to learn a distributed representation of the categorical data, a product layer to capture interactive patterns between inter-field categories and further fully connected layers to explore high-order feature interactions.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Book ChapterDOI

I and J

William Marsden

Proceedings Article

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

Sergey Ioffe, +1 more

TL;DR: Applied to a state-of-the-art image classification model, Batch Normalization achieves the same accuracy with 14 times fewer training steps, and beats the original model by a significant margin.

...read moreread less

Proceedings Article

Distributed Representations of Words and Phrases and their Compositionality

Tomas Mikolov, +4 more

TL;DR: This paper presents a simple method for finding phrases in text, and shows that learning good vector representations for millions of phrases is possible and describes a simple alternative to the hierarchical softmax called negative sampling.

...read moreread less

Posted Content

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

Sergey Ioffe, +1 more

- 11 Feb 2015 -

arXiv: Learning

TL;DR: Batch Normalization as mentioned in this paper normalizes layer inputs for each training mini-batch to reduce the internal covariate shift in deep neural networks, and achieves state-of-the-art performance on ImageNet.

...read moreread less

Posted Content

Distributed Representations of Words and Phrases and their Compositionality

Tomas Mikolov, +4 more

- 16 Oct 2013 -

arXiv: Computation and Language

TL;DR: In this paper, the Skip-gram model is used to learn high-quality distributed vector representations that capture a large number of precise syntactic and semantic word relationships and improve both the quality of the vectors and the training speed.

...read moreread less

Collapse

Deep Neural Networks for YouTube Recommendations

Citations

Aspect-Aware Latent Factor Model: Rating Prediction with Ratings and Reviews

Recommending what video to watch next: a multitask ranking system

MOReL : Model-Based Offline Reinforcement Learning

Spatial-Aware Hierarchical Collaborative Deep Learning for POI Recommendation

Product-based Neural Networks for User Response Prediction

References

I and J

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

Distributed Representations of Words and Phrases and their Compositionality

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

Distributed Representations of Words and Phrases and their Compositionality

Related Papers (5)

Matrix Factorization Techniques for Recommender Systems

Neural Collaborative Filtering

Item-based collaborative filtering recommendation algorithms

Adam: A Method for Stochastic Optimization

Deep Residual Learning for Image Recognition