Deep Neural Networks for YouTube Recommendations

doi:10.1145/2959100.2959190

Open AccessProceedings ArticleDOI

Deep Neural Networks for YouTube Recommendations

Paul Covington, +2 more

- pp 191-198

Chats0

TLDR

This paper details a deep candidate generation model and then describes a separate deep ranking model and provides practical lessons and insights derived from designing, iterating and maintaining a massive recommendation system with enormous user-facing impact.

Abstract:

YouTube represents one of the largest scale and most sophisticated industrial recommendation systems in existence. In this paper, we describe the system at a high level and focus on the dramatic performance improvements brought by deep learning. The paper is split according to the classic two-stage information retrieval dichotomy: first, we detail a deep candidate generation model and then describe a separate deep ranking model. We also provide practical lessons and insights derived from designing, iterating and maintaining a massive recommendation system with enormous user-facing impact.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

MGAT: Multimodal Graph Attention Network for Recommendation

Zhulin Tao, +5 more

- 01 Sep 2020 -

Information Processing and Management

TL;DR: A new Multimodal Graph Attention Network, short for MGAT, is proposed, which disentangles personal interests at the granularity of modality and is able to capture more complex interaction patterns hidden in user behaviors and provide a more accurate recommendation.

...read moreread less

Proceedings ArticleDOI

Fake Co-visitation Injection Attacks to Recommender Systems.

Guolei Yang, +2 more

TL;DR: New attacks to recommender systems are proposed which can spoof a recommender system to make recommendations as an attacker desires and are modeled as constrained linear optimization problems by solving which the attacker can perform attacks with maximal threats.

...read moreread less

Proceedings ArticleDOI

SlateQ: A Tractable Decomposition for Reinforcement Learning with Recommendation Sets

Eugene Ie, +8 more

TL;DR: SLATEQ is developed, a decomposition of value-based temporal-difference and Q-learning that renders RL tractable with slates and shows that the long-term value of a slate can be decomposed into a tractable function of its component item-wise LTVs.

...read moreread less

Proceedings Article

Learning Disentangled Representations for Recommendation

Jianxin Ma, +4 more

TL;DR: In this article, the authors present the MACRo-mIcro Disentangled Variational Auto-Encoder (MacridVAE) for learning disentangled representations from user behavior by inferring high-level concepts associated with user intentions (e.g., to buy a shirt or a cellphone), while capturing the preference of a user regarding the different concepts separately.

...read moreread less

Book ChapterDOI

Recommender systems for health informatics: state-of-the-art and future perspectives

André Calero Valdez, +5 more

TL;DR: This work provides a three-part research framework to access health recommender systems, suggesting to incorporate domain understanding, evaluation and specific methodology into the development process.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Book ChapterDOI

I and J

William Marsden

Proceedings Article

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

Sergey Ioffe, +1 more

TL;DR: Applied to a state-of-the-art image classification model, Batch Normalization achieves the same accuracy with 14 times fewer training steps, and beats the original model by a significant margin.

...read moreread less

Proceedings Article

Distributed Representations of Words and Phrases and their Compositionality

Tomas Mikolov, +4 more

TL;DR: This paper presents a simple method for finding phrases in text, and shows that learning good vector representations for millions of phrases is possible and describes a simple alternative to the hierarchical softmax called negative sampling.

...read moreread less

Posted Content

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

Sergey Ioffe, +1 more

- 11 Feb 2015 -

arXiv: Learning

TL;DR: Batch Normalization as mentioned in this paper normalizes layer inputs for each training mini-batch to reduce the internal covariate shift in deep neural networks, and achieves state-of-the-art performance on ImageNet.

...read moreread less

Posted Content

Distributed Representations of Words and Phrases and their Compositionality

Tomas Mikolov, +4 more

- 16 Oct 2013 -

arXiv: Computation and Language

TL;DR: In this paper, the Skip-gram model is used to learn high-quality distributed vector representations that capture a large number of precise syntactic and semantic word relationships and improve both the quality of the vectors and the training speed.

...read moreread less

Collapse

Deep Neural Networks for YouTube Recommendations

Citations

MGAT: Multimodal Graph Attention Network for Recommendation

Fake Co-visitation Injection Attacks to Recommender Systems.

SlateQ: A Tractable Decomposition for Reinforcement Learning with Recommendation Sets

Learning Disentangled Representations for Recommendation

Recommender systems for health informatics: state-of-the-art and future perspectives

References

I and J

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

Distributed Representations of Words and Phrases and their Compositionality

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

Distributed Representations of Words and Phrases and their Compositionality

Related Papers (5)

Matrix Factorization Techniques for Recommender Systems

Neural Collaborative Filtering

Item-based collaborative filtering recommendation algorithms

Adam: A Method for Stochastic Optimization

Deep Residual Learning for Image Recognition