Deep Neural Networks for YouTube Recommendations
Paul Covington,Jay Adams,Emre Sargin +2 more
- pp 191-198
Reads0
Chats0
TLDR
This paper details a deep candidate generation model and then describes a separate deep ranking model and provides practical lessons and insights derived from designing, iterating and maintaining a massive recommendation system with enormous user-facing impact.Abstract:
YouTube represents one of the largest scale and most sophisticated industrial recommendation systems in existence. In this paper, we describe the system at a high level and focus on the dramatic performance improvements brought by deep learning. The paper is split according to the classic two-stage information retrieval dichotomy: first, we detail a deep candidate generation model and then describe a separate deep ranking model. We also provide practical lessons and insights derived from designing, iterating and maintaining a massive recommendation system with enormous user-facing impact.read more
Citations
More filters
Posted Content
Do Offline Metrics Predict Online Performance in Recommender Systems
Karl Krauth,Sarah Dean,Alex Zhao,Wenshuo Guo,Mihaela Curmei,Benjamin Recht,Michael I. Jordan +6 more
TL;DR: This work investigates the extent to which offline metrics predict online performance by evaluating eleven recommenders across six controlled simulated environments and study the impact of adding exploration strategies, and observes that their effectiveness is highly dependent on the recommendation algorithm.
Journal ArticleDOI
Learning and Fusing Multiple User Interest Representations for Micro-Video and Movie Recommendations
TL;DR: This paper considers efficient representations of four aspects of user interest and proposes item-level representation, which is learned from and integrates the features of a user's historical items, and investigates neighbor-assisted representation, i.e. using neighboring users’ information to characterize user interest collaboratively.
Proceedings ArticleDOI
ATBRG: Adaptive Target-Behavior Relational Graph Network for Effective Recommendation
TL;DR: A new framework named Adaptive Target-Behavior Relational Graph network (ATBRG) is proposed to effectively capture structural relations of target user-item pairs over KG, and empirical results show that ATBRG consistently and significantly outperforms state-of-the-art methods.
Proceedings ArticleDOI
Top-N Recommendation with Counterfactual User Preference Simulation
TL;DR: Zhang et al. as mentioned in this paper propose to reformulate the recommendation task within the causal inference framework, which enables them to counterfactually simulate user ranking-based preferences to handle the data scarce problem.
Posted Content
Understanding Capacity-Driven Scale-Out Neural Recommendation Inference
Michael Lui,Yavuz Yetim,Özgür Özkan,Zhuoran Zhao,Shin-Yeh Tsai,Carole-Jean Wu,Mark Hempstead +6 more
TL;DR: This work specifically explores latency-bounded inference systems, compared to the throughput-oriented training systems of other recent works, and finds that the latency and compute overheads of distributed inference are largely attributed to a model's static embedding table distribution and sparsity of inference request inputs.
References
More filters
Proceedings Article
Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift
Sergey Ioffe,Christian Szegedy +1 more
TL;DR: Applied to a state-of-the-art image classification model, Batch Normalization achieves the same accuracy with 14 times fewer training steps, and beats the original model by a significant margin.
Proceedings Article
Distributed Representations of Words and Phrases and their Compositionality
TL;DR: This paper presents a simple method for finding phrases in text, and shows that learning good vector representations for millions of phrases is possible and describes a simple alternative to the hierarchical softmax called negative sampling.
Posted Content
Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift
Sergey Ioffe,Christian Szegedy +1 more
TL;DR: Batch Normalization as mentioned in this paper normalizes layer inputs for each training mini-batch to reduce the internal covariate shift in deep neural networks, and achieves state-of-the-art performance on ImageNet.
Posted Content
Distributed Representations of Words and Phrases and their Compositionality
TL;DR: In this paper, the Skip-gram model is used to learn high-quality distributed vector representations that capture a large number of precise syntactic and semantic word relationships and improve both the quality of the vectors and the training speed.