Deep Neural Networks for YouTube Recommendations
Paul Covington,Jay Adams,Emre Sargin +2 more
- pp 191-198
Reads0
Chats0
TLDR
This paper details a deep candidate generation model and then describes a separate deep ranking model and provides practical lessons and insights derived from designing, iterating and maintaining a massive recommendation system with enormous user-facing impact.Abstract:
YouTube represents one of the largest scale and most sophisticated industrial recommendation systems in existence. In this paper, we describe the system at a high level and focus on the dramatic performance improvements brought by deep learning. The paper is split according to the classic two-stage information retrieval dichotomy: first, we detail a deep candidate generation model and then describe a separate deep ranking model. We also provide practical lessons and insights derived from designing, iterating and maintaining a massive recommendation system with enormous user-facing impact.read more
Citations
More filters
Posted Content
Hybrid Sequential Recommender via Time-aware Attentive Memory Network.
TL;DR: In this article, a multi-hop time-aware attentive memory network (MTAM) was proposed to integrate long-term and short-term preferences for top-k recommendation, which can be viewed as a nonlinear generalization of latent factorization for dot-product based top-K recommendation.
Proceedings ArticleDOI
Learning Compositional, Visual and Relational Representations for CTR Prediction in Sponsored Search
TL;DR: This paper proposes an approach to improve the accuracy of CTR prediction by learning supplementary representations from three new aspects: the compositional components, the visual appearance and the relational structure of ads.
Posted Content
FLEN: Leveraging Field for Scalable CTR Prediction.
TL;DR: A novel Field-Leveraged Embedding Network (FLEN) which has been deployed in the commercial recommender system in Meitu and serves the main traffic is described, which devises a field-wise bi-interaction pooling technique and shows that a variety of state-of-the-art CTR models can be expressed under this technique.
Posted Content
Energy-Based Sequence GANs for Recommendation and Their Connection to Imitation Learning
Jaeyoon Yoo,Heonseok Ha,Jihun Yi,J. Jon Ryu,Chanju Kim,Jung-Woo Ha,Young-Han Kim,Sungroh Yoon +7 more
TL;DR: Energy-based sequence generative adversarial nets are adopted for recommendation by learning a generative model for the time series of user-preferred items by recasting the energy function as the feature function.
Proceedings ArticleDOI
Representation Learning-Assisted Click-Through Rate Prediction
TL;DR: DeepMCP as discussed by the authors proposes to model other types of relationships in order to learn more informative and statistically reliable feature representations, and in consequence to improve the performance of click-through rate prediction.
References
More filters
Proceedings Article
Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift
Sergey Ioffe,Christian Szegedy +1 more
TL;DR: Applied to a state-of-the-art image classification model, Batch Normalization achieves the same accuracy with 14 times fewer training steps, and beats the original model by a significant margin.
Proceedings Article
Distributed Representations of Words and Phrases and their Compositionality
TL;DR: This paper presents a simple method for finding phrases in text, and shows that learning good vector representations for millions of phrases is possible and describes a simple alternative to the hierarchical softmax called negative sampling.
Posted Content
Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift
Sergey Ioffe,Christian Szegedy +1 more
TL;DR: Batch Normalization as mentioned in this paper normalizes layer inputs for each training mini-batch to reduce the internal covariate shift in deep neural networks, and achieves state-of-the-art performance on ImageNet.
Posted Content
Distributed Representations of Words and Phrases and their Compositionality
TL;DR: In this paper, the Skip-gram model is used to learn high-quality distributed vector representations that capture a large number of precise syntactic and semantic word relationships and improve both the quality of the vectors and the training speed.