scispace - formally typeset
Open AccessProceedings ArticleDOI

Deep Neural Networks for YouTube Recommendations

Paul Covington, +2 more
- pp 191-198
Reads0
Chats0
TLDR
This paper details a deep candidate generation model and then describes a separate deep ranking model and provides practical lessons and insights derived from designing, iterating and maintaining a massive recommendation system with enormous user-facing impact.
Abstract
YouTube represents one of the largest scale and most sophisticated industrial recommendation systems in existence. In this paper, we describe the system at a high level and focus on the dramatic performance improvements brought by deep learning. The paper is split according to the classic two-stage information retrieval dichotomy: first, we detail a deep candidate generation model and then describe a separate deep ranking model. We also provide practical lessons and insights derived from designing, iterating and maintaining a massive recommendation system with enormous user-facing impact.

read more

Citations
More filters
Proceedings ArticleDOI

Adversarial Feature Translation for Multi-domain Recommendation

TL;DR: In this paper, a novel adversarial feature translation (AFT) model is proposed to improve all recommendation domains simultaneously, where the key point is to capture informative domain-specific features from all domains.
Posted Content

Zero-Shot Knowledge Distillation from a Decision-Based Black-Box Model

Zi Wang
- 07 Jun 2021 - 
TL;DR: The concept of decisionbased black-box (DB3) knowledge distillation is proposed, with which the student is trained by distilling the knowledge from a black- box teacher (parameters are not accessible) that only returns classes rather than softmax outputs.
Journal ArticleDOI

Modeling Embedding Dimension Correlations via Convolutional Neural Collaborative Filtering

TL;DR: This work proposes a novel and general neural collaborative filtering framework, ConvNCF, which is featured with two designs: (1) applying outer product on user embedding and item embedding to explicitly model the pairwise correlations between embedding dimensions, and (2) employing convolutional neural network above the outer product to learn the high-order correlations amongembedding dimensions.
Proceedings ArticleDOI

dTrust: A Simple Deep Learning Approach for Social Recommendation

TL;DR: Li et al. as discussed by the authors presented dTrust, a simple social recommendation approach that avoids using user personal information, which relies uniquely on the topology of an anonymized trust-user-item network that combines user trust relations with user rating scores.
Proceedings ArticleDOI

A Semi-Personalized System for User Cold Start Recommendation on Music Streaming Apps

TL;DR: In this paper, a semi-personalized recommendation strategy based on a deep neural network architecture and on a clustering of users from heterogeneous sources of information is proposed for predicting the future musical preferences of cold start users on Deezer.
References
More filters
Book ChapterDOI

I and J

Proceedings Article

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

TL;DR: Applied to a state-of-the-art image classification model, Batch Normalization achieves the same accuracy with 14 times fewer training steps, and beats the original model by a significant margin.
Proceedings Article

Distributed Representations of Words and Phrases and their Compositionality

TL;DR: This paper presents a simple method for finding phrases in text, and shows that learning good vector representations for millions of phrases is possible and describes a simple alternative to the hierarchical softmax called negative sampling.
Posted Content

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

TL;DR: Batch Normalization as mentioned in this paper normalizes layer inputs for each training mini-batch to reduce the internal covariate shift in deep neural networks, and achieves state-of-the-art performance on ImageNet.
Posted Content

Distributed Representations of Words and Phrases and their Compositionality

TL;DR: In this paper, the Skip-gram model is used to learn high-quality distributed vector representations that capture a large number of precise syntactic and semantic word relationships and improve both the quality of the vectors and the training speed.