scispace - formally typeset
Open AccessProceedings ArticleDOI

DeepFM: a factorization-machine based neural network for CTR prediction

TLDR
This paper shows that it is possible to derive an end-to-end learning model that emphasizes both low- and high-order feature interactions, and combines the power of factorization machines for recommendation and deep learning for feature learning in a new neural network architecture.
Abstract
Learning sophisticated feature interactions behind user behaviors is critical in maximizing CTR for recommender systems. Despite great progress, existing methods seem to have a strong bias towards low- or high-order interactions, or require expertise feature engineering. In this paper, we show that it is possible to derive an end-to-end learning model that emphasizes both low- and high-order feature interactions. The proposed model, DeepFM, combines the power of factorization machines for recommendation and deep learning for feature learning in a new neural network architecture. Compared to the latest Wide & Deep model from Google, DeepFM has a shared input to its "wide" and "deep" parts, with no need of feature engineering besides raw features. Comprehensive experiments are conducted to demonstrate the effectiveness and efficiency of DeepFM over the existing models for CTR prediction, on both benchmark data and commercial data.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

Always Strengthen Your Strengths: A Drift-Aware Incremental Learning Framework for CTR Prediction

TL;DR: In this article , a drift-aware incremental learning framework based on ensemble learning is proposed to address catastrophic forgetting in CTR prediction in industrial data streams, where the model simply adapts to new data distribution all the time.
Journal ArticleDOI

PME: pruning-based multi-size embedding for recommender systems

TL;DR: Zhang et al. as discussed by the authors proposed a pruning-based multi-size embedding (PME) framework, which prunes the dimensions that have the least impact on model performance in the embedding to reduce its capacity.
Journal ArticleDOI

Boosting Factorization Machines via Saliency-Guided Mixup

TL;DR: Through theoretical analysis, it is proved that the proposed methods minimize the upper bound of the generalization error, which hold a beneficial effect on enhancing FMs and gives the first generalization bound of FM.
Proceedings ArticleDOI

PaddleBox: Communication-Efficient TeraByte-Scale Model Training Framework for Online Advertising

TL;DR: Wang et al. as mentioned in this paper proposed a hardware-aware training workflow that couples the hardware topology into the algorithm design, and introduced a k-step model merging algorithm for Adam and provided its convergence rate in non-convex optimization.
Proceedings ArticleDOI

When Newer is Not Better: Does Deep Learning Really Benefit Recommendation From Implicit Feedback?

TL;DR: This article performed a large-scale, systematic study to compare recent neural recommendation models against traditional ones in top-n recommendation from implicit data and proposed a set of evaluation strategies for measuring memorization performance, generalization performance and subgroup-specific performance of recommendation models.
References
More filters
Proceedings ArticleDOI

Deep Residual Learning for Image Recognition

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.
Journal Article

Dropout: a simple way to prevent neural networks from overfitting

TL;DR: It is shown that dropout improves the performance of neural networks on supervised learning tasks in vision, speech recognition, document classification and computational biology, obtaining state-of-the-art results on many benchmark data sets.
Proceedings ArticleDOI

Deep Neural Networks for YouTube Recommendations

TL;DR: This paper details a deep candidate generation model and then describes a separate deep ranking model and provides practical lessons and insights derived from designing, iterating and maintaining a massive recommendation system with enormous user-facing impact.
Proceedings ArticleDOI

Factorization Machines

TL;DR: Factorization Machines (FM) are introduced which are a new model class that combines the advantages of Support Vector Machines (SVM) with factorization models and can mimic these models just by specifying the input data (i.e. the feature vectors).
Proceedings ArticleDOI

Restricted Boltzmann machines for collaborative filtering

TL;DR: This paper shows how a class of two-layer undirected graphical models, called Restricted Boltzmann Machines (RBM's), can be used to model tabular data, such as user's ratings of movies, and demonstrates that RBM's can be successfully applied to the Netflix data set.
Related Papers (5)