Deep Crossing: Web-Scale Modeling without Manually Crafted Combinatorial Features

doi:10.1145/2939672.2939704

Proceedings ArticleDOI

Deep Crossing: Web-Scale Modeling without Manually Crafted Combinatorial Features

Ying Shan, +5 more

- pp 255-262

Chats0

TLDR

The Deep Crossing model is proposed which is a deep neural network that automatically combines features to produce superior models and was able to build, from scratch, two web-scale models for a major paid search engine, and achieve superior results with only a sub-set of the features used in the production models.

Abstract:

Manually crafted combinatorial features have been the "secret sauce" behind many successful models. For web-scale applications, however, the variety and volume of features make these manually crafted features expensive to create, maintain, and deploy. This paper proposes the Deep Crossing model which is a deep neural network that automatically combines features to produce superior models. The input of Deep Crossing is a set of individual features that can be either dense or sparse. The important crossing features are discovered implicitly by the networks, which are comprised of an embedding and stacking layer, as well as a cascade of Residual Units. Deep Crossing is implemented with a modeling tool called the Computational Network Tool Kit (CNTK), powered by a multi-GPU platform. It was able to build, from scratch, two web-scale models for a major paid search engine, and achieve superior results with only a sub-set of the features used in the production models. This demonstrates the potential of using Deep Crossing as a general modeling paradigm to improve existing products, as well as to speed up the development of new models with a fraction of the investment in feature engineering and acquisition of deep domain knowledge.

Deep Crossing: Web-Scale Modeling without Manually Crafted Combinatorial Features

Citations

Deep Interest Network for Click-Through Rate Prediction

Deep Learning Based Recommender System: A Survey and New Perspectives

Neural Factorization Machines for Sparse Predictive Analytics

KGAT: Knowledge Graph Attention Network for Recommendation

KGAT: Knowledge Graph Attention Network for Recommendation

References

Deep Residual Learning for Image Recognition

ImageNet Classification with Deep Convolutional Neural Networks

Gradient-based learning applied to document recognition

Distributed Representations of Words and Phrases and their Compositionality

Object recognition from local scale-invariant features

Related Papers (5)

Wide & Deep Learning for Recommender Systems

Factorization Machines

Deep Interest Network for Click-Through Rate Prediction

DeepFM: a factorization-machine based neural network for CTR prediction

Deep Neural Networks for YouTube Recommendations