Showing papers on "Recommender system published in 2017"

PDF

Open Access

Proceedings Article•DOI•

[...]

Xiangnan He¹, Lizi Liao¹, Hanwang Zhang², Liqiang Nie³, Xia Hu⁴, Tat-Seng Chua¹ - Show less +2 more•Institutions (4)

National University of Singapore¹, Columbia University², Shandong University³, Texas A&M University⁴

03 Apr 2017

TL;DR: This work strives to develop techniques based on neural networks to tackle the key problem in recommendation --- collaborative filtering --- on the basis of implicit feedback, and presents a general framework named NCF, short for Neural network-based Collaborative Filtering.

...read moreread less

Abstract: In recent years, deep neural networks have yielded immense success on speech recognition, computer vision and natural language processing. However, the exploration of deep neural networks on recommender systems has received relatively less scrutiny. In this work, we strive to develop techniques based on neural networks to tackle the key problem in recommendation --- collaborative filtering --- on the basis of implicit feedback. Although some recent work has employed deep learning for recommendation, they primarily used it to model auxiliary information, such as textual descriptions of items and acoustic features of musics. When it comes to model the key factor in collaborative filtering --- the interaction between user and item features, they still resorted to matrix factorization and applied an inner product on the latent features of users and items. By replacing the inner product with a neural architecture that can learn an arbitrary function from data, we present a general framework named NCF, short for Neural network-based Collaborative Filtering. NCF is generic and can express and generalize matrix factorization under its framework. To supercharge NCF modelling with non-linearities, we propose to leverage a multi-layer perceptron to learn the user-item interaction function. Extensive experiments on two real-world datasets show significant improvements of our proposed NCF framework over the state-of-the-art methods. Empirical evidence shows that using deeper layers of neural networks offers better recommendation performance.

...read moreread less

4,419 citations

Proceedings Article•DOI•

DeepFM: a factorization-machine based neural network for CTR prediction

[...]

Huifeng Guo¹, Ruiming Tang², Yunming Ye¹, Zhenguo Li², Xiuqiang He² - Show less +1 more•Institutions (2)

Harbin Institute of Technology¹, Huawei²

19 Aug 2017

TL;DR: This paper shows that it is possible to derive an end-to-end learning model that emphasizes both low- and high-order feature interactions, and combines the power of factorization machines for recommendation and deep learning for feature learning in a new neural network architecture.

...read moreread less

Abstract: Learning sophisticated feature interactions behind user behaviors is critical in maximizing CTR for recommender systems. Despite great progress, existing methods seem to have a strong bias towards low- or high-order interactions, or require expertise feature engineering. In this paper, we show that it is possible to derive an end-to-end learning model that emphasizes both low- and high-order feature interactions. The proposed model, DeepFM, combines the power of factorization machines for recommendation and deep learning for feature learning in a new neural network architecture. Compared to the latest Wide & Deep model from Google, DeepFM has a shared input to its "wide" and "deep" parts, with no need of feature engineering besides raw features. Comprehensive experiments are conducted to demonstrate the effectiveness and efficiency of DeepFM over the existing models for CTR prediction, on both benchmark data and commercial data.

...read moreread less

1,695 citations

Posted Content•

Graph Convolutional Matrix Completion

[...]

Rianne van den Berg, Thomas Kipf, Max Welling

25 Oct 2017-arXiv: Machine Learning

TL;DR: A graph auto-encoder framework based on differentiable message passing on the bipartite interaction graph that shows competitive performance on standard collaborative filtering benchmarks and outperforms recent state-of-the-art methods.

...read moreread less

Abstract: We consider matrix completion for recommender systems from the point of view of link prediction on graphs. Interaction data such as movie ratings can be represented by a bipartite user-item graph with labeled edges denoting observed ratings. Building on recent progress in deep learning on graph-structured data, we propose a graph auto-encoder framework based on differentiable message passing on the bipartite interaction graph. Our model shows competitive performance on standard collaborative filtering benchmarks. In settings where complimentary feature information or structured data such as a social network is available, our framework outperforms recent state-of-the-art methods.

...read moreread less

910 citations

Proceedings Article•DOI•

Joint Deep Modeling of Users and Items Using Reviews for Recommendation

[...]

Lei Zheng¹, Vahid Noroozi¹, Philip S. Yu¹•Institutions (1)

University of Illinois at Chicago¹

02 Feb 2017

TL;DR: A deep model to learn item properties and user behaviors jointly from review text, named Deep Cooperative Neural Networks (DeepCoNN), consists of two parallel neural networks coupled in the last layers.

...read moreread less

Abstract: A large amount of information exists in reviews written by users. This source of information has been ignored by most of the current recommender systems while it can potentially alleviate the sparsity problem and improve the quality of recommendations. In this paper, we present a deep model to learn item properties and user behaviors jointly from review text. The proposed model, named Deep Cooperative Neural Networks (DeepCoNN), consists of two parallel neural networks coupled in the last layers. One of the networks focuses on learning user behaviors exploiting reviews written by the user, and the other one learns item properties from the reviews written for the item. A shared layer is introduced on the top to couple these two networks together. The shared layer enables latent factors learned for users and items to interact with each other in a manner similar to factorization machine techniques. Experimental results demonstrate that DeepCoNN significantly outperforms all baseline recommender systems on a variety of datasets.

...read moreread less

828 citations

Proceedings Article•DOI•

Deep matrix factorization models for recommender systems

[...]

Hong-Jian Xue, Xinyu Dai¹, Jianbing Zhang¹, Shujian Huang¹, Jiajun Chen¹ - Show less +1 more•Institutions (1)

Nanjing University¹

01 Aug 2017

TL;DR: A novel matrix factorization model with neural network architecture is proposed that outperformed other state-of-the-art methods on several benchmark datasets and considers both explicit ratings and implicit feedback for a better optimization.

...read moreread less

Abstract: Recommender systems usually make personalized recommendation with user-item interaction ratings, implicit feedback and auxiliary information. Matrix factorization is the basic idea to predict a personalized ranking over a set of items for an individual user with the similarities among users and items. In this paper, we propose a novel matrix factorization model with neural network architecture. Firstly, we construct a user-item matrix with explicit ratings and non-preference implicit feedback. With this matrix as the input, we present a deep structure learning architecture to learn a common low dimensional space for the representations of users and items. Secondly, we design a new loss function based on binary cross entropy, in which we consider both explicit ratings and implicit feedback for a better optimization. The experimental results show the effectiveness of both our proposed model and the loss function. On several benchmark datasets, our model outperformed other stateof-the-art methods. We also conduct extensive experiments to evaluate the performance within different experimental settings.

...read moreread less

749 citations

Proceedings Article•DOI•

Recurrent Recommender Networks

[...]

Chao-Yuan Wu¹, Amr Ahmed², Alex Beutel², Alexander J. Smola³, How Jing⁴ - Show less +1 more•Institutions (4)

University of Texas at Austin¹, Google², Carnegie Mellon University³, LinkedIn⁴

02 Feb 2017

TL;DR: Recurrent Recommender Networks (RRN) are proposed that are able to predict future behavioral trajectories by endowing both users and movies with a Long Short-Term Memory (LSTM) autoregressive model that captures dynamics, in addition to a more traditional low-rank factorization.

...read moreread less

Abstract: Recommender systems traditionally assume that user profiles and movie attributes are static. Temporal dynamics are purely reactive, that is, they are inferred after they are observed, e.g. after a user's taste has changed or based on hand-engineered temporal bias corrections for movies. We propose Recurrent Recommender Networks (RRN) that are able to predict future behavioral trajectories. This is achieved by endowing both users and movies with a Long Short-Term Memory (LSTM) autoregressive model that captures dynamics, in addition to a more traditional low-rank factorization. On multiple real-world datasets, our model offers excellent prediction accuracy and it is very compact, since we need not learn latent state but rather just the state transition function.

...read moreread less

650 citations

Journal Article•DOI•

Deep Learning based Recommender System: A Survey and New Perspectives.

[...]

Shuai Zhang¹, Lina Yao¹, Aixin Sun²•Institutions (2)

University of New South Wales¹, Nanyang Technological University²

24 Jul 2017-arXiv: Information Retrieval

TL;DR: A taxonomy of deep learning-based recommendation models is provided and a comprehensive summary of the state of the art is provided, along with new perspectives pertaining to this new and exciting development of the field.

...read moreread less

Abstract: With the ever-growing volume of online information, recommender systems have been an effective strategy to overcome such information overload. The utility of recommender systems cannot be overstated, given its widespread adoption in many web applications, along with its potential impact to ameliorate many problems related to over-choice. In recent years, deep learning has garnered considerable interest in many research fields such as computer vision and natural language processing, owing not only to stellar performance but also the attractive property of learning feature representations from scratch. The influence of deep learning is also pervasive, recently demonstrating its effectiveness when applied to information retrieval and recommender systems research. Evidently, the field of deep learning in recommender system is flourishing. This article aims to provide a comprehensive review of recent research efforts on deep learning based recommender systems. More concretely, we provide and devise a taxonomy of deep learning based recommendation models, along with providing a comprehensive summary of the state-of-the-art. Finally, we expand on current trends and provide new perspectives pertaining to this new exciting development of the field.

...read moreread less

560 citations

Journal Article•DOI•

Collaborative filtering and deep learning based recommendation system for cold start items

[...]

Jian Wei¹, Jianhua He¹, Kai Chen², Yi Zhou², Zuoyin Tang¹ - Show less +1 more•Institutions (2)

Aston University¹, Shanghai Jiao Tong University²

01 Mar 2017-Expert Systems With Applications

TL;DR: Two recommendation models to solve the CCS and ICS problems for new items are proposed, which are based on a framework of tightly coupled CF approach and deep learning neural network.

...read moreread less

Abstract: Recommender system is a specific type of intelligent systems, which exploits historical user ratings on items and/or auxiliary information to make recommendations on items to the users. It plays a critical role in a wide range of online shopping, e-commercial services and social networking applications. Collaborative filtering (CF) is the most popular approaches used for recommender systems, but it suffers from complete cold start (CCS) problem where no rating record are available and incomplete cold start (ICS) problem where only a small number of rating records are available for some new items or users in the system. In this paper, we propose two recommendation models to solve the CCS and ICS problems for new items, which are based on a framework of tightly coupled CF approach and deep learning neural network. A specific deep neural network SADE is used to extract the content features of the items. The state of the art CF model, timeSVD++, which models and utilizes temporal dynamics of user preferences and item features, is modified to take the content features into prediction of ratings for cold start items. Extensive experiments on a large Netflix rating dataset of movies are performed, which show that our proposed recommendation models largely outperform the baseline models for rating prediction of cold start items. The two proposed recommendation models are also evaluated and compared on ICS items, and a flexible scheme of model retraining and switching is proposed to deal with the transition of items from cold start to non-cold start status. The experiment results on Netflix movie recommendation show the tight coupling of CF approach and deep learning neural network is feasible and very effective for cold start item recommendation. The design is general and can be applied to many other recommender systems for online shopping and social networking applications. The solution of cold start item problem can largely improve user experience and trust of recommender systems, and effectively promote cold start items.

...read moreread less

538 citations

Posted Content•

DeepFM: A Factorization-Machine based Neural Network for CTR Prediction

[...]

Huifeng Guo¹, Ruiming Tang², Yunming Ye¹, Zhenguo Li², Xiuqiang He² - Show less +1 more•Institutions (2)

Harbin Institute of Technology¹, Huawei²

13 Mar 2017-arXiv: Information Retrieval

TL;DR: DeepFM as mentioned in this paper combines the power of factorization machines for recommendation and deep learning for feature learning in a new neural network architecture, which has a shared input to its "wide" and "deep" parts.

...read moreread less

Abstract: Learning sophisticated feature interactions behind user behaviors is critical in maximizing CTR for recommender systems. Despite great progress, existing methods seem to have a strong bias towards low- or high-order interactions, or require expertise feature engineering. In this paper, we show that it is possible to derive an end-to-end learning model that emphasizes both low- and high-order feature interactions. The proposed model, DeepFM, combines the power of factorization machines for recommendation and deep learning for feature learning in a new neural network architecture. Compared to the latest Wide \& Deep model from Google, DeepFM has a shared input to its "wide" and "deep" parts, with no need of feature engineering besides raw features. Comprehensive experiments are conducted to demonstrate the effectiveness and efficiency of DeepFM over the existing models for CTR prediction, on both benchmark data and commercial data.

...read moreread less

504 citations

Journal Article•DOI•

Social Collaborative Filtering by Trust

[...]

Bo Yang¹, Yu Lei², Jiming Liu³, Wenjie Li²•Institutions (3)

Jilin University¹, Hong Kong Polytechnic University², Hong Kong Baptist University³

01 Aug 2017-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: In this paper, a model-based method that adopts matrix factorization technique that maps users into low-dimensional latent feature spaces in terms of their trust relationship, and aims to more accurately reflect the users reciprocal influence on the formation of their own opinions and to learn better preferential patterns of users for high-quality recommendations.

...read moreread less

Abstract: Recommender systems are used to accurately and actively provide users with potentially interesting information or services. Collaborative filtering is a widely adopted approach to recommendation, but sparse data and cold-start users are often barriers to providing high quality recommendations. To address such issues, we propose a novel method that works to improve the performance of collaborative filtering recommendations by integrating sparse rating data given by users and sparse social trust network among these same users. This is a model-based method that adopts matrix factorization technique that maps users into low-dimensional latent feature spaces in terms of their trust relationship, and aims to more accurately reflect the users reciprocal influence on the formation of their own opinions and to learn better preferential patterns of users for high-quality recommendations. We use four large-scale datasets to show that the proposed method performs much better, especially for cold start users, than state-of-the-art recommendation algorithms for social collaborative filtering based on trust.

...read moreread less

479 citations

Proceedings Article•DOI•

Collaborative Metric Learning

[...]

Cheng-Kang Hsieh¹, Longqi Yang², Yin Cui², Tsung-Yi Lin², Serge Belongie², Deborah Estrin² - Show less +2 more•Institutions (2)

University of California, Los Angeles¹, Cornell University²

03 Apr 2017

TL;DR: The proposed algorithm outperforms state-of-the-art collaborative filtering algorithms on a wide range of recommendation tasks and uncovers the underlying spectrum of users' fine-grained preferences.

...read moreread less

Abstract: Metric learning algorithms produce distance metrics that capture the important relationships among data. In this work, we study the connection between metric learning and collaborative filtering. We propose Collaborative Metric Learning (CML) which learns a joint metric space to encode not only users' preferences but also the user-user and item-item similarity. The proposed algorithm outperforms state-of-the-art collaborative filtering algorithms on a wide range of recommendation tasks and uncovers the underlying spectrum of users' fine-grained preferences. CML also achieves significant speedup for Top-K recommendation tasks using off-the-shelf, approximate nearest-neighbor search, with negligible accuracy reduction.

...read moreread less

Proceedings Article•DOI•

Personalizing Session-based Recommendations with Hierarchical Recurrent Neural Networks

[...]

Massimo Quadrana¹, Alexandros Karatzoglou², Balázs Hidasi, Paolo Cremonesi¹•Institutions (2)

Polytechnic University of Milan¹, Telefónica²

27 Aug 2017

TL;DR: A seamless way to personalize RNN models with cross-session information transfer is proposed and a Hierarchical RNN model that relays end evolves latent hidden states of the RNNs across user sessions is devised.

...read moreread less

Abstract: Session-based recommendations are highly relevant in many modern on-line services (e.g. e-commerce, video streaming) and recommendation settings. Recently, Recurrent Neural Networks have been shown to perform very well in session-based settings. While in many session-based recommendation domains user identifiers are hard to come by, there are also domains in which user profiles are readily available. We propose a seamless way to personalize RNN models with cross-session information transfer and devise a Hierarchical RNN model that relays end evolves latent hidden states of the RNNs across user sessions. Results on two industry datasets show large improvements over the session-only RNNs.

...read moreread less

Proceedings Article•DOI•

Meta-Graph Based Recommendation Fusion over Heterogeneous Information Networks

[...]

Huan Zhao¹, Quanming Yao¹, Jianda Li¹, Yangqiu Song¹, Dik Lun Lee¹ - Show less +1 more•Institutions (1)

Hong Kong University of Science and Technology¹

04 Aug 2017

TL;DR: This paper introduces the concept of meta-graph to HIN-based recommendation, and solves the information fusion problem with a "matrix factorization + factorization machine (FM)" approach, and proposes to use FM with Group lasso (FMG) to automatically learn from the observed ratings to effectively select useful meta- graph based features.

...read moreread less

Abstract: Heterogeneous Information Network (HIN) is a natural and general representation of data in modern large commercial recommender systems which involve heterogeneous types of data. HIN based recommenders face two problems: how to represent the high-level semantics of recommendations and how to fuse the heterogeneous information to make recommendations. In this paper, we solve the two problems by first introducing the concept of meta-graph to HIN-based recommendation, and then solving the information fusion problem with a "matrix factorization (MF) + factorization machine (FM)" approach. For the similarities generated by each meta-graph, we perform standard MF to generate latent features for both users and items. With different meta-graph based features, we propose to use FM with Group lasso (FMG) to automatically learn from the observed ratings to effectively select useful meta-graph based features. Experimental results on two real-world datasets, Amazon and Yelp, show the effectiveness of our approach compared to state-of-the-art FM and other HIN-based recommendation algorithms.

...read moreread less

Journal Article•DOI•

Two Decades of Recommender Systems at Amazon.com

[...]

Brent R. Smith¹, Greg Linden²•Institutions (2)

Amazon.com¹, Microsoft²

01 May 2017-IEEE Internet Computing

TL;DR: This update to their original paper discusses some of the changes as Amazon has grown, which help customers discover items they might otherwise not have found.

...read moreread less

Abstract: Amazon is well-known for personalization and recommendations, which help customers discover items they might otherwise not have found. In this update to their original paper, the authors discuss some of the changes as Amazon has grown.

...read moreread less

Proceedings Article•DOI•

Collaborative Variational Autoencoder for Recommender Systems

[...]

Xiaopeng Li¹, James She¹•Institutions (1)

Hong Kong University of Science and Technology¹

04 Aug 2017

TL;DR: A Bayesian generative model called collaborative variational autoencoder (CVAE) that considers both rating and content for recommendation in multimedia scenario that is able to significantly outperform the state-of-the-art recommendation methods with more robust performance is proposed.

...read moreread less

Abstract: Modern recommender systems usually employ collaborative filtering with rating information to recommend items to users due to its successful performance. However, because of the drawbacks of collaborative-based methods such as sparsity, cold start, etc., more attention has been drawn to hybrid methods that consider both the rating and content information. Most of the previous works in this area cannot learn a good representation from content for recommendation task or consider only text modality of the content, thus their methods are very limited in current multimedia scenario. This paper proposes a Bayesian generative model called collaborative variational autoencoder (CVAE) that considers both rating and content for recommendation in multimedia scenario. The model learns deep latent representations from content data in an unsupervised manner and also learns implicit relationships between items and users from both content and rating. Unlike previous works with denoising criteria, the proposed CVAE learns a latent distribution for content in latent space instead of observation space through an inference network and can be easily extended to other multimedia modalities other than text. Experiments show that CVAE is able to significantly outperform the state-of-the-art recommendation methods with more robust performance.

...read moreread less

Proceedings Article•DOI•

Interpretable Convolutional Neural Networks with Dual Local and Global Attention for Review Rating Prediction

[...]

Sungyong Seo¹, Jing Huang, Hao Yang, Yan Liu¹•Institutions (1)

University of Southern California¹

27 Aug 2017

TL;DR: The proposed convolutional neural networks with dual attention model outperforms HFT and ConvMF+ in terms of mean square errors (MSE) and the superior quality of user/item embeddings learned from the model is compared.

...read moreread less

Abstract: Recently, many e-commerce websites have encouraged their users to rate shopping items and write review texts. This review information has been very useful for understanding user preferences and item properties, as well as enhancing the capability to make personalized recommendations of these websites. In this paper, we propose to model user preferences and item properties using convolutional neural networks (CNNs) with dual local and global attention, motivated by the superiority of CNNs to extract complex features. By using aggregated review texts from a user and aggregated review text for an item, our model can learn the unique features (embedding) of each user and each item. These features are then used to predict ratings. We train these user and item networks jointly which enable the interaction between users and items in a similar way as matrix factorization. The local attention provides us insight on a user's preferences or an item's properties. The global attention helps CNNs focus on the semantic meaning of the whole review text. Thus, the combined local and global attentions enable an interpretable and better-learned representation of users and items. We validate the proposed models by testing on popular review datasets in Yelp and Amazon and compare the results with matrix factorization (MF), the hidden factor and topical (HFT) model, and the recently proposed convolutional matrix factorization (ConvMF+). Our proposed CNNs with dual attention model outperforms HFT and ConvMF+ in terms of mean square errors (MSE). In addition, we compare the user/item embeddings learned from these models for classification and recommendation. These results also confirm the superior quality of user/item embeddings learned from our model.

...read moreread less

Proceedings Article•DOI•

When Recurrent Neural Networks meet the Neighborhood for Session-Based Recommendation

[...]

Dietmar Jannach¹, Malte Ludewig¹•Institutions (1)

Technical University of Dortmund¹

27 Aug 2017

TL;DR: This work shows based on a comprehensive empirical evaluation that a heuristics-based nearest neighbor (kNN) scheme for sessions outperforms GRU4REC in the large majority of the tested configurations and datasets and ensures the scalability of the kNN method.

...read moreread less

Abstract: Deep learning methods have led to substantial progress in various application fields of AI, and in recent years a number of proposals were made to improve recommender systems with artificial neural networks. For the problem of making session-based recommendations, i.e., for recommending the next item in an anonymous session, Hidasi et al.~recently investigated the application of recurrent neural networks with Gated Recurrent Units (GRU4REC). Assessing the true effectiveness of such novel approaches based only on what is reported in the literature is however difficult when no standard evaluation protocols are applied and when the strength of the baselines used in the performance comparison is not clear. In this work we show based on a comprehensive empirical evaluation that a heuristics-based nearest neighbor (kNN) scheme for sessions outperforms GRU4REC in the large majority of the tested configurations and datasets. Neighborhood sampling and efficient in-memory data structures ensure the scalability of the kNN method. The best results in the end were often achieved when we combine the kNN approach with GRU4REC, which shows that RNNs can leverage sequential signals in the data that cannot be detected by the co-occurrence-based kNN method.

...read moreread less

Posted Content•

Data Decisions and Theoretical Implications when Adversarially Learning Fair Representations

[...]

Alex Beutel, Ed H. Chi, Jilin Chen, Zhe Zhao

01 Jul 2017-arXiv: Learning

TL;DR: An adversarial training procedure is used to remove information about the sensitive attribute from the latent representation learned by a neural network, and the data distribution empirically drives the adversary's notion of fairness.

...read moreread less

Abstract: How can we learn a classifier that is "fair" for a protected or sensitive group, when we do not know if the input to the classifier belongs to the protected group? How can we train such a classifier when data on the protected group is difficult to attain? In many settings, finding out the sensitive input attribute can be prohibitively expensive even during model training, and sometimes impossible during model serving. For example, in recommender systems, if we want to predict if a user will click on a given recommendation, we often do not know many attributes of the user, e.g., race or age, and many attributes of the content are hard to determine, e.g., the language or topic. Thus, it is not feasible to use a different classifier calibrated based on knowledge of the sensitive attribute. Here, we use an adversarial training procedure to remove information about the sensitive attribute from the latent representation learned by a neural network. In particular, we study how the choice of data for the adversarial training effects the resulting fairness properties. We find two interesting results: a small amount of data is needed to train these adversarial models, and the data distribution empirically drives the adversary's notion of fairness.

...read moreread less

Journal Article•DOI•

Diversity in recommender systems A survey

[...]

Matev Kunaver¹, Toma Porl¹•Institutions (1)

University of Ljubljana¹

01 May 2017-Knowledge Based Systems

TL;DR: An overview of research done on this topic from one of the first mentions of diversity in 2001 until now is provided to offer a good overview to a researcher looking for the state-of-the-art on thistopic and to help a new developer get familiar with the topic.

...read moreread less

Abstract: Diversification has become one of the leading topics of recommender system research not only as a way to solve the over-fitting problem but also an approach to increasing the quality of the users experience with the recommender system. This article aims to provide an overview of research done on this topic from one of the first mentions of diversity in 2001 until now. The articles ,and research, have been divided into three sub-topics for a better overview of the work done in the field of recommendation diversification: the definition and evaluation of diversity; the impact of diversification on the quality of recommendation results and the development of diversification algorithms themselves. In this way, the article aims both to offer a good overview to a researcher looking for the state-of-the-art on this topic and to help a new developer get familiar with the topic.

...read moreread less

Proceedings Article•

Geometric Matrix Completion with Recurrent Multi-Graph Neural Networks

[...]

Federico Monti¹, Michael M. Bronstein², Xavier Bresson³•Institutions (3)

Polytechnic University of Milan¹, Intel², École Polytechnique Fédérale de Lausanne³

01 Jan 2017

TL;DR: In this paper, a multi-graph convolutional neural network (CNN) was used to learn graph-structured patterns from users and items, and a recurrent neural network applied a learnable diffusion on score matrix.

...read moreread less

Abstract: Matrix completion models are among the most common formulations of recommender systems. Recent works have showed a boost of performance of these techniques when introducing the pairwise relationships between users/items in the form of graphs, and imposing smoothness priors on these graphs. However, such techniques do not fully exploit the local stationary structures on user/item graphs, and the number of parameters to learn is linear w.r.t. the number of users and items. We propose a novel approach to overcome these limitations by using geometric deep learning on graphs. Our matrix completion architecture combines a novel multi-graph convolutional neural network that can learn meaningful statistical graph-structured patterns from users and items, and a recurrent neural network that applies a learnable diffusion on the score matrix. Our neural network system is computationally attractive as it requires a constant number of parameters independent of the matrix size. We apply our method on several standard datasets, showing that it outperforms state-of-the-art matrix completion techniques.

...read moreread less

Proceedings Article•DOI•

Bridging Collaborative Filtering and Semi-Supervised Learning: A Neural Approach for POI Recommendation

[...]

Carl Yang¹, Lanxiao Bai¹, Chao Zhang¹, Quan Yuan¹, Jiawei Han¹ - Show less +1 more•Institutions (1)

University of Illinois at Urbana–Champaign¹

04 Aug 2017

TL;DR: This work proposes to devise a general and principled SSL (semi-supervised learning) framework, to alleviate data scarcity via smoothing among neighboring users and POIs, and treat various context by regularizing user preference based on context graphs.

...read moreread less

Abstract: Recommender system is one of the most popular data mining topics that keep drawing extensive attention from both academia and industry. Among them, POI (point of interest) recommendation is extremely practical but challenging: it greatly benefits both users and businesses in real-world life, but it is hard due to data scarcity and various context. While a number of algorithms attempt to tackle the problem w.r.t. specific data and problem settings, they often fail when the scenarios change. In this work, we propose to devise a general and principled SSL (semi-supervised learning) framework, to alleviate data scarcity via smoothing among neighboring users and POIs, and treat various context by regularizing user preference based on context graphs. To enable such a framework, we develop PACE (Preference And Context Embedding), a deep neural architecture that jointly learns the embeddings of users and POIs to predict both user preference over POIs and various context associated with users and POIs. We show that PACE successfully bridges CF (collaborative filtering) and SSL by generalizing the de facto methods matrix factorization of CF and graph Laplacian regularization of SSL. Extensive experiments on two real location-based social network datasets demonstrate the effectiveness of PACE.

...read moreread less

Proceedings Article•DOI•

Joint Representation Learning for Top-N Recommendation with Heterogeneous Information Sources

[...]

Yongfeng Zhang¹, Qingyao Ai¹, Xu Chen², W. Bruce Croft¹•Institutions (2)

University of Massachusetts Amherst¹, Tsinghua University²

06 Nov 2017

TL;DR: This work analyzes how information propagates among different information sources in a gradient-descent learning paradigm, and proposes an extendable version of the JRL framework (eJRL), which is rigorously extendable to new information sources to avoid model re-training in practice.

...read moreread less

Abstract: The Web has accumulated a rich source of information, such as text, image, rating, etc, which represent different aspects of user preferences. However, the heterogeneous nature of this information makes it difficult for recommender systems to leverage in a unified framework to boost the performance. Recently, the rapid development of representation learning techniques provides an approach to this problem. By translating the various information sources into a unified representation space, it becomes possible to integrate heterogeneous information for informed recommendation. In this work, we propose a Joint Representation Learning (JRL) framework for top-N recommendation. In this framework, each type of information source (review text, product image, numerical rating, etc) is adopted to learn the corresponding user and item representations based on available (deep) representation learning architectures. Representations from different sources are integrated with an extra layer to obtain the joint representations for users and items. In the end, both the per-source and the joint representations are trained as a whole using pair-wise learning to rank for top-N recommendation. We analyze how information propagates among different information sources in a gradient-descent learning paradigm, based on which we further propose an extendable version of the JRL framework (eJRL), which is rigorously extendable to new information sources to avoid model re-training in practice. By representing users and items into embeddings offline, and using a simple vector multiplication for ranking score calculation online, our framework also has the advantage of fast online prediction compared with other deep learning approaches to recommendation that learn a complex prediction network for online calculation.

...read moreread less

Posted Content•

Neural Collaborative Filtering

[...]

Xiangnan He¹, Lizi Liao¹, Hanwang Zhang², Liqiang Nie³, Xia Hu⁴, Tat-Seng Chua¹ - Show less +2 more•Institutions (4)

National University of Singapore¹, Columbia University², Shandong University³, Texas A&M University⁴

16 Aug 2017-arXiv: Information Retrieval

TL;DR: Neural network-based collaborative filtering (NCF) as discussed by the authors is a general framework for deep learning-based recommender systems, which can express and generalize matrix factorization under its framework and leverage a multi-layer perceptron to learn the user-item interaction function.

...read moreread less

Abstract: In recent years, deep neural networks have yielded immense success on speech recognition, computer vision and natural language processing. However, the exploration of deep neural networks on recommender systems has received relatively less scrutiny. In this work, we strive to develop techniques based on neural networks to tackle the key problem in recommendation -- collaborative filtering -- on the basis of implicit feedback. Although some recent work has employed deep learning for recommendation, they primarily used it to model auxiliary information, such as textual descriptions of items and acoustic features of musics. When it comes to model the key factor in collaborative filtering -- the interaction between user and item features, they still resorted to matrix factorization and applied an inner product on the latent features of users and items. By replacing the inner product with a neural architecture that can learn an arbitrary function from data, we present a general framework named NCF, short for Neural network-based Collaborative Filtering. NCF is generic and can express and generalize matrix factorization under its framework. To supercharge NCF modelling with non-linearities, we propose to leverage a multi-layer perceptron to learn the user-item interaction function. Extensive experiments on two real-world datasets show significant improvements of our proposed NCF framework over the state-of-the-art methods. Empirical evidence shows that using deeper layers of neural networks offers better recommendation performance.

...read moreread less

Proceedings Article•DOI•

Translation-based Recommendation

[...]

Ruining He¹, Wang-Cheng Kang¹, Julian McAuley¹•Institutions (1)

University of California, San Diego¹

08 Jul 2017-arXiv: Information Retrieval

TL;DR: This paper proposes a unified method, TransRec, to model such third-order relationships for large-scale sequential prediction, and embeds items into a 'transition space' where users are modeled as translation vectors operating on item sequences.

...read moreread less

Abstract: Modeling the complex interactions between users and items as well as amongst items themselves is at the core of designing successful recommender systems. One classical setting is predicting users' personalized sequential behavior (or `next-item' recommendation), where the challenges mainly lie in modeling `third-order' interactions between a user, her previously visited item(s), and the next item to consume. Existing methods typically decompose these higher-order interactions into a combination of pairwise relationships, by way of which user preferences (user-item interactions) and sequential patterns (item-item interactions) are captured by separate components. In this paper, we propose a unified method, TransRec, to model such third-order relationships for large-scale sequential prediction. Methodologically, we embed items into a `transition space' where users are modeled as translation vectors operating on item sequences. Empirically, this approach outperforms the state-of-the-art on a wide spectrum of real-world datasets. Data and code are available at this https URL.

...read moreread less

Proceedings Article•DOI•

Sequential User-based Recurrent Neural Network Recommendations

[...]

Tim Donkers¹, Benedikt Loepp¹, Jürgen Ziegler¹•Institutions (1)

University of Duisburg-Essen¹

27 Aug 2017

TL;DR: This paper extends Recurrent Neural Networks by considering unique characteristics of the Recommender Systems domain and shows how individual users can be represented in addition to sequences of consumed items in a new type of Gated Recurrent Unit to effectively produce personalized next item recommendations.

...read moreread less

Abstract: Recurrent Neural Networks are powerful tools for modeling sequences. They are flexibly extensible and can incorporate various kinds of information including temporal order. These properties make them well suited for generating sequential recommendations. In this paper, we extend Recurrent Neural Networks by considering unique characteristics of the Recommender Systems domain. One of these characteristics is the explicit notion of the user recommendations are specifically generated for. We show how individual users can be represented in addition to sequences of consumed items in a new type of Gated Recurrent Unit to effectively produce personalized next item recommendations. Offline experiments on two real-world datasets indicate that our extensions clearly improve objective performance when compared to state-of-the-art recommender algorithms and to a conventional Recurrent Neural Network.

...read moreread less

Journal Article•DOI•

On Deep Learning for Trust-Aware Recommendations in Social Networks

[...]

Shuiguang Deng¹, Longtao Huang², Guandong Xu³, Xindong Wu⁴, Zhaohui Wu¹ - Show less +1 more•Institutions (4)

Zhejiang University¹, Chinese Academy of Sciences², University of Technology, Sydney³, University of Vermont⁴

01 May 2017-IEEE Transactions on Neural Networks

TL;DR: A two-phase recommendation process is proposed to utilize deep learning to determinate the initialization in MF for trust-aware social recommendations and to differentiate the community effect in user’s trusted friendships.

...read moreread less

Abstract: With the emergence of online social networks, the social network-based recommendation approach is popularly used. The major benefit of this approach is the ability of dealing with the problems with cold-start users. In addition to social networks, user trust information also plays an important role to obtain reliable recommendations. Although matrix factorization (MF) becomes dominant in recommender systems, the recommendation largely relies on the initialization of the user and item latent feature vectors. Aiming at addressing these challenges, we develop a novel trust-based approach for recommendation in social networks. In particular, we attempt to leverage deep learning to determinate the initialization in MF for trust-aware social recommendations and to differentiate the community effect in user’s trusted friendships. A two-phase recommendation process is proposed to utilize deep learning in initialization and to synthesize the users’ interests and their trusted friends’ interests together with the impact of community effect for recommendations. We perform extensive experiments on real-world social network data to demonstrate the accuracy and effectiveness of our proposed approach in comparison with other state-of-the-art methods.

...read moreread less

Journal Article•DOI•

An experimental evaluation of point-of-interest recommendation in location-based social networks

[...]

Yiding Liu¹, Tuan-Anh Nguyen Pham¹, Gao Cong¹, Quan Yuan²•Institutions (2)

Nanyang Technological University¹, University of Illinois at Urbana–Champaign²

01 Jun 2017

TL;DR: An all-around evaluation of 12 state-of-the-art POI recommendation models to provide readers with an overall picture of the cutting-edge research onPOI recommendation and obtain several important findings.

...read moreread less

Abstract: Point-of-interest (POI) recommendation is an important service to Location-Based Social Networks (LBSNs) that can benefit both users and businesses. In recent years, a number of POI recommender systems have been proposed, but there is still a lack of systematical comparison thereof. In this paper, we provide an all-around evaluation of 12 state-of-the-art POI recommendation models. From the evaluation, we obtain several important findings, based on which we can better understand and utilize POI recommendation models in various scenarios. We anticipate this work to provide readers with an overall picture of the cutting-edge research on POI recommendation.

...read moreread less

Proceedings Article•

A Hybrid Collaborative Filtering Model with Deep Structure for Recommender Systems

[...]

Xin Dong, Lei Yu, Zhonghuo Wu, Yuxia Sun, Lingfeng Yuan, Fangxi Zhang - Show less +2 more

12 Feb 2017

TL;DR: This work utilizes advances of learning effective representations in deep learning, and proposes a hybrid model which jointly performs deep users and items’ latent factors learning from side information and collaborative filtering from the rating matrix.

...read moreread less

Abstract: Collaborative filtering (CF) is a widely used approach in recommender systems to solve many real-world problems. Traditional CF-based methods employ the user-item matrix which encodes the individual preferences of users for items for learning to make recommendation. In real applications, the rating matrix is usually very sparse, causing CF-based methods to degrade significantly in recommendation performance. In this case, some improved CF methods utilize the increasing amount of side information to address the data sparsity problem as well as the cold start problem. However, the learned latent factors may not be effective due to the sparse nature of the user-item matrix and the side information. To address this problem, we utilize advances of learning effective representations in deep learning, and propose a hybrid model which jointly performs deep users and items’ latent factors learning from side information and collaborative filtering from the rating matrix. Extensive experimental results on three real-world datasets show that our hybrid model outperforms other methods in effectively utilizing side information and achieves performance improvement.

...read moreread less

Proceedings Article•DOI•

Cross-domain recommendation: an embedding and mapping approach

[...]

Tong Man¹, Huawei Shen¹, Xiaolong Jin¹, Xueqi Cheng¹•Institutions (1)

Chinese Academy of Sciences¹

19 Aug 2017

TL;DR: The proposed EMCDR framework distinguishes itself from existing crossdomain recommendation models in two aspects: a multi-layer perceptron is used to capture the nonlinear mapping function across domains, and only the entities with sufficient data are used to learn the mapping function, guaranteeing its robustness to noise caused by data sparsity in single domain.

...read moreread less

Abstract: Data sparsity is one of the most challenging problems for recommender systems. One promising solution to this problem is cross-domain recommendation, i.e., leveraging feedbacks or ratings from multiple domains to improve recommendation performance in a collective manner. In this paper, we propose an Embedding and Mapping framework for Cross-Domain Recommendation, called EMCDR. The proposed EMCDR framework distinguishes itself from existing crossdomain recommendation models in two aspects. First, a multi-layer perceptron is used to capture the nonlinear mapping function across domains, which offers high flexibility for learning domain-specific features of entities in each domain. Second, only the entities with sufficient data are used to learn the mapping function, guaranteeing its robustness to noise caused by data sparsity in single domain. Extensive experiments on two cross-domain recommendation scenarios demonstrate that EMCDR significantly outperforms state-of-the-art cross-domain recommendation methods.

...read moreread less

Proceedings Article•DOI•

TransNets: Learning to Transform for Recommendation

[...]

Rose Catherine¹, William W. Cohen¹•Institutions (1)

Carnegie Mellon University¹

27 Aug 2017

TL;DR: TransNets as mentioned in this paper extends the DeepCoNN model by introducing an additional latent layer representing the target user-target item pair, which is used to regularize this layer, at training time, to be similar to another latent representation of a target user's review of the target item.

...read moreread less

Abstract: Recently, deep learning methods have been shown to improve the performance of recommender systems over traditional methods, especially when review text is available. For example, a recent model, DeepCoNN, uses neural nets to learn one latent representation for the text of all reviews written by a target user, and a second latent representation for the text of all reviews for a target item, and then combines these latent representations to obtain state-of-the-art performance on recommendation tasks. We show that (unsurprisingly) much of the predictive value of review text comes from reviews of the target user for the target item. We then introduce a way in which this information can be used in recommendation, even when the target user's review for the target item is not available. Our model, called TransNets, extends the DeepCoNN model by introducing an additional latent layer representing the target user-target item pair. We then regularize this layer, at training time, to be similar to another latent representation of the target user's review of the target item. We show that TransNets and extensions of it improve substantially over the previous state-of-the-art.

...read moreread less

Collapse