scispace - formally typeset
Open AccessProceedings ArticleDOI

Improving Collaborative Filtering Based Recommenders Using Topic Modelling

TLDR
This paper uses latent Dirichlet allocation (LDA) to learn latent properties of items, expressed in terms of topic proportions, derived from their textual description, and infer user's topic preferences or user profile in the same latent space, based on her historical ratings.
Abstract
Standard Collaborative Filtering (CF) algorithms make use of interactions between users and items in the form of implicit or explicit ratings alone for generating recommendations. Similarity among users or items is calculated purely based on rating overlap in this case, without considering explicit properties of users or items involved, limiting their applicability in domains with very sparse rating spaces. In many domains such as movies, news or electronic commerce recommenders, considerable contextual data in text form describing item properties is available along with the rating data, which could be utilized to improve recommendation quality. In this paper, we propose a novel approach to improve standard CF based recommenders by utilizing latent Dirichlet allocation (LDA) to learn latent properties of items, expressed in terms of topic proportions, derived from their textual description. We infer user's topic preferences or user profile in the same latent space, based on her historical ratings. While computing similarity between users, we make use of a combined similarity measure involving rating overlap as well as similarity in the latent topic space. This approach alleviates sparsity problem as it allows calculation of similarity between users even if they have not rated any items in common. Our experiments on multiple public datasets indicate that the proposed hybrid approach significantly outperforms standard User Based and Item Based CF recommenders in terms of classification accuracy metrics such as precision, recall and F-measure.

read more

Citations
More filters
Journal ArticleDOI

Rating LDA model for collaborative filtering

TL;DR: A Rating LDA (RLDA) Model for collaborative filtering by adding rating information to the Latent Dirichlet Allocation (LDA) is proposed, assuming that for similar interests, the higher the proportion of high ratings, the more popular the items.
Journal ArticleDOI

Similarity measures for Collaborative Filtering-based Recommender Systems: Review and experimental comparison

TL;DR: In this paper, the authors provide an in-depth review on similarity measures used for collaborative filtering-based recommender systems, and test their performance through an experimental study on three standard datasets (MovieLens100k, MovieLens1M and Jester).
Journal ArticleDOI

Sparse Online Learning for Collaborative Filtering

TL;DR: The results show that, the proposed approach is able to effectively online update the recommendation model from a sequence of rating observation and outperforms other baseline methods in terms of RMSE.
Journal ArticleDOI

Towards a knowledge-based probabilistic and context-aware social recommender system:

TL;DR: A knowledge-based probabilistic collaborative filtering (CF) recommendation approach using both an ontology-based semantic similarity metric and a latent Dirichlet allocation model-based recommendation technique and a context-aware software architecture and system with the objective of validating the recommendation approach in the eating domain.
Journal ArticleDOI

How to measure information similarity in online social networks

TL;DR: This paper considered users self-defined online social connections, specifically in Citeulike, which were built around an object-centered sociality as the gold standard of shared interests amongOnline social connections to computed the effectiveness of various similarity measures in their capabilities to estimate shared interests.
References
More filters
Journal ArticleDOI

Latent dirichlet allocation

TL;DR: This work proposes a generative model for text and other collections of discrete data that generalizes or improves on several previous models including naive Bayes/unigram, mixture of unigrams, and Hofmann's aspect model.
Proceedings Article

Latent Dirichlet Allocation

TL;DR: This paper proposed a generative model for text and other collections of discrete data that generalizes or improves on several previous models including naive Bayes/unigram, mixture of unigrams, and Hof-mann's aspect model, also known as probabilistic latent semantic indexing (pLSI).
Proceedings ArticleDOI

Item-based collaborative filtering recommendation algorithms

TL;DR: This paper analyzes item-based collaborative ltering techniques and suggests that item- based algorithms provide dramatically better performance than user-based algorithms, while at the same time providing better quality than the best available userbased algorithms.
Journal ArticleDOI

Evaluating collaborative filtering recommender systems

TL;DR: The key decisions in evaluating collaborative filtering recommender systems are reviewed: the user tasks being evaluated, the types of analysis and datasets being used, the ways in which prediction quality is measured, the evaluation of prediction attributes other than quality, and the user-based evaluation of the system as a whole.
Proceedings ArticleDOI

Factorization meets the neighborhood: a multifaceted collaborative filtering model

TL;DR: The factor and neighborhood models can now be smoothly merged, thereby building a more accurate combined model and a new evaluation metric is suggested, which highlights the differences among methods, based on their performance at a top-K recommendation task.
Related Papers (5)