Evaluating Recommender Systems

doi:10.1007/978-1-4899-7637-6_8

Book ChapterDOI

Evaluating Recommender Systems

Asela Gunawardana, +1 more

- pp 265-308

Chats0

TLDR

This paper discusses how to compare recommenders based on a set of properties that are relevant for the application, and focuses on comparative studies, where a few algorithms are compared using some evaluation metric, rather than absolute benchmarking of algorithms.

Abstract:

Recommender systems are now popular both commercially and in the research community, where many approaches have been suggested for providing recommendations. In many cases a system designer that wishes to employ a recommendater system must choose between a set of candidate approaches. A first step towards selecting an appropriate algorithm is to decide which properties of the application to focus upon when making this choice. Indeed, recommender systems have a variety of properties that may affect user experience, such as accuracy, robustness, scalability, and so forth. In this paper we discuss how to compare recommenders based on a set of properties that are relevant for the application. We focus on comparative studies, where a few algorithms are compared using some evaluation metric, rather than absolute benchmarking of algorithms. We describe experimental settings appropriate for making choices between algorithms. We review three types of experiments, starting with an offline setting, where recommendation approaches are compared without user interaction, then reviewing user studies, where a small group of subjects experiment with the system and report on the experience, and finally describe large scale online experiments, where real user populations interact with the system. In each of these cases we describe types of questions that can be answered, and suggest protocols for experimentation. We also discuss how to draw trustworthy conclusions from the conducted experiments. We then review a large set of properties, and explain how to evaluate systems given relevant properties. We also survey a large set of evaluation metrics in the context of the property that they evaluate.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

The use of machine learning algorithms in recommender systems: A systematic review

Ivens Portugal, +2 more

- 01 May 2018 -

Expert Systems With Applications

TL;DR: The study concludes that Bayesian and decision tree algorithms are widely used in recommender systems because of their relative simplicity, and that requirement and design phases of recommender system development appear to offer opportunities for further research.

...read moreread less

Posted Content

The Use of Machine Learning Algorithms in Recommender Systems: A Systematic Review

Ivens Portugal, +2 more

- 17 Nov 2015 -

arXiv: Software Engineering

TL;DR: In this paper, the authors present a systematic review of the literature that analyzes the use of machine learning algorithms in recommender systems and identifies research opportunities for software engineering research, and conclude that Bayesian and decision tree algorithms are widely used in recommendation systems because of their relative simplicity and that requirement and design phases of recommender system development appear to offer opportunities for further research.

...read moreread less

Journal ArticleDOI

Current challenges and visions in music recommender systems research

Markus Schedl, +4 more

- 05 Apr 2018 -

International Journal of Multimedia Info...

TL;DR: In this article, the authors identify and shed light on what they believe are the most pressing challenges in recommender systems from both academic and industry perspectives, and detail possible future directions and visions for the further evolution of the field.

...read moreread less

Journal ArticleDOI

Facebook single and cross domain data for recommendation systems

Bracha Shapira, +2 more

- 01 Apr 2013 -

User Modeling and User-adapted Interacti...

TL;DR: This paper investigates the feasibility and effectiveness of utilizing existing available data from social networks for the recommendation process, specifically from Facebook, and highlights the benefits of utilizing cross domain Facebook data to achieve improvement in recommendation performance.

...read moreread less

Proceedings ArticleDOI

Let Me Explain: Impact of Personal and Impersonal Explanations on Trust in Recommender Systems

Johannes Kunkel, +4 more

TL;DR: It is suggested that RS should provide richer explanations in order to increase their perceived recommendation quality and trustworthiness.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Controlling the false discovery rate: a practical and powerful approach to multiple testing

Yoav Benjamini, +1 more

- 01 Jan 1995 -

Journal of the royal statistical society...

TL;DR: In this paper, a different approach to problems of multiple significance testing is presented, which calls for controlling the expected proportion of falsely rejected hypotheses -the false discovery rate, which is equivalent to the FWER when all hypotheses are true but is smaller otherwise.

...read moreread less

Journal Article

Statistical Comparisons of Classifiers over Multiple Data Sets

Janez Demšar

- 01 Dec 2006 -

Journal of Machine Learning Research

TL;DR: A set of simple, yet safe and robust non-parametric tests for statistical comparisons of classifiers is recommended: the Wilcoxon signed ranks test for comparison of two classifiers and the Friedman test with the corresponding post-hoc tests for comparisons of more classifiers over multiple data sets.

...read moreread less

Proceedings ArticleDOI

Item-based collaborative filtering recommendation algorithms

Badrul Sarwar, +3 more

TL;DR: This paper analyzes item-based collaborative ltering techniques and suggests that item- based algorithms provide dramatically better performance than user-based algorithms, while at the same time providing better quality than the best available userbased algorithms.

...read moreread less

Journal ArticleDOI

A new measure of rank correlation

Maurice G. Kendall

- 01 Jun 1938 -

Biometrika

TL;DR: Rank correlation as mentioned in this paper is a measure of similarity between two rankings of the same set of individuals, and it has been used in psychological work to compare two different rankings of individuals in order to indicate similarity of taste.

...read moreread less

Journal ArticleDOI

Evaluating collaborative filtering recommender systems

Jonathan L. Herlocker, +3 more

- 01 Jan 2004 -

ACM Transactions on Information Systems

TL;DR: The key decisions in evaluating collaborative filtering recommender systems are reviewed: the user tasks being evaluated, the types of analysis and datasets being used, the ways in which prediction quality is measured, the evaluation of prediction attributes other than quality, and the user-based evaluation of the system as a whole.

...read moreread less