scispace - formally typeset
Open Access

A collaborative filtering approach to predict web pages of interest from navigation patterns of past users within an academic website

Reads0
Chats0
TLDR
It is demonstrated that association rules that show the correlation relationships between user navigation patterns and web pages they find interesting can be transformed into collaborative filtering data, and the weighted averages scheme more accurately computes predictions of user interests than the simple averages scheme does.
Abstract
This dissertation is a simulation study of factors and techniques involved in designing hyperlink recommender systems that recommend to users, web pages that past users with similar navigation behaviors found interesting. The methodology involves identification of pertinent factors or techniques, and for each one, addresses the following questions: (a) room for improvement; (b) better approach, if any; and (c) performance characteristics of the technique in environments that hyperlink recommender systems operate in. The following four problems are addressed: Web page classification. A new metric (PageRank × Inverse Links-to-Word count ratio) is proposed for classifying web pages as content or navigation, to help in the discovery of user navigation behaviors from web user access logs. Results of a small user study suggest that this metric leads to desirable results. Data mining. A new apriori algorithm for mining association rules from large databases is proposed. The new algorithm addresses the problem of scaling of the classical apriori algorithm by eliminating an expensive join step, and applying the apriori property to every row of the database. In this study, association rules show the correlation relationships between user navigation behaviors and web pages they find interesting. The new algorithm has better space complexity than the classical one, and better time efficiency under some conditions and comparable time efficiency under other conditions. Prediction models for user interests. We demonstrate that association rules that show the correlation relationships between user navigation patterns and web pages they find interesting can be transformed into collaborative filtering data. We investigate collaborative filtering prediction models based on two approaches for computating prediction scores: using simple averages and weighted averages. Our findings suggest that the weighted averages scheme more accurately computes predictions of user interests than the simple averages scheme does. Clustering. Clustering techniques are frequently applied in the design of personalization systems. We studied the performance of the CLARANS clustering algorithm in high dimensional space in relation to the PAM and CLARA clustering algorithms. While CLARA had the best time performance, CLARANS resulted in clusters with the lowest intra-cluster dissimilarities, and so was most effective in this regard.

read more

Content maybe subject to copyright    Report

Citations
More filters
Patent

Systems and methods for recommending a list of urls

Hilary Mason, +1 more
TL;DR: In this paper, the authors present a method for providing a recommended list of uniform resource locators (URLs) responsive to a Uniform Resource Locator (URL) by identifying, by a server, a plurality of users that clicked on an encoded URL link corresponding to a URL.
Patent

Systems and methods for influence of a user on content shared via 7 encoded uniform resource locator (url) link

Hilary Mason, +1 more
TL;DR: In this article, the authors present systems and methods for measuring influence of a user to identify what level of engagement the user drives to content when the user shares content with other users, such as via encoded links.
Patent

Method and system for autocompletion of multiple fields in electronic forms

TL;DR: In this paper, a method and system for autocompletion of multiple fields in electronic forms that generates "inference networks" (Bayesian networks), and evaluates them to suggest likely options for user selection while filling out fields in corresponding electronic forms is presented.
References
More filters
Book

Data Mining: Concepts and Techniques

TL;DR: This book presents dozens of algorithms and implementation examples, all in pseudo-code and suitable for use in real-world, large-scale data mining projects, and provides a comprehensive, practical look at the concepts and techniques you need to get the most out of real business data.
Book ChapterDOI

Learning internal representations by error propagation

TL;DR: This chapter contains sections titled: The Problem, The Generalized Delta Rule, Simulation Results, Some Further Generalizations, Conclusion.
Proceedings ArticleDOI

Mining association rules between sets of items in large databases

TL;DR: An efficient algorithm is presented that generates all significant association rules between items in the database of customer transactions and incorporates buffer management and novel estimation and pruning techniques.
Book

Learning internal representations by error propagation

TL;DR: In this paper, the problem of the generalized delta rule is discussed and the Generalized Delta Rule is applied to the simulation results of simulation results in terms of the generalized delta rule.
Journal ArticleDOI

Indexing by Latent Semantic Analysis

TL;DR: A new method for automatic indexing and retrieval to take advantage of implicit higher-order structure in the association of terms with documents (“semantic structure”) in order to improve the detection of relevant documents on the basis of terms found in queries.
Related Papers (5)