Learning and Revising User Profiles: The Identification ofInteresting Web Sites

doi:10.1023/A:1007369909943

Home
/
Papers
/
Learning and Revising User Profiles: The Identification ofInteresting Web Sites

Journal Article•DOI•

Learning and Revising User Profiles: The Identification ofInteresting Web Sites

Michael J. Pazzani¹, Daniel Billsus¹•Institutions (1)

University of California, Irvine¹

01 Jun 1997-Machine Learning (Kluwer Academic Publishers)-Vol. 27, Iss: 3, pp 313-331

TL;DR: The use of a naive Bayesian classifier is described, and it is demonstrated that it can incrementally learn profiles from user feedback on the interestingness of Web sites and may easily be extended to revise user provided profiles.

read less

Abstract: We discuss algorithms for learning and revising user profiles that can determine which World Wide Web sites on a given topic would be interesting to a user. We describe the use of a naive Bayesian classifier for this task, and demonstrate that it can incrementally learn profiles from user feedback on the interestingness of Web sites. Furthermore, the Bayesian classifier may easily be extended to revise user provided profiles. In an experimental evaluation we compare the Bayesian classifier to computationally more intensive alternatives, and show that it performs at least as well as these approaches throughout a range of different domains. In addition, we empirically analyze the effects of providing the classifier with background knowledge in form of user defined profiles and examine the use of lexical knowledge for feature selection. We find that both approaches can substantially increase the prediction accuracy.

...read moreread less

Content maybe subject to copyright Report

Citations

PDF

Open Access

More filters

Journal Article•DOI•

Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions

[...]

Gediminas Adomavicius¹, Alexander Tuzhilin•Institutions (1)

University of Minnesota¹

01 Jun 2005-IEEE Transactions on Knowledge and Data Engineering

TL;DR: This paper presents an overview of the field of recommender systems and describes the current generation of recommendation methods that are usually classified into the following three main categories: content-based, collaborative, and hybrid recommendation approaches.

...read moreread less

Abstract: This paper presents an overview of the field of recommender systems and describes the current generation of recommendation methods that are usually classified into the following three main categories: content-based, collaborative, and hybrid recommendation approaches. This paper also describes various limitations of current recommendation methods and discusses possible extensions that can improve recommendation capabilities and make recommender systems applicable to an even broader range of applications. These extensions include, among others, an improvement of understanding of users and items, incorporation of the contextual information into the recommendation process, support for multicriteria ratings, and a provision of more flexible and less intrusive types of recommendations.

...read moreread less

9,873 citations

Cites background or methods from "Learning and Revising User Profiles..."

...On the other hand, [77] use a Bayesian classifier in order to estimate the probability that a document...
[...]
...accuracy [77]....
[...]
...For example, based on a set of Web pages that were rated as “relevant” or “irrelevant” by the user, [77] use the naïve Bayesian classifier [31] to classify unrated Web pages....
[...]
...In addition to using traditional profile features, such as keywords and simple user demographics [69, 77], more advanced profiling techniques based on data mining rules [1, 34], sequences [63], and signatures [26] that describe user’s interests can be used to build user profiles....
[...]
...(7) Moreover, [77] use the assumption that keywords are independent and, therefore, the above probability is proportional to...
[...]

Journal Article•DOI•

A survey of collaborative filtering techniques

[...]

Xiaoyuan Su¹, Taghi M. Khoshgoftaar¹•Institutions (1)

Florida Atlantic University¹

01 Jan 2009-Advances in Artificial Intelligence

TL;DR: From basic techniques to the state-of-the-art, this paper attempts to present a comprehensive survey for CF techniques, which can be served as a roadmap for research and practice in this area.

...read moreread less

Abstract: As one of the most successful approaches to building recommender systems, collaborative filtering (CF) uses the known preferences of a group of users to make recommendations or predictions of the unknown preferences for other users. In this paper, we first introduce CF tasks and their main challenges, such as data sparsity, scalability, synonymy, gray sheep, shilling attacks, privacy protection, etc., and their possible solutions. We then present three main categories of CF techniques: memory-based, modelbased, and hybrid CF algorithms (that combine CF with other recommendation techniques), with examples for representative algorithms of each category, and analysis of their predictive performance and their ability to address the challenges. From basic techniques to the state-of-the-art, we attempt to present a comprehensive survey for CF techniques, which can be served as a roadmap for research and practice in this area.

...read moreread less

3,406 citations

Cites methods from "Learning and Revising User Profiles..."

...A content-based recommender then uses heuristic methods or classification algorithms to make recommendations [112]....
[...]

Journal Article•DOI•

Recommender systems survey

[...]

Jesús Bobadilla¹, Fernando Ortega¹, Antonio Hernando¹, Abraham Gutiérrez¹•Institutions (1)

Technical University of Madrid¹

01 Jul 2013-Knowledge Based Systems

TL;DR: An overview of recommender systems as well as collaborative filtering methods and algorithms is provided, which explains their evolution, provides an original classification for these systems, identifies areas of future implementation and develops certain areas selected for past, present or future importance.

...read moreread less

Abstract: Recommender systems have developed in parallel with the web. They were initially based on demographic, content-based and collaborative filtering. Currently, these systems are incorporating social information. In the future, they will use implicit, local and personal information from the Internet of things. This article provides an overview of recommender systems as well as collaborative filtering methods and algorithms; it also explains their evolution, provides an original classification for these systems, identifies areas of future implementation and develops certain areas selected for past, present or future importance.

...read moreread less

2,639 citations

Cites methods from "Learning and Revising User Profiles..."

...This task is resolved traditionally by using heuristic methods [198,15,79] or classification algorithms, such us: rule induction [65,119], nearest neighbors methods [236,27], Rocchio’s algorithm [131,16], linear classifiers [113], and probabilistic methods [175,160,84]....
[...]

Book Chapter•DOI•

Content-based recommendation systems

[...]

Michael J. Pazzani¹, Daniel Billsus²•Institutions (2)

Rutgers University¹, FX Palo Alto Laboratory²

01 Jan 2007

TL;DR: This chapter discusses content-based recommendation systems, i.e., systems that recommend an item to a user based upon a description of the item and a profile of the user's interests, which are used in a variety of domains ranging from recommending web pages, news articles, restaurants, television programs, and items for sale.

...read moreread less

Abstract: This chapter discusses content-based recommendation systems, i.e., systems that recommend an item to a user based upon a description of the item and a profile of the user's interests. Content-based recommendation systems may be used in a variety of domains ranging from recommending web pages, news articles, restaurants, television programs, and items for sale. Although the details of various systems differ, content-based recommendation systems share in common a means for describing the items that may be recommended, a means for creating a profile of the user that describes the types of items the user likes, and a means of comparing items to the user profile to determine what to recommend. The profile is often created and updated automatically in response to feedback on the desirability of items that have been presented to the user.

...read moreread less

2,428 citations

Cites background or methods from "Learning and Revising User Profiles..."

...Some representative systems included WebWatcher [16] and Syskill & Webert [29]....
[...]
..., for learning a user profile from unstructured text ([15], [3], [29])....
[...]
...The naïve Bayes classifier has been used in several content-based recommendation systems including Syskill & Webert [29]....
[...]
...Arguably, the decision tree bias is not ideal for unstructured text classification tasks [29]....
[...]

Book Chapter•DOI•

Content-based Recommender Systems: State of the Art and Trends

[...]

Pasquale Lops¹, Marco de Gemmis¹, Giovanni Semeraro¹•Institutions (1)

University of Bari¹

01 Jan 2011

TL;DR: The role of User Generated Content is described as a way for taking into account evolving vocabularies, and the challenge of feeding users with serendipitous recommendations, that is to say surprisingly interesting items that they might not have otherwise discovered.

...read moreread less

Abstract: Recommender systems have the effect of guiding users in a personal- ized way to interesting objects in a large space of possible options. Content-based recommendation systems try to recommend items similar to those a given user has liked in the past. Indeed, the basic process performed by a content-based recom- mender consists in matching up the attributes of a user profile in which preferences and interests are stored, with the attributes of a content object (item), in order to recommend to the user new interesting items. This chapter provides an overview of content-based recommender systems, with the aim of imposing a degree of order on the diversity of the different aspects involved in their design and implementation. The first part of the chapter presents the basic concepts and terminology of content- based recommender systems, a high level architecture, and their main advantages and drawbacks. The second part of the chapter provides a review of the state of the art of systems adopted in several application domains, by thoroughly describ- ing both classical and advanced techniques for representing items and user profiles. The most widely adopted techniques for learning user profiles are also presented. The last part of the chapter discusses trends and future research which might lead towards the next generation of systems, by describing the role of User Generated Content as a way for taking into account evolving vocabularies, and the challenge of feeding users with serendipitous recommendations, that is to say surprisingly interesting items that they might not have otherwise discovered.

...read moreread less

1,582 citations

Cites background or methods from "Learning and Revising User Profiles..."

...The naı̈ve Bayes classifier has been used in several content-based recommendation systems, such as Syskill &Webert [70, 68], NewsDude [12],Daily Learner [13], LIBRA [65] and ITR [27, 83]....
[...]
...used in the Syskill & Webert [70, 68] recommender system....
[...]
...Personal WebWatcher [62, 63], Syskill & Webert [70, 68], ifWeb [4], Amalthea [66], and WebMate [23]....
[...]

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

References

PDF

Open Access

More filters

Book Chapter•DOI•

Learning internal representations by error propagation

[...]

David E. Rumelhart, Geoffrey E. Hinton, Ronald J. Williams

01 Jan 1988

TL;DR: This chapter contains sections titled: The Problem, The Generalized Delta Rule, Simulation Results, Some Further Generalizations, Conclusion.

...read moreread less

Abstract: This chapter contains sections titled: The Problem, The Generalized Delta Rule, Simulation Results, Some Further Generalizations, Conclusion

...read moreread less

17,604 citations

"Learning and Revising User Profiles..." refers methods in this paper

...We also use multi-layer networks trained with error backpropagation (Rumelhart et al., 1986)....
[...]

Journal Article•DOI•

Induction of Decision Trees

[...]

J. R. Quinlan

25 Mar 1986-Machine Learning

TL;DR: In this paper, an approach to synthesizing decision trees that has been used in a variety of systems, and it describes one such system, ID3, in detail, is described, and a reported shortcoming of the basic algorithm is discussed.

...read moreread less

Abstract: The technology for building knowledge-based systems by inductive inference from examples has been demonstrated successfully in several practical applications. This paper summarizes an approach to synthesizing decision trees that has been used in a variety of systems, and it describes one such system, ID3, in detail. Results from recent studies show ways in which the methodology can be modified to deal with information that is noisy and/or incomplete. A reported shortcoming of the basic algorithm is discussed and two means of overcoming it are compared. The paper concludes with illustrations of current research directions.

...read moreread less

17,177 citations

Journal Article•DOI•

Pattern Classification and Scene Analysis.

[...]

Ulf Grenander, Richard O. Duda, Peter E. Hart

01 Sep 1974-Journal of the American Statistical Association

14,948 citations

Book•

Pattern classification and scene analysis

[...]

Richard O. Duda, Peter E. Hart

01 Jan 1973

TL;DR: In this article, a unified, comprehensive and up-to-date treatment of both statistical and descriptive methods for pattern recognition is provided, including Bayesian decision theory, supervised and unsupervised learning, nonparametric techniques, discriminant analysis, clustering, preprosessing of pictorial data, spatial filtering, shape description techniques, perspective transformations, projective invariants, linguistic procedures, and artificial intelligence techniques for scene analysis.

...read moreread less

Abstract: Provides a unified, comprehensive and up-to-date treatment of both statistical and descriptive methods for pattern recognition. The topics treated include Bayesian decision theory, supervised and unsupervised learning, nonparametric techniques, discriminant analysis, clustering, preprosessing of pictorial data, spatial filtering, shape description techniques, perspective transformations, projective invariants, linguistic procedures, and artificial intelligence techniques for scene analysis.

...read moreread less

13,647 citations

Book•

Learning internal representations by error propagation

[...]

David E. Rumelhart, Geoffrey E. Hinton, Ronald J. Williams

03 Jan 1986

TL;DR: In this paper, the problem of the generalized delta rule is discussed and the Generalized Delta Rule is applied to the simulation results of simulation results in terms of the generalized delta rule.

...read moreread less

Abstract: This chapter contains sections titled: The Problem, The Generalized Delta Rule, Simulation Results, Some Further Generalizations, Conclusion

...read moreread less

13,579 citations