scispace - formally typeset
Journal ArticleDOI

Interval Set Clustering of Web Users with Rough K -Means

Pawan Lingras, +1 more
- Vol. 23, Iss: 1, pp 5-16
Reads0
Chats0
TLDR
A variation of the K-means clustering algorithm based on properties of rough sets is proposed, which represents clusters as interval or rough sets.
Abstract
Data collection and analysis in web mining faces certain unique challenges. Due to a variety of reasons inherent in web browsing and web logging, the likelihood of bad or incomplete data is higher than conventional applications. The analytical techniques in web mining need to accommodate such data. Fuzzy and rough sets provide the ability to deal with incomplete and approximate information. Fuzzy set theory has been shown to be useful in three important aspects of web and data mining, namely clustering, association, and sequential analysis. There is increasing interest in research on clustering based on rough set theory. Clustering is an important part of web mining that involves finding natural groupings of web resources or web users. Researchers have pointed out some important differences between clustering in conventional applications and clustering in web mining. For example, the clusters and associations in web mining do not necessarily have crisp boundaries. As a result, researchers have studied the possibility of using fuzzy sets in web mining clustering applications. Recent attempts have used genetic algorithms based on rough set theory for clustering. However, the genetic algorithms based clustering may not be able to handle the large amount of data typical in a web mining application. This paper proposes a variation of the K-means clustering algorithm based on properties of rough sets. The proposed algorithm represents clusters as interval or rough sets. The paper also describes the design of an experiment including data collection and the clustering process. The experiment is used to create interval set representations of clusters of web visitors.

read more

Citations
More filters
Journal ArticleDOI

Rudiments of rough sets

TL;DR: The basic concepts of rough set theory are presented and some rough set-based research directions and applications are pointed out, indicating that the rough set approach is fundamentally important in artificial intelligence and cognitive sciences.
Journal ArticleDOI

Rough sets and Boolean reasoning

TL;DR: Methods based on the combination of rough sets and Boolean reasoning with applications in pattern recognition, machine learning, data mining and conflict analysis are discussed.
Book

Computational Intelligence and Feature Selection: Rough and Fuzzy Approaches

Richard Jensen, +1 more
TL;DR: Computational Intelligence and Feature Selection provides a high level audience with both the background and fundamental ideas behind feature selection with an emphasis on those techniques based on rough and fuzzy sets, including their hybridizations.
Journal ArticleDOI

Rough–Fuzzy Collaborative Clustering

TL;DR: A novel clustering architecture is introduced, in which several subsets of patterns can be processed together with an objective of finding a common structure, and the required communication links are established at the level of cluster prototypes and partition matrices.
Journal ArticleDOI

Rough Set Based Generalized Fuzzy $C$ -Means Algorithm and Quantitative Indices

TL;DR: The RFPCM comprises a judicious integration of the principles of rough and fuzzy sets that incorporates both probabilistic and possibilistic memberships simultaneously to avoid the problems of noise sensitivity of fuzzy C-means and the coincident clusters of PCM.
References
More filters

Some methods for classification and analysis of multivariate observations

TL;DR: The k-means algorithm as mentioned in this paper partitions an N-dimensional population into k sets on the basis of a sample, which is a generalization of the ordinary sample mean, and it is shown to give partitions which are reasonably efficient in the sense of within-class variance.
Book

Rough Sets: Theoretical Aspects of Reasoning about Data

TL;DR: Theoretical Foundations.
Journal ArticleDOI

Rough sets

TL;DR: This approach seems to be of fundamental importance to artificial intelligence (AI) and cognitive sciences, especially in the areas of machine learning, knowledge acquisition, decision analysis, knowledge discovery from databases, expert systems, decision support systems, inductive reasoning, and pattern recognition.
Journal ArticleDOI

A possibilistic approach to clustering

TL;DR: An appropriate objective function whose minimum will characterize a good possibilistic partition of the data is constructed, and the membership and prototype update equations are derived from necessary conditions for minimization of the criterion function.