scispace - formally typeset
Journal ArticleDOI

Semantic preprocessing for mining sensor streams from heterogeneous environments

Jason J. Jung
- 01 May 2011 - 
- Vol. 38, Iss: 5, pp 6107-6111
Reads0
Chats0
TLDR
An ontology-based data preprocessing scheme, which is composed of two main phases: (i) session identification and (ii) error detection, which can find out not only relationships between sensor streams but also temporal dynamics of a data stream.
Abstract
? A number of heterogeneous sensor streams have been efficiently pre-processed for better services. Many studies have tried to employ data mining methods to discover useful patterns and knowledge from data streams on sensor networks. However, it is difficult to apply such data mining methods to the sensor streams intermixed from heterogeneous sensor networks. In this paper, to improve the performance of conventional data mining methods, we propose an ontology-based data preprocessing scheme, which is composed of two main phases: (i) session identification and (ii) error detection. The ontology can provide and describe semantics of data measured by each sensor. Thus, by comparing the semantics, we can find out not only relationships between sensor streams but also temporal dynamics of a data stream. To evaluate the proposed method, we have collected sensor streams from in our building during 30days. By using two well-known data mining methods (i.e., co-occurrence pattern and sequential pattern), the results from raw sensor streams and ones from sensor streams with preprocessing were compared with respect to two measurements recall and precision.

read more

Citations
More filters
Journal ArticleDOI

MuDi-Stream

TL;DR: The proposed MuDi-Stream algorithm improves clustering quality in multi-density environments and is evaluated on various synthetic and real-world datasets using different quality metrics and further, scalability results are compared.
Journal ArticleDOI

CUDT: A CUDA Based Decision Tree Algorithm

TL;DR: This paper designs and implements a new parallelized decision tree algorithm on a CUDA (compute unified device architecture), which is a GPGPU solution provided by NVIDIA, and shows that CUDT is 5∼55 times faster than Weka-j48 and is 18 times speedup than SPRINT for large data set.

Semantic Sensor Web

TL;DR: The semantic sensor Web (SSW) proposes that sensor data be annotated with semantic metadata that will both increase interoperability and provide contextual information essential for situational knowledge.
Journal ArticleDOI

Real-Time Traffic Flow Management Based on Inter-Object Communication: a Case Study at Intersection

TL;DR: This study designs a model which communicating among objects to improve traffic flow at intersection with real time problem, and shows the potential results comparing with the existing traffic management system.
Journal ArticleDOI

Ubiquitous conference management system for mobile recommendation services based on mobilizing social networks: A case study of u-conference

TL;DR: This paper mobilize all possible on- and off-line social networks to build an ego-centric social network and implemented the proposed system, which is called u-conference system, by collecting the social network dataset from online and offline.
References
More filters
Book

Data Mining: Concepts and Techniques

TL;DR: This book presents dozens of algorithms and implementation examples, all in pseudo-code and suitable for use in real-world, large-scale data mining projects, and provides a comprehensive, practical look at the concepts and techniques you need to get the most out of real business data.
Proceedings ArticleDOI

Mining sequential patterns

TL;DR: Three algorithms are presented to solve the problem of mining sequential patterns over databases of customer transactions, and empirically evaluating their performance using synthetic data shows that two of them have comparable performance.
Journal Article

Data Mining Concepts and Techniques

TL;DR: Data mining is the search for new, valuable, and nontrivial information in large volumes of data, a cooperative effort of humans and computers that is possible to put data-mining activities into one of two categories: Predictive data mining, which produces the model of the system described by the given data set, or Descriptive data mining which produces new, nontrivials information based on the available data set.
Book ChapterDOI

Mining Sequential Patterns: Generalizations and Performance Improvements

TL;DR: This work adds time constraints that specify a minimum and/or maximum time period between adjacent elements in a pattern, and relax the restriction that the items in an element of a sequential pattern must come from the same transaction.
Journal ArticleDOI

Guest Editors' Introduction: Overview of Sensor Networks

TL;DR: Wireless sensor networks could advance many scientific pursuits while providing a vehicle for enhancing various forms of productivity, including manufacturing, agriculture, construction, and transportation.
Related Papers (5)