scispace - formally typeset
Open AccessProceedings Article

Using Probabilistic Models for Data Management in Acquisitional Environments

Reads0
Chats0
TLDR
A suite of techniques based on probabilistic models that are designed to allow database to tolerate noise and loss are discussed, based on exploiting correlations to predict missing values and identify outliers.
Abstract
Traditional database systems, particularly those focused on capturing and managing data from the real world, are poorly equipped to deal with the noise, loss, and uncertainty in data. We discuss a suite of techniques based on probabilistic models that are designed to allow database to tolerate noise and loss. These techniques are based on exploiting correlations to predict missing values and identify outliers. Interestingly, correlations also provide a way to give approximate answers to users at a significantly lower cost and enable a range of new types of queries over the correlation structure itself. We illustrate a host of applications for our new techniques and queries, ranging from sensor networks to network monitoring to data stream management. We also present a unified architecture for integrating such models into database systems, focusing in particular on acquisitional systems where the cost of capturing data (e.g., from sensors) is itself a significant part of the query processing cost.

read more

Content maybe subject to copyright    Report

Citations
More filters
Proceedings ArticleDOI

Longitudinal study of a building-scale RFID ecosystem

TL;DR: It is found that a building-scale EPC Class-1 Generation-2 RFID deployment produces a very manageable amount of data overall, but with orders of magnitude difference among various participants and objects, and the tag detection rates tend to be low with high variance across the type of tag, participant and object.
Journal ArticleDOI

A Bayesian Inference-Based Framework for RFID Data Cleansing

TL;DR: A Bayesian inference-based framework for cleaning RFID raw data, an n-state detection model, and a Metropolis-Hastings sampler with constraints, which incorporates constraint management to clean RFID data with high efficiency and accuracy are proposed.

Probabilistic RFID Data Management

TL;DR: It is demonstrated, through experiments with real RFID traces collected on a small antenna deployment, that PEEX significantly improves event detection rates compared with deterministic techniques, and provides applications a flexible trade-off between event recall and precision.
Proceedings ArticleDOI

Distributed Real-Time Detection and Tracking of Homogeneous Regions in Sensor Networks

TL;DR: This paper proposes distributed algorithms to detect a group of sensors having similar underlying distribution over a period of time as a homogeneous region, approximate their boundary with a piece-wise linear curve and track the boundary in real-time.
Journal ArticleDOI

Resource-Optimized Quality-Assured Ambiguous Context Mediation Framework in Pervasive Environments

TL;DR: A resource-optimized, quality-assured context mediation framework for sensor networks based on efficient context-aware data fusion, information-theoretic reasoning, and selection of sensor parameters, leading to an optimal state estimation is proposed.
References
More filters
Book

Artificial Intelligence: A Modern Approach

TL;DR: In this article, the authors present a comprehensive introduction to the theory and practice of artificial intelligence for modern applications, including game playing, planning and acting, and reinforcement learning with neural networks.
Book

Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference

TL;DR: Probabilistic Reasoning in Intelligent Systems as mentioned in this paper is a complete and accessible account of the theoretical foundations and computational methods that underlie plausible reasoning under uncertainty, and provides a coherent explication of probability as a language for reasoning with partial belief.
Book

Dynamic Programming

TL;DR: The more the authors study the information processing aspects of the mind, the more perplexed and impressed they become, and it will be a very long time before they understand these processes sufficiently to reproduce them.
Journal ArticleDOI

Machine learning

TL;DR: Machine learning addresses many of the same research questions as the fields of statistics, data mining, and psychology, but with differences of emphasis.
MonographDOI

Causality: models, reasoning, and inference

TL;DR: The art and science of cause and effect have been studied in the social sciences for a long time as mentioned in this paper, see, e.g., the theory of inferred causation, causal diagrams and the identification of causal effects.
Related Papers (5)