scispace - formally typeset
Open AccessProceedings Article

Using Probabilistic Models for Data Management in Acquisitional Environments

Reads0
Chats0
TLDR
A suite of techniques based on probabilistic models that are designed to allow database to tolerate noise and loss are discussed, based on exploiting correlations to predict missing values and identify outliers.
Abstract
Traditional database systems, particularly those focused on capturing and managing data from the real world, are poorly equipped to deal with the noise, loss, and uncertainty in data. We discuss a suite of techniques based on probabilistic models that are designed to allow database to tolerate noise and loss. These techniques are based on exploiting correlations to predict missing values and identify outliers. Interestingly, correlations also provide a way to give approximate answers to users at a significantly lower cost and enable a range of new types of queries over the correlation structure itself. We illustrate a host of applications for our new techniques and queries, ranging from sensor networks to network monitoring to data stream management. We also present a unified architecture for integrating such models into database systems, focusing in particular on acquisitional systems where the cost of capturing data (e.g., from sensors) is itself a significant part of the query processing cost.

read more

Content maybe subject to copyright    Report

Citations
More filters
Proceedings ArticleDOI

Management of probabilistic data: foundations and challenges

TL;DR: The foundations of managing data where the uncertainties are quantified as probabilities are described, and some fundamental theoretical result for query evaluation on probabilistic databases is presented.
BookDOI

Managing and Mining Sensor Data

TL;DR: Managing and Mining Sensor Data is a contributed volume by prominent leaders in this field, targeting advanced-level students in computer science as a secondary text book or reference, and practitioners and researchers working in the field will also find this book useful.
Journal ArticleDOI

ASAP: An Adaptive Sampling Approach to Data Collection in Sensor Networks

TL;DR: In this article, an adaptive sampling approach to energy-efficient periodic data collection in sensor networks is proposed, which uses a dynamically changing subset of the nodes as samplers such that the sensor readings of the sampler nodes are directly collected, whereas the values of the nonsampler nodes were predicted through the use of probabilistic models that are locally and periodically constructed.
Journal ArticleDOI

A unified approach to ranking in probabilistic databases

TL;DR: This paper contends that a single, specific ranking function may not suffice for probabilistic databases, and proposes two parameterized ranking functions, called PRFω and PRFe, that generalize or can approximate many of the previously proposed ranking functions.
Proceedings ArticleDOI

Towards correcting input data errors probabilistically using integrity constraints

TL;DR: It is shown that StreamClean handles a large class of input data errors, and corrects them sufficiently fast to keep-up with input rates of many mobile and pervasive applications, and that the probabilities assigned by StreamClean correspond to a user's intuitive notion of correctness.
References
More filters
Book

Artificial Intelligence: A Modern Approach

TL;DR: In this article, the authors present a comprehensive introduction to the theory and practice of artificial intelligence for modern applications, including game playing, planning and acting, and reinforcement learning with neural networks.
Book

Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference

TL;DR: Probabilistic Reasoning in Intelligent Systems as mentioned in this paper is a complete and accessible account of the theoretical foundations and computational methods that underlie plausible reasoning under uncertainty, and provides a coherent explication of probability as a language for reasoning with partial belief.
Book

Dynamic Programming

TL;DR: The more the authors study the information processing aspects of the mind, the more perplexed and impressed they become, and it will be a very long time before they understand these processes sufficiently to reproduce them.
Journal ArticleDOI

Machine learning

TL;DR: Machine learning addresses many of the same research questions as the fields of statistics, data mining, and psychology, but with differences of emphasis.
MonographDOI

Causality: models, reasoning, and inference

TL;DR: The art and science of cause and effect have been studied in the social sciences for a long time as mentioned in this paper, see, e.g., the theory of inferred causation, causal diagrams and the identification of causal effects.
Related Papers (5)