scispace - formally typeset
Open AccessProceedings Article

Expiring Data in a Warehouse

Reads0
Chats0
TLDR
A framework for incrementally removing warehouse data (without a need to fully recompute) is presented, and how the system should compensate when data is expired or other parameters changed is shown.
Abstract
Data warehouses collect data into materialized views for analysis. After some time, some of the data may no longer be needed or may not be of interest. In this paper, we handle this by expiring or removing unneeded materialized view tuples. A framework supporting such expiration is presented. Within it, a user or administrator can declaratively request expirations and can specify what type of modifications are expected from external sources. The latter can significantly increase the amount of data that can be expired. We present efficient algorithms for determining what data can be expired (data not needed for maintenance of other views), taking into account the types of updates that may occur

read more

Content maybe subject to copyright    Report

Citations
More filters
Proceedings ArticleDOI

Models and issues in data stream systems

TL;DR: The need for and research issues arising from a new model of data processing, where data does not take the form of persistent relations, but rather arrives in multiple, continuous, rapid, time-varying data streams are motivated.
Journal ArticleDOI

Continuous queries over data streams

TL;DR: A general and flexible architecture for query processing in the presence of data streams is specified, which captures most previous work on continuous queries and data streams, as well as related concepts such as triggers and materialized views.
Book

Data Streams: Models and Algorithms

TL;DR: This volume covers mining aspects of data streams comprehensively: each contributed chapter contains a survey on the topic, the key ideas in the field for that particular topic, and future research directions.
Book

Fundamentals of Data Warehouses

TL;DR: This book presents a comparative review of the state of the art and best current practice of data warehouses and offers a conceptual framework by which the architecture and quality of data warehouse efforts can be assessed and improved using enriched metadata management combined with advanced techniques from databases, business modeling, and artificial intelligence.
Journal Article

Designing data marts for data warehouses

TL;DR: This paper presents a method to support the identification and design of data marts by exploiting a goal-oriented process based on the Goal/Question/Metric paradigm developed at the University of Maryland.
References
More filters
Book

Foundations of databases

TL;DR: This book discusses Languages, Computability, and Complexity, and the Relational Model, which aims to clarify the role of Semantic Data Models in the development of Query Language Design.
Proceedings ArticleDOI

Optimal implementation of conjunctive queries in relational data bases

TL;DR: It is shown that while answering conjunctive queries is NP complete (general queries are PSPACE complete), one can find an implementation that is within a constant of optimal.
Journal ArticleDOI

Incomplete Information in Relational Databases

TL;DR: There are precise conditions that should be satisfied in a semantically meaningful extension of the usual relational operators, such as projection, selection, union, and join, from operators on relations to operators on tables with “null values” of various kinds allowed.
Proceedings ArticleDOI

Maintaining views incrementally

TL;DR: A counting algorithm that tracks the number of alternative derivations (counts) for each derived tuple in a view, and shows that the count for a tuple can be computed at little or no cost above the cost of deriving the tuple.