scispace - formally typeset
Journal ArticleDOI

NiagaraCQ: a scalable continuous query system for Internet databases

TLDR
The design of NiagaraCQ system is presented, some experimental results on the system's performance and scalability are given and other techniques including incremental evaluation of continuous queries, use of both pull and push models for detecting heterogeneous data source changes, and memory caching are employed.
Abstract
Continuous queries are persistent queries that allow users to receive new results when they become available. While continuous query systems can transform a passive web into an active environment, they need to be able to support millions of queries due to the scale of the Internet. No existing systems have achieved this level of scalability. NiagaraCQ addresses this problem by grouping continuous queries based on the observation that many web queries share similar structures. Grouped queries can share the common computation, tend to fit in memory and can reduce the I/O cost significantly. Furthermore, grouping on selection predicates can eliminate a large number of unnecessary query invocations. Our grouping technique is distinguished from previous group optimization approaches in the following ways. First, we use an incremental group optimization strategy with dynamic re-grouping. New queries are added to existing query groups, without having to regroup already installed queries. Second, we use a query-split scheme that requires minimal changes to a general-purpose query engine. Third, NiagaraCQ groups both change-based and timer-based queries in a uniform way. To insure that NiagaraCQ is scalable, we have also employed other techniques including incremental evaluation of continuous queries, use of both pull and push models for detecting heterogeneous data source changes, and memory caching. This paper presents the design of NiagaraCQ system and gives some experimental results on the system's performance and scalability.

read more

Content maybe subject to copyright    Report

Citations
More filters
Book

Data Mining: Concepts and Techniques

TL;DR: This book presents dozens of algorithms and implementation examples, all in pseudo-code and suitable for use in real-world, large-scale data mining projects, and provides a comprehensive, practical look at the concepts and techniques you need to get the most out of real business data.
Proceedings ArticleDOI

Models and issues in data stream systems

TL;DR: The need for and research issues arising from a new model of data processing, where data does not take the form of persistent relations, but rather arrives in multiple, continuous, rapid, time-varying data streams are motivated.

Data Mining: Concepts and Techniques (2nd edition)

TL;DR: There have been many data mining books published in recent years, including Predictive Data Mining by Weiss and Indurkhya [WI98], Data Mining Solutions: Methods and Tools for Solving Real-World Problems by Westphal and Blaxton [WB98], Mastering Data Mining: The Art and Science of Customer Relationship Management by Berry and Linofi [BL99].
Journal ArticleDOI

TinyDB: an acquisitional query processing system for sensor networks

TL;DR: This work evaluates issues in the context of TinyDB, a distributed query processor for smart sensor devices, and shows how acquisitional techniques can provide significant reductions in power consumption on the authors' sensor devices.
Journal ArticleDOI

Data streams: algorithms and applications

TL;DR: Data Streams: Algorithms and Applications surveys the emerging area of algorithms for processing data streams and associated applications, which rely on metric embeddings, pseudo-random computations, sparse approximation theory and communication complexity.
References
More filters
Journal ArticleDOI

Multiple-query optimization

TL;DR: The results show that using multiple- query processing algorithms may reduce execution cost considerably, and the presentation and analysis of algorithms that can be used for multiple-query optimization are presented.
Proceedings ArticleDOI

Continuous queries over append-only databases

TL;DR: The techniques used in Tapestry are described, which do not depend on triggers and thus be implemented on any commercial database that supports SQL and are applicable to any append-only database.
Journal ArticleDOI

The architecture of an active database management system

TL;DR: This paper proposes an architecture for an active DBMS that supports Event-Condition-Action rules and develops an execution model that specifies how these rules are processed in the context of database transactions.
Journal ArticleDOI

Continual queries for Internet scale event-driven information delivery

TL;DR: The concept of continual queries, the design of a distributed event-driven continual query system-OpenCQ, and the initial implementation of OpenCQ on top of the distributed interoperable information mediation system DIOM are outlined.
Proceedings Article

On rules, procedures, caching and views in database systems

TL;DR: It is demonstrated that a simple rule system can be constructed that supports a more powerful view system than available in current commercial systems and that a rule system is a fundamental concept in a next generation DBMS, and it subsumes both views and procedures as special cases.
Related Papers (5)