NiagaraCQ: a scalable continuous query system for Internet databases

doi:10.1145/335191.335432

Journal ArticleDOI

NiagaraCQ: a scalable continuous query system for Internet databases

- Vol. 29, Iss: 2, pp 379-390

TLDR

The design of NiagaraCQ system is presented, some experimental results on the system's performance and scalability are given and other techniques including incremental evaluation of continuous queries, use of both pull and push models for detecting heterogeneous data source changes, and memory caching are employed.

Abstract:

Continuous queries are persistent queries that allow users to receive new results when they become available. While continuous query systems can transform a passive web into an active environment, they need to be able to support millions of queries due to the scale of the Internet. No existing systems have achieved this level of scalability. NiagaraCQ addresses this problem by grouping continuous queries based on the observation that many web queries share similar structures. Grouped queries can share the common computation, tend to fit in memory and can reduce the I/O cost significantly. Furthermore, grouping on selection predicates can eliminate a large number of unnecessary query invocations. Our grouping technique is distinguished from previous group optimization approaches in the following ways. First, we use an incremental group optimization strategy with dynamic re-grouping. New queries are added to existing query groups, without having to regroup already installed queries. Second, we use a query-split scheme that requires minimal changes to a general-purpose query engine. Third, NiagaraCQ groups both change-based and timer-based queries in a uniform way. To insure that NiagaraCQ is scalable, we have also employed other techniques including incremental evaluation of continuous queries, use of both pull and push models for detecting heterogeneous data source changes, and memory caching. This paper presents the design of NiagaraCQ system and gives some experimental results on the system's performance and scalability.

NiagaraCQ: a scalable continuous query system for Internet databases

Citations

Data Mining: Concepts and Techniques

Models and issues in data stream systems

Data Mining: Concepts and Techniques (2nd edition)

TinyDB: an acquisitional query processing system for sensor networks

Data streams: algorithms and applications

References

Multiple-query optimization

Continuous queries over append-only databases

The architecture of an active database management system

Continual queries for Internet scale event-driven information delivery

On rules, procedures, caching and views in database systems

Related Papers (5)

TelegraphCQ: Continuous Dataflow Processing for an Uncertain World.

Models and issues in data stream systems

Aurora: a new model and architecture for data stream management

Eddies: continuously adaptive query processing

Continuous queries over data streams