The Design of the Borealis Stream Processing Engine

Open AccessProceedings Article

The Design of the Borealis Stream Processing Engine

- pp 277-289

TLDR

This paper outlines the basic design and functionality of Borealis, and presents a highly flexible and scalable QoS-based optimization model that operates across server and sensor networks and a new fault-tolerance model with flexible consistency-availability trade-offs.

Abstract:

Borealis is a second-generation distributed stream processing engine that is being developed at Brandeis University, Brown University, and MIT. Borealis inherits core stream processing functionality from Aurora [14] and distribution functionality from Medusa [51]. Borealis modifies and extends both systems in non-trivial and critical ways to provide advanced capabilities that are commonly required by newly-emerging stream processing applications. In this paper, we outline the basic design and functionality of Borealis. Through sample real-world applications, we motivate the need for dynamically revising query results and modifying query specifications. We then describe how Borealis addresses these challenges through an innovative set of features, including revision records, time travel, and control lines. Finally, we present a highly flexible and scalable QoS-based optimization model that operates across server and sensor networks and a new fault-tolerance model with flexible consistency-availability trade-offs.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Aurora: a new model and architecture for data stream management

Daniel J. Abadi, +8 more

TL;DR: The basic processing model and architecture of Aurora, a new system to manage data streams for monitoring applications, are described and a stream-oriented set of operators are described.

...read moreread less

Journal ArticleDOI

Security, privacy and trust in Internet of Things

Sabrina Sicari, +3 more

- 15 Jan 2015 -

Computer Networks

TL;DR: In this article, the authors present the main research challenges and the existing solutions in the field of IoT security, identifying open issues and suggesting some hints for future research, and suggest some hints to future research.

...read moreread less

Journal Article

Apache flink : Stream and batch processing in a single engine

Paris Carbone, +9 more

- 01 Jan 2015 -

IEEE Data(base) Engineering Bulletin

TL;DR: This paper discusses the approach to achieve high throughput for transactional query processing while allowing concurrent analytical queries, and presents its approach to distributed snapshot isolation and optimized two-phase commit protocols.

...read moreread less

Proceedings ArticleDOI

S4: Distributed Stream Computing Platform

Leonardo Neumeyer, +3 more

TL;DR: The architecture resembles the Actors model, providing semantics of encapsulation and location transparency, thus allowing applications to be massively concurrent while exposing a simple programming interface to application developers.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings ArticleDOI

Consistent hashing and random trees: distributed caching protocols for relieving hot spots on the World Wide Web

David R. Karger, +5 more

TL;DR: A family of caching protocols for distrib-uted networks that can be used to decrease or eliminate the occurrence of hot spots in the network, based on a special kind of hashing that is called consistent hashing.

...read moreread less

Journal ArticleDOI

Aurora: a new model and architecture for data stream management

Daniel J. Abadi, +8 more

TL;DR: The basic processing model and architecture of Aurora, a new system to manage data streams for monitoring applications, are described and a stream-oriented set of operators are described.

...read moreread less

Proceedings ArticleDOI

The dangers of replication and a solution

Jim Gray, +3 more

TL;DR: In this article, a two-tier replication algorithm is proposed that allows mobile (disconnected) applications to propose tentative update transactions that are later applied to a master copy to avoid the instability of other replication schemes.

...read moreread less

Proceedings Article

TelegraphCQ: Continuous Dataflow Processing for an Uncertain World.

Sirish Chandrasekaran, +10 more

TL;DR: The next generation Telegraph system, called TelegraphCQ, is focused on meeting the challenges that arise in handling large streams of continuous queries over high-volume, highly-variable data streams and leverages the PostgreSQL open source code base.

...read moreread less

Proceedings ArticleDOI

Online aggregation

Joseph M. Hellerstein, +2 more

TL;DR: In this article, the authors propose an online aggregation interface that allows users to both observe the progress of their aggregation queries and control execution on the fly, and present a suite of techniques that extend a database system to meet these requirements.

...read moreread less

Collapse

The Design of the Borealis Stream Processing Engine

Citations

Aurora: a new model and architecture for data stream management

Security, privacy and trust in Internet of Things

Apache flink : Stream and batch processing in a single engine

C-store: a column-oriented DBMS

S4: Distributed Stream Computing Platform

References

Consistent hashing and random trees: distributed caching protocols for relieving hot spots on the World Wide Web

Aurora: a new model and architecture for data stream management

The dangers of replication and a solution

TelegraphCQ: Continuous Dataflow Processing for an Uncertain World.

Online aggregation

Related Papers (5)

Aurora: a new model and architecture for data stream management

TelegraphCQ: Continuous Dataflow Processing for an Uncertain World.

The CQL continuous query language: semantic foundations and query execution

Models and issues in data stream systems

NiagaraCQ: a scalable continuous query system for Internet databases