scispace - formally typeset
Open AccessProceedings Article

The Design of the Borealis Stream Processing Engine

TLDR
This paper outlines the basic design and functionality of Borealis, and presents a highly flexible and scalable QoS-based optimization model that operates across server and sensor networks and a new fault-tolerance model with flexible consistency-availability trade-offs.
Abstract
Borealis is a second-generation distributed stream processing engine that is being developed at Brandeis University, Brown University, and MIT. Borealis inherits core stream processing functionality from Aurora [14] and distribution functionality from Medusa [51]. Borealis modifies and extends both systems in non-trivial and critical ways to provide advanced capabilities that are commonly required by newly-emerging stream processing applications. In this paper, we outline the basic design and functionality of Borealis. Through sample real-world applications, we motivate the need for dynamically revising query results and modifying query specifications. We then describe how Borealis addresses these challenges through an innovative set of features, including revision records, time travel, and control lines. Finally, we present a highly flexible and scalable QoS-based optimization model that operates across server and sensor networks and a new fault-tolerance model with flexible consistency-availability trade-offs.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

Aurora: a new model and architecture for data stream management

TL;DR: The basic processing model and architecture of Aurora, a new system to manage data streams for monitoring applications, are described and a stream-oriented set of operators are described.
Journal ArticleDOI

Security, privacy and trust in Internet of Things

TL;DR: In this article, the authors present the main research challenges and the existing solutions in the field of IoT security, identifying open issues and suggesting some hints for future research, and suggest some hints to future research.
Journal Article

Apache flink : Stream and batch processing in a single engine

TL;DR: This paper discusses the approach to achieve high throughput for transactional query processing while allowing concurrent analytical queries, and presents its approach to distributed snapshot isolation and optimized two-phase commit protocols.
Book ChapterDOI

C-store: a column-oriented DBMS

TL;DR: Preliminary performance data on a subset of TPC-H is presented and it is shown that the system the team is building, C-Store, is substantially faster than popular commercial products.
Proceedings ArticleDOI

S4: Distributed Stream Computing Platform

TL;DR: The architecture resembles the Actors model, providing semantics of encapsulation and location transparency, thus allowing applications to be massively concurrent while exposing a simple programming interface to application developers.
References
More filters
Proceedings ArticleDOI

Consistent hashing and random trees: distributed caching protocols for relieving hot spots on the World Wide Web

TL;DR: A family of caching protocols for distrib-uted networks that can be used to decrease or eliminate the occurrence of hot spots in the network, based on a special kind of hashing that is called consistent hashing.
Journal ArticleDOI

Aurora: a new model and architecture for data stream management

TL;DR: The basic processing model and architecture of Aurora, a new system to manage data streams for monitoring applications, are described and a stream-oriented set of operators are described.
Proceedings ArticleDOI

The dangers of replication and a solution

TL;DR: In this article, a two-tier replication algorithm is proposed that allows mobile (disconnected) applications to propose tentative update transactions that are later applied to a master copy to avoid the instability of other replication schemes.
Proceedings Article

TelegraphCQ: Continuous Dataflow Processing for an Uncertain World.

TL;DR: The next generation Telegraph system, called TelegraphCQ, is focused on meeting the challenges that arise in handling large streams of continuous queries over high-volume, highly-variable data streams and leverages the PostgreSQL open source code base.
Proceedings ArticleDOI

Online aggregation

TL;DR: In this article, the authors propose an online aggregation interface that allows users to both observe the progress of their aggregation queries and control execution on the fly, and present a suite of techniques that extend a database system to meet these requirements.
Related Papers (5)