Horus: a flexible group communication system

doi:10.1145/227210.227229

Home
/
Papers
/
Horus: a flexible group communication system

Journal Article•DOI•

Horus: a flexible group communication system

Robbert van Renesse¹, Kenneth P. Birman¹, Silvano Maffeis¹•Institutions (1)

Cornell University¹

01 Apr 1996-Communications of The ACM (Cornell University)-Vol. 39, Iss: 4, pp 76-83

TL;DR: The Horus system offers flexible group communication support for distributed applications, allowing applications to only pay for services they use, and for groups with different communication needs to coexist in a single system.

read less

Abstract: The Horus system offers flexible group communication support for distributed applications. It is extensively layered and highly reconfigurable, allowing applications to only pay for services they use, and for groups with different communication needs to coexist in a single system. The approach encourages experimentation with new communication properties and incremental extension of the system, and enables us to support a variety of application-oriented interfaces.

...read moreread less

Content maybe subject to copyright Report

Citations

PDF

Open Access

More filters

Proceedings Article•

ZooKeeper: wait-free coordination for internet-scale systems

[...]

Patrick David Hunt¹, Mahadev Konar¹, Flavio Junqueira¹, Benjamin Reed¹•Institutions (1)

Yahoo!¹

23 Jun 2010

TL;DR: ZooKeeper provides a per client guarantee of FIFO execution of requests and linearizability for all requests that change the ZooKeeper state to enable the implementation of a high performance processing pipeline with read requests being satisfied by local servers.

...read moreread less

Abstract: In this paper, we describe ZooKeeper, a service for coordinating processes of distributed applications Since ZooKeeper is part of critical infrastructure, ZooKeeper aims to provide a simple and high performance kernel for building more complex coordination primitives at the client It incorporates elements from group messaging, shared registers, and distributed lock services in a replicated, centralized service The interface exposed by Zoo-Keeper has the wait-free aspects of shared registers with an event-driven mechanism similar to cache invalidations of distributed file systems to provide a simple, yet powerful coordination service The ZooKeeper interface enables a high-performance service implementation In addition to the wait-free property, ZooKeeper provides a per client guarantee of FIFO execution of requests and linearizability for all requests that change the ZooKeeper state These design decisions enable the implementation of a high performance processing pipeline with read requests being satisfied by local servers We show for the target workloads, 2:1 to 100:1 read to write ratio, that ZooKeeper can handle tens to hundreds of thousands of transactions per second This performance allows ZooKeeper to be used extensively by client applications

...read moreread less

1,637 citations

Cites background from "Horus: a flexible group communicati..."

...Horus [30] and Ensemble [31] are systems that evolved from ISIS....
[...]

Journal Article•DOI•

Group communication specifications: a comprehensive study

[...]

Gregory Chockler¹, Idit Keidar², Roman Vitenberg³•Institutions (3)

Hebrew University of Jerusalem¹, Massachusetts Institute of Technology², Technion – Israel Institute of Technology³

01 Dec 2001-ACM Computing Surveys

TL;DR: The specification framework presented in this article will help builders of group communication systems understand andspecify their service semantics; the extensive survey will allow them to compare their service to others, and serve as a unified framework for the classification, analysis, and comparison of group Communication systems.

...read moreread less

Abstract: View-oriented group communication is an important and widely used building block for many distributed applications. Much current research has been dedicated to specifying the semantics and services of view-oriented group communication systems (GCSs). However, the guarantees of different GCSs are formulated using varying terminologies and modeling techniques, and the specifications vary in their rigor. This makes it difficult to analyze and compare the different systems. This survey provides a comprehensive set of clear and rigorous specifications, which may be combined to represent the guarantees of most existing GCSs. In the light of these specifications, over 30 published GCS specifications are surveyed. Thus, the specifications serve as a unifying framework for the classification, analysis, and comparison of group communication systems. The survey also discusses over a dozen different applications of group communication systems, shedding light on the usefulness of the presented specifications. This survey is aimed at both system builders and theoretical researchers. The specification framework presented in this article will help builders of group communication systems understand and specify their service semantics; the extensive survey will allow them to compare their service to others. Application builders will find a guide here to the services provided by a large variety of GCSs, which could help them choose the GCS appropriate for their needs. The formal framework may provide a basis for interesting theoretical work, for example, analyzing relative strengths of different properties and the costs of implementing them.

...read moreread less

734 citations

Cites background from "Horus: a flexible group communicati..."

...All the GCSs that we are aware of satisfy Self Delivery; some examples are: Isis, Transis, Totem, Horus, and Newtop....
[...]
...Several group communication systems (e.g., Ensemble, Horus, and RMP) provide a reliable FIFO service type that satis.es Property 6.2 and does not impose additional ordering constraints. xAMp provides several service levels that satisfy Property 6.1 but vary by their reliability guarantees....
[...]
...Instead, the Horus STABLE layer maintains a more general stability matrix at each process....
[...]
...Strong Total Order is provided by Totem and by some of the implementations of totally ordered multicast in Transis, Ensemble, Phoenix, RMP, and Horus....
[...]
...Horus does not deliver safe pre.x noti.cations....
[...]

Journal Article•DOI•

Bimodal multicast

[...]

Kenneth P. Birman¹, Mark Hayden, Oznur Ozkasap¹, Zhen Xiao¹, Mihai Budiu², Yaron Minsky¹ - Show less +2 more•Institutions (2)

Cornell University¹, Carnegie Mellon University²

01 May 1999-ACM Transactions on Computer Systems

TL;DR: This article introduces the protocol, provides a theoretical analysis of its behavior, review experimental results, and discusses some candidate applications, confirming that bimodal multicast is reliable, scalable, and that the protocol provides remarkably stable delivery throughput.

...read moreread less

Abstract: There are many methods for making a multicast protocol “reliable.” At one end of the spectrum, a reliable multicast protocol might offer tomicity guarantees, such as all-or-nothing delivery, delivery ordering, and perhaps additional properties such as virtually synchronous addressing. At the other are protocols that use local repair to overcome transient packet loss in the network, offering “best effort” reliability. Yet none of this prior work has treated stability of multicast delivery as a basic reliability property, such as might be needed in an internet radio, television, or conferencing application. This article looks at reliability with a new goal: development of a multicast protocol which is reliable in a sense that can be rigorously quantified and includes throughput stability guarantees. We characterize this new protocol as a “bimodal multicast” in reference to its reliability model, which corresponds to a family of bimodal probability distributions. Here, we introduce the protocol, provide a theoretical analysis of its behavior, review experimental results, and discuss some candidate applications. These confirm that bimodal multicast is reliable, scalable, and that the protocol provides remarkably stable delivery throughput.

...read moreread less

693 citations

Cites background or methods from "Horus: a flexible group communicati..."

...In addition to the work reported here, our group at Cornell also explored other uses of gossip, such as gossip-based membership tracking [van Renesse et al. 1996] and gossipbased stability detection [Guo 1998]....
[...]
...Ensemble supports group-communication protocol stacks that are constructed by composing microprotocols, an idea that originated in the Horus project [Birman 1997; van Renesse et al. 1996]....
[...]
...Spinglass uses gossip to track membership as well as to do communication [van Renesse et al. 1996], but the behavior of the bimodal protocol is unaffected (formal analysis of the combined gossip mechanisms is, however, beyond our current ability)....
[...]

Proceedings Article•DOI•

Peer-to-peer support for massively multiplayer games

[...]

Björn Knutsson¹, Honghui Lu¹, Wei Xu¹, B. Hopkins¹•Institutions (1)

University of Pennsylvania¹

07 Mar 2004

TL;DR: This work has designed scalable mechanisms to distribute the game state to the participating players and to maintain consistency in the face of node failures, and has implemented a simple game called SimMud, and experimented with up to 4000 players to demonstrate the applicability of this approach.

...read moreread less

Abstract: We present an approach to support massively multiplayer games on peer-to-peer overlays. Our approach exploits the fact that players in MMGs display locality of interest, and therefore can form self-organizing groups based on their locations in the virtual world. To this end, we have designed scalable mechanisms to distribute the game state to the participating players and to maintain consistency in the face of node failures. The resulting system dynamically scales with the number of online players. It is more flexible and has a lower deployment cost than centralized games servers. We have implemented a simple game we call SimMud, and experimented with up to 4000 players to demonstrate the applicability of this approach.

...read moreread less

578 citations

Cites background from "Horus: a flexible group communicati..."

...0 20 40 60 80 100 Percentage of nodes [0-10] [10-20] [20-30] [30-40] [40-50] [50-60] [60-70] [70-80] [80-90] [90-100] [100-110] [110-120] [120-130] [130-140] [140-150] [150-160] [160-170] [170-180] [180-190] [190-200] [200-210] [210-220]...
[...]
...Fault tolerant consistent data services can also be built on top of view-synchronous group communication [15], [30] using...
[...]
...0 10 20 30 40 Percentage of nodes [0-10] [10-20] [20-30] [30-40] [40-50] [50-60] [60-70] [70-80] [80-90] [90-100] [100-110] [110-120]...
[...]

Journal Article•DOI•

Process migration

[...]

Dejan Milojicic¹, Fred Douglis, Yves Paindaveine, Richard Wheeler², Songnian Zhou³ - Show less +1 more•Institutions (3)

Hewlett-Packard¹, EMC Corporation², University of Toronto³

01 Sep 2000-ACM Computing Surveys

TL;DR: In this article, the authors present a survey of the field of process migration by summarizing the key concepts and giving an overview of the most important implementations, including MOSIX, Sprite, Mach, and Load Sharing Facility.

...read moreread less

Abstract: Process migration is the act of transferring a process between two machines. It enables dynamic load distribution, fault resilience, eased system administration, and data access locality. Despite these goals and ongoing research efforts, migration has not achieved widespread use. With the increasing deployment of distributed systems in general, and distributed operating systems in particular, process migration is again receiving more attention in both research and product development. As high-performance facilities shift from supercomputers to networks of workstations, and with the ever-increasing role of the World Wide Web, we expect migration to play a more important role and eventually to be widely adopted.This survey reviews the field of process migration by summarizing the key concepts and giving an overview of the most important implementations. Design and implementation issues of process migration are analyzed in general, and then revisited for each of the case studies described: MOSIX, Sprite, Mach, and Load Sharing Facility. The benefits and drawbacks of process migration depend on the details of implementation and, therefore, this paper focuses on practical matters. This survey will help in understanding the potentials of process migration and why it has not caught on.

...read moreread less

551 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151

Collapse

References

PDF

Open Access

More filters

Proceedings Article•DOI•

Architectural considerations for a new generation of protocols

[...]

David D. Clark¹, D.L. Tennenhouse¹•Institutions (1)

Massachusetts Institute of Technology¹

01 Aug 1990

TL;DR: This paper identifies two new design principles, Application Level Framing and Integrated Layer Processing, and identifies the presentation layer as a key aspect of overall protocol performance.

...read moreread less

Abstract: The current generation of protocol architectures, such as TCP/IP or the ISO suite, seem successful at meeting the demands of todays networks. However, a number of new requirements have been proposed for the networks of tomorrow, and some innovation in protocol structuring may be necessary. In this paper, we review some key requirements for tomorrow's networks, and propose some architectural principles to structure a new generation of protocols. In particular, this paper identifies two new design principles, Application Level Framing and Integrated Layer Processing. Additionally, it identifies the presentation layer as a key aspect of overall protocol performance.

...read moreread less

1,083 citations

Proceedings Article•DOI•

A reliable multicast framework for light-weight sessions and application level framing

[...]

Sally Floyd¹, Van Jacobson¹, Steve McCanne¹, Ching-Gung Liu², Lixia Zhang³ - Show less +1 more•Institutions (3)

University of California, Berkeley¹, University of Southern California², PARC³

01 Oct 1995

TL;DR: An adaptive algorithm that uses the results of previous loss recovery events to adapt the control parameters used for future loss recovery is demonstrated, and the reliable multicast delivery algorithm provides good performance over a wide range of underlying topologies.

...read moreread less

Abstract: This paper describes SRM (Scalable Reliable Multicast), a reliable multicast framework for application level framing and light-weight sessions. The algorithms of this framework are efficient, robust, and scale well to both very large networks and very large sessions. The framework has been prototyped in wb, a distributed whiteboard application, and has been extensively tested on a global scale with sessions ranging from a few to more than 1000 participants. The paper describes the principles that have guided our design, including the IP multicast group delivery model, an end-to-end, receiver-based model of reliability, and the application level framing protocol model. As with unicast communications, the performance of a reliable multicast delivery algorithm depends on the underlying topology and operational environment. We investigate that dependence via analysis and simulation, and demonstrate an adaptive algorithm that uses the results of previous loss recovery events to adapt the control parameters used for future loss recovery. With the adaptive algorithm, our reliable multicast delivery algorithm provides good performance over a wide range of underlying topologies.

...read moreread less

753 citations

Book•

Reliable Distributed Computing with the Isis Toolkit

[...]

Kenneth P. Birman, Robert van Renesse, Robbert van Renesse

30 Mar 1994

639 citations

Journal Article•DOI•

Probabilistic clock synchronization

[...]

Flaviu Cristian¹•Institutions (1)

IBM¹

01 Sep 1989-Distributed Computing

TL;DR: A probabilistic method is proposed for reading remote clocks in distributed systems subject to unbounded random communication delays and can achieve clock synchronization precisions superior to those attainable by previously published clock synchronization algorithms.

...read moreread less

Abstract: A probabilistic method is proposed for reading remote clocks in distributed systems subject to unbounded random communication delays. The method can achieve clock synchronization precisions superior to those attainable by previously published clock synchronization algorithms. Its use is illustrated by presenting a time service which maintains externally (and hence, internally) synchronized clocks in the presence of process, communication and clock failures.

...read moreread less

620 citations

Journal Article•DOI•

Hypervisor-based fault tolerance

[...]

Thomas Bressoud, Fred B. Schneider¹•Institutions (1)

Cornell University¹

01 Feb 1996-ACM Transactions on Computer Systems

TL;DR: In this article, the authors describe protocols to implement a fault-tolerant computing system, which augment the hypervisor of a virtual machine manager and coordinate a primary virtual machine with its backup.

...read moreread less

Abstract: Protocols to implement a fault-tolerant computing system are described. These protocols augment the hypervisor of a virtual-machine manager and coordinate a primary virtual machine with its backup. No modifications to the hardware, operating system, or application programs are required. A prototype system was constructed for HP's PA-RISC instruction-set architecture. Even though the prototype was not carefully tuned, it ran programs about a factor of 2 slower than a bare machine would.

...read moreread less

480 citations