scispace - formally typeset
Journal ArticleDOI

Practical byzantine fault tolerance and proactive recovery

Reads0
Chats0
TLDR
A new replication algorithm, BFT, is described that can be used to build highly available systems that tolerate Byzantine faults and is used to implement the first Byzantine-fault-tolerant NFS file system, BFS.
Abstract
Our growing reliance on online services accessible on the Internet demands highly available systems that provide correct service without interruptions. Software bugs, operator mistakes, and malicious attacks are a major cause of service interruptions and they can cause arbitrary behavior, that is, Byzantine faults. This article describes a new replication algorithm, BFT, that can be used to build highly available systems that tolerate Byzantine faults. BFT can be used in practice to implement real services: it performs well, it is safe in asynchronous environments such as the Internet, it incorporates mechanisms to defend against Byzantine-faulty clients, and it recovers replicas proactively. The recovery mechanism allows the algorithm to tolerate any number of faults over the lifetime of the system provided fewer than 1/3 of the replicas become faulty within a small window of vulnerability. BFT has been implemented as a generic program library with a simple interface. We used the library to implement the first Byzantine-fault-tolerant NFS file system, BFS. The BFT library and BFS perform well because the library incorporates several important optimizations, the most important of which is the use of symmetric cryptography to authenticate messages. The performance results show that BFS performs 2p faster to 24p slower than production implementations of the NFS protocol that are not replicated. This supports our claim that the BFT library can be used to build practical systems that tolerate Byzantine faults.

read more

Content maybe subject to copyright    Report

Citations
More filters
Proceedings Article

Deconstructing Stellar Consensus.

TL;DR: This paper rigorously proves correct the Stellar Consensus Protocol (SCP), with the proof giving insights into the protocol structure and its use of lower-level abstractions, and establishes a refinement between the abstract protocol and the concrete SCP that uses only finite state.
Journal ArticleDOI

Quantitative survivability evaluation of three virtual machine-based server architectures

TL;DR: Analyzing and evaluating the survivability of three virtual machine-based architectures shows that BFTSA has better survivability than LBSA and ICSA, but with longer time to reach the steady states and higher communication costs.
Book ChapterDOI

A Decentralized Sharding Service Network Framework with Scalability

TL;DR: This paper proposed a sharding blockchain framework with linear scalability, which needs no centralized organization to assemble messages from subcommittees and redesigned the block-generating algorithm to accelerate generating block.

Coping with dependent failures in distributed systems

TL;DR: This dissertation presents a model of dependent failures based on two abstractions: cores and survivor sets, and develops techniques for selecting replicas and forming quorums that do have optimal availability in multi-site systems.
Proceedings ArticleDOI

Towards Trustworthy Integrated Clinical Environments

TL;DR: These mechanisms prevent faulty replicas from launching stealth denial-of-service attacks, which is important for the liveness of the system, and the overhead of the mechanisms is sufficiently low to warrant their use in practical ICEs.
References
More filters
Book ChapterDOI

Time, clocks, and the ordering of events in a distributed system

TL;DR: In this paper, the concept of one event happening before another in a distributed system is examined, and a distributed algorithm is given for synchronizing a system of logical clocks which can be used to totally order the events.
Journal ArticleDOI

Time, clocks, and the ordering of events in a distributed system

TL;DR: In this article, the concept of one event happening before another in a distributed system is examined, and a distributed algorithm is given for synchronizing a system of logical clocks which can be used to totally order the events.
Journal ArticleDOI

The Byzantine Generals Problem

TL;DR: The Albanian Generals Problem as mentioned in this paper is a generalization of Dijkstra's dining philosophers problem, where two generals have to come to a common agreement on whether to attack or retreat, but can communicate only by sending messengers who might never arrive.
Book ChapterDOI

The Byzantine generals problem

TL;DR: In this article, a group of generals of the Byzantine army camped with their troops around an enemy city are shown to agree upon a common battle plan using only oral messages, if and only if more than two-thirds of the generals are loyal; so a single traitor can confound two loyal generals.
Journal ArticleDOI

Impossibility of distributed consensus with one faulty process

TL;DR: In this paper, it is shown that every protocol for this problem has the possibility of nontermination, even with only one faulty process.
Related Papers (5)