scispace - formally typeset
Journal ArticleDOI

Practical byzantine fault tolerance and proactive recovery

Reads0
Chats0
TLDR
A new replication algorithm, BFT, is described that can be used to build highly available systems that tolerate Byzantine faults and is used to implement the first Byzantine-fault-tolerant NFS file system, BFS.
Abstract
Our growing reliance on online services accessible on the Internet demands highly available systems that provide correct service without interruptions. Software bugs, operator mistakes, and malicious attacks are a major cause of service interruptions and they can cause arbitrary behavior, that is, Byzantine faults. This article describes a new replication algorithm, BFT, that can be used to build highly available systems that tolerate Byzantine faults. BFT can be used in practice to implement real services: it performs well, it is safe in asynchronous environments such as the Internet, it incorporates mechanisms to defend against Byzantine-faulty clients, and it recovers replicas proactively. The recovery mechanism allows the algorithm to tolerate any number of faults over the lifetime of the system provided fewer than 1/3 of the replicas become faulty within a small window of vulnerability. BFT has been implemented as a generic program library with a simple interface. We used the library to implement the first Byzantine-fault-tolerant NFS file system, BFS. The BFT library and BFS perform well because the library incorporates several important optimizations, the most important of which is the use of symmetric cryptography to authenticate messages. The performance results show that BFS performs 2p faster to 24p slower than production implementations of the NFS protocol that are not replicated. This supports our claim that the BFT library can be used to build practical systems that tolerate Byzantine faults.

read more

Content maybe subject to copyright    Report

Citations
More filters
Proceedings ArticleDOI

The Efficient Server Audit Problem, Deduplicated Re-execution, and the Web

TL;DR: The Efficient Server Audit Problem as mentioned in this paper abstracts real-world scenarios, including running a web application on an untrusted provider, and presents a solution based on several new techniques, including simultaneous replay and efficient verification of concurrent executions.
Journal ArticleDOI

Impact of Adaptive Consistency on Distributed SDN Applications: An Empirical Study

TL;DR: In this article, the authors investigate the effect of the deployed consistency model on scalability and correctness metrics of the SDN control plane and make a case for a novel adaptive consistency approach.

Building Fault-Tolerant Consistency Protocols for an Adaptive Grid Data-Sharing Service

TL;DR: This work proposes a software architecture which decouples consistency management from fault-tolerance management, and illustrates this architecture with a case study showing how to design a consistency protocol using fault-Tolerant building blocks.
Book ChapterDOI

Intrusion-tolerant group management in Enclaves

TL;DR: It is shown how the improved protocol used by Enclaves can be modified to guarantee proper service in the presence of nontrustworthy group members and formally specified and proven correct.
Journal ArticleDOI

Blockchain-Incentivized D2D and Mobile Edge Caching: A Deep Reinforcement Learning Approach

TL;DR: A Blockchain-based CDM is proposed as an incentive mechanism for the distributed caching system, and the pPBFT protocol is proposed to minimize the latency of reaching consensus while guaranteeing its confidence level.
References
More filters
Book ChapterDOI

Time, clocks, and the ordering of events in a distributed system

TL;DR: In this paper, the concept of one event happening before another in a distributed system is examined, and a distributed algorithm is given for synchronizing a system of logical clocks which can be used to totally order the events.
Journal ArticleDOI

Time, clocks, and the ordering of events in a distributed system

TL;DR: In this article, the concept of one event happening before another in a distributed system is examined, and a distributed algorithm is given for synchronizing a system of logical clocks which can be used to totally order the events.
Journal ArticleDOI

The Byzantine Generals Problem

TL;DR: The Albanian Generals Problem as mentioned in this paper is a generalization of Dijkstra's dining philosophers problem, where two generals have to come to a common agreement on whether to attack or retreat, but can communicate only by sending messengers who might never arrive.
Book ChapterDOI

The Byzantine generals problem

TL;DR: In this article, a group of generals of the Byzantine army camped with their troops around an enemy city are shown to agree upon a common battle plan using only oral messages, if and only if more than two-thirds of the generals are loyal; so a single traitor can confound two loyal generals.
Journal ArticleDOI

Impossibility of distributed consensus with one faulty process

TL;DR: In this paper, it is shown that every protocol for this problem has the possibility of nontermination, even with only one faulty process.
Related Papers (5)