scispace - formally typeset
Journal ArticleDOI

Recovery Time and Fault Tolerance Improvement for Circuits mapped on SRAM-based FPGAs

Anees Ullah, +1 more
- 01 Aug 2014 - 
- Vol. 30, Iss: 4, pp 425-442
TLDR
The results demonstrate an evident reduction of the recovery time due to fast error detection time and selective partial reconfiguration of faulty domains, and the methodology drastically reduces Cross-Domain Errors in Look-Up Tables and routing resources.
Abstract
The rapid adoption of FPGA-based systems in space and avionics demands dependability rules from the design to the layout phases to protect against radiation effects. Triple Modular Redundancy is a widely used fault tolerance methodology to protect circuits against radiation-induced Single Event Upsets implemented on SRAM-based FPGAs. The accumulation of SEUs in the configuration memory can cause the TMR replicas to fail, requiring a periodic write-back of the configuration bit-stream. The associated system downtime due to scrubbing and the probability of simultaneous failures of two TMR domains are increasing with growing device densities. We propose a methodology to reduce the recovery time of TMR circuits with increased resilience to Cross-Domain Errors. Our methodology consists of an automated tool-flow for fine-grain error detection, error flags convergence and non-overlapping domain placement. The fine-grain error detection logic identifies the faulty domain using gate-level functions while the error flag convergence logic reduces the overwhelming number of flag signals. The non-overlapping placement enables selective domain reconfiguration and greatly reduces the number of Cross-Domain Errors. Our results demonstrate an evident reduction of the recovery time due to fast error detection time and selective partial reconfiguration of faulty domains. Moreover, the methodology drastically reduces Cross-Domain Errors in Look-Up Tables and routing resources. The improvements in recovery time and fault tolerance are achieved at an area overhead of a single LUT per majority voter in TMR circuits.

read more

Citations
More filters
Journal ArticleDOI

An ALU Protection Methodology for Soft Processors on SRAM-Based FPGAs

TL;DR: This paper proposes an application-based methodology, to protect a soft processor implemented in an SRAM-based FPGA, against the effect of soft errors, by creating a library of adaptive protection configurations, based on the profiling of the application.
Book ChapterDOI

Resilience and Fault Tolerance in Electrical Engineering

TL;DR: It is found that while significant and sustained attention has been dedicated to enhance the resilience of engineering electrical systems, substantial work remains to fully address resilience challenges that instill confidence in the ability to engineer resilient systems.
Journal ArticleDOI

Repair of FPGA-Based Real-Time Systems With Variable Slacks

TL;DR: A novel approach is proposed to use a deadline-aware scrubbing scheme with negligible area costs that dynamically chooses the scrubbing starting position to avoid missing real-time deadlines while maximizing the repair probability given a bounded repair time.
Journal ArticleDOI

Speed-area optimized VLSI architecture of multi-bit cellular automaton cell based random number generator on FPGA with testable logic support

TL;DR: In this paper, a speed-area efficient VLSI implementation of a cellular automaton (CA) based random number generator (RNG) on Field Programmable Gate Arrays (FPGAs) is presented.
Journal ArticleDOI

Testable Architecture Design for Programmable Cellular Automata on FPGA Using Run-Time Dynamically Reconfigurable Look-Up Tables

TL;DR: In this article, the authors have achieved run-time dynamic reconfiguration by employing a category of logic cells equipped to realize programmability in cellular automata (CA) architectures on Field Programmable Gate Arrays (FPGAs).
References
More filters
Book ChapterDOI

Boost C++ Libraries

TL;DR: This chapter discusses the Boost C++ API, a peer-reviewed C++ class library which implements many interesting and useful data structures and algorithms, and the use of Boost smart pointers, Boost asynchronous IO, and IO Streams.
BookDOI

Soft Errors in Modern Electronic Systems

TL;DR: In this article, the authors provide a comprehensive presentation of the most advanced research results and technological developments enabling understanding, qualifying and mitigating the soft errors effect in advanced electronics, including the fundamental physical mechanisms of radiation induced soft errors, the various steps that lead to a system failure, the modelling and simulation of soft error at various levels (including physical, electrical, netlist, event driven, RTL, and system level modeling and simulation), hardware fault injection, accelerated radiation testing and natural environment testing, soft error oriented test structures, process-level, device level, celllevel, circuit
Journal ArticleDOI

Effectiveness of Internal Versus External SEU Scrubbing Mitigation Strategies in a Xilinx FPGA: Design, Test, and Analysis

TL;DR: In this article, a comparison of two scrubbing mitigation schemes for Xilinx field programmable gate array (FPGA) devices is presented, along with an examination of mitigation limitations.
Proceedings ArticleDOI

FPGA partial reconfiguration via configuration scrubbing

TL;DR: This paper presents a novel technique that allows partial reconfiguration to be used with configuration scrubbing, a self scrubber that performs the necessary operations to reconfigure a portion of the design while continuously scrubbing the entire FPGA.
Related Papers (5)