On the optimal reconfiguration times for TMR circuits on SRAM based FPGAs
TL;DR: This work proposes a novel circuit instrumentation method for probing Triple Modular Redundancy (TMR) circuits for error detection at the granularity of individual domains and then use selective run-time dynamic reconfiguration for recovery.
Abstract: Unreliable and harsh environmental conditions in avionics and space applications demand run-time adaptation capabilities to withstand environmental changes and radiation-induced faults. Modern SRAM-based FPGAs integrating high computational power with partial and dynamic reconfiguration abilities are a usual candidate for such systems. However, due to the vulnerability of these devices to Single Event Upsets (SEUs), designs need proper fault-handling mechanisms. In this work we propose a novel circuit instrumentation method for probing Triple Modular Redundancy (TMR) circuits for error detection at the granularity of individual domains and then use selective run-time dynamic reconfiguration for recovery. Error detection logic is inserted in the physical net-list to identify and localize faults. Moreover, selective domain reconfiguration is achieved by careful considerations in the placement phase on the FPGA reconfigurable area. The proposed technique is suitable for systems having hard real-time constraints. Our results demonstrate that this approach has an overhead of 2 LUTs per majority voter in internal partitions in terms of area when compared to the standard TMR circuits. In addition, it brings down the reconfiguration times of TMR circuits to a single domain and ensures a 100% availability of the device assuming the Single Event Upset fault model.
...read more
Citations
14 citations
Cites background from "On the optimal reconfiguration time..."
...So, the scrubbing can be limited to the bitstream region implementing the faulty module [18], [19]....
[...]
13 citations
Cites background from "On the optimal reconfiguration time..."
...Finally, it differs from [30, 32] in that it targets permanent as well as transient faults....
[...]
...In [30], a modified implementation of TMR circuits was proposed allowing for the insertion of fault detection logic, mainly minority voters, on the granularity of a domain....
[...]
11 citations
Cites background or methods from "On the optimal reconfiguration time..."
...(1) To improve the reliability of electronics in space, the multi-objective evolutionary design of STMR system against SEUs is proposed in this paper....
[...]
...(1) The input is considered to be sensitive only if its value is dominant over other inputs....
[...]
...(1) Obtaining the required topologies using EHW....
[...]
...Each optimal chromosome evolved in (1) is translated into VHDL format to design a new VRC structure capable of implementing STMR....
[...]
...The basic concept is as follows: (1) a set of the primary input probabilities has been generated and propagated through the combinational circuit; (2) the output signal probabilities of all the lines in the circuit are calculated; (3) SEU sensitive gates are identified; (4) TMR is introduced to these sensitive gates....
[...]
11 citations
Cites methods from "On the optimal reconfiguration time..."
...In [10], TMR is used along with DPR for recovery of transient faults in [10]....
[...]
10 citations
Cites background from "On the optimal reconfiguration time..."
...the optimal granularity of DMR/TMR domains [4], its voting mechanism or even optimal reconfiguration time after the detection of the fault [11]....
[...]
References
189 citations
"On the optimal reconfiguration time..." refers background or methods in this paper
...TMR with configuration scrubbing [3][4] has reconfiguration time on the orders of milliseconds which can be intolerable for systems with hard real-time constraints....
[...]
...However, exploiting the reconfiguration abilities of FPGA, TMR life can be extended by writing the configuration bit-stream periodically in order to stop the accumulation of multiple independent single event upsets [3] [4]....
[...]
136 citations
"On the optimal reconfiguration time..." refers background in this paper
...Similarly, a system level partitioning based approach is proposed in [8] in order to detect and localize faults and use dynamic reconfiguration for recovery....
[...]
130 citations
"On the optimal reconfiguration time..." refers methods in this paper
...A method on how to resolve this situation is presented in [9]....
[...]
34 citations
"On the optimal reconfiguration time..." refers background in this paper
...An interesting work in presented in [10] where the authors suggest to remove all the internal voters and implement TMR as a single partition i-e voters are used only at the output....
[...]
33 citations
"On the optimal reconfiguration time..." refers methods in this paper
...In [7] the authors present a method to recover a system to health by dynamic partial reconfiguration....
[...]