ABSTRACT
Fault injection is the de facto standard for evaluating the sensitivity of digital systems to transient errors. Due to various masking effects only a very small portion of the injected faults lead to system-level failures, and hence, too many faults have to be injected for achieving statistically meaningful results. At the same time, since the majority of injected faults will be masked, lots of simulation cycles will be wasted for tracking each and every injected fault separately. In this paper, we propose an opportunistic acceleration technique which evaluates the impact of multiple non-interacting faults in one workload execution. In case no failure is observed, this technique skips the evaluation of those individual faults which leads to a significant speedup. The experimental results on the Leon3 processor show that our proposed technique shortens the fault injection runtime by two orders of magnitude.
- E. Ibe et al. Impact of scaling on neutron-induced soft error in SRAMs from a 250 nm to a 22 nm design rule. IEEE Transactions on Electron Devices, 57(7):1527--1538, 2010.Google ScholarCross Ref
- A. Dixit and A. Wood. The impact of new technology on soft error rates. In International Reliability Physics Symposium, pages 5B--4, 2011.Google ScholarCross Ref
- M. Ebrahimi et al. Comprehensive analysis of sequential and combinational soft errors in an embedded processor. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 2015.Google ScholarDigital Library
- J.-L. Autran and D. Munteanu. Soft Errors: From Particles to Circuits, volume 39. CRC Press, 2015.Google ScholarCross Ref
- K. Reick et al. Fault-tolerant design of the ibm power6 microprocessor. IEEE Micro, 28(2):30--38, 2008. Google ScholarDigital Library
- C.-Y. Cher et al. Soft error resiliency characterization and improvement on ibm bluegene/q processor using accelerated proton irradiation. In International Test Conference, pages 1--6, 2014.Google ScholarCross Ref
- R. Leveugle, A. Calvez, P. Maistri, and P. Vanhauwaert. Statistical fault injection: quantified error and confidence. In Design, Automation & Test in Europe Conference, pages 502--506, 2009. Google ScholarDigital Library
- M. Ebrahimi et al. A fast, flexible, and easy-to-develop fpga-based fault injection technique. Microelectronics Reliability, 54(5):1000--1008, 2014.Google ScholarCross Ref
- N. A. Harward, M. R. Gardiner, L. W. Hsiao, and M. J. Wirthlin. Estimating soft processor soft error sensitivity through fault injection. In Field-Programmable Custom Computing Machines (FCCM), pages 143--150, 2015. Google ScholarDigital Library
- S. Mirkhani, H. Cho, S. Mitra, and J. A. Abraham. Rethinking error injection for effective resilience. In Asia and South Pacific Design Automation Conference (ASP-DAC), pages 390--393, 2014.Google ScholarCross Ref
- V. B. Kleeberger, D. Mueller-Gritschneder, and U. Schlichtmann. Technology-aware system failure analysis in the presence of soft errors by mixture importance sampling. In Defect and Fault Tolerance in VLSI and Nanotechnology Systems, pages 118--124, 2013.Google ScholarCross Ref
- M. Ebrahimi et al. Fault injection acceleration by architectural importance sampling. In International Conference on Hardware-Software Codesign and System Synthesis, pages 1--6, 2014.Google Scholar
- The Leon3 Processor User Manual, http://www.gaisler.com, 2015.Google Scholar
- M. R. Guthaus et al. Mibench: A free, commercially representative embedded benchmark suite. In IEEE International Workshop on Workload Characterization, pages 3--14, 2001. Google ScholarDigital Library
- S. Mukherjee. Architecture design for soft errors. Morgan Kaufmann, 2011. Google ScholarDigital Library
- M. Ebrahimi et al. Layout-based modeling and mitigation of multiple event transients. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 2016.Google ScholarDigital Library
- D. Alexandrescu. A comprehensive soft error analysis methodology for SoCs/ASICs memory instances. In International On-Line Testing Symposium, pages 175--176, 2011. Google ScholarDigital Library
- L. Berrojo et al. New techniques for speeding-up fault-injection campaigns. In Design, Automation and Test in Europe Conference, pages 847--852, 2002. Google ScholarDigital Library
- P. Maistri and R. Leveugle. Towards automated fault pruning with petri nets. In International On-Line Testing Symposium, pages 41--46, 2009.Google ScholarCross Ref
- Vishwani D Agrawal, AVSS Prasad, and Madhusudan V Atre. Fault collapsing via functional dominance. In International Test Conference, pages 274--274, 2003.Google ScholarCross Ref
- A. Benso et al. Fault-list collapsing for fault-injection experiments. In Annual Reliability and Maintainability Symposium, pages 383--388. IEEE, 1998.Google Scholar
- M. Maniatakos, C. Tirumurti, A. Jas, and Y. Makris. Avf analysis acceleration via hierarchical fault pruning. In European Test Symposium (ETS), pages 87--92, 2011. Google ScholarDigital Library
- Fault injection acceleration by simultaneous injection of non-interacting faults
Recommendations
Fault Injection into VHDL Models: Experimental Validation of a Fault Tolerant Microcomputer System
EDCC-3: Proceedings of the Third European Dependable Computing Conference on Dependable ComputingThis work presents a campaign of fault injection to validate the dependability of a fault tolerant microcomputer system. The system is duplex with cold stand-by sparing, parity detection and a watchdog timer. The faults have been injected on a chip-...
Combining Software-Implemented and Simulation-Based Fault Injection into a Single Fault Injection Method
FTCS '95: Proceedings of the Twenty-Fifth International Symposium on Fault-Tolerant ComputingAbstract: Fault/error injection has emerged as a valuable means for evaluating the dependability of a system. In particular, software-based techniques (which can be described as software-implemented and simulation-based techniques) have become very ...
Fault Injection and Dependability Evaluation of Fault-Tolerant Systems
The authors describe a dependability evaluation method based on fault injection that establishes the link between the experimental evaluation of the fault tolerance process and the fault occurrence process. The main characteristics of a fault injection ...
Comments