Structured random differential testing of instruction decoders

doi:10.1109/SANER.2018.8330199

Proceedings ArticleDOI

Structured random differential testing of instruction decoders

- pp 84-94

TLDR

A testing methodology that automatically infers structural information for an instruction set and uses the inferred structure to efficiently generate structured-random test cases independent of the instruction set being tested is presented.

Abstract:

Decoding binary executable files is a critical facility for software analysis, including debugging, performance monitoring, malware detection, cyber forensics, and sandboxing, among other techniques. As a foundational capability, binary decoding must be consistently correct for the techniques that rely on it to be viable. Unfortunately, modern instruction sets are huge and the encodings are complex, so as a result, modern binary decoders are buggy. In this paper, we present a testing methodology that automatically infers structural information for an instruction set and uses the inferred structure to efficiently generate structured-random test cases independent of the instruction set being tested. Our testing methodology includes automatic output verification using differential analysis and reassembly to generate error reports. This testing methodology requires little instruction-set-specific knowledge, allowing rapid testing of decoders for new architectures and extensions to existing ones. We have implemented our testing procedure in a tool name Fleece and used it to test multiple binary decoders (Intel XED, libopcodes, LLVM, Dyninst and Capstone) on multiple architectures (x86, ARM and PowerPC). Our testing efficiently covered thousands of instruction format variations for each instruction set and uncovered decoding bugs in every decoder we tested.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

Differential analysis of x86-64 instruction decoders

William Woodruff, +2 more

TL;DR: Differential fuzzing has been applied successfully to cryptography software and complex application format parsers like PDF and ELF as discussed by the authors, where an implementation of a specification is said to be potentially erroneous if its behavior differs from another implementation's on the same input.

...read moreread less

Proceedings ArticleDOI

iDEV: exploring and exploiting semantic deviations in ARM instruction processing

Shisong Qin, +3 more

TL;DR: Li et al. as discussed by the authors conducted an empirical study on the ARM Instruction Semantic Deviation (ISDev) issue, and developed a framework iDEV to systematically explore the ISDev issue in existing ARM instructions processing tools and platforms via differential testing.

...read moreread less

Journal ArticleDOI

AnICA: analyzing inconsistencies in microarchitectural code analyzers

Fabian Ritter, +1 more

- 13 Sep 2022 -

Proceedings of the ACM on programming la...

TL;DR: This paper presents AnICA, a tool taking inspiration from differential testing and abstract interpretation to systematically analyze inconsistencies among microarchitectural code analyzers, and shows that AnICA can summarize thousands of inconsistencies in a few dozen descriptions that directly lead to high-level insights into the different behavior of the tools.

...read moreread less

Journal ArticleDOI

In-depth Testing of x86 Instruction Disassemblers with Feedback Controlled DFS Algorithm

Guang Xing Wang, +3 more

- 01 Oct 2022 -

International Conference on Community De...

TL;DR: FedDFS as mentioned in this paper leveraged a feedback controlled DFS algorithm, which is controlled by comparing its search depth with essential search depth, and the feedback mechanism promptly increases the search depth until it reaches the proper search depth.

...read moreread less

Proceedings ArticleDOI

In-depth Testing of x86 Instruction Disassemblers with Feedback Controlled DFS Algorithm

TL;DR: FedDFS as discussed by the authors leveraged a feedback controlled DFS algorithm, which is controlled by comparing its search depth with essential search depth, and the feedback mechanism promptly increases the search depth until it reaches the proper search depth.

...read moreread less

References

PDF

Open Access

More filters

Proceedings ArticleDOI

LLVM: a compilation framework for lifelong program analysis & transformation

Chris Lattner, +1 more

TL;DR: The design of the LLVM representation and compiler framework is evaluated in three ways: the size and effectiveness of the representation, including the type information it provides; compiler performance for several interprocedural problems; and illustrative examples of the benefits LLVM provides for several challenging compiler problems.

...read moreread less

Book ChapterDOI

BitBlaze: A New Approach to Computer Security via Binary Analysis

Dawn Song, +9 more

TL;DR: An overview of the BitBlaze project, a new approach to computer security via binary analysis that focuses on building a unified binary analysis platform and using it to provide novel solutions to a broad spectrum of different security problems.

...read moreread less

Journal ArticleDOI

HPCTOOLKIT: tools for performance analysis of optimized parallel programs

Laksono Adhianto, +6 more

- 01 Jan 2009 -

Concurrency and Computation: Practice an...

TL;DR: An overview of HPCTOOLKIT is provided and its utility for performance analysis of parallel applications is illustrated.

...read moreread less

Proceedings ArticleDOI

Grammar-based whitebox fuzzing

Patrice Godefroid, +2 more

TL;DR: Results of the experiments show that grammar-based whitebox fuzzing explores deeper program paths and avoids dead-ends due to non-parsable inputs and increased coverage of the code generation module of the IE7 JavaScript interpreter from 53% to 81% while using three times fewer tests.

...read moreread less

Proceedings Article

Instrumentation and optimization of Win32/intel executables using Etch

Ted Romer, +7 more

TL;DR: Etch is a general-purpose tool for rewriting arbitrary Win32/x86 binaries without requiring source code and some of the tools that are built using it are described, including a hierarchical call graph profiler and an instruction layout optimization tool.

...read moreread less

Structured random differential testing of instruction decoders

Citations

Differential analysis of x86-64 instruction decoders

iDEV: exploring and exploiting semantic deviations in ARM instruction processing

AnICA: analyzing inconsistencies in microarchitectural code analyzers

In-depth Testing of x86 Instruction Disassemblers with Feedback Controlled DFS Algorithm

In-depth Testing of x86 Instruction Disassemblers with Feedback Controlled DFS Algorithm

References

LLVM: a compilation framework for lifelong program analysis & transformation

BitBlaze: A New Approach to Computer Security via Binary Analysis

HPCTOOLKIT: tools for performance analysis of optimized parallel programs

Grammar-based whitebox fuzzing

Instrumentation and optimization of Win32/intel executables using Etch

Related Papers (5)

Differential analysis of x86-64 instruction decoders

Instruction trace analysis and enhanced debugging in embedded systems

ConTesa : Directed Test Suite Augmentation for Concurrent Software

XEMU: an efficient QEMU based binary mutation testing framework for embedded software

Random testing for security: blackbox vs. whitebox fuzzing