Automatic Testing of Design Faults in MapReduce Applications

doi:10.1109/TR.2018.2802047

Open AccessJournal ArticleDOI

Automatic Testing of Design Faults in MapReduce Applications

Jesus Moran, +3 more

- 16 Mar 2018 -

IEEE Transactions on Reliability

- Vol. 67, Iss: 3, pp 717-732

Chats0

TLDR

New testing techniques that aimed to detect design faults by simulating different infrastructure configurations that as whole are more likely to reveal failures using random testing, and partition testing together with combinatorial testing are proposed.

Abstract:

New processing models are being adopted in Big Data engineering to overcome the limitations of traditional technology. Among them, MapReduce stands out by allowing for the processing of large volumes of data over a distributed infrastructure that can change during runtime. The developer only designs the functionality of the program and its execution is managed by a distributed system. As a consequence, a program can behave differently at each execution because it is automatically adapted to the resources available at each moment. Therefore, when the program has a design fault, this could be revealed in some executions and masked in others. However, during testing, these faults are usually masked because the test infrastructure is stable, and they are only revealed in production because the environment is more aggressive with infrastructure failures, among other reasons. This paper proposes new testing techniques that aimed to detect these design faults by simulating different infrastructure configurations. The testing techniques generate a representative set of infrastructure configurations that as whole are more likely to reveal failures using random testing, and partition testing together with combinatorial testing. The techniques are automated by using a test execution engine called MRTest that is able to detect these faults using only the test input data, regardless of the expected output. Our empirical evaluation shows that MRTest can automatically detect these design faults within a reasonable time.

Automatic Testing of Design Faults in MapReduce Applications

Citations

TEA- Cloud : A Formal Framework for Testing Cloud Computing Systems

Automated testing in robotic process automation projects

Quality Assurance Technologies of Big Data Applications: A Systematic Literature Review

FSM Modeling of Testing Security Policies for MapReduce Frameworks

Optimization Driven Constraints Handling in Combinatorial Interaction Testing

References

MapReduce: simplified data processing on large clusters

MapReduce: simplified data processing on large clusters

Spark: cluster computing with working sets

Art of Software Testing

The Art of Software Testing

Related Papers (5)

Infrastructure-Aware Functional Testing of MapReduce Programs

Towards Ex Vivo Testing of MapReduce Applications

Testing data transformations in MapReduce programs

Separating passing and failing test executions by clustering anomalies

Guided Test Generation for Finding Worst-Case Stack Usage in Embedded Systems