scispace - formally typeset
Proceedings ArticleDOI

Towards Ex Vivo Testing of MapReduce Applications

TLDR
This work proposes an automatic test framework implementing a novel testing approach called Ex Vivo that can identify a fault in a few seconds, then the program can be stopped, not only avoiding an incorrect output, but also saving money, time and energy of production resources.
Abstract
Big Data programs are those that process large data exceeding the capabilities of traditional technologies. Among newly proposed processing models, MapReduce stands out as it allows the analysis of schema-less data in large distributed environments with frequent infrastructure failures. Functional faults in MapReduce are hard to detect in a testing/preproduction environment due to its distributed characteristics. We propose an automatic test framework implementing a novel testing approach called Ex Vivo. The framework employs data from production but executes the tests in a laboratory to avoid side-effects on the application. Faults are detected automatically without human intervention by checking if the same data would generate different outputs with different infrastructure configurations. The framework (MrExist) is validated with a real-world program. MrExist can identify a fault in a few seconds, then the program can be stopped, not only avoiding an incorrect output, but also saving money, time and energy of production resources.

read more

Citations
More filters
Journal ArticleDOI

A Survey of Field-based Testing Techniques

TL;DR: In this article, the authors present a systematic survey of field testing approaches over a body of 80 collected studies, and propose their categorization based on the environment and the system on which field testing is performed.
Proceedings ArticleDOI

Automatic Ex-Vivo Regression Testing of Microservices

TL;DR: An approach that analyzes the execution of deployed services at runtime in the field, in order to generate test cases for future versions of the same services, and shows that ExVivoMicroTest can effectively reveal faults based on automatically collected data.
Proceedings ArticleDOI

Toward In-Vivo Testing of Mobile Applications

TL;DR: The main idea is to run test cases in the field, where the intrinsic heterogeneity and variety of the end-user environment is exploited to dramatically increase the range of validated configurations.
Journal ArticleDOI

ExVivoMicroTest: ExVivo Testing of Microservices

TL;DR: ExVivoMicroTest as mentioned in this paper analyzes the execution of deployed services at run-time in the field, in order to generate test cases for future versions of the same services.
Journal ArticleDOI

Testing anticipatory systems: A systematic mapping study on the state of the art

TL;DR: In this article , a systematic mapping study was conducted to classify and analyze the literature on testing anticipatory systems, enabling us to highlight the most relevant topics and potential gaps in this field.
References
More filters
Journal ArticleDOI

MapReduce: simplified data processing on large clusters

TL;DR: This paper presents the implementation of MapReduce, a programming model and an associated implementation for processing and generating large data sets that runs on a large cluster of commodity machines and is highly scalable.
Journal ArticleDOI

MapReduce: simplified data processing on large clusters

TL;DR: This presentation explains how the underlying runtime system automatically parallelizes the computation across large-scale clusters of machines, handles machine failures, and schedules inter-machine communication to make efficient use of the network and disks.
Proceedings ArticleDOI

The Hadoop Distributed File System

TL;DR: The architecture of HDFS is described and experience using HDFS to manage 25 petabytes of enterprise data at Yahoo! is reported on.
Proceedings Article

The MD5 Message-Digest Algorithm

TL;DR: This document describes the MD5 message-digest algorithm, which takes as input a message of arbitrary length and produces as output a 128-bit "fingerprint" or "message digest" of the input.
Related Papers (5)