Proceedings ArticleDOI
Towards Ex Vivo Testing of MapReduce Applications
Jesus Moran,Antonia Bertolino,Claudio de la Riva,Javier Tuya +3 more
- pp 73-80
TLDR
This work proposes an automatic test framework implementing a novel testing approach called Ex Vivo that can identify a fault in a few seconds, then the program can be stopped, not only avoiding an incorrect output, but also saving money, time and energy of production resources.Abstract:
Big Data programs are those that process large data exceeding the capabilities of traditional technologies. Among newly proposed processing models, MapReduce stands out as it allows the analysis of schema-less data in large distributed environments with frequent infrastructure failures. Functional faults in MapReduce are hard to detect in a testing/preproduction environment due to its distributed characteristics. We propose an automatic test framework implementing a novel testing approach called Ex Vivo. The framework employs data from production but executes the tests in a laboratory to avoid side-effects on the application. Faults are detected automatically without human intervention by checking if the same data would generate different outputs with different infrastructure configurations. The framework (MrExist) is validated with a real-world program. MrExist can identify a fault in a few seconds, then the program can be stopped, not only avoiding an incorrect output, but also saving money, time and energy of production resources.read more
Citations
More filters
Journal ArticleDOI
A Survey of Field-based Testing Techniques
Antonia Bertolino,Pietro Braione,Guglielmo De Angelis,Luca Gazzola,Fitsum Meshesha Kifetew,Leonardo Mariani,Matteo Orru,Mauro Pezzè,Roberto Pietrantuono,Stefano Russo,Paolo Tonella +10 more
TL;DR: In this article, the authors present a systematic survey of field testing approaches over a body of 80 collected studies, and propose their categorization based on the environment and the system on which field testing is performed.
Proceedings ArticleDOI
Automatic Ex-Vivo Regression Testing of Microservices
TL;DR: An approach that analyzes the execution of deployed services at runtime in the field, in order to generate test cases for future versions of the same services, and shows that ExVivoMicroTest can effectively reveal faults based on automatically collected data.
Proceedings ArticleDOI
Toward In-Vivo Testing of Mobile Applications
Mariano Ceccato,Luca Gazzola,Fitsum Meshesha Kifetew,Leonardo Mariani,Matteo Orru,Paolo Tonella +5 more
TL;DR: The main idea is to run test cases in the field, where the intrinsic heterogeneity and variety of the end-user environment is exploited to dramatically increase the range of validated configurations.
Journal ArticleDOI
ExVivoMicroTest: ExVivo Testing of Microservices
Luca Gazzola,Maayan Goldstein,Leonardo Mariani,Marco Mobilio,Itai Segall,Alessandro Tundo,Luca Ussi +6 more
TL;DR: ExVivoMicroTest as mentioned in this paper analyzes the execution of deployed services at run-time in the field, in order to generate test cases for future versions of the same services.
Journal ArticleDOI
Testing anticipatory systems: A systematic mapping study on the state of the art
TL;DR: In this article , a systematic mapping study was conducted to classify and analyze the literature on testing anticipatory systems, enabling us to highlight the most relevant topics and potential gaps in this field.
References
More filters
Journal ArticleDOI
MapReduce: simplified data processing on large clusters
Jeffrey Dean,Sanjay Ghemawat +1 more
TL;DR: This paper presents the implementation of MapReduce, a programming model and an associated implementation for processing and generating large data sets that runs on a large cluster of commodity machines and is highly scalable.
Journal ArticleDOI
MapReduce: simplified data processing on large clusters
Jeffrey Dean,Sanjay Ghemawat +1 more
TL;DR: This presentation explains how the underlying runtime system automatically parallelizes the computation across large-scale clusters of machines, handles machine failures, and schedules inter-machine communication to make efficient use of the network and disks.
Proceedings ArticleDOI
The Hadoop Distributed File System
TL;DR: The architecture of HDFS is described and experience using HDFS to manage 25 petabytes of enterprise data at Yahoo! is reported on.
Proceedings Article
The MD5 Message-Digest Algorithm
TL;DR: This document describes the MD5 message-digest algorithm, which takes as input a message of arbitrary length and produces as output a 128-bit "fingerprint" or "message digest" of the input.