Showing papers on "Benchmark (computing) published in 2012"

PDF

Open Access

Journal Article•DOI•

A comparison study of basic data-driven fault diagnosis and process monitoring methods on the benchmark Tennessee Eastman process

[...]

Shen Yin¹, Shen Yin², Steven X. Ding², Adel Haghani², Haiyang Hao², Ping Zhang² - Show less +2 more•Institutions (2)

Harbin Institute of Technology¹, University of Duisburg-Essen²

01 Oct 2012-Journal of Process Control

TL;DR: A comparison study on the basic data-driven methods for process monitoring and fault diagnosis (PM–FD) based on the original ideas, implementation conditions, off-line design and on-line computation algorithms as well as computation complexity are discussed in detail.

...read moreread less

1,116 citations

Proceedings Article•DOI•

Introducing a New Benchmarked Dataset for Activity Monitoring

[...]

Attila Reiss¹, Didier Stricker¹•Institutions (1)

German Research Centre for Artificial Intelligence¹

18 Jun 2012

TL;DR: A new dataset - recorded from 18 activities performed by 9 subjects, wearing 3 IMUs and a HR-monitor - is created and made publicly available, showing the difficulty of the classification tasks and exposes new challenges for physical activity monitoring.

...read moreread less

Abstract: This paper addresses the lack of a commonly used, standard dataset and established benchmarking problems for physical activity monitoring. A new dataset -- recorded from 18 activities performed by 9 subjects, wearing 3 IMUs and a HR-monitor -- is created and made publicly available. Moreover, 4 classification problems are benchmarked on the dataset, using a standard data processing chain and 5 different classifiers. The benchmark shows the difficulty of the classification tasks and exposes new challenges for physical activity monitoring.

...read moreread less

902 citations

Proceedings Article•DOI•

Clearing the clouds: a study of emerging scale-out workloads on modern hardware

[...]

Michael Ferdman¹, Almutaz Adileh², Onur Kocberber², Stavros Volos², Mohammad Alisafaee², Djordje Jevdjic², Cansu Kaynak², Adrian Daniel Popescu², Anastasia Ailamaki², Babak Falsafi² - Show less +6 more•Institutions (2)

Carnegie Mellon University¹, École Polytechnique Fédérale de Lausanne²

03 Mar 2012

TL;DR: This work identifies the key micro-architectural needs of scale-out workloads, calling for a change in the trajectory of server processors that would lead to improved computational density and power efficiency in data centers.

...read moreread less

Abstract: Emerging scale-out workloads require extensive amounts of computational resources. However, data centers using modern server hardware face physical constraints in space and power, limiting further expansion and calling for improvements in the computational density per server and in the per-operation energy. Continuing to improve the computational resources of the cloud while staying within physical constraints mandates optimizing server efficiency to ensure that server hardware closely matches the needs of scale-out workloads.In this work, we introduce CloudSuite, a benchmark suite of emerging scale-out workloads. We use performance counters on modern servers to study scale-out workloads, finding that today's predominant processor micro-architecture is inefficient for running these workloads. We find that inefficiency comes from the mismatch between the workload needs and modern processors, particularly in the organization of instruction and data memory systems and the processor core micro-architecture. Moreover, while today's predominant micro-architecture is inefficient when executing scale-out workloads, we find that continuing the current trends will further exacerbate the inefficiency in the future. In this work, we identify the key micro-architectural needs of scale-out workloads, calling for a change in the trajectory of server processors that would lead to improved computational density and power efficiency in data centers.

...read moreread less

860 citations

Parboil: A Revised Benchmark Suite for Scientific and Commercial Throughput Computing

[...]

John A. Stratton, Christopher I. Rodrigues, I-Jui Sung, Nady Obeid, Li-Wen Chang, Nasser Anssari, Geng Daniel Liu, Wen-mei W. Hwu¹ - Show less +4 more•Institutions (1)

University of Illinois at Urbana–Champaign¹

01 Jan 2012

TL;DR: By including versions of varying levels of optimization of the same fundamental algorithm, the Parboil benchmarks present opportunities to demonstrate tools and architectures that help programmers get the most out of their parallel hardware.

...read moreread less

Abstract: The Parboil benchmarks are a set of throughput computing applications useful for studying the performance of throughput computing architecture and compilers. The name comes from the culinary term for a partial cooking process, which represents our belief that useful throughput computing benchmarks must be “cooked”, or preselected to implement a scalable algorithm with fine-grained paralle l tasks. But useful benchmarks for this field cannot be “fully cooked”, because the architectures and programming models and supporting tools are evolving rapidly enough that static benchmark codes will lose relevance very quickly. We have collected benchmarks from throughput computing application researchers in many different scientific and commercial fields including image processing, biomolec ular simulation, fluid dynamics, and astronomy. Each benchmark includes several implementations. Some implementations we provide as readable base implementations from which new optimization efforts can begin, and others as examples of the current state-of-the-art targeting specific CPU and GPU architectures. As we continue to optimiz e these benchmarks for new and existing architectures ourselves, we will also gladly accept new implementations and benchmark contributions from developers to recognize those at the frontier of performance optimization on each architecture. Finally, by including versions of varying levels of optimization of the same fundamental algorithm, the benchmarks present opportunities to demonstrate tools and architectures that help programmers get the most out of their parallel hardware. Less optimized versions are presented as challenges to the compiler and architecture research communities: to develop the technology that automatically raises the performance of simpler implementations to the performance level of sophisticated programmer-optimized implementations, or demonstrate any other performance or programmability improvements. We hope that these benchmarks will facilitate effective demonstrations of such technology.

...read moreread less

695 citations

A Benchmark of Computational Models of Saliency to Predict Human Fixations

[...]

Tilke Judd, Frédo Durand, Antonio Torralba

13 Jan 2012

TL;DR: A benchmark data set containing 300 natural images with eye tracking data from 39 observers is proposed to compare model performances and it is shown that human performance increases with the number of humans to a limit.

...read moreread less

Abstract: Many computational models of visual attention have been created from a wide variety of different approaches to predict where people look in images. Each model is usually introduced by demonstrating performances on new images, and it is hard to make immediate comparisons between models. To alleviate this problem, we propose a benchmark data set containing 300 natural images with eye tracking data from 39 observers to compare model performances. We calculate the performance of 10 models at predicting ground truth fixations using three different metrics. We provide a way for people to submit new models for evaluation online. We find that the Judd et al. and Graph-based visual saliency models perform best. In general, models with blurrier maps and models that include a center bias perform well. We add and optimize a blur and center bias for each model and show improvements. We compare performances to baseline models of chance, center and human performance. We show that human performance increases with the number of humans to a limit. We analyze the similarity of different models using multidimensional scaling and explore the relationship between model performance and fixation consistency. Finally, we offer observations about how to improve saliency models in the future.

...read moreread less

564 citations

Journal Article•DOI•

Evaluating defect prediction approaches: a benchmark and an extensive comparison

[...]

Marco D'Ambros¹, Michele Lanza¹, Romain Robbes²•Institutions (2)

University of Lugano¹, University of Chile²

01 Aug 2012-Empirical Software Engineering

TL;DR: The results indicate that, while some approaches perform better than others in a statistically significant manner, external validity in defect prediction is still an open problem, as generalizing results to different contexts/learners proved to be a partially unsuccessful endeavor.

...read moreread less

Abstract: Reliably predicting software defects is one of the holy grails of software engineering. Researchers have devised and implemented a plethora of defect/bug prediction approaches varying in terms of accuracy, complexity and the input data they require. However, the absence of an established benchmark makes it hard, if not impossible, to compare approaches. We present a benchmark for defect prediction, in the form of a publicly available dataset consisting of several software systems, and provide an extensive comparison of well-known bug prediction approaches, together with novel approaches we devised. We evaluate the performance of the approaches using different performance indicators: classification of entities as defect-prone or not, ranking of the entities, with and without taking into account the effort to review an entity. We performed three sets of experiments aimed at (1) comparing the approaches across different systems, (2) testing whether the differences in performance are statistically significant, and (3) investigating the stability of approaches across different learners. Our results indicate that, while some approaches perform better than others in a statistically significant manner, external validity in defect prediction is still an open problem, as generalizing results to different contexts/learners proved to be a partially unsuccessful endeavor.

...read moreread less

536 citations

Proceedings Article•DOI•

Multi2Sim: a simulation framework for CPU-GPU computing

[...]

Rafael Ubal¹, Byunghyun Jang², Perhaad Mistry¹, Dana Schaa¹, David Kaeli¹ - Show less +1 more•Institutions (2)

Northeastern University¹, University of Mississippi²

19 Sep 2012

TL;DR: This paper presents Multi2Sim, an open-source, modular, and fully configurable toolset that enables ISA-level simulation of an ×86 CPU and an AMD Evergreen GPU, and addresses program emulation correctness, as well as architectural simulation accuracy, using AMD's OpenCL benchmark suite.

...read moreread less

Abstract: Accurate simulation is essential for the proper design and evaluation of any computing platform. Upon the current move toward the CPU-GPU heterogeneous computing era, researchers need a simulation framework that can model both kinds of computing devices and their interaction. In this paper, we present Multi2Sim, an open-source, modular, and fully configurable toolset that enables ISA-level simulation of an ×86 CPU and an AMD Evergreen GPU. Focusing on a model of the AMD Radeon 5870 GPU, we address program emulation correctness, as well as architectural simulation accuracy, using AMD's OpenCL benchmark suite. Simulation capabilities are demonstrated with a preliminary architectural exploration study, and workload characterization examples. The project source code, benchmark packages, and a detailed user's guide are publicly available at www.multi2sim.org.

...read moreread less

440 citations

Posted Content•

Heuristic Search Value Iteration for POMDPs

[...]

Trey Smith¹, Reid Simmons¹•Institutions (1)

Carnegie Mellon University¹

11 Jul 2012-arXiv: Artificial Intelligence

TL;DR: Heuristic search value iteration (HSVI) as mentioned in this paper is an anytime algorithm that returns a policy and a provable bound on its regret with respect to the optimal policy, which can be used to solve POMDP problems.

...read moreread less

Abstract: We present a novel POMDP planning algorithm called heuristic search value iteration (HSVI).HSVI is an anytime algorithm that returns a policy and a provable bound on its regret with respect to the optimal policy. HSVI gets its power by combining two well-known techniques: attention-focusing search heuristics and piecewise linear convex representations of the value function. HSVI's soundness and convergence have been proven. On some benchmark problems from the literature, HSVI displays speedups of greater than 100 with respect to other state-of-the-art POMDP value iteration algorithms. We also apply HSVI to a new rover exploration problem 10 times larger than most POMDP problems in the literature.

...read moreread less

439 citations

Journal Article•DOI•

Implementing Molecular Dynamics on Hybrid High Performance Computers - Particle-Particle Particle-Mesh

[...]

W. Michael Brown¹, Axel Kohlmeyer², Steven J. Plimpton³, Arnold N. Tharrington¹•Institutions (3)

National Center for Computational Sciences¹, Temple University², Sandia National Laboratories³

01 Mar 2012-Computer Physics Communications

TL;DR: This paper presents an efficient implementation of the particle–particle particle-mesh method based on the work by Harvey and De Fabritiis, and provides a performance comparison of the same kernels compiled with both CUDA and OpenCL.

...read moreread less

381 citations

Proceedings Article•DOI•

Joint VM placement and routing for data center traffic engineering

[...]

Joe Wenjie Jiang¹, Tian Lan², Sangtae Ha¹, Minghua Chen³, Mung Chiang¹ - Show less +1 more•Institutions (3)

Princeton University¹, George Washington University², The Chinese University of Hong Kong³

25 Mar 2012

TL;DR: This work proposes an efficient online algorithm that employs the real data center traffic traces under a spectrum of elephant and mice flows and demonstrates a consistent and significant improvement over the benchmark achieved by common heuristics.

...read moreread less

Abstract: Today's data centers need efficient traffic management to improve resource utilization in their networks In this work, we study a joint tenant (eg, server or virtual machine) placement and routing problem to minimize traffic costs These two complementary degrees of freedom—placement and routing—are mutually-dependent, however, are often optimized separately in today's data centers Leveraging and expanding the technique of Markov approximation, we propose an efficient online algorithm in a dynamic environment under changing traffic loads The algorithm requires a very small number of virtual machine migrations and is easy to implement in practice Performance evaluation that employs the real data center traffic traces under a spectrum of elephant and mice flows, demonstrates a consistent and significant improvement over the benchmark achieved by common heuristics

...read moreread less

377 citations

Proceedings Article•DOI•

Articulated people detection and pose estimation: Reshaping the future

[...]

Leonid Pishchulin¹, Arjun Jain¹, Mykhaylo Andriluka¹, Thorsten Thormählen¹, Bernt Schiele¹ - Show less +1 more•Institutions (1)

Max Planck Society¹

16 Jun 2012

TL;DR: This work proposes a new technique to extend an existing training set that allows to explicitly control pose and shape variations and defines a new challenge of combined articulated human detection and pose estimation in real-world scenes.

...read moreread less

Abstract: State-of-the-art methods for human detection and pose estimation require many training samples for best performance. While large, manually collected datasets exist, the captured variations w.r.t. appearance, shape and pose are often uncontrolled thus limiting the overall performance. In order to overcome this limitation we propose a new technique to extend an existing training set that allows to explicitly control pose and shape variations. For this we build on recent advances in computer graphics to generate samples with realistic appearance and background while modifying body shape and pose. We validate the effectiveness of our approach on the task of articulated human detection and articulated pose estimation. We report close to state of the art results on the popular Image Parsing [25] human pose estimation benchmark and demonstrate superior performance for articulated human detection. In addition we define a new challenge of combined articulated human detection and pose estimation in real-world scenes.

...read moreread less

Journal Article•DOI•

WebPIE: A Web-scale Parallel Inference Engine using MapReduce

[...]

Jacopo Urbani¹, Spyros Kotoulas¹, Jason Maassen¹, Frank van Harmelen¹, Henri E. Bal¹ - Show less +1 more•Institutions (1)

VU University Amsterdam¹

01 Jan 2012-Journal of Web Semantics

TL;DR: This article proposes a distributed technique to perform materialization under the RDFS and OWL ter Horst semantics using the MapReduce programming model and shows that it scales linearly and vastly outperforms current systems in terms of maximum data size and inference speed.

...read moreread less

Proceedings Article•DOI•

SnuCL: an OpenCL framework for heterogeneous CPU/GPU clusters

[...]

Jung-Won Kim¹, Sangmin Seo¹, Jun Lee¹, Jeongho Nah¹, Gangwon Jo¹, Jaejin Lee¹ - Show less +2 more•Institutions (1)

Seoul National University¹

25 Jun 2012

TL;DR: It is shown that the original OpenCL semantics naturally fits to the heterogeneous cluster programming environment, and the framework achieves high performance and ease of programming.

...read moreread less

Abstract: In this paper, we propose SnuCL, an OpenCL framework for heterogeneous CPU/GPU clusters. We show that the original OpenCL semantics naturally fits to the heterogeneous cluster programming environment, and the framework achieves high performance and ease of programming. The target cluster architecture consists of a designated, single host node and many compute nodes. They are connected by an interconnection network, such as Gigabit Ethernet and InfiniBand switches. Each compute node is equipped with multicore CPUs and multiple GPUs. A set of CPU cores or each GPU becomes an OpenCL compute device. The host node executes the host program in an OpenCL application. SnuCL provides a system image running a single operating system instance for heterogeneous CPU/GPU clusters to the user. It allows the application to utilize compute devices in a compute node as if they were in the host node. No communication API, such as the MPI library, is required in the application source. SnuCL also provides collective communication extensions to OpenCL to facilitate manipulating memory objects. With SnuCL, an OpenCL application becomes portable not only between heterogeneous devices in a single node, but also between compute devices in the cluster environment. We implement SnuCL and evaluate its performance using eleven OpenCL benchmark applications.

...read moreread less

Proceedings Article•DOI•

Unsupervised metric fusion by cross diffusion

[...]

Bo Wang¹, Jiayan Jiang², Wei Wang³, Zhi-Hua Zhou³, Zhuowen Tu² - Show less +1 more•Institutions (3)

University of Toronto¹, University of California, Los Angeles², Nanjing University³

16 Jun 2012

TL;DR: This paper proposes a fusion algorithm which outputs enhanced metrics by combining multiple given metrics (similarity measures) through diffusion process in an unsupervised way and has a wide range of applications in machine learning and computer vision.

...read moreread less

Abstract: Metric learning is a fundamental problem in computer vision. Different features and algorithms may tackle a problem from different angles, and thus often provide complementary information. In this paper, we propose a fusion algorithm which outputs enhanced metrics by combining multiple given metrics (similarity measures). Unlike traditional co-training style algorithms where multi-view features or multiple data subsets are used for classification or regression, we focus on fusing multiple given metrics through diffusion process in an unsupervised way. Our algorithm has its particular advantage when the input similarity matrices are the outputs from diverse algorithms. We provide both theoretical and empirical explanations to our method. Significant improvements over the state-of-the-art results have been observed on various benchmark datasets. For example, we have achieved 100% accuracy (no longer the bull's eye measure) on the MPEG-7 shape dataset. Our method has a wide range of applications in machine learning and computer vision.

...read moreread less

Journal Article•DOI•

The Mario AI Benchmark and Competitions

[...]

Sergey Karakovskiy¹, Julian Togelius²•Institutions (2)

Saint Petersburg State University¹, IT University of Copenhagen²

22 Feb 2012-IEEE Transactions on Computational Intelligence and AI in Games

TL;DR: The Mario AI benchmark is described, a game-based benchmark for reinforcement learning algorithms and game AI techniques developed by the authors, intended as the definitive point of reference for those using the benchmark for research or teaching.

...read moreread less

Abstract: This paper describes the Mario AI benchmark, a game-based benchmark for reinforcement learning algorithms and game AI techniques developed by the authors. The benchmark is based on a public domain clone of Nintendo's classic platform game Super Mario Bros, and completely open source. During the last two years, the benchmark has been used in a number of competitions associated with international conferences, and researchers and students from around the world have contributed diverse solutions to try to beat the benchmark. The paper summarizes these contributions, gives an overview of the state of the art in Mario-playing AIs, and chronicles the development of the benchmark. This paper is intended as the definitive point of reference for those using the benchmark for research or teaching.

...read moreread less

Journal Article•DOI•

Real-Coded Chemical Reaction Optimization

[...]

Albert Y. S. Lam¹, Victor O. K. Li², James J.Q. Yu²•Institutions (2)

University of California, Berkeley¹, University of Hong Kong²

01 Jun 2012-IEEE Transactions on Evolutionary Computation

TL;DR: This paper compares the performance of RCCRO with a large number of optimization techniques on a large set of standard continuous benchmark functions and finds that RCC RO outperforms all the others on the average, showing that CRO is suitable for solving problems in the continuous domain.

...read moreread less

Abstract: Optimization problems can generally be classified as continuous and discrete, based on the nature of the solution space. A recently developed chemical-reaction-inspired metaheuristic, called chemical reaction optimization (CRO), has been shown to perform well in many optimization problems in the discrete domain. This paper is dedicated to proposing a real-coded version of CRO, namely, RCCRO, to solve continuous optimization problems. We compare the performance of RCCRO with a large number of optimization techniques on a large set of standard continuous benchmark functions. We find that RCCRO outperforms all the others on the average. We also propose an adaptive scheme for RCCRO which can improve the performance effectively. This shows that CRO is suitable for solving problems in the continuous domain.

...read moreread less

Journal Article•DOI•

The time dependent vehicle routing problem with time windows: Benchmark problems, an efficient solution algorithm, and solution characteristics

[...]

Miguel A. Figliozzi¹•Institutions (1)

Portland State University¹

01 May 2012-Transportation Research Part E-logistics and Transportation Review

TL;DR: In this article, an algorithm that can tackle time dependent vehicle routing problems with hard or soft time windows without any alteration in its structure is presented, and experimental results indicate that average computational time increases proportionally to the number of customers squared.

...read moreread less

Abstract: An algorithm that can tackle time dependent vehicle routing problems with hard or soft time windows without any alteration in its structure is presented. Analytical and experimental results indicate that average computational time increases proportionally to the number of customers squared. New replicable test problems that capture the typical speed variations of congested urban settings are proposed. Solution quality, time window perturbations, and computational time results are discussed as well as a method to study the impact of perturbations by problem type. The algorithm efficiency and simplicity is well suited for urban areas where fast running times may be required.

...read moreread less

Journal Article•DOI•

The Team Orienteering Problem with Time Windows: An LP-based Granular Variable Neighborhood Search

[...]

Nacima Labadie¹, Renata Mansini², Jan Melechovský¹, Jan Melechovský³, Roberto Wolfler Calvo - Show less +1 more•Institutions (3)

University of Technology of Troyes¹, University of Brescia², University of Economics, Prague³

01 Jul 2012-European Journal of Operational Research

TL;DR: A Variable Neighborhood Search (VNS) procedure based on the idea of exploring, most of the time, granular instead of complete neighborhoods in order to improve the algorithm’s efficiency without loosing effectiveness is proposed.

...read moreread less

Journal Article•DOI•

A hybrid algorithm for the Heterogeneous Fleet Vehicle Routing Problem

[...]

Anand Subramanian¹, Anand Subramanian², Puca Huachi Vaz Penna¹, Eduardo Uchoa¹, Luiz Satoru Ochi¹ - Show less +1 more•Institutions (2)

Federal Fluminense University¹, Federal University of Paraíba²

01 Sep 2012-European Journal of Operational Research

TL;DR: The proposed hybrid algorithm is composed by an Iterated Local Search (ILS) based heuristic and a Set Partitioning (SP) formulation, which is solved by means of a Mixed Integer Programming solver that interactively calls the ILS heuristic during its execution.

...read moreread less

Journal Article•DOI•

Survey and performance evaluation on some automotive semi-active suspension control methods: A comparative study on a single-corner model

[...]

Charles Poussot-Vassal, Cristiano Spelta¹, Olivier Sename, Sergio M. Savaresi², Luc Dugard - Show less +1 more•Institutions (2)

University of Bergamo¹, Polytechnic University of Milan²

01 Apr 2012-Annual Reviews in Control

TL;DR: This paper aims at providing a picture – as complete as possible – of the present state of the art in the semi-active suspension control field in terms of comfort and road-holding performance evaluation and trade-off.

...read moreread less

Book Chapter•DOI•

SRBench: a streaming RDF/SPARQL benchmark

[...]

Ying Zhang¹, Pham Minh Duc¹, Oscar Corcho², Jean-Paul Calbimonte²•Institutions (2)

Centrum Wiskunde & Informatica¹, Technical University of Madrid²

11 Nov 2012

TL;DR: SRBench is introduced, a general-purpose benchmark primarily designed for streaming RDF/SPARQL engines, completely based on real-world data sets from the Linked Open Data cloud, which defines a concise, yet comprehensive set of queries that cover the major aspects of strRS processing.

...read moreread less

Abstract: We introduce SRBench, a general-purpose benchmark primarily designed for streaming RDF/SPARQL engines, completely based on real-world data sets from the Linked Open Data cloud. With the increasing problem of too much streaming data but not enough tools to gain knowledge from them, researchers have set out for solutions in which Semantic Web technologies are adapted and extended for publishing, sharing, analysing and understanding streaming data. To help researchers and users comparing streaming RDF/SPARQL (strRS) engines in a standardised application scenario, we have designed SRBench, with which one can assess the abilities of a strRS engine to cope with a broad range of use cases typically encountered in real-world scenarios. The data sets used in the benchmark have been carefully chosen, such that they represent a realistic and relevant usage of streaming data. The benchmark defines a concise, yet comprehensive set of queries that cover the major aspects of strRS processing. Finally, our work is complemented with a functional evaluation on three representative strRS engines: SPARQLStream, C-SPARQL and CQELS. The presented results are meant to give a first baseline and illustrate the state-of-the-art.

...read moreread less

Proceedings Article•DOI•

A dynamic program analysis to find floating-point accuracy problems

[...]

Florian Benz¹, Andreas Hildebrandt², Sebastian Hack¹•Institutions (2)

Saarland University¹, University of Mainz²

11 Jun 2012

TL;DR: This paper presents a dynamic program analysis that supports the programmer in finding accuracy problems and uses binary translation to perform every floating-point computation side by side in higher precision and a lightweight slicing approach to track the evolution of errors.

...read moreread less

Abstract: Programs using floating-point arithmetic are prone to accuracy problems caused by rounding and catastrophic cancellation. These phenomena provoke bugs that are notoriously hard to track down: the program does not necessarily crash and the results are not necessarily obviously wrong, but often subtly inaccurate. Further use of these values can lead to catastrophic errors.In this paper, we present a dynamic program analysis that supports the programmer in finding accuracy problems. Our analysis uses binary translation to perform every floating-point computation side by side in higher precision. Furthermore, we use a lightweight slicing approach to track the evolution of errors.We evaluate our analysis by demonstrating that it catches wellknown floating-point accuracy problems and by analyzing the Spec CFP2006 floating-point benchmark. In the latter, we show how our tool tracks down a catastrophic cancellation that causes a complete loss of accuracy leading to a meaningless program result. Finally, we apply our program to a complex, real-world bioinformatics application in which our program detected a serious cancellation. Correcting the instability led not only to improved quality of the result, but also to an improvement of the program's run time.In this paper, we present a dynamic program analysis that supports the programmer in finding accuracy problems. Our analysis uses binary translation to perform every floating-point computation side by side in higher precision. Furthermore, we use a lightweight slicing approach to track the evolution of errors. We evaluate our analysis by demonstrating that it catches wellknown floating-point accuracy problems and by analyzing the SpecfiCFP2006 floating-point benchmark. In the latter, we show how our tool tracks down a catastrophic cancellation that causes a complete loss of accuracy leading to a meaningless program result. Finally, we apply our program to a complex, real-world bioinformatics application in which our program detected a serious cancellation. Correcting the instability led not only to improved quality of the result, but also to an improvement of the program's run time.

...read moreread less

Proceedings Article•DOI•

SmartScale: Automatic Application Scaling in Enterprise Clouds

[...]

Sourav Dutta¹, Sankalp Gera¹, Akshat Verma¹, Balaji Viswanathan¹•Institutions (1)

IBM¹

24 Jun 2012

TL;DR: This work presents the SmartScale automated scaling framework, a combination of vertical and horizontal scaling that ensures that the application is scaled in a manner that optimizes both resource usage and the reconfiguration cost incurred due to scaling.

...read moreread less

Abstract: Enterprise clouds today support an on demand resource allocation model and can provide resources requested by applications in a near online manner using virtual machine resizing or cloning. However, in order to take advantage of an on demand resource model, enterprise applications need to be automatically scaled in a way that makes the most efficient use of resources. In this work, we present the SmartScale automated scaling framework. SmartScale uses a combination of vertical (adding more resources to existing VM instances) and horizontal (adding more VM instances) scaling to ensure that the application is scaled in a manner that optimizes both resource usage and the reconfiguration cost incurred due to scaling. The SmartScale methodology is proactive and ensures that the application converges quickly to the desired scaling level even when the workload intensity changes significantly. We evaluate SmartScale using real production traces on Olio, an emerging cloud benchmark, running on a ???-based cloud testbed. We present both theoretical and experimental evidence that comprehensively establish the effectiveness of SmartScale.

...read moreread less

Proceedings Article•DOI•

The PRISM Benchmark Suite

[...]

Marta Kwiatkowsa¹, Gethin Norman², David Parker³•Institutions (3)

University of Oxford¹, University of Glasgow², University of Birmingham³

17 Sep 2012

TL;DR: The PRISM benchmark suite is presented: a collection of probabilistic models and property specifications, designed to facilitate testing, benchmarking and comparisons of Probabilistic verification tools and implementations.

...read moreread less

Abstract: We present the PRISM benchmark suite: a collection of probabilistic models and property specifications, designed to facilitate testing, benchmarking and comparisons of probabilistic verification tools and implementations.

...read moreread less

Journal Article•DOI•

SharedDB: killing one thousand queries with one stone

[...]

Georgios Giannikis¹, Gustavo Alonso¹, Donald Kossmann¹•Institutions (1)

ETH Zurich¹

01 Feb 2012

TL;DR: SharedDB as mentioned in this paper is a new database architecture that is based on batching queries and shared computation across possibly hundreds of concurrent queries and updates, which is robust across a wide range of dynamic workloads.

...read moreread less

Abstract: Traditional database systems are built around the query-at-a-time model. This approach tries to optimize performance in a best-effort way. Unfortunately, best effort is not good enough for many modern applications. These applications require response time guarantees in high load situations. This paper describes the design of a new database architecture that is based on batching queries and shared computation across possibly hundreds of concurrent queries and updates. Performance experiments with the TPC-W benchmark show that the performance of our implementation, SharedDB, is indeed robust across a wide range of dynamic workloads.

...read moreread less

Posted Content•

SharedDB: Killing One Thousand Queries With One Stone

[...]

Georgios Giannikis¹, Gustavo Alonso¹, Donald Kossmann¹•Institutions (1)

ETH Zurich¹

01 Mar 2012-arXiv: Databases

TL;DR: This paper describes the design of a new database architecture that is based on batching queries and shared computation across possibly hundreds of concurrent queries and updates, and shows that the implementation, SharedDB, is robust across a wide range of dynamic workloads.

...read moreread less

Journal Article•DOI•

Performance Evaluation of Full Search Equivalent Pattern Matching Algorithms

[...]

Wanli Ouyang¹, Federico Tombari², Stefano Mattoccia², L. Di Stefano², Wai-Kuen Cham¹ - Show less +1 more•Institutions (2)

The Chinese University of Hong Kong¹, University of Bologna²

01 Jan 2012-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: This paper proposes an analysis and comparison of state-of-the-art algorithms for full search equivalent pattern matching and proposes extensions of the evaluated algorithms that show that they outperform the original formulations.

...read moreread less

Abstract: Pattern matching is widely used in signal processing, computer vision, and image and video processing. Full search equivalent algorithms accelerate the pattern matching process and, in the meantime, yield exactly the same result as the full search. This paper proposes an analysis and comparison of state-of-the-art algorithms for full search equivalent pattern matching. Our intention is that the data sets and tests used in our evaluation will be a benchmark for testing future pattern matching algorithms, and that the analysis concerning state-of-the-art algorithms could inspire new fast algorithms. We also propose extensions of the evaluated algorithms and show that they outperform the original formulations.

...read moreread less

Journal Article•DOI•

State transition algorithm

[...]

Xiaojun Zhou, Chunhua Yang, Weihua Gui

01 Sep 2012-Journal of Industrial and Management Optimization

TL;DR: In this paper, a new heuristic random search algorithm named state transition algorithm is proposed for continuous function optimization problems, four special transformation operators called rotation, translation, expansion and axesion are designed.

...read moreread less

Abstract: In terms of the concepts of state and state transition, a new heuristic random search algorithm named state transition algorithm is proposed. For continuous function optimization problems, four special transformation operators called rotation, translation, expansion and axesion are designed. Adjusting measures of the transformations are mainly studied to keep the balance of exploration and exploitation. Convergence analysis is also discussed about the algorithm based on random search theory. In the meanwhile, to strengthen the search ability in high dimensional space, communication strategy is introduced into the basic algorithm and intermittent exchange is presented to prevent premature convergence. Finally, experiments are carried out for the algorithms. With 10 common benchmark unconstrained continuous functions used to test the performance, the results show that state transition algorithms are promising algorithms due to their good global search capability and convergence property when compared with some popular algorithms.

...read moreread less

Proceedings Article•DOI•

A benchmark dataset to evaluate sensor displacement in activity recognition

[...]

Oresti Banos¹, Miguel Damas¹, Héctor Pomares¹, Ignacio Rojas¹, Mate Attila Toth², Oliver Amft² - Show less +2 more•Institutions (2)

University of Granada¹, Eindhoven University of Technology²

05 Sep 2012

TL;DR: This work introduces an open benchmark dataset to investigate inertial sensor displacement effects in activity recognition, and introduces a concept of gradual sensor displacement conditions, including ideal, self-placement of a user, and mutual displacement deployments.

...read moreread less

Abstract: This work introduces an open benchmark dataset to investigate inertial sensor displacement effects in activity recognition. While sensor position displacements such as rotations and translations have been recognised as a key limitation for the deployment of wearable systems, a realistic dataset is lacking. We introduce a concept of gradual sensor displacement conditions, including ideal, self-placement of a user, and mutual displacement deployments. These conditions were analysed in the dataset considering 33 fitness activities, recorded using 9 inertial sensor units from 17 participants. Our statistical analysis of acceleration features quantified relative effects of the displacement conditions. We expect that the dataset can be used to benchmark and compare recognition algorithms in the future.

...read moreread less

Journal Article•DOI•

A bi-population based estimation of distribution algorithm for the flexible job-shop scheduling problem

[...]

Ling Wang¹, Shengyao Wang¹, Ye Xu¹, Gang Zhou¹, Min Liu¹ - Show less +1 more•Institutions (1)

Tsinghua University¹

01 May 2012-Computers & Industrial Engineering

TL;DR: Comparisons between BEDA and some existing algorithms as well as the single-population based EDA demonstrate the effectiveness of the proposed B EDA in solving the FJSP.

...read moreread less

Collapse