Showing papers on "Program transformation published in 2016"

PDF

Open Access

Book Chapter•DOI•

Probabilistic Inference by Program Transformation in Hakaru (System Description)

[...]

Praveen Narayanan¹, Jacques Carette², Wren Romano¹, Chung-chieh Shan¹, Robert Zinkov¹ - Show less +1 more•Institutions (2)

Indiana University¹, McMaster University²

04 Mar 2016

TL;DR: Hakaru is a new probabilistic programming system that allows composable reuse of distributions, queries, and inference algorithms, all expressed in a single language of measures, and implements two automatic and semantics-preserving program transformations.

...read moreread less

Abstract: We present Hakaru, a new probabilistic programming system that allows composable reuse of distributions, queries, and inference algorithms, all expressed in a single language of measures. The system implements two automatic and semantics-preserving program transformations—disintegration, which calculates conditional distributions, and simplification, which subsumes exact inference by computer algebra. We show how these features work together by describing the ideal workflow of a Hakaru user on two small problems. We highlight our composition of transformations and types in design and implementation.

...read moreread less

116 citations

Journal Article•DOI•

Semi-equilibrium models for paracoherent answer set programs

[...]

Giovanni Amendola¹, Thomas Eiter², Michael Fink², Nicola Leone¹, João Martinho Moura³ - Show less +1 more•Institutions (3)

University of Calabria¹, Vienna University of Technology², Universidade Nova de Lisboa³

01 May 2016-Artificial Intelligence

TL;DR: This paper single out classes of canonical models that are amenable for customary bottom-up evaluation of answer set programs, with an option to switch to a paracoherent mode when lack of an answer set is detected.

...read moreread less

43 citations

Journal Article•DOI•

Building program construction and verification tools from algebraic principles

[...]

Alasdair Armstrong¹, Victor B. F. Gomes¹, Georg Struth¹•Institutions (1)

University of Sheffield¹

01 Apr 2016-Formal Aspects of Computing

TL;DR: A principled modular approach to the development of construction and verification tools for imperative programs, in which the control flow and the data flow are cleanly separated, is presented.

...read moreread less

Abstract: We present a principled modular approach to the development of construction and verification tools for imperative programs, in which the control flow and the data flow are cleanly separated. Our simplest verification tool uses Kleene algebra with tests for the control flow of while-programs and their standard relational semantics for the data flow. It is expanded to a basic program construction tool by adding an operation for the specification statement and one single axiom. To include recursive procedures, Kleene algebras with tests are expanded further to quantales with tests. In this more expressive setting, iteration and the specification statement can be defined explicitly and stronger program transformation rules can be derived. Programming our approach in the Isabelle/HOL interactive theorem prover yields simple lightweight mathematical components as well as program construction and verification tools that are correct by construction themselves. Verification condition generation and program construction rules are based on equational reasoning and supported by powerful Isabelle tactics and automated theorem proving. A number of examples shows our tools at work.

...read moreread less

37 citations

Book Chapter•DOI•

Explaining Relaxed Memory Models with Program Transformations

[...]

Ori Lahav¹, Viktor Vafeiadis¹•Institutions (1)

Max Planck Society¹

09 Nov 2016

TL;DR: It is proved that TSO is equivalent to a set of two local transformations over sequential consistency, but that non-multi-copy-atomic models (such as C11, Power and ARM) cannot be explained in terms of local transformationsover sequential consistency.

...read moreread less

Abstract: Weak memory models determine the behavior of concurrent programs. While they are often understood in terms of reorderings that the hardware or the compiler may perform, their formal definitions are typically given in a very different style—either axiomatic or operational. In this paper, we investigate to what extent weak behaviors of existing memory models can be fully explained in terms of reorderings and other program transformations. We prove that TSO is equivalent to a set of two local transformations over sequential consistency, but that non-multi-copy-atomic models (such as C11, Power and ARM) cannot be explained in terms of local transformations over sequential consistency. We then show that transformations over a basic non-multi-copy-atomic model account for the relaxed behaviors of (a large fragment of) Power, but that ARM’s relaxed behaviors cannot be explained in a similar way. Our positive results may be used to simplify correctness of compilation proofs from a high-level language to TSO or Power.

...read moreread less

30 citations

Proceedings Article•DOI•

Analysing the program analyser

[...]

Cristian Cadar¹, Alastair F. Donaldson¹•Institutions (1)

Imperial College London¹

14 May 2016

TL;DR: This position paper presents the vision that, by 2025, the use of more rigorous analyses to check the reliability of program analysers will be commonplace, building upon techniques such as cross-checking, program transformation and program generation.

...read moreread less

Abstract: The reliability of program analysis tools is clearly important if such tools are to play a serious role in improving the quality and integrity of software systems, and the confidence which users place in such systems. Yet our experience is that, currently, little attention is paid to analysing the correctness of program analysers themselves, beyond regression testing. In this position paper we present our vision that, by 2025, the use of more rigorous analyses to check the reliability of program analysers will be commonplace. Inspired by recent advances in compiler testing, we set out initial steps towards this vision, building upon techniques such as cross-checking, program transformation and program generation.

...read moreread less

22 citations

Proceedings Article•DOI•

On fusing recursive traversals of K-d trees

[...]

Samyam Rajbhandari¹, Jinsung Kim¹, Sriram Krishnamoorthy², Louis-Noël Pouchet¹, Fabrice Rastello³, Robert W. Harrison⁴, P. Sadayappan¹ - Show less +3 more•Institutions (4)

Ohio State University¹, Pacific Northwest National Laboratory², French Institute for Research in Computer Science and Automation³, Stony Brook University⁴

17 Mar 2016

TL;DR: FuseT as discussed by the authors is a source-to-source code transformation framework to automatically generate fused composite recursive operators from an input program containing a sequence of primitive recursive operators, which can offer significant performance improvement.

...read moreread less

Abstract: Loop fusion is a key program transformation for data locality optimization that is implemented in production compilers. But optimizing compilers for imperative languages currently cannot ex- ploit fusion opportunities across a set of recursive tree traversal computations with producer-consumer relationships. In this paper, we develop a compile-time approach to dependence characterization and program transformation to enable fusion across recursively specified traversals over k-d trees. We present the FuseT source-to- source code transformation framework to automatically generate fused composite recursive operators from an input program containing a sequence of primitive recursive operators. We use our framework to implement fused operators for MADNESS, Multi-resolution Adaptive Numerical Environment for Scientific Simulation. We show that locality optimization through fusion can offer significant performance improvement.

...read moreread less

21 citations

Proceedings Article•DOI•

Symbolic execution of high-level transformations

[...]

Ahmad Salim Al-Sibahi¹, Aleksandar S. Dimovski¹, Andrzej Wąsowski¹•Institutions (1)

University of Copenhagen¹

20 Oct 2016

TL;DR: This work develops a general symbolic execution technique that handles programs written in expressive high-level transformation languages, and uses logical constraints to describe structured symbolic values, including containment, acyclicity, simple unordered collections (sets) and to handle deep type-based querying of syntax hierarchies.

...read moreread less

Abstract: Transformations form an important part of developing domain specific languages, where they are used to provide semantics for typing and evaluation. Yet, few solutions exist for verifying transformations written in expressive high-level transformation languages. We take a step towards that goal, by developing a general symbolic execution technique that handles programs written in these high-level transformation languages. We use logical constraints to describe structured symbolic values, including containment, acyclicity, simple unordered collections (sets) and to handle deep type-based querying of syntax hierarchies. We evaluate this symbolic execution technique on a collection of refactoring and model transformation programs, showing that the white-box test generation tool based on symbolic execution obtains better code coverage than a black box test generator for such programs in almost all tested cases.

...read moreread less

19 citations

Book Chapter•DOI•

Securing a Compiler Transformation

[...]

Chaoqiang Deng¹, Kedar S. Namjoshi²•Institutions (2)

New York University¹, Bell Labs²

08 Sep 2016

TL;DR: It is shown that deciding a posteriori whether a new leak has been introduced as a result of eliminating dead stores is difficult: it is PSPACE-hard for finite-state programs and undecidable in general.

...read moreread less

Abstract: A compiler can be correct and yet be insecure. That is, a compiled program may have the same input-output behavior as the original, and yet leak more information. An example is the commonly applied optimization which removes dead (i.e., useless) stores. It is shown that deciding a posteriori whether a new leak has been introduced as a result of eliminating dead stores is difficult: it is PSPACE-hard for finite-state programs and undecidable in general. In contrast, deciding the correctness of dead store removal is in polynomial time. In response to the hardness result, a sound but approximate polynomial-time algorithm for secure dead store elimination is presented and proved correct. Furthermore, it is shown that for several other compiler transformations, security follows from correctness.

...read moreread less

18 citations

Proceedings Article•DOI•

Formal verification of control-flow graph flattening

[...]

Sandrine Blazy¹, Alix Trieu•Institutions (1)

University of Rennes¹

18 Jan 2016

TL;DR: This paper formally verify in Coq an advanced code obfuscation called control-flow graph flattening, that is used in state-of-the-art program obfuscators, and yields a program with competitive results.

...read moreread less

Abstract: Code obfuscation is emerging as a key asset in security by obscurity. It aims at hiding sensitive information in programs so that they become more difficult to understand and reverse engineer. Since the results on the impossibility of perfect and universal obfuscation, many obfuscation techniques have been proposed in the literature, ranging from simple variable encoding to hiding the control-flow of a program. In this paper, we formally verify in Coq an advanced code obfuscation called control-flow graph flattening, that is used in state-of-the-art program obfuscators. Our control-flow graph flattening is a program transformation operating over C programs, that is integrated into the CompCert formally verified compiler. The semantics preservation proof of our program obfuscator relies on a simulation proof performed on a realistic language, the Clight language of CompCert. The automatic extraction of our program obfuscator into OCaml yields a program with competitive results.

...read moreread less

17 citations

Proceedings Article•DOI•

A study & review on code obfuscation

[...]

Savio Antony Sebastian, Saurabh Malgaonkar, Paulami Shah, Mudit Kapoor, Tanay Parekhji - Show less +1 more

01 Feb 2016

TL;DR: This paper presents a technical study review of code obfuscation, and analyzes the different techniques which are used to thwart reverse engineers and to protect against malicious code injection and attacks.

...read moreread less

Abstract: This paper presents a technical study review of code obfuscation. To address this, we discuss the need and methods of code obfuscation. We analyze the different techniques which are used to thwart reverse engineers and to protect against malicious code injection and attacks. Obfuscation, in software technology, is the deliberate act of creating an obfuscated code, that is difficult for humans to understand. Code obfuscation is a protective mechanism which is used to reduce the attack activities on a software system. It is a behavior preserving program transformation which aims to make a program unintelligible to automated program comprehension tools. Code obfuscation is convenient in situations where depending on cryptographic techniques is not enough; this is normal in remote execution situations where the software is executed on an unforeseen exposed hostile environment, such as the new computing platforms: cloud-computing paradigm and smart phones.

...read moreread less

16 citations

Proceedings Article•DOI•

Effect-dependent transformations for concurrent programs

[...]

Nick Benton¹, Martin Hofmann², Vivek Nigam³•Institutions (3)

Microsoft¹, Ludwig Maximilian University of Munich², Federal University of Paraíba³

05 Sep 2016

TL;DR: In this article, a denotational semantics for an abstract effect system for a higher-order, shared-variable concurrent language is presented, including sufficient conditions for replacing sequential composition with parallel composition.

...read moreread less

Abstract: We describe a denotational semantics for an abstract effect system for a higher-order, shared-variable concurrent language. The semantics validates general effect-based program equivalences, including sufficient conditions for replacing sequential composition with parallel composition. Effect annotations refer to abstract locations, specified by contracts, rather than physical footprints, allowing us to also show soundness of some transformations involving fine-grained concurrent data structures, such as Michael-Scott queues.We build on a trace-based semantics for first-order programs due to Brookes. By moving from concrete to abstract locations, and adding type refinements capturing possible side-effects of both expressions and their environments, we can validate many equivalences that do not hold in an unrefined model. Refined types are interpreted using a game-based logical relation over sets of traces.

...read moreread less

Book Chapter•DOI•

A Transformational Approach to Parametric Accumulated-Cost Static Profiling

[...]

Rémy Haemmerlé¹, Pedro López-García¹, Pedro López-García², Umer Liqat¹, Maximiliano Klemen¹, John P. Gallagher³, John P. Gallagher¹, Manuel V. Hermenegildo⁴, Manuel V. Hermenegildo¹ - Show less +5 more•Institutions (4)

IMDEA¹, Spanish National Research Council², Roskilde University³, Technical University of Madrid⁴

04 Mar 2016

TL;DR: A novel resource analysis whose aim is instead the static profiling of accumulated cost, i.e., to discover, for selected parts of the program, an estimate or bound of the resource usage accumulated in each of those parts.

...read moreread less

Abstract: Traditional static resource analyses estimate the total resource usage of a program, without executing it. In this paper we present a novel resource analysis whose aim is instead the static profiling of accumulated cost, i.e., to discover, for selected parts of the program, an estimate or bound of the resource usage accumulated in each of those parts. Traditional resource analyses are parametric in the sense that the results can be functions on input data sizes. Our static profiling is also parametric, i.e., our accumulated cost estimates are also parameterized by input data sizes. Our proposal is based on the concept of cost centers and a program transformation that allows the static inference of functions that return bounds on these accumulated costs depending on input data sizes, for each cost center of interest. Such information is much more useful to the software developer than the traditional resource usage functions, as it allows identifying the parts of a program that should be optimized, because of their greater impact on the total cost of program executions. We also report on our implementation of the proposed technique using the CiaoPP program analysis framework, and provide some experimental results.

...read moreread less

DOI•

Reversible Term Rewriting

[...]

Naoki Nishida, Adrián Palacios, Germán Vidal

01 Jan 2016

TL;DR: This paper introduces a conservative extension of term rewriting that becomes reversible, a computation model that underlies most rule-based programming languages, and defines a transformation to make a rewrite system reversible using standard term rewriting.

...read moreread less

Abstract: Essentially, in a reversible programming language, for each forward computation step from state S to state S', there exists a constructive and deterministic method to go backwards from state S' to state S. Besides its theoretical interest, reversible computation is a fundamental concept which is relevant in many different areas like cellular automata, bidirectional program transformation, or quantum computing, to name a few. In this paper, we focus on term rewriting, a computation model that underlies most rule-based programming languages. In general, term rewriting is not reversible, even for injective functions; namely, given a rewrite step t1 -> t2, we do not always have a decidable and deterministic method to get t1 from t2. Here, we introduce a conservative extension of term rewriting that becomes reversible. Furthermore, we also define a transformation to make a rewrite system reversible using standard term rewriting.

...read moreread less

Book Chapter•DOI•

Counting Successes: Effects and Transformations for Non-deterministic Programs

[...]

Nick Benton¹, Andrew Kennedy², Martin Hofmann³, Vivek Nigam⁴•Institutions (4)

Microsoft¹, Facebook², Ludwig Maximilian University of Munich³, Federal University of Paraíba⁴

01 Jan 2016

TL;DR: A relational semantics for the effect system establishes the soundness of both the analysis and its use in effect-based program transformations.

...read moreread less

Abstract: We give a simple effect system for non-deterministic programs, tracking static approximations to the number of results that may be produced by each computation. A relational semantics for the effect system establishes the soundness of both the analysis and its use in effect-based program transformations.

...read moreread less

Posted Content•

A General Framework for Static Profiling of Parametric Resource Usage

[...]

Pedro López-García, Maximiliano Klemen, Umer Liqat, Manuel V. Hermenegildo

09 Aug 2016-arXiv: Programming Languages

TL;DR: This work proposes a novel, general, and flexible framework for setting up cost equations/relations which can be instantiated for performing a wide range of resource usage analyses, including both static profiling and the inference of the standard notion of cost.

...read moreread less

Book Chapter•DOI•

From Array Domains to Abstract Interpretation Under Store-Buffer-Based Memory Models

[...]

Thibault Suzanne¹, Antoine Miné²•Institutions (2)

École Normale Supérieure¹, University of Paris²

08 Sep 2016

TL;DR: This work addresses the problem of verifying concurrent programs under store-buffer-based weakly consistent memory models, such as TSO or PSO, using the abstract interpretation framework, and obtains a sound abstraction of program states parameterised by a numerical domain.

...read moreread less

Abstract: We address the problem of verifying concurrent programs under store-buffer-based weakly consistent memory models, such as TSO or PSO. Using the abstract interpretation framework, we adapt existing domains for arrays to model store buffers and obtain a sound abstraction of program states (including the case of programs with infinite state space) parameterised by a numerical domain. Whereas the usual method for this kind of programs implements a program transformation to come back to an analysis under a sequentially consistent model, the novelty of our work consists in applying abstract interpretation directly on the source program, setting a clean foundation for special dedicated domains keeping information difficult to express with program transformations. We demonstrate the precision of this method on a few examples, tar-getting the TSO model and incidentally being also sound for PSO due to some specific abstraction choice. We discuss an application to fence removal and show that our implementation is usually able to remove as many or more fences, with respect to the state of the art, on concurrent algorithms designed for sequential consistency while still remaining precise enough to verify them.

...read moreread less

Journal Article•DOI•

A General Framework for Static Profiling of Parametric Resource Usage

[...]

Pedro López-García, Maximiliano Klemen, Umer Liqat, Manuel V. Hermenegildo

01 Sep 2016-Theory and Practice of Logic Programming

TL;DR: In this article, the authors propose a framework for setting up cost equations/relations which can be instantiated for performing a wide range of resource usage analyses, including both static profiling and the inference of the standard notion of cost.

...read moreread less

Abstract: For some applications, standard resource analyses do not provide the information required. Such analyses estimate the total resource usage of a program (without executing it) as functions on input data sizes. However, some applications require knowing how such total resource usage is distributed over selected parts of a program. We propose a novel, general, and flexible framework for setting up cost equations/relations which can be instantiated for performing a wide range of resource usage analyses, including both static profiling and the inference of the standard notion of cost. We extend and generalize standard resource analysis techniques, so that the relations generated include additional Boolean control variables for switching on or off different terms in the relations, as required by the desired resource usage profile. We also instantiate our framework to perform static profiling of accumulated cost (also parameterized by input data sizes). Such information is much more useful to the software developer than the standard notion of cost: it identifies the parts of the program that have the greatest impact on the total program cost, and which therefore should be optimized first. We also report on an implementation of our framework within the CiaoPP system, and its instantiation for accumulated cost, and provide some experimental results. In addition to generality, our new method brings important advantages over our previous approach based on a program transformation, including support for non-deterministic programs, better and easier integration in the compiler, and higher efficiency.

...read moreread less

Proceedings Article•DOI•

Efficiently implementing the copy semantics of MATLAB's arrays in JavaScript

[...]

Vincent Foley-Bourgon¹, Laurie Hendren¹•Institutions (1)

McGill University¹

01 Nov 2016

TL;DR: An intra-procedural data-flow analysis is presented to track where each array variable may point to and which variables are possibly aliased and the resulting JavaScript program respects the MATLAB value semantics and it is shown that it performs fewer run-time copies than some alternative approaches.

...read moreread less

Abstract: Compiling MATLAB---a dynamic, array-based language---to JavaScript is an attractive proposal: the output code can be deployed on a platform used by billions and can leverage the countless hours that have gone into making JavaScript JIT engines fast. But before that can happen, the original MATLAB code must be properly translated, making sure to bridge the semantic gaps of the two languages. An important area where MATLAB and JavaScript differ is in their handling of arrays: for example, in MATLAB, arrays are one-indexed and writing at an index beyond the end of an array extends it; in JavaScript, typed arrays are zero-indexed and writing out of bounds is a no-op. A MATLAB-to-JavaScript compiler must address these mismatches. Another salient and pervasive difference between the two languages is the assignment of arrays to variables: in MATLAB, this operation has value semantics, while in JavaScript is has reference semantics. In this paper, we present MatJuice --- a source-to-source, ahead-of-time compiler back-end for MATLAB --- and how it deals efficiently with this last issue. We present an intra-procedural data-flow analysis to track where each array variable may point to and which variables are possibly aliased. We also present the associated copy insertion transformation that uses the points-to information to insert explicit copies when necessary. The resulting JavaScript program respects the MATLAB value semantics and we show that it performs fewer run-time copies than some alternative approaches.

...read moreread less

Proceedings Article•DOI•

Program Transformation to Identify Parallel Skeletons

[...]

Venkatesh Kannan¹, Geoff W. Hamilton¹•Institutions (1)

Dublin City University¹

01 Feb 2016

TL;DR: This paper presents a transformation method for functional programs defined over any recursive data types that encodes the inputs of a program so that the transformed program is more likely to containing instances of polytypic fold skeletons, and less likely to contain inefficient intermediate data structures.

...read moreread less

Abstract: Programs that operate over recursive data structures may contain potential parallel computations. Writing parallel programs, even when aided by parallel skeletons, is very challenging, requires intricate analysis of the underlying algorithm and often uses inefficient intermediate data structures. Very few automated parallelisation methods that address a wide range of programs and data types exist. In this paper, we present a transformation method for functional programs defined over any recursive data types. Our method encodes the inputs of a program so that the transformed program is more likely to contain instances of polytypic fold skeletons, and less likely to contain inefficient intermediate data structures. With parallel implementations for these skeletons, the transformed programs can potentially be evaluated on hardware such as multi-core CPUs and/or GPUs.

...read moreread less

Journal Article•DOI•

Ping-pong protocols as prefix grammars: Modelling and verification via program transformation

[...]

Antonina Nepeivoda

01 Aug 2016-The Journal of Logic and Algebraic Programming

TL;DR: A technique to model ping-pong protocols by prefix grammars in such a way that the security of the protocol models becomes decidable by a general-purpose program transformation tool with unfolding is presented.

...read moreread less

Proceedings Article•DOI•

Transforming Programs between APIs with Many-to-Many Mappings

[...]

Chenglong Wang¹, Jiajun Jiang², Jun Li³, Yingfei Xiong², Xiangyu Luo, Lu Zhang², Zhenjiang Hu⁴ - Show less +3 more•Institutions (4)

Carnegie Mellon University¹, Peking University², China University of Mining and Technology³, Graduate University for Advanced Studies⁴

01 Jan 2016

TL;DR: A declarative program transformation language, PATL, for adapting Java programs between different APIs that has simple syntax and basic semantics to handle transformations only considering consecutive statements inside basic blocks, while with guided-normalization, it can be extended to handle complex forms of invocations.

...read moreread less

Abstract: Transforming programs between two APIs or different versions of the same API is a common software engineering task. However, existing languages supporting for such transformation cannot satisfactorily handle the cases when the relations between elements in the old API and the new API are many-to-many mappings: multiple invocations to the old API are supposed to be replaced by multiple invocations to the new API. Since the multiple invocations of the original APIs may not appear consecutively and the variables in these calls may have different names, writing a tool correctly to cover all such invocation cases is not an easy task. In this paper we propose a novel guided-normalization approach to address this problem. Our core insight is that programs in different forms can be semantics-equivalently normalized into a basic form guided by transformation goals, and developers only need to write rules for the basic form to address the transformation. Based on this approach, we design a declarative program transformation language, PATL, for adapting Java programs between different APIs. PATL has simple syntax and basic semantics to handle transformations only considering consecutive statements inside basic blocks, while with guided-normalization, it can be extended to handle complex forms of invocations. Furthermore, PATL ensures that the user-written rules would not accidentally break def-use relations in the program. We formalize the semantics of PATL on Middleweight Java and prove the semantics-preserving property of guided-normalization. We also evaluated our language with three non-trivial case studies: i.e. updating Google Calendar API, switching from JDom to Dom4j, and switching from Swing to SWT. The result is encouraging; it shows that our language allows successful transformations of real world programs with a small number of rules and little manual resolution.

...read moreread less

Book Chapter•DOI•

Scaling Bounded Model Checking by Transforming Programs with Arrays

[...]

Anushri Jana¹, Uday P. Khedker², Advaita Datar¹, R. Venkatesh¹, C Niyas¹ - Show less +1 more•Institutions (2)

Tata Research Development and Design Centre¹, Indian Institute of Technology Bombay²

06 Sep 2016

TL;DR: In this article, the authors present a transformation that enables bounded model checkers to verify a certain class of array properties, which reduces the resource requirements of a model checker significantly.

...read moreread less

Abstract: Bounded Model Checking is one the most successful techniques for finding bugs in program. However, model checkers are resource hungry and are often unable to verify programs with loops iterating over large arrays. We present a transformation that enables bounded model checkers to verify a certain class of array properties. Our technique transforms an array-manipulating (Ansi-C) program to an array-free and loop-free (Ansi-C) program thereby reducing the resource requirements of a model checker significantly. Model checking of the transformed program using an off-the-shelf bounded model checker simulates the loop iterations efficiently. Thus, our transformed program is a sound abstraction of the original program and is also precise in a large number of cases—we formally characterize the class of programs for which it is guaranteed to be precise. We demonstrate the applicability and usefulness of our technique on both industry code as well as academic benchmarks.

...read moreread less

Journal Article•DOI•

Description and Optimization of Abstract Machines in a Dialect of Prolog

[...]

José F. Morales¹, Manuel Carro¹, Manuel V. Hermenegildo¹•Institutions (1)

IMDEA¹

01 Jan 2016-Theory and Practice of Logic Programming

TL;DR: It is shown how the semantics of most basic components of an efficient virtual machine for Prolog can be described using (a variant of) Prolog, and how these descriptions are compiled to C and assembled to build a complete bytecode emulator.

...read moreread less

Abstract: In order to achieve competitive performance, abstract machines for Prolog and related languages end up being large and intricate, and incorporate sophisticated optimizations, both at the design and at the implementation levels. At the same time, efficiency considerations make it necessary to use low-level languages in their implementation. This makes them laborious to code, optimize, and, especially, maintain and extend. Writing the abstract machine (and ancillary code) in a higher-level language can help tame this inherent complexity. We show how the semantics of most basic components of an efficient virtual machine for Prolog can be described using (a variant of) Prolog. These descriptions are then compiled to C and assembled to build a complete bytecode emulator. Thanks to the high-level of the language used and its closeness to Prolog, the abstract machine description can be manipulated using standard Prolog compilation and optimization techniques with relative ease. We also show how, by applying program transformations selectively, we obtain abstract machine implementations whose performance can match and even exceed that of state-of-the-art, highly-tuned, hand-crafted emulators.

...read moreread less

Proceedings Article•DOI•

Towards semi-automatic data-type translation for parallelism in Erlang

[...]

Adam D. Barwell¹, Christopher Brown¹, David Castro¹, Kevin Hammond¹•Institutions (1)

University of St Andrews¹

23 Sep 2016

TL;DR: This work uses a refactoring approach to choose and introduce transformations for specific algorithmic skeletons, structured forms of parallelism that capture common patterns of parallelisation.

...read moreread less

Abstract: As part of ongoing research into programmer-in-the-loop parallelisation, we are studying the problem of automatically introducing alternative data structures to support parallelism. Our goal is to make it easier to produce the best parallelisation for some given program, or even to make parallelisation feasible. We use a refactoring approach to choose and introduce these transformations for specific algorithmic skeletons, structured forms of parallelism that capture common patterns of parallelism.

...read moreread less

Journal Article•DOI•

Towards a Semantics-Aware Code Transformation Toolchain for Heterogeneous Systems

[...]

Salvador Tamarit¹, Julio Mariño¹, Guillermo Vigueras², Manuel Carro²•Institutions (2)

Technical University of Madrid¹, IMDEA²

09 Mar 2016-arXiv: Programming Languages

TL;DR: This work presents a program transformation environment, implemented in Haskell, where architecture-agnostic scientific C code with semantic annotations is transformed into functionally equivalent code better suited for a given platform.

...read moreread less

Abstract: Obtaining good performance when programming heterogeneous computing platforms poses significant challenges for the programmer. We present a program transformation environment, implemented in Haskell, where architecture-agnostic scientific C code with semantic annotations is transformed into functionally equivalent code better suited for a given platform. The transformation steps are formalized (and implemented) as rules which can be fired when certain syntactic and semantic conditions are met. These conditions are to be fulfilled by program properties which can be automatically inferred or, alternatively, stated as annotations in the source code. Rule selection can be guided by heuristics derived from a machine learning procedure which tries to capture how run-time characteristics (e.g., resource consumption or performance) are affected by the transformation steps.

...read moreread less

Journal Article•DOI•

Bound Analysis for Whiley Programs

[...]

Min-Hsien Weng¹, Mark Utting², Bernhard Pfahringer¹•Institutions (2)

University of Waikato¹, University of the Sunshine Coast²

26 Feb 2016-Electronic Notes in Theoretical Computer Science

TL;DR: An abstract interpretation-based bound inference approach along with symbolic analysis for Whiley programs that combines the bound consistency technique along with a widening operator to give fast time of solving program constraints and of converging to the fixed point.

...read moreread less

Proceedings Article•DOI•

Optimizing sparse matrix computations through compiler-assisted programming

[...]

Kristian F. D. Rietveld¹, Harry A. G. Wijshoff¹•Institutions (1)

Leiden University¹

16 May 2016

TL;DR: This paper proposes to only specify the essence of the computation on the level of individual matrix elements, which allows code to perform scatter/gather operations, matrix reordering, matrix data structure handling, handling of fill-in, etc., to be generated automatically.

...read moreread less

Abstract: Existing high-performance implementations of sparse matrix codes are intricate and result in large code bases. In fact, a single floating-point operation requires 400 to 600 lines of additional code to "prepare" this operation. This imbalance severely obscures code development, thereby complicating maintenance and portability. In this paper, we propose a drastically different approach in order to continue to effectively handle these codes. We propose to only specify the essence of the computation on the level of individual matrix elements. All additional source code to embed these computations are then generated and optimized automatically by the compiler. This approach is far superior to existing library approaches and allows code to perform scatter/gather operations, matrix reordering, matrix data structure handling, handling of fill-in, etc., to be generated automatically. Experiments show that very efficient data structures can be generated and the resulting codes can be very competitive.

...read moreread less

Posted Content•

A program transformation for converting Java assertions into control-flow statements

[...]

Rafael Caballero, Vincent von Hof, Manuel Montenegro, Herbert Kuchen

01 Jan 2016-Research Papers in Economics

TL;DR: This work presents a technique for checking the validity of Java assertions using an arbitrary automated test-case generator, and demonstrates not just the applicability of the proposal but also that it can get a better coverage than the same test- case generators without the transformation.

...read moreread less

Abstract: We present a technique for checking the validity of Java assertions using an arbitrary automated test-case generator. Our framework transforms the program by introducing code that detects whether the assertion conditions are met by every direct and indirect method call within a certain depth level. Then, any automated test-case generator can be used to look for input examples that falsify the conditions. The program transformation ensures that the value obtained for these inputs represents a path of method calls that ends with a violation of some assertion. We show experiments with two different automatic test-case generators that demonstrate not just the applicability of our proposal but also that we can get a better coverage than the same test-case generators without our transformation.

...read moreread less

Proceedings Article•DOI•

A Verification Guided Approach for Selective Program Transformations for Approximate Computing

[...]

Sayandeep Mitra, Moumita Das¹, Ansuman Banerjee¹, K. Datta, Tsung-Yi Ho² - Show less +1 more•Institutions (2)

Indian Statistical Institute¹, Technische Universität München²

01 Nov 2016

TL;DR: This paper introduces a verification-guided method to automatically identify program blocks which lend themselves to easy approximations, while not compromising significantly on program correctness, based on identifying regions of code which are less influential for the computation of the program outputs.

...read moreread less

Abstract: In recent times, approximate computing is being looked at as a viable alternative for reducing the energy consumption of programs, while marginally compromising on the correctness of their computation. The idea behind approximate computing is to introduce approximations at various levels of the execution stack, with an attempt to realize the resource hungry computations on low resource consuming approximate hardware blocks. However approximate computing for program transformation faces a serious challenge of automatically identifying core program areas/statements where approximations can be introduced, with a quantifiable measure of the resulting program correctness compromise. Introducing approximations randomly can cause performance deterioration without much energy advantage, which is undesirable. In this paper, we introduce a verification-guided method to automatically identify program blocks which lend themselves to easy approximations, while not compromising significantly on program correctness. Our method is based on identifying regions of code which are less influential for the computation of the program outputs and therefore, can be compromised with, however still having a potential of significant resource reduction. We take the help of assertions to quantify the effect of the resulting transformations on program outputs. We show experimental results to support our proposal.

...read moreread less

Journal Article•DOI•

Towards Automatic Learning of Heuristics for Mechanical Transformations of Procedural Code

[...]

Guillermo Vigueras¹, Manuel Carro², Salvador Tamarit², Julio Mariño²•Institutions (2)

IMDEA¹, Technical University of Madrid²

09 Mar 2016-arXiv: Programming Languages

TL;DR: A machine learning-based approach to learn heuristics for defining transformation strategies of a program transformation system and a novel combination of reinforcement learning and classification methods to efficiently tackle the problems inherent to heterogeneous systems.

...read moreread less

Abstract: The current trend in next-generation exascale systems goes towards integrating a wide range of specialized (co-)processors into traditional supercomputers. However, the integration of different specialized devices increases the degree of heterogeneity and the complexity in programming such type of systems. Due to the efficiency of heterogeneous systems in terms of Watt and FLOPS per surface unit, opening the access of heterogeneous platforms to a wider range of users is an important problem to be tackled. In order to bridge the gap between heterogeneous systems and programmers, in this paper we propose a machine learning-based approach to learn heuristics for defining transformation strategies of a program transformation system. Our approach proposes a novel combination of reinforcement learning and classification methods to efficiently tackle the problems inherent to this type of systems. Preliminary results demonstrate the suitability of the approach for easing the programmability of heterogeneous systems.

...read moreread less