Home
/
Authors
/
Aviral Goel

Author

Aviral Goel

Other affiliations: Netaji Subhas Institute of Technology

Bio: Aviral Goel is an academic researcher from Northeastern University. The author has contributed to research in topics: Compiler & R Programming Language. The author has an hindex of 3, co-authored 10 publications receiving 27 citations. Previous affiliations of Aviral Goel include Netaji Subhas Institute of Technology.

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Correctness of speculative optimizations with dynamic deoptimization

[...]

Olivier Flückiger¹, Gabriel Scherer², Ming-Ho Yee¹, Aviral Goel¹, Amal Ahmed¹, Jan Vitek³ - Show less +2 more•Institutions (3)

Northeastern University¹, French Institute for Research in Computer Science and Automation², Czech Technical University in Prague³

27 Dec 2017

TL;DR: In this paper, the authors show that reasoning about speculation is surprisingly easy when assumptions are made explicit in the program representation, and show that traditional compiler optimizations such as constant folding, unreachable code elimination, and function inlining are correct in the presence of assumptions.

...read moreread less

Abstract: High-performance dynamic language implementations make heavy use of speculative optimizations to achieve speeds close to statically compiled languages. These optimizations are typically performed by a just-in-time compiler that generates code under a set of assumptions about the state of the program and its environment. In certain cases, a program may execute code compiled under assumptions that are no longer valid. The implementation must then deoptimize the program on-the-fly; this entails finding semantically equivalent code that does not rely on invalid assumptions, translating program state to that expected by the target code, and transferring control. This paper looks at the interaction between optimization and deoptimization, and shows that reasoning about speculation is surprisingly easy when assumptions are made explicit in the program representation. This insight is demonstrated on a compiler intermediate representation, named sourir, modeled after the high-level representation for a dynamic language. Traditional compiler optimizations such as constant folding, unreachable code elimination, and function inlining are shown to be correct in the presence of assumptions. Furthermore, the paper establishes the correctness of compiler transformations specific to deoptimization: namely unrestricted deoptimization, predicate hoisting, and assume composition.

...read moreread less

17 citations

Journal Article•DOI•

Correctness of Speculative Optimizations with Dynamic Deoptimization

[...]

Olivier Flückiger¹, Gabriel Scherer¹, Ming-Ho Yee¹, Aviral Goel¹, Amal Ahmed¹, Jan Vitek¹ - Show less +2 more•Institutions (1)

Northeastern University¹

08 Nov 2017-arXiv: Programming Languages

TL;DR: In this article, the interaction between optimization and deoptimization is investigated, and it is shown that reasoning about speculation is surprisingly easy when assumptions are made explicit in the program representation.

...read moreread less

Abstract: High-performance dynamic language implementations make heavy use of speculative optimizations to achieve speeds close to statically compiled languages. These optimizations are typically performed by a just-in-time compiler that generates code under a set of assumptions about the state of the program and its environment. In certain cases, a program may execute code compiled under assumptions that are no longer valid. The implementation must then deoptimize the program on-the-fly; this entails finding semantically equivalent code that does not rely on invalid assumptions, translating program state to that expected by the target code, and transferring control. This paper looks at the interaction between optimization and deoptimization, and shows that reasoning about speculation is surprisingly easy when assumptions are made explicit in the program representation. This insight is demonstrated on a compiler intermediate representation, named \sourir, modeled after the high-level representation for a dynamic language. Traditional compiler optimizations such constant folding, dead code elimination, and function inlining are shown to be correct in the presence of assumptions. Furthermore, the paper establishes the correctness of compiler transformations specific to deoptimization: namely unrestricted deoptimization, predicate hoisting, and assume composition.

...read moreread less

9 citations

Proceedings Article•

Correctness of Speculative Optimizations with Dynamic Deoptimization

[...]

Olivier Flückiger¹, Gabriel Scherer¹, Ming-Ho Yee¹, Aviral Goel¹, Amal Ahmed¹, Jan Vitek¹ - Show less +2 more•Institutions (1)

Northeastern University¹

07 Jan 2018

TL;DR: In this paper, the authors show that reasoning about speculation is surprisingly easy when assumptions are made explicit in the program representation, and show that traditional compiler optimizations such as constant folding, dead code elimination and function inlining are correct in the presence of assumption.

...read moreread less

Abstract: High-performance dynamic language implementations make heavy use of speculative optimizations to achieve speeds close to statically compiled languages. These optimizations are typically performed by a just-in-time compiler that generates code under a set of assumptions about the state of the program and its environment. In certain cases, a program may execute code compiled under assumptions that are no longer valid. The implementation must then deoptimize the program on-the-fly; this entails finding a semantically equivalent code fragment that does not rely on invalid assumptions, translating program state to that expected by the target code, and transferring control. This paper looks at the interaction between optimization and deoptimization, and shows that reasoning about speculation is surprisingly easy when assumptions are made explicit in the program representation. This insight is demonstrated on a compiler intermediate representation, named sourir, modeled after the high-level representation for a dynamic language. Traditional compiler optimizations such constant folding, dead code elimination and function inlining are shown to be correct in the presence of assumption. Furthermore, the paper establishes the correctness of compiler transformations specific to deoptimization: namely unrestricted deoptimization, predicate hoisting and assume composition.

...read moreread less

9 citations

Journal Article•DOI•

On the design, implementation, and use of laziness in R

[...]

Aviral Goel¹, Jan Vitek²•Institutions (2)

Northeastern University¹, Czech Technical University in Prague²

10 Oct 2019

TL;DR: The authors present a review of the design and implementation of call-by-need in R, and a data-driven study of how generations of programmers have put laziness to use in their code.

...read moreread less

Abstract: The R programming language has been lazy for over twenty-five years. This paper presents a review of the design and implementation of call-by-need in R, and a data-driven study of how generations of programmers have put laziness to use in their code. We analyze 16,707 packages and observe the creation of 270.9 B promises. Our data suggests that there is little supporting evidence to assert that programmers use laziness to avoid unnecessary computation or to operate over infinite data structures. For the most part R code appears to have been written without reliance on, and in many cases even knowledge of, delayed argument evaluation. The only significant exception is a small number of packages which leverage call-by-need for meta-programming.

...read moreread less

6 citations

Journal Article•DOI•

On the Design, Implementation, and Use of Laziness in R.

[...]

Aviral Goel¹, Jan Vitek²•Institutions (2)

Northeastern University¹, Czech Technical University in Prague²

19 Sep 2019-arXiv: Programming Languages

TL;DR: This paper present a review of the design and implementation of call-by-need in R, and a data-driven study of how generations of programmers have put laziness to use in their code.

...read moreread less

4 citations

Cited by

PDF

Open Access

More filters

Proceedings Article•DOI•

What's Wrong with Computational Notebooks? Pain Points, Needs, and Design Opportunities

[...]

Souti Chattopadhyay¹, Ishita Prasad², Austin Z. Henley³, Anita Sarma¹, Titus Barik² - Show less +1 more•Institutions (3)

Oregon State University¹, Microsoft², University of Tennessee³

21 Apr 2020

TL;DR: It is suggested that data scientists face numerous pain points throughout the entire workflow - from setting up notebooks to deploying to production - across many notebook environments.

...read moreread less

Abstract: Computational notebooks - such as Azure, Databricks, and Jupyter - are a popular, interactive paradigm for data scientists to author code, analyze data, and interleave visualizations, all within a single document. Nevertheless, as data scientists incorporate more of their activities into notebooks, they encounter unexpected difficulties, or pain points, that impact their productivity and disrupt their workflow. Through a systematic, mixed-methods study using semi-structured interviews (n=20) and survey (n=156) with data scientists, we catalog nine pain points when working with notebooks. Our findings suggest that data scientists face numerous pain points throughout the entire workflow - from setting up notebooks to deploying to production - across many notebook environments. Our data scientists report essential notebook requirements, such as supporting data exploration and visualization. The results of our study inform and inspire the design of computational notebooks.

...read moreread less

100 citations

Proceedings Article•DOI•

On-stack replacement, distilled

[...]

Daniele Cono D'Elia¹, Camil Demetrescu¹•Institutions (1)

Sapienza University of Rome¹

11 Jun 2018

TL;DR: A constructive and provably correct OSR framework is proposed, allowing a class of general-purpose transformation functions to yield a special-purpose replacement, and a feasibility study on debugging of optimized code is presented, showing how the techniques can be used to fix variables holding incorrect values at breakpoints due to optimizations.

...read moreread less

Abstract: On-stack replacement (OSR) is essential technology for adaptive optimization, allowing changes to code actively executing in a managed runtime. The engineering aspects of OSR are well-known among VM architects, with several implementations available to date. However, OSR is yet to be explored as a general means to transfer execution between related program versions, which can pave the road to unprecedented applications that stretch beyond VMs. We aim at filling this gap with a constructive and provably correct OSR framework, allowing a class of general-purpose transformation functions to yield a special-purpose replacement. We describe and evaluate an implementation of our technique in LLVM. As a novel application of OSR, we present a feasibility study on debugging of optimized code, showing how our techniques can be used to fix variables holding incorrect values at breakpoints due to optimizations.

...read moreread less

15 citations

Proceedings Article•DOI•

R melts brains: an IR for first-class environments and lazy effectful arguments

[...]

Olivier Flückiger¹, Guido Chari², Jan Ječmen², Ming-Ho Yee¹, Jakob Hain¹, Jan Vitek² - Show less +2 more•Institutions (2)

Northeastern University¹, Czech Technical University in Prague²

20 Oct 2019

TL;DR: This work presents PIR, an intermediate representation with explicit support for first-class environments and effectful lazy evaluation, and describes two dataflow analyses on PIR that enables reasoning about variables and their environments and infers where arguments are evaluated.

...read moreread less

Abstract: The R programming language combines a number of features considered hard to analyze and implement efficiently: dynamic typing, reflection, lazy evaluation, vectorized primitive types, first-class closures, and extensive use of native code. Additionally, variable scopes are reified at runtime as first-class environments. The combination of these features renders most static program analysis techniques impractical, and thus, compiler optimizations based on them ineffective. We present our work on PIR, an intermediate representation with explicit support for first-class environments and effectful lazy evaluation. We describe two dataflow analyses on PIR: the first enables reasoning about variables and their environments, and the second infers where arguments are evaluated. Leveraging their results, we show how to elide environment creation and inline functions.

...read moreread less

13 citations

Journal Article•DOI•

Faster Algorithms for Dynamic Algebraic Queries in Basic RSMs with Constant Treewidth

[...]

Krishnendu Chatterjee¹, Amir Kafshdar Goharshady¹, Prateesh Goyal², Rasmus Ibsen-Jensen³, Andreas Pavlogiannis⁴ - Show less +1 more•Institutions (4)

Institute of Science and Technology Austria¹, Massachusetts Institute of Technology², University of Liverpool³, Aarhus University⁴

13 Nov 2019-ACM Transactions on Programming Languages and Systems

TL;DR: This work considers possible multiple queries as required in many applications such as in alias analysis, and proposes simple and implementable algorithms that support multiple queries for algebraic path properties for RSMs that have constant treewidth.

...read moreread less

Abstract: Interprocedural analysis is at the heart of numerous applications in programming languages, such as alias analysis, constant propagation, and so on. Recursive state machines (RSMs) are standard models for interprocedural analysis. We consider a general framework with RSMs where the transitions are labeled from a semiring and path properties are algebraic with semiring operations. RSMs with algebraic path properties can model interprocedural dataflow analysis problems, the shortest path problem, the most probable path problem, and so on. The traditional algorithms for interprocedural analysis focus on path properties where the starting point is fixed as the entry point of a specific method. In this work, we consider possible multiple queries as required in many applications such as in alias analysis. The study of multiple queries allows us to bring in an important algorithmic distinction between the resource usage of the one-time preprocessing vs for each individual query. The second aspect we consider is that the control flow graphs for most programs have constant treewidth. Our main contributions are simple and implementable algorithms that support multiple queries for algebraic path properties for RSMs that have constant treewidth. Our theoretical results show that our algorithms have small additional one-time preprocessing but can answer subsequent queries significantly faster as compared to the current algorithmic solutions for interprocedural dataflow analysis. We have also implemented our algorithms and evaluated their performance for performing on-demand interprocedural dataflow analysis on various domains, such as for live variable analysis and reaching definitions, on a standard benchmark set. Our experimental results align with our theoretical statements and show that after a lightweight preprocessing, on-demand queries are answered much faster than the standard existing algorithmic approaches.

...read moreread less

12 citations

Journal Article•DOI•

Contextual dispatch for function specialization

[...]

Olivier Flückiger¹, Guido Chari, Ming-Ho Yee¹, Jan Ječmen², Jakob Hain¹, Jan Vitek² - Show less +2 more•Institutions (2)

Northeastern University¹, Czech Technical University in Prague²

13 Nov 2020

TL;DR: This paper proposes an approach to further the specialization of dynamic language compilers, by disentangling classes of behaviors into separate optimization units, and describes a compiler for the R language which uses this approach.

...read moreread less

Abstract: In order to generate efficient code, dynamic language compilers often need information, such as dynamic types, not readily available in the program source. Leveraging a mixture of static and dynamic information, these compilers speculate on the missing information. Within one compilation unit, they specialize the generated code to the previously observed behaviors, betting that past is prologue. When speculation fails, the execution must jump back to unoptimized code. In this paper, we propose an approach to further the specialization, by disentangling classes of behaviors into separate optimization units. With contextual dispatch, functions are versioned and each version is compiled under different assumptions. When a function is invoked, the implementation dispatches to a version optimized under assumptions matching the dynamic context of the call. As a proof-of-concept, we describe a compiler for the R language which uses this approach. Our implementation is, on average, 1.7× faster than the GNU R reference implementation. We evaluate contextual dispatch on a set of benchmarks and measure additional speedup, on top of traditional speculation with deoptimization techniques. In this setting contextual dispatch improves the performance of 18 out of 46 programs in our benchmark suite.

...read moreread less

8 citations

1
2
3
4
…
5
6
7
8

Collapse