scispace - formally typeset
V

V. Krishna Nandivada

Researcher at Indian Institute of Technology Madras

Publications -  43
Citations -  411

V. Krishna Nandivada is an academic researcher from Indian Institute of Technology Madras. The author has contributed to research in topics: Compiler & Task (computing). The author has an hindex of 11, co-authored 43 publications receiving 356 citations. Previous affiliations of V. Krishna Nandivada include University of California, Los Angeles & Purdue University.

Papers
More filters
Proceedings ArticleDOI

Chunking parallel loops in the presence of synchronization

TL;DR: A transformation framework is presented that uses a combination of transformations from past work to obtain an equivalent set of parallel loops that chunk together statements from multiple iterations while preserving the semantics of the original parallel program, thereby improving performance and scalability.
Journal ArticleDOI

A Transformation Framework for Optimizing Task-Parallel Programs

TL;DR: A transformation framework for optimizing task-parallel programs with a focus on task creation and task termination operations is introduced, and three different but interrelated optimizations are covered: finish-elimination, forall-coarsening, and loop-chunking.
Proceedings ArticleDOI

Fault localization for data-centric programs

TL;DR: An automated technique for localizing faults in data-centric programs using a novel, precise slicing algorithm to break the execution trace into multiple slices, such that each slice maps to an entry in the output collection.
Proceedings ArticleDOI

Reducing task creation and termination overhead in explicitly parallel programs

TL;DR: A transformation framework to optimize task-parallel programs with finish, forall and next statements is introduced and a significant improvement obtained by the transformation framework underscores the importance of the compiler optimizations introduced in this paper.
Proceedings ArticleDOI

Efficient, portable implementation of asynchronous multi-place programs

TL;DR: The design, implementation and evaluation of a compiler and runtime system for Flat X10, designed to permit efficient execution across multiple single-threaded places with a simple runtime and without compromising on the productivity of X10 are presented.