R
Rupesh Nasre
Researcher at Indian Institute of Technology Madras
Publications - 67
Citations - 1059
Rupesh Nasre is an academic researcher from Indian Institute of Technology Madras. The author has contributed to research in topics: Speedup & Pointer analysis. The author has an hindex of 13, co-authored 60 publications receiving 917 citations. Previous affiliations of Rupesh Nasre include Indian Institute of Science & NetApp.
Papers
More filters
Proceedings ArticleDOI
A quantitative study of irregular programs on GPUs
TL;DR: This paper defines two measures of irregularity called control-flow irregularity and memory-access irregularity, and investigates, using performance-counter measurements, how irregular GPU kernels differ from regular kernels with respect to these measures.
Proceedings ArticleDOI
Data-Driven Versus Topology-driven Irregular Computations on GPUs
TL;DR: Data-driven and topology-driven implementations of six important graph algorithms on GPUs are studied to understand the tradeoffs between these implementations and how to optimize them and devise hybrid approaches that combine the two techniques and outperform both of them.
Proceedings ArticleDOI
Morph algorithms on GPUs
TL;DR: This work proposes efficient techniques to perform concurrent subgraph addition, subgraph deletion, conflict detection and several optimizations to improve the scalability of morph algorithms and provides several insights into how other morph algorithms can be efficiently implemented on GPUs.
Proceedings ArticleDOI
Atomic-free irregular computations on GPUs
TL;DR: This paper presents two high-level methods to eliminate atomics in irregular programs by exploiting algebraic properties of algorithms to elide atomics, and illustrates the generality of the two methods by applying them to five irregular graph applications.
Journal ArticleDOI
Falcon: A Graph Manipulation Language for Heterogeneous Systems
TL;DR: A domain-specific language (DSL) is proposed, Falcon, for implementing graph algorithms that abstracts the hardware, provides constructs to write explicitly parallel programs at a higher level, and can work with general algorithms that may change the graph structure.