R
Rashid Kaleem
Researcher at University of Texas at Austin
Publications - 7
Citations - 594
Rashid Kaleem is an academic researcher from University of Texas at Austin. The author has contributed to research in topics: Data structure & Stochastic gradient descent. The author has an hindex of 6, co-authored 7 publications receiving 552 citations. Previous affiliations of Rashid Kaleem include Intel.
Papers
More filters
Journal ArticleDOI
The tao of parallelism in algorithms
Keshav Pingali,Donald Nguyen,Milind Kulkarni,Martin Burtscher,M. Amber Hassaan,Rashid Kaleem,Tsung-Hsien Lee,Andrew Lenharth,Roman Manevich,Mario Méndez-Lojo,Dimitrios Prountzos,Xin Sui +11 more
TL;DR: It is suggested that the operator formulation and tao-analysis of algorithms can be the foundation of a systematic approach to parallel programming.
Proceedings ArticleDOI
Adaptive heterogeneous scheduling for integrated GPUs
TL;DR: The asymmetric scheduling algorithm uses low-overhead online profiling to automatically partition the work of dataparallel kernels between the CPU and GPU without input from application developers, underscoring the feasibility of online profile-based heterogeneous scheduling on integrated CPU-GPU processors.
Proceedings ArticleDOI
Stochastic gradient descent on GPUs
TL;DR: This work examines several synchronization strategies for SGD, ranging from simple locking to conflict-free scheduling, and finds that the best schedule for some problems can be up to two orders of magnitude faster than the worst one.
Proceedings ArticleDOI
Efficient Mapping of Irregular C++ Applications to Integrated GPUs
Rajkishore Barik,Rashid Kaleem,Deepak Majeti,Brian T. Lewis,Tatiana Shpeisman,Chunling Hu,Yang Ni,Ali-Reza Adl-Tabatabai +7 more
TL;DR: This work presents a compiler framework with support for C++ features that enables GPU acceleration of a wide range of C++ applications with minimal changes and includes compiler optimizations to improve irregular application performance on GPUs.
Proceedings ArticleDOI
Synchronization Trade-Offs in GPU Implementations of Graph Algorithms
TL;DR: This work studied how the choice of synchronization mechanism affects the end-to-end performance of complex graph algorithms, using stochastic gradient descent (SGD) as an exemplar.