B
Bharath Ramesh
Researcher at Ohio State University
Publications - 7
Citations - 25
Bharath Ramesh is an academic researcher from Ohio State University. The author has contributed to research in topics: Message Passing Interface & InfiniBand. The author has an hindex of 2, co-authored 7 publications receiving 12 citations.
Papers
More filters
Proceedings ArticleDOI
Designing a Profiling and Visualization Tool for Scalable and In-depth Analysis of High-Performance GPU Clusters
Pouya Kousha,Bharath Ramesh,Kaushik Kandadi Suresh,Ching-Hsiang Chu,Arpan Jain,Nick Sarkauskas,Hari Subramoni,Dhabaleswar K. Panda +7 more
TL;DR: This paper proposes and designs an in-depth, real-time analysis, profiling, and visualization tool for high-performance GPU-enabled clusters with NVLinks, the first such tool which is capable of presenting a unified and holistic view of MPI-level and fabric level information for emerging NVLink-enabled high- performance GPU clusters.
Book ChapterDOI
Communication-Aware Hardware-Assisted MPI Overlap Engine
Mohammadreza Bayatpour,Jahanzeb Hashmi Maqbool,Sourav Chakraborty,Kaushik Kandadi Suresh,Seyedeh Mahdieh Ghazimirsaeed,Bharath Ramesh,Hari Subramoni,Dhabaleswar K. Panda +7 more
TL;DR: This paper designs a communication-aware overlap engine for MPI that uses novel hardware-assisted and software-based solutions to extract overlap for both expected and unexpected messages.
Proceedings ArticleDOI
Machine-agnostic and Communication-aware Designs for MPI on Emerging Architectures
Jahanzeb Maqbool Hashmi,Shulei Xu,Bharath Ramesh,Mohammadreza Bayatpour,Hari Subramoni,Dhabaleswar K. Panda +5 more
TL;DR: A set of low-level benchmarking based approaches and MPI-level designs to infer vendor-specific machine characteristics e.g., physical to virtual machine topologies, and dynamic communication patterns of the applications are proposed.
Proceedings ArticleDOI
Leveraging Network-level parallelism with Multiple Process-Endpoints for MPI Broadcast
Amit Ruhela,Bharath Ramesh,Sourav Chakraborty,Hari Subramoni,Jahanzeb Maqbool Hashmi,Dhabaleswar K. Panda +5 more
TL;DR: A Scalable Multi-Endpoint broadcast algorithm that combines hierarchical communication with multiple endpoints per node for high performance and scalability is proposed and evaluated against state-of-the-art designs in other MPI libraries.
Proceedings ArticleDOI
Performance Characterization of Network Mechanisms for Non-Contiguous Data Transfers in MPI.
Kaushik Kandadi Suresh,Bharath Ramesh,Seyedeh Mahdieh Ghazimirsaeed,Mohammadreza Bayatpour,Jahanzeb Maqbool Hashmi,Hari Subramoni,Dhabaleswar K. Panda +6 more
TL;DR: From these evaluations, it is realized why MPI run-times may not meet the expectations of DDT, and when to use DDT based implementations.