scispace - formally typeset
E

Eiman Ebrahimi

Researcher at Nvidia

Publications -  35
Citations -  2387

Eiman Ebrahimi is an academic researcher from Nvidia. The author has contributed to research in topics: Shared memory & Cache. The author has an hindex of 19, co-authored 35 publications receiving 1953 citations. Previous affiliations of Eiman Ebrahimi include University of Tehran & University of Texas at Austin.

Papers
More filters
Journal ArticleDOI

Fairness via source throttling: a configurable and high-performance fairness substrate for multi-core memory systems

TL;DR: The technique, Fairness via Source Throttling (FST), estimates the unfairness in the entire shared memory system and throttles down cores causing unfairness, thereby eliminating the need for and complexity of developing fairness mechanisms for each individual resource.
Journal ArticleDOI

Transparent offloading and mapping (TOM): enabling programmer-transparent near-data processing in GPU systems

TL;DR: Extensive evaluations across a variety of modern memory-intensive GPU workloads show that TOM significantly improves performance compared to a baseline GPU system that cannot offload computation to 3D-stacked memories.
Proceedings ArticleDOI

Coordinated control of multiple prefetchers in multi-core systems

TL;DR: This paper proposes a low-cost mechanism to control prefetcher-caused inter-core interference by dynamically adjusting the aggressiveness of multiple cores' prefetchers in a coordinated fashion and improves system performance by 14% compared to a state-of-the-artPrefetcher aggressiveness control technique on an eight-core system.
Proceedings ArticleDOI

MCM-GPU: Multi-Chip-Module GPUs for Continued Performance Scalability

TL;DR: It is demonstrated that package-level integration of multiple GPU modules to build larger logical GPUs can enable continuous performance scaling beyond Moore's law and solve the need for higher performing GPUs in many domains.
Proceedings ArticleDOI

Prefetch-aware shared resource management for multi-core systems

TL;DR: This paper is the first to propose mechanisms that both manage the shared resources of a multi-core chip to obtain high-performance and fairness, and also exploit prefetching, and it is shown that these mechanisms improve the performance of a 4-core system that uses network fair queuing, parallelism-aware batch scheduling, and fairness via source throttling.