scispace - formally typeset
T

Tayo Oguntebi

Researcher at Stanford University

Publications -  7
Citations -  869

Tayo Oguntebi is an academic researcher from Stanford University. The author has contributed to research in topics: Transactional memory & Memory bandwidth. The author has an hindex of 7, co-authored 7 publications receiving 825 citations.

Papers
More filters
Proceedings ArticleDOI

Accelerating CUDA graph algorithms at maximum warp

TL;DR: A novel virtual warp-centric programming method that exposes the traits of underlying GPU architectures to users and significantly improves the performance of applications with heavily imbalanced workloads, and enables trade-offs between workload imbalance and ALU underutilization for fine-tuning the performance.
Proceedings ArticleDOI

Efficient Parallel Graph Exploration on Multi-Core CPU and GPU

TL;DR: A new method for implementing the parallel BFS algorithm on multi-core CPUs which exploits a fundamental property of randomly shaped real-world graph instances and shows improved performance over the current state-of-the-art implementation and increases its advantage as the size of the graph increases.
Proceedings ArticleDOI

GraphOps: A Dataflow Library for Graph Analytics Acceleration

TL;DR: Results show that the GraphOps-based accelerators are able to operate close to the bandwidth limit of the hardware platform, the limiting constraint in graph analytics computation.
Proceedings ArticleDOI

Eigenbench: A simple exploration tool for orthogonal TM characteristics

TL;DR: EigenBench, a lightweight yet powerful microbenchmark for fully evaluating a transactional memory system, is presented and it is shown that EigenBench is useful for thoroughly exploring the orthogonal space of TM application characteristics.
Proceedings ArticleDOI

Hardware acceleration of transactional memory on commodity systems

TL;DR: It is demonstrated that hardware can substantially accelerate the performance of an STM on unmodified commodity processors, and it is shown that, for all but short transactions, it is not necessary to modify the processor to obtain substantial improvement in TM performance.