M
Maciej Besta
Researcher at ETH Zurich
Publications - 87
Citations - 2481
Maciej Besta is an academic researcher from ETH Zurich. The author has contributed to research in topics: Computer science & Graph (abstract data type). The author has an hindex of 26, co-authored 75 publications receiving 1710 citations. Previous affiliations of Maciej Besta include University of Zurich & Helsinki Institute of Physics.
Papers
More filters
Proceedings ArticleDOI
Slim fly: a cost effective low-diameter network topology
Maciej Besta,Torsten Hoefler +1 more
TL;DR: Slim Fly as mentioned in this paper is based on graphs that approximate the solution to the degree-diameter problem, which has significant advantages over other topologies in latency, bandwidth, resiliency, cost, and power consumption.
Proceedings ArticleDOI
To Push or To Pull: On Reducing Communication and Synchronization in Graph Computations
TL;DR: In this paper, the authors investigate the applicability of push-pull dichotomy to various algorithms and its impact on complexity, performance, and the amount of used locks, atomics, and reads/writes.
Proceedings ArticleDOI
Enabling highly-scalable remote memory access programming with MPI-3 one sided
TL;DR: In this paper, the authors develop scalable bufferless protocols that implement the MPI-3.0 specification, which support scaling to millions of cores with negligible memory consumption while providing highest performance and minimal overheads.
Journal ArticleDOI
Transformations of High-Level Synthesis Codes for High-Performance Computing
TL;DR: A collection of optimizing transformations for HLS, targeting scalable and efficient architectures for high-performance computing (HPC) applications, is presented, aiming to establish a common toolbox to guide both performance engineers and compiler engineers in tapping into the performance potential offered by spatial computing architectures using HLS.
Proceedings ArticleDOI
Evaluating the Cost of Atomic Operations on Modern Architectures
TL;DR: In this paper, the authors investigate the performance tradeoffs between compare-and-swap (CAS) operations and various characteristics of such systems, such as the structure of caches, and present a set of detailed benchmarks for latency and bandwidth of different atomics.