scispace - formally typeset
Z

Zeke Wang

Researcher at Zhejiang University

Publications -  45
Citations -  726

Zeke Wang is an academic researcher from Zhejiang University. The author has contributed to research in topics: Speedup & Computer science. The author has an hindex of 14, co-authored 36 publications receiving 535 citations. Previous affiliations of Zeke Wang include ETH Zurich & Northeastern University (China).

Papers
More filters
Proceedings ArticleDOI

A performance analysis framework for optimizing OpenCL applications on FPGAs

TL;DR: This paper presents an FPGA-based performance analysis framework that can shed light on the performance bottleneck and thus guide the code tuning for OpenCL applications on FPGAs, and demonstrates that its analytical performance model can accurately predict the performance of OpenCL programs with different optimization combinations onFPGAs.
Proceedings ArticleDOI

StRoM: smart remote memory

TL;DR: StRoM is presented, a programmable, FPGA-based RoCE v2 NIC supporting the offloading of application level kernels that bypasses the CPU entirely and extends the semantics of RDMA to enable multi-step data access operations and in-network processing ofRDMA streams.
Journal ArticleDOI

A Combined SDC-SDF Architecture for Normal I/O Pipelined Radix-2 FFT

TL;DR: An efficient combined single-path delay commutator-feedback (SDC-SDF) radix-2 pipelined fast Fourier transform architecture, which includes log2 N - 1 SDC stages, and 1 SDF stage, is presented.
Proceedings ArticleDOI

A study of data partitioning on OpenCL-based FPGAs

TL;DR: This paper investigates the performance of relational database applications on OpenCL-based FPGAs, and develops a simple yet efficient multi-kernel approach to leverage two emerging features of Altera OpenCL SDK, namely task kernel and channel.
Proceedings Article

G-NET: Effective GPU Sharing in NFV Systems

TL;DR: G-NET is proposed, an NFV system with a GPU virtualization scheme that supports spatial GPU sharing, a service chain based GPU scheduler, and a scheme to guarantee data isolation in the GPU, and an abstraction for building efficient network functions on G-NET, which significantly reduces development efforts.