scispace - formally typeset
S

Shijie Zhou

Researcher at University of Southern California

Publications -  30
Citations -  644

Shijie Zhou is an academic researcher from University of Southern California. The author has contributed to research in topics: Network packet & Speedup. The author has an hindex of 14, co-authored 29 publications receiving 506 citations. Previous affiliations of Shijie Zhou include Microsoft.

Papers
More filters
Proceedings ArticleDOI

High-Throughput and Energy-Efficient Graph Processing on FPGA

TL;DR: The design uses large external memory for storing massive graph data and FPGA for acceleration, and leverages edge-centric computing principles to accelerate several classic graph algorithms, including single-source shortest path, weakly connected component, and minimum spanning tree.
Journal ArticleDOI

HitGraph: High-throughput Graph Processing Framework on FPGA

TL;DR: HitGraph takes in an edge-centric graph algorithm and hardware resource constraints, determines design parameters and then generates a Register Transfer Level (RTL) FPGA design that makes accelerator design for various graph analytics transparent and user-friendly by masking internal details of the accelerator design process.
Proceedings ArticleDOI

High-performance packet classification on GPU

TL;DR: A high-performance packet classifier on GPU is presented, which can achieve the throughput of 85 million packets per second and the average processing latency of 4.9 μs per packet.
Proceedings ArticleDOI

Accelerating Graph Analytics on CPU-FPGA Heterogeneous Platform

TL;DR: This paper introduces the notion of active vertex ratio, based on which a simple but efficient paradigm selection approach is developed and a hybrid data structure is developed to concurrently support vertex-centric and edge-centric paradigms and proposes a graph partitioning scheme.
Proceedings ArticleDOI

Optimizing memory performance for FPGA implementation of pagerank

TL;DR: This paper presents an FPGA implementation of the classic PageRank algorithm, and dramatically reduces the number of random memory accesses and improves the execution time by at least 70%.