Xuehai Zhou

Researcher at University of Science and Technology of China

Publications - 263

Citations - 3034

Xuehai Zhou is an academic researcher from University of Science and Technology of China. The author has contributed to research in topics: Speedup & Field-programmable gate array. The author has an hindex of 24, co-authored 244 publications receiving 2447 citations.

Papers

PDF

Open Access

More filters

Proceedings ArticleDOI

PuDianNao: A Polyvalent Machine Learning Accelerator

Liu Daofu, +8 more

TL;DR: An ML accelerator called PuDianNao is presented, which accommodates seven representative ML techniques, including k-means, k-nearest neighbors, naive bayes, support vector machine, linear regression, classification tree, and deep neural network, and can perform up to 1056 GOP/s, and consumes 596 mW only.

...read moreread less

Journal ArticleDOI

DLAU: A Scalable Deep Learning Accelerator Unit on FPGA

Chao Wang, +5 more

- 01 Mar 2017 -

IEEE Transactions on Computer-Aided Desi...

TL;DR: This paper designs deep learning accelerator unit (DLAU), which is a scalable accelerator architecture for large-scale deep learning networks using field-programmable gate array (FPGA) as the hardware prototype and employs three pipelined processing units to improve the throughput.

...read moreread less

Proceedings ArticleDOI

Cambricon-s: addressing irregularity in sparse neural networks through a cooperative software/hardware approach

Zhou Xuda, +9 more

TL;DR: A software-based coarse-grained pruning technique, together with local quantization, significantly reduces the size of indexes and improves the network compression ratio and a hardware accelerator is designed to address the remaining irregularity of sparse synapses and neurons efficiently.

...read moreread less

Journal ArticleDOI

MALOC: A Fully Pipelined FPGA Accelerator for Convolutional Neural Networks With All Layers Mapped on Chip

Lei Gong, +4 more

- 18 Jul 2018 -

IEEE Transactions on Computer-Aided Desi...

TL;DR: A new architecture for FPGA-based CNN accelerator that maps all the layers to their own on-chip units and working concurrently as a pipeline is proposed, which can achieve maximum resource utilization as well as optimal computational efficiency.

...read moreread less

Journal ArticleDOI

WGAN-Based Synthetic Minority Over-Sampling Technique: Improving Semantic Fine-Grained Classification for Lung Nodules in CT Images

Qingfeng Wang, +8 more

- 30 Jan 2019 -

IEEE Access

TL;DR: The experimental results suggest that the WGAN-based oversampling technique can synthesize helpful samples for the minority classes to assist the training of the CNN model and to boost the fine-grained classification performance better than the conventional data augmentation method and the two schemes of the GAN and DCGAN techniques do.

...read moreread less

Collapse