scispace - formally typeset
J

Jiazhen Lin

Researcher at Beihang University

Publications -  5
Citations -  398

Jiazhen Lin is an academic researcher from Beihang University. The author has contributed to research in topics: Computer science & Quantization (signal processing). The author has an hindex of 2, co-authored 2 publications receiving 176 citations.

Papers
More filters
Proceedings ArticleDOI

Differentiable Soft Quantization: Bridging Full-Precision and Low-Bit Neural Networks

TL;DR: Differentiable soft quantization (DSQ) as mentioned in this paper is proposed to bridge the gap between the full-precision and low-bit networks, which can automatically evolve during training to gradually approximate the standard quantization.
Posted Content

Differentiable Soft Quantization: Bridging Full-Precision and Low-Bit Neural Networks

TL;DR: Differentiable Soft Quantization (DSQ) is proposed to bridge the gap between the full-precision and low-bit networks and can help pursue the accurate gradients in backward propagation, and reduce the quantization loss in forward process with an appropriate clipping range.
Proceedings ArticleDOI

Fleche: an efficient GPU embedding cache for personalized recommendations

TL;DR: Fleche, a holistic cache scheme with detailed designs for efficient GPU-resident embedding caching, uses one cache backend for all embedding tables to improve the total cache utilization, and merges small kernel calls into one unitary call to reduce the overhead of kernel maintenance.
Proceedings Article

AlNiCo: SmartNIC-accelerated Contention-aware Request Scheduling for Transaction Processing

TL;DR: AlNiCo is proposed, which leverages SmartNICs to intelligently schedule incoming transaction requests to CPU cores, minimizing inter-transaction contention with low latency and co-designs hardware and software to enable adaptive and adaptive scheduling.