scispace - formally typeset
C

Cong Xu

Researcher at Hewlett-Packard

Publications -  55
Citations -  5895

Cong Xu is an academic researcher from Hewlett-Packard. The author has contributed to research in topics: Resistive random-access memory & Non-volatile memory. The author has an hindex of 24, co-authored 54 publications receiving 4746 citations. Previous affiliations of Cong Xu include Pennsylvania State University & University of California, Santa Barbara.

Papers
More filters
Journal ArticleDOI

PRIME: a novel processing-in-memory architecture for neural network computation in ReRAM-based main memory

TL;DR: This work proposes a novel PIM architecture, called PRIME, to accelerate NN applications in ReRAM based main memory, and distinguishes itself from prior work on NN acceleration, with significant performance improvement and energy saving.
Journal ArticleDOI

NVSim: A Circuit-Level Performance, Energy, and Area Model for Emerging Nonvolatile Memory

TL;DR: NVSim is developed, a circuit-level model for NVM performance, energy, and area estimation, which supports various NVM technologies, including STT-RAM, PCRAM, ReRAM, and legacy NAND Flash and is expected to help boost architecture-level NVM-related studies.
Proceedings Article

TernGrad: ternary gradients to reduce communication in distributed deep learning

TL;DR: This work mathematically proves the convergence of TernGrad under the assumption of a bound on gradients, and proposes layer-wise ternarizing and gradient clipping to improve its convergence.
Posted Content

TernGrad: Ternary Gradients to Reduce Communication in Distributed Deep Learning

TL;DR: TernGrad as discussed by the authors uses ternary gradients to accelerate distributed deep learning in data parallelism, which can reduce the communication cost of synchronizing gradients and parameters by ternarizing and gradient clipping.
Proceedings ArticleDOI

Pinatubo: a processing-in-memory architecture for bulk bitwise operations in emerging non-volatile memories

TL;DR: This work proposes Pinatubo, a Processing In Non-volatile memory ArchiTecture for bUlk Bitwise Operations, which redesigns the read circuitry so that it can compute the bitwise logic of two or more memory rows very efficiently, and support one-step multi-row operations.