scispace - formally typeset
Journal ArticleDOI

Neurocube: a programmable digital neuromorphic architecture with high-density 3D memory

TLDR
The basic architecture of the Neurocube is presented and an analysis of the logic tier synthesized in 28nm and 15nm process technologies are presented and the performance is evaluated through the mapping of a Convolutional Neural Network and estimating the subsequent power and performance for both training and inference.
Abstract
This paper presents a programmable and scalable digital neuromorphic architecture based on 3D high-density memory integrated with logic tier for efficient neural computing. The proposed architecture consists of clusters of processing engines, connected by 2D mesh network as a processing tier, which is integrated in 3D with multiple tiers of DRAM. The PE clusters access multiple memory channels (vaults) in parallel. The operating principle, referred to as the memory centric computing, embeds specialized state-machines within the vault controllers of HMC to drive data into the PE clusters. The paper presents the basic architecture of the Neurocube and an analysis of the logic tier synthesized in 28nm and 15nm process technologies. The performance of the Neurocube is evaluated and illustrated through the mapping of a Convolutional Neural Network and estimating the subsequent power and performance for both training and inference.

read more

Citations
More filters
Proceedings ArticleDOI

Hydra: A near hybrid memory accelerator for CNN inference

TL;DR: Hydra is proposed, a near hybrid memory accelerator integrated close to the DRAM to execute inference, which achieves around 20x performance improvements over the in-DRAM processing-based state-of-the-art works while accelerating the CNN inference.
Journal ArticleDOI

Accelerating Time Series Analysis via Processing using Non-Volatile Memories

TL;DR: In this article , the authors present MATSA , the first MRAM-based Accelerator for Time Series Analysis (MATSA), which exploits magneto-resistive memory crossbars to enable energy-efficient and fast time series computation in memory while overcoming endurance issues of other nonvolatile memory technologies.
Journal ArticleDOI

A 7-nm FinFET 1.2-TB/s/mm2 3D-Stacked SRAM Module With 0.7-pJ/b Inductive Coupling Interface Using Over-SRAM Coil and Manchester-Encoded Synchronous Transceiver

TL;DR: A 0.7 pJ/bit, 8.5 Gb/s/link inductive coupling interchip wireless communication interface for a 3D-stacked static-random access memory (SRAM) has been developed in a 7-nm FinFET process.
Journal ArticleDOI

Enabling PIM-based AES encryption for online video streaming

TL;DR: Wang et al. as discussed by the authors proposed a Processing-In Memory (PIM) architecture AESPIM to offload AES encryption computation to the memory side, which significantly reduces data movement and increased memory bandwidth.
Journal ArticleDOI

IMCI: an efficient fingerprint retrieval approach based on 3D stacked memory

TL;DR: Data deduplication as a type of redundant data elimination technology can effectively reduce the impact of redundancy on storage costs, which consequently alleviates the problems and pressures caused by massive data storage, management, and backup.
References
More filters
Journal ArticleDOI

Gradient-based learning applied to document recognition

TL;DR: In this article, a graph transformer network (GTN) is proposed for handwritten character recognition, which can be used to synthesize a complex decision surface that can classify high-dimensional patterns, such as handwritten characters.
Journal ArticleDOI

Deep learning in neural networks

TL;DR: This historical survey compactly summarizes relevant work, much of it from the previous millennium, review deep supervised learning, unsupervised learning, reinforcement learning & evolutionary computation, and indirect search for short programs encoding deep and large networks.
Book

Neural Networks And Learning Machines

Simon Haykin
TL;DR: Refocused, revised and renamed to reflect the duality of neural networks and learning machines, this edition recognizes that the subject matter is richer when these topics are studied together.
Journal ArticleDOI

Cellular neural networks: theory

TL;DR: In this article, a class of information processing systems called cellular neural networks (CNNs) are proposed, which consist of a massive aggregate of regularly spaced circuit clones, called cells, which communicate with each other directly through their nearest neighbors.
Book ChapterDOI

GradientBased Learning Applied to Document Recognition

TL;DR: Various methods applied to handwritten character recognition are reviewed and compared and Convolutional Neural Networks, that are specifically designed to deal with the variability of 2D shapes, are shown to outperform all other techniques.
Related Papers (5)