Neurocube: a programmable digital neuromorphic architecture with high-density 3D memory

doi:10.1145/3007787.3001178

Journal ArticleDOI

Neurocube: a programmable digital neuromorphic architecture with high-density 3D memory

- Vol. 44, Iss: 3, pp 380-392

TLDR

The basic architecture of the Neurocube is presented and an analysis of the logic tier synthesized in 28nm and 15nm process technologies are presented and the performance is evaluated through the mapping of a Convolutional Neural Network and estimating the subsequent power and performance for both training and inference.

Abstract:

This paper presents a programmable and scalable digital neuromorphic architecture based on 3D high-density memory integrated with logic tier for efficient neural computing. The proposed architecture consists of clusters of processing engines, connected by 2D mesh network as a processing tier, which is integrated in 3D with multiple tiers of DRAM. The PE clusters access multiple memory channels (vaults) in parallel. The operating principle, referred to as the memory centric computing, embeds specialized state-machines within the vault controllers of HMC to drive data into the PE clusters. The paper presents the basic architecture of the Neurocube and an analysis of the logic tier synthesized in 28nm and 15nm process technologies. The performance of the Neurocube is evaluated and illustrated through the mapping of a Convolutional Neural Network and estimating the subsequent power and performance for both training and inference.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Crosstalk Analysis and Countermeasures of High-Bandwidth 3D-Stacked Memory Using Multi-Hop Inductive Coupling Interface

Kota Shiba, +3 more

- 01 Jul 2023 -

IEICE Transactions on Electronics

TL;DR: In this article , an in-depth analysis of crosstalk in a high-bandwidth 3D-stacked memory using a multi-hop inductive coupling interface is presented.

...read moreread less

Proceedings ArticleDOI

MetaNMP: Leveraging Cartesian-Like Product to Accelerate HGNNs with Near-Memory Processing

Dan Chen, +6 more

TL;DR: MetaNMP as mentioned in this paper proposes a cartesian-like product paradigm to generate all metapath instances on the fly for heterogeneous graphs, and then designs a data flow for aggregating vertex features on metAPath instances, which aggregates vertex features along the direction of the metAP instances dispersed from the starting vertex to exploit shareable aggregation computations.

...read moreread less

Posted Content

NullaNet Tiny: Ultra-low-latency DNN Inference Through Fixed-function Combinational Logic

Mahdi Nazemi, +5 more

- 07 Apr 2021 -

arXiv: Learning

TL;DR: In this paper, the authors present NullaNet Tiny, an across-the-stack design and optimization framework for constructing resource and energy-efficient, ultra-low-latency FPGA-based neural network accelerators.

...read moreread less

Journal ArticleDOI

A survey of architectures of neural network accelerators

怡然陈, +1 more

- 29 Mar 2022 -

Zhongguo kexue

TL;DR: This survey will introduce some architecture designs of typical accelerators, including the computing unit, data flow, the characteristics of the different neural networks to be accelerated, and design considerations on emerging platforms, etc.

...read moreread less

Journal ArticleDOI

Speeding-up neuromorphic computation for neural networks: Structure optimization approach

Heechun Park, +1 more

- 01 Jan 2022 -

Integration

TL;DR: This work proposes a new neuromorphic computing architecture of mixing both dendritic and axonal-based neuromorphic cores in a way to totally eliminate the inherent non-zero waiting time between neuromorph cores to speed up the computation of fully connected neural network twice as fast as that of the existing architectures.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Gradient-based learning applied to document recognition

Yann LeCun, +6 more

TL;DR: In this article, a graph transformer network (GTN) is proposed for handwritten character recognition, which can be used to synthesize a complex decision surface that can classify high-dimensional patterns, such as handwritten characters.

...read moreread less

Journal ArticleDOI

Deep learning in neural networks

Jürgen Schmidhuber

- 01 Jan 2015 -

Neural Networks

TL;DR: This historical survey compactly summarizes relevant work, much of it from the previous millennium, review deep supervised learning, unsupervised learning, reinforcement learning & evolutionary computation, and indirect search for short programs encoding deep and large networks.

...read moreread less

Book

Neural Networks And Learning Machines

Simon Haykin

TL;DR: Refocused, revised and renamed to reflect the duality of neural networks and learning machines, this edition recognizes that the subject matter is richer when these topics are studied together.

...read moreread less

Journal ArticleDOI

Cellular neural networks: theory

Leon O. Chua, +1 more

- 01 Oct 1988 -

IEEE Transactions on Circuits and System...

TL;DR: In this article, a class of information processing systems called cellular neural networks (CNNs) are proposed, which consist of a massive aggregate of regularly spaced circuit clones, called cells, which communicate with each other directly through their nearest neighbors.

...read moreread less

Book ChapterDOI

GradientBased Learning Applied to Document Recognition

Simon Haykin, +1 more

TL;DR: Various methods applied to handwritten character recognition are reviewed and compared and Convolutional Neural Networks, that are specifically designed to deal with the variability of 2D shapes, are shown to outperform all other techniques.

...read moreread less

Collapse

Neurocube: a programmable digital neuromorphic architecture with high-density 3D memory

Citations

Crosstalk Analysis and Countermeasures of High-Bandwidth 3D-Stacked Memory Using Multi-Hop Inductive Coupling Interface

MetaNMP: Leveraging Cartesian-Like Product to Accelerate HGNNs with Near-Memory Processing

NullaNet Tiny: Ultra-low-latency DNN Inference Through Fixed-function Combinational Logic

A survey of architectures of neural network accelerators

Speeding-up neuromorphic computation for neural networks: Structure optimization approach

References

Gradient-based learning applied to document recognition

Deep learning in neural networks

Neural Networks And Learning Machines

Cellular neural networks: theory

GradientBased Learning Applied to Document Recognition

Related Papers (5)

DaDianNao: A Machine-Learning Supercomputer

ISAAC: a convolutional neural network accelerator with in-situ analog arithmetic in crossbars

DianNao: a small-footprint high-throughput accelerator for ubiquitous machine-learning

EIE: efficient inference engine on compressed deep neural network

In-Datacenter Performance Analysis of a Tensor Processing Unit