HieIM: highly flexible in-memory computing using STT MRAM

doi:10.5555/3201607.3201699

Open AccessProceedings ArticleDOI

HieIM: highly flexible in-memory computing using STT MRAM

- pp 361-366

TLDR

A Highly Flexible InMemory (HieIM) computing platform using STT MRAM, which can be leveraged to implement Boolean logic functions without sacrificing memory functionality, thus overcoming the ‘operand locality’ problem in contemporary in-memory computing platform designs is proposed.

Abstract:

In this paper we propose a Highly Flexible In-Memory (HieIM) computing platform using STT MRAM, which can be leveraged to implement Boolean logic functions without sacrificing memory functionality. It could pre-process data within memory to further reduce power hungry long distance communication between memory and processing units as in Von-Neumann computing system. HieIM can implement all the Boolean logic functions (AND/NAND, OR/NOR, XOR/XNOR) between any two cells in the same memory array, thus overcoming the 'operand locality' problem in contemporary in-memory computing platform designs. To investigate the performance of HieIM, we test in-memory bulk bit-wise Boolean logic operations using different vector datasets, which shows ∼ 8× energy saving and ∼ 5× speedup compared to recent DRAM based in-memory computing platform. We further implement an in-memory data encryption engine design based on HieIM as another case study. With AES algorithm, it shows 51.5% and 68.9% lower energy consumption compared to CMOS-ASIC and CMOL based implementations, respectively.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

C3SRAM: An In-Memory-Computing SRAM Macro Based on Robust Capacitive Coupling Computing Mechanism

Zhewei Jiang, +3 more

- 18 May 2020 -

IEEE Journal of Solid-state Circuits

TL;DR: The macro is an SRAM module with the circuits embedded in bitcells and peripherals to perform hardware acceleration for neural networks with binarized weights and activations and utilizes analog-mixed-signal capacitive-coupling computing to evaluate the main computations of binary neural networks, binary-multiply-and-accumulate operations.

...read moreread less

Journal ArticleDOI

MRIMA: An MRAM-Based In-Memory Accelerator

Shaahin Angizi, +3 more

- 01 May 2020 -

IEEE Transactions on Computer-Aided Desi...

TL;DR: This paper presents practical case studies to demonstrate MRIMA’s acceleration for binary-weight and low bit-width convolutional neural networks (CNNs) as well as data encryption, and shows ~77% and 21% lower energy consumption compared to CMOS-ASIC and recent domain-wall-based design, respectively.

...read moreread less

Journal ArticleDOI

A Survey of Spintronic Architectures for Processing-in-Memory and Neural Networks

Sumanth Umesh, +1 more

- 01 Aug 2019 -

Journal of Systems Architecture

TL;DR: A survey of spintronic-architectures for PIM and NNs based on main attributes to underscore their similarities and differences will be useful for researchers in the area of artificial intelligence, hardware architecture, chip design and memory system.

...read moreread less

Proceedings ArticleDOI

An MRAM-Based Deep In-Memory Architecture for Deep Neural Networks

Ameya D. Patil, +4 more

TL;DR: An MRAM-based deep in-memory architecture (MRAM-DIMA) to efficiently implement multi-bit matrix vector multiplication for deep neural networks using a standard MRAM bitcell array is presented.

...read moreread less

Journal ArticleDOI

Vesti: Energy-Efficient In-Memory Computing Accelerator for Deep Neural Networks

Shihui Yin, +5 more

- 01 Jan 2020 -

IEEE Transactions on Very Large Scale In...

TL;DR: A new DNN accelerator is designed to support configurable multibit activations and large-scale DNNs seamlessly while substantially improving the chip-level energy-efficiency with favorable accuracy tradeoff compared to conventional digital ASIC.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Book

The Design of Rijndael: AES - The Advanced Encryption Standard

Joan Daemen, +1 more

TL;DR: The underlying mathematics and the wide trail strategy as the basic design idea are explained in detail and the basics of differential and linear cryptanalysis are reworked.

...read moreread less

Proceedings ArticleDOI

McPAT: an integrated power, area, and timing modeling framework for multicore and manycore architectures

Sheng Li, +5 more

TL;DR: Combining power, area, and timing results of McPAT with performance simulation of PARSEC benchmarks at the 22nm technology node for both common in-order and out-of-order manycore designs shows that when die cost is not taken into account clustering 8 cores together gives the best energy-delay product, whereas when cost is taking into account configuring clusters with 4 cores gives thebest EDA2P and EDAP.

...read moreread less

Journal ArticleDOI

Hitting the memory wall: implications of the obvious

William A. Wulf, +1 more

- 01 Mar 1995 -

ACM Sigarch Computer Architecture News

TL;DR: This work proposes an exact analysis, removing all remaining uncertainty, based on model checking, using abstract-interpretation results to prune down the model for scalability, and notably improves precision upon classical abstract interpretation at reasonable cost.

...read moreread less

Proceedings ArticleDOI

Architecting phase change memory as a scalable dram alternative

Benjamin C. Lee, +3 more

TL;DR: This work proposes, crafted from a fundamental understanding of PCM technology parameters, area-neutral architectural enhancements that address these limitations and make PCM competitive with DRAM.

...read moreread less

Collapse

IEEE Transactions on Computer-Aided Desi...

HieIM: highly flexible in-memory computing using STT MRAM

Citations

C3SRAM: An In-Memory-Computing SRAM Macro Based on Robust Capacitive Coupling Computing Mechanism

MRIMA: An MRAM-Based In-Memory Accelerator

A Survey of Spintronic Architectures for Processing-in-Memory and Neural Networks

An MRAM-Based Deep In-Memory Architecture for Deep Neural Networks

Vesti: Energy-Efficient In-Memory Computing Accelerator for Deep Neural Networks

References

The gem5 simulator

The Design of Rijndael: AES - The Advanced Encryption Standard

McPAT: an integrated power, area, and timing modeling framework for multicore and manycore architectures

Hitting the memory wall: implications of the obvious

Architecting phase change memory as a scalable dram alternative

Related Papers (5)

Pinatubo: a processing-in-memory architecture for bulk bitwise operations in emerging non-volatile memories

In-Memory Processing Paradigm for Bitwise Logic Operations in STT–MRAM

Low power in-memory computing based on dual-mode SOT-MRAM

PIMA-logic: a novel processing-in-memory architecture for highly flexible and energy-efficient logic computation

MRIMA: An MRAM-Based In-Memory Accelerator