Cache revive: architecting volatile STT-RAM caches for enhanced performance in CMPs

doi:10.1145/2228360.2228406

Proceedings ArticleDOI

Cache revive: architecting volatile STT-RAM caches for enhanced performance in CMPs

Adwait Jog, +6 more

- pp 243-252

Chats0

TLDR

This work forms the relationship between retention-time and write-latency, and finds optimal retention- time for architecting an efficient cache hierarchy using STT-RAM to overcome high write latency and energy problems.

Abstract:

High density, low leakage and non-volatility are the attractive features of Spin-Transfer-Torque-RAM (STT-RAM), which has made it a strong competitor against SRAM as a universal memory replacement in multi-core systems. However, STT-RAM suffers from high write latency and energy which has impeded its widespread adoption. To this end, we look at trading-off STT-RAM's non-volatility property (data-retention-time) to overcome these problems. We formulate the relationship between retention-time and write-latency, and find optimal retention-time for architecting an efficient cache hierarchy using STT-RAM. Our results show that, compared to SRAM-based design, our proposal can improve performance and energy consumption by 18% and 60%, respectively.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

Evaluating STT-RAM as an energy-efficient main memory alternative

Emre Kultursay, +3 more

TL;DR: It is shown that an optimized, equal capacity STT-RAM main memory can provide performance comparable to DRAM main memory, with an average 60% reduction in main memory energy.

...read moreread less

Journal ArticleDOI

In-Memory Big Data Management and Processing: A Survey

Hao Zhang, +4 more

- 01 Jul 2015 -

IEEE Transactions on Knowledge and Data ...

TL;DR: This survey aims to provide a thorough review of a wide range of in-memory data management and processing proposals and systems, including both data storage systems and data processing frameworks.

...read moreread less

Proceedings ArticleDOI

OWL: cooperative thread array aware scheduling techniques for improving GPGPU performance

Adwait Jog, +7 more

TL;DR: This paper presents a coordinated CTA-aware scheduling policy that utilizes four schemes to minimize the impact of long memory latencies, and indicates that the proposed mechanism can provide 33% average performance improvement compared to the commonly-employed round-robin warp scheduling policy.

...read moreread less

Journal ArticleDOI

Computing in Memory With Spin-Transfer Torque Magnetic RAM

Shubham Jain, +3 more

- 01 Mar 2018 -

IEEE Transactions on Very Large Scale In...

TL;DR: In this article, the spin-transfer torque compute-in-memory (STT-CiM) was proposed for in-memory computing with spin transfer torque magnetic RAM, which allows multiple wordlines within an array to be simultaneously enabled, allowing for directly sensing functions of the values stored in multiple rows using a single access.

...read moreread less

Journal ArticleDOI

Scalable logging through emerging non-volatile memory

Tianzheng Wang, +1 more

TL;DR: This paper evaluates distributed logging with logging-intensive workloads and shows that distributed logging can achieve as much as ~3x speedup over centralized logging in a modern DBMS and that passive group commit only induces minuscule overhead.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings ArticleDOI

Scalable high performance main memory system using phase-change memory technology

Moinuddin K. Qureshi, +2 more

TL;DR: This paper analyzes a PCM-based hybrid main memory system using an architecture level model of PCM and proposes simple organizational and management solutions of the hybrid memory that reduces the write traffic to PCM, boosting its lifetime from 3 years to 9.7 years.

...read moreread less

Journal ArticleDOI

NVSim: A Circuit-Level Performance, Energy, and Area Model for Emerging Nonvolatile Memory

Xiangyu Dong, +3 more

- 01 Jul 2012 -

IEEE Transactions on Computer-Aided Desi...

TL;DR: NVSim is developed, a circuit-level model for NVM performance, energy, and area estimation, which supports various NVM technologies, including STT-RAM, PCRAM, ReRAM, and legacy NAND Flash and is expected to help boost architecture-level NVM-related studies.

...read moreread less

Benchmarking modern multiprocessors

Kai Li, +1 more

TL;DR: A methodology to design effective benchmark suites is developed and its effectiveness is demonstrated by developing and deploying a benchmark suite for evaluating multiprocessors called PARSEC, which has been adopted by many architecture groups in both research and industry.

...read moreread less

Journal ArticleDOI

The M5 Simulator: Modeling Networked Systems

Nathan Binkert, +5 more

- 01 Jul 2006 -

IEEE Micro

TL;DR: The M5 simulator provides features necessary for simulating networked hosts, including full-system capability, a detailed I/O subsystem, and the ability to simulate multiple networked systems deterministically.

...read moreread less

Journal ArticleDOI

New Generation of Predictive Technology Model for Sub-45 nm Early Design Exploration

Wei Zhao, +1 more

- 23 Oct 2006 -

IEEE Transactions on Electron Devices

TL;DR: In this article, a new generation of predictive technology model (PTM) is developed to predict the characteristics of nanoscale CMOS, including process variations and correlations among model parameters.

...read moreread less