scispace - formally typeset
Proceedings ArticleDOI

Cache revive: architecting volatile STT-RAM caches for enhanced performance in CMPs

Reads0
Chats0
TLDR
This work forms the relationship between retention-time and write-latency, and finds optimal retention- time for architecting an efficient cache hierarchy using STT-RAM to overcome high write latency and energy problems.
Abstract
High density, low leakage and non-volatility are the attractive features of Spin-Transfer-Torque-RAM (STT-RAM), which has made it a strong competitor against SRAM as a universal memory replacement in multi-core systems. However, STT-RAM suffers from high write latency and energy which has impeded its widespread adoption. To this end, we look at trading-off STT-RAM's non-volatility property (data-retention-time) to overcome these problems. We formulate the relationship between retention-time and write-latency, and find optimal retention-time for architecting an efficient cache hierarchy using STT-RAM. Our results show that, compared to SRAM-based design, our proposal can improve performance and energy consumption by 18% and 60%, respectively.

read more

Citations
More filters
Proceedings ArticleDOI

Evaluating STT-RAM as an energy-efficient main memory alternative

TL;DR: It is shown that an optimized, equal capacity STT-RAM main memory can provide performance comparable to DRAM main memory, with an average 60% reduction in main memory energy.
Journal ArticleDOI

In-Memory Big Data Management and Processing: A Survey

TL;DR: This survey aims to provide a thorough review of a wide range of in-memory data management and processing proposals and systems, including both data storage systems and data processing frameworks.
Proceedings ArticleDOI

OWL: cooperative thread array aware scheduling techniques for improving GPGPU performance

TL;DR: This paper presents a coordinated CTA-aware scheduling policy that utilizes four schemes to minimize the impact of long memory latencies, and indicates that the proposed mechanism can provide 33% average performance improvement compared to the commonly-employed round-robin warp scheduling policy.
Journal ArticleDOI

Computing in Memory With Spin-Transfer Torque Magnetic RAM

TL;DR: In this article, the spin-transfer torque compute-in-memory (STT-CiM) was proposed for in-memory computing with spin transfer torque magnetic RAM, which allows multiple wordlines within an array to be simultaneously enabled, allowing for directly sensing functions of the values stored in multiple rows using a single access.
Journal ArticleDOI

Scalable logging through emerging non-volatile memory

TL;DR: This paper evaluates distributed logging with logging-intensive workloads and shows that distributed logging can achieve as much as ~3x speedup over centralized logging in a modern DBMS and that passive group commit only induces minuscule overhead.
References
More filters
Proceedings ArticleDOI

Scalable high performance main memory system using phase-change memory technology

TL;DR: This paper analyzes a PCM-based hybrid main memory system using an architecture level model of PCM and proposes simple organizational and management solutions of the hybrid memory that reduces the write traffic to PCM, boosting its lifetime from 3 years to 9.7 years.
Journal ArticleDOI

NVSim: A Circuit-Level Performance, Energy, and Area Model for Emerging Nonvolatile Memory

TL;DR: NVSim is developed, a circuit-level model for NVM performance, energy, and area estimation, which supports various NVM technologies, including STT-RAM, PCRAM, ReRAM, and legacy NAND Flash and is expected to help boost architecture-level NVM-related studies.

Benchmarking modern multiprocessors

TL;DR: A methodology to design effective benchmark suites is developed and its effectiveness is demonstrated by developing and deploying a benchmark suite for evaluating multiprocessors called PARSEC, which has been adopted by many architecture groups in both research and industry.
Journal ArticleDOI

The M5 Simulator: Modeling Networked Systems

TL;DR: The M5 simulator provides features necessary for simulating networked hosts, including full-system capability, a detailed I/O subsystem, and the ability to simulate multiple networked systems deterministically.
Journal ArticleDOI

New Generation of Predictive Technology Model for Sub-45 nm Early Design Exploration

TL;DR: In this article, a new generation of predictive technology model (PTM) is developed to predict the characteristics of nanoscale CMOS, including process variations and correlations among model parameters.
Related Papers (5)