MBZip: Multiblock Data Compression

doi:10.1145/3151033

Open AccessJournal ArticleDOI

MBZip: Multiblock Data Compression

Raghavendra Kanakagiri, +2 more

- 05 Dec 2017 -

ACM Transactions on Architecture and Cod...

- Vol. 14, Iss: 4, pp 42

TLDR

MBZip is a synergistic mechanism that compresses multiple data blocks into one single block (called a zipped block), both at the LLC and DRAM, and improves the system performance by 21.9%, with a maximum of 90.3% on a 4-core system.

Abstract:

Compression techniques at the last-level cache and the DRAM play an important role in improving system performance by increasing their effective capacities. A compressed block in DRAM also reduces the transfer time over the memory bus to the caches, reducing the latency of a LLC cache miss. Usually, compression is achieved by exploiting data patterns present within a block. But applications can exhibit data locality that spread across multiple consecutive data blocks. We observe that there is significant opportunity available for compressing multiple consecutive data blocks into one single block, both at the LLC and DRAM. Our studies using 21 SPEC CPU applications show that, at the LLC, around 25% (on average) of the cache blocks can be compressed into one single cache block when grouped together in groups of 2 to 8 blocks. In DRAM, more than 30% of the columns residing in a single DRAM page can be compressed into one DRAM column, when grouped together in groups of 2 to 6. Motivated by these observations, we propose a mechanism, namely, MBZip, that compresses multiple data blocks into one single block (called a zipped block), both at the LLC and DRAM. At the cache, MBZip includes a simple tag structure to index into these zipped cache blocks and the indexing does not incur any redirectional delay. At the DRAM, MBZip does not need any changes to the address computation logic and works seamlessly with the conventional/existing logic. MBZip is a synergistic mechanism that coordinates these zipped blocks at the LLC and DRAM. Further, we also explore silent writes at the DRAM and show that certain writes need not access the memory when blocks are zipped. MBZip improves the system performance by 21.9%, with a maximum of 90.3% on a 4-core system.

MBZip: Multiblock Data Compression

Citations

Safecracker: Leaking Secrets through Compressed Caches

Optimized Lossless Embedded Compression for Mobile Multimedia Applications

CABLE: a CAche-based link encoder for bandwidth-starved manycores

MemSZ: Squeezing Memory Traffic with Lossy Compression

Compacted CPU/GPU Data Compression via Modified Virtual Address Translation

References

The gem5 simulator

SPEC CPU2006 benchmark descriptions

SPEC CPU2000: measuring CPU performance in the New Millennium

Symbiotic jobscheduling for a simultaneous multithreaded processor

System-Level Performance Metrics for Multiprogram Workloads

Related Papers (5)

A distributed predictive cache for high performance computer systems

Storage system for a high-performance processor

Multi-column implementations for cache associativity

Energy Optimized Cache Memory Architecture Exploiting Spatial Locality

The Direct-to-Data (D2D) cache: navigating the cache hierarchy with a single lookup