Gaining insights into multicore cache partitioning: Bridging the gap between simulation and real systems

doi:10.1109/HPCA.2008.4658653

Proceedings ArticleDOI

Gaining insights into multicore cache partitioning: Bridging the gap between simulation and real systems

Jiang Lin, +5 more

- pp 367-378

Chats0

TLDR

This paper has comprehensively evaluated several representative cache partitioning schemes with different optimization objectives, including performance, fairness, and quality of service (QoS) and provides new insights into dynamic behaviors and interaction effects.

Abstract:

Cache partitioning and sharing is critical to the effective utilization of multicore processors. However, almost all existing studies have been evaluated by simulation that often has several limitations, such as excessive simulation time, absence of OS activities and proneness to simulation inaccuracy. To address these issues, we have taken an efficient software approach to supporting both static and dynamic cache partitioning in OS through memory address mapping. We have comprehensively evaluated several representative cache partitioning schemes with different optimization objectives, including performance, fairness, and quality of service (QoS). Our software approach makes it possible to run the SPEC CPU2006 benchmark suite to completion. Besides confirming important conclusions from previous work, we are able to gain several insights from whole-program executions, which are infeasible from simulation. For example, giving up some cache space in one program to help another one may improve the performance of both programs for certain workloads due to reduced contention for memory bandwidth. Our evaluation of previously proposed fairness metrics is also significantly different from a simulation-based study. The contributions of this study are threefold. (1) To the best of our knowledge, this is a highly comprehensive execution- and measurement-based study on multicore cache partitioning. This paper not only confirms important conclusions from simulation-based studies, but also provides new insights into dynamic behaviors and interaction effects. (2) Our approach provides a unique and efficient option for evaluating multicore cache partitioning. The implemented software layer can be used as a tool in multicore performance evaluation and hardware design. (3) The proposed schemes can be further refined for OS kernels to improve performance.

Gaining insights into multicore cache partitioning: Bridging the gap between simulation and real systems

Citations

Intel SGX Explained.

Q-clouds: managing performance interference effects for QoS-aware clouds

Bubble-Up: increasing utilization in modern warehouse scale computers via sensible co-locations

Addressing shared resource contention in multicore processors via scheduling

Sanctum: Minimal Hardware Extensions for Strong Software Isolation

References

Statistical methods

Utility-Based Cache Partitioning: A Low-Overhead, High-Performance, Runtime Mechanism to Partition Shared Caches

Fair Cache Sharing and Partitioning in a Chip Multiprocessor Architecture

Dynamic Partitioning of Shared Cache Memory

Managing Distributed, Shared L2 Caches through OS-Level Page Allocation

Related Papers (5)

Utility-Based Cache Partitioning: A Low-Overhead, High-Performance, Runtime Mechanism to Partition Shared Caches

Towards practical page coloring-based multicore cache management

Managing Distributed, Shared L2 Caches through OS-Level Page Allocation

Fair Cache Sharing and Partitioning in a Chip Multiprocessor Architecture

Dynamic Partitioning of Shared Cache Memory