Compiling for niceness: mitigating contention for QoS in warehouse scale computers

doi:10.1145/2259016.2259018

Proceedings ArticleDOI

Compiling for niceness: mitigating contention for QoS in warehouse scale computers

Lingjia Tang, +2 more

- pp 1-12

Chats0

TLDR

QoS-Compile is presented, the first compilation approach that statically manipulates application contentiousness to enable the co-location of applications with varying QoS requirements, and as a result, can greatly improve machine utilization.

Abstract:

As the class of datacenters recently coined as warehouse scale computers (WSCs) continues to leverage commodity multicore processors with increasing core counts, there is a growing need to consolidate various workloads on these machines to fully utilize their computation power. However, it is well known that when multiple applications are co-located on a multicore machine, contention for shared memory resources can cause severe cross-core performance interference. To ensure that the quality of service (QoS) of user-facing applications does not suffer from performance interference, WSC operators resort to disallowing co-location of latency-sensitive applications with other applications. This policy translates to low machine utilization and millions of dollars wasted in WSCs.This paper presents QoS-Compile, the first compilation approach that statically manipulates application contentiousness to enable the co-location of applications with varying QoS requirements, and as a result, can greatly improve machine utilization. Our technique first pinpoints an application's code regions that tend to cause contention and performance interference. QoS-Compile then transforms those regions to reduce their contentious nature. In essence, to co-locate applications of different QoS priorities, our compilation technique uses pessimizing transformations to throttle down the memory access rate of the contentious regions in low priority applications to reduce their interference to high priority applications. Our evaluation using synthetic benchmarks, SPEC benchmarks and large-scale Google applications show that QoS-Compile can greatly reduce contention, improve QoS of applications, and improve machine utilization. Our experiments show that our technique improves applications' QoS performance by 21% and machine utilization by 36% on average.

Compiling for niceness: mitigating contention for QoS in warehouse scale computers

Citations

Bubble-flux: precise online QoS management for increased utilization in warehouse scale computers

Whare-map: heterogeneity in "homogeneous" warehouse-scale computers

SMiTe: Precise QoS Prediction on Real-System SMT Processors to Improve Utilization in Warehouse Scale Computers

Profile-guided automated software diversity

Machine Learning in Compiler Optimization

References

Pin: building customized program analysis tools with dynamic instrumentation

The Datacenter as a Computer: An Introduction to the Design of Warehouse-Scale Machines

Web search for a planet: The Google cluster architecture

Utility-Based Cache Partitioning: A Low-Overhead, High-Performance, Runtime Mechanism to Partition Shared Caches

PowerNap: eliminating server idle power

Related Papers (5)

Bubble-Up: increasing utilization in modern warehouse scale computers via sensible co-locations

Addressing shared resource contention in multicore processors via scheduling

Bubble-flux: precise online QoS management for increased utilization in warehouse scale computers

Paragon: QoS-aware scheduling for heterogeneous datacenters

The Datacenter as a Computer: An Introduction to the Design of Warehouse-Scale Machines