Reconstructing hardware transactional memory for workload optimized systems

doi:10.1007/978-3-642-24151-2_1

Book ChapterDOI

Reconstructing hardware transactional memory for workload optimized systems

Kunal Korgaonkar, +4 more

- pp 1-15

Chats0

TLDR

It is argued that Hardware Transactional Memory (HTM) can be a suitable implementation choice for these systems and the knowledge about the workload is extremely useful to make appropriate design choices in the workload optimized HTM.

Abstract:

Workload optimized systems consisting of large number of general and special purpose cores, and with a support for shared memory programming, are slowly becoming prevalent. One of the major impediments for effective parallel programming on these systems is lock-based synchronization. An alternate synchronization solution called Transactional Memory (TM) is currently being explored.We observe that most of the TM design proposals in literature are catered to match the constrains of general purpose computing platforms. Given the fact that workload optimized systems utilize wider hardware design spaces and on-chip parallelism, we argue that Hardware Transactional Memory (HTM) can be a suitable implementation choice for these systems. We re-evaluate the criteria to be satisfied by a HTM and identify possible scope for relaxations in the context of workload optimized systems. Based on the relaxed criteria, we demonstrate the scope for building HTM design variants, such that, each variant caters to a specific workload requirement. We carry out suitable experiments to bring about the trade-off between the design variants. Overall, we show how the knowledge about the workload is extremely useful to make appropriate design choices in the workload optimized HTM.

Reconstructing hardware transactional memory for workload optimized systems

Citations

Parallel Scientific Computation: A Structured Approach using BSP and MPI

References

TokenTM: Efficient Execution of Large Transactions with Hardware Transactional Memory

Heap data management for limited local memory (LLM) multi-core processors

Efficient dynamic heap allocation of scratch-pad memory

Compiler-directed scratchpad memory management via graph coloring

Dynamic trace selection using performance monitoring hardware sampling

Related Papers (5)

On the power of hardware transactional memory to simplify memory management

FaulTM: Fault-Tolerance Using Hardware Transactional Memory

On the (dis)similarity of transactional memory workloads

Automatic Tuning of the Parallelism Degree in Hardware Transactional Memory

Architectures for transactional memory