scispace - formally typeset
Book ChapterDOI

Reconstructing hardware transactional memory for workload optimized systems

Reads0
Chats0
TLDR
It is argued that Hardware Transactional Memory (HTM) can be a suitable implementation choice for these systems and the knowledge about the workload is extremely useful to make appropriate design choices in the workload optimized HTM.
Abstract
Workload optimized systems consisting of large number of general and special purpose cores, and with a support for shared memory programming, are slowly becoming prevalent. One of the major impediments for effective parallel programming on these systems is lock-based synchronization. An alternate synchronization solution called Transactional Memory (TM) is currently being explored.We observe that most of the TM design proposals in literature are catered to match the constrains of general purpose computing platforms. Given the fact that workload optimized systems utilize wider hardware design spaces and on-chip parallelism, we argue that Hardware Transactional Memory (HTM) can be a suitable implementation choice for these systems. We re-evaluate the criteria to be satisfied by a HTM and identify possible scope for relaxations in the context of workload optimized systems. Based on the relaxed criteria, we demonstrate the scope for building HTM design variants, such that, each variant caters to a specific workload requirement. We carry out suitable experiments to bring about the trade-off between the design variants. Overall, we show how the knowledge about the workload is extremely useful to make appropriate design choices in the workload optimized HTM.

read more

Citations
More filters
Journal ArticleDOI

Parallel Scientific Computation: A Structured Approach using BSP and MPI

TL;DR: This is the first textbook provides a comprehensive overview of the technical aspects of building parallel programs using BSP and BSPlib, and is contemporary, well presented, and balanced between concepts and the technical depth required for developing parallel algorithms.
References
More filters
Book ChapterDOI

Texture Descriptors in MPEG-7

TL;DR: Three descriptors of texture feature of a region are presented, the homogeneous texture descriptor (HTD), the edge histogram descriptor (EHD), and the perceptual browsing descriptor (PBD), which are useful for image retrieval application.
Journal ArticleDOI

Architecture independent parallel binomial tree option price valuations

TL;DR: An architecture independent approach is introduced in describing how computations such as those involved in American or European-style option price valuations can be performed in parallel under the binomial tree model and a latency-tolerant parallel algorithm is described and analyzed.
Journal ArticleDOI

An attention-driven model for grouping similar images with image retrieval applications

TL;DR: It is demonstrated that certain shortcomings of existing content-based image retrieval solutions can be addressed by implementing a biologically motivated, unsupervised way of grouping together images whose salient regions of interest (ROIs) are perceptually similar regardless of the visual contents of other (less relevant) parts of the image.
Journal ArticleDOI

A fast radix-4 division algorithm and its architecture

TL;DR: The architecture presented for the proposed algorithm is faster than previously proposed radix-4 dividers, which require at least four digits of the partial remainder to be observed to determine quotient digits.
Proceedings ArticleDOI

Hardware design and arithmetic algorithms for a variable-precision, interval arithmetic coprocessor

TL;DR: This paper presents the hardware design and arithmetic algorithms for a coprocessor that performs variable-precision, interval arithmetic that can be implemented on a single chip with a cycle time that is comparable to IEEE double- precision floating pointCoprocessors.
Related Papers (5)