scispace - formally typeset
Open AccessProceedings ArticleDOI

Design methodology for a tightly coupled VLIW/reconfigurable matrix architecture: a case study

TLDR
A C-based design flow using an MPEG-2 decoder as a design example shows that the methodology and architecture can deliver a competitive package in terms of design efforts and performance over other programmable architectures.
Abstract
Coarse-grained reconfigurable architectures have seen growing importance recently. Design tools and methodology are essential to their success. Based on our previous work on modulo scheduling algorithms and a novel architecture with tightly coupled VLIW/reconfigurable matrix, we present a C-based design flow using an MPEG-2 decoder as a design example. The application is mapped to the architecture in less than one person-week starting from a software implementation. The kernel and overall speedup over the reference VLIW are 4.84 and 3.05 respectively. The case study shows that our methodology and architecture can deliver a competitive package in terms of design efforts and performance over other programmable architectures.

read more

Citations
More filters
Proceedings ArticleDOI

A unified hardware/software runtime environment for FPGA-based reconfigurable computers using BORPH

TL;DR: This paper presents a hw/sw codesign methodology based on BORPH, an operating system designed for FPGA-based reconfigurable computers (RC's).
Journal ArticleDOI

An overview of reconfigurable hardware in embedded systems

TL;DR: An overview of reconfigurable computing in embedded systems, in terms of benefits it can provide, how it has already been used, design issues, and hurdles that have slowed its adoption are presented.
Journal ArticleDOI

A unified hardware/software runtime environment for FPGA-based reconfigurable computers using BORPH

TL;DR: This paper explores the design and implementation of BORPH, an operating system designed for FPGA-based reconfigurable computers, and a Simulink-based design flow that integrates with BOR PH is employed.
Proceedings ArticleDOI

EPIMap: using epimorphism to map applications on CGRAs

TL;DR: Experimental results on 14 important kernels extracted from well known benchmark programs show that using EPIMap can improve the performance of the kernels on CGRA by more than 2.8X on average, as compared to one of the best existing mapping algorithm, EMS.
Proceedings ArticleDOI

Custom implementation of the coarse-grained reconfigurable ADRES architecture for multimedia purposes

TL;DR: The purpose of this study is to use the coarse-grained architecture for H264/AVC in order to determine at the physical level whether reconfigurable computing, high-performance and low-power can be obtained.
References
More filters
Proceedings ArticleDOI

MediaBench: a tool for evaluating and synthesizing multimedia and communications systems

TL;DR: The MediaBench benchmark suite as discussed by the authors is a benchmark suite that has been designed to fill the gap between the compiler community and embedded applications developers, which has been constructed through a three-step process: intuition and market driven initial selection, experimental measurement, and integration with system synthesis algorithms to establish usefulness.
Journal ArticleDOI

MorphoSys: an integrated reconfigurable system for data-parallel and computation-intensive applications

TL;DR: The MorphoSys architecture is described, including the reconfigurable processor array, the control processor, and data and configuration memories, and the suitability of MorphoSy for the target application domain is illustrated with examples such as video compression, data encryption and target recognition.
Book ChapterDOI

ADRES: An Architecture with Tightly Coupled VLIW Processor and Coarse-Grained Reconfigurable Matrix

TL;DR: A novel architecture with tightly coupled very long instruction word (VLIW) processor and coarse-grained reconfigurable matrix is proposed, which has good performance and is very compiler-friendly.
Journal ArticleDOI

Effective compiler support for predicated execution using the hyperblock

TL;DR: In this paper, a new structure, referred to as the hyperblock, is proposed to combine speculative execution with predicated execution for both compile-time optimization and scheduling of conditional branches.
Book ChapterDOI

RaPiD - Reconfigurable Pipelined Datapath

TL;DR: RaPiD is presented, a new coarse-grained FPGA architecture that is optimized for highly repetitive, computation-intensive tasks that make much more efficient use of silicon than traditional FPGAs and also yield much higher performance for a wide range of applications.
Related Papers (5)