scispace - formally typeset
Journal ArticleDOI

Heterogeneous Computing Utilizing FPGAs

Reads0
Chats0
TLDR
The results show that the LibHSA infrastructure greatly simplifies the effort integrating FPGAs and customized hardware into existing accelerator systems, runtimes and application software.
Abstract
Heterogeneous computing plays an ever-increasing role in power-efficient, high-performance embedded systems for various data processing tasks, such as computer vision. One possibility to accelerate this kind of application is the usage of FPGAs as a co-processor for standard CPUs. Although hardware design is becoming easier by utilizing High-Level-Synthesis tools, the question of interfacing FPGAs and CPUs has yet to be completely solved. The Heterogeneous System Architecture (HSA) Foundation defines and publishes architecture neutral standards for heterogeneous systems and programming models. While compatible CPU, GPU and DSP designs exist, FPGA models have not been defined yet. This paper describes the IP library LibHSA, which greatly simplifies integration of domain specific FPGA acceleration into existing HSA compliant systems. It allows FPGA based accelerators to take immediate advantage of high-level language tool chains. Including user space memory access, low-latency task dispatch and other benefits of the HSA programming model. We will demonstrate LibHSA with a programmable image processor implementation on a Xilinx FPGA. The image processor supports low-level algorithms, e.g. Sobel, Median, Laplace, or Gaussian. Our results show that the LibHSA infrastructure greatly simplifies the effort integrating FPGAs and customized hardware into existing accelerator systems, runtimes and application software.

read more

Citations
More filters
Journal ArticleDOI

A high performance scalable fuzzy based modified Asymmetric Heterogene Multiprocessor System on Chip (AHt-MPSOC) reconfigurable architecture

TL;DR: A fuzzy based MIP and Graph theory based Traffic Estimator are proposed system used to define New asymmetric multiprocessor heterogene framework on microprocessor (AHt-MPSoC) architecture and the bandwidths, energy consumption, wait and transmission range are better accomplished in this suggested technique than the standard technique.
Journal ArticleDOI

A high performance scalable fuzzy based modified Asymmetric Heterogene Multiprocessor System on Chip (AHt-MPSOC) reconfigurable architecture

TL;DR: In this paper , a fuzzy control-based Heterogene Multi-Processor System-on-Chip (Ht-MPSoC) analysis has been proposed, which is also implemented with a multi-task framework.
Proceedings ArticleDOI

Improving Job Launch Rates in the TaPaSCo FPGA Middleware by Hardware/Software-Co-Design

TL;DR: TaPaSCo as discussed by the authors is an extension of the TaPaSco framework which improves the launch rates and latencies of FPGA-accelerated compute jobs, a crucial factor for the performance of the overall system.
Proceedings ArticleDOI

Enabling OpenMP Task Parallelism on Multi-FPGAs

TL;DR: The OpenMP task-based computation offloading model is extended to enable several FPGAs to work as a single Multi-FPGA architecture, and results have shown close to linear speedups as the number of FPG as well as IP-cores per FPGA increases.
References
More filters
Book

Computer Architecture: A Quantitative Approach

TL;DR: This best-selling title, considered for over a decade to be essential reading for every serious student and practitioner of computer design, has been updated throughout to address the most important trends facing computer designers today.
Book

Computer Architecture, Fifth Edition: A Quantitative Approach

TL;DR: The Fifth Edition of Computer Architecture focuses on this dramatic shift in the ways in which software and technology in the "cloud" are accessed by cell phones, tablets, laptops, and other mobile computing devices.
Proceedings ArticleDOI

OpenCL-based design methodology for application-specific processors

TL;DR: The case shows that the use of OpenCL allows producing scalable application-specific processor designs and makes it possible to gradually reach the performance of hand-tailored RTL designs by exploiting the OpenCL extension mechanism to access custom hardware operations of varying complexity.
Proceedings ArticleDOI

Embedded supercomputing in FPGAs with the VectorBlox MXP matrix processor

TL;DR: The VectorBlox MXP Matrix Processor is an FPGA-based soft processor capable of executing data-parallel software algorithms at hardware-like speeds and seamlessly ties into existing Altera and Xilinx development flows, simplifying system creation and deployment.
Proceedings ArticleDOI

A comprehensive performance analysis of HSA and OpenCL 2.0

TL;DR: This paper provides the first comprehensive study of OpenCL 2.0 and HSA 1.0 execution, considering OpenCL 1.2 as the baseline, and finds that by using HSA signals, it can remove 92% of the overhead due to synchronous kernel launches.
Related Papers (5)