Comparing Energy Efficiency of CPU, GPU and FPGA Implementations for Vision Kernels

doi:10.1109/ICESS.2019.8782524

Open AccessProceedings ArticleDOI

Comparing Energy Efficiency of CPU, GPU and FPGA Implementations for Vision Kernels

Murad Qasaimeh, +5 more

- pp 1-8

Chats0

TLDR

A comprehensive benchmark of the run-time performance and energy efficiency of a wide range of vision kernels is conducted and rationales for why a given underlying hardware architecture innately performs well or poorly based on the characteristics of arange of vision kernel categories are discussed.

Abstract:

Developing high performance embedded vision applications requires balancing run-time performance with energy constraints. Given the mix of hardware accelerators that exist for embedded computer vision (e.g. multi-core CPUs, GPUs, and FPGAs), and their associated vendor optimized vision libraries, it becomes a challenge for developers to navigate this fragmented solution space. To aid with determining which embedded platform is most suitable for their application, we conduct a comprehensive benchmark of the run-time performance and energy efficiency of a wide range of vision kernels. We discuss rationales for why a given underlying hardware architecture innately performs well or poorly based on the characteristics of a range of vision kernel categories. Specifically, our study is performed for three commonly used HW accelerators for embedded vision applications: ARM57 CPU, Jetson TX2 GPU and ZCU102 FPGA, using their vendor optimized vision libraries: OpenCV, VisionWorks and xfOpenCV. Our results show that the GPU achieves an energy/frame reduction ratio of 1.1–3.2× compared to the others for simple kernels. While for more complicated kernels and complete vision pipelines, the FPGA outperforms the others with energy/frame reduction ratios of 1.2–22.3×. It is also observed that the FPGA performs increasingly better as a vision application's pipeline complexity grows.

Comparing Energy Efficiency of CPU, GPU and FPGA Implementations for Vision Kernels

Citations

UAV in the advent of the twenties: Where we stand and what is next

Optimizing the Deep Neural Networks by Layer-Wise Refined Pruning and the Acceleration on FPGA

Treehouse: A Case For Carbon-Aware Datacenter Software

ReconROS: Flexible Hardware Acceleration for ROS2 Applications

Field Trial of a Flexible Real-Time Software-Defined GPU-Based Optical Receiver

References

Rodinia: A benchmark suite for heterogeneous computing

Accelerating Compute-Intensive Applications with GPUs and FPGAs

Scaling, power, and the future of CMOS

OpenCV: Open Source Computer Vision Library

A performance and energy comparison of FPGAs, GPUs, and multicores for sliding-window applications

Related Papers (5)

Comparing Energy Efficiency of CPU, GPU and FPGA Implementations for Vision Kernels

Benchmarking vision kernels and neural network inference accelerators on embedded platforms

Computer vision algorithms acceleration using graphic processors NVIDIA CUDA

Comparing performance and energy efficiency of FPGAs and GPUs for high productivity computing

GPU-FPGA Heterogeneous Computing with OpenCL-Enabled Direct Memory Access