GPU Implementation of Image Convolution Using Sparse Model with Efficient Storage Format

doi:10.4018/IJGHPC.2018010104

Citations

PDF

Open Access

More filters

Journal Article•DOI•

Image filtering by convolution

[...]

Nora H. Sultan, Jannah Raad Taher, Ghadeer I. Maki

03 Apr 2023-Periodicals of Engineering and Natural Sciences (PEN)

TL;DR: In this article , the authors used convolutional techniques to correct the images that were messed up, which can be used in a variety of ways, including smoothing, sharpening, reducing noise, and detecting borders.

...read moreread less

Abstract: Image filtering is a common technique used in digital image processing that can be used to take a picture appear differently aesthetically. Noise, also known as distracting visual artifacts, can lower the overall quality of a picture, which is why image improvement techniques are required to fix the problem. It can be utilized in a variety of ways, including smoothing, sharpening, reducing noise, and detecting borders, to name a few. In this piece, we will be using convolutional techniques to correct the images that were messed up. The first thing that needs to be done is a point-by-point multiplication of the frequency domain representation of the picture that's being entered through a black image that has a small white rectangle in the mid of it. This is the first step. Only the lowest harmonics are kept after we apply a filter that gets rid of the higher ones. Because the high frequencies in the input picture are filtered out, the special domain of the image that is produced should look like a blurrier variation of the original picture. Therefore, a greater degree of detail preservation is indicated when the white rectangle W is larger because this indicates that more high-frequency components of I have been preserved.

...read moreread less

1 citations

References

PDF

Open Access

More filters

Journal Article•DOI•

The university of Florida sparse matrix collection

[...]

Timothy A. Davis¹, Yifan Hu²•Institutions (2)

University of Florida¹, AT&T Labs²

07 Dec 2011-ACM Transactions on Mathematical Software

TL;DR: The University of Florida Sparse Matrix Collection, a large and actively growing set of sparse matrices that arise in real applications, is described and a new multilevel coarsening scheme is proposed to facilitate this task.

...read moreread less

Abstract: We describe the University of Florida Sparse Matrix Collection, a large and actively growing set of sparse matrices that arise in real applications The Collection is widely used by the numerical linear algebra community for the development and performance evaluation of sparse matrix algorithms It allows for robust and repeatable experiments: robust because performance results with artificially generated matrices can be misleading, and repeatable because matrices are curated and made publicly available in many formats Its matrices cover a wide spectrum of domains, include those arising from problems with underlying 2D or 3D geometry (as structural engineering, computational fluid dynamics, model reduction, electromagnetics, semiconductor devices, thermodynamics, materials, acoustics, computer graphics/vision, robotics/kinematics, and other discretizations) and those that typically do not have such geometry (optimization, circuit simulation, economic and financial modeling, theoretical and quantum chemistry, chemical process simulation, mathematics and statistics, power networks, and other networks and graphs) We provide software for accessing and managing the Collection, from MATLAB™, Mathematica™, Fortran, and C, as well as an online search capability Graph visualization of the matrices is provided, and a new multilevel coarsening scheme is proposed to facilitate this task

...read moreread less

3,456 citations

Journal Article•DOI•

Image selective smoothing and edge detection by nonlinear diffusion. II

[...]

Luis Alvarez, Pierre-Louis Lions, Jean-Michel Morel

01 Jun 1992-SIAM Journal on Numerical Analysis

TL;DR: In this article, a new version of the Perona and Malik theory for edge detection and image restoration is proposed, which keeps all the improvements of the original model and avoids its drawbacks.

...read moreread less

Abstract: A new version of the Perona and Malik theory for edge detection and image restoration is proposed. This new version keeps all the improvements of the original model and avoids its drawbacks: it is proved to be stable in presence of noise, with existence and uniqueness results. Numerical experiments on natural images are presented.

...read moreread less

2,565 citations

Ecient Sparse Matrix-Vector Multiplication on CUDA

[...]

Nathan Bell¹, Michael Garland¹•Institutions (1)

Nvidia¹

01 Jan 2008

TL;DR: Data structures and algorithms for SpMV that are eciently implemented on the CUDA platform for the ne-grained parallel architecture of the GPU and develop methods to exploit several common forms of matrix structure while oering alternatives which accommodate greater irregularity are developed.

...read moreread less

Abstract: The massive parallelism of graphics processing units (GPUs) oers tremendous performance in many high-performance computing applications. While dense linear algebra readily maps to such platforms, harnessing this potential for sparse matrix computations presents additional challenges. Given its role in iterative methods for solving sparse linear systems and eigenvalue problems, sparse matrix-vector multiplication (SpMV) is of singular importance in sparse linear algebra. In this paper we discuss data structures and algorithms for SpMV that are eciently implemented on the CUDA platform for the ne-grained parallel architecture of the GPU. Given the memory-bound nature of SpMV, we emphasize memory bandwidth eciency and compact storage formats. We consider a broad spectrum of sparse matrices, from those that are well-structured and regular to highly irregular matrices with large imbalances in the distribution of nonzeros per matrix row. We develop methods to exploit several common forms of matrix structure while oering alternatives which accommodate greater irregularity. On structured, grid-based matrices we achieve performance of 36 GFLOP/s in single precision and 16 GFLOP/s in double precision on a GeForce GTX 280 GPU. For unstructured nite-element matrices, we observe performance in excess of 15 GFLOP/s and 10 GFLOP/s in single and double precision respectively. These results compare favorably to prior state-of-the-art studies of SpMV methods on conventional multicore processors. Our double precision SpMV performance is generally two and a half times that of a Cell BE with 8 SPEs and more than ten times greater than that of a quad-core Intel Clovertown system.

...read moreread less

795 citations

Journal Article•DOI•

Medical Image Processing on the GPU : Past, Present and Future

[...]

Anders Eklund¹, Paul Dufort², Daniel Forsberg³, Stephen M. LaConte¹, Stephen M. LaConte⁴ - Show less +1 more•Institutions (4)

Virginia Tech¹, University of Toronto², Linköping University³, Wake Forest University⁴

01 Dec 2013-Medical Image Analysis

TL;DR: This review presents the past and present work on GPU accelerated medical image processing, and is meant to serve as an overview and introduction to existing GPU implementations.

...read moreread less

360 citations

Proceedings Article•DOI•

Efficient sparse matrix-vector multiplication on GPUs using the CSR storage format

[...]

Joseph L. Greathouse¹, Mayank Daga¹•Institutions (1)

Advanced Micro Devices¹

16 Nov 2014

TL;DR: This work proposes a novel algorithm, CSR-Adaptive, which keeps the CSR format intact and maps well to GPUs, and achieves an average speedup of 14.7× over existingCSR-based algorithms and 2.3× over clSpMV cocktail, which uses an assortment of matrix formats.

...read moreread less

Abstract: The performance of sparse matrix vector multiplication (SpMV) is important to computational scientists. Compressed sparse row (CSR) is the most frequently used format to store sparse matrices. However, CSR-based SpMV on graphics processing units (GPUs) has poor performance due to irregular memory access patterns, load imbalance, and reduced parallelism. This has led researchers to propose new storage formats. Unfortunately, dynamically transforming CSR into these formats has significant runtime and storage overheads. We propose a novel algorithm, CSR-Adaptive, which keeps the CSR format intact and maps well to GPUs. Our implementation addresses the aforementioned challenges by (i) efficiently accessing DRAM by streaming data into the local scratchpad memory and (ii) dynamically assigning different numbers of rows to each parallel GPU compute unit. CSR-Adaptive achieves an average speedup of 14.7 × over existing CSR-based algorithms and 2.3× over clSpMV cocktail, which uses an assortment of matrix formats.

...read moreread less

182 citations

Collapse

GPU Implementation of Image Convolution Using Sparse Model with Efficient Storage Format

Citations

References

Related Papers (5)