scispace - formally typeset
Search or ask a question
Author

Alan C. Bovik

Bio: Alan C. Bovik is an academic researcher from University of Texas at Austin. The author has contributed to research in topics: Image quality & Video quality. The author has an hindex of 102, co-authored 837 publications receiving 96088 citations. Previous affiliations of Alan C. Bovik include University of Illinois at Urbana–Champaign & University of Sydney.


Papers
More filters
Journal ArticleDOI
TL;DR: In this paper, the authors apply a promising method for the automatic extraction of channel presence called RivaMap, on both synthetic and experimental data sets, to investigate the changes experienced by the system in response to five changes in forcings.
Abstract: River deltas are complex, dynamic systems whose channel networks evolve in response to internal and external forcings. To capture these changes, methods to extract and analyze deltaic morphodynamics automatically using available remotely sensed imagery and experimental observations are needed. Here, we apply a promising method for the automatic extraction of channel presence called RivaMap, on both synthetic and experimental data sets, to investigate the changes experienced by the system in response to five changes in forcings. RivaMap is an automated method to extract nonbinarized channel locations from imagery based on a singularity index that combines the multiscale first and second derivatives of the image intensity to favor the identification of curvilinear features and suppress edges. We quantify how the channelization varies by computing the channelized response variance (CRV), which we define as the variance of each pixel's singularity index response through time. We find that increasing magnitudes of sediment inflow (Qs) and water inflow (Qw) result in corresponding increases in the maximum CRV. We find that increasing the ratio of Qs to Qw results in increased number of channelized areas. We see that adding cohesion to the exposed sediment surface of the experimental delta results in decreased magnitude and decreased number of channelized areas in the CRV. Finally, by observing changes to the CRV over time, we are able to quantify the timescale of internal channel reorganization events as the experimental delta evolves under constant forcings.

9 citations

Journal ArticleDOI
TL;DR: The proposed NNGVF snake expresses the gradient vector flow as a convolution with a neighborhood-extending Laplacian operator augmented by a noise-smoothing mask to provide better segmentation and an enlarged capture range.
Abstract: We propose a novel external force for active contours, which we call neighborhood-extending and noise-smoothing gradient vector flow (NNGVF). The proposed NNGVF snake expresses the gradient vector flow (GVF) as a convolution with a neighborhood-extending Laplacian operator augmented by a noise-smoothing mask. We find that the NNGVF snake provides better segmentation than the GVF snake in terms of noise resistance, weak edge preservation, and an enlarged capture range. The NNGVF snake accomplishes this with a reduced computational cost while maintaining other desirable properties of the GVF snake, such as initialization insensitivity and good convergences at concavities. We demonstrate the advantages of NNGVF on synthetic and real images.

9 citations

Proceedings ArticleDOI
03 Apr 1990
TL;DR: Texture analysis algorithms that decompose images into oriented spatial frequency channels are studied and the effects of textural perturbations interpreted as localized amplitude and phase variations on the segmentation are found to be effectively ameliorated with postdetection smoothing.
Abstract: Texture analysis algorithms that decompose images into oriented spatial frequency channels are studied. Optimality properties for texture segmentation filters are considered using idealized (narrowband) image texture models. The functional uncertainty of the channel filters is shown to define a tradeoff between spectral selectivity and accuracy in boundary localization that is optimized by the 2-D Gabor functions. The idealized texture model is then relaxed to analyze the effects of textural perturbations interpreted as localized amplitude and phase variations on the segmentation. The effects of these perturbations are found to be effectively ameliorated with postdetection smoothing. >

9 citations

Journal ArticleDOI
TL;DR: A simple filter-based model is created that successfully captures the psychophysical data over a wide range of velocities and flicker frequencies and finds that the threshold of silencing occurs when the log frequency of object replacement is roughly one quarter of the log flicker frequency.
Abstract: Motion can impair the perception of other visual changes. Suchow and Alvarez (2011a, Current Biology, 21, 140-143) recently demonstrated a striking 'motion silencing' illusion, in which the salient changes among a group of objects' luminances (or colors, etc) appear to cease in the presence of large, coherent object motion. To understand why the visual system might be insensitive to changes in object luminances ('flicker') in the presence of object motion, we constructed similar stimuli and did a systematic spectral analysis of them. We conducted human psychophysical experiments to examine motion silencing as a function of stimulus velocity, flicker frequency, and spacing; and we created a simple filter-based model as a working hypothesis of motion silencing. From the results, we found that the threshold of silencing occurs when the log frequency of object replacement is roughly one quarter of the log flicker frequency (the mean slope is approximately 0.27). The dependence of silencing on object spacing may be explained as a phenomenon of temporal sampling of the stimuli by the visual system. Our proposed model successfully captures the psychophysical data over a wide range of velocities and flicker frequencies.

9 citations

Journal ArticleDOI
TL;DR: In this article, a debanding filter is proposed to adaptively smooth banded regions while preserving image edges and details, yielding perceptually enhanced gradient rendering with limited bit-depths.
Abstract: Banding artifacts, which manifest as staircase-like color bands on pictures or video frames, is a common distortion caused by compression of low-textured smooth regions. These false contours can be very noticeable even on high-quality videos, especially when displayed on high-definition screens. Yet, relatively little attention has been applied to this problem. Here we consider banding artifact removal as a visual enhancement problem, and accordingly, we solve it by applying a form of content-adaptive smoothing filtering followed by dithered quantization, as a post-processing module. The proposed debanding filter is able to adaptively smooth banded regions while preserving image edges and details, yielding perceptually enhanced gradient rendering with limited bit-depths. Experimental results show that our proposed debanding filter outperforms state-of-the-art false contour removing algorithms both visually and quantitatively.

9 citations


Cited by
More filters
Journal ArticleDOI
TL;DR: In this article, a structural similarity index is proposed for image quality assessment based on the degradation of structural information, which can be applied to both subjective ratings and objective methods on a database of images compressed with JPEG and JPEG2000.
Abstract: Objective methods for assessing perceptual image quality traditionally attempted to quantify the visibility of errors (differences) between a distorted image and a reference image using a variety of known properties of the human visual system. Under the assumption that human visual perception is highly adapted for extracting structural information from a scene, we introduce an alternative complementary framework for quality assessment based on the degradation of structural information. As a specific example of this concept, we develop a structural similarity index and demonstrate its promise through a set of intuitive examples, as well as comparison to both subjective ratings and state-of-the-art objective methods on a database of images compressed with JPEG and JPEG2000. A MATLAB implementation of the proposed algorithm is available online at http://www.cns.nyu.edu//spl sim/lcv/ssim/.

40,609 citations

Book
01 Jan 1998
TL;DR: An introduction to a Transient World and an Approximation Tour of Wavelet Packet and Local Cosine Bases.
Abstract: Introduction to a Transient World. Fourier Kingdom. Discrete Revolution. Time Meets Frequency. Frames. Wavelet Zoom. Wavelet Bases. Wavelet Packet and Local Cosine Bases. An Approximation Tour. Estimations are Approximations. Transform Coding. Appendix A: Mathematical Complements. Appendix B: Software Toolboxes.

17,693 citations

Proceedings ArticleDOI
21 Jul 2017
TL;DR: Conditional adversarial networks are investigated as a general-purpose solution to image-to-image translation problems and it is demonstrated that this approach is effective at synthesizing photos from label maps, reconstructing objects from edge maps, and colorizing images, among other tasks.
Abstract: We investigate conditional adversarial networks as a general-purpose solution to image-to-image translation problems. These networks not only learn the mapping from input image to output image, but also learn a loss function to train this mapping. This makes it possible to apply the same generic approach to problems that traditionally would require very different loss formulations. We demonstrate that this approach is effective at synthesizing photos from label maps, reconstructing objects from edge maps, and colorizing images, among other tasks. Moreover, since the release of the pix2pix software associated with this paper, hundreds of twitter users have posted their own artistic experiments using our system. As a community, we no longer hand-engineer our mapping functions, and this work suggests we can achieve reasonable results without handengineering our loss functions either.

11,958 citations

Posted Content
TL;DR: Conditional Adversarial Network (CA) as discussed by the authors is a general-purpose solution to image-to-image translation problems, which can be used to synthesize photos from label maps, reconstructing objects from edge maps, and colorizing images, among other tasks.
Abstract: We investigate conditional adversarial networks as a general-purpose solution to image-to-image translation problems. These networks not only learn the mapping from input image to output image, but also learn a loss function to train this mapping. This makes it possible to apply the same generic approach to problems that traditionally would require very different loss formulations. We demonstrate that this approach is effective at synthesizing photos from label maps, reconstructing objects from edge maps, and colorizing images, among other tasks. Indeed, since the release of the pix2pix software associated with this paper, a large number of internet users (many of them artists) have posted their own experiments with our system, further demonstrating its wide applicability and ease of adoption without the need for parameter tweaking. As a community, we no longer hand-engineer our mapping functions, and this work suggests we can achieve reasonable results without hand-engineering our loss functions either.

11,127 citations

Journal ArticleDOI
01 Apr 1988-Nature
TL;DR: In this paper, a sedimentological core and petrographic characterisation of samples from eleven boreholes from the Lower Carboniferous of Bowland Basin (Northwest England) is presented.
Abstract: Deposits of clastic carbonate-dominated (calciclastic) sedimentary slope systems in the rock record have been identified mostly as linearly-consistent carbonate apron deposits, even though most ancient clastic carbonate slope deposits fit the submarine fan systems better. Calciclastic submarine fans are consequently rarely described and are poorly understood. Subsequently, very little is known especially in mud-dominated calciclastic submarine fan systems. Presented in this study are a sedimentological core and petrographic characterisation of samples from eleven boreholes from the Lower Carboniferous of Bowland Basin (Northwest England) that reveals a >250 m thick calciturbidite complex deposited in a calciclastic submarine fan setting. Seven facies are recognised from core and thin section characterisation and are grouped into three carbonate turbidite sequences. They include: 1) Calciturbidites, comprising mostly of highto low-density, wavy-laminated bioclast-rich facies; 2) low-density densite mudstones which are characterised by planar laminated and unlaminated muddominated facies; and 3) Calcidebrites which are muddy or hyper-concentrated debrisflow deposits occurring as poorly-sorted, chaotic, mud-supported floatstones. These

9,929 citations