scispace - formally typeset
Search or ask a question
Author

Mark J. T. Smith

Bio: Mark J. T. Smith is an academic researcher from Georgia Institute of Technology. The author has contributed to research in topics: Filter bank & Filter design. The author has an hindex of 31, co-authored 143 publications receiving 5358 citations.


Papers
More filters
Journal ArticleDOI
TL;DR: A directionally oriented 2-D filter bank with the property that the individual channels may be critically sampled without loss of information is introduced and it is shown that these filter bank outputs may be maximally decimated to achieve a minimum sample representation in a way that permits the original signal to be exactly reconstructed.
Abstract: The authors introduce a directionally oriented 2-D filter bank with the property that the individual channels may be critically sampled without loss of information. The passband regions of the component filters are wedge-shaped and thus provide directional information. It is shown that these filter bank outputs may be maximally decimated to achieve a minimum sample representation in a way that permits the original signal to be exactly reconstructed. The authors discuss the theory for directional decomposition and reconstruction. In addition, implementation issues are addressed where realizations based on both recursive and nonrecursive filters are considered. >

911 citations

Journal ArticleDOI
TL;DR: It is shown that it is possible to design tree-structured analysis/reconstruction systems which meet the sampling rate condition and which result in exact reconstruction of the input signal.
Abstract: In recent years, tree-structured analysis/reconstruction systems have been extensively studied for use in subband coders for speech. In such systems, it is imperative that the individual channel signals be decimated in such a way that the number of samples coded and transmitted do not exceed the number of samples in the original speech signal. Under this constraint, the systems presented in the past have sought to remove the aliasing distortion while minimizing the overall analysis/reconstruction distortion. In this paper, it is shown that it is possible to design tree-structured analysis/reconstruction systems which meet the sampling rate condition and which result in exact reconstruction of the input signal. The conditions for exact reconstruction are developed and presented. Furthermore, it is shown that these conditions are not overly restrictive and high-quality frequency division may be performed in the analysis section. A filter design procedure is presented which allows high-quality filters to be easily designed.

785 citations

Journal ArticleDOI
TL;DR: The investigation focuses on the design of analysis/synthesis systems for image coding and the perceptual impact of these systems at low bit rates, and the theory, design, and implementation of both recursive and nonrecursive filtering systems are discussed.
Abstract: Analysis/synthesis systems designed for low bit rate image coding, their impact on overall system quality, and their computational complexity are discussed The investigation focuses on the design of analysis/synthesis systems for image coding and the perceptual impact of these systems at low bit rates Two objectives are emphasized in developing these systems: confining the total size of the subband images to be equal to the original image size, and designing the filters so that perceptual distortion is not introduced by the analysis/synthesis system Methods based on circular convolution and symmetric extensions are developed and discussed in detail The theory, design, and implementation of both recursive and nonrecursive filtering systems are discussed Methods are introduced which display advantages over conventional quadrature mirror filter based approaches >

388 citations

Journal ArticleDOI
TL;DR: A new night-time lane detection system and its accompanying framework are presented in this paper, which is an improvement over the ALD 1.0 with integration of pixel remapping, outlier removal, and prediction with tracking.
Abstract: A new night-time lane detection system and its accompanying framework are presented in this paper. The accompanying framework consists of an automated ground truth process and systematic storage of captured videos that will be used for training and testing. The proposed Advanced Lane Detector 2.0 (ALD 2.0) is an improvement over the ALD 1.0 or Layered Approach with integration of pixel remapping, outlier removal, and prediction with tracking. Additionally, a novel procedure to generate the ground truth data for lane marker locations is also proposed. The procedure consists of an original process called time slicing, which provides the user with unique visualization of the captured video and enables quick generation of ground truth information. Finally, the setup and implementation of a database hosting lane detection videos and standardized data sets for testing are also described. The ALD 2.0 is evaluated by means of the user-created annotations accompanying the videos. Finally, the planned improvements and remaining work are addressed.

226 citations

Proceedings ArticleDOI
19 Mar 1984
TL;DR: In this paper, it is shown that it is possible to design tree-structured analysis/reconstruction systems which meet the sampling rate condition and which also result in exact reconstruction of the input signal.
Abstract: In recent years, tree-structured analysis/reconstruction systems have been extensively studied for use in subband coders for speech. In such systems, it is important that the individual channel signals be decimated in such a way that the number of samples coded and transmitted does not exceed the number of samples in the original speech signal. Under this constraint, the systems presented in the past have sought to remove the aliasing distortion while minimizing the overall analysis/reconstruction distortion. In this paper, it is shown that it is possible to design tree-structured analysis/reconstruction systems which meet the sampling rate condition and which also result in exact reconstruction of the input signal. This paper develops the conditions for exact reconstruction and presents a general method for designing the corresponding high quality analysis and reconstruction filters.

185 citations


Cited by
More filters
Journal ArticleDOI
TL;DR: In this paper, it is shown that the difference of information between the approximation of a signal at the resolutions 2/sup j+1/ and 2 /sup j/ (where j is an integer) can be extracted by decomposing this signal on a wavelet orthonormal basis of L/sup 2/(R/sup n/), the vector space of measurable, square-integrable n-dimensional functions.
Abstract: Multiresolution representations are effective for analyzing the information content of images. The properties of the operator which approximates a signal at a given resolution were studied. It is shown that the difference of information between the approximation of a signal at the resolutions 2/sup j+1/ and 2/sup j/ (where j is an integer) can be extracted by decomposing this signal on a wavelet orthonormal basis of L/sup 2/(R/sup n/), the vector space of measurable, square-integrable n-dimensional functions. In L/sup 2/(R), a wavelet orthonormal basis is a family of functions which is built by dilating and translating a unique function psi (x). This decomposition defines an orthogonal multiresolution representation called a wavelet representation. It is computed with a pyramidal algorithm based on convolutions with quadrature mirror filters. Wavelet representation lies between the spatial and Fourier domains. For images, the wavelet representation differentiates several spatial orientations. The application of this representation to data compression in image coding, texture discrimination and fractal analysis is discussed. >

20,028 citations

Book
01 Jan 1998
TL;DR: An introduction to a Transient World and an Approximation Tour of Wavelet Packet and Local Cosine Bases.
Abstract: Introduction to a Transient World. Fourier Kingdom. Discrete Revolution. Time Meets Frequency. Frames. Wavelet Zoom. Wavelet Bases. Wavelet Packet and Local Cosine Bases. An Approximation Tour. Estimations are Approximations. Transform Coding. Appendix A: Mathematical Complements. Appendix B: Software Toolboxes.

17,693 citations

Journal ArticleDOI
Ingrid Daubechies1
TL;DR: This work construct orthonormal bases of compactly supported wavelets, with arbitrarily high regularity, by reviewing the concept of multiresolution analysis as well as several algorithms in vision decomposition and reconstruction.
Abstract: We construct orthonormal bases of compactly supported wavelets, with arbitrarily high regularity. The order of regularity increases linearly with the support width. We start by reviewing the concept of multiresolution analysis as well as several algorithms in vision decomposition and reconstruction. The construction then follows from a synthesis of these different approaches.

8,588 citations

Journal ArticleDOI
TL;DR: A "true" two-dimensional transform that can capture the intrinsic geometrical structure that is key in visual information is pursued and it is shown that with parabolic scaling and sufficient directional vanishing moments, contourlets achieve the optimal approximation rate for piecewise smooth functions with discontinuities along twice continuously differentiable curves.
Abstract: The limitations of commonly used separable extensions of one-dimensional transforms, such as the Fourier and wavelet transforms, in capturing the geometry of image edges are well known. In this paper, we pursue a "true" two-dimensional transform that can capture the intrinsic geometrical structure that is key in visual information. The main challenge in exploring geometry in images comes from the discrete nature of the data. Thus, unlike other approaches, such as curvelets, that first develop a transform in the continuous domain and then discretize for sampled data, our approach starts with a discrete-domain construction and then studies its convergence to an expansion in the continuous domain. Specifically, we construct a discrete-domain multiresolution and multidirection expansion using nonseparable filter banks, in much the same way that wavelets were derived from filter banks. This construction results in a flexible multiresolution, local, and directional image expansion using contour segments, and, thus, it is named the contourlet transform. The discrete contourlet transform has a fast iterated filter bank algorithm that requires an order N operations for N-pixel images. Furthermore, we establish a precise link between the developed filter bank and the associated continuous-domain contourlet expansion via a directional multiresolution analysis framework. We show that with parabolic scaling and sufficient directional vanishing moments, contourlets achieve the optimal approximation rate for piecewise smooth functions with discontinuities along twice continuously differentiable curves. Finally, we show some numerical experiments demonstrating the potential of contourlets in several image processing applications.

3,948 citations

Journal ArticleDOI
TL;DR: A scheme for image compression that takes into account psychovisual features both in the space and frequency domains is proposed and it is shown that the wavelet transform is particularly well adapted to progressive transmission.
Abstract: A scheme for image compression that takes into account psychovisual features both in the space and frequency domains is proposed. This method involves two steps. First, a wavelet transform used in order to obtain a set of biorthogonal subclasses of images: the original image is decomposed at different scales using a pyramidal algorithm architecture. The decomposition is along the vertical and horizontal directions and maintains constant the number of pixels required to describe the image. Second, according to Shannon's rate distortion theory, the wavelet coefficients are vector quantized using a multiresolution codebook. To encode the wavelet coefficients, a noise shaping bit allocation procedure which assumes that details at high resolution are less visible to the human eye is proposed. In order to allow the receiver to recognize a picture as quickly as possible at minimum cost, a progressive transmission scheme is presented. It is shown that the wavelet transform is particularly well adapted to progressive transmission. >

3,925 citations