scispace - formally typeset
Search or ask a question

Showing papers on "Spectrogram published in 1984"


Journal ArticleDOI
TL;DR: In this article, the authors demonstrate that the Wigner distribution function of real 2D signals is susceptible to Radon transform solution, where the 2D operation is reduced to a series of 1-D operations on the line-integral projections.
Abstract: The Wigner distribution function (WDF), a simultaneous coordinate and frequency representation of a signal, has properties useful in pattern recognition. Because the WDF is computationally demanding, its use is not usually appropriate in digital processing. Optical schemes have been developed to compute the WDF for one-dimensional (1 -D) signals, often using acousto-optic signal transducers. Some recent work has demonstrated the computation of two-dimensional (2-D) slices of the four-dimensional (4-D) WDF of a 2-D input transparency. In this latter case, the required 2-D Fourier transformation is performed by coherent optics. We demonstrate that computation of the WDF of real 2-D signals is susceptible to Radon transform solution. The 2-D operation is reduced to a series of 1 -D operations on the line-integral projections. The required projection data are produced optically, and the Fourier transformation is performed by efficient 1 -D processors (surface acoustic wave filters) by means of the chirp-transform algorithm. The resultant output gives 1 -D slices through the 4-D WDF nearly in real time, and the computation is not restricted to coherently illuminated transparencies. This approach may be useful in distinguishing patterns with known texture direction. The optical setup is easily modified to produce the cross-Wigner distribution function, a special case of the complex, or windowed, spectrogram.

28 citations


01 Jan 1984
TL;DR: It is demonstrated that computation of the WDF of real 2-D signals is susceptible to Radon transform solution, and the optical setup is easily modified to produce the cross-Wigner distribution function, a special case of the complex, or windowed, spectrogram.
Abstract: The Wigner distribution function (WDF), a simultaneous coordinate and frequency representation of a signal, has properties useful in pattern recognition. Because the WDF is computationally demanding, its use is not usually appropriate in digital processing. Optical schemes have been developed to compute the WDF for one-dimensional (1 -D) signals, often using acousto-optic signal transducers. Some recent work has demonstrated the computation of two-dimensional (2-D) slices of the four-dimensional (4-D) WDF of a 2-D input transparency. In this latter case, the required 2-D Fourier transformation is performed by coherent optics. We demonstrate that computation of the WDF of real 2-D signals is susceptible to Radon transform solution. The 2-D operation is reduced to a series of 1 -D operations on the line-integral projections. The required projection data are produced optically, and the Fourier transformation is performed by efficient 1 -D processors (surface acoustic wave filters) by means of the chirp-transform algorithm. The resultant output gives 1 -D slices through the 4-D WDF nearly in real time, and the computation is not restricted to coherently illuminated transparencies. This approach may be useful in distinguishing patterns with known texture direction. The optical setup is easily modified to produce the cross-Wigner distribution function, a special case of the complex, or windowed, spectrogram.

26 citations


Journal ArticleDOI
Clifford A. Pickover1
TL;DR: A description of a vector graphics facility for coordinated computation and display of such functions is presented, using the spectrogram and 3D power spectrum for calculations for a human bladder oncogene.

16 citations


Journal ArticleDOI
TL;DR: An astigmatic optical system is used to implement the generation of the mixed time-frequency representation of a signal, multiplication of that representation by a 2-D function H (ω, t ), and obtaining a filtered output by an inverse operation.

14 citations


Proceedings ArticleDOI
19 Mar 1984
TL;DR: The method used to retrieve the human expertise and results by an experienced spectrogram reader on 50 phonetically-balanced sentences are described and the impact this approach has on the design is discussed.
Abstract: We report on a long term project on the phonetic decoding of continuous speech, within the framework of an expert system. Spectrogram reading is the field of expertise under study, although it is clear that an automatic phonetic decoder should not be in the form of an expert system. We expect major improvements in the recognition rate when expert's rules will be incorporated into existing systems. We describe here both the method used to retrieve the human expertise and results by an experienced spectrogram reader on 50 phonetically-balanced sentences. We also sketch various ways in which the expert rules can be validated through an "inference engine" and actually used in a system. Finally, we discuss the impact this approach has on the design.

13 citations


Journal ArticleDOI
TL;DR: A color spectrogram is defined which retains all the information of the standard spectrogram, and which encodes information about the shape of each short‐time spectrum into the chromaticity with which that spectrum is illuminated.
Abstract: We define a color spectrogram which retains all the information of the standard spectrogram, and which encodes information about the shape of each short‐time spectrum into the chromaticity with which that spectrum is illuminated. The chromaticity for each spectrum is derived by interpreting each spectrum as an energy distribution in the visible light frequencies.

8 citations


Journal ArticleDOI
TL;DR: A microprocessor-based speech acquisition and processing system which uses waveform analysis techniques to extract measurements from the acoustic signal and operates in "real time" and employs noninvasive data-capturing techniques.
Abstract: Durational measurements of frication, aspiration, prevoicing, and voice onset are often difficult to perform from the spectrogram, and the resolution is limited to about 5 ms. In many instances, a ...

5 citations


Journal ArticleDOI
J. Asmuth1, Jerry D. Gibson
TL;DR: Subjective listening tests and spectrograms show the Kalman algorithm to be a viable alternative to the block-adaptive algorithms.
Abstract: The output speech from a fixed-tap differential pulse code modulation system with adaptive quantization and adaptive noise spectral shaping (NSS) is compared for block-adaptive and sequentially adaptive NSS filters. Block-adaptive systems incorporate a delay which can build up in analog communications systems and cause echoing problems. The buildup of delay can be eliminated by implementing the sequentially adaptive Kalman algorithm in the NSS filter. Simulations are performed for 4-, 8-, and 16-level quantizers with fourth- and ninth-order Kalman adaptation of the NSS filter. A block-adaptive system is implemented as a reference. Subjective listening tests and spectrograms show the Kalman algorithm to be a viable alternative to the block-adaptive algorithms. Signal-to-noise ratios are also given.

3 citations


Proceedings ArticleDOI
01 Mar 1984
TL;DR: A maximum likelihood (ML) classifier for discriminating between nonstationary Gaussian time series can be implemented by correlating the data spectrogram with templates that are constructed from ensemble average reference spectrograms, which implies that peak-oriented models of random processes are suboptimum for ML classification under high SNR conditions.
Abstract: A maximum likelihood (ML) classifier for discriminating between nonstationary Gaussian time series can be implemented by correlating the data spectrogram with templates that are constructed from ensemble average reference spectrograms. The time window used to synthesize the spectrograms must have a duration that is longer than the decorrelation time of the data in the neighborhood of the window. If the data time series exhibits significant nonstationarity within this decorrelation time, Karhunen-Loeve (K-L) basis functions should ideally be used to construct a generalized spectrogram, rather than using a standard spectrogram constructed with the usual sinusoidal basis functions. Utilization of a standard spectrogram imposes forced, pseudo-stationarity by approximating the autocovariance function of the data by the short-time autocorrelation function. This forced stationarity is routinely used to obtain linear prediction coefficients (LPC). When signal to interference ratio (SNR) is large, the templates that are used to classify a data spectrogram are sensitive to differences in the locations of nulls or zeroes in the expected signal spectrograms from different data classes. This null sensitivity seems to imply that peak-oriented models of random processes, e.g. , the all pole representation that is associated with LPC, are suboptimum for ML classification under high SNR conditions. Compensation for time warping is especially necessary if window durations are data dependent. Spectrogram implementation of the ML classifier yields a new similarity index for time warp compensation.

3 citations