scispace - formally typeset
Search or ask a question
Topic

Cepstrum

About: Cepstrum is a research topic. Over the lifetime, 3346 publications have been published within this topic receiving 55742 citations.


Papers
More filters
Journal ArticleDOI
TL;DR: The diagnostic framework combining DRS-CEL and morphological analysis is validated by comparing several methods and related studies, which offers a promising solution for wind-farm applications.
Abstract: Wind turbine blade bearings are often operated in harsh circumstances, which may easily be damaged causing the turbine to lose control and to further result in the reduction of energy production. However, for condition monitoring and fault diagnosis (CMFD) of wind turbine blade bearings, one of the main difficulties is that the rotation speeds of blade bearings are very slow (less than 5 r/min). Over the past few years, acoustic emission (AE) analysis has been used to carry out bearing CMFD. This article presents the results that reflect the potential of the AE analysis for diagnosing a slow-speed wind turbine blade bearing. To undertake this experiment, a 15-year-old naturally damaged industrial and slow-speed blade bearing is used for this study. However, due to very slow rotation speed conditions, the fault signals are very weak and masked by heavy noise disturbances. To denoise the raw AE signals, we propose a novel cepstrum editing method, discrete/random separation-based cepstrum editing liftering (DRS-CEL), to extract weak fault features from raw AE signals, where DRS is used to edit the cepstrum. Thereafter, the morphological envelope analysis is employed to further filter the residual noise leaked from DRS-CEL and demodulate the denoised signal, so the specific bearing fault type can be inferred in the frequency domain. The diagnostic framework combining DRS-CEL and morphological analysis is validated by comparing several methods and related studies, which offers a promising solution for wind-farm applications.

71 citations

Journal ArticleDOI
Hong Kook Kim, Richard Rose1
TL;DR: In this paper, a set of acoustic feature pre-processing techniques that are applied to improving automatic speech recognition (ASR) performance on noisy speech recognition tasks are presented. But the main contribution of this paper is an approach for cepstrum-domain feature compensation in ASR which is motivated by techniques for decomposing speech and noise that were originally developed for noisy speech enhancement.
Abstract: This paper presents a set of acoustic feature pre-processing techniques that are applied to improving automatic speech recognition (ASR) performance on noisy speech recognition tasks. The principal contribution of this paper is an approach for cepstrum-domain feature compensation in ASR which is motivated by techniques for decomposing speech and noise that were originally developed for noisy speech enhancement. This approach is applied in combination with other feature compensation algorithms to compensating ASR features obtained from a mel-filterbank cepstrum coefficient front-end. Performance comparisons are made with respect to the application of the minimum mean squared error log spectral amplitude (MMSE-LSA) estimator based speech enhancement algorithm prior to feature analysis. An experimental study is presented where the feature compensation approaches described in the paper are found to greatly reduce ASR word error rate compared to uncompensated features under environmental and channel mismatched conditions.

70 citations

Proceedings ArticleDOI
19 Apr 1994
TL;DR: Noise-masking is considered, through the addition of a constant offset to the linear spectral estimates, which provides a feature space far more stable to changes in noise statistics, which leads to performance equivalent to that achieved by explicit modelling.
Abstract: This paper examines the effects of additive Gaussian noise on the short-term cepstral analysis of speech. We identify three distinct modifications to the long-term statistics of the cepstrum that cause a gross mismatch after the addition of noise, namely: a mean shift, a change of variance and a distribution distorted from normal, with distinct bimodal characteristics. We assess the importance of each of these, and demonstrate the limitations of simple cepstral mappings. We then consider noise-masking, through the addition of a constant offset to the linear spectral estimates, which provides a feature space far more stable to changes in noise statistics. This leads to performance equivalent to that achieved by explicit modelling. >

70 citations

Journal ArticleDOI
TL;DR: A psychologically-inspired binary cascade classification schema is proposed for speech emotion recognition, and the recently proposed speaker-independent experimental protocol is tested on the Berlin emotional speech database for each gender separately.
Abstract: In this paper, a psychologically-inspired binary cascade classification schema is proposed for speech emotion recognition. Performance is enhanced because commonly confused pairs of emotions are distinguishable from one another. Extracted features are related to statistics of pitch, formants, and energy contours, as well as spectrum, cepstrum, perceptual and temporal features, autocorrelation, MPEG-7 descriptors, Fujisaki's model parameters, voice quality, jitter, and shimmer. Selected features are fed as input to K nearest neighborhood classifier and to support vector machines. Two kernels are tested for the latter: linear and Gaussian radial basis function. The recently proposed speaker-independent experimental protocol is tested on the Berlin emotional speech database for each gender separately. The best emotion recognition accuracy, achieved by support vector machines with linear kernel, equals 87.7%, outperforming state-of-the-art approaches. Statistical analysis is first carried out with respect to the classifiers' error rates and then to evaluate the information expressed by the classifiers' confusion matrices.

70 citations

Journal ArticleDOI
TL;DR: In this paper, the performance of seven different cepstrum-based methods for radial blind deconvolution of medical ultrasound images was compared, and the results showed that the generalized cepstrum method gave the best images closely followed by the complex cepstrate using phase unwrapping or polynomial rooting.
Abstract: This paper compares the performance of seven different cepstrum-based methods for radial blind deconvolution of medical ultrasound images. The first is the generalized cepstrum method. The second is the spectral root cepstrum method. These methods have received little attention so far. The last five methods are all based on the complex cepstrum, but different computational techniques in the spatial and frequency domain are employed. Using in vivo radio frequency data from a clinical scanner, the generalized cepstrum method gave the best images closely followed by the complex cepstrum using phase unwrapping or polynomial rooting. The complex cepstrum method using higher-order statistics was ranked as low as number five. These results are an important guideline for selecting a specific cepstrum-based radial deconvolution method for implementation in ultrasound scanners.

70 citations


Network Information
Related Topics (5)
Feature extraction
111.8K papers, 2.1M citations
82% related
Robustness (computer science)
94.7K papers, 1.6M citations
80% related
Feature (computer vision)
128.2K papers, 1.7M citations
79% related
Deep learning
79.8K papers, 2.1M citations
79% related
Support vector machine
73.6K papers, 1.7M citations
78% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
202386
2022206
202160
202096
2019135
2018130