scispace - formally typeset
Search or ask a question
Topic

Cepstrum

About: Cepstrum is a research topic. Over the lifetime, 3346 publications have been published within this topic receiving 55742 citations.


Papers
More filters
Book ChapterDOI
19 Sep 2016
TL;DR: This application of machine learning to vehicle subsystem monitoring simplifies traditional engine diagnostics, aiding vehicle owners in the maintenance process and opening up new avenues for pervasive mobile sensing and automotive diagnostics.
Abstract: We address the problem of detecting whether an engine is misfiring by using machine learning techniques on transformed audio data collected from a smartphone. We recorded audio samples in an uncontrolled environment and extracted Fourier, Wavelet and Mel-frequency Cepstrum features from normal and abnormal engines. We then implemented Fisher Score and Relief Score based variable ranking to obtain an informative reduced feature set for training and testing classification algorithms. Using this feature set, we were able to obtain a model accuracy of over 99 % using a linear SVM applied to outsample data. This application of machine learning to vehicle subsystem monitoring simplifies traditional engine diagnostics, aiding vehicle owners in the maintenance process and opening up new avenues for pervasive mobile sensing and automotive diagnostics.

18 citations

Proceedings Article
01 Jan 2003
TL;DR: None of the feature transformations could outperform the baseline when used alone, but improvement in the word error rate was gained when the baseline feature was combined with the feature transformation stream.
Abstract: In this work, linear and nonlinear feature transformations have been experimented in ASR front end. Unsupervised transformations were based on principal component analysis and independent component analysis. Discriminative transformations were based on linear discriminant analysis and multilayer perceptron networks. The acoustic models were trained using a subset of HUB5 training data and they were tested using OGI Numbers corpus. Baseline feature vector consisted of PLP cepstrum and energy with first and second order deltas. None of the feature transformations could outperform the baseline when used alone, but improvement in the word error rate was gained when the baseline feature was combined with the feature transformation stream. Two combination methods were experimented: feature vector concatenation and n-best list combination using ROVER. Best results were obtained using the combination of the baseline PLP cepstrum and the feature transform based on multilayer perceptron network. The word error rate in the number recognition task was reduced from 4.1 to 3.1.

18 citations

Proceedings Article
01 Sep 1998
TL;DR: Preliminary experimental results show that the hybrid speaker verification system performs better than either of the sub-systems in terms of the equal error rate (EER), and improves the performance of the cepstral-based HMM system by 78% on average.
Abstract: In this paper we report on a study of the variability of voice source parameters in the context of speaker characterisation, and we propose a speaker verification system which incorporates these parameters. The motivation for this approach is that, whilst we have conscious control over the action of our vocal tract articulators such as the tongue and jaw, we have only limited voluntary muscle control over the vocal cords. The conjecture is, therefore, that impostors are less likely to be able to mimic vocal cord effects than vocal tract effects. The hybrid speaker verification system that is proposed incorporates two sub-systems to improve the overall performance: (i) a cepstral-based HMM with cohort normalisation and (ii) voice source parameters derived from Multi-cycle Closed-phase Glottal Inverse Filtering (MCGIF). Preliminary experimental results show that the hybrid system performs better than either of the sub-systems in terms of the equal error rate (EER). Specifically, the hybrid system improved the performance of the cepstral-based HMM system by 78% on average, resulting in a mean EER of 0.42% for the specific tests conducted.

18 citations

Journal ArticleDOI
TL;DR: In this article, the problem of 2D phase retrieval from the magnitude of the Fourier spectrum was formulated as one of computing the parameters that uniquely determine the signal and solved by employing the annihilating filter method, particularly for the case when the parameters are distinct.
Abstract: We address the problem of two-dimensional (2-D) phase retrieval from magnitude of the Fourier spectrum. We consider 2-D signals that are characterized by first-order difference equations, which have a parametric representation in the Fourier domain. We show that, under appropriate stability conditions, such signals can be reconstructed uniquely from the Fourier transform magnitude. We formulate the phase retrieval problem as one of computing the parameters that uniquely determine the signal. We show that the problem can be solved by employing the annihilating filter method, particularly for the case when the parameters are distinct. For the more general case of the repeating parameters, the annihilating filter method is not applicable. We circumvent the problem by employing the algebraically coupled matrix pencil (ACMP) method. In the noiseless measurement setup, exact phase retrieval is possible. We also establish a link between the proposed analysis and 2-D cepstrum. In the noisy case, we derive Cramer–Rao lower bounds (CRLBs) on the estimates of the parameters and present Monte Carlo performance analysis as a function of the noise level. Comparisons with state-of-the-art techniques in terms of signal reconstruction accuracy show that the proposed technique outperforms the Fienup and relaxed averaged alternating reflections (RAAR) algorithms in the presence of noise.

18 citations

Journal ArticleDOI
TL;DR: A system approach to acquire a three-dimensional object distribution is presented using a compact and cost efficient camera system with an engineered point spread function and is tested experimentally by estimating the three- dimensional distribution of an extended passively illuminated scene.
Abstract: A system approach to acquire a three-dimensional object distribution is presented using a compact and cost efficient camera system with an engineered point spread function. The corresponding monocular setup incorporates a phase-only computer-generated hologram in combination with a conventional imaging objective in order to optically encode the axial information within a single two-dimensional image. The object’s depth map is calculated using a novel approach based on the power cepstrum of the image. The in-plane RGB image information is restored with an extended depth of focus by applying an adapted Wiener filter. The presented approach is tested experimentally by estimating the three-dimensional distribution of an extended passively illuminated scene.

18 citations


Network Information
Related Topics (5)
Feature extraction
111.8K papers, 2.1M citations
82% related
Robustness (computer science)
94.7K papers, 1.6M citations
80% related
Feature (computer vision)
128.2K papers, 1.7M citations
79% related
Deep learning
79.8K papers, 2.1M citations
79% related
Support vector machine
73.6K papers, 1.7M citations
78% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
202386
2022206
202160
202096
2019135
2018130