Topic

Cepstrum

About: Cepstrum is a research topic. Over the lifetime, 3346 publications have been published within this topic receiving 55742 citations.

...read moreread less

Papers published on a yearly basis

1 / 2

Papers

PDF

Open Access

More filters

Book Chapter•DOI•

Engine Misfire Detection with Pervasive Mobile Audio

[...]

Joshua E. Siegel¹, Sumeet Kumar¹, Isaac M. Ehrenberg¹, Sanjay E. Sarma¹•Institutions (1)

Massachusetts Institute of Technology¹

19 Sep 2016

TL;DR: This application of machine learning to vehicle subsystem monitoring simplifies traditional engine diagnostics, aiding vehicle owners in the maintenance process and opening up new avenues for pervasive mobile sensing and automotive diagnostics.

...read moreread less

Abstract: We address the problem of detecting whether an engine is misfiring by using machine learning techniques on transformed audio data collected from a smartphone. We recorded audio samples in an uncontrolled environment and extracted Fourier, Wavelet and Mel-frequency Cepstrum features from normal and abnormal engines. We then implemented Fisher Score and Relief Score based variable ranking to obtain an informative reduced feature set for training and testing classification algorithms. Using this feature set, we were able to obtain a model accuracy of over 99 % using a linear SVM applied to outsample data. This application of machine learning to vehicle subsystem monitoring simplifies traditional engine diagnostics, aiding vehicle owners in the maintenance process and opening up new avenues for pervasive mobile sensing and automotive diagnostics.

...read moreread less

18 citations

Proceedings Article•

Feature Transformations and Combinations for Improving ASR Performance

[...]

Panu Somervuo¹, Barry Y. Chen¹, Qifeng Zhu²•Institutions (2)

University of California, Berkeley¹, Helsinki University of Technology²

01 Jan 2003

TL;DR: None of the feature transformations could outperform the baseline when used alone, but improvement in the word error rate was gained when the baseline feature was combined with the feature transformation stream.

...read moreread less

Abstract: In this work, linear and nonlinear feature transformations have been experimented in ASR front end. Unsupervised transformations were based on principal component analysis and independent component analysis. Discriminative transformations were based on linear discriminant analysis and multilayer perceptron networks. The acoustic models were trained using a subset of HUB5 training data and they were tested using OGI Numbers corpus. Baseline feature vector consisted of PLP cepstrum and energy with first and second order deltas. None of the feature transformations could outperform the baseline when used alone, but improvement in the word error rate was gained when the baseline feature was combined with the feature transformation stream. Two combination methods were experimented: feature vector concatenation and n-best list combination using ROVER. Best results were obtained using the combination of the baseline PLP cepstrum and the feature transform based on multilayer perceptron network. The word error rate in the number recognition task was reduced from 4.1 to 3.1.

...read moreread less

18 citations

Proceedings Article•

Voice source parameters for speaker verification

[...]

Andreas Neocleous¹, Patrick A. Naylor¹•Institutions (1)

Imperial College London¹

01 Sep 1998

TL;DR: Preliminary experimental results show that the hybrid speaker verification system performs better than either of the sub-systems in terms of the equal error rate (EER), and improves the performance of the cepstral-based HMM system by 78% on average.

...read moreread less

Abstract: In this paper we report on a study of the variability of voice source parameters in the context of speaker characterisation, and we propose a speaker verification system which incorporates these parameters. The motivation for this approach is that, whilst we have conscious control over the action of our vocal tract articulators such as the tongue and jaw, we have only limited voluntary muscle control over the vocal cords. The conjecture is, therefore, that impostors are less likely to be able to mimic vocal cord effects than vocal tract effects. The hybrid speaker verification system that is proposed incorporates two sub-systems to improve the overall performance: (i) a cepstral-based HMM with cohort normalisation and (ii) voice source parameters derived from Multi-cycle Closed-phase Glottal Inverse Filtering (MCGIF). Preliminary experimental results show that the hybrid system performs better than either of the sub-systems in terms of the equal error rate (EER). Specifically, the hybrid system improved the performance of the cepstral-based HMM system by 78% on average, resulting in a mean EER of 0.42% for the specific tests conducted.

...read moreread less

18 citations

Journal Article•DOI•

Exact Phase Retrieval for a Class of 2-D Parametric Signals

[...]

Basty Ajay Shenoy¹, Chandra Sekhar Seelamantula¹•Institutions (1)

Indian Institute of Science¹

01 Jan 2015-IEEE Transactions on Signal Processing

TL;DR: In this article, the problem of 2D phase retrieval from the magnitude of the Fourier spectrum was formulated as one of computing the parameters that uniquely determine the signal and solved by employing the annihilating filter method, particularly for the case when the parameters are distinct.

...read moreread less

Abstract: We address the problem of two-dimensional (2-D) phase retrieval from magnitude of the Fourier spectrum. We consider 2-D signals that are characterized by first-order difference equations, which have a parametric representation in the Fourier domain. We show that, under appropriate stability conditions, such signals can be reconstructed uniquely from the Fourier transform magnitude. We formulate the phase retrieval problem as one of computing the parameters that uniquely determine the signal. We show that the problem can be solved by employing the annihilating filter method, particularly for the case when the parameters are distinct. For the more general case of the repeating parameters, the annihilating filter method is not applicable. We circumvent the problem by employing the algebraically coupled matrix pencil (ACMP) method. In the noiseless measurement setup, exact phase retrieval is possible. We also establish a link between the proposed analysis and 2-D cepstrum. In the noisy case, we derive Cramer–Rao lower bounds (CRLBs) on the estimates of the parameters and present Monte Carlo performance analysis as a function of the noise level. Comparisons with state-of-the-art techniques in terms of signal reconstruction accuracy show that the proposed technique outperforms the Fienup and relaxed averaged alternating reflections (RAAR) algorithms in the presence of noise.

...read moreread less

18 citations

Journal Article•DOI•

Single shot three-dimensional imaging using an engineered point spread function.

[...]

René Berlich¹, Andreas Bräuer¹, Sjoerd Stallinga²•Institutions (2)

Fraunhofer Society¹, Delft University of Technology²

21 Mar 2016-Optics Express

TL;DR: A system approach to acquire a three-dimensional object distribution is presented using a compact and cost efficient camera system with an engineered point spread function and is tested experimentally by estimating the three- dimensional distribution of an extended passively illuminated scene.

...read moreread less

Abstract: A system approach to acquire a three-dimensional object distribution is presented using a compact and cost efficient camera system with an engineered point spread function. The corresponding monocular setup incorporates a phase-only computer-generated hologram in combination with a conventional imaging objective in order to optically encode the axial information within a single two-dimensional image. The object’s depth map is calculated using a novel approach based on the power cepstrum of the image. The in-plane RGB image information is restored with an extended depth of focus by applying an adapted Wiener filter. The presented approach is tested experimentally by estimating the three-dimensional distribution of an extended passively illuminated scene.

...read moreread less

18 citations

Collapse

Network Information

Performance

Metrics

3,645

Papers

60,375

Citations

No. of papers in the topic in previous years
Year	Papers
2023	86
2022	206
2021	60
2020	96
2019	135
2018	130

Cepstrum

Papers published on a yearly basis

Papers

Trending Questions (9)

Network Information

Related Topics (5)

Performance

Metrics