scispace - formally typeset
Open AccessProceedings ArticleDOI

Phase reconstruction of spectrograms with linear unwrapping: Application to audio signal restoration

TLDR
In this article, the authors reconstruct the phase of modified spectrograms of audio signals from the analysis of mixtures of sinusoids and obtain relationships between phases of successive time frames in the Time-Frequency domain.
Abstract
This paper introduces a novel technique for reconstructing the phase of modified spectrograms of audio signals. From the analysis of mixtures of sinusoids we obtain relationships between phases of successive time frames in the Time-Frequency (TF) domain. To obtain similar relationships over frequencies, in particular within onset frames, we study an impulse model. Instantaneous frequencies and attack times are estimated locally to encompass the class of non-stationary signals such as vibratos. These techniques ensure both the vertical coherence of partials (over frequencies) and the horizontal coherence (over time). The method is tested on a variety of data and demonstrates better performance than traditional consistency-based approaches. We also introduce an audio restoration framework and observe that our technique outperforms traditional methods.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

Advances in phase-aware signal processing in speech communication

TL;DR: It is shown that phase-aware signal processing is an important emerging field with high potential in the current speech communication applications and can complement the possible solutions that magnitude-only methods suggest.
Journal ArticleDOI

A Noniterative Method for Reconstruction of Phase From STFT Magnitude

TL;DR: In this article, a non-iterative method for the reconstruction of the short-time Fourier transform (STFT) phase from the magnitude is presented, which is based on the direct relationship between the partial derivatives of the phase and the logarithm of the magnitude of the un-sampled STFT with respect to the Gaussian window.
Journal ArticleDOI

Model-Based STFT Phase Recovery for Audio Source Separation

TL;DR: A novel iterative source separation procedure is proposed that outperforms the state-of-the-art consistent Wiener filter in minimizing the mixing error by means of the auxiliary function method.
Journal ArticleDOI

On MMSE-Based Estimation of Amplitude and Complex Speech Spectral Coefficients Under Phase-Uncertainty

TL;DR: This work revisits phase-aware estimators of clean speech amplitudes and complex coefficients and derives a novel amplitude estimator given uncertain prior phase information and a closed-form solution for complex coefficients when the prior phase Information is completely uncertain or not available.
Journal ArticleDOI

A Non-iterative Method for (Re)Construction of Phase from STFT Magnitude

TL;DR: In this article, a non-iterative method for the construction of the Short-Time Fourier Transform (STFT) phase from the magnitude is presented, which is based on the direct relationship between the partial derivatives of the phase and the logarithm of the magnitude of the un-sampled STFT with respect to the Gaussian window.
References
More filters
Journal ArticleDOI

Performance measurement in blind audio source separation

TL;DR: This paper considers four different sets of allowed distortions in blind audio source separation algorithms, from time-invariant gains to time-varying filters, and derives a global performance measure using an energy ratio, plus a separate performance measure for each error term.
Journal ArticleDOI

Signal estimation from modified short-time Fourier transform

TL;DR: An algorithm to estimate a signal from its modified short-time Fourier transform (STFT) by minimizing the mean squared error between the STFT of the estimated signal and the modified STFT magnitude is presented.
Proceedings ArticleDOI

Non-negative matrix factorization for polyphonic music transcription

TL;DR: This work presents a methodology for analyzing polyphonic musical passages comprised of notes that exhibit a harmonically fixed spectral profile (such as piano notes), which results in a very simple and compact system that is not knowledge-based, but rather learns notes by observation.
Journal ArticleDOI

Improved phase vocoder time-scale modification of audio

TL;DR: This paper examines the problem of phasiness in the context of time-scale modification and provides new insights into its causes, and two extensions to the standard phase vocoder algorithm are introduced, and the resulting sound quality is shown to be significantly improved.
Journal ArticleDOI

The PASCAL CHiME speech separation and recognition challenge

TL;DR: The ASR task as discussed by the authors was designed to identify keywords from sentences reverberantly mixed into audio backgrounds binaurally recorded in a busy domestic environment, and the challenge attracted thirteen submissions.
Related Papers (5)