Fundamental frequency estimation based on the joint time-frequency analysis of harmonic spectral structure

doi:10.1109/89.943339

Journal ArticleDOI

Fundamental frequency estimation based on the joint time-frequency analysis of harmonic spectral structure

Der-Jenq Liu, +1 more

- 01 Sep 2001 -

IEEE Transactions on Speech and Audio Pr...

- Vol. 9, Iss: 6, pp 609-621

Chats0

TLDR

A pitch measure to detect the harmonic characteristics of voiced sounds on the spectrum of a speech signal and a fast adaptive representation (FAR) algorithm, which reduces the computation complexity of the original algorithm by 50%.

Abstract:

In this paper, we propose a new scheme to analyze the spectral structure of speech signals for fundamental frequency estimation. First, we propose a pitch measure to detect the harmonic characteristics of voiced sounds on the spectrum of a speech signal. This measure utilizes the properties that there are distinct impulses located at the positions of fundamental frequency and its harmonics, and the energy of voiced sound is dominated by the energy of these distinct harmonic impulses. The spectrum can be obtained by the fast Fourier transform (FFT) however, it may be destroyed when the speech is interfered with by additive noise. To enhance the robustness of the proposed scheme in noisy environments, we apply the joint time-frequency analysis (JTFA) technique to obtain the adaptive representation of the spectrum of speech signals. The adaptive representation can accurately extract important harmonic structure of noisy speech signals at the expense of high computation cost. To solve this problem, we further propose a fast adaptive representation (FAR) algorithm, which reduces the computation complexity of the original algorithm by 50%. The performance of the proposed fundamental-frequency estimation scheme is evaluated on a large database with or without additive noise. The performance is compared to that of other approaches on the same database. The experimental results show that the proposed scheme performs well on clean speech and is robust in noisy environments.

Fundamental frequency estimation based on the joint time-frequency analysis of harmonic spectral structure

Citations

A multipitch tracking algorithm for noisy speech

A spectral/temporal method for robust fundamental frequency tracking

Speech analysis

Robust and accurate fundamental frequency estimation based on dominant harmonic components

A method for fundamental frequency estimation and voicing decision: Application to infant utterances recorded in real acoustical environments

References

Discrete-Time Signal Processing

Matching pursuits with time-frequency dictionaries

Fundamentals of speech recognition

Discrete-Time Processing of Speech Signals

Speech Analysis, Synthesis and Perception

Related Papers (5)

YIN, a fundamental frequency estimator for speech and music

A comparative performance study of several pitch detection algorithms

Pitch Determination of Speech Signals

Cepstrum Pitch Determination

On the use of autocorrelation analysis for pitch detection