scispace - formally typeset
Patent

Fast frequency-domain pitch estimation

Reads0
Chats0
TLDR
In this paper, a first transform of the signal to a frequency domain over a first time interval (42) and a second transform over a second time interval, which contains the first-time interval, is used to estimate the pitch frequency of an audio signal.
Abstract
A method for estimating a pitch frequency of an audio signal includes computing a first transform of the signal to a frequency domain over a first time interval (42), and computing a second transform of the signal of the frequency domain over a second time interval (44), which contains the first time interval. A line spectrum of the signal is found, based on the first and second transforms, the spectrum including spectral lines having respective line amplitudes and line frequencies. A utility function (130) that is periodic in the frequencies of the lines in the spectrum is then computed. This function is indicative (158), for each candidate pitch frequency in a given pitch frequency range, of a compatibility of the spectrum with the candidate pitch frequency. The pitch frequency of the speech signal is estimated responsive to the utility function (176, 178).

read more

Citations
More filters
Patent

System for Suppressing Wind Noise

TL;DR: In this paper, a voice enhancement system includes a noise detector and a noise attenuator, which detects a wind buffet and a continuous noise by modeling the wind buffet to improve the intelligibility of an unvoiced, a fully voiced or a mixed voice segment.
Patent

Method and apparatus for suppressing wind noise

TL;DR: In this paper, a method, apparatus, and computer program to selectively suppress wind noise while preserving narrow-band signals in acoustic data is presented, which overcomes prior art limitations that require more than one microphone and an independent measurement of wind speed.
PatentDOI

Feature-domain concatenative speech synthesis

TL;DR: In this article, the spectral envelopes are integrated over a plurality of window functions in a frequency domain so as to determine elements of feature vectors corresponding to the speech segments, and an output speech signal is reconstructed by concatenating the feature vector corresponding to a sequence of speech segments.
Patent

Speech end-pointer

TL;DR: A rule-based end-pointer as discussed by the authors isolates spoken utterances contained within an audio stream from background noise and non-speech transients, and includes a plurality of rules to determine the beginning and/or end of a spoken utterance based on various speech characteristics.
Patent

System for suppressing rain noise

TL;DR: In this article, a voice enhancement system includes a noise detector and a noise attenuator to improve the intelligibility of an unvoiced, a fully voiced, or a mixed voice segment.
References
More filters
Journal ArticleDOI

Speech analysis/Synthesis based on a sinusoidal representation

TL;DR: A sinusoidal model for the speech waveform is used to develop a new analysis/synthesis technique that is characterized by the amplitudes, frequencies, and phases of the component sine waves, which forms the basis for new approaches to the problems of speech transformations including time-scale and pitch-scale modification, and midrate speech coding.
Journal ArticleDOI

Period Histogram and Product Spectrum: New Methods for Fundamental‐Frequency Measurement

TL;DR: Several methods of fundamental frequency and period measurement, based on these concepts, are described and the results of computer simulations and analog instrumentations indicate that these new methods compare favorably with, and in some cases exceed, the capabilities of cepstrum analysis.
Journal ArticleDOI

Super resolution pitch determination of speech signals

TL;DR: Based on a new similarity model for the voice excitation process, a novel pitch determination procedure is derived that has infinite (super) resolution, better accuracy than the difference limen for F/sub 0/, robustness to noise, reliability, and modest computational complexity.
Patent

Linear predictive speech encoding systems with efficient combination pitch coefficients computation

TL;DR: In this paper, method and system aspects for linear predictive speech encoding are disclosed, which include the definition of an error function, the computation of an optimal vector of continuous pitch coefficients together with an optimal pitch, and the weighted vector quantization of the continuous pitch coefficient.