scispace - formally typeset
Search or ask a question

Showing papers by "Masayuki Nishiguchi published in 2003"


PatentDOI
TL;DR: In this article, a feature extraction unit extracts sound quality featuring quantities, relevant to the sound quality, from the acoustic signals, to quantify sound quality peculiar to a climax, in order to detect a true highlight domain, based on the so calculated score value.
Abstract: A highlight portion is detected to a high accuracy from acoustic signals in say an event, and an index is added to the highlight portion. In an acoustic signal processing apparatus 10 , a candidate domain extraction unit 13 retains a domain, a length of which with short-term amplitudes as calculated by an amplitude calculating unit 11 not being less than an amplitude threshold value is not less than a time threshold value, as a candidate domain. A feature extraction unit 14 extracts sound quality featuring quantities, relevant to the sound quality, from the acoustic signals, to quantify the sound quality peculiar to a climax. A candidate domain evaluating unit 15 calculates a score value, indicating the degree of the climax, using featuring quantities relevant to the amplitude or the sound quality for each candidate domain, in order to detect a true highlight domain, based on the so calculated score value. An index generating unit 16 generates and outputs an index including the start and end positions and the score values of the highlight domain.

17 citations


Patent
20 May 2003
TL;DR: In this paper, the authors propose a signal recording/reproducing apparatus for searching for a reproduction position desired by a user from a recorded video signal and/or audio signal and reproducing the signal.
Abstract: This invention is a signal recording/reproducing apparatus (1) for searching for a reproduction position desired by a user from a recorded video signal and/or audio signal and reproducing the signal. The apparatus has a feature vector generating unit (30). When recording a video signal/audio signal, the feature vector generating unit (30) generates a feature vector characterizing the video signal and/or audio signal and also records this feature vector to a recording unit (10). When the recorded video signal/audio signal is reproduced, a registration unit (40) registers feature vectors near a position designated by the user, as bookmarks. When the user designates one of the bookmarks, a coincidence detecting unit (60) searches for a coincident position on the basis of the feature vector of the bookmark and the recorded feature vector, and a reproducing unit (20) starts reproduction of the video signal and/or audio signal at a reproduction position set by a reproduction point setting unit (70) on the basis of the coincident position.

12 citations


Patent
15 Jan 2003
TL;DR: In this article, a speaker frequency detection unit discriminates the speaker based on a feature of the speech waveform decoded to find the frequency of conversation (frequency of occurrence) of the speaker in a preset time interval.
Abstract: A speaker of encoded speech data recorded in a semiconductor storage device in an IC recorder is to be retrieved easily. An information receiving unit 10 in a speaker retrieval apparatus 1 reads out the encoded speech data recorded in a semiconductor storage device 107 in an IC recorder 100. A speech decoding unit 12 decodes the encoded speech data. A speaker frequency detection unit 13 discriminates the speaker based on a feature of the speech waveform decoded to find the frequency of conversation (frequency of occurrence) of the speaker in a preset time interval. A speaker frequency graph displaying unit 14 displays the speaker frequency on a picture as a two-dimensional graph having time and the frequency as two axes. A speech reproducing unit 16 reads out the portion of the encoded speech data corresponding to a time position or a time range specified by a reproducing position input unit 15 based on this two-dimensional graph from the storage device 11 and decodes the read-out data to output the decoded data to a speech outputting unit 17.

4 citations


Patent
27 Jun 2003
TL;DR: In this article, the amplitude, phase and frequency of a sine wave to be generated are calculated on the basis of feature quantity s 1 delivered to feature quantity detecting means (2), and are sent to initialization means (3 ).
Abstract: Amplitude, phase and frequency of a sine wave to be generated are calculated on the basis of feature quantity s 1 delivered to feature quantity detecting means ( 2 ), and are sent to initialization means ( 3 ). The initialization means ( 3 ) calculates first two points of the sine wave to send the points thus calculated to oscillator (sine wave generating means) ( 4 ) as initial value s 4 . The oscillator ( 4 ) sequentially calculates values of respective sample points of waveform by using recurrence formula in accordance with initial value or values instructed from the initialization means ( 3 ) to thereby generate a sine wave signal. Thus, sine wave generation is performed without performing modulo-addressing.

3 citations


Patent
19 Feb 2003
TL;DR: In this article, the alpha-parameters are converted by an alpha -parameter to LSP converting circuit 13 into linear spectral pair (LSP) parameters and a vector of these LSP parameters is vector-quantized by a quantizer.
Abstract: Foe executing the code excitation linear prediction (CELP) coding, for example, alpha -parameters are taken out from the input speech signal by a linear prediction coding (LPC) analysis circuit 12. The alpha -parameters are then converted by an alpha -parameter to LSP converting circuit 13 into linear spectral pair (LSP) parameters and a vector of these line spectral pair (LSP) parameters is vector-quantized by a quantizer 14. The changeover switch 16 is controlled depending upon the pitch value detected by a pitch detection circuit 22 for selecting and using one of the codebook 15M for male voice and the codebook 15F for female voice for improving quantization characteristics without increasing the transmission bit rate.

2 citations


Patent
27 Jun 2003
TL;DR: In this article, the amplitude, phase and frequency of a sine wave to be generated are calculated on the basis of feature quantity s 1 delivered to feature quantity detecting means (2), and are sent to initialization means (3 ).
Abstract: Amplitude, phase and frequency of a sine wave to be generated are calculated on the basis of feature quantity s 1 delivered to feature quantity detecting means ( 2 ), and are sent to initialization means ( 3 ). The initialization means ( 3 ) calculates first two points of the sine wave to send the points thus calculated to oscillator (sine wave generating means) ( 4 ) as initial value s 4 . The oscillator ( 4 ) sequentially calculates values of respective sample points of waveform by using recurrence formula in accordance with initial value or values instructed from the initialization means ( 3 ) to thereby generate a sine wave signal. Thus, sine wave generation is performed without performing modulo-addressing.