G
Gerhard Stenzel
Researcher at IBM
Publications - 15
Citations - 623
Gerhard Stenzel is an academic researcher from IBM. The author has contributed to research in topics: Identifier & Audio mining. The author has an hindex of 10, co-authored 15 publications receiving 623 citations.
Papers
More filters
Patent
Method and system for the automatic generation of multi-lingual synchronized sub-titles for audiovisual data
TL;DR: In this paper, a method and system for computerized synchronization of an audio stream having a first synchronized textual representation usable for subtitling of the audio stream in the original language was presented.
Patent
Method and apparatus for linking representation and realization data
TL;DR: In this paper, a method and apparatus for creating links between a representation and a realization (e.g., text data and corresponding audio data) is provided, by combining a time-stamped version of the representation generated from the realization with structural information from the representation.
Patent
Method and system for generating a characteristic identifier for digital data and for detecting identical digital data
TL;DR: In this paper, a characteristic identifier for digital data is generated, which is used for detecting identical digital data or to determine inexact copies of digital data and can be used to establish automated processes to find potential unauthorized copies of audio data.
Patent
A method and system for the automatic detection of similar or identical segments in audio recordings
TL;DR: In this article, the identification of identical or similar audio recordings or segments of audio recordings is determined by digitizing at least the first audio segment and the at least second audio segment of said audio streams.
PatentDOI
Method and system for the automatic segmentation of an audio stream into semantic or syntactic units
TL;DR: In this article, a digitized speech signal is input to an F0 (fundamental frequency) processor that computes a continuous F0 data from the speech signal and prosodic features are computed.