scispace - formally typeset
G

Gerhard Stenzel

Researcher at IBM

Publications -  15
Citations -  623

Gerhard Stenzel is an academic researcher from IBM. The author has contributed to research in topics: Identifier & Audio mining. The author has an hindex of 10, co-authored 15 publications receiving 623 citations.

Papers
More filters
Patent

Method and system for the automatic generation of multi-lingual synchronized sub-titles for audiovisual data

TL;DR: In this paper, a method and system for computerized synchronization of an audio stream having a first synchronized textual representation usable for subtitling of the audio stream in the original language was presented.
Patent

Method and apparatus for linking representation and realization data

TL;DR: In this paper, a method and apparatus for creating links between a representation and a realization (e.g., text data and corresponding audio data) is provided, by combining a time-stamped version of the representation generated from the realization with structural information from the representation.
Patent

Method and system for generating a characteristic identifier for digital data and for detecting identical digital data

TL;DR: In this paper, a characteristic identifier for digital data is generated, which is used for detecting identical digital data or to determine inexact copies of digital data and can be used to establish automated processes to find potential unauthorized copies of audio data.
Patent

A method and system for the automatic detection of similar or identical segments in audio recordings

TL;DR: In this article, the identification of identical or similar audio recordings or segments of audio recordings is determined by digitizing at least the first audio segment and the at least second audio segment of said audio streams.
PatentDOI

Method and system for the automatic segmentation of an audio stream into semantic or syntactic units

TL;DR: In this article, a digitized speech signal is input to an F0 (fundamental frequency) processor that computes a continuous F0 data from the speech signal and prosodic features are computed.