scispace - formally typeset
J

Jörg-Hendrik Bach

Researcher at University of Oldenburg

Publications -  15
Citations -  124

Jörg-Hendrik Bach is an academic researcher from University of Oldenburg. The author has contributed to research in topics: Voice activity detection & Spectrogram. The author has an hindex of 6, co-authored 15 publications receiving 107 citations.

Papers
More filters
Journal ArticleDOI

Robust speech detection in real acoustic backgrounds with perceptually motivated features

TL;DR: The best backgrounds in terms of generalisation capabilities are found to be backgrounds in which some component of speech is present, which corroborates the hypothesis that the AMS features provide a decomposition of signals which is by itself very suitable for training very general speech/nonspeech detectors.
Proceedings ArticleDOI

Modulation-based detection of speech in real background noise: Generalization to novel background classes

TL;DR: The results show that generalization to novel background classes with AMS features yields better performance in 84% of investigated situations, corresponding to an SNR benefit of about 10 dB compared to mel-frequency cepstral coefficient (MFCC) features.
Proceedings ArticleDOI

Detection of speech embedded in real acoustic background based on amplitude modulation spectrogram features.

TL;DR: Results show that reliable detection of speech can be performed with less than 10 optimally selected modulation features, the most important ones are located in the modulation frequency range below 10 Hz.
Book ChapterDOI

Object category detection using audio-visual cues

TL;DR: A multimodal approach to object category detection, using audio and visual information, which is modeled on biologically motivated spectral features via a discriminative classifier and a state of the art part based model.
Proceedings Article

Detecting novel objects in acoustic scenes through classifier incongruence.

TL;DR: This study applies a new generic framework for the detection and interpretation of disagreement (“incongruence”) between different classifiers to the problem of detecting novel acoustic objects in an office environment and yields approximately 90% hit rate for novel events.