scispace - formally typeset
Open AccessProceedings Article

Data Collection in Real Acoustical Environments for Sound Scene Understanding and Hands-Free Speech Recognition

Reads0
Chats0
TLDR
EUROSPEECH1999: the 6th European Conference on Speech Communication and Techinology, September 5-9, 1999, Budapest, Hungary.
Abstract
EUROSPEECH1999: the 6th European Conference on Speech Communication and Techinology, September 5-9, 1999, Budapest, Hungary.

read more

Citations
More filters
Journal ArticleDOI

Robust sound event classification using deep neural networks

TL;DR: A sound event classification framework is outlined that compares auditory image front end features with spectrogram image-based frontEnd features, using support vector machine and deep neural network classifiers, and is shown to compare very well with current state-of-the-art classification techniques.
Proceedings ArticleDOI

Robust sound event recognition using convolutional neural networks

TL;DR: This work proposes novel features derived from spectrogram energy triggering, allied with the powerful classification capabilities of a convolutional neural network (CNN), which demonstrates excellent performance under noise-corrupted conditions when compared against state-of-the-art approaches on standard evaluation tasks.
Journal ArticleDOI

AENet: Learning Deep Audio Features for Video Analysis

TL;DR: In this article, the authors proposed a new deep network for audio event recognition, called AENet, which uses a convolutional neural network (CNN) operating on a large temporal input.
Proceedings ArticleDOI

Deep Neural Network based learning and transferring mid-level audio features for acoustic scene classification

TL;DR: DNN based transfer learning is proposed for Acoustic Scene Classification by exploiting VOC DNN's ability of learning beyond its pre-trained environments and its improved performance is verified by comparing it to prominent conventional methods.
Journal ArticleDOI

Continuous robust sound event classification using time-frequency features and deep learning

TL;DR: This paper proposes and evaluates a novel Bayesian-inspired front end for the segmentation and detection of continuous sound recordings prior to classification, and benchmarks several high performing isolated sound classifiers to operate with continuous sound data by incorporating an energy-based event detection front end.
References
More filters
Proceedings Article

Use of different microphone array configurations for hands-free speech recognition in noisy and reverberant environment.

TL;DR: In this work hands-free continuous speech recognition based on microphone arrays is investigated and HMM adaptation was used to realign the recognizer acoustic modeling to the given acoustic condition.
Proceedings Article

Recent speech database projects in Japan.

Related Papers (5)