Data Collection in Real Acoustical Environments for Sound Scene Understanding and Hands-Free Speech Recognition

Open AccessProceedings Article

Data Collection in Real Acoustical Environments for Sound Scene Understanding and Hands-Free Speech Recognition

Chats0

TLDR

EUROSPEECH1999: the 6th European Conference on Speech Communication and Techinology, September 5-9, 1999, Budapest, Hungary.

Abstract:

EUROSPEECH1999: the 6th European Conference on Speech Communication and Techinology, September 5-9, 1999, Budapest, Hungary.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Robust sound event classification using deep neural networks

Ian McLoughlin, +4 more

- 01 Mar 2015 -

IEEE Transactions on Audio, Speech, and ...

TL;DR: A sound event classification framework is outlined that compares auditory image front end features with spectrogram image-based frontEnd features, using support vector machine and deep neural network classifiers, and is shown to compare very well with current state-of-the-art classification techniques.

...read moreread less

Proceedings ArticleDOI

Robust sound event recognition using convolutional neural networks

Haomin Zhang, +2 more

TL;DR: This work proposes novel features derived from spectrogram energy triggering, allied with the powerful classification capabilities of a convolutional neural network (CNN), which demonstrates excellent performance under noise-corrupted conditions when compared against state-of-the-art approaches on standard evaluation tasks.

...read moreread less

Journal ArticleDOI

AENet: Learning Deep Audio Features for Video Analysis

Naoya Takahashi, +2 more

- 01 Mar 2018 -

IEEE Transactions on Multimedia

TL;DR: In this article, the authors proposed a new deep network for audio event recognition, called AENet, which uses a convolutional neural network (CNN) operating on a large temporal input.

...read moreread less

Proceedings ArticleDOI

Deep Neural Network based learning and transferring mid-level audio features for acoustic scene classification

Seongkyu Mun, +4 more

TL;DR: DNN based transfer learning is proposed for Acoustic Scene Classification by exploiting VOC DNN's ability of learning beyond its pre-trained environments and its improved performance is verified by comparing it to prominent conventional methods.

...read moreread less

Journal ArticleDOI

Continuous robust sound event classification using time-frequency features and deep learning

Ian McLoughlin, +6 more

- 11 Sep 2017 -

PLOS ONE

TL;DR: This paper proposes and evaluates a novel Bayesian-inspired front end for the segmentation and detection of continuous sound recordings prior to classification, and benchmarks several high performing isolated sound classifiers to operate with continuous sound data by incorporating an energy-based event detection front end.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings Article

Use of different microphone array configurations for hands-free speech recognition in noisy and reverberant environment.

Diego Giuliani, +3 more

TL;DR: In this work hands-free continuous speech recognition based on microphone arrays is investigated and HMM adaptation was used to realign the recognizer acoustic modeling to the given acoustic condition.

...read moreread less

Proceedings Article