Reduction of Computational Cost Using Two-Stage Deep Neural Network for Training for Denoising and Sound Source Identification

doi:10.1007/978-3-319-42007-3_49

Book ChapterDOI

Reduction of Computational Cost Using Two-Stage Deep Neural Network for Training for Denoising and Sound Source Identification

- pp 562-573

TLDR

This paper addresses reduction of computational cost in training of a Deep Neural Network (DNN) for sound identification using highly noise-contaminated sound recorded with a microphone array embedded in an Unmanned Aerial Vehicle (UAV), aiming at people’s voice detection quickly and widely in a disastrous situation.

Abstract:

This paper addresses reduction of computational cost in training of a Deep Neural Network (DNN), in particular, for sound identification using highly noise-contaminated sound recorded with a microphone array embedded in an Unmanned Aerial Vehicle (UAV), aiming at people’s voice detection quickly and widely in a disastrous situation. It is known that a DNN training method called end-to-end training shows high performance, since it uses a huge neural network with high non-linearity which is trained with a large amount of raw input signals without preprocessing. Its computational cost is, however, expensive due to the high complexity of the neural network. Therefore, we propose two-stage DNN training using two separately-trained networks; denoising of sound sources and sound source identification. Since the huge network is divided into two smaller networks, the complexity of the networks is expected to decrease and each of them can consider a specific model of denoising and identification. This results in faster convergence and computational cost reduction in DNN training. Preliminary results showed that only 71 % of training time was necessary with the proposed two staged network, while maintaining the accuracy of sound source identification, compared to end-to-end training using noisy acoustic signals recorded with an 8 ch circular microphone array embedded in a UAV.

Reduction of Computational Cost Using Two-Stage Deep Neural Network for Training for Denoising and Sound Source Identification

Citations

Design of UAV-Embedded Microphone Array System for Sound Source Localization in Outdoor Environments

Development of microphone-array-embedded UAV for search and rescue task

Recent R&D technologies and future prospective of flying robot in tough robotics challenge

References

Stacked denoising autoencoders: Learning useful representations in a deep network with a local denoising 1 criterion

Stacked Denoising Autoencoders: Learning Useful Representations in a Deep Network with a Local Denoising Criterion

A Scale for the Measurement of the Psychological Magnitude Pitch

Automatic Speech Recognition: A Deep Learning Approach

What Size Neural Network Gives Optimal Generalization? Convergence Properties of Backpropagation

Related Papers (5)

A compressive sensing based compressed neural network for sound source localization

A Diverse Noise-Resilient DNN Ensemble Model on Edge Devices for Time-Series Data

Robust Sound Localization of Sound Sources using Deep Convolution Network

Compressive Sensing Based Compressed Neural Network for Sound Source Localization

Sound Source Localization with CS Based Compressed Neural Network