Proceedings ArticleDOI
Voice Extraction from Background Noise using Filter Bank Analysis for Voice Communication Applications
TLDR
In this paper, the authors applied prevalent feature extraction techniques to extract the speech signal with the trade off of complexity, compression ratio, and compression ratio for the application of voice communication.Abstract:
Automatic Speech Recognition plays an evident role in extracting the voice signal in the noisy background. The reduction of noise in the signal is susceptible to the information which is to be transmitted since not all the information is emphasized. This leads to the deterioration in the transmitted information and paved furtherance for automatic speech recognition. Prevalent feature extraction techniques are applied to extract the speech signal with the trade off of complexity, compression ratio. For the application of voice communication, filter bank analysis is applied to extract the voice signals in the noisy environment. This work emphasized on the attributes of the perceptual quality of Loudness, Pitch Intensity, Timing. Band pass filtering provides reliable extraction of the voice signal features in the noisy environment. The power distribution of the extracted signals for the selected audio signal with the length of more than 20 seconds wave file with a sampling rate of 16 Khz along with the background noises has been plotted and its respective spectrogram also been plotted.read more
Citations
More filters
Journal ArticleDOI
G-Cocktail: An Algorithm to Address Cocktail Party Problem of Gujarati Language Using Cat Boost
TL;DR: The proposed algorithm, G- Cocktail, addresses the Cocktail party problem of Indian language, Gujarati by utilizing the power of CatBoost algorithm to classify and identify the voice.
References
More filters
Journal ArticleDOI
On the Improvement of Singing Voice Separation for Monaural Recordings Using the MIR-1K Dataset
TL;DR: This paper has constructed a corpus called MIR-1K (multimedia information retrieval lab, 1000 song clips), where all singing voices and music accompaniments were recorded separately, and enhanced the performance of separating voiced singing via a spectral subtraction method.
Proceedings ArticleDOI
The beat spectrum: a new approach to rhythm analysis
Jonathan Foote,S. Uchihashi +1 more
TL;DR: The beat spectrum is a measure of acoustic self-similarity versus lag time, computed from a representation of spectrally similarity, which has a variety of applications, including music retrieval by similarity and automatically generating music videos.
Journal ArticleDOI
Adaptation of Bayesian Models for Single-Channel Source Separation and its Application to Voice/Music Separation in Popular Songs
TL;DR: A general formalism for source model adaptation which is expressed in the framework of Bayesian models is introduced and results show that an adaptation scheme can improve consistently and significantly the separation performance in comparison with nonadapted models.
Journal ArticleDOI
Separation of Singing Voice From Music Accompaniment for Monaural Recordings
Yipeng Li,DeLiang Wang +1 more
TL;DR: This work proposes a system to separate singing voice from music accompaniment for monaural recordings and quantitative results show that the system performs the separation task successfully.
Journal ArticleDOI
An Automatic Tamil Speech Recognition system by using Bidirectional Recurrent Neural Network with Self-Organizing Map
TL;DR: Bidirectional recurrent neural network (BRNN) with self-organizing map (SOM)-based classification scheme is suggested for Tamil speech recognition and demonstrates that the suggested conspire accomplished preferable outcomes looked at over exist deep neural network–hidden Markov model algorithm regarding signal-to-noise ratio, classification accuracy, and mean square error.
Related Papers (5)
Performance Evaluation of Silence-Feature Normalization Model using Cepstrum Features of Noise Signals
Sang-Yeob Oh,Kyung-Yong Chung +1 more