Proceedings ArticleDOI
Speech recognition using MFCC and DTW
Bhadragiri Jagan Mohan,N. Ramesh Babu +1 more
- pp 1-4
Reads0
Chats0
TLDR
In this paper, an implementation of speech recognition system in MATLAB environment is explained, where two algorithms, Mel-Frequency Cepstral Coefficients (MFCC) and Dynamic Time Wrapping (DTW) are adapted for feature extraction and pattern matching respectively.Abstract:
Speech recognition has wide range of applications in security systems, healthcare, telephony military, and equipment designed for handicapped. Speech is continuous varying signal. So, proper digital processing algorithm has to be selected for automatic speech recognition system. To obtain required information from the speech sample, features have to be extracted from it. For recognition purpose the feature are analyzed to make decisions. In this paper implementation of Speech recognition system in MATLAB environment is explained. Mel-Frequency Cepstral Coefficients (MFCC) and Dynamic Time Wrapping (DTW) are two algorithms adapted for feature extraction and pattern matching respectively. Results are obtained by one time training and continuous testing phases.read more
Citations
More filters
Journal Article
Analysis of Voice Recognition Algorithms using MATLAB
TL;DR: From the simulation results, the Wiener Filter algorithm outperform the other four algorithms in terms of all measure of performance, and power requirement with the moderate complexity of the algorithm and its prospective implementation as a hardware.
Proceedings ArticleDOI
Mimicking voice recognition using MFCC-GMM framework
M V Unnikrishnan,Rajeev Rajan +1 more
TL;DR: Mel frequency cepstral coefficients (MFCC) are effectively utilized for evaluating the quality of text independent mimicked speech and show the promise of MFCC in evaluating voice mimicking performance.
Proceedings ArticleDOI
Speech Recognition in a Multi-speaker Environment by Using Hidden Markov Model and Mel-frequency Approach
Junzo Watada,Hanayuki +1 more
TL;DR: In this research, a Hidden Markov Model approach is proposed as an emotion classifier to carry out testing phases using speech data to recognize and analyze human voice in a multi-speaker environment from the meeting or indirect conversation.
Proceedings ArticleDOI
Gamification of Mobile-based Japanese Language Shadowing
TL;DR: The evaluation results showed that gamification is preferred and have significant Effect Size on student's motivation, when compared to the conventional shadowing method, as well as have positive effects on overall learning experience.
Journal ArticleDOI
FPGA-based implementation of speech recognition for robocar control using MFCC
TL;DR: The achievement of logic design in this research proven with a comparison between the Matlab computation and Xilinx simulation enables to facilitate the researchers to continue its implementation to FPGA hardware.
References
More filters
Journal ArticleDOI
A tutorial on hidden Markov models and selected applications in speech recognition
TL;DR: In this paper, the authors provide an overview of the basic theory of hidden Markov models (HMMs) as originated by L.E. Baum and T. Petrie (1966) and give practical details on methods of implementation of the theory along with a description of selected applications of HMMs to distinct problems in speech recognition.
Posted Content
Voice Recognition Algorithms using Mel Frequency Cepstral Coefficient (MFCC) and Dynamic Time Warping (DTW) Techniques
TL;DR: This paper presents the viability of MFCC to extract features and DTW to compare the test patterns and explains why the alignment is important to produce the better performance.
Speaker identification using mel frequency cepstral coefficients
Rashidul Hasan,Saifur Rahman +1 more
TL;DR: This paper presents a security system based on speaker identification based onMel frequency Cepstral Coefficients{MFCCs} have been used for feature extraction and vector quantization technique is used to minimize the amount of data to be handled.
Voice command recognition system based on mfcc and dtw
TL;DR: The feasibility of MFCC to extract features and DTW to compare the test patterns is presented and the non linear sequence alignment known as Dynamic Time Warping introduced by Sakoe Chiba has been used as features matching techniques.
Recognition of Isolated Words using Features based on LPC, MFCC, ZCR and STE, with Neural Network Classifiers
Bishnu Prasad Das,Ranjan Parekh +1 more
TL;DR: An accuracy of 85% is obtained by the combination of features, when the proposed approach is tested using a dataset of 280 speech samples, which is more than those obtained by using the features singly.