Speech recognition using MFCC and DTW

doi:10.1109/ICAEE.2014.6838564

Proceedings ArticleDOI

Speech recognition using MFCC and DTW

Bhadragiri Jagan Mohan, +1 more

- pp 1-4

Chats0

TLDR

In this paper, an implementation of speech recognition system in MATLAB environment is explained, where two algorithms, Mel-Frequency Cepstral Coefficients (MFCC) and Dynamic Time Wrapping (DTW) are adapted for feature extraction and pattern matching respectively.

Abstract:

Speech recognition has wide range of applications in security systems, healthcare, telephony military, and equipment designed for handicapped. Speech is continuous varying signal. So, proper digital processing algorithm has to be selected for automatic speech recognition system. To obtain required information from the speech sample, features have to be extracted from it. For recognition purpose the feature are analyzed to make decisions. In this paper implementation of Speech recognition system in MATLAB environment is explained. Mel-Frequency Cepstral Coefficients (MFCC) and Dynamic Time Wrapping (DTW) are two algorithms adapted for feature extraction and pattern matching respectively. Results are obtained by one time training and continuous testing phases.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

Comparison of Three Auditory Frequency Scales in Feature Extraction on Myanmar Digits Recognition

Hay Mar Soe Naing, +3 more

TL;DR: This paper demonstrates another scale of auditory frequency spectrum namely, Bark and Equivalent Rectangular Bandwidth (ERB) scales, which have achieved the better performance than the Mel scale.

...read moreread less

Étude et conception de systèmes de validation de tâches d'assemblage : participation à la conception d'un robot collaboratif destiné à une chaîne de montage automobile

Jean Thuriet

Journal ArticleDOI

Voice-Print Recognition System Using Python And Machine Learning With IBM Watson

Pritam Ahire, +2 more

- 30 Jun 2021 -

International Advanced Research Journal ...

TL;DR: This system is going to use some machine learning concept SVM (Support Vector Machine), the SVC will be useful to differentiate the dataset and find out the actual required result so that user can get authentication to access the machine.

...read moreread less

Book ChapterDOI

Evaluating the Effectiveness of Inhaler Use Among COPD Patients via Recording and Processing Cough and Breath Sounds from Smartphones

Anthony Windmon, +2 more

TL;DR: In this article, a machine learning algorithm operating on Mel-frequency Cepstral Coefficients of patients' cough and breath sounds was proposed to detect the effectiveness of inhaler usage.

...read moreread less

Journal ArticleDOI

Speech signal analysis of alzheimer's diseases in farsi using auditory model system.

Maryam Momeni, +1 more

- 01 Jun 2021 -

Cognitive Neurodynamics

TL;DR: Farsi speech signals were analyzed using the auditory model system (AMS) in order to recognize AD, demonstrating the applicability of the proposed algorithm in non-invasive and low-cost recognizing Alzheimer's only using the few extracted features of the speech signal.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

A tutorial on hidden Markov models and selected applications in speech recognition

Lawrence R. Rabiner

TL;DR: In this paper, the authors provide an overview of the basic theory of hidden Markov models (HMMs) as originated by L.E. Baum and T. Petrie (1966) and give practical details on methods of implementation of the theory along with a description of selected applications of HMMs to distinct problems in speech recognition.

...read moreread less

Posted Content

Voice Recognition Algorithms using Mel Frequency Cepstral Coefficient (MFCC) and Dynamic Time Warping (DTW) Techniques

Lindasalwa Muda, +2 more

- 22 Mar 2010 -

arXiv: Multimedia

TL;DR: This paper presents the viability of MFCC to extract features and DTW to compare the test patterns and explains why the alignment is important to produce the better performance.

...read moreread less

Speaker identification using mel frequency cepstral coefficients

Rashidul Hasan, +1 more

TL;DR: This paper presents a security system based on speaker identification based onMel frequency Cepstral Coefficients{MFCCs} have been used for feature extraction and vector quantization technique is used to minimize the amount of data to be handled.

...read moreread less

Voice command recognition system based on mfcc and dtw

Anjali Bala, +2 more

TL;DR: The feasibility of MFCC to extract features and DTW to compare the test patterns is presented and the non linear sequence alignment known as Dynamic Time Warping introduced by Sakoe Chiba has been used as features matching techniques.

...read moreread less

Recognition of Isolated Words using Features based on LPC, MFCC, ZCR and STE, with Neural Network Classifiers

Bishnu Prasad Das, +1 more

TL;DR: An accuracy of 85% is obtained by the combination of features, when the proposed approach is tested using a dataset of 280 speech samples, which is more than those obtained by using the features singly.

...read moreread less

Speech recognition using MFCC and DTW

Citations

Comparison of Three Auditory Frequency Scales in Feature Extraction on Myanmar Digits Recognition

Étude et conception de systèmes de validation de tâches d'assemblage : participation à la conception d'un robot collaboratif destiné à une chaîne de montage automobile

Voice-Print Recognition System Using Python And Machine Learning With IBM Watson

Evaluating the Effectiveness of Inhaler Use Among COPD Patients via Recording and Processing Cough and Breath Sounds from Smartphones

Speech signal analysis of alzheimer's diseases in farsi using auditory model system.

References

A tutorial on hidden Markov models and selected applications in speech recognition

Voice Recognition Algorithms using Mel Frequency Cepstral Coefficient (MFCC) and Dynamic Time Warping (DTW) Techniques

Speaker identification using mel frequency cepstral coefficients

Voice command recognition system based on mfcc and dtw

Recognition of Isolated Words using Features based on LPC, MFCC, ZCR and STE, with Neural Network Classifiers

Related Papers (5)

An efficient method for Tamil speech recognition using MFCC and DTW for mobile applications

Automatic Speech Recognition using correlation analysis

Dynamic Time Warping based speech recognition for isolated sinhala words

Feature selection algorithms for automatic speech recognition

Isolated word Automatic Speech Recognition (ASR) System using MFCC, DTW & KNN