scispace - formally typeset
Journal ArticleDOI

A review on speech processing using machine learning paradigm

TLDR
The performance of several machine learning techniques is validated for speech emotion recognition application on Berlin EmoDB database and the broad application areas and challenges in machine learning for speech processing are given.
Abstract
Speech processing plays a crucial role in many signal processing applications, while the last decade has bought gigantic evolution based on machine learning prototype. Speech processing has a close relationship with computer linguistics, human–machine interaction, natural language processing, and psycholinguistics. This review article majorly discusses the feature extraction techniques and machine learning classifiers employed in speech processing and recognition activities. The performance of several machine learning techniques is validated for speech emotion recognition application on Berlin EmoDB database. Further, it gives the broad application areas and challenges in machine learning for speech processing.

read more

Citations
More filters
Journal ArticleDOI

Speech Emotion Recognition Based on Multiple Acoustic Features and Deep Convolutional Neural Network

TL;DR: In this paper , the acoustic feature set based on Mel frequency cepstral coefficients (MFCC), linear prediction Cepstrals coefficients (LPCC), wavelet packet transform (WPT), zero crossing rate (ZCR), spectrum centroid, spectral roll-off, spectral kurtosis, root mean square (RMS), pitch, jitter, and shimmer to improve the feature distinctiveness.
Proceedings ArticleDOI

Neural Style Transfer: Reliving art through Artificial Intelligence

TL;DR: Deep-learning techniques are introduced, which are vital in accomplishing human characteristics and open up a new world of prospects in generating higher quality creative products.
Journal ArticleDOI

Energy efficient clustering routing protocol using novel admission allotment scheme (AAS) based intra-cluster communication for Wireless Sensor Network

TL;DR: Novel Admission Allotment Scheme (AAS) based intra-cluster communication to minimize overheads on the sensor nodes and packet drop and energy efficient Ant Colony Optimization (ACO) is proposed to route the data from CH to base station (BS).
Journal ArticleDOI

Entropy-Argumentative Concept of Computational Phonetic Analysis of Speech Taking into Account Dialect and Individuality of Phonation

TL;DR: In this paper , a mathematical model and methods for computational phonetic analysis of speech with an analytical description of the phenomenon of phonetic fusion is proposed, which allows for determining reliably the individual phonetic alphabet inherent in a person, taking into account their inherent dialect of speech and individual features of phonation, as well as detecting and correcting errors in the recognition of language units.
References
More filters
Journal ArticleDOI

Robust text-independent speaker identification using Gaussian mixture speaker models

TL;DR: The individual Gaussian components of a GMM are shown to represent some general speaker-dependent spectral shapes that are effective for modeling speaker identity and is shown to outperform the other speaker modeling techniques on an identical 16 speaker telephone speech task.
Proceedings ArticleDOI

Listen, attend and spell: A neural network for large vocabulary conversational speech recognition

TL;DR: Listen, Attend and Spell (LAS), a neural speech recognizer that transcribes speech utterances directly to characters without pronunciation models, HMMs or other components of traditional speech recognizers is presented.
Proceedings ArticleDOI

A database of German emotional speech.

TL;DR: A database of emotional speech that was evaluated in a perception test regarding the recognisability of emotions and their naturalness and can be accessed by the public via the internet.
Journal ArticleDOI

Speech recognition by machine: A review

TL;DR: This paper provides a review of recent developments in speech recognition research and the concept of sources of knowledge is introduced and the use of knowledge to generate and verify hypotheses is discussed.
Reference BookDOI

Springer Handbook of Acoustics

TL;DR: The Springer Handbook of Acoustics as discussed by the authors is an unparalleled modern handbook reflecting this richly interdisciplinary nature edited by one of the acknowledged masters in the field, Thomas Rossing.
Related Papers (5)