Deep-Sparse-Representation-Based Features for Speech Recognition

doi:10.1109/TASLP.2017.2748240

Journal ArticleDOI

Deep-Sparse-Representation-Based Features for Speech Recognition

Pulkit Sharma, +2 more

- 01 Nov 2017 -

IEEE Transactions on Audio, Speech, and ...

- Vol. 25, Iss: 11, pp 2162-2175

Chats0

TLDR

This paper proposes to use a multilevel decomposition (having multiple layers), also known as the deep sparse representation (DSR), to derive a feature representation for speech recognition, and reveals that the representations obtained at different sparse layers of the proposed DSR model have complimentary information.

Abstract:

Features derived using sparse representation (SR)-based approaches have been shown to yield promising results for speech recognition tasks. In most of the approaches, the SR corresponding to speech signal is estimated using a dictionary, which could be either exemplar based or learned. However, a single-level decomposition may not be suitable for the speech signal, as it contains complex hierarchical information about various hidden attributes. In this paper, we propose to use a multilevel decomposition (having multiple layers), also known as the deep sparse representation (DSR), to derive a feature representation for speech recognition. Instead of having a series of sparse layers, the proposed framework employs a dense layer between two sparse layers, which helps in efficient implementation. Our studies reveal that the representations obtained at different sparse layers of the proposed DSR model have complimentary information. Thus, the final feature representation is derived after concatenating the representations obtained at the sparse layers. This results in a more discriminative representation, and improves the speech recognition performance. Since the concatenation results in a high-dimensional feature, principal component analysis is used to reduce the dimension of the obtained feature. Experimental studies demonstrate that the proposed feature outperforms existing features for various speech recognition tasks.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

A sparse deep belief network with efficient fuzzy learning framework.

Gongming Wang, +4 more

- 01 Jan 2020 -

Neural Networks

TL;DR: As a novel cross-model, SDBFNN combines the advantages of both pre-training technique and fuzzy neural network to improve modeling capability and achieves better performance than the existing methods in learning speed, modeling accuracy and robustness.

...read moreread less

Journal ArticleDOI

Multiview Concept Learning Via Deep Matrix Factorization

Wei Zhao, +3 more

- 01 Feb 2021 -

IEEE Transactions on Neural Networks

TL;DR: This work presents the deep multiview concept learning (DMCL) method, which hierarchically factorizes the multIView data, and tries to explicitly model consistent and complementary information and capture semantic structures at the highest abstraction level.

...read moreread less

Journal ArticleDOI

Weighted discriminative collaborative competitive representation for robust image classification

Jianping Gou, +5 more

- 01 May 2020 -

Neural Networks

TL;DR: The proposed WDCCR designs the discriminative and competitive collaborative representation among all the classes by fully considering the class information, and introduces the constraint of the weighted categorical representation coefficients into the proposed model for further enhancing the power of discriminatives and competitive representation.

...read moreread less

Proceedings ArticleDOI

Deep Multi-View Concept Learning

Cai Xu, +5 more

TL;DR: DMCL performs nonnegative factorization of the data hierarchically, and tries to capture semantic structures and explicitly model consistent and complementary information in multi-view data at the highest abstraction level, and develops a block coordinate descent algorithm for DMCL.

...read moreread less

Journal ArticleDOI

Classification of multiclass motor imagery EEG signal using sparsity approach

S. R. Sreeja, +1 more

- 27 Nov 2019 -

Neurocomputing

TL;DR: A sparse representation based classification technique has been proposed to classify multi-tasks MI electroencephalogram data and the results substantiate that the proposed sparsity approach performs significantly better than the existing classifiers.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal Article

Visualizing Data using t-SNE

Laurens van der Maaten, +1 more

- 01 Jan 2008 -

Journal of Machine Learning Research

TL;DR: A new technique called t-SNE that visualizes high-dimensional data by giving each datapoint a location in a two or three-dimensional map, a variation of Stochastic Neighbor Embedding that is much easier to optimize, and produces significantly better visualizations by reducing the tendency to crowd points together in the center of the map.

...read moreread less

Statistical learning theory

Vladimir Vapnik

TL;DR: Presenting a method for determining the necessary and sufficient conditions for consistency of learning process, the author covers function estimates from small data pools, applying these estimations to real-life problems, and much more.

...read moreread less

Journal ArticleDOI

Robust Face Recognition via Sparse Representation

John Wright, +4 more

- 01 Feb 2009 -

IEEE Transactions on Pattern Analysis an...

TL;DR: This work considers the problem of automatically recognizing human faces from frontal views with varying expression and illumination, as well as occlusion and disguise, and proposes a general classification algorithm for (image-based) object recognition based on a sparse representation computed by C1-minimization.

...read moreread less

Journal ArticleDOI

Deep Neural Networks for Acoustic Modeling in Speech Recognition: The Shared Views of Four Research Groups

Geoffrey E. Hinton, +10 more

- 18 Oct 2012 -

IEEE Signal Processing Magazine

TL;DR: This article provides an overview of progress and represents the shared views of four research groups that have had recent successes in using DNNs for acoustic modeling in speech recognition.

...read moreread less

Journal ArticleDOI

$rm K$ -SVD: An Algorithm for Designing Overcomplete Dictionaries for Sparse Representation

Michal Aharon, +2 more

- 01 Nov 2006 -

IEEE Transactions on Signal Processing

TL;DR: A novel algorithm for adapting dictionaries in order to achieve sparse signal representations, the K-SVD algorithm, an iterative method that alternates between sparse coding of the examples based on the current dictionary and a process of updating the dictionary atoms to better fit the data.

...read moreread less

Collapse

Related Papers (5)

Supervised Monaural Speech Enhancement Using Complementary Joint Sparse Representations

You Luo, +3 more

- 01 Feb 2016 -

IEEE Signal Processing Letters

IEEE Transactions on Pattern Analysis an...

Deep-Sparse-Representation-Based Features for Speech Recognition

Citations

A sparse deep belief network with efficient fuzzy learning framework.

Multiview Concept Learning Via Deep Matrix Factorization

Weighted discriminative collaborative competitive representation for robust image classification

Deep Multi-View Concept Learning

Classification of multiclass motor imagery EEG signal using sparsity approach

References

Visualizing Data using t-SNE

Statistical learning theory

Robust Face Recognition via Sparse Representation

Deep Neural Networks for Acoustic Modeling in Speech Recognition: The Shared Views of Four Research Groups

$rm K$ -SVD: An Algorithm for Designing Overcomplete Dictionaries for Sparse Representation

Related Papers (5)

Supervised Monaural Speech Enhancement Using Complementary Joint Sparse Representations

Sparse coding for speech recognition

Language Recognition via Sparse Coding

Structured sparse representation with low-rank interference

A Deep Matrix Factorization Method for Learning Attribute Representations