Topic

TIMIT

About: TIMIT is a research topic. Over the lifetime, 1401 publications have been published within this topic receiving 59888 citations. The topic is also known as: TIMIT Acoustic-Phonetic Continuous Speech Corpus.

...read moreread less

Papers published on a yearly basis

Papers

PDF

Open Access

More filters

Multitask learning to improve articulatory feature estimation and phoneme recognition

[...]

Ramya Rasipuram, Mathew Magimai.-Doss

01 Jan 2011

TL;DR: Investigation of multitask learning (MTL) approach for joint estimation of articulatory features with and without phoneme classification as subtask shows that MTL MLP can estimate articulatory feature features compactly and efficiently by learning the inter-feature dependencies through a common hidden layer representation, irrespective of number of subtasks.

...read moreread less

Abstract: Speech sounds can be characterized by articulatory features. Articulatory features are typically estimated using a set of multilayer perceptrons (MLPs), i.e., a separate MLP is trained for each articulatory feature. In this report, we investigate multitask learning (MTL) approach for joint estimation of articulatory features with and without phoneme classification as subtask. The effect of number of subtasks in MTL is studied by selecting two different articulatory feature representations. Our studies show that MTL MLP can estimate articulatory features compactly and efficiently by learning the inter-feature dependencies through a common hidden layer representation, irrespective of number of subtasks. Furthermore, adding phoneme as subtask while estimating articulatory features improves both articulatory feature estimation and phoneme recognition. On TIMIT phoneme recognition task, articulatory feature posterior probabilities obtained by MTL MLP achieve a phoneme recognition accuracy of 73.8%, while the phoneme posterior probabilities achieve an accuracy of 74.2%.

...read moreread less

4 citations

Proceedings Article•DOI•

Normalized, HOS-based, blind speech separation algorithms

[...]

P.L. De Leon¹, Yunsheng Ma•Institutions (1)

New Mexico State University¹

29 Oct 2000

TL;DR: Simulation results generally indicate improved separation quality, a higher probability in producing distinct source outputs, and robustness in noisy cases.

...read moreread less

Abstract: Techniques for blind separation of mixed speech signals (co-channel speech) have been reported in the literature One computationally simple method for linear mixtures (suitable for real-time separation), employs a gradient search algorithm to maximize the kurtosis of the outputs (hopefully separated speech signals) We report the results of an enhancement to the algorithm which involves a normalization to the correction matrix used in the update of the separation matrix Simulation results (using the TIMIT speech corpus) generally indicate improved (sometimes significantly) separation quality, a higher probability in producing distinct source outputs, and robustness in noisy cases

...read moreread less

4 citations

Proceedings Article•DOI•

Source and system features for text independent speaker identification using iterative clustering approach

[...]

A. Revathi, Y. Venkataramani

01 Nov 2009

TL;DR: In this work, Fratio is computed as a theoretical measure to validate the experimental results on speaker recognition and reveals the performance of the proposed algorithm in performing speaker recognition based on minimum distance between test features and clusters.

...read moreread less

Abstract: The main objective of this paper is to explore the effectiveness of perceptual features combined with pitch for text independent speaker recognition. The proposed combined features are captured and training models are developed by K-means clustering procedure. Speaker recognition system is evaluated on clean test speeches and the experimental results reveal the performance of the proposed algorithm in performing speaker recognition based on minimum distance between test features and clusters. This algorithm gives the overall accuracy of 99.675% and 98.75% for the combined features and perceptual features respectively for identifying speaker among 8 speakers chosen randomly from 8 different dialect regions in “TIMIT” database. It also gives average accuracy of 96.375% and 95.625% for perceptual linear predictive cepstrum combined with pitch and perceptual linear predictive cepstrum respectively for 8 speakers chosen randomly from the same dialect region. The noteworthy feature of speaker identification algorithm is to evaluate the testing procedure on identical messages for all speakers. In this work, Fratio is computed as a theoretical measure to validate the experimental results on speaker recognition.

...read moreread less

4 citations

Proceedings Article•DOI•

Joint phoneme segmentation inference and classification using CRFs

[...]

Dimitri Palaz¹, Mathew Magimai-Doss¹, Ronan Collobert¹•Institutions (1)

Idiap Research Institute¹

01 Dec 2014

TL;DR: The proposed CRF based phoneme sequence recognition approach is capable of achieving performance similar to standard hybrid HMM/ANN and ANN/CRF systems where the ANN is trained with manual segmentation.

...read moreread less

Abstract: State-of-the-art phoneme sequence recognition systems are based on hybrid hidden Markov model/artificial neural networks (HMM/ANN) framework. In this framework, the local classifier, ANN, is typically trained using Viterbi expectation-maximization algorithm, which involves two separate steps: phoneme sequence segmentation and training of ANN. In this paper, we propose a CRF based phoneme sequence recognition approach that simultaneously infers the phoneme segmentation and classifies the phoneme sequence. More specifically, the phoneme sequence recognition system consists of a local classifier ANN followed by a conditional random field (CRF) whose parameters are trained jointly, using a cost function that discriminates the true phoneme sequence against all competing sequences. In order to efficiently train such a system we introduce a novel CRF based segmentation using acyclic graph. We study the viability of the proposed approach on TIMIT phoneme recognition task. Our studies show that the proposed approach is capable of achieving performance similar to standard hybrid HMM/ANN and ANN/CRF systems where the ANN is trained with manual segmentation.

...read moreread less

4 citations

Journal Article•DOI•

Statistical trajectory models for phonetic classification

[...]

William D. Goldenthal, James Glass

01 May 1994-Journal of the Acoustical Society of America

TL;DR: This talk presents phonetic models that capture both the dynamic characteristics and the statistical dependencies of acoustic attributes in a segment‐based framework that compares favorably with other studies using the timit corpus.

...read moreread less

Abstract: This talk presents phonetic models that capture both the dynamic characteristics and the statistical dependencies of acoustic attributes in a segment‐based framework. The approach is based on the creation of a track, Tα, for each phonetic unit α. The track serves as a model of the dynamic trajectories of the acoustic attributes over the segment. The statistical framework for scoring incorporates the auto‐ and cross‐correlation properties of the track error over time, within a segment. On a vowel classification task [W. Goldenthal and J. Glass, ‘‘Modeling Spectra Dynamics for Vowel Classification,’’ Proc. Eurospeech 93, pp. 289–292, Berlin, Germany (1993)], this methodology achieved classification performance of 68.9%. This result compares favorably with other studies using the timit corpus. This talk extends this result by presenting context‐independent and context‐dependent experiments for all the phones. Context‐independent classification performance of 76.8% is demonstrated. The key to implementing the...

...read moreread less

4 citations

Collapse

Network Information

Performance

Metrics

1,488

Papers

68,688

Citations

No. of papers in the topic in previous years
Year	Papers
2023	24
2022	62
2021	67
2020	86
2019	77
2018	95

TIMIT

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics