Topic

TIMIT

About: TIMIT is a research topic. Over the lifetime, 1401 publications have been published within this topic receiving 59888 citations. The topic is also known as: TIMIT Acoustic-Phonetic Continuous Speech Corpus.

...read moreread less

Papers published on a yearly basis

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

Flexible Feature Spaces Based on Generalized Heteroscedastic Linear Discriminant Analysis

[...]

A. Duminuco¹, Chaojun Liu¹, David Kryze¹, Luca Rigazio¹•Institutions (1)

Panasonic¹

14 May 2006

TL;DR: A generalized feature projection scheme which allows each feature dimension to be classified in a set of 1 to M classes, where M is the total number of classes, which allows for a better trade-off of number of parameters versus model complexity.

...read moreread less

Abstract: This paper presents a generalized feature projection scheme which allows each feature dimension to be classified in a set of 1 to M classes, where M is the total number of classes. Our method is an extension of the classical full-space null-space approach where each dimension can only be classified in either M classes or 1 class. We believe that this more general formulation allows for a better trade-off of number of parameters versus model complexity, which in turn should provide better classification. We first tested GLDA on TIMIT and obtained an improvement up to 1% in phone classification rate over the best HLDA classifier. Preliminary results on Wall Street Journal 20K also show an improvement over the best HLDA system of about 0.2% absolute.

...read moreread less

1 citations

Journal Article•DOI•

Physiologically Motivated Feature Extraction for Robust Automatic Speech Recognition

[...]

Ibrahim Missaoui, Zied Lachiri

01 Jan 2016-International Journal of Advanced Computer Science and Applications

TL;DR: The evaluation results demonstrate that the proposed feature extraction method outperforms the classic methods such as Perceptual Linear Prediction, Linear Predictive Coding, Linear Prediction Cepstral coefficients and Mel Frequency CepStral Coefficients.

...read moreread less

Abstract: In this paper, a new method is presented to extract robust speech features in the presence of the external noise. The proposed method based on two-dimensional Gabor filters takes in account the spectro-temporal modulation frequencies and also limits the redundancy on the feature level. The performance of the proposed feature extraction method was evaluated on isolated speech words which are extracted from TIMIT corpus and corrupted by background noise. The evaluation results demonstrate that the proposed feature extraction method outperforms the classic methods such as Perceptual Linear Prediction, Linear Predictive Coding, Linear Prediction Cepstral coefficients and Mel Frequency Cepstral Coefficients.

...read moreread less

1 citations

Book Chapter•DOI•

Feature selection using ant colony optimization for text-independent speaker verification system

[...]

Javad Sohafi-Bonab¹, Mehdi Hosseinzadeh Aghdam²•Institutions (2)

Islamic Azad University¹, Payame Noor University²

22 Oct 2010

TL;DR: This paper presents another method that is based on ant colony optimization (ACO) that is compared to the performance of genetic algorithm on the task of feature selection in TIMIT corpora and indicates that with the optimized feature set, theperformance of the ASV system is improved.

...read moreread less

Abstract: With the growing trend toward remote security verification procedures for telephone banking, biometric security measures and similar applications, automatic speaker verification (ASV) has received a lot of attention in recent years. The complexity of ASV system and its verification time depends on the number of feature vectors, their dimensionality, the complexity of the speaker models and the number of speakers. In this paper, we concentrate on optimizing dimensionality of feature space by selecting relevant features. It presents another method that is based on ant colony optimization (ACO). The performance of the proposed algorithm is compared to the performance of genetic algorithm on the task of feature selection in TIMIT corpora. The results of experiments indicate that with the optimized feature set, the performance of the ASV system is improved.

...read moreread less

1 citations

Proceedings Article•DOI•

Correlation Between Speaker Gender and Perceptual Quality of Mobile Speech Signal

[...]

Abdulaleem Z. Al-Othmani¹, Azizah Abdul Manaf, Akram M. Zeki², Qusay Al-Maatouk³, Abdulaziz Aborujilah¹, Maen Al-Rashdan³ - Show less +2 more•Institutions (3)

University of Kuala Lumpur¹, International Islamic University Malaysia², Ritsumeikan Asia Pacific University³

20 Feb 2020

TL;DR: The results clearly show that there is a significant difference in perceptual quality score between female and male speech signals which demonstrate another reliability issue of PESQ as a perceptual quality of speech signal in mobile communications.

...read moreread less

Abstract: Perceptual evaluation of speech signals is a very crucial measure for quality of service in mobile speech communication. Several subjective and objective quality measures are being utilized to evaluate the perceptual quality of speech signals. Perceptual Evaluation of Speech Quality (PESQ) has been found to reliably predict the quality of processed speech signals with a higher correlation with the perceived quality. However, some studied have shown some issues with PESQ measure in specific environments or speech signals. This paper investigates the effect of speaker gender on PESQ measure of the perceptual quality of GSM Full Rate (GSM-FR) encoded speech signals. A Matlab experiment is carried out to encode 350 speech files from TIMIT corpus using GSM-FR vocoder and calculate the PESQ scores. The results clearly show that there is a significant difference in perceptual quality score between female and male speech signals which demonstrate another reliability issue of PESQ as a perceptual quality of speech signal in mobile communications.

...read moreread less

1 citations

Proceedings Article•DOI•

A Modified Speaking Rate Estimation Based on Frame-Level LSTM

[...]

Yanhong Xiao¹, Shixuan Du¹, Xiang Xie¹, Jing Wang¹, Qingran Zhan¹ - Show less +1 more•Institutions (1)

Beijing Institute of Technology¹

01 Aug 2018

TL;DR: Instead of taking the whole utterance as a sequence, the frame-level LSTM exploits the sequence information in each segment and brings a more precise segmented speaking rate estimation.

...read moreread less

Abstract: Speaking rate has various applications in many domains such as speech recognition, speaker verification, emotion recognition, etc. It conveys long-term information in speech and changes over time which can be seen as a kind of time sequence. This paper proposes a frame-level LSTM speaking rate estimation method. Instead of taking the whole utterance as a sequence, the frame-level LSTM exploits the sequence information in each segment and brings a more precise segmented speaking rate estimation. We also evaluate the influence of fixed-length segmentation and voice activity detection(vad) segmentation on speaking rate estimation. Results show that the proposed frame-level LSTM method yields a high correlation between the estimated speaking rate and the ground truth. It achieves a relative improvement of 13.0% compared to the state of the art statistical learning method and 16.3% over the support vector regression(SVR) evaluated on the same TIMIT corpus.

...read moreread less

1 citations

Collapse

Network Information

Performance

Metrics

1,488

Papers

68,688

Citations

No. of papers in the topic in previous years
Year	Papers
2023	24
2022	62
2021	67
2020	86
2019	77
2018	95

TIMIT

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics