Home
/
Topics
/
TIMIT

Topic

TIMIT

About: TIMIT is a research topic. Over the lifetime, 1401 publications have been published within this topic receiving 59888 citations. The topic is also known as: TIMIT Acoustic-Phonetic Continuous Speech Corpus.

...read moreread less

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1993
1992
1991
1990
1989

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

The development of the Otago speech database

[...]

S.J. Sinclair¹, Catherine Watson¹•Institutions (1)

University of Otago¹

20 Nov 1995

TL;DR: A relational database management system has been developed to house the speech data and provides much more usability, flexibility and expandibility than file based speech corpora such as TIMIT.

...read moreread less

Abstract: A collection of digits and words, spoken with a New Zealand English accent, has been systematically and formally collected. This collection along with the beginning and end points of the realised phonemes from within the words, comprise the Otago Speech Corpora. A relational database management system has been developed to house the speech data. This system provides much more usability, flexibility and expandibility than file based speech corpora such as TIMIT.

...read moreread less

31 citations

Journal Article•DOI•

Overall risk criterion estimation of hidden Markov model parameters

[...]

Janez Kaiser, Bogomir Horvat¹, Zdravko Kacic¹•Institutions (1)

University of Maribor¹

01 Nov 2002-Speech Communication

TL;DR: A novel discriminative objective function for the estimation of hidden Markov model (HMM) parameters, based on the calculation of overall risk, which minimises the risk of misclassification on the training database and thus maximises recognition accuracy.

...read moreread less

31 citations

Journal Article•DOI•

An energy-constrained signal subspace method for speech enhancement and recognition in white and colored noises

[...]

Jun Huang¹, Yunxin Zhao•Institutions (1)

University of Illinois at Urbana–Champaign¹

01 Nov 1998-Speech Communication

TL;DR: An energy-constrained signal subspace (ECSS) method is proposed for speech enhancement and automatic speech recognition under additive noise condition and it was found that the ECSS method can achieve very high word recognition accuracy (WRA) for the digits set under low SNR conditions.

...read moreread less

30 citations

Proceedings Article•DOI•

Recurrent convolutional neural network for speech processing

[...]

Yue Zhao¹, Xingyu Jin¹, Xiaolin Hu¹•Institutions (1)

Tsinghua University¹

05 Mar 2017

TL;DR: A recently developed deep learning model, recurrent convolutional neural network (RCNN), is proposed to use for speech processing, which inherits some merits of recurrent neural networks (RNN) and convolutionals (CNN) and is competitive with previous methods in terms of accuracy and efficiency.

...read moreread less

Abstract: Different neural networks have exhibited excellent performance on various speech processing tasks, and they usually have specific advantages and disadvantages. We propose to use a recently developed deep learning model, recurrent convolutional neural network (RCNN), for speech processing, which inherits some merits of recurrent neural network (RNN) and convolutional neural network (CNN). The core module can be viewed as a convolutional layer embedded with an RNN, which enables the model to capture both temporal and frequency dependance in the spectrogram of the speech in an efficient way. The model is tested on speech corpus TIMIT for phoneme recognition and IEMOCAP for emotion recognition. Experimental results show that the model is competitive with previous methods in terms of accuracy and efficiency.

...read moreread less

30 citations

Posted Content•

Self-Supervised Contrastive Learning for Unsupervised Phoneme Segmentation

[...]

Felix Kreuk¹, Joseph Keshet¹, Yossi Adi²•Institutions (2)

Bar-Ilan University¹, Facebook²

27 Jul 2020-arXiv: Audio and Speech Processing

TL;DR: The proposed model is a convolutional neural network that operates directly on the raw waveform that is optimized to identify spectral changes in the signal using the Noise-Contrastive Estimation principle and reaches state-of-the-art performance on both data sets.

...read moreread less

Abstract: We propose a self-supervised representation learning model for the task of unsupervised phoneme boundary detection. The model is a convolutional neural network that operates directly on the raw waveform. It is optimized to identify spectral changes in the signal using the Noise-Contrastive Estimation principle. At test time, a peak detection algorithm is applied over the model outputs to produce the final boundaries. As such, the proposed model is trained in a fully unsupervised manner with no manual annotations in the form of target boundaries nor phonetic transcriptions. We compare the proposed approach to several unsupervised baselines using both TIMIT and Buckeye corpora. Results suggest that our approach surpasses the baseline models and reaches state-of-the-art performance on both data sets. Furthermore, we experimented with expanding the training set with additional examples from the Librispeech corpus. We evaluated the resulting model on distributions and languages that were not seen during the training phase (English, Hebrew and German) and showed that utilizing additional untranscribed data is beneficial for model performance.

...read moreread less

30 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
…
42
43
44
45
46
47
48
…
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

Network Information

Performance

Metrics

1,488

Papers

68,688

Citations

No. of papers in the topic in previous years
Year	Papers
2023	24
2022	62
2021	67
2020	86
2019	77
2018	95

TIMIT

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics