A new fast algorithm for automatic segmentation of continuous speech.

Open AccessProceedings Article

A new fast algorithm for automatic segmentation of continuous speech.

Chats0

TLDR

A new method for automatic segmentation of continuous speech into phone-like units is addressed, based on a very fast presegmentation algorithm which uses a new statistical modeling of speech and searching in a multilevel structure, called Dendrogram, for decreasing insertion rate.

Abstract:

In this paper a new method for automatic segmentation of continuous speech into phone-like units is addressed. Our method is based on a very fast presegmentation algorithm which uses a new statistical modeling of speech and searching in a multilevel structure, called Dendrogram, for decreasing insertion rate. In each step the performance of algorithms have been tested over a large set of TIMIT sentences. According to these tests, our final segmentation algorithm is capable of detecting nearly 97% of segments with an average boundary position error of less than 7 msec and average insertion rate of less than 12.6%. The paper will describe the algorithms for determining the acoustic segments. Performance results will also be included.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Automatic phonetic segmentation

Doroteo Torre Toledano, +2 more

- 01 Nov 2003 -

IEEE Transactions on Speech and Audio Pr...

TL;DR: The most frequently used approach-based on a modified Hidden Markov Model (HMM) phonetic recognizer is analyzed, and a general framework for the local refinement of boundaries is proposed, and the performance of several pattern classification approaches is compared within this framework.

...read moreread less

Automatic time alignment of phonemes using acoustic-phonetic information

Ronald A. Cole, +1 more

TL;DR: The hypothesis is that the integration of acoustic-phonetic information into a state-of-the-art automatic phonetic alignment system will significantly improve its accuracy and robustness.

...read moreread less

Journal ArticleDOI

Mapping between acoustic and articulatory gestures

Gopal Ananthakrishnan, +1 more

- 01 Apr 2011 -

Speech Communication

TL;DR: A definition for articulatory as well as acoustic gestures is proposed along with a method to segment the measured articulatory trajectories and acoustic waveforms into gestures and a method based on the error in estimated critical points is suggested.

...read moreread less

Proceedings Article

On hierarchical clustering for speech phonetic segmentation

Ciro Gracia, +1 more

TL;DR: This paper extends the framework with an unsupervised segmentation algorithm based on a divisive clustering technique and compares both approaches: agglomerative nesting (Bottom-up) against divisive analysis (Top-down).

...read moreread less

Journal ArticleDOI

A phonetic labeling method format database processing

Hsiao‐Chuan Wang, +3 more

- 01 Sep 1999 -

Journal of The Chinese Institute of Engi...

TL;DR: A semi‐automatic phonetic labeling method for processing in the MAT (Mandarin across Taiwan) speech database can achieve segmentation accuracy around 90% for an allowed tolerance of 16 ms.

...read moreread less

References

PDF

Open Access

More filters

Book

Graph theory with applications

J. A. Bondy

TL;DR: In this paper, the authors present Graph Theory with Applications: Graph theory with applications, a collection of applications of graph theory in the field of Operational Research and Management. Journal of the Operational research Society: Vol. 28, Volume 28, issue 1, pp. 237-238.

...read moreread less

Journal ArticleDOI

Graph theory with applications (revised edition), by J. A. Bondy and U.S.R. Murty. Pp x, 264. £5·95 paperback. 1977. SBN 0 333 22694 1 (Macmillan)

E. Keith Lloyd

- 01 Mar 1978 -

The Mathematical Gazette

Book

Discrete-Time Processing of Speech Signals

J. R. Deller, +2 more

TL;DR: The preface to the IEEE Edition explains the background to speech production, coding, and quality assessment and introduces the Hidden Markov Model, the Artificial Neural Network, and Speech Enhancement.

...read moreread less

Book

Voice and Speech Processing

Thomas W. Parsons

Proceedings ArticleDOI

Multi-level acoustic segmentation of continuous speech

James Glass, +1 more

TL;DR: The authors have developed a procedure that describes the acoustic structure of the signal, and the algorithms for determining the acoustic segments and the multi-level structure, and possible use for automatic speech recognition is discussed.

...read moreread less

A new fast algorithm for automatic segmentation of continuous speech.

Citations

Automatic phonetic segmentation

Automatic time alignment of phonemes using acoustic-phonetic information

Mapping between acoustic and articulatory gestures

On hierarchical clustering for speech phonetic segmentation

A phonetic labeling method format database processing

References

Graph theory with applications

Graph theory with applications (revised edition), by J. A. Bondy and U.S.R. Murty. Pp x, 264. £5·95 paperback. 1977. SBN 0 333 22694 1 (Macmillan)

Discrete-Time Processing of Speech Signals

Voice and Speech Processing

Multi-level acoustic segmentation of continuous speech

Related Papers (5)

Error Prediction-Based Semi-automatic Segmentation of Speech Databases

Data processing methods and devices

Realtime Human Segmentation in Video

Automatic spatio-temporal video sequence segmentation

Research on threshold segmentation in tracking technology of moving objects