DISTBIC: a speaker-based segmentation for audio data indexing

doi:10.1016/S0167-6393(00)00027-3

Journal ArticleDOI

DISTBIC: a speaker-based segmentation for audio data indexing

P. Delacourt, +1 more

- 01 Sep 2000 -

Speech Communication

- Vol. 32, Iss: 1, pp 111-126

Chats0

TLDR

This paper proposes a new segmentation method, called DISTBIC, which combines two different segmentation techniques and is efficiency in detecting speaker turns even close to one another (i.e., separated by a few seconds).

About:

This article is published in Speech Communication.The article was published on 2000-09-01. It has received 299 citations till now. The article focuses on the topics: Speaker diarisation & Speaker recognition.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

A tutorial on text-independent speaker verification

Frédéric Bimbot, +9 more

- 01 Jan 2004 -

EURASIP Journal on Advances in Signal Pr...

TL;DR: An introduction proposes a modular scheme of the training and test phases of a speaker verification system, and the most commonly speech parameterization used in speaker verification, namely, cepstral analysis, is detailed.

...read moreread less

Journal ArticleDOI

Speaker Diarization: A Review of Recent Research

Xavier Anguera Miro, +5 more

- 01 Feb 2012 -

IEEE Transactions on Audio, Speech, and ...

TL;DR: An analysis of speaker diarization performance as reported through the NIST Rich Transcription evaluations on meeting data and identify important areas for future research are presented.

...read moreread less

Book

MPEG-7 Audio and Beyond: Audio Content Indexing and Retrieval

Hyoung-Gook Kim, +2 more

TL;DR: A comparison of MPEG-7 Audio Spectrum Projection vs. MFCC Features and Results for Distinguishing Between Speech, Music and Environmental Sound shows that the former is superior to the latter in terms of sound classification.

...read moreread less

Journal ArticleDOI

Multistage speaker diarization of broadcast news

Claude Barras, +3 more

- 01 Sep 2006 -

IEEE Transactions on Audio, Speech, and ...

TL;DR: This paper describes recent advances in speaker diarization with a multistage segmentation and clustering system, which incorporates a speaker identification step, which builds upon the baseline audio partitioner used in the LIMSI broadcast news transcription system.

...read moreread less

Journal ArticleDOI

Robust speaker change detection

Jitendra Ajmera, +2 more

- 26 Jul 2004 -

IEEE Signal Processing Letters

TL;DR: In this article, the authors present a criterion which can be used to identify speaker changes in an audio stream without such tuning, which consists of calculating the log likelihood ratio (LLR) of two models with the same number of parameters.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings ArticleDOI

SWITCHBOARD: telephone speech corpus for research and development

J.J. Godfrey, +2 more

TL;DR: SWITCHBOARD as mentioned in this paper is a large multispeaker corpus of conversational speech and text which should be of interest to researchers in speaker authentication and large vocabulary speech recognition.

...read moreread less

Book

Stochastic Complexity In Statistical Inquiry

Jorma Rissanen

IEEE International Conference on Acoustics Speech and Signal Processing

S. Chen, +3 more

Speaker, Environment and Channel Change Detection and Clustering via the Bayesian Information Criterion

S. Chen

TL;DR: The segmentation algorithm can successfully detect acoustic changes; the clustering algorithm can produce clusters with high purity, leading to improvements in accuracy through unsupervised adaptation as much as the ideal clustering by the true speaker identities.

...read moreread less

Automatic Segmentation, Classification and Clustering of Broadcast News Audio

M. A. Siegler

TL;DR: This work describes the problems faced in adapting a system built to recognize one utterance at a time to a task that requires recognition of an entire half hour show, and shows that a priori knowledge of acoustic conditions and speakers in the broadcast data is not required for segmentation.

...read moreread less

Digital Signal Processing

DISTBIC: a speaker-based segmentation for audio data indexing

Citations

A tutorial on text-independent speaker verification

Speaker Diarization: A Review of Recent Research

MPEG-7 Audio and Beyond: Audio Content Indexing and Retrieval

Multistage speaker diarization of broadcast news

Robust speaker change detection

References

SWITCHBOARD: telephone speech corpus for research and development

Stochastic Complexity In Statistical Inquiry

IEEE International Conference on Acoustics Speech and Signal Processing

Speaker, Environment and Channel Change Detection and Clustering via the Bayesian Information Criterion

Automatic Segmentation, Classification and Clustering of Broadcast News Audio

Related Papers (5)

Speaker, Environment and Channel Change Detection and Clustering via the Bayesian Information Criterion

Improved speaker segmentation and segments clustering using the bayesian information criterion.

Automatic Segmentation, Classification and Clustering of Broadcast News Audio

Segregation of speakers for speech recognition and speaker identification

Speaker Verification Using Adapted Gaussian Mixture Models