scispace - formally typeset
Search or ask a question
Topic

Speaker recognition

About: Speaker recognition is a research topic. Over the lifetime, 14990 publications have been published within this topic receiving 310061 citations.


Papers
More filters
Proceedings Article
01 Jan 2003
TL;DR: An emotion recognition method using the facial images and speech signals and the LDA (linear discriminant analysis) results obtained better performance than previous ones.
Abstract: In this paper, we propose an emotion recognition method using the facial images and speech signals Six basic emotions including happiness, sadness, anger, surprise, fear and dislike are investigated Facial expression recognition is performed by using the multi-resolution analysis based on the discrete wavelet Here, we obtain the feature vectors through the LDA (linear discriminant analysis) On the other hand, the emotion recognition from the speech signal method has a structure of performing the recognition algorithm independently for each wavelet subband and the final recognition are obtained from the multi-decision making scheme After merging the facial and speech emotion recognition results, we obtained better performance than previous ones

68 citations

Proceedings ArticleDOI
P.Z. Patrick1, G. Aversano1, Raphaël Blouet1, M. Charbit1, Gérard Chollet1 
18 Mar 2005
TL;DR: It is demonstrated that an automatic speaker recognition system could be seriously threatened by a transformation of this kind, using a speaker verification system to calculate the likelihood that the forged voice belongs to the genuine client.
Abstract: The article deals with a technique of voice forgery using the ALISP (automatic language independent speech processing) approach. Such a technique allows the voice of an arbitrary person (the impostor) to be transformed, forging the identity of another person (the client). Our goal is to demonstrate that an automatic speaker recognition system could be seriously threatened by a transformation of this kind. For this purpose, we use a speaker verification system to calculate the likelihood that the forged voice belongs to the genuine client. Experiments on NIST 2004 evaluation data show that the equal error rate for the verification task is significantly increased by our voice transformation.

68 citations

Proceedings ArticleDOI
05 Mar 2017
TL;DR: The final results on speaker diarization system indicate that the use of speaker change detection based on CNN is beneficial with relative improvement of diarized error rate by 28 %.
Abstract: The aim of this paper is to propose a speaker change detection technique based on Convolutional Neural Network (CNN) and evaluate its contribution to the performance of a speaker diarization system for telephone conversations. For the comparison we used an i-vector based speaker diarization system. The baseline speaker change detection uses Generalized Likelihood Ratio (GLR) metric. Experiments were conducted on the English part of the CallHome corpus. Our proposed CNN speaker change detection outperformed the GLR approach, reducing the Equal Error Rate relatively by 46 %. The final results on speaker diarization system indicate that the use of speaker change detection based on CNN is beneficial with relative improvement of diarization error rate by 28 %.

68 citations


Network Information
Related Topics (5)
Feature vector
48.8K papers, 954.4K citations
83% related
Recurrent neural network
29.2K papers, 890K citations
82% related
Feature extraction
111.8K papers, 2.1M citations
81% related
Signal processing
73.4K papers, 983.5K citations
81% related
Decoding methods
65.7K papers, 900K citations
79% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
2023165
2022468
2021283
2020475
2019484
2018420