Performance comparison of speaker recognition systems in presence of duration variability
Citations
93 citations
21 citations
Cites background from "Performance comparison of speaker r..."
...MFCC [26], [27] feature captures spectral and phonetic information related to speech signal....
[...]
15 citations
13 citations
13 citations
References
4,673 citations
"Performance comparison of speaker r..." refers background or methods in this paper
...During the last two decades in speaker recognition research, most of the notable developments in classifier-level are based on the GMM concept [10], [11], [12]....
[...]
...In GMM-UBM, prior to enrollment phase, a single speaker independent universal background model (UBM) is created by using a large development data [10], [14]....
[...]
...For this reason, GMM-UBM systems are still popular and widely used, particularly when suitable amount development data is inadequate [10], [5], [14]....
[...]
3,526 citations
"Performance comparison of speaker r..." refers background or methods in this paper
...Though i-vector based speaker recognition systems are shown to give best recognition accuracy in latest NIST SREs [18], [19], [21], they require huge computational resources as well as massive amount of development data for estimating its parameters and hyper-parameters....
[...]
...Inspired by the earlier use of JFA, Dehak et al. proposed total-variability based approach for reducing the dimensionality of GMM-supervector [18]....
[...]
...The i-vector represents the GMM supervector by a single variability space which reduces high dimensional GMM supervector into lower dimensional total variability space [18]....
[...]
...proposed total-variability based approach for reducing the dimensionality of GMM-supervector [18]....
[...]
3,134 citations
"Performance comparison of speaker r..." refers methods in this paper
...For classification, various modeling techniques such as vector quantization (VQ) [7], dynamic time warping (DTW) [8], Gaussian mixture model (GMM) [9] were used....
[...]
1,686 citations
"Performance comparison of speaker r..." refers background or methods in this paper
...Its potential applications include telephone banking system, system access control, providing forensic evidence, call centers and many more [1], [2]....
[...]
...A TI speaker recognition system includes three fundamental modules [1], [2]: a feature extraction unit, which represents the speech signal in a compact manner, a modeling block to characterize those features using statistical approaches, and lastly, a classification scheme to classify the unknown utterance....
[...]
...SV system can be broadly categorized as text-dependent (TD) [4] and text-independent (TI) modes depending on the speech content in training and test phase [1], [2]....
[...]
...Speech signal conveys information regarding the physiological aspects of a speaker because it is affected by the unique shape and size of vocal tract, mouth, nasal cavity, etc [1], [2]....
[...]
1,433 citations
"Performance comparison of speaker r..." refers background or methods in this paper
...Its potential applications include telephone banking system, system access control, providing forensic evidence, call centers and many more [1], [2]....
[...]
...A TI speaker recognition system includes three fundamental modules [1], [2]: a feature extraction unit, which represents the speech signal in a compact manner, a modeling block to characterize those features using statistical approaches, and lastly, a classification scheme to classify the unknown utterance....
[...]
...SV system can be broadly categorized as text-dependent (TD) [4] and text-independent (TI) modes depending on the speech content in training and test phase [1], [2]....
[...]
...Speech signal conveys information regarding the physiological aspects of a speaker because it is affected by the unique shape and size of vocal tract, mouth, nasal cavity, etc [1], [2]....
[...]