S
Srinivasan Umesh
Researcher at Indian Institute of Technology Madras
Publications - 118
Citations - 1244
Srinivasan Umesh is an academic researcher from Indian Institute of Technology Madras. The author has contributed to research in topics: Normalization (statistics) & Speaker recognition. The author has an hindex of 18, co-authored 109 publications receiving 1072 citations. Previous affiliations of Srinivasan Umesh include University of Rhode Island & RWTH Aachen University.
Papers
More filters
Proceedings ArticleDOI
Fitting the Mel scale
TL;DR: It is shown that there are many qualitatively different equations, each with few parameters, that fit the experimentally obtained Mel scale and that F/sub M/=f/(af+b) where F/ sub M/ and f are the Mel and physical frequency respectively, fits better than a line in the linear region or a logarithm in the "log" region.
Journal ArticleDOI
Estimation of parameters of exponentially damped sinusoids using fast maximum likelihood estimation with application to NMR spectroscopy data
Srinivasan Umesh,Donald W. Tufts +1 more
TL;DR: The proposed fast maximum likelihood algorithm is an iterative method that decomposes the original data into its constituent signal components and estimates the parameters of the individual components efficiently using the shape of the compressed likelihood function (CLF) in the parameter space.
Journal ArticleDOI
Scale transform in speech analysis
TL;DR: It is shown that the F-ratio tests indicate better separability of vowels by using scale-transform based features than mel- transform based features.
Proceedings ArticleDOI
Using VTLN for broadcast news transcription.
TL;DR: A new, simple, linear approximation to VTLN allows the Jacobian to be exactly computed and can be highly efficient in terms of warp factor estimation and application of the warp factors.
Proceedings ArticleDOI
Improved cepstral mean and variance normalization using Bayesian framework
TL;DR: This work proposes to use posterior estimates of mean and variance in CMVN, instead of the maximum likelihood estimates, and has shown to preserve discriminable information without increase in computational cost, making it particularly relevant for Interactive Voice Response (IVR)-based applications.