scispace - formally typeset
S

Srinivasan Umesh

Researcher at Indian Institute of Technology Madras

Publications -  118
Citations -  1244

Srinivasan Umesh is an academic researcher from Indian Institute of Technology Madras. The author has contributed to research in topics: Normalization (statistics) & Speaker recognition. The author has an hindex of 18, co-authored 109 publications receiving 1072 citations. Previous affiliations of Srinivasan Umesh include University of Rhode Island & RWTH Aachen University.

Papers
More filters
Proceedings ArticleDOI

Fitting the Mel scale

TL;DR: It is shown that there are many qualitatively different equations, each with few parameters, that fit the experimentally obtained Mel scale and that F/sub M/=f/(af+b) where F/ sub M/ and f are the Mel and physical frequency respectively, fits better than a line in the linear region or a logarithm in the "log" region.
Journal ArticleDOI

Estimation of parameters of exponentially damped sinusoids using fast maximum likelihood estimation with application to NMR spectroscopy data

TL;DR: The proposed fast maximum likelihood algorithm is an iterative method that decomposes the original data into its constituent signal components and estimates the parameters of the individual components efficiently using the shape of the compressed likelihood function (CLF) in the parameter space.
Journal ArticleDOI

Scale transform in speech analysis

TL;DR: It is shown that the F-ratio tests indicate better separability of vowels by using scale-transform based features than mel- transform based features.
Proceedings ArticleDOI

Using VTLN for broadcast news transcription.

TL;DR: A new, simple, linear approximation to VTLN allows the Jacobian to be exactly computed and can be highly efficient in terms of warp factor estimation and application of the warp factors.
Proceedings ArticleDOI

Improved cepstral mean and variance normalization using Bayesian framework

TL;DR: This work proposes to use posterior estimates of mean and variance in CMVN, instead of the maximum likelihood estimates, and has shown to preserve discriminable information without increase in computational cost, making it particularly relevant for Interactive Voice Response (IVR)-based applications.