scispace - formally typeset
Search or ask a question
Author

Goutam Saha

Other affiliations: Indian Institutes of Technology
Bio: Goutam Saha is an academic researcher from Indian Institute of Technology Kharagpur. The author has contributed to research in topics: Speaker recognition & Phonocardiogram. The author has an hindex of 24, co-authored 73 publications receiving 1996 citations. Previous affiliations of Goutam Saha include Indian Institutes of Technology.


Papers
More filters
Journal ArticleDOI
21 Apr 2015
TL;DR: A framework which selects good quality heart sound subseences which are artifact-free and reused the features involved in this processing in segmentation to assist interpretation of heart sound by physicians in objective analysis through record- ing in a computer is developed.
Abstract: Purpose: Digital recording of heart sounds commonly known as Phonocardiogram (PCG) signal, is a convenient primary diagnostic tool for analyzing condition of heart. Phono- cardiogram aids physicians to visualize the acoustic energies that results from mechanical aspect of cardiac activity. PCG signal cycle segmentation is an essential processing step to- wards heart sound signal analysis. Sound artifacts due to inappropriate placement of stetho- scope, body movement, cough etc. makes segmentation difficult. Artifact-free segmented heart sound cycles are convenient for physicians to interpret and it is also useful for computerized automated classification of abnormality. Methods: We have developed a framework which selects good quality heart sound subse- quences which are artifact-free and reused the features involved in this processing in segmenta- tion. In this work, we have used information contained in frequency subbands by decomposing the signal using Discrete Wavelet Packet Transform (DWPT). The algorithm identifies the parts of the signal where artifacts are prominent and it also detects major events in heart sound cycles. Results: The algorithm shows good results when tested on normal and five commonly occur - ring pathological heart sound signals. An average accuracy of 93.71% is registered for artifact- free subsequence selection process. The cycle segmentation algorithm gives an accuracy of 98.36%, 98.18% and 93.97% respectively for three databases used in the experiment. Conclusions: The work provides a solution for artifact-free segmentation of heart sound cy- cles to assist interpretation of heart sound by physicians in objective analysis through record- ing in a computer. It is also useful for development of an automated decision support system on heart sound abnormality.

6 citations

Book ChapterDOI
05 Dec 2017
TL;DR: The ASV performance with the proposed approach considerably outperformed conventional i-vector based system on publicly available speech corpora, NIST SRE 2010, especially in short duration, as required in real-world applications.
Abstract: A prime challenge in automatic speaker verification (ASV) is to improve performance with short speech segments. The variability and uncertainty of intermediate model parameters associated with state-of-the-art i-vector based ASV system, extensively increases in short duration. To compensate increased variability, we propose an adaptive approach for estimation of model parameters. The pre-estimated universal background model (UBM) parameters are used for adaptation. The speaker models i.e., i-vectors are generated with the proposed adapted parameters. The ASV performance with the proposed approach considerably outperformed conventional i-vector based system on publicly available speech corpora, NIST SRE 2010, especially in short duration, as required in real-world applications.

5 citations

Journal ArticleDOI
TL;DR: A new method that uses the inter-scale dependency between the coefficients and their parents by a Circularly Symmetric Probability Density Function related to the family of Spherically Invariant Random Processes (SIRPs) in Log Gabor Wavelet (LGW) domain and corresponding joint shrinkage estimators are derived by Maximum a Posteriori (MAP) estimation theory is introduced.

5 citations

Journal ArticleDOI
TL;DR: The proposed MKL, finds optimal kernel combination by maximizing the similarity with ideal kernel and minimizing the redundancy with other basis kernels, shows the potential of development of high accuracy CAD detection system by using easy to acquire, non-invasive PCG signal.
Abstract: Conventional machine learning has paved the way for a simple, affordable, non-invasive approach for Coronary artery disease (CAD) detection using phonocardiogram (PCG). It leaves a scope to explore improvement of performance metrics by fusion of learned representations from deep learning. In this study, we propose a novel, multiple kernel learning (MKL) for their fusion using deep embeddings transferred from pre-trained convolutional neural network (CNN). The proposed MKL, finds optimal kernel combination by maximizing the similarity with ideal kernel and minimizing the redundancy with other basis kernels. Experiments are performed on 960 PCG epochs collected from 40 CAD and 40 normal subjects. The transferred embeddings attain maximum subject-level accuracy of 89.25% with kappa of 0.7850. Later, their fusion with handcrafted features using the proposed MKL gives an accuracy of 91.19% and kappa 0.8238. The study shows the potential of development of high accuracy CAD detection system by using easy to acquire, non-invasive PCG signal.

5 citations


Cited by
More filters
Christopher M. Bishop1
01 Jan 2006
TL;DR: Probability distributions of linear models for regression and classification are given in this article, along with a discussion of combining models and combining models in the context of machine learning and classification.
Abstract: Probability Distributions.- Linear Models for Regression.- Linear Models for Classification.- Neural Networks.- Kernel Methods.- Sparse Kernel Machines.- Graphical Models.- Mixture Models and EM.- Approximate Inference.- Sampling Methods.- Continuous Latent Variables.- Sequential Data.- Combining Models.

10,141 citations

Journal ArticleDOI
01 May 1981
TL;DR: This chapter discusses Detecting Influential Observations and Outliers, a method for assessing Collinearity, and its applications in medicine and science.
Abstract: 1. Introduction and Overview. 2. Detecting Influential Observations and Outliers. 3. Detecting and Assessing Collinearity. 4. Applications and Remedies. 5. Research Issues and Directions for Extensions. Bibliography. Author Index. Subject Index.

4,948 citations

Journal ArticleDOI
TL;DR: In this paper, the performance of wavelet decomposition-based de-noising and wavelet filter based denoising methods are compared based on signals from mechanical defects, and the comparison result reveals that wavelet filters are more suitable and reliable to detect a weak signature of mechanical impulse-like defect signals, whereas the wavelet transform has a better performance on smooth signal detection.

1,104 citations

Journal ArticleDOI

1,008 citations

Journal ArticleDOI
TL;DR: Various procedures used in the analysis of circadian rhythms at the populational, organismal, cellular and molecular levels are reviewed.
Abstract: This article reviews various procedures used in the analysis of circadian rhythms at the populational, organismal, cellular and molecular levels. The procedures range from visual inspection of time plots and actograms to several mathematical methods of time series analysis. Computational steps are described in some detail, and additional bibliographic resources and computer programs are listed.

583 citations