scispace - formally typeset
Search or ask a question
Proceedings ArticleDOI

Automatic Multimodal Heart Disease Classification using Phonocardiogram Signal

TL;DR: In this paper, the authors proposed an integrated automatic multimodal heart disease classification (AMHDC) system using the Phonocardiogram (PCG) signal using pre-processing techniques such as Data Normalization and Data Augmentation.
Abstract: Heart diseases are considered the leading cause of death globally. Early diagnosis of disease may help give appropriate prescribing medicines, which may help control and reduce conditions. The current clinical diagnosis methods such as Electrocardiograms, computed tomography, echocardiogram, Magnetic Resonance Imaging, etc. provide valuable information for diagnosis and treatment. However, these techniques are time-intensive, operator-dependent, and expensive. In this paper, we propose a low-cost solution real-time solution to diagnose heart diseases. We suggest an integrated automatic multimodal heart disease classification (AMHDC) system using the Phonocardiogram (PCG) signal. For this purpose, first, we have developed an advanced fusion method using pre-processing techniques such as Data Normalization and Data Augmentation. Secondly, we have extracted the spectrograms from heart sound and used them as features and images for signal and image processing. Finally, we created a real-time integrated Convolutional Neural Network (CNN) model for high-performance heart disease classification. The results show our model outperformed the state-of-art research, whose accuracy is 89.7%., while our model reported accuracy of 93% for audio and 96% for image-based approach.
Citations
More filters
Proceedings ArticleDOI
17 Dec 2022
TL;DR: In this paper , an integrated solution combines signal filtering, variable construction (feature engineering) and multidimensional data summarization, for a tighter and more effective integration of PCA and K-means clustering.
Abstract: We discuss our progress towards solving a challenging biomedical problem: identifying similar patterns among multiple physiological nerve signals hidden in high throughput data, collected from micro electrical sensors implanted in several animal organs. The problem is difficult because patterns come as spikes within millisecond time-windows, data sets have high dimensionality and there is background electrical noise. A previous analytic system discovers patterns combining PCA dimensionality reduction and K-means clustering, which is slow and misses important patterns hidden by noise. Moreover, it requires reading the data set several times and it requires multiple languages and tools. With such limitations in mind, we present an improved, integrated system that effectively allows the discovery of more accurate patterns, with automated algorithm parameter tuning, by learning model parameters incrementally exploiting summarization. Our integrated solution combines signal filtering, variable construction (feature engineering) and multidimensional data summarization, for a tighter and more effective integration of PCA and K-means clustering. We present preliminary experiments on signals collected from key nerves in a rat. We show our method discovers more patterns in larger time-windows, with better noise filtering, taking less time. In the future, we plan to link signal patterns to specific physiological functions, paving the way for innovative medical treatment via nerve stimulation.
References
More filters
Book
01 Jan 1995
TL;DR: In this article, the authors present a general approach and the Kernel Method for reduced interference in the representation of signal signals, which is based on the Wigner distribution and the characteristic function operator.
Abstract: 1. The Time and Frequency Description of Signals. 2. Instantaneous Frequency and the Complex Signal. 3. The Uncertainty Principle. 4. Densities and Characteristic Functions. 5. The Need for Time-Frequency Analysis. 6. Time-Frequency Distributions: Fundamental Ideas. 7. The Short-Time Fourier Transform. 8. The Wigner Distribution. 9. General Approach and the Kernel Method. 10. Characteristic Function Operator Method. 11. Kernel Design for Reduced Interference. 12. Some Distributions. 13. Further Developments. 14. Positive Distributions Satisfying the Marginals. 15. The Representation of Signals. 16. Density of a Single Variable. 17. Joint Representations for Arbitrary Variables. 18. Scale. 19. Joint Scale Representations. Bibliography. Index.

2,951 citations

Proceedings ArticleDOI
01 Jan 2015
TL;DR: A brief overview of the librosa library's functionality is provided, along with explanations of the design goals, software development practices, and notational conventions.
Abstract: This document describes version 0.4.0 of librosa: a Python pack- age for audio and music signal processing. At a high level, librosa provides implementations of a variety of common functions used throughout the field of music information retrieval. In this document, a brief overview of the library's functionality is provided, along with explanations of the design goals, software development practices, and notational conventions.

1,793 citations

Proceedings ArticleDOI
03 Nov 2014
TL;DR: A taxonomy of urban sounds and a new dataset, UrbanSound, containing 27 hours of audio with 18.5 hours of annotated sound event occurrences across 10 sound classes are presented.
Abstract: Automatic urban sound classification is a growing area of research with applications in multimedia retrieval and urban informatics. In this paper we identify two main barriers to research in this area - the lack of a common taxonomy and the scarceness of large, real-world, annotated data. To address these issues we present a taxonomy of urban sounds and a new dataset, UrbanSound, containing 27 hours of audio with 18.5 hours of annotated sound event occurrences across 10 sound classes. The challenges presented by the new dataset are studied through a series of experiments using a baseline classification system.

954 citations

Proceedings ArticleDOI
12 Nov 2015
TL;DR: The model outperforms baseline implementations relying on mel-frequency cepstral coefficients and achieves results comparable to other state-of-the-art approaches.
Abstract: This paper evaluates the potential of convolutional neural networks in classifying short audio clips of environmental sounds. A deep model consisting of 2 convolutional layers with max-pooling and 2 fully connected layers is trained on a low level representation of audio data (segmented spectrograms) with deltas. The accuracy of the network is evaluated on 3 public datasets of environmental and urban recordings. The model outperforms baseline implementations relying on mel-frequency cepstral coefficients and achieves results comparable to other state-of-the-art approaches.

742 citations

Journal ArticleDOI
TL;DR: A convolutional neural network algorithm is implemented for the automated detection of a normal and MI ECG beats (with noise and without noise) and can accurately detect the unknown ECG signals even with noise.

645 citations