scispace - formally typeset
Book ChapterDOI

Design of a Silent Speech Interface using Facial Gesture Recognition and Electromyography

Reads0
Chats0
TLDR
The SSI discussed in this paper was built using lip region segmentation from facial images via Chan-Vese algorithm, simple feature extraction from these images, matching features to words using a Multi-Class Support Vector Machine Classifier and a text -to-speech module producing synthesized speech implemented using MATLAB.
Abstract
A Silent Speech Interface (SSI) aims to substitute natural speech production using various electronic techniques to aid the vocally challenged or impaired. Such interfaces have been experimentally developed using several technologies such as imaging of the lip, tongue and throat regions; mapping of articulatory information, direct mapping of electroencephalogram details, Brain-Computer Interfaces and lip movement tracking mechanisms. Several unimodal as well as multi-modal systems are in the experimental stages. The SSI discussed in this paper was built using lip region segmentation from facial images via Chan-Vese algorithm, simple feature extraction from these images, matching features to words using a Multi-Class Support Vector Machine Classifier and a text -to-speech module producing synthesized speech implemented using MATLAB, with a recognition accuracy of 97.5% for four words – ‘cat’, ‘dog’, ‘eat’ and ‘mum’. A multimodal system with facial electromyographic input for validation purposes has been explored.

read more

Citations
More filters
Proceedings ArticleDOI

EEG Based Brain State Classification Technique Using Support Vector Machine -A Design Approach

TL;DR: In this paper, the EEG signals are decomposed into smaller segments of signal by Time Frequency Approach (T-F) like fast Fourier transform and short time Fourier Transform (STFT).
Book ChapterDOI

Design of EEG Based Classification of Brain States Using STFT by Deep Neural Network

TL;DR: In this article, the EEG signal is used as an input source that is preprocessed and decomposed into smaller segments of the signal by Time-frequency approaches (T-F) like fast Fourier transform and short time Fourier Transform.
References
More filters
Journal ArticleDOI

Active contours without edges

TL;DR: A new model for active contours to detect objects in a given image, based on techniques of curve evolution, Mumford-Shah (1989) functional for segmentation and level sets is proposed, which can detect objects whose boundaries are not necessarily defined by the gradient.
Journal Article

Supervised Machine Learning: A Review of Classification Techniques

TL;DR: The goal of supervised learning is to build a concise model of the distribution of class labels in terms of predictor features, and the resulting classifier is then used to assign class labels to the testing instances where the values of the predictor features are known, but the value of the class label is unknown.
Patent

Method of recognizing speech using a lip image

TL;DR: In this paper, a speech recognition method is implemented with an image pickup apparatus such as a TV camera which picks up a lip image during speech and with a small computer which has a small memory capacity and is connected to the TV camera.
Proceedings ArticleDOI

Prospects for a Silent Speech Interface using Ultrasound Imaging

TL;DR: The feasibility of a silent speech interface using ultrasound imaging and lip profile video is investigated by examining the quality of line spectral frequencies (LSF) derived from the image sequences, finding that LSF's recovered from vocalized passages are compatible with the synthesis of intelligible speech.
Related Papers (5)