scispace - formally typeset
Journal ArticleDOI

Tackling Speaking Mode Varieties in EMG-Based Speech Recognition

Reads0
Chats0
TLDR
This paper introduces multimode systems that allow seamless switching between audible and silent speech, investigates speaking mode variations, investigate different measures which quantify speaking mode differences, and presents the spectral mapping algorithm, which improves the word error rate on silent speech by up to 14.3% relative.
Abstract
An electromyographic (EMG) silent speech recognizer is a system that recognizes speech by capturing the electric potentials of the human articulatory muscles, thus enabling the user to communicate silently. After having established a baseline EMG-based continuous speech recognizer, in this paper, we investigate speaking mode variations, i.e., discrepancies between audible and silent speech that deteriorate recognition accuracy. We introduce multimode systems that allow seamless switching between audible and silent speech, investigate different measures which quantify speaking mode differences, and present the spectral mapping algorithm, which improves the word error rate (WER) on silent speech by up to 14.3% relative. Our best average silent speech WER is 34.7%, and our best WER on audibly spoken speech is 16.8%.

read more

Citations
More filters
Proceedings ArticleDOI

Lipreading with long short-term memory

TL;DR: Lipreading, i.e. speech recognition from visual-only recordings of a speaker's face, can be achieved with a processing pipeline based solely on neural networks, yielding significantly better accuracy than conventional methods.
Journal ArticleDOI

Biosignal-Based Spoken Communication: A Survey

TL;DR: An overview of the various modalities, research approaches, and objectives for biosignal-based spoken communication is given.
Journal ArticleDOI

Direct Speech Reconstruction From Articulatory Sensor Data by Machine Learning

TL;DR: This work promises to lead to a technology that truly will give people whose larynx has been removed their voices back, with the best results obtained for a silent-speech system without a restricted vocabulary and with an unobtrusive device that delivers audio in close to real time.
Journal ArticleDOI

Silent Speech Interfaces for Speech Restoration: A Review

TL;DR: A number of challenges remain to be addressed in future research before SSIs can be promoted to real-world applications, and future SSIs will improve the lives of persons with severe speech impairments by restoring their communication capabilities.
Journal ArticleDOI

A silent speech system based on permanent magnet articulography and direct synthesis

TL;DR: Results show that it is possible to reconstruct speech from articulator movements captured by an unobtrusive technique without an intermediate recognition step, and the SSI is capable of producing speech of sufficient intelligibility and naturalness that the speaker is clearly identifiable.
References
More filters
Journal ArticleDOI

The use of fast Fourier transform for the estimation of power spectra: A method based on time averaging over short, modified periodograms

TL;DR: In this article, the use of the fast Fourier transform in power spectrum analysis is described, and the method involves sectioning the record and averaging modified periodograms of the sections.
BookDOI

Electromyography. Physiology, engineering and non invasive applications

TL;DR: This work focuses on the development of models for Surface EMG Signal Generation based on the principles of Structure--Based SEMG models, which were developed in the context of motor control and Muscle Contraction.
Journal ArticleDOI

Neural modeling and imaging of the cortical interactions underlying syllable production.

TL;DR: The model is a neural network whose components correspond to regions of the cerebral cortex and cerebellum, including premotor, motor, auditory, and somatosensory cortical areas, and its ability to account for compensation to lip and jaw perturbations during speech is verified.
Journal ArticleDOI

Electromechanical delay in human skeletal muscle under concentric and eccentric contractions.

TL;DR: It is suggested that the time required to stretch the series elastic component (SEC) represents the major portion of the measured delay and that during eccentric muscle activity the SEC is in a more favorable condition for rapid force development.
Journal ArticleDOI

Physiology and Mathematics of Myoelectric Signals

TL;DR: The myoelectric signal is the electrical manifestation of the neuromuscular activation associated with a contracting muscle and the lack of a proper description of the ME signal is probably the greatest single factor which has hampered the development of electromyography (EMG) into a precise discipline.
Related Papers (5)