Towards a Practical Silent Speech Interface Based on Vocal Tract Imaging

Open Access

Towards a Practical Silent Speech Interface Based on Vocal Tract Imaging

Bruce Denby, +9 more

- pp 89-94

Chats0

TLDR

In this article, the authors describe advances in the development of an ultrasound silent speech interface for use in silent communications applications or as a speaking aid for persons who have undergone a laryngectomy.

Abstract:

The paper describes advances in the development of an ultrasound silent speech interface for use in silent communications applications or as a speaking aid for persons who have undergone a laryngectomy. It reports some first steps towards making such a device lightweight, portable, interactive, and practical to use. Simple experimental tests of an interactive silent speech interface for everyday applications are described. Possible future improvements including extension to continuous speech and real time operation are discussed.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Updating the Silent Speech Challenge benchmark with deep learning

Yan Ji, +5 more

- 01 Apr 2018 -

Speech Communication

TL;DR: In this paper, the authors present new results in which a 2010 benchmark study, called the Silent Speech Challenge, is updated with a Deep Learning strategy, using the same input features and decoding strategy as in the original Challenge article.

...read moreread less

Journal ArticleDOI

Convolutional neural network-based automatic classification of midsagittal tongue gestural targets using B-mode ultrasound images

Kele Xu, +3 more

- 09 Jun 2017 -

Journal of the Acoustical Society of Ame...

TL;DR: The CNN-based method achieves state-of-the-art performance, even though no pre-training of the CNN was carried out, and the speaker-dependent and speaker-independent tongue gestural target classification experiments are conducted.

...read moreread less

Journal ArticleDOI

EchoWhisper: Exploring an Acoustic-based Silent Speech Interface for Smartphone Users

Yang Gao, +4 more

TL;DR: The EchoWhisper is proposed as a novel user-friendly, smartphone-based silent speech interface that takes advantage of the micro-Doppler effect of the acoustic wave resulting from mouth and tongue movements and assesses the acoustic features of beamformed reflected echoes captured by the dual microphones in the smartphone.

...read moreread less

Proceedings ArticleDOI

DNN-based Acoustic-to-Articulatory Inversion using Ultrasound Tongue Imaging

Dagoberto Porras, +2 more

TL;DR: This paper implemented several different Deep Neural Networks to estimate the articulatory information from the acoustic signal, and shows that CW-SSIM is the most useful error measure in the UTI context.

...read moreread less

Journal ArticleDOI

Non-Invasive Silent Phoneme Recognition Using Microwave Signals

Peter Birkholz, +3 more

- 01 Dec 2018 -

IEEE Transactions on Audio, Speech, and ...

TL;DR: Electromagnetic transmission and reflection measurements of the vocal tract have great potential for future silent-speech interfaces, and are suggested to be a viable alternative to established methods.

...read moreread less

References

PDF

Open Access

More filters

The HTK book

Steve Young, +4 more

TL;DR: The Fundamentals of HTK: General Principles of HMMs, Recognition and Viterbi Decoding, and Continuous Speech Recognition.

...read moreread less

Proceedings Article

Julius --- An Open Source Real-Time Large Vocabulary Recognition Engine

Akinobu Lee, +2 more

TL;DR: EUROSPEECH2001: the 7th European Conference on Speech Communication and Technology, September 3-7, 2001, Aalborg, Denmark.

...read moreread less

Journal ArticleDOI

Silent speech interfaces

Bruce Denby, +5 more

- 01 Apr 2010 -

Speech Communication

TL;DR: The article first outlines the emergence of the silent speech interface from the fields of speech production, automatic speech processing, speech pathology research, and telecommunications privacy issues, and then follows with a presentation of demonstrator systems based on seven different types of technologies.

...read moreread less

Journal ArticleDOI

Development of a silent speech interface driven by ultrasound and optical images of the tongue and lips

Thomas Hueber, +5 more

- 01 Apr 2010 -

Speech Communication

TL;DR: A segmental vocoder driven by ultrasound and optical images of the tongue and lips for a ''silent speech interface'' application, usable either by a laryngectomized patient or for silent communication.

...read moreread less

Acquisition of Ultrasound, Video and Acoustic Speech Data for a Silent-Speech Interface Application

Thomas Hueber, +3 more

TL;DR: Synchronous acquisition of high-speed multimodal speech data, composed of ultrasound and optical images of the vocal tract together with the acoustic speech signal, for a silent speech interface is addressed.

...read moreread less