scispace - formally typeset
S

Szu-Chen Stan Jou

Researcher at Carnegie Mellon University

Publications -  10
Citations -  397

Szu-Chen Stan Jou is an academic researcher from Carnegie Mellon University. The author has contributed to research in topics: Word error rate & Feature extraction. The author has an hindex of 7, co-authored 10 publications receiving 362 citations.

Papers
More filters
Proceedings Article

Towards Continuous Speech Recognition Using Surface Electromyography

TL;DR: This paper demonstrates how to train the phoneme-based acoustic models with carefully designed electromyographic feature extraction methods by decomposing the signal into different feature space and successfully keep the useful information while reducing the noise.
Proceedings ArticleDOI

Using word latice information for a tighter coupling in speech translation systems.

TL;DR: First experiments towards a tighter coupling between Automatic Speech Recognition (ASR) and Statistical Machine Translation (SMT) to improve the overall performance of the speech translation system are presented.
Proceedings ArticleDOI

Adaptation for Soft Whisper Recognition Using a Throat Microphone

TL;DR: Various adaptation methods applied to recognizing soft whisper recorded with a throat microphone include: maximum likelihood linear regression, feature-space adaptation, and re-training with downsampling, sigmoidal low-pass filter, or linear multivariate regression.
Proceedings ArticleDOI

Whispering Speaker Identification

TL;DR: A study of automatically identifying whispering speakers by comparing performances between normal and whispered speech mode in clean and noisy environment under matched and mismatched training conditions, and the impact of feature warping and throat microphone on noise reduction.
Proceedings ArticleDOI

Continuous Electromyographic Speech Recognition with a Multi-Stream Decoding Architecture

TL;DR: It is shown that articulatory feature (AF) classifiers can also benefit from the E4 feature, which improves the F-score of the AF classifiers from 0.492 to 0.686, and is less correlated across EMG channels and thus channel combination gains larger improvement in F- score.