scispace - formally typeset
A

Alejandro Acero

Researcher at Microsoft

Publications -  282
Citations -  9437

Alejandro Acero is an academic researcher from Microsoft. The author has contributed to research in topics: Speech processing & Noise. The author has an hindex of 53, co-authored 282 publications receiving 9339 citations. Previous affiliations of Alejandro Acero include Carnegie Mellon University.

Papers
More filters
Proceedings ArticleDOI

Environmental robustness in automatic speech recognition

TL;DR: Initial efforts to make Sphinx, a continuous-speech speaker-independent recognition system, robust to changes in the environment are reported, and two novel methods based on additive corrections in the cepstral domain are proposed.
Patent

Method and system of runtime acoustic unit selection for speech synthesis

TL;DR: In this article, a concatenative speech synthesis system and method which produces a more natural sounding speech is presented. But this system requires multiple instances of each acoustic unit which can be used to generate a speech waveform representing an linguistic expression and is limited to a robust representation of the highest probability instances.
Proceedings ArticleDOI

Robust speech recognition by normalization of the acoustic space

TL;DR: Several algorithms are presented that increase the robustness of SPHINX, the CMU (Carnegie Mellon University) continuous-speech speaker-independent recognition systems, by normalizing the acoustic space via minimization of the overall VQ distortion.
Patent

Method and apparatus for multi-sensory speech enhancement

TL;DR: In this paper, a method and system using an alternative sensor signal received from a sensor other than an air conduction microphone to estimate a clean speech value is presented, which uses either the alternative sensor signals alone, or in conjunction with the air-conduction microphone signal.
PatentDOI

Combined speech and alternate input modality to a mobile device

TL;DR: In this article, both speech and alternate modality inputs are used in inputting information spoken into a mobile device to perform sequential commitment of words in a speech recognition result, which can be used to perform Sequential Commitment of Words in a Speech Recognition result.