scispace - formally typeset
M

Michiel Bacchiani

Researcher at Google

Publications -  91
Citations -  4876

Michiel Bacchiani is an academic researcher from Google. The author has contributed to research in topics: Acoustic model & Word error rate. The author has an hindex of 34, co-authored 86 publications receiving 3970 citations. Previous affiliations of Michiel Bacchiani include IBM & Boston University.

Papers
More filters
Proceedings ArticleDOI

State-of-the-Art Speech Recognition with Sequence-to-Sequence Models

TL;DR: In this article, the authors explore a variety of structural and optimization improvements to the Listen, Attend, and Spell (LAS) encoder-decoder architecture, which significantly improves performance.
Proceedings ArticleDOI

Generation of large-scale simulated utterances in virtual rooms to train deep-neural networks for far-field speech recognition in Google Home

TL;DR: The structure and application of an acoustic room simulator to generate large-scale simulated data for training deep neural networks for far-field speech recognition and performance is evaluated using a factored complex Fast Fourier Transform (CFFT) acoustic model introduced in earlier work.
Journal ArticleDOI

Multichannel Signal Processing With Deep Neural Networks for Automatic Speech Recognition

TL;DR: This paper introduces a neural network architecture, which performs multichannel filtering in the first layer of the network, and shows that this network learns to be robust to varying target speaker direction of arrival, performing as well as a model that is given oracle knowledge of the true target Speaker direction.
Posted Content

Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling

TL;DR: This document outlines the underlying design of Lingvo and serves as an introduction to the various pieces of the framework, while also offering examples of advanced features that showcase the capabilities of the Framework.
Proceedings ArticleDOI

Acoustic Modeling for Google Home

TL;DR: The technical and system building advances made to the Google Home multichannel speech recognition system, which was launched in November 2016, result in a reduction of WER of 8-28% relative to the current production system.