scispace - formally typeset
Y

Yanzhang He

Researcher at Google

Publications -  54
Citations -  2161

Yanzhang He is an academic researcher from Google. The author has contributed to research in topics: Computer science & Language model. The author has an hindex of 14, co-authored 48 publications receiving 1248 citations. Previous affiliations of Yanzhang He include Ohio State University.

Papers
More filters
Proceedings ArticleDOI

Streaming End-to-end Speech Recognition for Mobile Devices

TL;DR: This work describes its efforts at building an E2E speech recog-nizer using a recurrent neural network transducer and finds that the proposed approach can outperform a conventional CTC-based model in terms of both latency and accuracy.
Posted Content

Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling

TL;DR: This document outlines the underlying design of Lingvo and serves as an introduction to the various pieces of the framework, while also offering examples of advanced features that showcase the capabilities of the Framework.
Proceedings ArticleDOI

Two-Pass End-to-End Speech Recognition

TL;DR: In this paper, two-pass automatic speech recognition (ASR) models are used to perform streaming on-device ASR to generate a text representation of an utterance captured in audio data.
Proceedings ArticleDOI

Towards Fast and Accurate Streaming End-To-End ASR

TL;DR: This work proposes to reduce E2E model’s latency by extending the RNN-T endpointer (RNN- T EP) model with additional early and late penalties and achieves 8.0% relative word error rate (WER) reduction and 130ms 90-percentile latency reduction over [2] on a Voice Search test set.