scispace - formally typeset
D

Dong Yu

Researcher at Tencent

Publications -  389
Citations -  45733

Dong Yu is an academic researcher from Tencent. The author has contributed to research in topics: Artificial neural network & Word error rate. The author has an hindex of 72, co-authored 339 publications receiving 39098 citations. Previous affiliations of Dong Yu include Peking University & Microsoft.

Papers
More filters

Chapter 8: Deep Neural Network Sequence-Discriminative Training

Dong Yu, +1 more
TL;DR: This chapter introduces the sequence-discriminative training techniques that match better to the problem, and describes the popular maximum mutual information (MMI), boosted MMI, minimum phone error (MPE), and minimum Bayes risk (MBR) training criteria.
Proceedings Article

Towards High-Accuracy Low-Cost Noisy Robust Speech Recognition Exploiting Structured Model

TL;DR: This paper presents the recent study on using this structured model of physical distortion for robust automatic speech recognition, and shows that online updating all the noise and channel distortion parameters is critical to the success of the proposed JAC algorithms.
Proceedings Article

Commute UX: Telephone Dialog System for Location-based Services.

TL;DR: The strategies employed by the system were evaluated through user studies and a system employing the best strategies was deployed and evaluated through an analysis of 700 calls over a two month period.
Posted Content

Synthesising Expressiveness in Peking Opera via Duration Informed Attention Network

TL;DR: The proposed musical note based system produces comparable singing voice in Peking opera with expressiveness in various aspects and gives extra flexibility in data collection for Peking Opera singing synthesis.
Journal ArticleDOI

A Speech-Centric Perspective for Human-Computer Interface: A Case Study

Li Deng, +1 more
TL;DR: This paper presents a case study of a prototype system, called MapPointS, which is a speech-centric multimodal map-query application for North America, which provides rich functionalities that allow users to obtain map-related information through speech, text, and pointing devices.