scispace - formally typeset
D

Dong Yu

Researcher at Tencent

Publications -  389
Citations -  45733

Dong Yu is an academic researcher from Tencent. The author has contributed to research in topics: Artificial neural network & Word error rate. The author has an hindex of 72, co-authored 339 publications receiving 39098 citations. Previous affiliations of Dong Yu include Peking University & Microsoft.

Papers
More filters
Patent

Computer-implemented deep tensor neural network

TL;DR: In this article, a deep tensor neural network (DTNN) is described, wherein the DTNN is suitable for employment in a computer-implemented recognition/classification system.
Patent

Exploiting heterogeneous data in deep neural network-based speech recognition systems

TL;DR: In this article, a deep neural network (DNN) is trained using heterogeneous data, including narrowband signals and wideband signals, and subsequent to being trained, receives an input signal that can be either a wideband signal or a narrowband signal.
Proceedings ArticleDOI

Discriminative pronounciation learning using phonetic decoder and minimum-classification-error criterion

TL;DR: The new discriminative pronunciation learning technique overcomes the limitation of the traditional ways of introducing alternative pronunciations that often enlarge confusability across different lexical items and is used to improve the pronunciation-modeling component of a speech recognition system designed for mobile voice search.
Proceedings ArticleDOI

End-To-End Accent Conversion Without Using Native Utterances

TL;DR: This paper presents an end-to-end framework, which is able to conduct AC from non-native-accented utterances without using any native-acented utterances during online conversion, and can convert Hindi-acented English speech into native American English speech with high naturalness.
Proceedings ArticleDOI

Speaker-Aware Target Speaker Enhancement by Jointly Learning with Speaker Embedding Extraction

TL;DR: It is demonstrated, on large simulated noisy and far-field evaluation sets of overlapped speech signals, that the proposed approach significantly improves the speech enhancement performance compared to the baseline speaker-aware speech enhancement models.