D
Dong Yu
Researcher at Tencent
Publications - 389
Citations - 45733
Dong Yu is an academic researcher from Tencent. The author has contributed to research in topics: Artificial neural network & Word error rate. The author has an hindex of 72, co-authored 339 publications receiving 39098 citations. Previous affiliations of Dong Yu include Peking University & Microsoft.
Papers
More filters
Journal ArticleDOI
On the localness modeling for the self-attention based end-to-end speech synthesis.
TL;DR: Experimental results indicate that the two proposed localness enhanced methods can both improve the performance of the self-attention model, especially when applied to the encoder part, and the query-specific window of Gaussian bias approach is more robust compared with the fixed relative edges.
Journal ArticleDOI
Speech Recognition Using Long-Span Temporal Patterns in a Deep Network Model
TL;DR: It is shown that word recognition accuracy can be significantly enhanced by arranging DNNs in a hierarchical structure to model long-term energy trajectories and evaluated on the 5000-word Wall Street Journal task.
Proceedings ArticleDOI
Speech recognition with prediction-adaptation-correction recurrent neural networks
TL;DR: It is found that incorporating the prediction objective and including the recurrent loop are both important to boost the performance of the PAC-RNN.
Posted Content
Audio-visual Recognition of Overlapped speech for the LRS2 dataset
Jianwei Yu,Shi-Xiong Zhang,Jian Wu,Shahram Ghorbani,Bo Wu,Shiyin Kang,Shansong Liu,Xunying Liu,Helen Meng,Dong Yu +9 more
TL;DR: In this article, a lattice-free MMI (LF-MMI) discriminative criterion was used to improve the performance of audio-visual speech recognition in the LRS2 dataset.
Journal ArticleDOI
Neural Network Based Multi-Factor Aware Joint Training for Robust Speech Recognition
Yanmin Qian,Tian Tan,Dong Yu +2 more
TL;DR: The proposed neural network-based multifactor aware joint training can be easily combined with the conventional factor-aware training which uses the explicit factors, such as i-vector, noise energy, and T60 value to obtain additional improvement.