scispace - formally typeset
D

Dong Yu

Researcher at Tencent

Publications -  389
Citations -  45733

Dong Yu is an academic researcher from Tencent. The author has contributed to research in topics: Artificial neural network & Word error rate. The author has an hindex of 72, co-authored 339 publications receiving 39098 citations. Previous affiliations of Dong Yu include Peking University & Microsoft.

Papers
More filters
Proceedings ArticleDOI

Disambiguation of Chinese Polyphones in an End-to-End Framework with Semantic Features Extracted by Pre-Trained BERT.

TL;DR: Experimental results demonstrate the effectiveness of the proposed end-to-end framework for polyphone disambiguation and the semantic features extracted by BERT can greatly enhance the performance.
Proceedings ArticleDOI

Pitchnet: Unsupervised Singing Voice Conversion with Pitch Adversarial Network

TL;DR: The proposed Pitch-Net added an adversarially trained pitch regression network to enforce the encoder network to learn pitch invariant phoneme representation, and a separate module to feed pitch extracted from the source audio to the decoder network.
Proceedings Article

Unscented Transform with Online Distortion Estimation for HMM Adaptation

TL;DR: The new JAC-UT method differentiates itself from other UT-based approaches in that it combines the online noise and channel distortion estimation and model parameter adaptation in a unified UT framework.
PatentDOI

Speaker adaptive learning of resonance targets in a hidden trajectory model of speech coarticulation

TL;DR: In this paper, a computer-implemented method is provided for training a hidden trajectory model, of a speech recognition system, which generates Vocal Tract Resonance (VTR) targets.
Proceedings ArticleDOI

Boundary Discriminative Large Margin Cosine Loss for Text-independent Speaker Verification

TL;DR: This work proposes an enhanced LMCL named boundary discriminative LMCL (BD-LMCL) to emphasize the discriminating information inherited in the speaker boundaries, where those samples close to the boundaries are dynamically selected using top-k zero-one loss.