Vijayaditya Peddinti

Researcher at Johns Hopkins University

Publications - 25

Citations - 5069

Vijayaditya Peddinti is an academic researcher from Johns Hopkins University. The author has contributed to research in topics: Time delay neural network & Word error rate. The author has an hindex of 18, co-authored 25 publications receiving 3782 citations. Previous affiliations of Vijayaditya Peddinti include Google & IBM.

Papers

PDF

Open Access

More filters

Proceedings ArticleDOI

Audio augmentation for speech recognition.

Tom Ko, +3 more

TL;DR: This paper investigates audio-level speech augmentation methods which directly process the raw signal, and presents results on 4 different LVCSR tasks with training data ranging from 100 hours to 1000 hours, to examine the effectiveness of audio augmentation in a variety of data scenarios.

...read moreread less

Proceedings ArticleDOI

A time delay neural network architecture for efficient modeling of long temporal contexts.

Vijayaditya Peddinti, +2 more

TL;DR: This paper proposes a time delay neural network architecture which models long term temporal dependencies with training times comparable to standard feed-forward DNNs and uses sub-sampling to reduce computation during training.

...read moreread less

Proceedings ArticleDOI

Purely Sequence-Trained Neural Networks for ASR Based on Lattice-Free MMI.

Daniel Povey, +7 more

TL;DR: A method to perform sequencediscriminative training of neural network acoustic models without the need for frame-level cross-entropy pre-training is described, using the lattice-free version of the maximum mutual information (MMI) criterion: LF-MMI.

...read moreread less

Proceedings ArticleDOI

A study on data augmentation of reverberant speech for robust speech recognition

Tom Ko, +4 more

TL;DR: It is found that the performance gap between using simulated and real RIRs can be eliminated when point-source noises are added, and the trained acoustic models not only perform well in the distant- talking scenario but also provide better results in the close-talking scenario.

...read moreread less

Posted Content

Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling

Jonathan Shen, +90 more

- 21 Feb 2019 -

arXiv: Learning

TL;DR: This document outlines the underlying design of Lingvo and serves as an introduction to the various pieces of the framework, while also offering examples of advanced features that showcase the capabilities of the Framework.

...read moreread less