scispace - formally typeset
S

Shibo Wang

Publications -  9
Citations -  2007

Shibo Wang is an academic researcher. The author has contributed to research in topics: Computer science & Language model. The author has an hindex of 7, co-authored 7 publications receiving 446 citations.

Papers
More filters
Posted Content

Conformer: Convolution-augmented Transformer for Speech Recognition

TL;DR: This work proposes the convolution-augmented transformer for speech recognition, named Conformer, which significantly outperforms the previous Transformer and CNN based models achieving state-of-the-art accuracies.
Proceedings ArticleDOI

Conformer: Convolution-augmented Transformer for Speech Recognition

TL;DR: Conformer as mentioned in this paper combines convolution neural networks and transformers to model both local and global dependencies of an audio sequence in a parameter-efficient way, achieving state-of-the-art accuracies.
Posted Content

Scale MLPerf-0.6 models on Google TPU-v3 Pods.

TL;DR: This work discusses the optimizations and techniques including choice of optimizer, spatial partitioning and weight update sharding necessary to scale to 1024 TPU chips and identifies properties of models that make scaling them challenging, such as limited data parallelism and unscaled weights.
Posted Content

Automatic Cross-Replica Sharding of Weight Update in Data-Parallel Training

TL;DR: This paper presents an approach to automatically shard the weight update computation across replicas with efficient communication primitives and data formatting, using static analysis and transformations on the training computation graph, and achieves substantial speedups on typical image and language models on Cloud TPUs, requiring no change to model code.