Minjia Zhang

Researcher at Microsoft

Publications - 69

Citations - 1559

Minjia Zhang is an academic researcher from Microsoft. The author has contributed to research in topics: Computer science & Deep learning. The author has an hindex of 12, co-authored 45 publications receiving 737 citations. Previous affiliations of Minjia Zhang include Huazhong University of Science and Technology & Ohio State University.

Papers

PDF

Open Access

More filters

Journal ArticleDOI

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Teven Le Scao, +386 more

- 09 Nov 2022 -

arXiv.org

TL;DR: BLOOM as discussed by the authors is a decoder-only Transformer language model that was trained on the ROOTS corpus, a dataset comprising hundreds of sources in 46 natural and 13 programming languages (59 in total).

...read moreread less

Proceedings ArticleDOI

Memcached Design on High Performance RDMA Capable Interconnects

Jithin Jose, +10 more

TL;DR: The design extends the existing open-source Memcached software and makes it RDMA capable and a detailed performance comparison of the Memcaches design is provided compared to unmodifiedmemcached using Sockets over RDMA and 10Gigabit Ethernet network with hardware-accelerated TCP/IP.

...read moreread less

Posted Content

ZeRO-Offload: Democratizing Billion-Scale Model Training

Jie Ren, +7 more

- 18 Jan 2021 -

arXiv: Distributed, Parallel, and Cluste...

TL;DR: ZeRO-Offload democratizes large-scale model training making it accessible to even data scientists with access to just a single GPU, and combines compute and memory efficiency with ease-of-use.

...read moreread less

Proceedings Article

Learning Intrinsic Sparse Structures within Long Short-Term Memory

Wei Wen, +8 more

TL;DR: This work aims to learn structurally-sparse Long Short-Term Memory by reducing the sizes of basic structures within LSTM units, including input updates, gates, hidden states, cell states and outputs, by proposing Intrinsic Sparse Structures (ISS) in LSTMs.

...read moreread less

Proceedings Article

DeepCPU: serving RNN-based deep learning models 10x faster

Minjia Zhang, +3 more

TL;DR: This work characterizes RNN performance and identifies low data reuse as a root cause, and develops novel techniques and an efficient search strategy to squeeze more data reuse out of this intrinsically challenging workload.

...read moreread less

Collapse