scispace - formally typeset
S

Shiqing Fan

Researcher at Alibaba Group

Publications -  7
Citations -  115

Shiqing Fan is an academic researcher from Alibaba Group. The author has contributed to research in topics: Computer science & Architecture. The author has an hindex of 2, co-authored 3 publications receiving 24 citations.

Papers
More filters
Posted Content

DAPPLE: A Pipelined Data Parallel Approach for Training Large Models

TL;DR: DAPPLE, a synchronous training framework which combines data parallelism and pipeline parallelism for large DNN models, is proposed, which features a novel parallelization strategy planner to solve the partition and placement problems, and explores the optimal hybrid strategies of data and pipeline Parallelism.
Proceedings ArticleDOI

DAPPLE: a pipelined data parallel approach for training large models

TL;DR: DAPPLE as mentioned in this paper is a synchronous training framework which combines data parallelism and pipeline parallelism for large DNN models, and it features a novel parallelization strategy planner to solve the partition and placement problems.
Posted Content

Auto-MAP: A DQN Framework for Exploring Distributed Execution Plans for DNN Workloads.

TL;DR: Auto-MAP is proposed, a framework for exploring distributed execution plans for DNN workloads, which can automatically discovering fast parallelization strategies through reinforcement learning on IR level of deep learning models and can find the optimal solution in two hours, while achieving better throughput on several NLP and convolution models.
Proceedings ArticleDOI

Efficient Pipeline Planning for Expedited Distributed DNN Training

TL;DR: Efficient, near-optimal algorithms for expediting synchronous pipeline-parallel training of modern large DNNs over arbitrary inter-GPU connectivity are designed and can accelerate training up to 157% compared with state-of-the-art designs.