scispace - formally typeset
P

Ping Li

Researcher at Baidu

Publications -  144
Citations -  1410

Ping Li is an academic researcher from Baidu. The author has contributed to research in topics: Computer science & Artificial neural network. The author has an hindex of 14, co-authored 119 publications receiving 795 citations. Previous affiliations of Ping Li include Rutgers University.

Papers
More filters
Proceedings Article

Gradient Hard Thresholding Pursuit for Sparsity-Constrained Optimization

TL;DR: This paper generalizes HTP from compressive sensing to a generic problem setup of sparsity-constrained convex optimization and proves that the proposed algorithm enjoys the strong guarantees analogous to HTP in terms of rate of convergence and parameter estimation accuracy.
Posted Content

Distributed Hierarchical GPU Parameter Server for Massive Scale Deep Learning Ads Systems

TL;DR: In this paper, a distributed GPU hierarchical parameter server for massive scale deep learning ad systems is proposed, which utilizes GPU High Bandwidth Memory, CPU main memory and SSD as 3-layer hierarchical storage.
Posted Content

A Tight Bound of Hard Thresholding

TL;DR: A novel stochastic algorithm is presented which performs hard thresholding in each iteration, hence ensuring such parsimonious solutions and proves the {\em global linear convergence} for a number of prevalent statistical models under mild assumptions, even though the problem turns out to be non-convex.
Proceedings ArticleDOI

End-to-end Deep Reinforcement Learning Based Coreference Resolution.

TL;DR: This paper introduces an end-to-end reinforcement learning based coreference resolution model to directly optimize coreference evaluation metrics and introduces maximum entropy regularization for adequate exploration to prevent the model from prematurely converging to a bad local optimum.
Proceedings Article

Online low-rank subspace clustering by basis dictionary pursuit

TL;DR: In this article, a novel online implementation of low-rank representation (LRR) was proposed, which reduces the memory cost from O(n2) to O(pd, with p being the ambient dimension and d being some estimated rank (d ≤ p ≪ n).