Junyu Zhang

Researcher at Princeton University

Publications - 15

Citations - 127

Junyu Zhang is an academic researcher from Princeton University. The author has contributed to research in topics: Reinforcement learning & Markov decision process. The author has an hindex of 3, co-authored 15 publications receiving 53 citations. Previous affiliations of Junyu Zhang include National University of Singapore & University of Minnesota.

Papers

PDF

Open Access

More filters

Proceedings Article

Variational Policy Gradient Method for Reinforcement Learning with General Utilities

Junyu Zhang, +4 more

TL;DR: A new Variational Policy Gradient Theorem for RL with general utilities is derived, which establishes that the parametrized policy gradient may be obtained as the solution of a stochastic saddle point problem involving the Fenchel dual of the utility function.

...read moreread less

Posted Content

Variational Policy Gradient Method for Reinforcement Learning with General Utilities

Junyu Zhang, +4 more

- 04 Jul 2020 -

arXiv: Learning

TL;DR: In this paper, a variational Monte Carlo gradient estimation algorithm is proposed to compute the policy gradient based on sample paths, and the algorithm converges globally to the optimal policy for the general objective, though the optimization problem is nonconvex.

...read moreread less

Posted Content

Cautious Reinforcement Learning via Distributional Risk in the Dual Domain

Junyu Zhang, +3 more

- 27 Feb 2020 -

arXiv: Machine Learning

TL;DR: This work proposes a new definition of risk, which is called caution, as a penalty function added to the dual of the linear programming (LP) formulation of tabular RL, and proposes a block-coordinate augmentation of the aforementioned approach, which improves the reliability of reward accumulation without additional computation as compared to risk-neutral LP solvers.

...read moreread less

Posted Content

Cubic Regularized Newton Method for Saddle Point Models: a Global and Local Convergence Analysis

Kevin Huang, +2 more

- 22 Aug 2020 -

arXiv: Optimization and Control

TL;DR: In this article, a cubic regularized Newton (CRN) method was proposed for solving convex-concave saddle point problems, where at each iteration, a saddle point subproblem is constructed and solved, which provides a search direction for the iterate.

...read moreread less

Journal ArticleDOI

On lower iteration complexity bounds for the convex concave saddle point problems

Junyu Zhang, +3 more

- 07 Jun 2021 -

Mathematical Programming

TL;DR: In this paper, a lower bound for the complexity of finding the saddle point of a strongly convex and strongly concave saddle point problem with gradient Lipschitz constants was derived.

...read moreread less