scispace - formally typeset
T

Thinh T. Doan

Researcher at Virginia Tech

Publications -  58
Citations -  799

Thinh T. Doan is an academic researcher from Virginia Tech. The author has contributed to research in topics: Rate of convergence & Optimization problem. The author has an hindex of 14, co-authored 57 publications receiving 555 citations. Previous affiliations of Thinh T. Doan include University of Illinois at Urbana–Champaign & Georgia Institute of Technology.

Papers
More filters
Posted Content

Performance of Q-learning with Linear Function Approximation: Stability and Finite-Time Analysis

TL;DR: This paper provides a finite-time bound and the convergence rate on the performance of Q-learning with linear function approximation under an assumption on the behavior policy and exploits the geometric mixing of the underlying Markov chain.
Journal ArticleDOI

Fast Convergence Rates of Distributed Subgradient Methods With Adaptive Quantization

TL;DR: This article introduces a novel quantization method, which it is shown that if the objective functions are convex or strongly convex, then using adaptive quantization does not affect the rate of convergence of the distributed subgradient methods when the communications are quantized, except for a constant that depends on the resolution of the quantizer.
Posted Content

Finite-Sample Analysis of Nonlinear Stochastic Approximation with Applications in Reinforcement Learning

TL;DR: This paper studies a nonlinear Stochastic Approximation (SA) algorithm under Markovian noise, and derives its finite-sample convergence bounds, and shows the finite- sample bounds of the popular Q-learning with linear function approximation algorithm for solving the RL problem.
Journal ArticleDOI

Distributed resource allocation on dynamic networks in quadratic time

TL;DR: In this article, the authors consider the problem of allocating a fixed amount of resource among nodes in a network when each node suffers a cost which is a convex function of the amount of resources allocated to it.