scispace - formally typeset
V

Vivek S. Borkar

Researcher at Indian Institute of Technology Bombay

Publications -  394
Citations -  13959

Vivek S. Borkar is an academic researcher from Indian Institute of Technology Bombay. The author has contributed to research in topics: Markov chain & Stochastic approximation. The author has an hindex of 48, co-authored 370 publications receiving 12622 citations. Previous affiliations of Vivek S. Borkar include University of California, Berkeley & Massachusetts Institute of Technology.

Papers
More filters
Journal ArticleDOI

A unified framework for hybrid control: model and optimal control theory

TL;DR: This work introduces a mathematical model of hybrid systems as interacting collections of dynamical systems, evolving on continuous-variable state spaces and subject to continuous controls and discrete transitions, and develops a theory for synthesizing hybrid controllers for hybrid plants in all optimal control framework.
Book

Stochastic Approximation: A Dynamical Systems Viewpoint

TL;DR: In this article, the authors present a convergence analysis for lock-in probability, stability criteria, and synchronous schemes with different timescales, and a limit theorem for fluctuations.
Proceedings Article

Manufacturing consent

TL;DR: An algorithm for this optimization problem, as well as a greedy scheme with some performance guarantees for a variant of the problem that seeks to minimize a simpler objective are proposed.
Journal ArticleDOI

Discrete-time controlled Markov processes with average cost criterion: a survey

TL;DR: A survey of the average cost control problem for discrete-time Markov processes can be found in this paper, where the authors have attempted to put together a comprehensive account of the considerable research on this problem over the past three decades.
Journal ArticleDOI

The O.D. E. Method for Convergence of Stochastic Approximation and Reinforcement Learning

TL;DR: It is shown here that Stability of the stochastic approximation algorithm is implied by the asymptotic stability of the origin for an associated ODE, which implies convergence of the algorithm.