Performance optimization of continuous-time Markov control processes based on performance potentials

doi:10.1080/0020772031000115641

Journal ArticleDOI

Performance optimization of continuous-time Markov control processes based on performance potentials

Tang Hao, +2 more

- 01 Jan 2003 -

International Journal of Systems Science

- Vol. 34, Iss: 1, pp 63-71

TLDR

Average-cost optimization problems for a class of continuous-time Markov control processes with a compact action set with average-cost optimality equation derived and the existence of its solution established are studied.

Abstract:

Average-cost optimization problems for a class of continuous-time Markov control processes with a compact action set have been studied The definition of a generalized average-cost Poisson equation, which can be viewed as an extension to the standard one is first given Markov performance potentials are defined as its unique solution Based on the formula of performance potentials, an average-cost optimality equation is derived and the existence of its solution is established Then, policy iteration and value iteration algorithms are proposed and their convergence discussed A numerical example for controlled closed queuing networks illustrates the application of the proposed value iteration algorithm

Citations

PDF

Open Access

More filters

Journal ArticleDOI

50 Years of international journal of systems science: a review of the past and trends for the future

Xinxin Wang, +4 more

- 11 Jun 2021 -

International Journal of Systems Science

TL;DR: The fundamental characteristics of the documents were identified from the bibliometric indicators, and features of keywords were revealed over the half century, and the future research trends in the four clusters were predicted as evidenced in the historical data.

...read moreread less

Journal ArticleDOI

Error bounds of optimization algorithms for semi-Markov decision processes

Tang Hao, +2 more

- 01 Sep 2007 -

International Journal of Systems Science

TL;DR: This work introduces an α-uniformized Markov chain (UMC) for a semi-Markov decision process (SMDP) via A α and a uniformized parameter, and derives the error bounds for a potential-based policy-iteration algorithm and a value-iterations algorithm, respectively, when there exist various calculation errors.

...read moreread less

Journal ArticleDOI

Modeling and Optimization of M/G/1-Type Queueing Networks: An Efficient Sensitivity Analysis Approach

Liang Tang, +3 more

- 09 Aug 2010 -

Mathematical Problems in Engineering

TL;DR: An online policy gradient optimization algorithm based on a single sample path is provided to avoid suffering from a “curse of dimensionality” and the asymptotic convergence properties of this algorithm are proved.

...read moreread less

Journal ArticleDOI

Modeling and optimization control of a demand-driven, conveyor-serviced production station

Hao Tang, +4 more

- 16 Jun 2015 -

European Journal of Operational Research

TL;DR: Simulation results are presented to show that by the established model and proposed optimization methods the system can achieve an optimal or suboptimal look-ahead control policy once the capacities of both the buffer and the bank are designed appropriately.

...read moreread less

Journal ArticleDOI

The optimal robust control policy for uncertain semi-Markov control processes

Tang Hao, +2 more

- 20 Oct 2005 -

International Journal of Systems Science

TL;DR: A potential-based policy iteration method is proposed in this work to search for the optimal robust control policy for SMCPs with uncertain infinitesimal transition rates that are represented as compact sets.

...read moreread less

References

PDF

Open Access

More filters

Book

Dynamic Programming and Optimal Control

Dimitri P. Bertsekas

TL;DR: The leading and most up-to-date textbook on the far-ranging algorithmic methododogy of Dynamic Programming, which can be used for optimal control, Markovian decision problems, planning and sequential decision making under uncertainty, and discrete/combinatorial optimization.

...read moreread less

Journal ArticleDOI

The Relations Among Potentials, Perturbation Analysis,and Markov Decision Processes

Xi-Ren Cao

- 01 Mar 1998 -

Discrete Event Dynamic Systems

TL;DR: This paper provides an introductory discussion for an important concept, the performance potentials of Markov processes, and its relations with perturbation analysis (PA), average-cost Markov decision processes, Poisson equations, α-potentials, the fundamental matrix, and the group inverse of the transition matrix.

...read moreread less

ReportDOI

A Lyapunov Bound for Solutions of Poisson's Equation

Peter W. Glynn

TL;DR: In this article, a Lyapunov function criterion was developed to bound the solution g to Poisson's equation for a positive recurrent Harris chain with invariant measure pi, and then applied to obtain sufficient conditions that guarantee that the solution be an element of L sub p (pi) when p = 2.

...read moreread less

Journal ArticleDOI

Technical Communique: A unified approach to Markov decision problems and performance sensitivity analysis

Xi-Ren Cao

- 01 May 2000 -

Automatica

TL;DR: A simple approach is proposed that provides a unified formulation for the performance sensitivity analysis of Markov chains and Markov decision problems with both infinite horizon average-cost and discounted performance criteria.

...read moreread less

Journal ArticleDOI

Single sample path-based optimization of Markov chains

Xi-Ren Cao

- 01 Mar 1999 -

Journal of Optimization Theory and Appli...

TL;DR: A fast algorithm is proposed, which updates the policy whenever the system reaches a particular set of states and it is proved that the algorithm converges to the true optimal policy with probability one under some conditions.

...read moreread less