Showing papers on "Markov decision process published in 1968"

PDF

Open Access

Book•

Conditional Markov processes and their application to the theory of optimal control

[...]

R.L. Stratonovich

01 Jan 1968

286 citations

Journal Article•DOI•

Multichain Markov Renewal Programs

[...]

E. V. Denardo, B. L. Fox

01 May 1968-Siam Journal on Applied Mathematics

TL;DR: In this article, two methods for computing optimal decision sequences and their cost functions are presented for solving a broad class of shortest-route problems and a third solution technique is shown to apply to certain, but not all, of these Markov renewal programs.

...read moreread less

Abstract: : Two methods are presented for computing optimal decision sequences and their cost functions. The first method, called 'policy iteration,' is an adaption of an iterative scheme that is widely used for sequential decision problems. The second method is to specify a linear programming problem whose solution determines an optimal policy and its cost function. A third solution technique is shown to apply to certain, but not all, of these Markov renewal programs. As a byproduct of the development, new techniques are provided for solving a broad class of shortest-route problems. (Author)

...read moreread less

153 citations

Journal Article•DOI•

Finite state continuous time Markov decision processes with an infinite planning horizon

[...]

Bruce L Miller¹•Institutions (1)

RAND Corporation¹

01 Jun 1968-Journal of Mathematical Analysis and Applications

134 citations

Journal Article•DOI•

Finite State Continuous Time Markov Decision Processes with a Finite Planning Horizon

[...]

Bruce L. Miller

01 May 1968-Siam Journal on Control

TL;DR: In this article, the authors consider the problem of finding a policy that maximizes the expected return over a given planning horizon, which depends on both the policy and the sample path of the process.

...read moreread less

Abstract: The system we consider may be in one of n states at any point in time and its probability law is a Markov process which depends on the policy (control) chosen. The return to the system over a given planning horizon is the integral (over that horizon) of a return rate which depends on both the policy and the sample path of the process. Our objective is to find a policy which maximizes the expected return over the given planning horizon. A necessary and sufficient condition for optimality is obtained, and a constructive proof is given that there is a piecewise constant policy which is optimal. A bound on the number of switches (points where the piecewise constant policy jumps) is obtained for the case where there are two states.

...read moreread less

107 citations

Journal Article•DOI•

Bayesian Decision Problems and Markov Chains

[...]

Paul H. Randolph

01 Nov 1968-Technometrics

60 citations

Journal Article•DOI•

Linear programming algorithms for semi-Markovian decision processes

[...]

Shunji Osaki¹, Hisashi Mine¹•Institutions (1)

Kyoto University¹

01 May 1968-Journal of Mathematical Analysis and Applications

TL;DR: In this paper, a semi-Markovian decision process is defined and the concept of returns associated with this process is introduced, and the average return per unit time that the system will get in the steady state is obtained.

...read moreread less

38 citations

Journal Article•DOI•

On Recurrent Denumerable Decision Processes

[...]

Lloyd Fisher

01 Apr 1968-Annals of Mathematical Statistics

TL;DR: In this article, the authors considered decision processes on a denumerable state space and showed that if one always chooses the same decision at each state, the resulting Markov chain is ergodic (i.e. positive recurrent).

...read moreread less

Abstract: This paper considers decision processes on a denumerable state space. At each state a finite number of decisions is allowed. The main assumption is that if one always chooses the same decision at each state the resulting Markov chain is ergodic (i.e. positive recurrent). Under this assumption it is shown that all possible decision procedures are (in an appropriate sense) uniformly ergodic.

...read moreread less

9 citations

Journal Article•DOI•

Linear and Dynamic Programming in Markov Chains

[...]

Yoav Kislev¹, Amotz Amiad•Institutions (1)

Agricultural & Applied Economics Association¹

01 Feb 1968-American Journal of Agricultural Economics

TL;DR: The linear programming solution to Markov chain theory models is presented and compared to the dynamic programming solution and it is shown that the elements of the simplex tableau contain information relevant to the understanding of the programmed system.

...read moreread less

Abstract: Some essential elements of the Markov chain theory are reviewed, along with programming of economic models which incorporate Markovian matrices and whose objective function is the maximization of the present value of an infinite stream of income. The linear programming solution to these models is presented and compared to the dynamic programming solution. Several properties of the solution are analyzed and it is shown that the elements of the simplex tableau contain information relevant to the understanding of the programmed system. It is also shown that the model can be extended to cover, among other elements, multiprocess enterprises and the realistic cases of programming in the face of probable deterioration of the productive capacity of the system or its total destruction.

...read moreread less

8 citations

Book•DOI•

Decision rules in Markovian decision processes with incompletely known transition probabilities

[...]

Jacobus Wessels

01 Jan 1968

TL;DR: A submitted manuscript is the author's version of the article upon submission and before peer-review as discussed by the authors, and the final published version features the final layout of the paper including the volume, issue and page numbers.

...read moreread less

Abstract: • A submitted manuscript is the author's version of the article upon submission and before peer-review. There can be important differences between the submitted version and the official published version of record. People interested in the research are advised to contact the author for the final version of the publication, or visit the DOI to the publisher's website. • The final author version and the galley proof are versions of the publication after peer review. • The final published version features the final layout of the paper including the volume, issue and page numbers.

...read moreread less

8 citations

Journal Article•DOI•

A note on decision rules for stochastic programs

[...]

David W. Walkup, Roger J.-B. Wets

01 Oct 1968-Journal of Computer and System Sciences

TL;DR: It is shown that a two-stage stochastic program with recourse with right-hand sides random has optimal decision rules which are continuous and piecewise linear, but this result does not extend to programs with three or more stages.

...read moreread less

8 citations

Journal Article•DOI•

Stochastic Optimization of Production Planning

[...]

J. H. Beebe, Charles S. Beightler, J. P. Stark

01 Aug 1968-Operations Research

TL;DR: A multistage decision problem is optimized using a new formulation of stochastic dynamic programming that employs at one stage a Markov decision process with an infinite number of substages and shows how this process may be compressed and handled as one stage in the larger problem.

...read moreread less

Abstract: A multistage decision problem is optimized using a new formulation of stochastic dynamic programming. The problem optimized in this paper concerns a semiconductor production process where the transitions at each work station are stochastic. The mathematical model employs at one stage a Markov decision process with an infinite number of substages and shows how this process may be compressed and handled as one stage in the larger problem.

...read moreread less

Journal Article•DOI•

Some topics in the theory of recurrent Markov processes

[...]

Richard Isaac

01 Sep 1968-Duke Mathematical Journal

Journal Article•DOI•

A markov decision process with non-stationary transition laws

[...]

Nagata Furukawa

01 Mar 1968-Bulletin of Mathematical Statistics

Journal Article•DOI•

Reduction in decision dimensionality in dynamic programming

[...]

D. Detchmendy¹, R. Kalaba•Institutions (1)

Esso¹

01 Jun 1968-IEEE Transactions on Automatic Control

TL;DR: A procedure for reducing the decision dimensionality in a dynamic programming calculation is presented for multi-stage decision processes for which the dimension of the decision vector is greater than thedimension of the state vector.

...read moreread less

Abstract: A procedure for reducing the decision dimensionality in a dynamic programming calculation is presented for multi-stage decision processes for which the dimension of the decision vector is greater than the dimension of the state vector. This procedure facilitates the numerical and analytical investigations of this class of optimization problems.

...read moreread less