A survey of multi-objective sequential decision-making

doi:10.1613/JAIR.3987

Open AccessJournal ArticleDOI

A survey of multi-objective sequential decision-making

Diederik M. Roijers, +3 more

- 01 Oct 2013 -

Journal of Artificial Intelligence Resea...

- Vol. 48, Iss: 1, pp 67-113

TLDR

This article surveys algorithms designed for sequential decision-making problems with multiple objectives and proposes a taxonomy that classifies multi-objective methods according to the applicable scenario, the nature of the scalarization function, and the type of policies considered.

Abstract:

Sequential decision-making problems with multiple objectives arise naturally in practice and pose unique challenges for research in decision-theoretic planning and learning, which has largely focused on single-objective settings. This article surveys algorithms designed for sequential decision-making problems with multiple objectives. Though there is a growing body of literature on this subject, little of it makes explicit under what circumstances special methods are needed to solve multi-objective problems. Therefore, we identify three distinct scenarios in which converting such a problem to a single-objective one is impossible, infeasible, or undesirable. Furthermore, we propose a taxonomy that classifies multi-objective methods according to the applicable scenario, the nature of the scalarization function (which projects multi-objective values to scalar ones), and the type of policies considered. We show how these factors determine the nature of an optimal solution, which can be a single policy, a convex hull, or a Pareto front. Using this taxonomy, we survey the literature on multi-objective methods for planning and learning. Finally, we discuss key applications of such methods and outline opportunities for future work.

Citations

PDF

Open Access

More filters

Book ChapterDOI

Multiple criteria decision making

János Fodor, +1 more

TL;DR: In this Chapter, a decision maker (or a group of experts) trying to establish or examine fair procedures to combine opinions about alternatives related to different points of view is imagined.

...read moreread less

Posted Content

Deep Reinforcement Learning: An Overview

Yuxi Li

- 25 Jan 2017 -

arXiv: Learning

TL;DR: This work discusses core RL elements, including value function, in particular, Deep Q-Network (DQN), policy, reward, model, planning, and exploration, and important mechanisms for RL, including attention and memory, unsupervised learning, transfer learning, multi-agent RL, hierarchical RL, and learning to learn.

...read moreread less

Journal ArticleDOI

Deep Reinforcement Learning for Autonomous Driving: A Survey

B Ravi Kiran, +6 more

- 09 Feb 2021 -

IEEE Transactions on Intelligent Transpo...

TL;DR: This review summarises deep reinforcement learning algorithms, provides a taxonomy of automated driving tasks where (D)RL methods have been employed, highlights the key challenges algorithmically as well as in terms of deployment of real world autonomous driving agents, the role of simulators in training agents, and finally methods to evaluate, test and robustifying existing solutions in RL and imitation learning.

...read moreread less

Posted Content

Multi-Task Learning as Multi-Objective Optimization

Ozan Sener, +1 more

- 10 Oct 2018 -

arXiv: Learning

TL;DR: This paper cast multi-task learning as a multi-objective optimization problem, with the overall objective of finding a Pareto optimal solution, and propose an upper bound for the multiobjective loss and show that it can be optimized efficiently.

...read moreread less

Posted Content

Challenges of Real-World Reinforcement Learning

Gabriel Dulac-Arnold, +2 more

- 29 Apr 2019 -

arXiv: Learning

TL;DR: A set of nine unique challenges that must be addressed to productionize RL to real world problems are presented and an example domain that has been modified to present these challenges as a testbed for practical RL research is presented.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings Article

A theory of goal-oriented MDPs with dead ends

Andrey Kolobov, +1 more

TL;DR: In this paper, value iteration-based and heuristic search algorithms for solving stochastic shortest path (SSP) MDPs with dead-end states are presented, and a preliminary empirical study comparing the performance of these algorithms on different MDP classes is conducted.

...read moreread less

Journal ArticleDOI

An evolutionary algorithm with advanced goal and priority specification for multi-objective optimization

Kay Chen Tan, +3 more

- 01 Jan 2003 -

Journal of Artificial Intelligence Resea...

TL;DR: An evolutionary algorithm with a new goal-sequence domination scheme for better decision support in multi-objective optimization and a dynamic sharing scheme that is simple and adaptively estimated according to the on-line population distribution without needing any a priori parameter setting is presented.

...read moreread less

Journal ArticleDOI

Multiobjective reinforcement learning for traffic signal control using vehicular ad hoc network

Duan Houli, +2 more

TL;DR: A new multiobjective control algorithm based on reinforcement learning for urban traffic signal control, named multi-RL, in which the optimization objective is selected adaptively to real-time traffic states.

...read moreread less

Journal ArticleDOI

Neural networks and reinforcement learning in control of water systems

B. Bhattacharya, +2 more

- 01 Nov 2003 -

Journal of Water Resources Planning and ...

TL;DR: Aquarius DSS is chosen as a reference model for building a controller using machine learning techniques such as artificial neural networks and reinforcement learning, where RL is used to decrease the error of the ANN-based component.

...read moreread less

Proceedings ArticleDOI

Computing Optimal Stationary Policies for Multi-Objective Markov Decision Processes

Marco A. Wiering, +1 more

TL;DR: It is proved that the CON-MODP algorithm converges to the Pareto optimal set of value functions and policies for deterministic infinite horizon discounted multi-objective Markov decision processes.

...read moreread less

Collapse

A survey of multi-objective sequential decision-making

Citations

Multiple criteria decision making

Deep Reinforcement Learning: An Overview

Deep Reinforcement Learning for Autonomous Driving: A Survey

Multi-Task Learning as Multi-Objective Optimization

Challenges of Real-World Reinforcement Learning

References

A theory of goal-oriented MDPs with dead ends

An evolutionary algorithm with advanced goal and priority specification for multi-objective optimization

Multiobjective reinforcement learning for traffic signal control using vehicular ad hoc network

Neural networks and reinforcement learning in control of water systems

Computing Optimal Stationary Policies for Multi-Objective Markov Decision Processes

Related Papers (5)

Reinforcement Learning: An Introduction

Human-level control through deep reinforcement learning

Markov Decision Processes: Discrete Stochastic Dynamic Programming

Multiobjective Reinforcement Learning: A Comprehensive Overview

Mastering the game of Go with deep neural networks and tree search