scispace - formally typeset
J

Jean Harb

Researcher at McGill University

Publications -  12
Citations -  3957

Jean Harb is an academic researcher from McGill University. The author has contributed to research in topics: Reinforcement learning & Flexibility (engineering). The author has an hindex of 10, co-authored 11 publications receiving 2606 citations.

Papers
More filters
Posted Content

Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments

TL;DR: An adaptation of actor-critic methods that considers action policies of other agents and is able to successfully learn policies that require complex multi-agent coordination is presented.
Proceedings Article

Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments

TL;DR: In this article, an actor-critic method was used to learn multi-agent coordination policies in cooperative and competitive multi-player RL games, where agent populations are able to discover various physical and informational coordination strategies.
Posted Content

The Option-Critic Architecture

TL;DR: This paper propose a new option-critic architecture capable of learning both the internal policies and the termination conditions of options, in tandem with the policy over options, without the need to provide any additional rewards or subgoals.
Proceedings Article

The Option-Critic Architecture

TL;DR: This article propose a new option-critic architecture capable of learning both the internal policies and the termination conditions of options, in tandem with the policy over options, without the need to provide any additional rewards or subgoals.
Proceedings Article

When Waiting Is Not an Option: Learning Options With a Deliberation Cost

TL;DR: This work forms the answer to what "good" options should be in the bounded rationality framework through the notion of deliberation cost and derives practical gradient-based learning algorithms to implement this objective.