Tobias Pohlen

Researcher at RWTH Aachen University

Publications - 7

Citations - 3490

Tobias Pohlen is an academic researcher from RWTH Aachen University. The author has contributed to research in topics: Reinforcement learning & Segmentation. The author has an hindex of 6, co-authored 7 publications receiving 1881 citations.

Papers

PDF

Open Access

More filters

Journal ArticleDOI

Grandmaster level in StarCraft II using multi-agent reinforcement learning.

Oriol Vinyals, +41 more

- 30 Oct 2019 -

Nature

TL;DR: The agent, AlphaStar, is evaluated, which uses a multi-agent reinforcement learning algorithm and has reached Grandmaster level, ranking among the top 0.2% of human players for the real-time strategy game StarCraft II.

...read moreread less

Proceedings ArticleDOI

Full-Resolution Residual Networks for Semantic Segmentation in Street Scenes

Tobias Pohlen, +3 more

TL;DR: In this paper, a ResNet-like architecture is proposed to combine multi-scale context with pixel-level accuracy by using two processing streams within the network: one stream carries information at the full image resolution and the other stream undergoes a sequence of pooling operations to obtain robust features for recognition.

...read moreread less

Posted Content

Observe and Look Further: Achieving Consistent Performance on Atari

Tobias Pohlen, +12 more

- 29 May 2018 -

arXiv: Learning

TL;DR: This paper proposes an algorithm that addresses three key challenges that any algorithm needs to master in order to perform well on all games: processing diverse reward distributions, reasoning over long time horizons, and exploring efficiently.

...read moreread less

Posted Content

Full-Resolution Residual Networks for Semantic Segmentation in Street Scenes

Tobias Pohlen, +3 more

- 24 Nov 2016 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: This work proposes a novel ResNet-like architecture that exhibits strong localization and recognition performance, and combines multi-scale context with pixel-level accuracy by using two processing streams within the network.

...read moreread less

Proceedings Article

Reward learning from human preferences and demonstrations in Atari

Borja Ibarz, +5 more

TL;DR: In this article, the authors combine two approaches: learning from expert demonstrations and learning from trajectory preferences to train a deep neural network to model the reward function and use its predicted reward to train an DQN-based deep RL agent on 9 Atari games.

...read moreread less