Top 5 papers published by Thomas Unterthiner from Google in 2019

Proceedings Article•

RUDDER: Return Decomposition for Delayed Rewards

[...]

Jose A. Arjona-Medina¹, Michael Gillhofer¹, Michael Widrich¹, Thomas Unterthiner¹, Johannes Brandstetter², Sepp Hochreiter¹ - Show less +2 more•Institutions (2)

Johannes Kepler University of Linz¹, Helsinki Institute of Physics²

01 Jan 2019

TL;DR: In this paper, a reinforcement learning approach for delayed rewards in finite Markov decision processes (MDPs) is proposed, which aims at making the expected future rewards zero, which simplifies Q-value estimation to computing the mean of the immediate reward.

...read moreread less

Abstract: We propose RUDDER, a novel reinforcement learning approach for delayed rewards in finite Markov decision processes (MDPs). In MDPs the Q-values are equal to the expected immediate reward plus the expected future rewards. The latter are related to bias problems in temporal difference (TD) learning and to high variance problems in Monte Carlo (MC) learning. Both problems are even more severe when rewards are delayed. RUDDER aims at making the expected future rewards zero, which simplifies Q-value estimation to computing the mean of the immediate reward. We propose the following two new concepts to push the expected future rewards toward zero. (i) Reward redistribution that leads to return-equivalent decision processes with the same optimal policies and, when optimal, zero expected future rewards. (ii) Return decomposition via contribution analysis which transforms the reinforcement learning task into a regression task at which deep learning excels. On artificial tasks with delayed rewards, RUDDER is significantly faster than MC and exponentially faster than Monte Carlo Tree Search (MCTS), TD(λ), and reward shaping approaches. At Atari games, RUDDER on top of a Proximal Policy Optimization (PPO) baseline improves the scores, which is most prominent at games with delayed rewards.

...read moreread less

111 citations

Book Chapter•DOI•

Interpretable Deep Learning in Drug Discovery

[...]

Kristina Preuer¹, Günter Klambauer¹, Friedrich Rippmann², Sepp Hochreiter¹, Thomas Unterthiner¹ - Show less +1 more•Institutions (2)

Johannes Kepler University of Linz¹, Merck KGaA²

07 Mar 2019

TL;DR: In this paper, the authors show how single neurons can be interpreted as classifiers which determine the presence or absence of pharmacophore or toxicophore-like structures, thereby generating new insights and relevant knowledge for chemistry, pharmacology and biochemistry.

...read moreread less

Abstract: Without any means of interpretation, neural networks that predict molecular properties and bioactivities are merely black boxes. We will unravel these black boxes and will demonstrate approaches to understand the learned representations which are hidden inside these models. We show how single neurons can be interpreted as classifiers which determine the presence or absence of pharmacophore- or toxicophore-like structures, thereby generating new insights and relevant knowledge for chemistry, pharmacology and biochemistry. We further discuss how these novel pharmacophores/toxicophores can be determined from the network by identifying the most relevant components of a compound for the prediction of the network. Additionally, we propose a method which can be used to extract new pharmacophores from a model and will show that these extracted structures are consistent with literature findings. We envision that having access to such interpretable knowledge is a crucial aid in the development and design of new pharmaceutically active molecules, and helps to investigate and understand failures and successes of current methods.

...read moreread less

58 citations

Posted Content•

Interpretable Deep Learning in Drug Discovery

[...]

Kristina Preuer¹, Günter Klambauer¹, Friedrich Rippmann², Sepp Hochreiter¹, Thomas Unterthiner¹ - Show less +1 more•Institutions (2)

Johannes Kepler University of Linz¹, Merck KGaA²

07 Mar 2019-arXiv: Learning

TL;DR: It is shown how single neurons can be interpreted as classifiers which determine the presence or absence of pharmacophores- or toxicophore-like structures, thereby generating new insights and relevant knowledge for chemistry, pharmacology and biochemistry.

...read moreread less

Abstract: Without any means of interpretation, neural networks that predict molecular properties and bioactivities are merely black boxes. We will unravel these black boxes and will demonstrate approaches to understand the learned representations which are hidden inside these models. We show how single neurons can be interpreted as classifiers which determine the presence or absence of pharmacophore- or toxicophore-like structures, thereby generating new insights and relevant knowledge for chemistry, pharmacology and biochemistry. We further discuss how these novel pharmacophores/toxicophores can be determined from the network by identifying the most relevant components of a compound for the prediction of the network. Additionally, we propose a method which can be used to extract new pharmacophores from a model and will show that these extracted structures are consistent with literature findings. We envision that having access to such interpretable knowledge is a crucial aid in the development and design of new pharmaceutically active molecules, and helps to investigate and understand failures and successes of current methods.

...read moreread less

51 citations

Proceedings Article•

FVD: A new Metric for Video Generation

[...]

Thomas Unterthiner¹, Sjoerd van Steenkiste², Karol Kurach³, Raphaël Marinier³, Marcin Michalski³, Sylvain Gelly³ - Show less +2 more•Institutions (3)

Johannes Kepler University of Linz¹, Dalle Molle Institute for Artificial Intelligence Research², Google³

27 Mar 2019

41 citations

Book Chapter•DOI•

Visual Scene Understanding for Autonomous Driving Using Semantic Segmentation

[...]

Markus Hofmarcher¹, Thomas Unterthiner¹, Jose A. Arjona-Medina¹, Günter Klambauer¹, Sepp Hochreiter¹, Bernhard Nessler¹ - Show less +2 more•Institutions (1)

Johannes Kepler University of Linz¹

01 Jan 2019

TL;DR: This work suggests the use of a tiered approach, whose main component is a semantic segmentation model, over an end-to-end approach for an autonomous driving system.

...read moreread less

Abstract: Deep neural networks are an increasingly important technique for autonomous driving, especially as a visual perception component. Deployment in a real environment necessitates the explainability and inspectability of the algorithms controlling the vehicle. Such insightful explanations are relevant not only for legal issues and insurance matters but also for engineers and developers in order to achieve provable functional quality guarantees. This applies to all scenarios where the results of deep networks control potentially life threatening machines. We suggest the use of a tiered approach, whose main component is a semantic segmentation model, over an end-to-end approach for an autonomous driving system. In order for a system to provide meaningful explanations for its decisions it is necessary to give an explanation about the semantics that it attributes to the complex sensory inputs that it perceives. In the context of high-dimensional visual input this attribution is done as a pixel-wise classification process that assigns an object class to every pixel in the image. This process is called semantic segmentation.

...read moreread less

38 citations

Showing papers by "Thomas Unterthiner published in 2019"