scispace - formally typeset
J

Jürgen Schmidhuber

Researcher at Dalle Molle Institute for Artificial Intelligence Research

Publications -  568
Citations -  161189

Jürgen Schmidhuber is an academic researcher from Dalle Molle Institute for Artificial Intelligence Research. The author has contributed to research in topics: Artificial neural network & Reinforcement learning. The author has an hindex of 99, co-authored 539 publications receiving 122453 citations. Previous affiliations of Jürgen Schmidhuber include Information Technology University & University of Lugano.

Papers
More filters
Journal ArticleDOI

Long short-term memory

TL;DR: A novel, efficient, gradient based method called long short-term memory (LSTM) is introduced, which can learn to bridge minimal time lags in excess of 1000 discrete-time steps by enforcing constant error flow through constant error carousels within special units.
Journal ArticleDOI

Deep learning in neural networks

TL;DR: This historical survey compactly summarizes relevant work, much of it from the previous millennium, review deep supervised learning, unsupervised learning, reinforcement learning & evolutionary computation, and indirect search for short programs encoding deep and large networks.
Proceedings ArticleDOI

Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks

TL;DR: This paper presents a novel method for training RNNs to label unsegmented sequences directly, thereby solving both problems of sequence learning and post-processing.
Journal ArticleDOI

LSTM: A Search Space Odyssey

TL;DR: This paper presents the first large-scale analysis of eight LSTM variants on three representative tasks: speech recognition, handwriting recognition, and polyphonic music modeling, and observes that the studied hyperparameters are virtually independent and derive guidelines for their efficient adjustment.
Proceedings ArticleDOI

Multi-column deep neural networks for image classification

TL;DR: In this paper, a biologically plausible, wide and deep artificial neural network architectures was proposed to match human performance on tasks such as the recognition of handwritten digits or traffic signs, achieving near-human performance.