scispace - formally typeset
Journal

arXiv: Learning 

About: arXiv: Learning is an academic journal. The journal publishes majorly in the area(s): Artificial neural network & Reinforcement learning. Over the lifetime, 45050 publication(s) have been published receiving 837151 citation(s).

...read more

Papers
More filters

Posted Content
02 Jan 2012-arXiv: Learning
Abstract: Scikit-learn is a Python module integrating a wide range of state-of-the-art machine learning algorithms for medium-scale supervised and unsupervised problems. This package focuses on bringing machine learning to non-specialists using a general-purpose high-level language. Emphasis is put on ease of use, performance, documentation, and API consistency. It has minimal dependencies and is distributed under the simplified BSD license, encouraging its use in both academic and commercial settings. Source code, binaries, and documentation can be downloaded from this http URL.

...read more

28,898 citations


Posted Content
Diederik P. Kingma1, Jimmy Ba2Institutions (2)
22 Dec 2014-arXiv: Learning
Abstract: We introduce Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments. The method is straightforward to implement, is computationally efficient, has little memory requirements, is invariant to diagonal rescaling of the gradients, and is well suited for problems that are large in terms of data and/or parameters. The method is also appropriate for non-stationary objectives and problems with very noisy and/or sparse gradients. The hyper-parameters have intuitive interpretations and typically require little tuning. Some connections to related algorithms, on which Adam was inspired, are discussed. We also analyze the theoretical convergence properties of the algorithm and provide a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization framework. Empirical results demonstrate that Adam works well in practice and compares favorably to other stochastic optimization methods. Finally, we discuss AdaMax, a variant of Adam based on the infinity norm.

...read more

23,369 citations


Posted Content
Sergey Ioffe1, Christian Szegedy1Institutions (1)
11 Feb 2015-arXiv: Learning
Abstract: Training Deep Neural Networks is complicated by the fact that the distribution of each layer's inputs changes during training, as the parameters of the previous layers change. This slows down the training by requiring lower learning rates and careful parameter initialization, and makes it notoriously hard to train models with saturating nonlinearities. We refer to this phenomenon as internal covariate shift, and address the problem by normalizing layer inputs. Our method draws its strength from making normalization a part of the model architecture and performing the normalization for each training mini-batch. Batch Normalization allows us to use much higher learning rates and be less careful about initialization. It also acts as a regularizer, in some cases eliminating the need for Dropout. Applied to a state-of-the-art image classification model, Batch Normalization achieves the same accuracy with 14 times fewer training steps, and beats the original model by a significant margin. Using an ensemble of batch-normalized networks, we improve upon the best published result on ImageNet classification: reaching 4.9% top-5 validation error (and 4.8% test error), exceeding the accuracy of human raters.

...read more

17,151 citations


Posted Content
Thomas Kipf1, Max Welling1Institutions (1)
09 Sep 2016-arXiv: Learning
TL;DR: A scalable approach for semi-supervised learning on graph-structured data that is based on an efficient variant of convolutional neural networks which operate directly on graphs which outperforms related methods by a significant margin.

...read more

Abstract: We present a scalable approach for semi-supervised learning on graph-structured data that is based on an efficient variant of convolutional neural networks which operate directly on graphs. We motivate the choice of our convolutional architecture via a localized first-order approximation of spectral graph convolutions. Our model scales linearly in the number of graph edges and learns hidden layer representations that encode both local graph structure and features of nodes. In a number of experiments on citation networks and on a knowledge graph dataset we demonstrate that our approach outperforms related methods by a significant margin.

...read more

8,285 citations


Posted Content
19 Nov 2015-arXiv: Learning
TL;DR: This work introduces a class of CNNs called deep convolutional generative adversarial networks (DCGANs), that have certain architectural constraints, and demonstrates that they are a strong candidate for unsupervised learning.

...read more

Abstract: In recent years, supervised learning with convolutional networks (CNNs) has seen huge adoption in computer vision applications. Comparatively, unsupervised learning with CNNs has received less attention. In this work we hope to help bridge the gap between the success of CNNs for supervised learning and unsupervised learning. We introduce a class of CNNs called deep convolutional generative adversarial networks (DCGANs), that have certain architectural constraints, and demonstrate that they are a strong candidate for unsupervised learning. Training on various image datasets, we show convincing evidence that our deep convolutional adversarial pair learns a hierarchy of representations from object parts to scenes in both the generator and discriminator. Additionally, we use the learned features for novel tasks - demonstrating their applicability as general image representations.

...read more

6,739 citations


Network Information
Related Journals (5)
arXiv: Machine Learning

12.4K papers, 260.6K citations

96% related
Journal of Machine Learning Research

3.1K papers, 519.3K citations

92% related
arXiv: Artificial Intelligence

13.6K papers, 186.5K citations

90% related
arXiv: Computer Vision and Pattern Recognition

50K papers, 1.1M citations

90% related
arXiv: Neural and Evolutionary Computing

4.4K papers, 97.7K citations

89% related
Performance
Metrics
No. of papers from the Journal in previous years
YearPapers
20223
202112,564
202010,847
20198,886
20184,727
20172,309

Top Attributes

Show by:

Journal's top 5 most impactful authors

Sergey Levine

213 papers, 17.3K citations

Yoshua Bengio

177 papers, 31.9K citations

Pieter Abbeel

114 papers, 13.5K citations

Ruslan Salakhutdinov

81 papers, 13.3K citations

Michael I. Jordan

62 papers, 8.1K citations