Overtraining, Regularization, and Searching for Minimum in Neural Networks

doi:10.1016/S1474-6670(17)50715-6

Journal ArticleDOI

Overtraining, Regularization, and Searching for Minimum in Neural Networks

Jonas Sjöberg, +1 more

- 01 Jul 1992 -

IFAC Proceedings Volumes

- Vol. 25, Iss: 14, pp 73-78

Chats0

TLDR

In this article, a neural network model for dynamical systems has been proposed, which is often characterized by the fact that they use a fairly large amount of parameters and is often unsuitable for dynamic systems.

About:

This article is published in IFAC Proceedings Volumes.The article was published on 1992-07-01. It has received 123 citations till now. The article focuses on the topics: Time delay neural network & Feedforward neural network.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Nonlinear black-box modeling in system identification: a unified overview

Jonas Sjöberg, +7 more

- 01 Dec 1995 -

Automatica

TL;DR: What are the common features in the different approaches, the choices that have to be made and what considerations are relevant for a successful system-identification application of these techniques are described, from a user's perspective.

...read moreread less

Journal ArticleDOI

The effects of adding noise during backpropagation training on a generalization performance

Guozhong An

- 01 Apr 1996 -

Neural Computation

TL;DR: It is shown that input noise and weight noise encourage the neural-network output to be a smooth function of the input or its weights, respectively, and in the weak-noise limit, noise added to the output of the neural networks only changes the objective function by a constant, it cannot improve generalization.

...read moreread less

Journal ArticleDOI

The statistical-mechanics of learning a rule

Timothy L. H. Watkin, +2 more

- 01 Apr 1993 -

Reviews of Modern Physics

TL;DR: In this article, a summary of the statistical mechanical theory of learning a rule with a neural network, a rapidly advancing area which is closely related to other inverse problems frequently encountered by physicists, is presented.

...read moreread less

Journal ArticleDOI

On the interpretation and identification of dynamic Takagi-Sugeno fuzzy models

Tor Arne Johansen, +2 more

- 01 Jun 2000 -

IEEE Transactions on Fuzzy Systems

TL;DR: There exists a close relationship between dynamic Takagi-Sugeno fuzzy models and dynamic linearization when using affine local model structures, which suggests that a solution to the multiobjective identification problem exists, but it is also shown that the affineLocal model structure is a highly sensitive parametrization when applied in transient operating regimes.

...read moreread less

Journal ArticleDOI

Original Contribution: Improving model selection by nonconvergent methods

W. Finnoff, +2 more

- 01 Jun 1993 -

Neural Networks

TL;DR: This paper shows the general superiority of the ''extended'' nonconvergent methods compared to classical penalty term methods, simple stopped training, and methods which only vary the number of hidden units.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Approximation by superpositions of a sigmoidal function

George Cybenko

- 01 Dec 1989 -

Mathematics of Control, Signals, and Sys...

TL;DR: It is demonstrated that finite linear combinations of compositions of a fixed, univariate function and a set of affine functionals can uniformly approximate any continuous function ofn real variables with support in the unit hypercube.

...read moreread less

Book

Numerical Methods for Unconstrained Optimization and Nonlinear Equations (Classics in Applied Mathematics, 16)

John E. Dennis, +1 more

TL;DR: In this paper, Schnabel proposed a modular system of algorithms for unconstrained minimization and nonlinear equations, based on Newton's method for solving one equation in one unknown convergence of sequences of real numbers.

...read moreread less

Journal ArticleDOI

Generalized Cross-Validation as a Method for Choosing a Good Ridge Parameter

Gene H. Golub, +2 more

- 01 May 1979 -

Technometrics

TL;DR: The generalized cross-validation (GCV) method as discussed by the authors is a generalized version of Allen's PRESS, which can be used in subset selection and singular value truncation, and even to choose from among mixtures of these methods.

...read moreread less

Journal ArticleDOI

Regularization algorithms for learning that are equivalent to multilayer networks.

Tomaso Poggio, +1 more

- 23 Feb 1990 -

Science

TL;DR: A theory is reported that shows the equivalence between regularization and a class of three-layer networks called regularization networks or hyper basis functions.

...read moreread less

Proceedings Article

The Effective Number of Parameters: An Analysis of Generalization and Regularization in Nonlinear Learning Systems

John Moody

TL;DR: In this paper, the generalization performance of nonlinear learning systems, such as multilayer perceptrons and radial basis functions, was analyzed for the second order relationship between the expected test set and training set errors.

...read moreread less

Overtraining, Regularization, and Searching for Minimum in Neural Networks

Citations

Nonlinear black-box modeling in system identification: a unified overview

The effects of adding noise during backpropagation training on a generalization performance

The statistical-mechanics of learning a rule

On the interpretation and identification of dynamic Takagi-Sugeno fuzzy models

Original Contribution: Improving model selection by nonconvergent methods

References

Approximation by superpositions of a sigmoidal function

Numerical Methods for Unconstrained Optimization and Nonlinear Equations (Classics in Applied Mathematics, 16)

Generalized Cross-Validation as a Method for Choosing a Good Ridge Parameter

Regularization algorithms for learning that are equivalent to multilayer networks.

The Effective Number of Parameters: An Analysis of Generalization and Regularization in Nonlinear Learning Systems

Related Papers (5)

System Identification: Theory for the User

Identification and control of dynamical systems using neural networks

Approximation by superpositions of a sigmoidal function

Neural Networks: A Comprehensive Foundation

Multilayer feedforward networks are universal approximators