Showing papers on "Unsupervised learning published in 1990"

PDF

Open Access

Book Chapter•DOI•

[...]

E.R. Davies¹•Institutions (1)

01 Jan 1990

TL;DR: This chapter introduces the subject of statistical pattern recognition (SPR) by considering how features are defined and emphasizes that the nearest neighbor algorithm achieves error rates comparable with those of an ideal Bayes’ classifier.

...read moreread less

Abstract: This chapter introduces the subject of statistical pattern recognition (SPR). It starts by considering how features are defined and emphasizes that the nearest neighbor algorithm achieves error rates comparable with those of an ideal Bayes’ classifier. The concepts of an optimal number of features, representativeness of the training data, and the need to avoid overfitting to the training data are stressed. The chapter shows that methods such as the support vector machine and artificial neural networks are subject to these same training limitations, although each has its advantages. For neural networks, the multilayer perceptron architecture and back-propagation algorithm are described. The chapter distinguishes between supervised and unsupervised learning, demonstrating the advantages of the latter and showing how methods such as clustering and principal components analysis fit into the SPR framework. The chapter also defines the receiver operating characteristic, which allows an optimum balance between false positives and false negatives to be achieved.

...read moreread less

1,189 citations

Proceedings Article•DOI•

Boosting a weak learning algorithm by majority

[...]

Yoav Freund¹•Institutions (1)

University of California, Santa Cruz¹

01 Jul 1990

TL;DR: In this article, the authors present an algorithm for improving the accuracy of algorithms for learning binary concepts by combining a large number of hypotheses, each of which is generated by training the given learning algorithm on a different set of examples.

...read moreread less

Abstract: We present an algorithm for improving the accuracy of algorithms for learning binary concepts. The improvement is achieved by combining a large number of hypotheses, each of which is generated by training the given learning algorithm on a different set of examples. Our algorithm is based on ideas presented by Schapire and represents an improvement over his results, The analysis of our algorithm provides general upper bounds on the resources required for learning in Valiant′s polynomial PAC learning framework, which are the best general upper bounds known today. We show that the number of hypotheses that are combined by our algorithm is the smallest number possible. Other outcomes of our analysis are results regarding the representational power of threshold circuits, the relation between learnability and compression, and a method for parallelizing PAC learning algorithms. We provide extensions of our algorithms to cases in which the concepts are not binary and to the case where the accuracy of the learning algorithm depends on the distribution of the instances.

...read moreread less

865 citations

Proceedings Article•

Estimating probabilities: a crucial task in machine learning

[...]

Bojan Cestnik

01 Jan 1990

557 citations

Journal Article•DOI•

Perceptron-based learning algorithms

[...]

Stephen I. Gallant¹•Institutions (1)

Northeastern University¹

01 Jun 1990-IEEE Transactions on Neural Networks

TL;DR: The heart of these algorithms is the pocket algorithm, a modification of perceptron learning that makes perceptronLearning well-behaved with nonseparable training data, even if the data are noisy and contradictory.

...read moreread less

Abstract: A key task for connectionist research is the development and analysis of learning algorithms. An examination is made of several supervised learning algorithms for single-cell and network models. The heart of these algorithms is the pocket algorithm, a modification of perceptron learning that makes perceptron learning well-behaved with nonseparable training data, even if the data are noisy and contradictory. Features of these algorithms include speed algorithms fast enough to handle large sets of training data; network scaling properties, i.e. network methods scale up almost as well as single-cell models when the number of inputs is increased; analytic tractability, i.e. upper bounds on classification error are derivable; online learning, i.e. some variants can learn continually, without referring to previous data; and winner-take-all groups or choice groups, i.e. algorithms can be adapted to select one out of a number of possible classifications. These learning algorithms are suitable for applications in machine learning, pattern recognition, and connectionist expert systems. >

...read moreread less

529 citations

Journal Article•DOI•

A stochastic reinforcement learning algorithm for learning real-valued functions

[...]

Vijaykumar Gullapalli¹•Institutions (1)

University of Massachusetts Amherst¹

01 Jan 1990-Neural Networks

TL;DR: A stochastic reinforcement learning algorithm for learning functions with continuous outputs using a connectionist network that learns to perform an underconstrained positioning task using a simulated 3 degree-of-freedom robot arm.

...read moreread less

306 citations

Journal Article•DOI•

Unsupervised learning in noise

[...]

Bart Kosko¹•Institutions (1)

University of Southern California¹

01 Mar 1990-IEEE Transactions on Neural Networks

TL;DR: A new hybrid unsupervised-learning law, called the differential competitive law, which uses the signal velocity as a local unsuper supervised reinforcement mechanism, is introduced, and its coding and stability behavior in feedforward and feedback networks is studied.

...read moreread less

Abstract: A new hybrid learning law, the differential competitive law, which uses the neuronal signal velocity as a local unsupervised reinforcement mechanism, is introduced, and its coding and stability behavior in feedforward and feedback networks is examined. This analysis is facilitated by the recent Gluck-Parker pulse-coding interpretation of signal functions in differential Hebbian learning systems. The second-order behavior of RABAM (random adaptive bidirectional associative memory) Brownian-diffusion systems is summarized by the RABAM noise suppression theorem: the mean-squared activation and synaptic velocities decrease exponentially quickly to their lower bounds, the instantaneous noise variances driving the system. This result is extended to the RABAM annealing model, which provides a unified framework from which to analyze Geman-Hwang combinatorial optimization dynamical systems and continuous Boltzmann machine learning. >

...read moreread less

176 citations

Journal Article•DOI•

Self-organizing optical neural network for unsupervised learning

[...]

Thomas Taiwei Lu, Francis T. S. Yu¹, Don A. Gregory²•Institutions (2)

Pennsylvania State University¹, United States Department of the Army²

01 Sep 1990-Optical Engineering

TL;DR: It is shown that the optical neural network is capable of performing both unsupervised learning and pattern recognition operations simultaneously, by setting two matching scores in the learning algorithm by using a slower learning rate.

...read moreread less

Abstract: One of the features in neural computing must be the ability to adapt to a changeable environment and to recognize unknown objects. This paper deals with an adaptive optical neural network using Kohonen's self-organizing feature map algorithm for unsupervised learning. A compact optical neural network of 64 neurons using liquid crystal televisions is used for this study. To test the performance of the self-organizing neural network, experimental demonstrations and computer simulations are provided. Effects due to unsupervised learning parameters are analyzed. We show that the optical neural network is capable of performing both unsupervised learning and pattern recognition operations simultaneously, by setting two matching scores in the learning algorithm. By using a slower learning rate, the construction of the memory matrix becomes more organized topologically. Moreover, the introduction of forbidden regions in the memory space enables the neural network to learn new patterns without erasing the old ones.

...read moreread less

141 citations

Journal Article•

A Genetic Learning Algorithm for the Analysis of Complex Data.

[...]

Norman H. Packard

01 Jan 1990-Complex Systems

99 citations

Proceedings Article•DOI•

An on-line algorithm for dynamic reinforcement learning and planning in reactive environments

[...]

Jürgen Schmidhuber

17 Jun 1990

TL;DR: An online learning algorithm for reinforcement learning with continually running recurrent networks in nonstationary reactive environments is described and the possibility of using the system for planning future action sequences is investigated and this approach is compared to approaches based on temporal difference methods.

...read moreread less

Abstract: An online learning algorithm for reinforcement learning with continually running recurrent networks in nonstationary reactive environments is described. Various kinds of reinforcement are considered as special types of input to an agent living in the environment. The agent's only goal is to maximize the amount of reinforcement received over time. Supervised learning techniques for recurrent networks serve to construct a differentiable model of the environmental dynamics which includes a model of future reinforcement. This model is used for learning goal-directed behavior in an online fashion. The possibility of using the system for planning future action sequences is investigated and this approach is compared to approaches based on temporal difference methods. A connection to metalearning (learning how to learn) is noted

...read moreread less

98 citations

Book Chapter•DOI•

NEWBOOLE: a fast GBML system

[...]

Pierre Bonelli¹, Alexandre Parodi, Sandip Sen², Stewart W. Wilson³•Institutions (3)

University of Paris¹, University of Michigan², Rowland Institute for Science³

01 Jun 1990

TL;DR: An improvement of Wilson's classifier system BOOLE is proposed that shows how Genetics based machine learning systems learning rates can be greatly improved, and is compared to a neural net using back propagation on a difficult boolean learning task, the multiplexer function.

...read moreread less

Abstract: 1 Genetics based machine learning systems are considered by a majority of machine learners as slow rate learning systems. In this paper, we propose an improvement of Wilson's classifier system BOOLE that shows how Genetics based machine learning systems learning rates can be greatly improved. This modification consists in a change of the reinforcement component. We then compare the respective performances of this modified BOOLE, called NEWBOOLE, and a neural net using back propagation on a difficult boolean learning task, the multiplexer function. The results of this comparison show that NEWBOOLE obtains significantly faster learning rates.

...read moreread less

76 citations

DOI•

Neural models of incremental supervised and unsupervised learning

[...]

A. I. Ethem Alpaydin

01 Jan 1990

TL;DR: These Ecole polytechnique federale de Lausanne EPFL students studied polymer engineering at the 1990s and produced a large number of students who went on to earn post-graduate degrees.

...read moreread less

Abstract: These Ecole polytechnique federale de Lausanne EPFL, n° 863 (1990) Reference doi:10.5075/epfl-thesis-863Print copy in library catalog Record created on 2005-03-16, modified on 2016-08-08

...read moreread less

Proceedings Article•

Relaxation Networks for Large Supervised Learning Problems

[...]

Joshua Alspector, Robert B. Allen, Anthony Jayakumar, T. Zeppenfeld, Ron Meir - Show less +1 more

01 Oct 1990

TL;DR: It is shown by simulation that relaxation networks of the kind the authors are implementing in VLSI are capable of learning large problems just like back-propagation networks.

...read moreread less

Abstract: Feedback connections are required so that the teacher signal on the output neurons can modify weights during supervised learning. Relaxation methods are needed for learning static patterns with full-time feedback connections. Feedback network learning techniques have not achieved wide popularity because of the still greater computational efficiency of back-propagation. We show by simulation that relaxation networks of the kind we are implementing in VLSI are capable of learning large problems just like back-propagation networks. A microchip incorporates deterministic mean-field theory learning as well as stochastic Boltzmann learning. A multiple-chip electronic system implementing these networks will make high-speed parallel learning in them feasible in the future.

...read moreread less

Proceedings Article•

Discovering Viewpoint-Invariant Relationships That Characterize Objects

[...]

Richard S. Zemel¹, Geoffrey E. Hinton¹•Institutions (1)

University of Toronto¹

01 Oct 1990

TL;DR: Using an unsupervised learning procedure, a network is trained on an ensemble of images of the same two-dimensional object at different positions, orientations and sizes, and can reject instances of other shapes by using the fact that the predictions made by its two halves disagree.

...read moreread less

Abstract: Using an unsupervised learning procedure, a network is trained on an ensemble of images of the same two-dimensional object at different positions, orientations and sizes. Each half of the network "sees" one fragment of the object, and tries to produce as output a set of 4 parameters that have high mutual information with the 4 parameters output by the other half of the network. Given the ensemble of training patterns, the 4 parameters on which the two halves of the network can agree are the position, orientation, and size of the whole object, or some recoding of them. After training, the network can reject instances of other shapes by using the fact that the predictions made by its two halves disagree. If two competing networks are trained on an unlabelled mixture of images of two objects, they cluster the training cases on the basis of the objects' shapes, independently of the position, orientation, and size.

...read moreread less

Proceedings Article•

Myths and legends in learning classification rules

[...]

Wray Buntine¹•Institutions (1)

The Turing Institute¹

29 Jul 1990

TL;DR: Six myths in the machine learning community that address issues of bias, learning as search, computational learning theory, Occam's razor, "universal" learning algorithms, and interactive learning are proposed.

...read moreread less

Abstract: This paper is a discussion of machine learning theory on empirically learning classification rules. The paper proposes six myths in the machine learning community that address issues of bias, learning as search, computational learning theory, Occam's razor, "universal" learning algorithms, and interactive learning. Some of the problems raised are also addressed from a Bayesian perspective. The paper concludes by suggesting questions that machine learning researchers should be addressing both theoretically and experimentally.

...read moreread less

Proceedings Article•

A hybrid connectionist, symbolic learning system

[...]

Lawrence O. Hall¹, Steve G. Romaniuk¹•Institutions (1)

University of South Florida¹

29 Jul 1990

TL;DR: The learning part of a system which has been developed to provide expert systems capability augmented with learning, a hybrid connectionist, symbolic one, is described, which includes learning the well-known Iris data set.

...read moreread less

Abstract: This paper describes the learning part of a system which has been developed to provide expert systems capability augmented with learning. The learning scheme is a hybrid connectionist, symbolic one. A network representation is used. Learning may be done incrementally and requires only one pass through the data set to be learned. Attribute, value pairs are supported as a variable implementation. Variables are represented by groups of connected cells in the network. The learning algorithm is described and an example given. Current results are discussed, which include learning the well-known Iris data set. The results show that the system has promise.

...read moreread less

Proceedings Article•DOI•

Supervised learning techniques for backpropagation networks

[...]

L.G. Allred, G.E. Kelly

17 Jun 1990

TL;DR: A discussion is presented of three techniques which offer significant improvement in training time by using an acceleration process for neurons which produce the same output class for the inputs provided by the training sample.

...read moreread less

Abstract: A discussion is presented of three techniques which offer significant improvement in training time. In the first, training is restricted to those samples for which the network fails to predict correctly. The training process is extended to the entire training data set as the performance of the network improves. In the second technique, an acceleration process is used for neurons which produce the same output class for the inputs provided by the training sample. In the third technique, the learning rate is optimized, on the fly, to get the optimal improvement for each training pass. A derivation is presented for an optimal matching of momentum and learning rate

...read moreread less

Journal Article•DOI•

Anti-Hebbian learning in a non-linear neural network

[...]

A. Carlson¹•Institutions (1)

Max Planck Society¹

01 Dec 1990-Biological Cybernetics

TL;DR: Local, unsupervised learning rules for the threshold and the transition width are proposed, and a network using these rules sorts the input patterns into classes, which it identifies by a binary code, with the coarser structure coded by the earlier neurons in the hierarchy.

...read moreread less

Abstract: The Hebbian rule (Hebb 1949), coupled with an appropriate mechanism to limit the growth of synaptic weights, allows a neuron to learn to respond to the first principal component of the distribution of its input signals (Oja 1982). Rubner and Schulten (1990) have recently suggested the use of an "anti-Hebbian" rule in a network with hierarchical lateral connections. When applied to neurons with linear response functions, this model allows additional neurons to learn to respond to additional principal components (Rubner and Tavan 1989). Here we apply the model to neurons with non-linear response functions characterized by a threshold and a transition width. We propose local, unsupervised learning rules for the threshold and the transition width, and illustrate the operation of these rules with some simple examples. A network using these rules sorts the input patterns into classes, which it identifies by a binary code, with the coarser structure coded by the earlier neurons in the hierarchy.

...read moreread less

Book Chapter•DOI•

Grow-and-Learn: An Incremental Method for Category Learning

[...]

Ethem Alpaydm¹•Institutions (1)

École Polytechnique Fédérale de Lausanne¹

01 Jan 1990

TL;DR: GAL is a new algorithm that is able to quantize vectors as members of categories in an incremental fashion that, when learning, grows if and when necessary.

...read moreread less

Abstract: Learning by changing connection weights only is time-consuming and does not always work. Freedom to modify network structure is also needed. Grow-and-Learn (GAL) is a new algorithm that is able to quantize vectors as members of categories in an incremental fashion. When a new vector is encountered, it is tested as in nearest neighbor search and if it is not already quantized correctly, unit and links are added to accommodate this additional requirement. Thus network when learning, grows if and when necessary. As the structure of the resulting network in such a learning phase is dependent on the order of encountering the vectors, a second phase is added to eliminate old, no-longer necessary associations. In this phase, the network is closed to the environment and the input patterns are generated by the network itself during which relevance of units are computed and those who are not vital are removed. Simulation results when applied to character recognition is promising. Physiological plausibility and how the idea may be extended to unsupervised learning is discussed.

...read moreread less

Journal Article•DOI•

Learning in a competitive network

[...]

Wolfgang Banzhaf¹, H. Haken¹•Institutions (1)

University of Stuttgart¹

01 Jul 1990-Neural Networks

TL;DR: A learning scheme based on Hebb's rule which allows the system's neuronal cells to specialize on different patterns during learning is introduced, appropriately modified and applied to the competitive network under study.

...read moreread less

Journal Article•DOI•

An unsupervised learning technique for artificial neural networks

[...]

Amir F. Atiya¹•Institutions (1)

California Institute of Technology¹

01 Jan 1990-Neural Networks

TL;DR: A new artificial neural model for unsupervised learning that iterates the weights in such a way as to move the decision boundary to a place of low pattern density and extended to the multiclass case by applying the previous procedure in a hierarchical manner.

...read moreread less

Proceedings Article•DOI•

Supervised learning process of multi-layer perceptron neural networks using fast least squares

[...]

Mahmood R. Azimi-Sadjadi¹, S. Citrin¹, S. Sheedvash¹•Institutions (1)

Colorado State University¹

03 Apr 1990

TL;DR: A new approach for the learning process of multilayer perceptron neural networks using a recursive-least-squares-(RLS) type algorithm is proposed, and an analog of the back-propagation strategy used in the conventional learning algorithms is developed.

...read moreread less

Abstract: A new approach for the learning process of multilayer perceptron neural networks using a recursive-least-squares-(RLS) type algorithm is proposed. The weights in the network are updated recursively upon the arrival of a new training sample. To determine the desired target in the hidden layers an analog of the back-propagation strategy used in the conventional learning algorithms is developed. This permits the application of the learning procedure to all the other lower layers. Simulation results on the 4-b parity checker problem are provided. >

...read moreread less

Book Chapter•DOI•

Is learning rate a good performance criterion for learning

[...]

Claude Sammut¹, James Cribb¹•Institutions (1)

University of New South Wales¹

01 Jun 1990

TL;DR: In control tasks, such as pole balancing, it is found that a program that learns to balance the pole quickly produces a control strategy that is so specific as to make it impossible to transfer expertise from one related task to another.

...read moreread less

Abstract: The most frequently used measure of performance for reinforcement learning algorithms is learning rate. That is, how many learning trials are required before the program is able to perform its task adequately. In this paper, we argue that this is not necessarily the best measure of performance and, in some cases, can even be misleading. In control tasks, such as pole balancing, we have found that a program that learns to balance the pole quickly produces a control strategy that is so specific as to make it impossible to transfer expertise from one related task to another. We examine the reasons for this and suggest ways of obtaining general control strategies. We also make the conjecture that, as a broad principle, there is a trade-off between rapid learning rate and the ability to generalise. We also introduce methods for analysing the results of reinforcement learning algorithms to produce readable control rules.

...read moreread less

Book Chapter•DOI•

Comparing Instance-Averaging with Instance-Saving Learning Algorithms

[...]

Dennis Kibler¹, David W. Aha¹•Institutions (1)

University of California¹

01 Jan 1990

TL;DR: The goal of the research is to understand the power and appropriateness of instance-based representations and their associated acquisition methods and to mitigate the effects of non-convex concepts.

...read moreread less

Abstract: The goal of our research is to understand the power and appropriateness of instance-based representations and their associated acquisition methods. This paper concerns two methods for reducing storage requirements for instance-based learning algorithms. The first method, termed instance-saving, represents concept descriptions by selecting and storing a representative subset of the given training instances. We provide an analysis for instance-saving techniques and specify one general class of concepts that instance-saving algorithms can learn. The second method, termed instance-averaging, represents concept descriptions by averaging together some training instances while simply saving others. We describe why analyses for instance-averaging algorithms are difficult to produce. Our empirical results indicate that storage requirements for these two methods are roughly equivalent. We outline the assumptions of instance-averaging algorithms and describe how their violation might degrade performance. To mitigate the effects of non-convex concepts, a dynamic distance-thresholding technique is introduced and applied in both the averaging and non-averaging learning algorithms. Thresholding increases storage requirements but also increases the quality of the resulting concept descriptions.

...read moreread less

Proceedings Article•

General limitations on machine learning

[...]

Achim Hoffmann

01 Jan 1990

TL;DR: This paper proves the general inability of simple learning programs to learn complex concepts from few input data, independently of the epistemological problems of inductive inference.

...read moreread less

Abstract: Machine learning is widely regarded as a tool for overcoming the bottleneck in knowledge acquisition. Especially in knowledge-intensive domains there is the hope for using machine learning techniques successfully. This paper prove the general inability of simple learning programs to learn complex concepts from few input data. This holds independently of the epistemological problems of inductive inference. These results are obtained by the use of algorithmic information theory.

...read moreread less

Book Chapter•DOI•

Average case analysis of conjunctive learning algorithms

[...]

Michael J. Pazzani¹, Wendy Sarrett•Institutions (1)

University of California, Irvine¹

01 Jun 1990

TL;DR: This work applies an approach to modeling the average case behavior of learning algorithms to a purely empirical learning algorithm, and to an algorithm that combines empirical and explanation-based learning.

...read moreread less

Abstract: We present an approach to modeling the average case behavior of learning algorithms. Our motivation is to predict the expected accuracy of learning algorithms as a function of the number of training examples. We apply this framework to a purely empirical learning algorithm, (the one-sided algorithm for pure conjunctive concepts), and to an algorithm that combines empirical and explanation-based learning. We evaluate the average-case models by comparing the accuracy predicted by the models to the actual accuracy obtained by running the learning algorithms.

...read moreread less

Proceedings Article•DOI•

Self-organizing optical neural network for unsupervised learning

[...]

Thomas Taiwei Lu¹, Francis T. S. Yu¹, Don A. Gregory²•Institutions (2)

Pennsylvania State University¹, United States Department of the Army²

01 Sep 1990

...read moreread less

Abstract: One of the features in neural computing must be the adaptability to changeable environment and to recognize unknown objects. This paper deals with an adaptive optical neural network using Kohonon's self-organizing feature map algorithm for unsupervised learning. A compact optical neural network of 64 neurons using liquid crystal televisions is used for this study. To test the performances of the self-organizing neural network, experimental demonstrations with computer simulations are provided. Effects due to unsupervised learning parameters are analyzed. We have shown that the optical neural network is capable of performing both unsupervised learning and pattern recognition operations simultaneously, by setting two matching scores in the learning algorithm. By using slower learning rate, the construction of the memory matrix becomes topologically more organized. Moreover, by introducing the forbidden regions in the memory space, it would enable the neural network to learn new patterns without erasing the old ones.

...read moreread less

Proceedings Article•

On analytical and similarity-based classification

[...]

Marc Vilain¹, Phyllis Koton¹, Melissa P. Chase¹•Institutions (1)

Mitre Corporation¹

29 Jul 1990

TL;DR: A representation language is presented that supports a hybrid analytical and similarity-based classification scheme and can be seen as providing an inductive bias to the learning procedure, thereby shortening the required training phase, and reducing the brittleness of the induced generalizations.

...read moreread less

Abstract: This paper is concerned with knowledge representation issues in machine learning. In particular, it presents a representation language that supports a hybrid analytical and similarity-based classification scheme. Analytical classification is produced using a KL-ONE-like term-subsumption strategy, while similarity-based classification is driven by generalizations induced from a training set by an unsupervised learning procedure. This approach can be seen as providing an inductive bias to the learning procedure, thereby shortening the required training phase, and reducing the brittleness of the induced generalizations.

...read moreread less

Proceedings Article•DOI•

A method that combines inductive learning with exemplar-based learning

[...]

J. Zhang¹•Institutions (1)

Utah State University¹

06 Nov 1990

TL;DR: The method for combining inductive learning and exemplar-based learning has been implemented in the flexible concept learning system and experiments showed that the combined method has comparable performance to that of AQ16 and ASSISTANT in three natural domains.

...read moreread less

Abstract: A learning approach that combines inductive learning with exemplar-based learning is described. In the method, a concept is represented by two parts: a generalized abstract description and a set of exemplars (exceptions). Generalized descriptions represent the principles of concepts, whereas exemplars represent the exceptional or rare cases. The method is an alternative for solving the problem of small disjuncts and for representing concepts with imprecise and irregular boundaries. The method for combining inductive learning and exemplar-based learning has been implemented in the flexible concept learning system. Experiments showed that the combined method has comparable performance to that of AQ16 and ASSISTANT in three natural domains. >

...read moreread less

Extensions of a Theory of Networks and Learning: Outliers and Negative Examples

[...]

Tomaso Poggio, Federico Girosi, Bruno Caprile

01 Jul 1990

TL;DR: The theory of input-output mapping from a set of examples is extended by introducing ways of dealing with two aspects of learning: learning in the presence of unreliable examples and learning from positive {\it and} negative examples.

...read moreread less

Abstract: Learning an input-output mapping from a set of examples can be regarded as synthesizing an approximation of a multi-dimensional function. From this point of view, this form of learning is closely related to regularization theory. In this note, we extend the theory by introducing ways of dealing with two aspects of learning: learning in the presence of unreliable examples and learning from positive {\it and} negative examples. The first extension corresponds to dealing with outliers among the sparse data. The second one corresponds to exploiting information about points or regions in the range of the function that are forbidden.

...read moreread less

Proceedings Article•DOI•

Signature verification with a syntactic neural net

[...]

Simon M. Lucas, Robert I. Damper

17 Jun 1990

TL;DR: The authors show how the network itself can infer the grammar and show that nonstochastic nets can perform signature verification with high reliability, raising the possibility of signature verification on a robust smart card.

...read moreread less

Abstract: A syntactic neural network is equivalent to a parser for a certain type of grammar-in this case, strictly hierarchical context-free. This allows an efficient method for pattern description and has the added advantage of being a generative model. The authors show how the network itself can infer the grammar. Syntactic neural nets can model stochastic or nonstochastic grammars. The stochastic nets are properly probabilistic and are powerful discriminators; the nonstochastic nets are less powerful, but have straightforward silicon implementations with existing technology. Learning in syntactic nets may proceed supervised or unsupervised. In each case, the algorithm is the same; the difference lies in the data presented to the net. In prior publications, the authors applied syntactic neural nets to character recognition and cursive script recognition. The authors presently show that nonstochastic nets can perform signature verification with high reliability. This raises the possibility of signature verification on a robust smart card

...read moreread less