Showing papers on "Active learning (machine learning) published in 2005"

PDF

Open Access

Book•

[...]

Carl Edward Rasmussen¹, Christopher Williams•Institutions (1)

23 Nov 2005

TL;DR: The treatment is comprehensive and self-contained, targeted at researchers and students in machine learning and applied statistics, and deals with the supervised learning problem for both regression and classification.

...read moreread less

Abstract: A comprehensive and self-contained introduction to Gaussian processes, which provide a principled, practical, probabilistic approach to learning in kernel machines. Gaussian processes (GPs) provide a principled, practical, probabilistic approach to learning in kernel machines. GPs have received increased attention in the machine-learning community over the past decade, and this book provides a long-needed systematic and unified treatment of theoretical and practical aspects of GPs in machine learning. The treatment is comprehensive and self-contained, targeted at researchers and students in machine learning and applied statistics. The book deals with the supervised-learning problem for both regression and classification, and includes detailed algorithms. A wide variety of covariance (kernel) functions are presented and their properties discussed. Model selection is discussed both from a Bayesian and a classical perspective. Many connections to other well-known techniques from machine learning and statistics are discussed, including support-vector machines, neural networks, splines, regularization networks, relevance vector machines and others. Theoretical issues including learning curves and the PAC-Bayesian framework are treated, and several approximation methods for learning with large datasets are discussed. The book contains illustrative examples and exercises, and code and datasets are available on the Web. Appendixes provide mathematical background and a discussion of Gaussian Markov processes.

...read moreread less

11,357 citations

Book•

Gaussian Processes for Machine Learning (Adaptive Computation and Machine Learning)

[...]

Carl Edward Rasmussen, Christopher Williams

01 Dec 2005

TL;DR: The treatment is comprehensive and self-contained, targeted at researchers and students in machine learning and applied statistics, and includes detailed algorithms for supervised-learning problem for both regression and classification.

...read moreread less

Abstract: Gaussian processes (GPs) provide a principled, practical, probabilistic approach to learning in kernel machines. GPs have received increased attention in the machine-learning community over the past decade, and this book provides a long-needed systematic and unified treatment of theoretical and practical aspects of GPs in machine learning. The treatment is comprehensive and self-contained, targeted at researchers and students in machine learning and applied statistics. The book deals with the supervised-learning problem for both regression and classification, and includes detailed algorithms. A wide variety of covariance (kernel) functions are presented and their properties discussed. Model selection is discussed both from a Bayesian and a classical perspective. Many connections to other well-known techniques from machine learning and statistics are discussed, including support-vector machines, neural networks, splines, regularization networks, relevance vector machines and others. Theoretical issues including learning curves and the PAC-Bayesian framework are treated, and several approximation methods for learning with large datasets are discussed. The book contains illustrative examples and exercises, and code and datasets are available on the Web. Appendixes provide mathematical background and a discussion of Gaussian Markov processes.

...read moreread less

2,732 citations

Journal Article•DOI•

A Framework for Learning Predictive Structures from Multiple Tasks and Unlabeled Data

[...]

Rie Kubota Ando, Tong Zhang

01 Dec 2005-Journal of Machine Learning Research

TL;DR: This paper presents a general framework in which the structural learning problem can be formulated and analyzed theoretically, and relate it to learning with unlabeled data, and algorithms for structural learning will be proposed, and computational issues will be investigated.

...read moreread less

Abstract: One of the most important issues in machine learning is whether one can improve the performance of a supervised learning algorithm by including unlabeled data. Methods that use both labeled and unlabeled data are generally referred to as semi-supervised learning. Although a number of such methods are proposed, at the current stage, we still don't have a complete understanding of their effectiveness. This paper investigates a closely related problem, which leads to a novel approach to semi-supervised learning. Specifically we consider learning predictive structures on hypothesis spaces (that is, what kind of classifiers have good predictive power) from multiple learning tasks. We present a general framework in which the structural learning problem can be formulated and analyzed theoretically, and relate it to learning with unlabeled data. Under this framework, algorithms for structural learning will be proposed, and computational issues will be investigated. Experiments will be given to demonstrate the effectiveness of the proposed algorithms in the semi-supervised learning setting.

...read moreread less

1,484 citations

Proceedings Article•DOI•

Opposition-Based Learning: A New Scheme for Machine Intelligence

[...]

Hamid R. Tizhoosh¹•Institutions (1)

University of Waterloo¹

28 Nov 2005

TL;DR: Opposition-based learning as a new scheme for machine intelligence is introduced and possibilities for extensions of existing learning algorithms are discussed.

...read moreread less

Abstract: Opposition-based learning as a new scheme for machine intelligence is introduced. Estimates and counter-estimates, weights and opposite weights, and actions versus counter-actions are the foundation of this new approach. Examples are provided. Possibilities for extensions of existing learning algorithms are discussed. Preliminary results are provided

...read moreread less

1,464 citations

Journal Article•DOI•

Cooperative Multi-Agent Learning: The State of the Art

[...]

Liviu Panait¹, Sean Luke¹•Institutions (1)

George Mason University¹

01 Nov 2005-Autonomous Agents and Multi-Agent Systems

TL;DR: This survey attempts to draw from multi-agent learning work in a spectrum of areas, including RL, evolutionary computation, game theory, complex systems, agent modeling, and robotics, and finds that this broad view leads to a division of the work into two categories.

...read moreread less

Abstract: Cooperative multi-agent systems (MAS) are ones in which several agents attempt, through their interaction, to jointly solve tasks or to maximize utility. Due to the interactions among the agents, multi-agent problem complexity can rise rapidly with the number of agents or their behavioral sophistication. The challenge this presents to the task of programming solutions to MAS problems has spawned increasing interest in machine learning techniques to automate the search and optimization process. We provide a broad survey of the cooperative multi-agent learning literature. Previous surveys of this area have largely focused on issues common to specific subareas (for example, reinforcement learning, RL or robotics). In this survey we attempt to draw from multi-agent learning work in a spectrum of areas, including RL, evolutionary computation, game theory, complex systems, agent modeling, and robotics. We find that this broad view leads to a division of the work into two categories, each with its own special issues: applying a single learner to discover joint solutions to multi-agent problems (team learning), or using multiple simultaneous learners, often one per agent (concurrent learning). Additionally, we discuss direct and indirect communication in connection with learning, plus open issues in task decomposition, scalability, and adaptive dynamics. We conclude with a presentation of multi-agent learning problem domains, and a list of multi-agent learning resources.

...read moreread less

1,283 citations

Journal Article•DOI•

Rapid and brief communication: Evolutionary extreme learning machine

[...]

Qin-Yu Zhu¹, A. K. Qin¹, Ponnuthurai Nagaratnam Suganthan¹, Guang-Bin Huang¹•Institutions (1)

Nanyang Technological University¹

01 Oct 2005-Pattern Recognition

TL;DR: A hybrid learning algorithm is proposed which uses the differential evolutionary algorithm to select the input weights and Moore-Penrose (MP) generalized inverse to analytically determine the output weights.

...read moreread less

734 citations

Journal Article•

Fast Kernel Classifiers with Online and Active Learning

[...]

Antoine Bordes¹, Seyda Ertekin², Jason Weston³, Léon Bottou⁴•Institutions (4)

École Normale Supérieure¹, Massachusetts Institute of Technology², Google³, Microsoft⁴

01 Dec 2005-Journal of Machine Learning Research

TL;DR: This contribution presents an online SVM algorithm based on the premise that active example selection can yield faster training, higher accuracies, and simpler models, using only a fraction of the training example labels.

...read moreread less

Abstract: Very high dimensional learning systems become theoretically possible when training examples are abundant. The computing cost then becomes the limiting factor. Any efficient learning algorithm should at least take a brief look at each example. But should all examples be given equal attention?This contribution proposes an empirical answer. We first present an online SVM algorithm based on this premise. LASVM yields competitive misclassification rates after a single pass over the training examples, outspeeding state-of-the-art SVM solvers. Then we show how active example selection can yield faster training, higher accuracies, and simpler models, using only a fraction of the training example labels.

...read moreread less

700 citations

Report•DOI•

Reducing labeling effort for structured prediction tasks

[...]

Aron Culotta¹, Andrew McCallum¹•Institutions (1)

University of Massachusetts Amherst¹

09 Jul 2005

TL;DR: A new active learning paradigm is proposed which reduces not only how many instances the annotator must label, but also how difficult each instance is to annotate, which can vary widely in structured prediction tasks.

...read moreread less

Abstract: A common obstacle preventing the rapid deployment of supervised machine learning algorithms is the lack of labeled training data. This is particularly expensive to obtain for structured prediction tasks, where each training instance may have multiple, interacting labels, all of which must be correctly annotated for the instance to be of use to the learner. Traditional active learning addresses this problem by optimizing the order in which the examples are labeled to increase learning efficiency. However, this approach does not consider the difficulty of labeling each example, which can vary widely in structured prediction tasks. For example, the labeling predicted by a partially trained system may be easier to correct for some instances than for others. We propose a new active learning paradigm which reduces not only how many instances the annotator must label, but also how difficult each instance is to annotate. The system also leverages information from partially correct predictions to efficiently solicit annotations from the user. We validate this active learning framework in an interactive information extraction system, reducing the total number of annotation actions by 22%.

...read moreread less

387 citations

Proceedings Article•

Coarse sample complexity bounds for active learning

[...]

Sanjoy Dasgupta¹•Institutions (1)

University of California, San Diego¹

05 Dec 2005

TL;DR: The sample complexity of active learning problems is characterized in terms of a parameter which takes into account the distribution over the input space, the specific target hypothesis, and the desired accuracy.

...read moreread less

Abstract: We characterize the sample complexity of active learning problems in terms of a parameter which takes into account the distribution over the input space, the specific target hypothesis, and the desired accuracy.

...read moreread less

329 citations

Journal Article•DOI•

Letters: Fully complex extreme learning machine

[...]

Ming-Bin Li¹, Guang-Bin Huang¹, Paramasivan Saratchandran¹, Narasimhan Sundararajan¹•Institutions (1)

Nanyang Technological University¹

01 Oct 2005-Neurocomputing

TL;DR: The simulation results show that the ELM equalizer significantly outperforms other neural network equalizers such as the complex minimal resource allocation network (CMRAN), complex radial basis function (CRBF) network and complex backpropagation (CBP) equalizers.

...read moreread less

316 citations

Proceedings Article•DOI•

Using GPUs for machine learning algorithms

[...]

David W. Steinkraus¹, I. Buck², Patrice Y. Simard¹•Institutions (2)

Microsoft¹, Stanford University²

31 Aug 2005

TL;DR: This work proposes a generic 2-layer fully connected neural network GPU implementation which yields over 3/spl times/ speedup for both training and testing with respect to a 3 GHz P4 CPU.

...read moreread less

Abstract: Using dedicated hardware to do machine learning typically ends up in disaster because of cost, obsolescence, and poor software. The popularization of graphic processing units (GPUs), which are now available on every PC, provides an attractive alternative. We propose a generic 2-layer fully connected neural network GPU implementation which yields over 3/spl times/ speedup for both training and testing with respect to a 3 GHz P4 CPU.

...read moreread less

Proceedings Article•DOI•

A High-Performance Semi-Supervised Learning Method for Text Chunking

[...]

Rie Ando¹, Tong Zhang¹•Institutions (1)

IBM¹

25 Jun 2005

TL;DR: A novel semi-supervised method that employs a learning paradigm which is to find "what good classifiers are like" by learning from thousands of automatically generated auxiliary classification problems on unlabeled data, which produces performance higher than the previous best results.

...read moreread less

Abstract: In machine learning, whether one can build a more accurate classifier by using unlabeled data (semi-supervised learning) is an important issue. Although a number of semi-supervised methods have been proposed, their effectiveness on NLP tasks is not always clear. This paper presents a novel semi-supervised method that employs a learning paradigm which we call structural learning. The idea is to find "what good classifiers are like" by learning from thousands of automatically generated auxiliary classification problems on unlabeled data. By doing so, the common predictive structure shared by the multiple classification problems can be discovered, which can then be used to improve performance on the target problem. The method produces performance higher than the previous best results on CoNLL'00 syntactic chunking and CoNLL'03 named entity chunking (English and German).

...read moreread less

Journal Article•DOI•

Active Learning to Recognize Multiple Types of Plankton

[...]

Tong Luo, Kurt Kramer, Dmitry B. Goldgof, Lawrence O. Hall, Scott Samson, Andrew Remsen, Thomas L. Hopkins - Show less +3 more

01 Dec 2005-Journal of Machine Learning Research

TL;DR: The experimental results from a plankton recognition system indicate that the active learning approach to multiple class support vector machines often requires significantly less labeled images to maintain the same accuracy level as random sampling.

...read moreread less

Abstract: This paper presents an active learning method which reduces the labeling effort of domain experts in multi-class classification problems. Active learning is applied in conjunction with support vector machines to recognize underwater zooplankton from higher-resolution, new generation SIPPER II images. Most previous work on active learning with support vector machines only deals with two class problems. In this paper, we propose an active learning approach "breaking ties" for multi-class support vector machines using the one-vs-one approach with a probability approximation. Experimental results indicate that our approach often requires significantly less labeled images to reach a given accuracy than the approach of labeling the least certain test example and random sampling. It can also be applied in batch mode resulting in an accuracy comparable to labeling one image at a time and retraining.

...read moreread less

Proceedings Article•

On-Line Sequential Extreme Learning Machine

[...]

Guang-Bin Huang¹, Nanying Liang, Hai-Jun Rong, Paramasivan Saratchandran, Narasimhan Sundararajan - Show less +1 more•Institutions (1)

Nanyang Technological University¹

01 Jan 2005

TL;DR: Experimental results on some real benchmark regression problems show that the proposed Online Sequential Extreme Learning Machine (OS-ELM) produces better generalization performance at very fast learning speed.

...read moreread less

Abstract: The primitive Extreme Learning Machine (ELM) [1, 2, 3] with additive neurons and RBF kernels was implemented in batch mode. In this paper, its sequential modification based on recursive least-squares (RLS) algorithm, which referred as Online Sequential Extreme Learning Machine (OS-ELM), is introduced. Based on OS-ELM, Online Sequential Fuzzy Extreme Learning Machine (Fuzzy-ELM) is also introduced to implement zero order TSK model and first order TSK model. The performance of OS-ELM and Fuzzy-ELM are evaluated and compared with other popular sequential learning algorithms, and experimental results on some real benchmark regression problems show that the proposedOnlineSequentialExtreme Learning Machine (OS-ELM) produces better generalization performance at very fast learning speed.

...read moreread less

Journal Article•DOI•

Active learning: theory and applications to automatic speech recognition

[...]

Giuseppe Riccardi¹, Dilek Hakkani-Tur¹•Institutions (1)

AT&T Labs¹

20 Jun 2005-IEEE Transactions on Speech and Audio Processing

TL;DR: This paper describes how to estimate the confidence score for each utterance through an on-line algorithm using the lattice output of a speech recognizer and shows that the amount of labeled data needed for a given word accuracy can be reduced by more than 60% with respect to random sampling.

...read moreread less

Abstract: We are interested in the problem of adaptive learning in the context of automatic speech recognition (ASR). In this paper, we propose an active learning algorithm for ASR. Automatic speech recognition systems are trained using human supervision to provide transcriptions of speech utterances. The goal of Active Learning is to minimize the human supervision for training acoustic and language models and to maximize the performance given the transcribed and untranscribed data. Active learning aims at reducing the number of training examples to be labeled by automatically processing the unlabeled examples, and then selecting the most informative ones with respect to a given cost function for a human to label. In this paper we describe how to estimate the confidence score for each utterance through an on-line algorithm using the lattice output of a speech recognizer. The utterance scores are filtered through the informativeness function and an optimal subset of training samples is selected. The active learning algorithm has been applied to both batch and on-line learning scheme and we have experimented with different selective sampling algorithms. Our experiments show that by using active learning the amount of labeled data needed for a given word accuracy can be reduced by more than 60% with respect to random sampling.

...read moreread less

Proceedings Article•

Faster Rates in Regression via Active Learning

[...]

Rebecca Willett¹, Robert Nowak¹, Rui Castro²•Institutions (2)

University of Wisconsin-Madison¹, Rice University²

05 Dec 2005

TL;DR: In this article, the authors present a rigorous statistical analysis characterizing regimes in which active learning significantly outperforms classical passive learning, and explore fundamental performance limits of active and passive learning in two illustrative nonparametric function classes.

...read moreread less

Abstract: This paper presents a rigorous statistical analysis characterizing regimes in which active learning significantly outperforms classical passive learning. Active learning algorithms are able to make queries or select sample locations in an online fashion, depending on the results of the previous queries. In some regimes, this extra flexibility leads to significantly faster rates of error decay than those possible in classical passive learning settings. The nature of these regimes is explored by studying fundamental performance limits of active and passive learning in two illustrative nonparametric function classes. In addition to examining the theoretical potential of active learning, this paper describes a practical algorithm capable of exploiting the extra flexibility of the active setting and provably improving upon the classical passive techniques. Our active learning theory and methods show promise in a number of applications, including field estimation using wireless sensor networks and fault line detection.

...read moreread less

Journal Article•DOI•

On-line learning for very large data sets

[...]

Léon Bottou¹, Yann L. Cun²•Institutions (2)

Princeton University¹, Courant Institute of Mathematical Sciences²

01 Mar 2005-Applied Stochastic Models in Business and Industry

TL;DR: This paper reconsiders the convergence speed in terms of how fast a learning algorithm optimizes the testing error and shows the superiority of the well designed stochastic learning algorithm.

...read moreread less

Abstract: The design of very large learning systems presents many unsolved challenges. Consider, for instance, a system that ‘watches’ television for a few weeks and learns to enumerate the objects present in these images. Most current learning algorithms do not scale well enough to handle such massive quantities of data. Experience suggests that the stochastic learning algorithms are best suited to such tasks. This is at first surprising because stochastic learning algorithms optimize the training error rather slowly. Our paper reconsiders the convergence speed in terms of how fast a learning algorithm optimizes the testing error. This reformulation shows the superiority of the well designed stochastic learning algorithm. Copyright © 2005 John Wiley & Sons, Ltd.

...read moreread less

Book Chapter•DOI•

Symbolic compositional verification by learning assumptions

[...]

Rajeev Alur¹, P. Madhusudan², Wonhong Nam¹•Institutions (2)

University of Pennsylvania¹, University of Illinois at Urbana–Champaign²

06 Jul 2005

TL;DR: This paper presents a symbolic implementation of the L* algorithm for active learning of regular languages, and incorporates it in the model checker NuSMV, demonstrating significant savings in the computational requirements of symbolic model checking.

...read moreread less

Abstract: The verification problem for a system consisting of components can be decomposed into simpler subproblems for the components using assume-guarantee reasoning However, such compositional reasoning requires user guidance to identify appropriate assumptions for components In this paper, we propose an automated solution for discovering assumptions based on the L* algorithm for active learning of regular languages We present a symbolic implementation of the learning algorithm, and incorporate it in the model checker NuSMV Our experiments demonstrate significant savings in the computational requirements of symbolic model checking

...read moreread less

Proceedings Article•DOI•

Balancing exploration and exploitation: a new algorithm for active machine learning

[...]

T.T. Osugi, Deng Kim, Stephen D. Scott

27 Nov 2005

TL;DR: This work proposes a new active learning algorithm that balances exploration with refining of the decision boundary by dynamically adjusting the probability to explore at each step, and demonstrates improved performance on data sets that require extensive exploration while remaining competitive on data set that do not.

...read moreread less

Abstract: Active machine learning algorithms are used when large numbers of unlabeled examples are available and getting labels for them is costly (e.g. requiring consulting a human expert). Many conventional active learning algorithms focus on refining the decision boundary, at the expense of exploring new regions that the current hypothesis misclassifies. We propose a new active learning algorithm that balances such exploration with refining of the decision boundary by dynamically adjusting the probability to explore at each step. Our experimental results demonstrate improved performance on data sets that require extensive exploration while remaining competitive on data sets that do not. Our algorithm also shows significant tolerance of noise.

...read moreread less

Patent•

Interactive learning-based document annotation

[...]

Boris Chidlovskii¹, Thierry Jacquin¹•Institutions (1)

Xerox¹

23 Dec 2005

TL;DR: In this article, an active learning component trains an annotation model and proposes annotations to documents based on the annotation model, and a request handler conveys annotation requests from the graphical user interface to the active learning components.

...read moreread less

Abstract: A document annotation system includes a graphical user interface used by an annotator to annotate documents. An active learning component trains an annotation model and proposes annotations to documents based on the annotation model. A request handler conveys annotation requests from the graphical user interface to the active learning component, conveys proposed annotations from the active learning component to the graphical user interface, and selectably conveys evaluation requests from the graphical user interface to a domain expert. During annotation, at least some low probability proposed annotations are presented to the annotator by the graphical user interface. The presented low probability proposed annotations enhance training of the annotation model by the active learning component.

...read moreread less

Proceedings Article•DOI•

SVM selective sampling for ranking with application to data retrieval

[...]

Hwanjo Yu¹•Institutions (1)

University of Iowa¹

21 Aug 2005

TL;DR: The proposed sampling technique effectively learns an accurate SVM ranking function with fewer partial orders, and is applied to the data retrieval application, which enables fuzzy search on relational databases by interacting with users for learning their preferences.

...read moreread less

Abstract: Learning ranking (or preference) functions has been a major issue in the machine learning community and has produced many applications in information retrieval. SVMs (Support Vector Machines) - a classification and regression methodology - have also shown excellent performance in learning ranking functions. They effectively learn ranking functions of high generalization based on the "large-margin" principle and also systematically support nonlinear ranking by the "kernel trick". In this paper, we propose an SVM selective sampling technique for learning ranking functions. SVM selective sampling (or active learning with SVM) has been studied in the context of classification. Such techniques reduce the labeling effort in learning classification functions by selecting only the most informative samples to be labeled. However, they are not extendable to learning ranking functions, as the labeled data in ranking is relative ordering, or partial orders of data. Our proposed sampling technique effectively learns an accurate SVM ranking function with fewer partial orders. We apply our sampling technique to the data retrieval application, which enables fuzzy search on relational databases by interacting with users for learning their preferences. Experimental results show a significant reduction of the labeling effort in inducing accurate ranking functions.

...read moreread less

Proceedings Article•DOI•

Preference elicitation for interface optimization

[...]

Krzysztof Z. Gajos¹, Daniel S. Weld¹•Institutions (1)

University of Washington¹

23 Oct 2005

TL;DR: ARNAULD is described, a general interactive tool for eliciting user preferences concerning concrete outcomes and using this feedback to automatically learn a factored cost function.

...read moreread less

Abstract: Decision-theoretic optimization is becoming a popular tool in the user interface community, but creating accurate cost (or utility) functions has become a bottleneck --- in most cases the numerous parameters of these functions are chosen manually, which is a tedious and error-prone process. This paper describes ARNAULD, a general interactive tool for eliciting user preferences concerning concrete outcomes and using this feedback to automatically learn a factored cost function. We empirically evaluate our machine learning algorithm and two automatic query generation approaches and report on an informal user study.

...read moreread less

Proceedings Article•

Maximum Margin Semi-Supervised Learning for Structured Variables

[...]

Yasemin Altun, David McAllester, Mikhail Belkin¹•Institutions (1)

University of Chicago¹

05 Dec 2005

TL;DR: This paper presents a discriminative approach that utilizes the intrinsic geometry of input patterns revealed by unlabeled data points and derives a maximum-margin formulation of semi-supervised learning for structured variables.

...read moreread less

Abstract: Many real-world classification problems involve the prediction of multiple inter-dependent variables forming some structural dependency. Recent progress in machine learning has mainly focused on supervised classification of such structured variables. In this paper, we investigate structured classification in a semi-supervised setting. We present a discriminative approach that utilizes the intrinsic geometry of input patterns revealed by unlabeled data points and we derive a maximum-margin formulation of semi-supervised learning for structured variables. Unlike transductive algorithms, our formulation naturally extends to new test points.

...read moreread less

Proceedings Article•

Query by Committee Made Real

[...]

Ran Gilad-Bachrach¹, Amir Navot², Naftali Tishby²•Institutions (2)

Hebrew University of Jerusalem¹, Interdisciplinary Center for Neural Computation²

05 Dec 2005

TL;DR: A new algorithm, KQBC, is introduced, capable of actively learning large scale problems by using selective sampling, which overcomes the costly sampling step of the well known Query By Committee (QBC) algorithm by projecting onto a low dimensional space.

...read moreread less

Abstract: Training a learning algorithm is a costly task. A major goal of active learning is to reduce this cost. In this paper we introduce a new algorithm, KQBC, which is capable of actively learning large scale problems by using selective sampling. The algorithm overcomes the costly sampling step of the well known Query By Committee (QBC) algorithm by projecting onto a low dimensional space. KQBC also enables the use of kernels, providing a simple way of extending QBC to the non-linear scenario. Sampling the low dimension space is done using the hit and run random walk. We demonstrate the success of this novel algorithm by applying it to both artificial and a real world problems.

...read moreread less

Proceedings Article•DOI•

Online learning over graphs

[...]

Mark Herbster¹, Massimiliano Pontil¹, Lisa Wainer¹•Institutions (1)

University College London¹

07 Aug 2005

TL;DR: Classic online learning techniques similar to the perceptron algorithm are applied to the problem of learning a function defined on a graph in terms of structural properties of the graph, such as the algebraic connectivity or the diameter.

...read moreread less

Abstract: We apply classic online learning techniques similar to the perceptron algorithm to the problem of learning a function defined on a graph. The benefit of our approach includes simple algorithms and performance guarantees that we naturally interpret in terms of structural properties of the graph, such as the algebraic connectivity or the diameter of the graph. We also discuss how these methods can be modified to allow active learning on a graph. We present preliminary experiments with encouraging results.

...read moreread less

Journal Article•

Computational Representation of Collaborative Learning Flow Patterns Using IMS Learning Design

[...]

Davinia Hernández-Leo, Juan I. Asensio-Pérez¹, Yannis Dimitriadis¹•Institutions (1)

University of Valladolid¹

01 Jan 2005-Educational Technology & Society

TL;DR: An extension to IMS-LD is proposed that enables to specify several characteristics of the use of tools that mediate collaboration and in order to obtain a Unit of Learning based on a CLFP, a three stage process is proposed.

...read moreread less

Abstract: The identification and integration of reusable and customizable CSCL (Computer Supported Collaborative Learning) may benefit from the capture of best practices in collaborative learning structuring. The authors have proposed CLFPs (Collaborative Learning Flow Patterns) as a way of collecting these best practices. To facilitate the process of CLFPs by software systems, the paper proposes to specify these patterns using IMS Learning Design (IMS-LD). Thus, teachers without technical knowledge can particularize and integrate CSCL tools. Nevertheless, the support of IMS-LD for describing collaborative learning activities has some deficiencies: the collaborative tools that can be defined in these activities are limited. Thus, this paper proposes and discusses an extension to IMS-LD that enables to specify several characteristics of the use of tools that mediate collaboration. In order to obtain a Unit of Learning based on a CLFP, a three stage process is also proposed. A CLFP-based Unit of Learning example is used to illustrate the process and the need of the proposed extension.

...read moreread less

Journal Article•DOI•

Obstacle avoidance of a mobile robot using hybrid learning approach

[...]

Meng Joo Er¹, Chang Deng¹•Institutions (1)

Nanyang Technological University¹

31 May 2005-IEEE Transactions on Industrial Electronics

TL;DR: Through the hybrid learning approach, an efficient and compact neuro-fuzzy system is generated for obstacle avoidance of a mobile robot in the real world.

...read moreread less

Abstract: in this paper, a hybrid learning approach for obstacle avoidance of a mobile robot is presented. the key features of the approach are, firstly, innate hardwired behaviors which are used to bootstrap learning in the mobile robot system. a neuro-fuzzy controller is developed from a pre-wired or innate controller based on supervised learning in a simulation environment. the fuzzy inference system has been constructed based on the generalized dynamic fuzzy neural networks learning algorithm of Wu and Er, whereby structure and parameters identification are carried out automatically and simultaneously. Secondly, the neuro-fuzzy controller is capable of re-adapting in a new environment. After carrying out the learning phase on a simulated robot, the controller is implemented on a real robot. A reinforcement learning method based on the fuzzy actor-critic learning algorithm is employed so that the system can re-adapt to a new environment without human intervention. In this phase, the structure of the fuzzy inference system and the parameters of the antecedent parts of fuzzy rules are frozen, and reinforcement learning is applied to further tune the parameters in the consequent parts of the fuzzy rules. Through the hybrid learning approach, an efficient and compact neuro-fuzzy system is generated for obstacle avoidance of a mobile robot in the real world.

...read moreread less

Journal Article•DOI•

An incremental approach to genetic-algorithms-based classification

[...]

Sheng-Uei Guan¹, Fangming Zhu¹•Institutions (1)

National University of Singapore¹

01 Apr 2005

TL;DR: Four new approaches with different initialization schemes for incremental learning within one or more classifier agents in a multiagent environment are proposed and it is shown that the proposed approaches can be successfully used for incrementallearning and improve classification rates as compared to the retraining GA.

...read moreread less

Abstract: Incremental learning has been widely addressed in the machine learning literature to cope with learning tasks where the learning environment is ever changing or training samples become available over time. However, most research work explores incremental learning with statistical algorithms or neural networks, rather than evolutionary algorithms. The work in this paper employs genetic algorithms (GAs) as basic learning algorithms for incremental learning within one or more classifier agents in a multiagent environment. Four new approaches with different initialization schemes are proposed. They keep the old solutions and use an "integration" operation to integrate them with new elements to accommodate new attributes, while biased mutation and crossover operations are adopted to further evolve a reinforced solution. The simulation results on benchmark classification data sets show that the proposed approaches can deal with the arrival of new input attributes and integrate them with the original input space. It is also shown that the proposed approaches can be successfully used for incremental learning and improve classification rates as compared to the retraining GA. Possible applications for continuous incremental training and feature selection are also discussed.

...read moreread less

Proceedings Article•DOI•

A semi-supervised active learning framework for image retrieval

[...]

Steven C. H. Hoi¹, Michael R. Lyu¹•Institutions (1)

The Chinese University of Hong Kong¹

20 Jun 2005

TL;DR: A novel semi- supervised active learning framework comprising a fusion of semi-supervised learning and support vector machines is proposed and promising experimental results show that the proposed scheme significantly outperforms the previous approaches.

...read moreread less

Abstract: Although recent studies have shown that unlabeled data are beneficial to boosting the image retrieval performance, very few approaches for image retrieval can learn with labeled and unlabeled data effectively. This paper proposes a novel semi-supervised active learning framework comprising a fusion of semi-supervised learning and support vector machines. We provide theoretical analysis of the active learning framework and present a simple yet effective active learning algorithm for image retrieval. Experiments are conducted on real-world color images to compare with traditional methods. The promising experimental results show that our proposed scheme significantly outperforms the previous approaches.

...read moreread less

Journal Article•

Algorithmic Stability and Meta-Learning

[...]

Andreas Maurer

01 Dec 2005-Journal of Machine Learning Research

TL;DR: A general method is given to prove generalisation error bounds for meta-algorithms searching spaces of uniformly stable algorithms and an application to regularized least squares regression is presented.

...read moreread less

Abstract: A mechnism of transfer learning is analysed, where samples drawn from different learning tasks of an environment are used to improve the learners performance on a new task. We give a general method to prove generalisation error bounds for such meta-algorithms. The method can be applied to the bias learning model of J. Baxter and to derive novel generalisation bounds for meta-algorithms searching spaces of uniformly stable algorithms. We also present an application to regularized least squares regression.

...read moreread less

Collapse