Kernel based learning methods for pattern and feature analysis

Open Access

Kernel based learning methods for pattern and feature analysis

TLDR

The Rival Penalized Competitive Learning (RPCL) is transformed from data space to feature space for automatic clustering and spectral analysis of kernel matrices is used to address the seeds initialization problem associated with RPCL.

Abstract:

Kernel-based learning methods (kernel methods) have significant influences on recent development of machine learning research. This thesis is on devising and improving kernel methods, and on applying them to pattern and feature analysis. Part of our research focuses on improving Support Vector Machines (SVMs). In solving SVMs, we find that some proposed cache policies for sequential minimal optimization (SMO) result in low efficiency. A better strategy is to cache gradients for all vectors frequently checked. Moreover, we propose a strategy that utilizes the nearest neighboring vectors to speed up the convergence of SMO. We also suggest the use of Hadamard codes for multiclass label prediction by SVMs. We prove that the Hadamard codes are optimal in correcting the wrong labels predicted by base classifiers. Furthermore, we design a newsummation of exponential (SoE) kernel for solving regression tasks with missing values. We show SoE kernels are admissible to kernel conditions and insensitive to missing values. This thesis also deals with unsupervised and semi-supervised kernel methods. Specifically, we transform the Rival Penalized Competitive Learning (RPCL) from data space to feature space for automatic clustering. In addition, we use spectral analysis of kernel matrices to address the seeds initialization problem associated with RPCL. We also improve the SVM-based feature selection in a semi-supervised manner by utilizing both labeled and unlabeled data. The new feature selection method exhibits good performance in solving feature selection benchmark problems.

Kernel based learning methods for pattern and feature analysis

Citations

Optimization of SVM MultiClass by Particle Swarm (PSO-SVM)

Semi-supervised SVM-based Feature Selection for Cancer Classification using Microarray Gene Expression Data

Adaptive dimension reduction for clustering high dimensional data

Improving support vector machine using a stochastic local search for classification in datamining

Kernel matrix completion by semidefinite programming

References

LIBSVM: A library for support vector machines

Statistical learning theory

Pattern Classification

A Tutorial on Support Vector Machines for Pattern Recognition

Nonlinear dimensionality reduction by locally linear embedding.

Related Papers (5)

Kernel Methods for Machine Learning

On Multiple Kernel Learning Methods: On Multiple Kernel Learning Methods

New kernel functions and learning methods for text and data mining

Kernel Mean Embedding of Distributions: A Review and Beyond

Optimization-based Extreme Learning Machine with Multi-kernel Learning Approach for Classification