Showing papers by "Nello Cristianini published in 2000"

PDF

Open Access

Book•

An Introduction to Support Vector Machines and Other Kernel-based Learning Methods

[...]

Nello Cristianini¹, John Shawe-Taylor²•Institutions (2)

University of Bristol¹, Royal Holloway, University of London²

01 Jan 2000

TL;DR: This is the first comprehensive introduction to Support Vector Machines (SVMs), a new generation learning system based on recent advances in statistical learning theory, and will guide practitioners to updated literature, new applications, and on-line software.

...read moreread less

Abstract: From the publisher: This is the first comprehensive introduction to Support Vector Machines (SVMs), a new generation learning system based on recent advances in statistical learning theory. SVMs deliver state-of-the-art performance in real-world applications such as text categorisation, hand-written character recognition, image classification, biosequences analysis, etc., and are now established as one of the standard tools for machine learning and data mining. Students will find the book both stimulating and accessible, while practitioners will be guided smoothly through the material required for a good grasp of the theory and its applications. The concepts are introduced gradually in accessible and self-contained stages, while the presentation is rigorous and thorough. Pointers to relevant literature and web sites containing software ensure that it forms an ideal starting point for further study. Equally, the book and its associated web site will guide practitioners to updated literature, new applications, and on-line software.

...read moreread less

13,736 citations

Book•

An Introduction to Support Vector Machines

[...]

Nello Cristianini, John Shawe-Taylor

01 Mar 2000

TL;DR: This book is the first comprehensive introduction to Support Vector Machines, a new generation learning system based on recent advances in statistical learning theory, and introduces Bayesian analysis of learning and relates SVMs to Gaussian Processes and other kernel based learning methods.

...read moreread less

Abstract: This book is the first comprehensive introduction to Support Vector Machines (SVMs), a new generation learning system based on recent advances in statistical learning theory. The book also introduces Bayesian analysis of learning and relates SVMs to Gaussian Processes and other kernel based learning methods. SVMs deliver state-of-the-art performance in real-world applications such as text categorisation, hand-written character recognition, image classification, biosequences analysis, etc. Their first introduction in the early 1990s lead to a recent explosion of applications and deepening theoretical analysis, that has now established Support Vector Machines along with neural networks as one of the standard tools for machine learning and data mining. Students will find the book both stimulating and accessible, while practitioners will be guided smoothly through the material required for a good grasp of the theory and application of these techniques. The concepts are introduced gradually in accessible and self-contained stages, though in each stage the presentation is rigorous and thorough. Pointers to relevant literature and web sites containing software ensure that it forms an ideal starting point for further study. Equally the book will equip the practitioner to apply the techniques and an associated web site will provide pointers to updated literature, new applications, and on-line software.

...read moreread less

4,327 citations

Journal Article•DOI•

Support vector machine classification and validation of cancer tissue samples using microarray expression data

[...]

Terrence S. Furey¹, Nello Cristianini², Nigel Duffy¹, David W. Bednarski³, Michèl Schummer³, David Haussler¹ - Show less +2 more•Institutions (3)

University of California, Santa Cruz¹, University of Bristol², University of Washington³

01 Oct 2000-Bioinformatics

TL;DR: A new method to analyse tissue samples using support vector machines for mis-labeled or questionable tissue results and shows that other machine learning methods also perform comparably to the SVM on many of those datasets.

...read moreread less

Abstract: Motivation: DNA microarray experiments generating thousands of gene expression measurements, are being used to gather information from tissue and cell samples regarding gene expression differences that will be useful in diagnosing disease. We have developed a new method to analyse this kind of data using support vector machines (SVMs). This analysis consists of both classification of the tissue samples, and an exploration of the data for mis-labeled or questionable tissue results. Results: We demonstrate the method in detail on samples consisting of ovarian cancer tissues, normal ovarian tissues, and other normal tissues. The dataset consists of expression experiment results for 97 802 cDNAs for each tissue. As a result of computational analysis, a tissue sample is discovered and confirmed to be wrongly labeled. Upon correction of this mistake and the removal of an outlier, perfect classification of tissues is achieved, but not with high confidence. We identify and analyse a subset of genes from the ovarian dataset whose expression is highly differentiated between the types of tissues. To show robustness of the SVM method, two previously published datasets from other types of tissues or cells are analysed. The results are comparable to those previously obtained. We show that other machine learning methods also perform comparably to the SVM on many of those datasets. Availability: The SVM software is available at http:// www. cs.columbia.edu/ ∼bgrundy/ svm.

...read moreread less

2,464 citations

Journal Article•DOI•

Knowledge-based analysis of microarray gene expression data by using support vector machines

[...]

Michael S. Brown¹, William Noble Grundy², David Lin¹, Nello Cristianini³, Charles W. Sugnet¹, Terrence S. Furey¹, Manuel Ares¹, David Haussler¹ - Show less +4 more•Institutions (3)

University of California, Santa Cruz¹, Columbia University², University of Bristol³

04 Jan 2000-Proceedings of the National Academy of Sciences of the United States of America

TL;DR: In this paper, a method of functionally classifying genes by using gene expression data from DNA microarray hybridization experiments is introduced based on the theory of support vector machines (SVMs).

...read moreread less

Abstract: We introduce a method of functionally classifying genes by using gene expression data from DNA microarray hybridization experiments. The method is based on the theory of support vector machines (SVMs). SVMs are considered a supervised computer learning method because they exploit prior knowledge of gene function to identify unknown genes of similar function from expression data. SVMs avoid several problems associated with unsupervised clustering methods, such as hierarchical clustering and self-organizing maps. SVMs have many mathematical features that make them attractive for gene expression analysis, including their flexibility in choosing a similarity function, sparseness of solution when dealing with large data sets, the ability to handle large feature spaces, and the ability to identify outliers. We test several SVMs that use different similarity metrics, as well as some other supervised learning methods, and find that the SVMs best identify sets of genes with a common function using expression data. Finally, we use SVMs to predict functional roles for uncharacterized yeast ORFs based on their expression data.

...read moreread less

2,395 citations

Proceedings Article•

Text Classification using String Kernels

[...]

Huma Lodhi¹, John Shawe-Taylor¹, Nello Cristianini¹, Chris Watkins¹•Institutions (1)

Royal Holloway, University of London¹

01 Jan 2000

TL;DR: In this article, an inner product in the feature space consisting of all subsequences of length k was introduced for comparing two text documents, where a subsequence is any ordered sequence of k characters occurring in the text though not necessarily contiguously.

...read moreread less

Abstract: We introduce a novel kernel for comparing two text documents. The kernel is an inner product in the feature space consisting of all subsequences of length k. A subsequence is any ordered sequence of k characters occurring in the text though not necessarily contiguously. The subsequences are weighted by an exponentially decaying factor of their full length in the text, hence emphasising those occurrences which are close to contiguous. A direct computation of this feature vector would involve a prohibitive amount of computation even for modest values of k, since the dimension of the feature space grows exponentially with k. The paper describes how despite this fact the inner product can be efficiently evaluated by a dynamic programming technique. A preliminary experimental comparison of the performance of the kernel compared with a standard word feature space kernel [6] is made showing encouraging results.

...read moreread less

1,464 citations

Proceedings Article•

Query Learning with Large Margin Classifiers

[...]

Colin Campbell, Nello Cristianini, Alexander J. Smola

29 Jun 2000

TL;DR: This paper proposes an algorithm for the training of support vector machines using instance selection, a theoretical justification for the strategy and experimental results on real and artificial data demonstrating its effectiveness.

...read moreread less

Abstract: The active selection of instances can significantly improve the generalisation performance of a learning machine. Large margin classifiers such as support vector machines classify data using the most informative instances (the support vectors). This makes them natural candidates for instance selection strategies. In this paper we propose an algorithm for the training of support vector machines using instance selection. We give a theoretical justification for the strategy and experimental results on real and artificial data demonstrating its effectiveness. The technique is most efficient when the data set can be learnt using few support vectors.

...read moreread less

418 citations

Knowledge-based analysis of microarray gene expression

[...]

M.P.S. Brwon, William Noble Grundy, David Lin, Nello Cristianini, Charles W. Sugnet, Terrence S. Furey, Ares, David Haussler - Show less +4 more

04 Jan 2000

135 citations

Journal Article•DOI•

Enlarging the Margins in Perceptron Decision Trees

[...]

Kristin P. Bennett¹, Nello Cristianini², John Shawe-Taylor², Donghui Wu¹•Institutions (2)

Rensselaer Polytechnic Institute¹, Royal Holloway, University of London²

01 Dec 2000-Machine Learning

TL;DR: It is proved that other quantities can be as relevant to reduce their flexibility and combat overfitting to provide an upper bound on the generalization error which depends both on the size of the tree and on the margin of the decision nodes.

...read moreread less

Abstract: Capacity control in perceptron decision trees is typically performed by controlling their size. We prove that other quantities can be as relevant to reduce their flexibility and combat overfitting. In particular, we provide an upper bound on the generalization error which depends both on the size of the tree and on the margin of the decision nodes. So enlarging the margin in perceptron decision trees will reduce the upper bound on generalization error. Based on this analysis, we introduce three new algorithms, which can induce large margin perceptron decision trees. To assess the effect of the large margin bias, OC1 (Journal of Artificial Intelligence Research, 1994, 2, 1–32.) of Murthy, Kasif and Salzberg, a well-known system for inducing perceptron decision trees, is used as the baseline algorithm. An extensive experimental study on real world data showed that all three new algorithms perform better or at least not significantly worse than OC1 on almost every dataset with only one exception. OC1 performed worse than the best margin-based method on every dataset.

...read moreread less

102 citations

Latent Semantic Kernels

[...]

Nello Cristianini¹, John Shawe-Taylor¹, Huma Lodhi¹•Institutions (1)

Royal Holloway, University of London¹

01 Jan 2000

TL;DR: This paper describes how the LSI approach can be implemented in a kernel-defined feature space and provides experimental results demonstrating that the approach can significantly improve performance, and that it does not impair it.

...read moreread less

Abstract: Kernel methods like support vector machines have successfully been used for text categorization. A standard choice of kernel function has been the inner product between the vector-space representation of two documents, in analogy with classical information retrieval (IR) approaches. Latent semantic indexing (LSI) has been successfully used for IR purposes as a technique for capturing semantic relations between terms and inserting them into the similarity measure between two documents. One of its main drawbacks, in IR, is its computational cost. In this paper we describe how the LSI approach can be implemented in a kernel-defined feature space. We provide experimental results demonstrating that the approach can significantly improve performance, and that it does not impair it.

...read moreread less

50 citations

Book Chapter•

Margin Distribution and Soft Margin

[...]

John Shawe-Taylor, Nello Cristianini

01 Jan 2000

36 citations

Proc.17th Int Conference on Machine Learning

[...]

I C G Campbell, Nello Cristianini, Alexander J. Smola

01 Jan 2000