Showing papers by "Klaus-Robert Müller published in 2004"

PDF

Open Access

Journal Article•DOI•

The BCI competition 2003: progress and perspectives in detection and discrimination of EEG single trials

[...]

Benjamin Blankertz¹, Klaus-Robert Müller¹, Gabriel Curio¹, Theresa M. Vaughan², Gerwin Schalk², Jonathan R. Wolpaw³, Alois Schlögl⁴, C. Neuper⁴, Gert Pfurtscheller, Thilo Hinterberger⁵, Michael Schröder⁵, Niels Birbaumer⁵ - Show less +8 more•Institutions (5)

Charité¹, Wadsworth Center², New York State Department of Health³, Graz University of Technology⁴, University of Tübingen⁵

24 May 2004-IEEE Transactions on Biomedical Engineering

TL;DR: The BCI Competition 2003 was organized to evaluate the current state of the art of signal processing and classification methods and the results and function of the most successful algorithms were described.

...read moreread less

Abstract: Interest in developing a new method of man-to-machine communication-a brain-computer interface (BCI)-has grown steadily over the past few decades. BCIs create a new communication channel between the brain and an output device by bypassing conventional motor output pathways of nerves and muscles. These systems use signals recorded from the scalp, the surface of the cortex, or from inside the brain to enable users to control a variety of applications including simple word-processing software and orthotics. BCI technology could therefore provide a new communication and control option for individuals who cannot otherwise express their wishes to the outside world. Signal processing and classification methods are essential tools in the development of improved BCI technology. We organized the BCI Competition 2003 to evaluate the current state of the art of these tools. Four laboratories well versed in EEG-based BCI research provided six data sets in a documented format. We made these data sets (i.e., labeled training sets and unlabeled test sets) and their descriptions available on the Internet. The goal in the competition was to maximize the performance measure for the test labels. Researchers worldwide tested their algorithms and competed for the best classification results. This paper describes the six data sets and the results and function of the most successful algorithms.

...read moreread less

667 citations

Journal Article•DOI•

Boosting bit rates in noninvasive EEG single-trial classifications by feature combination and multiclass paradigms

[...]

Guido Dornhege, Benjamin Blankertz, Gabriel Curio¹, Klaus-Robert Müller², Klaus-Robert Müller³ - Show less +1 more•Institutions (3)

Charité¹, Fraunhofer Institute for Open Communication Systems², University of Potsdam³

24 May 2004-IEEE Transactions on Biomedical Engineering

TL;DR: It is shown that a suitably arranged interaction between these concepts can significantly boost BCI performances and derive information-theoretic predictions and demonstrate their relevance in experimental data.

...read moreread less

Abstract: Noninvasive electroencephalogram (EEG) recordings provide for easy and safe access to human neocortical processes which can be exploited for a brain-computer interface (BCI). At present, however, the use of BCIs is severely limited by low bit-transfer rates. We systematically analyze and develop two recent concepts, both capable of enhancing the information gain from multichannel scalp EEG recordings: 1) the combination of classifiers, each specifically tailored for different physiological phenomena, e.g., slow cortical potential shifts, such as the premovement Bereitschaftspotential or differences in spatio-spectral distributions of brain activity (i.e., focal event-related desynchronizations) and 2) behavioral paradigms inducing the subjects to generate one out of several brain states (multiclass approach) which all bare a distinctive spatio-temporal signature well discriminable in the standard scalp EEG. We derive information-theoretic predictions and demonstrate their relevance in experimental data. We will show that a suitably arranged interaction between these concepts can significantly boost BCI performances.

...read moreread less

614 citations

Journal Article•

A Fast Algorithm for Joint Diagonalization with Non-orthogonal Transformations and its Application to Blind Source Separation

[...]

Andreas Ziehe¹, Pavel Laskov¹, Guido Nolte², Klaus-Robert Müller¹, Klaus-Robert Müller³ - Show less +1 more•Institutions (3)

Fraunhofer Institute for Open Communication Systems¹, National Institutes of Health², University of Potsdam³

01 Dec 2004-Journal of Machine Learning Research

TL;DR: A new efficient algorithm is presented for joint diagonalization of several matrices based on the Frobenius-norm formulation of the joint diagonalized problem, and addresses diagonalization with a general, non-orthogonal transformation.

...read moreread less

Abstract: A new efficient algorithm is presented for joint diagonalization of several matrices. The algorithm is based on the Frobenius-norm formulation of the joint diagonalization problem, and addresses diagonalization with a general, non-orthogonal transformation. The iterative scheme of the algorithm is based on a multiplicative update which ensures the invertibility of the diagonalizer. The algorithm's efficiency stems from the special approximation of the cost function resulting in a sparse, block-diagonal Hessian to be used in the computation of the quasi-Newton update step. Extensive numerical simulations illustrate the performance of the algorithm and provide a comparison to other leading diagonalization methods. The results of such comparison demonstrate that the proposed algorithm is a viable alternative to existing state-of-the-art joint diagonalization algorithms. The practical use of our algorithm is shown for blind source separation problems.

...read moreread less

266 citations

Journal Article•

Feature Discovery in Non-Metric Pairwise Data

[...]

Julian Laub¹, Klaus-Robert Müller², Klaus-Robert Müller¹•Institutions (2)

Fraunhofer Institute for Open Communication Systems¹, University of Potsdam²

01 Dec 2004-Journal of Machine Learning Research

TL;DR: It is shown by a simple, exploratory analysis that the negative eigenvalues can code for relevant structure in the data, thus leading to the discovery of new features, which were lost by conventional data analysis techniques.

...read moreread less

Abstract: Pairwise proximity data, given as similarity or dissimilarity matrix, can violate metricity. This occurs either due to noise, fallible estimates, or due to intrinsic non-metric features such as they arise from human judgments. So far the problem of non-metric pairwise data has been tackled by essentially omitting the negative eigenvalues or shifting the spectrum of the associated (pseudo-)covariance matrix for a subsequent embedding. However, little attention has been paid to the negative part of the spectrum itself. In particular no answer was given to whether the directions associated to the negative eigenvalues would at all code variance other than noise related. We show by a simple, exploratory analysis that the negative eigenvalues can code for relevant structure in the data, thus leading to the discovery of new features, which were lost by conventional data analysis techniques. The information hidden in the negative eigenvalue part of the spectrum is illustrated and discussed for three data sets, namely USPS handwritten digits, text-mining and data from cognitive psychology.

...read moreread less

102 citations

Journal Article•DOI•

Intrusion detection in unlabeled data with quarter-sphere Support Vector Machines

[...]

Pavel Laskov¹, Christin Schäfer¹, Igor Kotenko, Klaus-Robert Müller•Institutions (1)

Fraunhofer Society¹

01 Dec 2004-Praxis Der Informationsverarbeitung Und Kommunikation

TL;DR: This contribution proposes a novel formulation of a one-class Support Vector Machine (SVM) specially designed for typical IDS data features, to encompass the data with a hypersphere anchored at the center of mass of the data in feature space.

...read moreread less

Abstract: Practical application of data mining and machine learning techniques to intrusion detection is often hindered by the difficulty to produce clean data for the training. To address this problem a geometric framework for unsupervised anomaly detection has been recently proposed. In this framework, the data is mapped into a feature space, and anomalies are detected as the entries in sparsely populated regions. In this contribution we propose a novel formulation of a one-class Support Vector Machine (SVM) specially designed for typical IDS data features. The key idea of our ”quarter-sphere” algorithm is to encompass the data with a hypersphere anchored at the center of mass of the data in feature space. The proposed method and its behavior on varying percentages of attacks in the data is evaluated on the KDDCup 1999 dataset.

...read moreread less

88 citations

Independent Component Analysis Aapo Hyvärinen, Juha Karhunen,

[...]

Erkki Oja, Klaus-Robert Müller, Andreas Ziehe, Motoaki Kawanabe

01 Jan 2004

50 citations

Book Chapter•DOI•

Approximate Joint Diagonalization Using a Natural Gradient Approach

[...]

Arie Yeredor¹, Andreas Ziehe, Klaus-Robert Müller•Institutions (1)

Tel Aviv University¹

22 Sep 2004

TL;DR: In this paper, a non-unitary approximate joint diagonalization (AJD) algorithm is proposed, which is based on a natural gradient-type multi-plicative update of the diagonalizing matrix.

...read moreread less

Abstract: We present a new algorithm for non-unitary approximate joint diagonalization (AJD), based on a “natural gradient”-type multi-plicative update of the diagonalizing matrix, complemented by step-size optimization at each iteration. The advantages of the new algorithm over existing non-unitary AJD algorithms are in the ability to accommodate non-positive-definite matrices (compared to Pham’s algorithm), in the low computational load per iteration (compared to Yeredor’s AC-DC algorithm), and in the theoretically guaranteed convergence to a true (possibly local) minimum (compared to Ziehe et al.’s FFDiag algorithm).

...read moreread less

49 citations

Journal Article•DOI•

Blind source separation techniques for decomposing event-related brain signals

[...]

Klaus-Robert Müller¹, Ricardo Vigário, Frank C. Meinecke¹, Andreas Ziehe•Institutions (1)

University of Potsdam¹

01 Feb 2004-International Journal of Bifurcation and Chaos

TL;DR: The concept of BSS is reviewed and its usefulness in the context of event-related MEG measurements is demonstrated and an additional grouping of the BSS components reveals interesting structure, that could ultimately be used for gaining a better physiological modeling of the data.

...read moreread less

Abstract: Recently blind source separation (BSS) methods have been highly successful when applied to biomedical data. This paper reviews the concept of BSS and demonstrates its usefulness in the context of event-related MEG measurements. In a first experiment we apply BSS to artifact identification of raw MEG data and discuss how the quality of the resulting independent component projections can be evaluated. The second part of our study considers averaged data of event-related magnetic fields. Here, it is particularly important to monitor and thus avoid possible overfitting due to limited sample size. A stability assessment of the BSS decomposition allows to solve this task and an additional grouping of the BSS components reveals interesting structure, that could ultimately be used for gaining a better physiological modeling of the data.

...read moreread less

48 citations

Proceedings Article•DOI•

A consistency-based model selection for one-class classification

[...]

David M. J. Tax¹, Klaus-Robert Müller•Institutions (1)

Delft University of Technology¹

23 Aug 2004

TL;DR: A simple selection criterion for hyper-parameters in one-class classifiers (OCCs) is proposed, which makes use of the particular structure of the one- class problem to define the most complex classifier.

...read moreread less

Abstract: Model selection in unsupervised learning is a hard problem. In this paper, a simple selection criterion for hyper-parameters in one-class classifiers (OCCs) is proposed. It makes use of the particular structure of the one-class problem. The mean idea is that the complexity of the classifier is increased until the classifier becomes inconsistent on the target class. This defines the most complex classifier, which can still reliably be trained on the data. Experiments indicated the usefulness of the approach.

...read moreread less

44 citations

Proceedings Article•DOI•

Improving speed and accuracy of brain-computer interfaces using readiness potential features

[...]

Matthias Krauledat, Guido Dornhege, Benjamin Blankertz, F. Losch¹, Gabriel Curio¹, Klaus-Robert Müller², Klaus-Robert Müller³ - Show less +3 more•Institutions (3)

Charité¹, Fraunhofer Institute for Open Communication Systems², University of Potsdam³

01 Jan 2004

TL;DR: Two directions in which brain-computer interfacing can be enhanced by exploiting the lateralized readiness potential are presented: for establishing a rapid response BCI system that can predict the laterality of upcoming finger movements before EMG onset even in time critical contexts, and to improve information transfer rates in the common BCI approach relying on imagined limb movements.

...read moreread less

Abstract: To enhance human interaction with machines, research interest is growing to develop a 'brain-computer interface', which allows communication of a human with a machine only by use of brain signals So far, the applicability of such an interface is strongly limited by low bit-transfer rates, slow response times and long training sessions for the subject The Berlin Brain-Computer Interface (BBCI) project is guided by the idea to train a computer by advanced machine learning techniques both to improve classification performance and to reduce the need of subject training In this paper we present two directions in which brain-computer interfacing can be enhanced by exploiting the lateralized readiness potential: (1) for establishing a rapid response BCI system that can predict the laterality of upcoming finger movements before EMG onset even in time critical contexts, and (2) to improve information transfer rates in the common BCI approach relying on imagined limb movements

...read moreread less

44 citations

Posted Content•

Automatic Identification of Faked and Fraudulent Interviews in Surveys by Two Different Methods

[...]

Christin Schäfer, Jörg-Peter Schräpler, Klaus-Robert Müller, Gert G. Wagner

01 Jan 2004-Research Papers in Economics

TL;DR: In this article, the authors presented two new tools for the identification of faking interviewers in surveys, one based on Benford's Law, and the other exploiting the empirical observation that fakers most often produce answers with less variability than could be expected from the whole survey.

...read moreread less

Abstract: This paper presents two new tools for the identification of faking interviewers in surveys. One method is based on Benford's Law, and the other exploits the empirical observation that fakers most often produce answers with less variability than could be expected from the whole survey. We focus on fabricated data, which were taken out of the survey before the data were disseminated in the German Socio-Economic Panel (SOEP). For two samples, the resulting rankings of the interviewers with respect to their cheating behavior are given. For both methods all of the evident fakers are identified.

...read moreread less

Journal Article•DOI•

Asymptotic properties of the Fisher kernel

[...]

Koji Tsuda¹, Shotaro Akaho², Motoaki Kawanabe, Klaus-Robert Müller³•Institutions (3)

Max Planck Society¹, National Institute of Advanced Industrial Science and Technology², University of Potsdam³

01 Jan 2004-Neural Computation

TL;DR: It is underlined that the Fisher kernel should be viewed not as a heuristics but as a powerful statistical tool with well-controlled statistical properties.

...read moreread less

Abstract: This letter analyzes the Fisher kernel from a statistical point of view. The Fisher kernel is a particularly interesting method for constructing a model of the posterior probability that makes intelligent use of unlabeled data (i.e., of the underlying data density). It is important to analyze and ultimately understand the statistical properties of the Fisher kernel. To this end, we first establish sufficient conditions that the constructed posterior model is realizable (i.e., it contains the true distribution). Realizability immediately leads to consistency results. Subsequently, we focus on an asymptotic analysis of the generalization error, which elucidates the learning curves of the Fisher kernel and how unlabeled data contribute to learning. We also point out that the squared or log loss is theoretically preferable--because both yield consistent estimators--to other losses such as the exponential loss, when a linear classifier is used together with the Fisher kernel. Therefore, this letter underlines that the Fisher kernel should be viewed not as a heuristics but as a powerful statistical tool with well-controlled statistical properties.

...read moreread less

Journal Article•DOI•

Trading variance reduction with unbiasedness: the regularized subspace information criterion for robust model selection in kernel regression

[...]

Masashi Sugiyama¹, Motoaki Kawanabe, Klaus-Robert Müller²•Institutions (2)

Tokyo Institute of Technology¹, University of Potsdam²

01 May 2004-Neural Computation

TL;DR: This article derives an unbiased estimator of the expected squared error, between SIC and the expected generalization error and proposes determining the degree of regularization of SIC such that the estimators of theexpected squared error is minimized.

...read moreread less

Abstract: A well-known result by Stein (1956) shows that in particular situations, biased estimators can yield better parameter estimates than their generally preferred unbiased counterparts. This letter follows the same spirit, as we will stabilize the unbiased generalization error estimates by regularization and finally obtain more robust model selection criteria for learning. We trade a small bias against a larger variance reduction, which has the beneficial effect of being more precise on a single training set. We focus on the subspace information criterion (SIC), which is an unbiased estimator of the expected generalization error measured by the reproducing kernel Hilbert space norm. SIC can be applied to the kernel regression, and it was shown in earlier experiments that a small regularization of SIC has a stabilization effect. However, it remained open how to appropriately determine the degree of regularization in SIC. In this article, we derive an unbiased estimator of the expected squared error, between SIC and the expected generalization error and propose determining the degree of regularization of SIC such that the estimator of the expected squared error is minimized. Computer simulations with artificial and real data sets illustrate that the proposed method works effectively for improving the precision of SIC, especially in the high-noise-level cases. We furthermore compare the proposed method to the original SIC, the cross-validation, and an empirical Bayesian method in ridge parameter selection, with good results.

...read moreread less

Patent•

Method and Apparatus for Automatic Online Detection and Classification of Anomalous Objects in a Data Stream

[...]

Klaus-Robert Müller, Pavel Laskov, David M. J. Tax, Christin Schäfer

17 Aug 2004

TL;DR: In this article, a method for automatic online detection and classification of anomalous objects in a data stream, especially comprising datasets and / or signals, characterized in that a) the detection of at least one incoming data stream (1000) containing normal and anomalous object, b) automatic construction (2100) of a geometric representation of normality (2200) the incoming objects of the data stream(1000) at a time t1 subject to a predefined optimality condition, especially the construction of a hypersurface enclosing a finite number of normal objects, c) online adaptation

...read moreread less

Abstract: The invention is concerned with a method for automatic online detection and classification of anomalous objects in a data stream, especially comprising datasets and / or signals, characterized in that a) the detection of at least one incoming data stream (1000) containing normal and anomalous objects, b) automatic construction (2100) of a geometric representation of normality (2200) the incoming objects of the data stream (1000) at a time t1 subject to at least one predefined optimality condition, especially the construction of a hypersurface enclosing a finite number of normal objects, c) online adaptation of the geometric representation ofnormality (2200) in respect to received at least one received object at a time t2 >= t1 , the adaptation being subject to at least one predefined optimality condition, d) online determination of a normality classification (2300) for received objects at t2 in respect to the geometric representation of normality (2200), e) automatic classification of normal objects and anomalous objects based on the generated normality classification (2300) and generating a data set describing the anomalous data for further processing, especially a visual representation.

...read moreread less

Journal Article•DOI•

Injecting noise for analysing the stability of ICA components

[...]

Stefan Harmeling, Frank C. Meinecke, Klaus-Robert Müller¹•Institutions (1)

University of Potsdam¹

01 Feb 2004-Signal Processing

TL;DR: This work presents a new method that constructively injects noise to assess the reliability and the grouping structure of empirical ICA component estimates and demonstrates that the approach is useful for exploratory data analysis of real-world data.

...read moreread less

Book Chapter•DOI•

Robust ICA for Super-Gaussian Sources

[...]

Frank C. Meinecke, Stefan Harmeling, Klaus-Robert Müller¹•Institutions (1)

University of Potsdam¹

22 Sep 2004

TL;DR: This work shows how a simple outlier index can be used directly to solve the ICA problem for super-Gaussian source signals and is outlier-robust by construction and could be used for standard ICA as well as for over-complete ICA.

...read moreread less

Abstract: Most ICA algorithms are sensitive to outliers. Instead of robustifying existing algorithms by outlier rejection techniques, we show how a simple outlier index can be used directly to solve the ICA problem for super-Gaussian source signals. This ICA method is outlier-robust by construction and can be used for standard ICA as well as for over-complete ICA (i.e. more source signals than observed signals (mixtures)).

...read moreread less

Journal Article•DOI•

Special issue on brain-machine interfaces: Editorial

[...]

Miguel A. L. Nicolelis¹, Niels Birbaumer, Klaus-Robert Müller²•Institutions (2)

Duke University¹, University of Potsdam²

01 Jun 2004-IEEE Transactions on Biomedical Engineering

Journal Article•DOI•

The Influence of Preceding Movements on Motor Cortical Activity in Finger-Tapping

[...]

F. Losch, Benjamin Blankertz, Klaus-Robert Müller, Gabriel Curio

25 Aug 2004-Klinische Neurophysiologie

Journal Article•DOI•

The Berlin Brain-Computer Interface – Single trial classifications of phantom finger movements of arm amputees

[...]

V. Kunzmann, Benjamin Blankertz, G. Dornhege, Matthias Krauledat, Klaus-Robert Müller, Gabriel Curio - Show less +2 more

25 Aug 2004-Klinische Neurophysiologie

Journal Article•DOI•

New Inference Concepts for Analysing Complex Data

[...]

Jianqing Fan, Klaus-Robert Müller, Vladimir Spokoiny

01 Jan 2004-Oberwolfach Reports

Proceedings Article•

Regularizing generalization error estimators: a novel approach to robust model selection.

[...]

Masashi Sugiyama¹, Motoaki Kawanabe, Klaus-Robert Müller•Institutions (1)

Tokyo Institute of Technology¹

01 Jan 2004

TL;DR: This paper proposes regularizing unbiased generalization error estimators for stabilization by trading a small bias in a model selection criterion against a larger variance reduction which has the beneficial effect of being more precise on a single training set.

...read moreread less

Abstract: A well-known result by Stein shows that regularized estimators with small bias often yield better estimates than unbiased estimators. In this paper, we adapt this spirit to model selection, and propose regularizing unbiased generalization error estimators for stabilization. We trade a small bias in a model selection criterion against a larger variance reduction which has the beneficial effect of being more precise on a single training set.

...read moreread less