Showing papers on "Artificial neural network published in 2004"

PDF

Open Access

Journal Article•DOI•

Improved Prediction of Signal Peptides: SignalP 3.0

[...]

Jannick Dyrløv Bendtsen¹, Henrik Nielsen¹, Gunnar von Heijne², Søren Brunak¹•Institutions (2)

Technical University of Denmark¹, Stockholm University²

16 Jul 2004-Journal of Molecular Biology

TL;DR: Improvements of the currently most popular method for prediction of classically secreted proteins, SignalP, which consists of two different predictors based on neural network and hidden Markov model algorithms, where both components have been updated.

...read moreread less

6,492 citations

Proceedings Article•DOI•

Extreme learning machine: a new learning scheme of feedforward neural networks

[...]

Guang-Bin Huang¹, Qin-Yu Zhu¹, Chee-Kheong Siew¹•Institutions (1)

Nanyang Technological University¹

25 Jul 2004

TL;DR: A new learning algorithm called extreme learning machine (ELM) for single-hidden layer feedforward neural networks (SLFNs) which randomly chooses the input weights and analytically determines the output weights of SLFNs is proposed.

...read moreread less

Abstract: It is clear that the learning speed of feedforward neural networks is in general far slower than required and it has been a major bottleneck in their applications for past decades. Two key reasons behind may be: 1) the slow gradient-based learning algorithms are extensively used to train neural networks, and 2) all the parameters of the networks are tuned iteratively by using such learning algorithms. Unlike these traditional implementations, this paper proposes a new learning algorithm called extreme learning machine (ELM) for single-hidden layer feedforward neural networks (SLFNs) which randomly chooses the input weights and analytically determines the output weights of SLFNs. In theory, this algorithm tends to provide the best generalization performance at extremely fast learning speed. The experimental results based on real-world benchmarking function approximation and classification problems including large complex applications show that the new algorithm can produce best generalization performance in some cases and can learn much faster than traditional popular learning algorithms for feedforward neural networks.

...read moreread less

3,643 citations

Journal Article•DOI•

Classification of hyperspectral remote sensing images with support vector machines

[...]

Farid Melgani¹, Lorenzo Bruzzone¹•Institutions (1)

University of Trento¹

16 Aug 2004-IEEE Transactions on Geoscience and Remote Sensing

TL;DR: This paper addresses the problem of the classification of hyperspectral remote sensing images by support vector machines by understanding and assessing the potentialities of SVM classifiers in hyperdimensional feature spaces and concludes that SVMs are a valid and effective alternative to conventional pattern recognition approaches.

...read moreread less

Abstract: This paper addresses the problem of the classification of hyperspectral remote sensing images by support vector machines (SVMs) First, we propose a theoretical discussion and experimental analysis aimed at understanding and assessing the potentialities of SVM classifiers in hyperdimensional feature spaces Then, we assess the effectiveness of SVMs with respect to conventional feature-reduction-based approaches and their performances in hypersubspaces of various dimensionalities To sustain such an analysis, the performances of SVMs are compared with those of two other nonparametric classifiers (ie, radial basis function neural networks and the K-nearest neighbor classifier) Finally, we study the potentially critical issue of applying binary SVMs to multiclass problems in hyperspectral data In particular, four different multiclass strategies are analyzed and compared: the one-against-all, the one-against-one, and two hierarchical tree-based strategies Different performance indicators have been used to support our experimental studies in a detailed and accurate way, ie, the classification accuracy, the computational time, the stability to parameter setting, and the complexity of the multiclass architecture The results obtained on a real Airborne Visible/Infrared Imaging Spectroradiometer hyperspectral dataset allow to conclude that, whatever the multiclass strategy adopted, SVMs are a valid and effective alternative to conventional pattern recognition approaches (feature-reduction procedures combined with a classification method) for the classification of hyperspectral remote sensing data

...read moreread less

3,607 citations

Book•

Random Matrix Theory and Wireless Communications

[...]

Antonio M. Tulino¹, Sergio Verdu²•Institutions (2)

University of Naples Federico II¹, Princeton University²

28 Jun 2004

TL;DR: A tutorial on random matrices is provided which provides an overview of the theory and brings together in one source the most significant results recently obtained.

...read moreread less

Abstract: Random matrix theory has found many applications in physics, statistics and engineering since its inception. Although early developments were motivated by practical experimental problems, random matrices are now used in fields as diverse as Riemann hypothesis, stochastic differential equations, condensed matter physics, statistical physics, chaotic systems, numerical linear algebra, neural networks, multivariate statistics, information theory, signal processing and small-world networks. This article provides a tutorial on random matrices which provides an overview of the theory and brings together in one source the most significant results recently obtained. Furthermore, the application of random matrix theory to the fundamental limits of wireless communication channels is described in depth.

...read moreread less

2,308 citations

Book•

Artificial Neural Networks

[...]

B. Yegnanarayana

30 Aug 2004

TL;DR: artificial neural networks, artificial neural networks , مرکز فناوری اطلاعات و اصاع رسانی, کδاوρزی

...read moreread less

Abstract: artificial neural networks , artificial neural networks , مرکز فناوری اطلاعات و اطلاع رسانی کشاورزی

...read moreread less

2,254 citations

Journal Article•DOI•

Practical selection of SVM parameters and noise estimation for SVM regression

[...]

Vladimir Cherkassky¹, Yunqian Ma¹•Institutions (1)

University of Minnesota¹

01 Jan 2004-Neural Networks

TL;DR: This work describes a new analytical prescription for setting the value of insensitive zone epsilon, as a function of training sample size, and compares generalization performance of SVM regression under sparse sample settings with regression using 'least-modulus' loss (epsilon=0) and standard squared loss.

...read moreread less

1,781 citations

Proceedings Article•DOI•

Feature selection, L1 vs. L2 regularization, and rotational invariance

[...]

Andrew Y. Ng¹•Institutions (1)

Stanford University¹

04 Jul 2004

TL;DR: A lower-bound is given showing that any rotationally invariant algorithm---including logistic regression with L1 regularization, SVMs, and neural networks trained by backpropagation---has a worst case sample complexity that grows at least linearly in the number of irrelevant features.

...read moreread less

Abstract: We consider supervised learning in the presence of very many irrelevant features, and study two different regularization methods for preventing overfitting. Focusing on logistic regression, we show that using L1 regularization of the parameters, the sample complexity (i.e., the number of training examples required to learn "well,") grows only logarithmically in the number of irrelevant features. This logarithmic rate matches the best known bounds for feature selection, and indicates that L1 regularized logistic regression can be effective even if there are exponentially many irrelevant features as there are training examples. We also give a lower-bound showing that any rotationally invariant algorithm---including logistic regression with L2 regularization, SVMs, and neural networks trained by backpropagation---has a worst case sample complexity that grows at least linearly in the number of irrelevant features.

...read moreread less

1,742 citations

Book Chapter•DOI•

Learning with Drift Detection

[...]

João Gama¹, Pedro Medas¹, Gladys Castillo², Gladys Castillo¹, Pedro Pereira Rodrigues¹ - Show less +1 more•Institutions (2)

University of Porto¹, University of Aveiro²

29 Sep 2004

TL;DR: A method for detection of changes in the probability distribution of examples, to control the online error-rate of the algorithm and to observe that the method is independent of the learning algorithm.

...read moreread less

Abstract: Most of the work in machine learning assume that examples are generated at random according to some stationary probability distribution. In this work we study the problem of learning when the distribution that generate the examples changes over time. We present a method for detection of changes in the probability distribution of examples. The idea behind the drift detection method is to control the online error-rate of the algorithm. The training examples are presented in sequence. When a new training example is available, it is classified using the actual model. Statistical theory guarantees that while the distribution is stationary, the error will decrease. When the distribution changes, the error will increase. The method controls the trace of the online error of the algorithm. For the actual context we define a warning level, and a drift level. A new context is declared, if in a sequence of examples, the error increases reaching the warning level at example k w , and the drift level at example k d . This is an indication of a change in the distribution of the examples. The algorithm learns a new model using only the examples since k w . The method was tested with a set of eight artificial datasets and a real world dataset. We used three learning algorithms: a perceptron, a neural network and a decision tree. The experimental results show a good performance detecting drift and with learning the new concept. We also observe that the method is independent of the learning algorithm.

...read moreread less

1,256 citations

Journal Article•DOI•

Credit rating analysis with support vector machines and neural networks: a market comparative study

[...]

Zan Huang¹, Hsinchun Chen¹, Chia-Jung Hsu¹, Wun-Hwa Chen², Soushan Wu³ - Show less +1 more•Institutions (3)

University of Arizona¹, National Taiwan University², Chang Gung University³

01 Sep 2004

TL;DR: A relatively new machine learning technique, support vector machines (SVM), is introduced to the problem in attempt to provide a model with better explanatory power and relative importance of the input financial variables from the neural network models.

...read moreread less

Abstract: Corporate credit rating analysis has attracted lots of research interests in the literature. Recent studies have shown that Artificial Intelligence (AI) methods achieved better performance than traditional statistical methods. This article introduces a relatively new machine learning technique, support vector machines (SVM), to the problem in attempt to provide a model with better explanatory power. We used backpropagation neural network (BNN) as a benchmark and obtained prediction accuracy around 80% for both BNN and SVM methods for the United States and Taiwan markets. However, only slight improvement of SVM was observed. Another direction of the research is to improve the interpretability of the AI-based models. We applied recent research results in neural network model interpretation and obtained relative importance of the input financial variables from the neural network models. Based on these results, we conducted a market comparative analysis on the differences of determining factors in the United States and Taiwan markets.

...read moreread less

962 citations

Journal Article•DOI•

A hybrid of genetic algorithm and particle swarm optimization for recurrent network design

[...]

Chia-Feng Juang¹•Institutions (1)

National Chung Hsing University¹

01 Apr 2004

TL;DR: An evolutionary recurrent network which automates the design of recurrent neural/fuzzy networks using a new evolutionary learning algorithm based on a hybrid of genetic algorithm (GA) and particle swarm optimization (PSO), and is thus called HGAPSO.

...read moreread less

Abstract: An evolutionary recurrent network which automates the design of recurrent neural/fuzzy networks using a new evolutionary learning algorithm is proposed in this paper. This new evolutionary learning algorithm is based on a hybrid of genetic algorithm (GA) and particle swarm optimization (PSO), and is thus called HGAPSO. In HGAPSO, individuals in a new generation are created, not only by crossover and mutation operation as in GA, but also by PSO. The concept of elite strategy is adopted in HGAPSO, where the upper-half of the best-performing individuals in a population are regarded as elites. However, instead of being reproduced directly to the next generation, these elites are first enhanced. The group constituted by the elites is regarded as a swarm, and each elite corresponds to a particle within it. In this regard, the elites are enhanced by PSO, an operation which mimics the maturing phenomenon in nature. These enhanced elites constitute half of the population in the new generation, whereas the other half is generated by performing crossover and mutation operation on these enhanced elites. HGAPSO is applied to recurrent neural/fuzzy network design as follows. For recurrent neural network, a fully connected recurrent neural network is designed and applied to a temporal sequence production problem. For recurrent fuzzy network design, a Takagi-Sugeno-Kang-type recurrent fuzzy network is designed and applied to dynamic plant control. The performance of HGAPSO is compared to both GA and PSO in these recurrent networks design problems, demonstrating its superiority.

...read moreread less

961 citations

Journal Article•DOI•

An approach to online identification of Takagi-Sugeno fuzzy models

[...]

Plamen Angelov¹, Dimitar Petrov Filev²•Institutions (2)

Lancaster University¹, Ford Motor Company²

01 Feb 2004

TL;DR: An approach to the online learning of Takagi-Sugeno (TS) type models is proposed, based on a novel learning algorithm that recursively updates TS model structure and parameters by combining supervised and unsupervised learning.

...read moreread less

Abstract: An approach to the online learning of Takagi-Sugeno (TS) type models is proposed in the paper. It is based on a novel learning algorithm that recursively updates TS model structure and parameters by combining supervised and unsupervised learning. The rule-base and parameters of the TS model continually evolve by adding new rules with more summarization power and by modifying existing rules and parameters. In this way, the rule-base structure is inherited and up-dated when new data become available. By applying this learning concept to the TS model we arrive at a new type adaptive model called the Evolving Takagi-Sugeno model (ETS). The adaptive nature of these evolving TS models in combination with the highly transparent and compact form of fuzzy rules makes them a promising candidate for online modeling and control of complex processes, competitive to neural networks. The approach has been tested on data from an air-conditioning installation serving a real building. The results illustrate the viability and efficiency of the approach. The proposed concept, however, has significantly wider implications in a number of fields, including adaptive nonlinear control, fault detection and diagnostics, performance analysis, forecasting, knowledge extraction, robotics, behavior modeling.

...read moreread less

Journal Article•DOI•

Automatic license plate recognition

[...]

Shyang-Lih Chang¹, Li-Shien Chen¹, Yun-Chung Chung¹, Sei-Wan Chen¹•Institutions (1)

National Taiwan Normal University¹

01 Mar 2004-IEEE Transactions on Intelligent Transportation Systems

TL;DR: The proposed LPR technique consists of two main modules: a license plate locating module and a license number identification module, the former characterized by fuzzy disciplines attempts to extract license plates from an input image, while the latter conceptualized in terms of neural subjects aims to identify the number present in alicense plate.

...read moreread less

Abstract: Automatic license plate recognition (LPR) plays an important role in numerous applications and a number of techniques have been proposed. However, most of them worked under restricted conditions, such as fixed illumination, limited vehicle speed, designated routes, and stationary backgrounds. In this study, as few constraints as possible on the working environment are considered. The proposed LPR technique consists of two main modules: a license plate locating module and a license number identification module. The former characterized by fuzzy disciplines attempts to extract license plates from an input image, while the latter conceptualized in terms of neural subjects aims to identify the number present in a license plate. Experiments have been conducted for the respective modules. In the experiment on locating license plates, 1088 images taken from various scenes and under different conditions were employed. Of which, 23 images have been failed to locate the license plates present in the images; the license plate location rate of success is 97.9%. In the experiment on identifying license number, 1065 images, from which license plates have been successfully located, were used. Of which, 47 images have been failed to identify the numbers of the license plates located in the images; the identification rate of success is 95.6%. Combining the above two rates, the overall rate of success for our LPR algorithm is 93.7%.

...read moreread less

Journal Article•DOI•

An accurate comparison of methods for quantifying variable importance in artificial neural networks using simulated data

[...]

Julian D. Olden¹, Michael K. Joy², Russell G. Death²•Institutions (2)

Colorado State University¹, Massey University²

01 Nov 2004-Ecological Modelling

TL;DR: A robust comparison of different methodologies for assessing variable importance in neural networks that can be generalized to other data and from which valid recommendations can be made for future studies is provided.

...read moreread less

Journal Article•DOI•

Computational subunits in thin dendrites of pyramidal cells.

[...]

Aloan Polsky¹, Bartlett W. Mel², Jackie Schiller¹•Institutions (2)

Technion – Israel Institute of Technology¹, University of Southern California²

01 Jun 2004-Nature Neuroscience

TL;DR: This work combined confocal imaging and dual-site focal synaptic stimulation of identified thin dendrites in rat neocortical pyramidal neurons found that nearby inputs on the same branch summed sigmoidally, whereas widely separated inputs or inputs to different branches summed linearly.

...read moreread less

Abstract: The thin basal and oblique dendrites of cortical pyramidal neurons receive most of the synaptic inputs from other cells, but their integrative properties remain uncertain. Previous studies have most often reported global linear or sublinear summation. An alternative view, supported by biophysical modeling studies, holds that thin dendrites provide a layer of independent computational 'subunits' that sigmoidally modulate their inputs prior to global summation. To distinguish these possibilities, we combined confocal imaging and dual-site focal synaptic stimulation of identified thin dendrites in rat neocortical pyramidal neurons. We found that nearby inputs on the same branch summed sigmoidally, whereas widely separated inputs or inputs to different branches summed linearly. This strong spatial compartmentalization effect is incompatible with a global summation rule and provides the first experimental support for a two-layer 'neural network' model of pyramidal neuron thin-branch integration. Our findings could have important implications for the computing and memory-related functions of cortical tissue.

...read moreread less

Book•DOI•

Handbook of Learning and Approximate Dynamic Programming

[...]

Jennie Si, Andrew G. Barto¹, Warren B. Powell², Donald C. Wunsch³•Institutions (3)

University of Massachusetts Amherst¹, Princeton University², Missouri University of Science and Technology³

01 Jan 2004

TL;DR: This chapter discusses reinforcement learning in large, high-dimensional state spaces, model-based adaptive critic designs, and applications of approximate dynamic programming in power systems control.

...read moreread less

Abstract: Foreword. 1. ADP: goals, opportunities and principles. Part I: Overview. 2. Reinforcement learning and its relationship to supervised learning. 3. Model-based adaptive critic designs. 4. Guidance in the use of adaptive critics for control. 5. Direct neural dynamic programming. 6. The linear programming approach to approximate dynamic programming. 7. Reinforcement learning in large, high-dimensional state spaces. 8. Hierarchical decision making. Part II: Technical advances. 9. Improved temporal difference methods with linear function approximation. 10. Approximate dynamic programming for high-dimensional resource allocation problems. 11. Hierarchical approaches to concurrency, multiagency, and partial observability. 12. Learning and optimization - from a system theoretic perspective. 13. Robust reinforcement learning using integral-quadratic constraints. 14. Supervised actor-critic reinforcement learning. 15. BPTT and DAC - a common framework for comparison. Part III: Applications. 16. Near-optimal control via reinforcement learning. 17. Multiobjective control problems by reinforcement learning. 18. Adaptive critic based neural network for control-constrained agile missile. 19. Applications of approximate dynamic programming in power systems control. 20. Robust reinforcement learning for heating, ventilation, and air conditioning control of buildings. 21. Helicopter flight control using direct neural dynamic programming. 22. Toward dynamic stochastic optimal power flow. 23. Control, optimization, security, and self-healing of benchmark power systems.

...read moreread less

Journal Article•DOI•

Analysis and prediction of leucine-rich nuclear export signals

[...]

Tanja la Cour¹, Lars Kiemer, Anne Mølgaard, Ramneek Gupta, Karen Skriver, Søren Brunak¹ - Show less +2 more•Institutions (1)

Technical University of Denmark¹

01 Jun 2004-Protein Engineering Design & Selection

TL;DR: A thorough analysis of nuclear export signals and a prediction server is presented, which shows that the most important properties of NESs are accessibility and flexibility allowing relevant proteins to interact with the signal.

...read moreread less

Abstract: We present a thorough analysis of nuclear export signals and a prediction server, which we have made publicly available. The machine learning prediction method is a significant improvement over the generally used consensus patterns. Nuclear export signals (NESs) are extremely important regulators of the subcellular location of proteins. This regulation has an impact on transcription and other nuclear processes, which are fundamental to the viability of the cell. NESs are studied in relation to cancer, the cell cycle, cell differentiation and other important aspects of molecular biology. Our conclusion from this analysis is that the most important properties of NESs are accessibility and flexibility allowing relevant proteins to interact with the signal. Furthermore, we show that not only the known hydrophobic residues are important in defining a nuclear export signals. We employ both neural networks and hidden Markov models in the prediction algorithm and verify the method on the most recently discovered NESs. The NES predictor (NetNES) is made available for general use at http://www.cbs.dtu.dk/.

...read moreread less

Journal Article•DOI•

Real-time computation at the edge of chaos in recurrent neural networks

[...]

Nils Bertschinger¹, Thomas Natschläger•Institutions (1)

Graz University of Technology¹

01 Jul 2004-Neural Computation

TL;DR: It is shown that only near the critical boundary can recurrent networks of threshold gates perform complex computations on time series, which strongly supports conjectures that dynamical systems that are capable of doing complex computational tasks should operate near the edge of chaos.

...read moreread less

Abstract: Depending on the connectivity, recurrent networks of simple computational units can show very different types of dynamics, ranging from totally ordered to chaotic. We analyze how the type of dynamics (ordered or chaotic) exhibited by randomly connected networks of threshold gates driven by a time-varying input signal depends on the parameters describing the distribution of the connectivity matrix. In particular, we calculate the critical boundary in parameter space where the transition from ordered to chaotic dynamics takes place. Employing a recently developed framework for analyzing real-time computations, we show that only near the critical boundary can such networks perform complex computations on time series. Hence, this result strongly supports conjectures that dynamical systems that are capable of doing complex computational tasks should operate near the edge of chaos, that is, the transition from ordered to chaotic dynamics.

...read moreread less

Journal Article•DOI•

Support vector machines for wind speed prediction

[...]

Mohamed Mohandes¹, T.O. Halawani¹, Shafiqur Rehman¹, Ahmed A. Hussain¹•Institutions (1)

King Fahd University of Petroleum and Minerals¹

01 May 2004-Renewable Energy

TL;DR: This paper introduces support vector machines (SVM), the latest neural network algorithm, to wind speed prediction and compares their performance with the multilayer perceptron (MLP) neural networks.

...read moreread less

Journal Article•DOI•

Comparing different classifiers for automatic age estimation

[...]

Andreas Lanitis¹, Chrisina Draganova², Chris Christodoulou³•Institutions (3)

Cyprus College¹, London Metropolitan University², Birkbeck, University of London³

01 Feb 2004

TL;DR: The aim of this work is to design classifiers that accept the model-based representation of unseen images and produce an estimate of the age of the person in the corresponding face image, which indicates that machines can estimate theAge of a person almost as reliably as humans.

...read moreread less

Abstract: We describe a quantitative evaluation of the performance of different classifiers in the task of automatic age estimation. In this context, we generate a statistical model of facial appearance, which is subsequently used as the basis for obtaining a compact parametric description of face images. The aim of our work is to design classifiers that accept the model-based representation of unseen images and produce an estimate of the age of the person in the corresponding face image. For this application, we have tested different classifiers: a classifier based on the use of quadratic functions for modeling the relationship between face model parameters and age, a shortest distance classifier, and artificial neural network based classifiers. We also describe variations to the basic method where we use age-specific and/or appearance specific age estimation methods. In this context, we use age estimation classifiers for each age group and/or classifiers for different clusters of subjects within our training set. In those cases, part of the classification procedure is devoted to choosing the most appropriate classifier for the subject/age range in question, so that more accurate age estimates can be obtained. We also present comparative results concerning the performance of humans and computers in the task of age estimation. Our results indicate that machines can estimate the age of a person almost as reliably as humans.

...read moreread less

Journal Article•DOI•

A neuro-fuzzy computing technique for modeling hydrological time series

[...]

P. C. Nayak, K. P. Sudheer¹, D. M. Rangan, K. S. Ramasastri•Institutions (1)

Indian Institute of Technology Madras¹

31 May 2004-Journal of Hydrology

TL;DR: Results showed that the ANFIS forecasted flow series preserves the statistical properties of the original flow series, and a comparative analysis suggests that the proposed modeling approach outperforms ANNs and other traditional time series models in terms of computational speed, forecast errors, efficiency, peak flow estimation etc.

...read moreread less

Journal Article•DOI•

Maximum likelihood estimation of cascade point-process neural encoding models

[...]

Liam Paninski¹•Institutions (1)

University College London¹

01 Nov 2004-Network: Computation In Neural Systems

TL;DR: This work investigates the shape of the likelihood function for this type of model, gives a simple condition on the nonlinearity ensuring that no non-global local maxima exist in the likelihood—leading to efficient algorithms for the computation of the maximum likelihood estimator—and discusses the implications for the form of the allowed nonlinearities.

...read moreread less

Abstract: Recent work has examined the estimation of models of stimulus-driven neural activity in which some linear filtering process is followed by a nonlinear, probabilistic spiking stage. We analyze the estimation of one such model for which this nonlinear step is implemented by a known parametric function; the assumption that this function is known speeds the estimation process considerably. We investigate the shape of the likelihood function for this type of model, give a simple condition on the nonlinearity ensuring that no non-global local maxima exist in the likelihood—leading, in turn, to efficient algorithms for the computation of the maximum likelihood estimator—and discuss the implications for the form of the allowed nonlinearities. Finally, we note some interesting connections between the likelihood-based estimators and the classical spike-triggered average estimator, discuss some useful extensions of the basic model structure, and provide two novel applications to physiological data.

...read moreread less

Journal Article•DOI•

PCA-based feature selection scheme for machine defect classification

[...]

A. Malhi¹, Robert X. Gao¹•Institutions (1)

University of Massachusetts Amherst¹

22 Nov 2004-IEEE Transactions on Instrumentation and Measurement

TL;DR: The proposed feature selection scheme has shown to provide more accurate defect classification with fewer feature inputs than using all features initially considered relevant, and confirms its utility as an effective tool for machine health assessment.

...read moreread less

Abstract: The sensitivity of various features that are characteristic of a machine defect may vary considerably under different operating conditions. Hence it is critical to devise a systematic feature selection scheme that provides guidance on choosing the most representative features for defect classification. This paper presents a feature selection scheme based on the principal component analysis (PCA) method. The effectiveness of the scheme was verified experimentally on a bearing test bed, using both supervised and unsupervised defect classification approaches. The objective of the study was to identify the severity level of bearing defects, where no a priori knowledge on the defect conditions was available. The proposed scheme has shown to provide more accurate defect classification with fewer feature inputs than using all features initially considered relevant. The result confirms its utility as an effective tool for machine health assessment.

...read moreread less

Journal Article•DOI•

Residual life predictions from vibration-based degradation signals: a neural network approach

[...]

Nagi Gebraeel¹, Mark Lawley¹, R. Liu¹, Vijay Parmeshwaran¹•Institutions (1)

Purdue University¹

01 Jun 2004-IEEE Transactions on Industrial Electronics

TL;DR: Neural-network-based models for predicting bearing failures are developed to perform accelerated bearing tests where vibration information is collected from a number of bearings that are run until failure and this information is used to train neural network models on predicting bearing operating times.

...read moreread less

Abstract: Maintenance of mechanical and rotational equipment often includes bearing inspection and/or replacement. Thus, it is important to identify current as well as future conditions of bearings to avoid unexpected failure. Most published research in this area is focused on diagnosing bearing faults. In contrast, this paper develops neural-network-based models for predicting bearing failures. An experimental setup is developed to perform accelerated bearing tests where vibration information is collected from a number of bearings that are run until failure. This information is then used to train neural network models on predicting bearing operating times. Vibration data from a set of validation bearings are then applied to these network models. Resulting predictions are then used to estimate the bearing failure time. These predictions are then compared with the actual lives of the validation bearings and errors are computed to evaluate the effectiveness of each model. For the best model, we find that 64% of predictions are within 10% of actual bearing life, while 92% of predictions are within 20% of the actual life.

...read moreread less

Journal Article•DOI•

Gear fault detection using artificial neural networks and support vector machines with genetic algorithms

[...]

Biswanath Samanta¹•Institutions (1)

Sultan Qaboos University¹

01 May 2004-Mechanical Systems and Signal Processing

TL;DR: A study is presented to compare the performance of gear fault detection using artificial neural networks (ANNs) and support vector machines (SMVs) and for most of the cases considered, the classification accuracy of SVM is better than ANN, without GA.

...read moreread less

Journal Article•DOI•

The responsibility gap: Ascribing responsibility for the actions of learning automata

[...]

Andreas Matthias¹•Institutions (1)

University of Kassel¹

01 Sep 2004-Ethics and Information Technology

TL;DR: Autonomous, learning machines, based on neural networks, genetic algorithms and agent architectures, create a new situation, where the manufacturer/operator of the machine is in principle not capable of predicting the future machine behaviour any more, and thus cannot be held morally responsible or liable for it.

...read moreread less

Abstract: Traditionally, the manufacturer/operator of a machine is held (morally and legally) responsible for the consequences of its operation. Autonomous, learning machines, based on neural networks, genetic algorithms and agent architectures, create a new situation, where the manufacturer/operator of the machine is in principle not capable of predicting the future machine behaviour any more, and thus cannot be held morally responsible or liable for it. The society must decide between not using this kind of machine any more (which is not a realistic option), or facing a responsibility gap, which cannot be bridged by traditional concepts of responsibility ascription.

...read moreread less

Proceedings Article•DOI•

Speech emotion recognition combining acoustic features and linguistic information in a hybrid support vector machine-belief network architecture

[...]

Björn Schuller¹, Gerhard Rigoll¹, M. Lang¹•Institutions (1)

Technische Universität München¹

17 May 2004

TL;DR: A novel approach to the combination of acoustic features and language information for a most robust automatic recognition of a speaker's emotion by applying belief network based spotting for emotional key-phrases is introduced.

...read moreread less

Abstract: In this paper we introduce a novel approach to the combination of acoustic features and language information for a most robust automatic recognition of a speaker's emotion. Seven discrete emotional states are classified throughout the work. Firstly a model for the recognition of emotion by acoustic features is presented. The derived features of the signal-, pitch-, energy, and spectral contours are ranked by their quantitative contribution to the estimation of an emotion. Several different classification methods including linear classifiers, Gaussian mixture models, neural nets, and support vector machines are compared by their performance within this task. Secondly an approach to emotion recognition by the spoken content is introduced applying belief network based spotting for emotional key-phrases. Finally the two information sources are integrated in a soft decision fusion by using a neural net. The gain is evaluated and compared to other advances. Two emotional speech corpora used for training and evaluation are described in detail and the results achieved applying the propagated novel advance to speaker emotion recognition are presented and discussed.

...read moreread less

Journal Article•DOI•

Representation of Number in Animals and Humans: A Neural Model

[...]

Tom Verguts¹, Wim Fias¹•Institutions (1)

Ghent University¹

01 Nov 2004-Journal of Cognitive Neuroscience

TL;DR: This article addresses the representation of numerical information conveyed by nonsymbolic and symbolic stimuli and presents a concrete proposal on the linkage between higher order numerical cognition and more primitive numerical abilities and generates specific predictions on the neural substrate of number processing.

...read moreread less

Abstract: This article addresses the representation of numerical information conveyed by nonsymbolic and symbolic stimuli. In a first simulation study, we show how number-selective neurons develop when an initially uncommitted neural network is given nonsymbolic stimuli as input (e.g., collections of dots) under unsupervised learning. The resultant network is able to account for the distance and size effects, two ubiquitous effects in numerical cognition. Furthermore, the properties of the network units conform in detail to the characteristics of recently discovered number-selective neurons. In a second study, we simulate symbol learning by presenting symbolic and nonsymbolic input simultaneously. The same number-selective neurons learn to represent the numerical meaning of symbols. In doing so, they show properties reminiscent of the originally available number-selective neurons, but at the same time, the representational efficiency of the neurons is increased when presented with symbolic input. This finding presents a concrete proposal on the linkage between higher order numerical cognition and more primitive numerical abilities and generates specific predictions on the neural substrate of number processing.

...read moreread less

Journal Article•DOI•

GPU implementation of neural networks

[...]

Kyoungsu Oh¹, Keechul Jung¹•Institutions (1)

Soongsil University¹

01 Jun 2004-Pattern Recognition

TL;DR: In this paper, the authors implemented the matrix multiplication of a neural network to enhance the time performance of a text detection system using an ATI RADEON 9700 PRO board, which produced a 20-fold performance enhancement.

...read moreread less

Journal Article•DOI•

Comparison of construction cost estimating models based on regression analysis, neural networks, and case-based reasoning

[...]

Gwang-Hee Kim¹, Sung Hoon An¹, Kyung In Kang¹•Institutions (1)

Korea University¹

01 Oct 2004-Building and Environment

TL;DR: The CBR estimating model performed better than the MRA, neural networks, and case-based reasoning models with respect to long-term use, available information from result, and time versus accuracy tradeoffs.

...read moreread less

Book Chapter•DOI•

Multi-swarm Optimization in Dynamic Environments

[...]

Tim Blackwell¹, Jürgen Branke²•Institutions (2)

University of London¹, Karlsruhe Institute of Technology²

05 Apr 2004-Lecture Notes in Computer Science

TL;DR: It is shown that the multi-swarm optimizer significantly outperforms single population PSO on this problem, and that multi-quantum swarms are superior to multi-charged swarms and SOS.

...read moreread less

Abstract: Many real-world problems are dynamic, requiring an optimization algorithm which is able to continuously track a changing optimum over time. In this paper, we present new variants of Particle Swarm Optimization (PSO) specifically designed to work well in dynamic environments. The main idea is to extend the single population PSO and Charged Particle Swarm Optimization (CPSO) methods by constructing interacting multi-swarms. In addition, a new algorithmic variant, which broadens the implicit atomic analogy of CPSO to a quantum model, is introduced. The multi-swarm algorithms are tested on a multi-modal dynamic function – the moving peaks benchmark – and results are compared to the single population approach of PSO and CPSO, and to results obtained by a state-of-the-art evolutionary algorithm, namely self-organizing scouts (SOS). We show that our multi-swarm optimizer significantly outperforms single population PSO on this problem, and that multi-quantum swarms are superior to multi-charged swarms and SOS.

...read moreread less

Collapse