Showing papers on "Support vector machine published in 2016"

PDF

Open Access

Journal Article•DOI•

Struck: Structured Output Tracking with Kernels

[...]

Sam Hare, Stuart Golodetz¹, Amir Saffari, Vibhav Vineet², Ming-Ming Cheng³, Stephen Hicks¹, Philip H. S. Torr¹ - Show less +3 more•Institutions (3)

University of Oxford¹, Stanford University², Nankai University³

01 Oct 2016-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: A framework for adaptive visual object tracking based on structured output prediction that is able to outperform state-of-the-art trackers on various benchmark videos and can easily incorporate additional features and kernels into the framework, which results in increased tracking performance.

...read moreread less

Abstract: Adaptive tracking-by-detection methods are widely used in computer vision for tracking arbitrary objects. Current approaches treat the tracking problem as a classification task and use online learning techniques to update the object model. However, for these updates to happen one needs to convert the estimated object position into a set of labelled training examples, and it is not clear how best to perform this intermediate step. Furthermore, the objective for the classifier (label prediction) is not explicitly coupled to the objective for the tracker (estimation of object position). In this paper, we present a framework for adaptive visual object tracking based on structured output prediction. By explicitly allowing the output space to express the needs of the tracker, we avoid the need for an intermediate classification step. Our method uses a kernelised structured output support vector machine (SVM), which is learned online to provide adaptive tracking. To allow our tracker to run at high frame rates, we (a) introduce a budgeting mechanism that prevents the unbounded growth in the number of support vectors that would otherwise occur during tracking, and (b) show how to implement tracking on the GPU. Experimentally, we show that our algorithm is able to outperform state-of-the-art trackers on various benchmark videos. Additionally, we show that we can easily incorporate additional features and kernels into our framework, which results in increased tracking performance.

...read moreread less

1,507 citations

Posted Content•

Transferability in Machine Learning: from Phenomena to Black-Box Attacks using Adversarial Samples

[...]

Nicolas Papernot, Patrick McDaniel, Ian Goodfellow

24 May 2016-arXiv: Cryptography and Security

TL;DR: New transferability attacks between previously unexplored (substitute, victim) pairs of machine learning model classes, most notably SVMs and decision trees are introduced.

...read moreread less

Abstract: Many machine learning models are vulnerable to adversarial examples: inputs that are specially crafted to cause a machine learning model to produce an incorrect output Adversarial examples that affect one model often affect another model, even if the two models have different architectures or were trained on different training sets, so long as both models were trained to perform the same task An attacker may therefore train their own substitute model, craft adversarial examples against the substitute, and transfer them to a victim model, with very little information about the victim Recent work has further developed a technique that uses the victim model as an oracle to label a synthetic training set for the substitute, so the attacker need not even collect a training set to mount the attack We extend these recent techniques using reservoir sampling to greatly enhance the efficiency of the training procedure for the substitute model We introduce new transferability attacks between previously unexplored (substitute, victim) pairs of machine learning model classes, most notably SVMs and decision trees We demonstrate our attacks on two commercial machine learning classification systems from Amazon (9619% misclassification rate) and Google (8894%) using only 800 queries of the victim model, thereby showing that existing machine learning approaches are in general vulnerable to systematic black-box attacks regardless of their structure

...read moreread less

1,481 citations

Book Chapter•DOI•

Support Vector Machine

[...]

Shan Suthaharan¹•Institutions (1)

University of North Carolina at Greensboro¹

01 Jan 2016

TL;DR: This chapter simplifies the Lagrangian support vector machine approach using process diagrams and data flow diagrams to help readers understand theory and implement it successfully.

...read moreread less

Abstract: Support Vector Machine is one of the classical machine learning techniques that can still help solve big data classification problems. Especially, it can help the multidomain applications in a big data environment. However, the support vector machine is mathematically complex and computationally expensive. The main objective of this chapter is to simplify this approach using process diagrams and data flow diagrams to help readers understand theory and implement it successfully. To achieve this objective, the chapter is divided into three parts: (1) modeling of a linear support vector machine; (2) modeling of a nonlinear support vector machine; and (3) Lagrangian support vector machine algorithm and its implementations. The Lagrangian support vector machine with simple examples is also implemented using the R programming platform on Hadoop and non-Hadoop systems.

...read moreread less

938 citations

Journal Article•DOI•

High-dimensional and large-scale anomaly detection using a linear one-class SVM with deep learning

[...]

Sarah M. Erfani¹, Sutharshan Rajasegarar¹, Shanika Karunasekera¹, Christopher Leckie¹•Institutions (1)

University of Melbourne¹

01 Oct 2016-Pattern Recognition

TL;DR: A hybrid model where an unsupervised DBN is trained to extract generic underlying features, and a one-class SVM is trained from the features learned by the DBN, which delivers a comparable accuracy with a deep autoencoder and is scalable and computationally efficient.

...read moreread less

876 citations

Journal Article•DOI•

Spatial prediction models for shallow landslide hazards: a comparative assessment of the efficacy of support vector machines, artificial neural networks, kernel logistic regression, and logistic model tree

[...]

Dieu Tien Bui¹, Dieu Tien Bui², Tran Anh Tuan³, Harald Klempe², Biswajeet Pradhan⁴, Inge Revhaug⁵ - Show less +2 more•Institutions (5)

Hanoi University of Mining and Geology¹, Telemark University College², Vietnam Academy of Science and Technology³, Universiti Putra Malaysia⁴, Norwegian University of Life Sciences⁵

01 Apr 2016-Landslides

TL;DR: This study introduces a framework for training and validation of shallow landslide susceptibility models by using the latest statistical methods and demonstrates the benefit of selecting the optimal machine learning techniques with proper conditioning selection method in shallow landslide susceptible mapping.

...read moreread less

Abstract: Preparation of landslide susceptibility maps is considered as the first important step in landslide risk assessments, but these maps are accepted as an end product that can be used for land use planning. The main objective of this study is to explore some new state-of-the-art sophisticated machine learning techniques and introduce a framework for training and validation of shallow landslide susceptibility models by using the latest statistical methods. The Son La hydropower basin (Vietnam) was selected as a case study. First, a landslide inventory map was constructed using the historical landslide locations from two national projects in Vietnam. A total of 12 landslide conditioning factors were then constructed from various data sources. Landslide locations were randomly split into a ratio of 70:30 for training and validating the models. To choose the best subset of conditioning factors, predictive ability of the factors were assessed using the Information Gain Ratio with 10-fold cross-validation technique. Factors with null predictive ability were removed to optimize the models. Subsequently, five landslide models were built using support vector machines (SVM), multi-layer perceptron neural networks (MLP Neural Nets), radial basis function neural networks (RBF Neural Nets), kernel logistic regression (KLR), and logistic model trees (LMT). The resulting models were validated and compared using the receive operating characteristic (ROC), Kappa index, and several statistical evaluation measures. Additionally, Friedman and Wilcoxon signed-rank tests were applied to confirm significant statistical differences among the five machine learning models employed in this study. Overall, the MLP Neural Nets model has the highest prediction capability (90.2 %), followed by the SVM model (88.7 %) and the KLR model (87.9 %), the RBF Neural Nets model (87.1 %), and the LMT model (86.1 %). Results revealed that both the KLR and the LMT models showed promising methods for shallow landslide susceptibility mapping. The result from this study demonstrates the benefit of selecting the optimal machine learning techniques with proper conditioning selection method in shallow landslide susceptibility mapping.

...read moreread less

861 citations

Journal Article•DOI•

Discriminative Unsupervised Feature Learning with Exemplar Convolutional Neural Networks

[...]

Alexey Dosovitskiy¹, Philipp Fischer¹, Jost Tobias Springenberg¹, Martin Riedmiller¹, Thomas Brox¹ - Show less +1 more•Institutions (1)

University of Freiburg¹

01 Sep 2016-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: In this article, a set of surrogate classes are formed by applying a variety of transformations to a randomly sampled image patch, and the resulting feature representation is not class specific, but provides robustness to the transformations that have been applied during training.

...read moreread less

Abstract: Deep convolutional networks have proven to be very successful in learning task specific features that allow for unprecedented performance on various computer vision tasks. Training of such networks follows mostly the supervised learning paradigm, where sufficiently many input-output pairs are required for training. Acquisition of large training sets is one of the key challenges, when approaching a new task. In this paper, we aim for generic feature learning and present an approach for training a convolutional network using only unlabeled data. To this end, we train the network to discriminate between a set of surrogate classes. Each surrogate class is formed by applying a variety of transformations to a randomly sampled ‘seed’ image patch. In contrast to supervised network training, the resulting feature representation is not class specific. It rather provides robustness to the transformations that have been applied during training. This generic feature representation allows for classification results that outperform the state of the art for unsupervised learning on several popular datasets (STL-10, CIFAR-10, Caltech-101, Caltech-256). While features learned with our approach cannot compete with class specific features from supervised training on a classification task, we show that they are advantageous on geometric matching problems, where they also outperform the SIFT descriptor.

...read moreread less

702 citations

Journal Article•DOI•

Machine learning in geosciences and remote sensing

[...]

David J. Lary¹, Amir H. Alavi², Amir H. Gandomi², Annette L. Walker³•Institutions (3)

University of Texas at Dallas¹, Michigan State University², United States Naval Research Laboratory³

01 Jan 2016-Geoscience frontiers

TL;DR: The role of ML as an effective approach for solving problems in geosciences and remote sensing will be highlighted and unique features of some of the ML techniques will be outlined with a specific attention to genetic programming paradigm.

...read moreread less

Abstract: Learning incorporates a broad range of complex procedures. Machine learning (ML) is a subdivision of artificial intelligence based on the biological learning process. The ML approach deals with the design of algorithms to learn from machine readable data. ML covers main domains such as data mining, difficult-to-program applications, and software applications. It is a collection of a variety of algorithms (e.g. neural networks, support vector machines, self-organizing map, decision trees, random forests, case-based reasoning, genetic programming, etc.) that can provide multivariate, nonlinear, nonparametric regression or classification. The modeling capabilities of the ML-based methods have resulted in their extensive applications in science and engineering. Herein, the role of ML as an effective approach for solving problems in geosciences and remote sensing will be highlighted. The unique features of some of the ML techniques will be outlined with a specific attention to genetic programming paradigm. Furthermore, nonparametric regression and classification illustrative examples are presented to demonstrate the efficiency of ML for tackling the geosciences and remote sensing problems.

...read moreread less

701 citations

Journal Article•DOI•

DeepTox: Toxicity Prediction using Deep Learning

[...]

Andreas Mayr¹, Günter Klambauer¹, Thomas Unterthiner¹, Sepp Hochreiter¹•Institutions (1)

Johannes Kepler University of Linz¹

02 Feb 2016-Frontiers in Environmental Science

TL;DR: DeepTox had the highest performance of all computational methods winning the grand challenge, the nuclear receptor panel, the stress response panel, and six single assays (teams ``Bioinf@JKU'').

...read moreread less

Abstract: The Tox21 Data Challenge has been the largest effort of the scientific community to compare computational methods for toxicity prediction. This challenge comprised 12,000 environmental chemicals and drugs which were measured for 12 different toxic effects by specifically designed assays. We participated in this challenge to assess the performance of Deep Learning in computational toxicity prediction. Deep Learning has already revolutionized image processing, speech recognition, and language understanding but has not yet been applied to computational toxicity. Deep Learning is founded on novel algorithms and architectures for artificial neural networks together with the recent availability of very fast computers and massive datasets. It discovers multiple levels of distributed representations of the input, with higher levels representing more abstract concepts. We hypothesized that the construction of a hierarchy of chemical features gives Deep Learning the edge over other toxicity prediction methods. Furthermore, Deep Learning naturally enables multi-task learning, that is, learning of all toxic effects in one neural network and thereby learning of highly informative chemical features. In order to utilize Deep Learning for toxicity prediction, we have developed the DeepTox pipeline. First, DeepTox normalizes the chemical representations of the compounds. Then it computes a large number of chemical descriptors that are used as input to machine learning methods. In its next step, DeepTox trains models, evaluates them, and combines the best of them to ensembles. Finally, DeepTox predicts the toxicity of new compounds. In the Tox21 Data Challenge, DeepTox had the highest performance of all computational methods winning the grand challenge, the nuclear receptor panel, the stress response panel, and six single assays (teams ``Bioinf@JKU''). We found that Deep Learning excelled in toxicity prediction and outperformed many other computational approaches like naive Bayes, support vector machines, and random forests.

...read moreread less

622 citations

Journal Article•DOI•

Hierarchical adaptive deep convolution neural network and its application to bearing fault diagnosis

[...]

Xiaojie Guo¹, Liang Chen¹, Changqing Shen¹•Institutions (1)

Soochow University (Suzhou)¹

01 Nov 2016-Measurement

TL;DR: A novel hierarchical learning rate adaptive deep convolution neural network based on an improved algorithm that is well suited to the fault-diagnosis model and superior to other existing methods is proposed.

...read moreread less

592 citations

Proceedings Article•DOI•

Ordinal Regression with Multiple Output CNN for Age Estimation

[...]

Zhenxing Niu¹, Mo Zhou¹, Le Wang², Xinbo Gao¹, Gang Hua³ - Show less +1 more•Institutions (3)

Xidian University¹, Xi'an Jiaotong University², Microsoft³

01 Jun 2016

TL;DR: This paper proposes an End-to-End learning approach to address ordinal regression problems using deep Convolutional Neural Network, which could simultaneously conduct feature learning and regression modeling, and achieves the state-of-the-art performance on both the MORPH and AFAD datasets.

...read moreread less

Abstract: To address the non-stationary property of aging patterns, age estimation can be cast as an ordinal regression problem. However, the processes of extracting features and learning a regression model are often separated and optimized independently in previous work. In this paper, we propose an End-to-End learning approach to address ordinal regression problems using deep Convolutional Neural Network, which could simultaneously conduct feature learning and regression modeling. In particular, an ordinal regression problem is transformed into a series of binary classification sub-problems. And we propose a multiple output CNN learning algorithm to collectively solve these classification sub-problems, so that the correlation between these tasks could be explored. In addition, we publish an Asian Face Age Dataset (AFAD) containing more than 160K facial images with precise age ground-truths, which is the largest public age dataset to date. To the best of our knowledge, this is the first work to address ordinal regression problems by using CNN, and achieves the state-of-the-art performance on both the MORPH and AFAD datasets.

...read moreread less

562 citations

Proceedings Article•DOI•

Understanding Data Augmentation for Classification: When to Warp?

[...]

Sebastien Wong¹, Adam Gatt², Victor Stamatescu³, Mark D. McDonnell⁴•Institutions (4)

Defence Science and Technology Organisation¹, Australian Department of Defence², University of Adelaide³, University of South Australia⁴

28 Sep 2016

TL;DR: In this article, the authors investigate the benefit of augmenting data with synthetically created samples when training a machine learning classifier, and they find that if plausible transforms for the data are known then augmentation in data-space provides a greater benefit for improving performance and reducing overfitting.

...read moreread less

Abstract: In this paper we investigate the benefit of augmenting data with synthetically created samples when training a machine learning classifier. Two approaches for creating additional training samples are data warping, which generates additional samples through transformations applied in the data-space, and synthetic over-sampling, which creates additional samples in feature-space. We experimentally evaluate the benefits of data augmentation for a convolutional backpropagation-trained neural network, a convolutional support vector machine and a convolutional extreme learning machine classifier, using the standard MNIST handwritten digit dataset. We found that while it is possible to perform generic augmentation in feature-space, if plausible transforms for the data are known then augmentation in data-space provides a greater benefit for improving performance and reducing overfitting.

...read moreread less

Journal Article•DOI•

Using Machine Learning Algorithms for Breast Cancer Risk Prediction and Diagnosis

[...]

Hiba Asri¹, Hajar Mousannif¹, Hassan Al Moatassime¹, Thomas Noel²•Institutions (2)

Cadi Ayyad University¹, University of Strasbourg²

01 Jan 2016-Procedia Computer Science

TL;DR: A performance comparison between different machine learning algorithms: Support Vector Machine (SVM), Decision Tree (C4.5), Naive Bayes (NB) and k Nearest Neighbors (k-NN) on the Wisconsin Breast Cancer datasets is conducted and Experimental results show that SVM gives the highest accuracy with lowest error rate.

...read moreread less

Proceedings Article•

Oblivious multi-party machine learning on trusted processors

[...]

Olga Ohrimenko¹, Felix Schuster¹, Cédric Fournet¹, Aastha Mehta¹, Sebastian Nowozin¹, Kapil Vaswani¹, Manuel Costa¹ - Show less +3 more•Institutions (1)

Microsoft¹

10 Aug 2016

TL;DR: This work proposes data-oblivious machine learning algorithms for support vector machines, matrix factorization, neural networks, decision trees, and k-means clustering and shows that their efficient implementation based on Intel Skylake processors scales up to large, realistic datasets, with overheads several orders of magnitude lower than with previous approaches.

...read moreread less

Abstract: Privacy-preserving multi-party machine learning allows multiple organizations to perform collaborative data analytics while guaranteeing the privacy of their individual datasets. Using trusted SGX-processors for this task yields high performance, but requires a careful selection, adaptation, and implementation of machine-learning algorithms to provably prevent the exploitation of any side channels induced by data-dependent access patterns. We propose data-oblivious machine learning algorithms for support vector machines, matrix factorization, neural networks, decision trees, and k-means clustering. We show that our efficient implementation based on Intel Skylake processors scales up to large, realistic datasets, with overheads several orders of magnitude lower than with previous approaches based on advanced cryptographic multi-party computation schemes.

...read moreread less

Journal Article•DOI•

Evolving support vector machines using fruit fly optimization for medical data classification

[...]

Liming Shen¹, Huiling Chen², Zhe Yu¹, Wenchang Kang¹, Bingyu Zhang¹, Huaizhong Li¹, Bo Yang², Dayou Liu² - Show less +4 more•Institutions (2)

Wenzhou University¹, Jilin University²

15 Mar 2016-Knowledge Based Systems

TL;DR: The empirical results demonstrate that the proposed FOA-SVM method can obtain much more appropriate model parameters as well as significantly reduce the computational time, which generates a high classification accuracy.

...read moreread less

Abstract: In this paper, a new support vector machines (SVM) parameter tuning scheme that uses the fruit fly optimization algorithm (FOA) is proposed. Termed as FOA-SVM, the scheme is successfully applied to medical diagnosis. In the proposed FOA-SVM, the FOA technique effectively and efficiently addresses the parameter set in SVM. Additionally, the effectiveness and efficiency of FOA-SVM is rigorously evaluated against four well-known medical datasets, including the Wisconsin breast cancer dataset, the Pima Indians diabetes dataset, the Parkinson dataset, and the thyroid disease dataset, in terms of classification accuracy, sensitivity, specificity, AUC (the area under the receiver operating characteristic (ROC) curve) criterion, and processing time. Four competitive counterparts are employed for comparison purposes, including the particle swarm optimization algorithm-based SVM (PSO-SVM), genetic algorithm-based SVM (GA-SVM), bacterial forging optimization-based SVM (BFO-SVM), and grid search technique-based SVM (Grid-SVM). The empirical results demonstrate that the proposed FOA-SVM method can obtain much more appropriate model parameters as well as significantly reduce the computational time, which generates a high classification accuracy. Promisingly, the proposed method can be regarded as a useful clinical tool for medical decision making.

...read moreread less

Journal Article•DOI•

Ensemble of keyword extraction methods and classifiers in text classification

[...]

Aytuğ Onan¹, Serdar Korukoglu², Hasan Bulut²•Institutions (2)

Celal Bayar University¹, Ege University²

15 Sep 2016-Expert Systems With Applications

TL;DR: The empirical analysis indicates that the utilization of keyword-based representation of text documents in conjunction with ensemble learning can enhance the predictive performance and scalability ofText classification schemes, which is of practical importance in the application fields of text classification.

...read moreread less

Abstract: Text classification is a domain with high dimensional feature space.Extracting the keywords as the features can be extremely useful in text classification.An empirical analysis of five statistical keyword extraction methods.A comprehensive analysis of classifier and keyword extraction ensembles.For ACM collection, a classification accuracy of 93.80% with Bagging ensemble of Random Forest. Automatic keyword extraction is an important research direction in text mining, natural language processing and information retrieval. Keyword extraction enables us to represent text documents in a condensed way. The compact representation of documents can be helpful in several applications, such as automatic indexing, automatic summarization, automatic classification, clustering and filtering. For instance, text classification is a domain with high dimensional feature space challenge. Hence, extracting the most important/relevant words about the content of the document and using these keywords as the features can be extremely useful. In this regard, this study examines the predictive performance of five statistical keyword extraction methods (most frequent measure based keyword extraction, term frequency-inverse sentence frequency based keyword extraction, co-occurrence statistical information based keyword extraction, eccentricity-based keyword extraction and TextRank algorithm) on classification algorithms and ensemble methods for scientific text document classification (categorization). In the study, a comprehensive study of comparing base learning algorithms (Naive Bayes, support vector machines, logistic regression and Random Forest) with five widely utilized ensemble methods (AdaBoost, Bagging, Dagging, Random Subspace and Majority Voting) is conducted. To the best of our knowledge, this is the first empirical analysis, which evaluates the effectiveness of statistical keyword extraction methods in conjunction with ensemble learning algorithms. The classification schemes are compared in terms of classification accuracy, F-measure and area under curve values. To validate the empirical analysis, two-way ANOVA test is employed. The experimental analysis indicates that Bagging ensemble of Random Forest with the most-frequent based keyword extraction method yields promising results for text classification. For ACM document collection, the highest average predictive performance (93.80%) is obtained with the utilization of the most frequent based keyword extraction method with Bagging ensemble of Random Forest algorithm. In general, Bagging and Random Subspace ensembles of Random Forest yield promising results. The empirical analysis indicates that the utilization of keyword-based representation of text documents in conjunction with ensemble learning can enhance the predictive performance and scalability of text classification schemes, which is of practical importance in the application fields of text classification.

...read moreread less

Journal Article•DOI•

Classification of sentiment reviews using n-gram machine learning approach

[...]

Abinash Tripathy¹, Ankit Agrawal¹, Santanu Kumar Rath¹•Institutions (1)

National Institute of Technology, Rourkela¹

15 Sep 2016-Expert Systems With Applications

TL;DR: Four different machine learning algorithms such as Naive Bayes (NB), Maximum Entropy (ME), Stochastic Gradient Descent (SGD), and Support Vector Machine (SVM) have been considered for classification of human sentiments.

...read moreread less

Abstract: A large number of sentiment reviews, blogs and comments present online.These reviews must be classified to obtain a meaningful information.Four different supervised machine learning algorithm used for classification.Unigram, Bigram, Trigram models and their combinations used for classification.The classification is done on IMDb movie review dataset. With the ever increasing social networking and online marketing sites, the reviews and blogs obtained from those, act as an important source for further analysis and improved decision making. These reviews are mostly unstructured by nature and thus, need processing like classification or clustering to provide a meaningful information for future uses. These reviews and blogs may be classified into different polarity groups such as positive, negative, and neutral in order to extract information from the input dataset. Supervised machine learning methods help to classify these reviews. In this paper, four different machine learning algorithms such as Naive Bayes (NB), Maximum Entropy (ME), Stochastic Gradient Descent (SGD), and Support Vector Machine (SVM) have been considered for classification of human sentiments. The accuracy of different methods are critically examined in order to access their performance on the basis of parameters such as precision, recall, f-measure, and accuracy.

...read moreread less

Journal Article•DOI•

Digital mammographic tumor classification using transfer learning from deep convolutional neural networks

[...]

Benjamin Q. Huynh¹, Hui Li¹, Maryellen L. Giger¹•Institutions (1)

University of Chicago¹

01 Jul 2016-Journal of medical imaging

TL;DR: It is concluded that transfer learning can improve current CADx methods while also providing standalone classifiers without large datasets, facilitating machine-learning methods in radiomics and precision medicine.

...read moreread less

Abstract: Convolutional neural networks (CNNs) show potential for computer-aided diagnosis (CADx) by learning features directly from the image data instead of using analytically extracted features. However, CNNs are difficult to train from scratch for medical images due to small sample sizes and variations in tumor presentations. Instead, transfer learning can be used to extract tumor information from medical images via CNNs originally pretrained for nonmedical tasks, alleviating the need for large datasets. Our database includes 219 breast lesions (607 full-field digital mammographic images). We compared support vector machine classifiers based on the CNN-extracted image features and our prior computer-extracted tumor features in the task of distinguishing between benign and malignant breast lesions. Five-fold cross validation (by lesion) was conducted with the area under the receiver operating characteristic (ROC) curve as the performance metric. Results show that classifiers based on CNN-extracted features (with transfer learning) perform comparably to those using analytically extracted features [area under the ROC curve [Formula: see text]]. Further, the performance of ensemble classifiers based on both types was significantly better than that of either classifier type alone ([Formula: see text] versus 0.81, [Formula: see text]). We conclude that transfer learning can improve current CADx methods while also providing standalone classifiers without large datasets, facilitating machine-learning methods in radiomics and precision medicine.

...read moreread less

Journal Article•DOI•

A meta-analysis of remote sensing research on supervised pixel-based land-cover image classification processes: General guidelines for practitioners and future research

[...]

Reza Khatami¹, Giorgos Mountrakis¹, Stephen V. Stehman¹•Institutions (1)

State University of New York College of Environmental Science and Forestry¹

01 May 2016-Remote Sensing of Environment

TL;DR: A statistical meta-analysis of the past 15 years of research on supervised per-pixel image classification revealed that inclusion of texture information yielded the greatest improvement in overall accuracy of land-cover classification with an average increase of 12.1%.

...read moreread less

Journal Article•DOI•

Building an Intrusion Detection System Using a Filter-Based Feature Selection Algorithm

[...]

Mohammed A. Ambusaidi¹, Xiangjian He¹, Priyadarsi Nanda¹, Zhiyuan Tan²•Institutions (2)

University of Technology, Sydney¹, University of Twente²

01 Oct 2016-IEEE Transactions on Computers

TL;DR: The evaluation results show that the feature selection algorithm contributes more critical features for LSSVM-IDS to achieve better accuracy and lower computational cost compared with the state-of-the-art methods.

...read moreread less

Abstract: Redundant and irrelevant features in data have caused a long-term problem in network traffic classification. These features not only slow down the process of classification but also prevent a classifier from making accurate decisions, especially when coping with big data. In this paper, we propose a mutual information based algorithm that analytically selects the optimal feature for classification. This mutual information based feature selection algorithm can handle linearly and nonlinearly dependent data features. Its effectiveness is evaluated in the cases of network intrusion detection. An Intrusion Detection System (IDS), named Least Square Support Vector Machine based IDS (LSSVM-IDS), is built using the features selected by our proposed feature selection algorithm. The performance of LSSVM-IDS is evaluated using three intrusion detection evaluation datasets, namely KDD Cup 99, NSL-KDD and Kyoto 2006+ dataset. The evaluation results show that our feature selection algorithm contributes more critical features for LSSVM-IDS to achieve better accuracy and lower computational cost compared with the state-of-the-art methods.

...read moreread less

Journal Article•DOI•

Deep Learning in Label-free Cell Classification

[...]

Claire Lifan Chen¹, Ata Mahjoubfar², Ata Mahjoubfar¹, Li Chia Tai², Ian K. Blaby¹, Allen Huang¹, Kayvan Niazi², Kayvan Niazi¹, Bahram Jalali - Show less +5 more•Institutions (2)

University of California, Los Angeles¹, California NanoSystems Institute²

15 Mar 2016-Scientific Reports

TL;DR: This work integrates feature extraction and deep learning with high-throughput quantitative imaging enabled by photonic time stretch, achieving record high accuracy in label-free cell classification.

...read moreread less

Abstract: Label-free cell analysis is essential to personalized genomics, cancer diagnostics, and drug development as it avoids adverse effects of staining reagents on cellular viability and cell signaling. However, currently available label-free cell assays mostly rely only on a single feature and lack sufficient differentiation. Also, the sample size analyzed by these assays is limited due to their low throughput. Here, we integrate feature extraction and deep learning with high-throughput quantitative imaging enabled by photonic time stretch, achieving record high accuracy in label-free cell classification. Our system captures quantitative optical phase and intensity images and extracts multiple biophysical features of individual cells. These biophysical measurements form a hyperdimensional feature space in which supervised learning is performed for cell classification. We compare various learning algorithms including artificial neural network, support vector machine, logistic regression, and a novel deep learning pipeline, which adopts global optimization of receiver operating characteristics. As a validation of the enhanced sensitivity and specificity of our system, we show classification of white blood T-cells against colon cancer cells, as well as lipid accumulating algal strains for biofuel production. This system opens up a new path to data-driven phenotypic diagnosis and better understanding of the heterogeneous gene expressions in cells.

...read moreread less

Book Chapter•DOI•

Deep Learning for Classification of Malware System Call Sequences

[...]

Bojan Kolosnjaji¹, Apostolis Zarras¹, George D. Webster¹, Claudia Eckert¹•Institutions (1)

Technische Universität München¹

05 Dec 2016

TL;DR: The increase in number and variety of malware samples amplifies the need for improvement in automatic detection and classification of the malware variants, and neural network methodology has been grown to the state that can surpass limitations of previous machine learning methods.

...read moreread less

Abstract: The increase in number and variety of malware samples amplifies the need for improvement in automatic detection and classification of the malware variants. Machine learning is a natural choice to cope with this increase, because it addresses the need of discovering underlying patterns in large-scale datasets. Nowadays, neural network methodology has been grown to the state that can surpass limitations of previous machine learning methods, such as Hidden Markov Models and Support Vector Machines. As a consequence, neural networks can now offer superior classification accuracy in many domains, such as computer vision or natural language processing. This improvement comes from the possibility of constructing neural networks with a higher number of potentially diverse layers and is known as Deep Learning.

...read moreread less

Journal Article•DOI•

Assessing the robustness of Random Forests to map land cover with high resolution satellite image time series over large areas

[...]

Charlotte Pelletier, Silvia Valero, Jordi Inglada, Nicolas Champion¹, Gérard Dedieu - Show less +1 more•Institutions (1)

University of Paris¹

15 Dec 2016-Remote Sensing of Environment

TL;DR: This work aims at demonstrating the ability of state-of-the-art classifiers, such as Random Forests (RF) or Support Vector Machines (SVM), to classify HR-SITS, and selecting the best feature set used as input data in order to establish the classifier accuracy over large areas.

...read moreread less

Proceedings Article•DOI•

A deep learning approach to detection of splicing and copy-move forgeries in images

[...]

Yuan Rao¹, Jiangqun Ni¹•Institutions (1)

Sun Yat-sen University¹

01 Dec 2016

TL;DR: A new image forgery detection method based on deep learning technique, which utilizes a convolutional neural network to automatically learn hierarchical representations from the input RGB color images to outperforms some state-of-the-art methods.

...read moreread less

Abstract: In this paper, we present a new image forgery detection method based on deep learning technique, which utilizes a convolutional neural network (CNN) to automatically learn hierarchical representations from the input RGB color images. The proposed CNN is specifically designed for image splicing and copy-move detection applications. Rather than a random strategy, the weights at the first layer of our network are initialized with the basic high-pass filter set used in calculation of residual maps in spatial rich model (SRM), which serves as a regularizer to efficiently suppress the effect of image contents and capture the subtle artifacts introduced by the tampering operations. The pre-trained CNN is used as patch descriptor to extract dense features from the test images, and a feature fusion technique is then explored to obtain the final discriminative features for SVM classification. The experimental results on several public datasets show that the proposed CNN based model outperforms some state-of-the-art methods.

...read moreread less

Journal Article•DOI•

Blind Image Quality Assessment Based on High Order Statistics Aggregation

[...]

Jingtao Xu¹, Peng Ye², Qiaohong Li³, Haiqing Du¹, Yong Liu¹, David Doermann⁴ - Show less +2 more•Institutions (4)

Beijing University of Posts and Telecommunications¹, Airbnb², Nanyang Technological University³, University of Maryland, College Park⁴

28 Jun 2016-IEEE Transactions on Image Processing

TL;DR: A novel general purpose BIQA method based on high order statistics aggregation (HOSA), requiring only a small codebook, which has been extensively evaluated on ten image databases with both simulated and realistic image distortions, and shows highly competitive performance to the state-of-the-art BIZA methods.

...read moreread less

Abstract: Blind image quality assessment (BIQA) research aims to develop a perceptual model to evaluate the quality of distorted images automatically and accurately without access to the non-distorted reference images. The state-of-the-art general purpose BIQA methods can be classified into two categories according to the types of features used. The first includes handcrafted features which rely on the statistical regularities of natural images. These, however, are not suitable for images containing text and artificial graphics. The second includes learning-based features which invariably require large codebook or supervised codebook updating procedures to obtain satisfactory performance. These are time-consuming and not applicable in practice. In this paper, we propose a novel general purpose BIQA method based on high order statistics aggregation (HOSA), requiring only a small codebook. HOSA consists of three steps. First, local normalized image patches are extracted as local features through a regular grid, and a codebook containing 100 codewords is constructed by K-means clustering. In addition to the mean of each cluster, the diagonal covariance and coskewness (i.e., dimension-wise variance and skewness) of clusters are also calculated. Second, each local feature is softly assigned to several nearest clusters and the differences of high order statistics (mean, variance and skewness) between local features and corresponding clusters are softly aggregated to build the global quality aware image representation. Finally, support vector regression is adopted to learn the mapping between perceptual features and subjective opinion scores. The proposed method has been extensively evaluated on ten image databases with both simulated and realistic image distortions, and shows highly competitive performance to the state-of-the-art BIQA methods.

...read moreread less

Journal Article•DOI•

Improving BCI-based emotion recognition by combining EEG feature selection and kernel classifiers

[...]

John Atkinson¹, Daniel Campos¹•Institutions (1)

University of Concepción¹

01 Apr 2016-Expert Systems With Applications

TL;DR: A novel feature-based emotion recognition model is proposed for EEG-based Brain-Computer Interfaces which combines statistical-based feature selection methods and SVM emotion classifiers and incorporates additional features which are relevant for signal pre-processing and recognition classification tasks.

...read moreread less

Abstract: A feature-based emotion recognition model is proposed for EEG-based BCI.The approach combines statistical-based feature selection methods and SVM emotion classifiers.The model is based on Valence/Arousal dimensions for emotion classification.Our combined approach outperformed other recognition methods. Current emotion recognition computational techniques have been successful on associating the emotional changes with the EEG signals, and so they can be identified and classified from EEG signals if appropriate stimuli are applied. However, automatic recognition is usually restricted to a small number of emotions classes mainly due to signal's features and noise, EEG constraints and subject-dependent issues. In order to address these issues, in this paper a novel feature-based emotion recognition model is proposed for EEG-based Brain-Computer Interfaces. Unlike other approaches, our method explores a wider set of emotion types and incorporates additional features which are relevant for signal pre-processing and recognition classification tasks, based on a dimensional model of emotions: Valence and Arousal. It aims to improve the accuracy of the emotion classification task by combining mutual information based feature selection methods and kernel classifiers. Experiments using our approach for emotion classification which combines efficient feature selection methods and efficient kernel-based classifiers on standard EEG datasets show the promise of the approach when compared with state-of-the-art computational methods.

...read moreread less

Journal Article•DOI•

A comparative study of different machine learning methods for landslide susceptibility assessment

[...]

Binh Thai Pham, Biswajeet Pradhan¹, Dieu Tien Bui², Indra Prakash³, M. B. Dholakia⁴ - Show less +1 more•Institutions (4)

Universiti Putra Malaysia¹, Telemark University College², Government of Gujarat³, Gujarat Technological University⁴

01 Oct 2016-Environmental Modelling and Software

TL;DR: Analysis and comparison of the results show that all five landslide models performed well for landslide susceptibility assessment, but it has been observed that the SVM model has the best performance in comparison to other landslide models.

...read moreread less

Abstract: Landslide susceptibility assessment of Uttarakhand area of India has been done by applying five machine learning methods namely Support Vector Machines (SVM), Logistic Regression (LR), Fisher's Linear Discriminant Analysis (FLDA), Bayesian Network (BN), and Naive Bayes (NB). Performance of these methods has been evaluated using the ROC curve and statistical index based methods. Analysis and comparison of the results show that all five landslide models performed well for landslide susceptibility assessment (AUCź=ź0.910-0.950). However, it has been observed that the SVM model (AUCź=ź0.950) has the best performance in comparison to other landslide models, followed by the LR model (AUCź=ź0.922), the FLDA model (AUCź=ź0.921), the BN model (AUCź=ź0.915), and the NB model (AUCź=ź0.910), respectively. Machine learning methods namely SVM, LR, FLDA, BN, and NB have been evaluated and compared for landslide susceptibility assessment.Results indicate that all these five models can be applied efficiently for landslide assessment and prediction.Analysis of comparative results reaffirmed that the SVM model is one of the best methods.

...read moreread less

Journal Article•DOI•

Decision Tree and SVM-Based Data Analytics for Theft Detection in Smart Grid

[...]

Anish Jindal¹, Amit Dua¹, Kuljeet Kaur¹, Mukesh Singh¹, Neeraj Kumar¹, Sukumar Mishra² - Show less +2 more•Institutions (2)

Thapar University¹, Indian Institute of Technology Delhi²

16 Mar 2016-IEEE Transactions on Industrial Informatics

TL;DR: This paper proposes a comprehensive top-down scheme capable enough to precisely detect and locate real-time electricity theft at every level in power transmission and distribution (T&D).

...read moreread less

Abstract: Nontechnical losses, particularly due to electrical theft, have been a major concern in power system industries for a long time. Large-scale consumption of electricity in a fraudulent manner may imbalance the demand–supply gap to a great extent. Thus, there arises the need to develop a scheme that can detect these thefts precisely in the complex power networks. So, keeping focus on these points, this paper proposes a comprehensive top-down scheme based on decision tree (DT) and support vector machine (SVM). Unlike existing schemes, the proposed scheme is capable enough to precisely detect and locate real-time electricity theft at every level in power transmission and distribution (T&D). The proposed scheme is based on the combination of DT and SVM classifiers for rigorous analysis of gathered electricity consumption data. In other words, the proposed scheme can be viewed as a two-level data processing and analysis approach, since the data processed by DT are fed as an input to the SVM classifier. Furthermore, the obtained results indicate that the proposed scheme reduces false positives to a great extent and is practical enough to be implemented in real-time scenarios.

...read moreread less

Journal Article•DOI•

A novel SVM-kNN-PSO ensemble method for intrusion detection system

[...]

Abdulla Amin Aburomman¹, Mamun Bin Ibne Reaz¹•Institutions (1)

National University of Malaysia¹

01 Jan 2016

TL;DR: A novel ensemble construction method that uses PSO generated weights to create ensemble of classifiers with better accuracy for intrusion detection and results suggest that the new approach can generate ensembles that outperform WMA in terms of classification accuracy.

...read moreread less

Abstract: Graphical abstractThe objective of this paper is to develop ensemble based classifiers that will improve the accuracy of Intrusion Detection. For this purpose, we trained and tested 12 experts and then combined them into an ensemble. We used the PSO algorithm to weight the opinion of each expert. Because the quality of the behavioral parameters inserted by the user into PSO strongly affects its effectiveness, we have used the LUS method as a meta-optimizer for finding high-quality parameters. We then used the improved PSO to create new weights for each expert. For comparison, we also developed an ensemble classifier with weights generated using WMA 12. Fig. 1 depicts the entire process. For simplicity, the system framework was divided into the following seven stages:1.Kdd99 data pre-processing.2.Data classification with six different SVM experts.3.Data classification with six different k-NN experts.4.Data classification with ensemble classifier based on PSO.5.Data classification with ensemble classifier based on LUS improvement of PSO.6.Data classification with ensemble classifier based on WMA.7.Comparison of results for each approach.Display Omitted HighlightsIDS implemented using ensemble of a six SVM and a six k-NN classifier.Ensembles are created with weight generated by PSO and meta-PSO algorithms.These two ensembles outperform third ensemble system that is created with WMA. In machine learning, a combination of classifiers, known as an ensemble classifier, often outperforms individual ones. While many ensemble approaches exist, it remains, however, a difficult task to find a suitable ensemble configuration for a particular dataset. This paper proposes a novel ensemble construction method that uses PSO generated weights to create ensemble of classifiers with better accuracy for intrusion detection. Local unimodal sampling (LUS) method is used as a meta-optimizer to find better behavioral parameters for PSO. For our empirical study, we took five random subsets from the well-known KDD99 dataset. Ensemble classifiers are created using the new approaches as well as the weighted majority algorithm (WMA) approach. Our experimental results suggest that the new approach can generate ensembles that outperform WMA in terms of classification accuracy.

...read moreread less

Journal Article•DOI•

An overview and comparison of machine-learning techniques for classification purposes in digital soil mapping

[...]

Brandon Heung¹, Hung Chak Ho¹, Jin Zhang¹, Anders Knudby², Chuck E. Bulmer, Margaret G. Schmidt¹ - Show less +2 more•Institutions (2)

Simon Fraser University¹, University of Ottawa²

01 Mar 2016-Geoderma

TL;DR: This study provides a comprehensive comparison of machine-learning techniques for classification purposes in soil science and may assist in model selection for digital soil mapping and geomorphic modeling studies in the future.

...read moreread less

Journal Article•DOI•

Comparing and experimenting machine learning techniques for code smell detection

[...]

Francesca Arcelli Fontana¹, Mika V. Mäntylä², Marco Zanoni¹, Alessandro Marino¹•Institutions (2)

University of Milano-Bicocca¹, University of Oulu²

01 Jun 2016-Empirical Software Engineering

TL;DR: The largest experiment of applying machine learning algorithms to code smells to the best of the authors' knowledge concludes that the application of machine learning to the detection of these code smells can provide high accuracy (>96 %), and only a hundred training examples are needed to reach at least 95 % accuracy.

...read moreread less

Abstract: Several code smell detection tools have been developed providing different results, because smells can be subjectively interpreted, and hence detected, in different ways. In this paper, we perform the largest experiment of applying machine learning algorithms to code smells to the best of our knowledge. We experiment 16 different machine-learning algorithms on four code smells (Data Class, Large Class, Feature Envy, Long Method) and 74 software systems, with 1986 manually validated code smell samples. We found that all algorithms achieved high performances in the cross-validation data set, yet the highest performances were obtained by J48 and Random Forest, while the worst performance were achieved by support vector machines. However, the lower prevalence of code smells, i.e., imbalanced data, in the entire data set caused varying performances that need to be addressed in the future studies. We conclude that the application of machine learning to the detection of these code smells can provide high accuracy (>96 %), and only a hundred training examples are needed to reach at least 95 % accuracy.

...read moreread less

Collapse