scispace - formally typeset
Open AccessBook ChapterDOI

Granular support vector machine based method for prediction of solubility of proteins on overexpression in escherichia coli

TLDR
The results indicate that a difficult imbalanced classification problem can be successfully solved by employing a granular support vector Machines for prediction of soluble proteins on over expression in Escherichia coli.
Abstract
We employed a granular support vector Machines(GSVM) for prediction of soluble proteins on over expression in Escherichia coli. Granular computing splits the feature space into a set of subspaces (or information granules) such as classes, subsets, clusters and intervals [14]. By the principle of divide and conquer it decomposes a bigger complex problem into smaller and computationally simpler problems. Each of the granules is then solved independently and all the results are aggregated to form the final solution. For the purpose of granulation association rules was employed. The results indicate that a difficult imbalanced classification problem can be successfully solved by employing GSVM.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

A review of machine learning methods to predict the solubility of overexpressed recombinant proteins in Escherichia coli.

TL;DR: This paper presents an extensive review of the existing models to predict protein solubility in Escherichia coli recombinant protein overexpression system and concludes that some of the models present acceptable prediction performances and convenient user interfaces can be considered as valuable tools to predict recombinant Protein Solubility results before performing real laboratory experiments, thus saving labour, time and cost.
Journal ArticleDOI

Granular support vector machine: a review

TL;DR: Granular support vector machine (GSVM) is a novel machine learning model based on granular computing and statistical learning theory, and it can solve the low efficiency learning problem that exists in the traditional SVM and obtain satisfactory generalization performance, as well.
Proceedings ArticleDOI

Prediction of protein solubility in E. coli

TL;DR: This work presents a framework that creates models of solubility from sequence information from the primary protein sequences of the genes to be synthesized, and provides the biologist with a comprehensive comparison between different learning algorithms, and insightful feature analysis.
Journal ArticleDOI

Machine learning in computational biology to accelerate high-throughput protein expression.

TL;DR: This work identifies protein properties that hinder the HPA high‐throughput antibody production pipeline based on a subset of key properties (aromaticity, hydropathy and isoelectric point) and guides the selection of protein fragments based on these characteristics to optimize high-throughput experimentation.
Journal ArticleDOI

Granular Computing Concept based long-term prediction of Gas Tank Levels in Steel Industry

TL;DR: In this article, a regression model based on the Granular Computing (GrC) is proposed to provide a long-term prediction for the LDG tank levels, in which the data segments are entirely considered for the prediction horizon extension rather than the generic data point-oriented modeling.
References
More filters
Book

The Nature of Statistical Learning Theory

TL;DR: Setting of the learning problem consistency of learning processes bounds on the rate of convergence ofLearning processes controlling the generalization ability of learning process constructing learning algorithms what is important in learning theory?
Book

Data Mining: Practical Machine Learning Tools and Techniques

TL;DR: This highly anticipated third edition of the most acclaimed work on data mining and machine learning will teach you everything you need to know about preparing inputs, interpreting outputs, evaluating results, and the algorithmic methods at the heart of successful data mining.
Journal ArticleDOI

A Tutorial on Support Vector Machines for Pattern Recognition

TL;DR: There are several arguments which support the observed high accuracy of SVMs, which are reviewed and numerous examples and proofs of most of the key theorems are given.
Proceedings ArticleDOI

Mining association rules between sets of items in large databases

TL;DR: An efficient algorithm is presented that generates all significant association rules between items in the database of customer transactions and incorporates buffer management and novel estimation and pruning techniques.
Proceedings Article

Fast algorithms for mining association rules

TL;DR: Two new algorithms for solving thii problem that are fundamentally different from the known algorithms are presented and empirical evaluation shows that these algorithms outperform theknown algorithms by factors ranging from three for small problems to more than an order of magnitude for large problems.
Related Papers (5)