scispace - formally typeset
Open AccessJournal Article

Study of Informative Gene Selection for Tissue Classification Based on Tumor Gene Expression Profiles

Reads0
Chats0
TLDR
The authors analyze the Multi-Class tumor gene expression profile dataset, which contains 218 tumor samples spanning 14 common tumor types, as well as 90 normal tissue samples, to find a small subset of genes for distinguishing tumor from normal tissues.
Abstract
Informative gene selection is of great importance in the analysis of microarray expression data because of its huge dimensionality and relatively small samples, and also provides a systemic and promising way to reveal the gene expression patterns of tumors with large scale gene expression profiles. In this paper, the authors analyze the Multi-Class tumor gene expression profile dataset, which contains 218 tumor samples spanning 14 common tumor types, as well as 90 normal tissue samples, to find a small subset of genes for distinguishing tumor from normal tissues. First, a Relief-based feature selection algorithm is applied to create candidate feature subsets and the one with the best classification performance is selected as the informative gene subset for classification. Then, a sensitivity analysis method based on the classifier of support vector machine with RBF kernel is employed to eliminate the redundant genes. As a result, 52 informative genes are selected as markers for making distinctions between different tumor tissues and their normal counterparts, and their expressions are analyzed to explore the tumor gene expression patterns. At the end of this paper, several methods for informative gene selection are also analyzed and compared to validate the feasibility and effectiveness of the method employed in this work.

read more

Citations
More filters
Proceedings ArticleDOI

Fuzzy support vector machine based on non-equilibrium data

TL;DR: Experiments show that the new FSVM can effectively reduce the misclassification rates produced by the class with fewer samples in dealing with non-equilibrium data classification problem, and the proposed FSVM may make the mis classification rates upon two classes approximately equal.
Journal ArticleDOI

An improved relief feature selection algorithm based on Monte-Carlo tree search

TL;DR: An MCTS-based feature selection approach is proposed to deal with the feature selection problem of high dimensional data, where the Relief algorithm is used as the evaluation function of the MCTs approach.
Book ChapterDOI

Coordinating Discernibility and Independence Scores of Variables in a 2D Space for Efficient and Accurate Feature Selection

TL;DR: A novel definition for the discernibility and independence scores of a feature is presented, and a two dimensional space is constructed with the feature’s independence as y-axis and discernibility as x-axis to rank features’ importance.
Journal ArticleDOI

Colon cancer data analysis by chameleon algorithm.

TL;DR: The clustering analysis to colon cancer data and the comparisons to the other related studies demonstrate that the proposed algorithm is effective in detecting the differential genes of colon cancers.
Proceedings ArticleDOI

Research about feature genes selection for cancer type identification based on gene expression profiles

TL;DR: The SVM and improved Relief algorithm show excellent performance of selecting feature genes to identify and classify cancer types.
Related Papers (5)