scispace - formally typeset
Author

Sushmita Paul

Bio: Sushmita Paul is an academic researcher from Indian Institute of Technology, Jodhpur. The author has contributed to research in topic(s): Cluster analysis & Rough set. The author has an hindex of 12, co-authored 57 publication(s) receiving 575 citation(s). Previous affiliations of Sushmita Paul include University of Erlangen-Nuremberg & Indian Statistical Institute.

Papers published on a yearly basis

Papers
More filters
Journal ArticleDOI

[...]

TL;DR: A new feature selection algorithm is presented based on rough set theory that selects a set of genes from microarray data by maximizing the relevance and significance of the selected genes.
Abstract: Among the large amount of genes presented in microarray gene expression data, only a small fraction of them is effective for performing a certain diagnostic test. In this regard, a new feature selection algorithm is presented based on rough set theory. It selects a set of genes from microarray data by maximizing the relevance and significance of the selected genes. A theoretical analysis is presented to justify the use of both relevance and significance criteria for selecting a reduced gene set with high predictive accuracy. The importance of rough set theory for computing both relevance and significance of the genes is also established. The performance of the proposed algorithm, along with a comparison with other related methods, is studied using the predictive accuracy of K-nearest neighbor rule and support vector machine on five cancer and two arthritis microarray data sets. Among seven data sets, the proposed algorithm attains 100% predictive accuracy for three cancer and two arthritis data sets, while the rough set based two existing algorithms attain this accuracy only for one cancer data set.

120 citations

Journal ArticleDOI

[...]

TL;DR: An efficient method is proposed to select initial prototypes of different gene clusters, which enables the proposed c-means algorithm to converge to an optimum or near optimum solutions and helps to discover coexpressed gene clusters.
Abstract: Gene expression data clustering is one of the important tasks of functional genomics as it provides a powerful tool for studying functional relationships of genes in a biological process. Identifying coexpressed groups of genes represents the basic challenge in gene clustering problem. In this regard, a gene clustering algorithm, termed as robust rough-fuzzy $(c)$-means, is proposed judiciously integrating the merits of rough sets and fuzzy sets. While the concept of lower and upper approximations of rough sets deals with uncertainty, vagueness, and incompleteness in cluster definition, the integration of probabilistic and possibilistic memberships of fuzzy sets enables efficient handling of overlapping partitions in noisy environment. The concept of possibilistic lower bound and probabilistic boundary of a cluster, introduced in robust rough-fuzzy $(c)$-means, enables efficient selection of gene clusters. An efficient method is proposed to select initial prototypes of different gene clusters, which enables the proposed $(c)$-means algorithm to converge to an optimum or near optimum solutions and helps to discover coexpressed gene clusters. The effectiveness of the algorithm, along with a comparison with other algorithms, is demonstrated both qualitatively and quantitatively on 14 yeast microarray data sets.

89 citations

Journal ArticleDOI

[...]

TL;DR: In vivo and in vitro gene profiling, together with chromatin immunoprecipitation analysis of macrophages, revealed direct activation of the proinflammatory factor cyclooxygenase-2 and indirect inhibition of the anti-inflammatory factor arginase-1 by c-Jun.
Abstract: Activation of proinflammatory macrophages is associated with the inflammatory state of rheumatoid arthritis. Their polarization and activation are controlled by transcription factors such as NF-κB and the AP-1 transcription factor member c-Fos. Surprisingly, little is known about the role of the AP-1 transcription factor c-Jun in macrophage activation. In this study, we show that mRNA and protein levels of c-Jun are increased in macrophages following pro- or anti-inflammatory stimulations. Gene Ontology and Kyoto Encyclopedia of Genes and Genomes pathway enrichment cluster analyses of microarray data using wild-type and c-Jun-deleted macrophages highlight the central function of c-Jun in macrophages, in particular for immune responses, IL production, and hypoxia pathways. Mice deficient for c-Jun in macrophages show an amelioration of inflammation and bone destruction in the serum-induced arthritis model. In vivo and in vitro gene profiling, together with chromatin immunoprecipitation analysis of macrophages, revealed direct activation of the proinflammatory factor cyclooxygenase-2 and indirect inhibition of the anti-inflammatory factor arginase-1 by c-Jun. Thus, c-Jun regulates the activation state of macrophages and promotes arthritis via differentially regulating cyclooxygenase-2 and arginase-1 levels.

33 citations

Journal ArticleDOI

[...]

01 Nov 2010
TL;DR: A new feature selection algorithm is presented, based on rough set theory, to select a set of effective molecular descriptors from a given QSAR dataset by maximizing both relevance and significance of the descriptors.
Abstract: Quantitative structure activity relationship (QSAR) is one of the important disciplines of computer-aided drug design that deals with the predictive modeling of properties of a molecule. In general, each QSAR dataset is small in size with large number of features or descriptors. Among the large amount of descriptors presented in the QSAR dataset, only a small fraction of them is effective for performing the predictive modeling task. In this paper, a new feature selection algorithm is presented, based on rough set theory, to select a set of effective molecular descriptors from a given QSAR dataset. The proposed algorithm selects the set of molecular descriptors by maximizing both relevance and significance of the descriptors. An important finding is that the proposed feature selection algorithm is shown to be effective in selecting relevant and significant molecular descriptors from the QSAR dataset for predictive modeling. The performance of the proposed algorithm is studied using R2 statistic of support vector regression method. The effectiveness of the proposed algorithm, along with a comparison with existing algorithms, is demonstrated on three QSAR datasets.

27 citations

Journal ArticleDOI

[...]

TL;DR: A new gene selection algorithm is presented, termed as RelSim, to identify disease genes, that integrates judiciously the information of gene expression profiles and protein-protein interaction networks to compute the functional similarity among genes.
Abstract: One of the important problems in functional genomics is how to select the disease genes. In this regard, the paper presents a new gene selection algorithm, termed as RelSim, to identify disease genes. It integrates judiciously the information of gene expression profiles and protein-protein interaction networks. A new similarity measure is introduced to compute the functional similarity between two genes. It is based on the information of protein-protein interaction networks. The new similarity measure offers an efficient way to calculate the functional similarity between two genes. The proposed algorithm selects a set of genes as disease genes, considering both microarray and protein-protein interaction data, by maximizing the relevance and functional similarity of the selected genes. While gene expression profiles are used to identify differentially expressed genes, the protein-protein interaction networks help to compute the functional similarity among genes. The performance of the proposed algorithm, along with a comparison with other related methods, is demonstrated on several colon cancer data sets.

24 citations


Cited by
More filters

[...]

01 Jan 2002

8,563 citations

[...]

19 Nov 2012

1,252 citations

Book

[...]

16 Nov 1998

766 citations

[...]

01 Jan 2009
TL;DR: In this article, a review outlines the current understanding of miRNA target recognition in animals and discusses the widespread impact of miRNAs on both the expression and evolution of protein-coding genes.
Abstract: MicroRNAs (miRNAs) are endogenous ∼23 nt RNAs that play important gene-regulatory roles in animals and plants by pairing to the mRNAs of protein-coding genes to direct their posttranscriptional repression. This review outlines the current understanding of miRNA target recognition in animals and discusses the widespread impact of miRNAs on both the expression and evolution of protein-coding genes.

646 citations

Journal ArticleDOI

[...]

626 citations