scispace - formally typeset
Search or ask a question
Book ChapterDOI

City Block Distance for Identification of Co-expressed MicroRNAs

19 Dec 2013-pp 387-396
TL;DR: The proposed method judiciously integrates the merits of robust rough-fuzzy c-means algorithm and normalized range-normalized city block distance to discover co-expressed miRNA clusters and helps to handle minute differences between two miRNA expression profiles.
Abstract: The microRNAs or miRNAs are short, endogenous RNAs having ability to regulate gene expression at the post-transcriptional level. Various studies have revealed that a large proportion of miRNAs are co-expressed. Expression profiling of miRNAs generates a huge volume of data. Complicated networks of miRNA-mRNA interaction increase the challenges of comprehending and interpreting the resulting mass of data. In this regard, this paper presents the application of city block distance in order to extract meaningful information from miRNA expression data. The proposed method judiciously integrates the merits of robust rough-fuzzy c-means algorithm and normalized range-normalized city block distance to discover co-expressed miRNA clusters. The city block distance is used to calculate the membership functions of fuzzy sets, and thereby helps to handle minute differences between two miRNA expression profiles. The effectiveness of the proposed approach, along with a comparison with other related methods, is demonstrated on several miRNA expression data sets using different cluster validity indices and gene ontology.
Citations
More filters
Proceedings ArticleDOI
22 Dec 2014
TL;DR: The proposed method integrates the ability of point symmetry based distance and existing Multi-objective optimization based clustering technique-AMOSA to identify co-regulated or co-expressed miRNA clusters to help extraction of relevant information from expression data of miRNA.
Abstract: The micro RNAs or miRNAs are short non-coding RNAs, which are capable in regulating gene expression in post-transcriptional level. A huge volume of data is generated by expression profiling of miRNAs. From various studies it has been proved that a large proportion of miRNAs tend to form clusters on chromosome. So, in this article we are proposing a multi-objective optimization based clustering algorithm for extraction of relevant information from expression data of miRNA. The proposed method integrates the ability of point symmetry based distance and existing Multi-objective optimization based clustering technique-AMOSA to identify co-regulated or co-expressed miRNA clusters. The superiority of our proposed approach by comparing it with other state-of-the-art clustering methods, is demonstrated on two publicly available miRNA expression data sets using Davies-Bouldin index - an external cluster validity index.

4 citations


Cites background or methods or result from "City Block Distance for Identificat..."

  • ...In this section, we have reported the results of [16] for six upper mentioned algorithms with their best DB index values (either for Euclidean distance or NRNCBD) for two miRNA microarray data sets, GSE16473 and GSE29495 and compared their results with our proposed clustering algorithm’s outcome....

    [...]

  • ...In [16] authors have incorporated range-normalized city block distance(NRNCBD) instead of Euclidean distance in robust rough Fuzzy c-means(rRFCM) [30] clustering algorithm....

    [...]

  • ...In [16] authors have shown the superioty of NRNCBD over Euclidean and Pearson distance version of different clustering algorithms like fuzzy c-means (FCM)[26], hard c-means (HCM)[7], rough-fuzzy c-means (RFCM)[27] and Robust rough-fuzzy c-means(rRFCM) [17]....

    [...]

  • ...Recently in [16] a clustering algorithm, combining the concepts of robust rough-fuzzy cmeans algorithm [17] and Normalized range-normalized cityblock distance(NRNCBD) is proposed to discover co-regulated miRNAs from datasets of miRNA expression data....

    [...]

  • ...As we can see in the table that for different clustering algorithms its best performance with respect DB index from [16] are reported....

    [...]

Journal ArticleDOI
12 Mar 2018
TL;DR: This paper has proposed a point symmetry-based clustering algorithm which has been used to identify clusters of tissue samples from some real life cancer datasets and is also multi-objective-optimisation (MOO) based, i.e., optimises more than one objectives simultaneously.
Abstract: Clustering or unsupervised classification techniques can be used to solve different types of classification problems of different domains. Symmetry is an important property for any real life object. Therefore, symmetry-based distance measurements play some important roles in identifying some patterns or clusters of real life datasets. In this paper, inspired by the symmetric property, we have proposed a point symmetry-based clustering algorithm which has been used to identify clusters of tissue samples from some real life cancer datasets. Our proposed algorithm is also multi-objective-optimisation (MOO) based, i.e., optimises more than one objectives simultaneously. We have also shown the superiority of our proposed algorithm with respect to some state-of-the-art clustering algorithms.

3 citations


Cites background or methods from "City Block Distance for Identificat..."

  • ..., 2010; Paul and Maji, 2013) for measuring similarity between two given data points, Euclidean distance or city block distance (Paul and Maji, 2013) is in general used....

    [...]

  • ...In most of the existing literatures (Acharya et al., 2014; Mukhopadhyay et al., 2010; Paul and Maji, 2013) for measuring similarity between two given data points, Euclidean distance or city block distance (Paul and Maji, 2013) is in general used....

    [...]

  • ...For example, microRNA datasets used in Paul and Maji (2013) or real life gene expression datasets used in Saha et al. (2013) are some unlabeled datasets....

    [...]

Journal ArticleDOI
TL;DR: In this article , three kinds of brain tumors (a meningioma, a glioma and a pituitary tumor) were detected using machine learning algorithms, such as Support Vector Machines (SVM), K-Nearest Neighbor (KNN), and GRNN.
Abstract: The brain is the organ that controls the activities of all parts of the body. The tumor is familiar as an irregular outgrowth of tissue. Brain tumors are an abnormal lump of tissue in which cells grow up and redouble uncontrollably. It is categorized into different types based on their nature, origin, growth rate, and stage of progress. Detection of the tumor by traditional methods is time-consuming and does not widen to diagnose a large amount of data and is less accurate. So, the automatic diagnosis of the tumors in the brain by magnetic resonance imaging (MRI) plays a very important role in computer-aided diagnosis. This paper concentrates on the diagnosis of three kinds of brain tumors (a meningioma, a glioma, and a pituitary tumor). Machine learning algorithms: KNN, SVM, and GRNN are suggested to increase accuracy and reduce diagnostic time by using a publicly available dataset, features that are extracted of images, data pre-processing methods, and the principal component analysis (PCA). This paper aims to minimize the training time of the suggested algorithms. The dimensionality reducing technique is applied to the dataset and diagnosis using machine learning algorithms, such as Support Vector Machines (SVM), K-Nearest Neighbor (KNN), and Generalized Regression Neural Networks (GRNN). The accuracies of the algorithms used in diagnosing tumors are 97%, 96.24%, and 94.7% for KNN, SVM, and GRNN, respectively. The KNN is therefore regarded as the algorithm of choice.
References
More filters
Book
01 Aug 1996
TL;DR: A separation theorem for convex fuzzy sets is proved without requiring that the fuzzy sets be disjoint.
Abstract: A fuzzy set is a class of objects with a continuum of grades of membership. Such a set is characterized by a membership (characteristic) function which assigns to each object a grade of membership ranging between zero and one. The notions of inclusion, union, intersection, complement, relation, convexity, etc., are extended to such sets, and various properties of these notions in the context of fuzzy sets are established. In particular, a separation theorem for convex fuzzy sets is proved without requiring that the fuzzy sets be disjoint.

52,705 citations

Book
31 Jul 1981
TL;DR: Books, as a source that may involve the facts, opinion, literature, religion, and many others are the great friends to join with, becomes what you need to get.
Abstract: New updated! The latest book from a very famous author finally comes out. Book of pattern recognition with fuzzy objective function algorithms, as an amazing reference becomes what you need to get. What's for is this book? Are you still thinking for what the book is? Well, this is what you probably will get. You should have made proper choices for your better life. Book, as a source that may involve the facts, opinion, literature, religion, and many others are the great friends to join with.

15,662 citations

Journal ArticleDOI
TL;DR: A new graphical display is proposed for partitioning techniques, where each cluster is represented by a so-called silhouette, which is based on the comparison of its tightness and separation, and provides an evaluation of clustering validity.

14,144 citations

Journal ArticleDOI
09 Jun 2005-Nature
TL;DR: A new, bead-based flow cytometric miRNA expression profiling method is used to present a systematic expression analysis of 217 mammalian miRNAs from 334 samples, including multiple human cancers, and finds the miRNA profiles are surprisingly informative, reflecting the developmental lineage and differentiation state of the tumours.
Abstract: Recent work has revealed the existence of a class of small non-coding RNA species, known as microRNAs (miRNAs), which have critical functions across various biological processes. Here we use a new, bead-based flow cytometric miRNA expression profiling method to present a systematic expression analysis of 217 mammalian miRNAs from 334 samples, including multiple human cancers. The miRNA profiles are surprisingly informative, reflecting the developmental lineage and differentiation state of the tumours. We observe a general downregulation of miRNAs in tumours compared with normal tissues. Furthermore, we were able to successfully classify poorly differentiated tumours using miRNA expression profiles, whereas messenger RNA profiles were highly inaccurate when applied to the same samples. These findings highlight the potential of miRNA profiling in cancer diagnosis.

9,470 citations

Book
31 Oct 1991
TL;DR: Theoretical Foundations.
Abstract: I. Theoretical Foundations.- 1. Knowledge.- 1.1. Introduction.- 1.2. Knowledge and Classification.- 1.3. Knowledge Base.- 1.4. Equivalence, Generalization and Specialization of Knowledge.- Summary.- Exercises.- References.- 2. Imprecise Categories, Approximations and Rough Sets.- 2.1. Introduction.- 2.2. Rough Sets.- 2.3. Approximations of Set.- 2.4. Properties of Approximations.- 2.5. Approximations and Membership Relation.- 2.6. Numerical Characterization of Imprecision.- 2.7. Topological Characterization of Imprecision.- 2.8. Approximation of Classifications.- 2.9. Rough Equality of Sets.- 2.10. Rough Inclusion of Sets.- Summary.- Exercises.- References.- 3. Reduction of Knowledge.- 3.1. Introduction.- 3.2. Reduct and Core of Knowledge.- 3.3. Relative Reduct and Relative Core of Knowledge.- 3.4. Reduction of Categories.- 3.5. Relative Reduct and Core of Categories.- Summary.- Exercises.- References.- 4. Dependencies in Knowledge Base.- 4.1. Introduction.- 4.2. Dependency of Knowledge.- 4.3. Partial Dependency of Knowledge.- Summary.- Exercises.- References.- 5. Knowledge Representation.- 5.1. Introduction.- 5.2. Examples.- 5.3. Formal Definition.- 5.4. Significance of Attributes.- 5.5. Discernibility Matrix.- Summary.- Exercises.- References.- 6. Decision Tables.- 6.1. Introduction.- 6.2. Formal Definition and Some Properties.- 6.3. Simplification of Decision Tables.- Summary.- Exercises.- References.- 7. Reasoning about Knowledge.- 7.1. Introduction.- 7.2. Language of Decision Logic.- 7.3. Semantics of Decision Logic Language.- 7.4. Deduction in Decision Logic.- 7.5. Normal Forms.- 7.6. Decision Rules and Decision Algorithms.- 7.7. Truth and Indiscernibility.- 7.8. Dependency of Attributes.- 7.9. Reduction of Consistent Algorithms.- 7.10. Reduction of Inconsistent Algorithms.- 7.11. Reduction of Decision Rules.- 7.12. Minimization of Decision Algorithms.- Summary.- Exercises.- References.- II. Applications.- 8. Decision Making.- 8.1. Introduction.- 8.2. Optician's Decisions Table.- 8.3. Simplification of Decision Table.- 8.4. Decision Algorithm.- 8.5. The Case of Incomplete Information.- Summary.- Exercises.- References.- 9. Data Analysis.- 9.1. Introduction.- 9.2. Decision Table as Protocol of Observations.- 9.3. Derivation of Control Algorithms from Observation.- 9.4. Another Approach.- 9.5. The Case of Inconsistent Data.- Summary.- Exercises.- References.- 10. Dissimilarity Analysis.- 10.1. Introduction.- 10.2. The Middle East Situation.- 10.3. Beauty Contest.- 10.4. Pattern Recognition.- 10.5. Buying a Car.- Summary.- Exercises.- References.- 11. Switching Circuits.- 11.1. Introduction.- 11.2. Minimization of Partially Defined Switching Functions.- 11.3. Multiple-Output Switching Functions.- Summary.- Exercises.- References.- 12. Machine Learning.- 12.1. Introduction.- 12.2. Learning From Examples.- 12.3. The Case of an Imperfect Teacher.- 12.4. Inductive Learning.- Summary.- Exercises.- References.

7,826 citations