City Block Distance for Identification of Co-expressed MicroRNAs

doi:10.1007/978-3-319-03756-1_35

Home
/
Papers
/
City Block Distance for Identification of Co-expressed MicroRNAs

Book Chapter•DOI•

City Block Distance for Identification of Co-expressed MicroRNAs

Sushmita Paul¹, Pradipta Maji¹•Institutions (1)

Indian Statistical Institute¹

19 Dec 2013-pp 387-396

TL;DR: The proposed method judiciously integrates the merits of robust rough-fuzzy c-means algorithm and normalized range-normalized city block distance to discover co-expressed miRNA clusters and helps to handle minute differences between two miRNA expression profiles.

read less

Abstract: The microRNAs or miRNAs are short, endogenous RNAs having ability to regulate gene expression at the post-transcriptional level. Various studies have revealed that a large proportion of miRNAs are co-expressed. Expression profiling of miRNAs generates a huge volume of data. Complicated networks of miRNA-mRNA interaction increase the challenges of comprehending and interpreting the resulting mass of data. In this regard, this paper presents the application of city block distance in order to extract meaningful information from miRNA expression data. The proposed method judiciously integrates the merits of robust rough-fuzzy c-means algorithm and normalized range-normalized city block distance to discover co-expressed miRNA clusters. The city block distance is used to calculate the membership functions of fuzzy sets, and thereby helps to handle minute differences between two miRNA expression profiles. The effectiveness of the proposed approach, along with a comparison with other related methods, is demonstrated on several miRNA expression data sets using different cluster validity indices and gene ontology.

...read moreread less

Citations

PDF

Open Access

More filters

Proceedings Article•DOI•

Identifying Co-expressed miRNAs using Multiobjective Optimization

[...]

Sudipta Acharya¹, Sriparna Saha¹•Institutions (1)

Indian Institute of Technology Patna¹

22 Dec 2014

TL;DR: The proposed method integrates the ability of point symmetry based distance and existing Multi-objective optimization based clustering technique-AMOSA to identify co-regulated or co-expressed miRNA clusters to help extraction of relevant information from expression data of miRNA.

...read moreread less

Abstract: The micro RNAs or miRNAs are short non-coding RNAs, which are capable in regulating gene expression in post-transcriptional level. A huge volume of data is generated by expression profiling of miRNAs. From various studies it has been proved that a large proportion of miRNAs tend to form clusters on chromosome. So, in this article we are proposing a multi-objective optimization based clustering algorithm for extraction of relevant information from expression data of miRNA. The proposed method integrates the ability of point symmetry based distance and existing Multi-objective optimization based clustering technique-AMOSA to identify co-regulated or co-expressed miRNA clusters. The superiority of our proposed approach by comparing it with other state-of-the-art clustering methods, is demonstrated on two publicly available miRNA expression data sets using Davies-Bouldin index - an external cluster validity index.

...read moreread less

4 citations

Cites background or methods or result from "City Block Distance for Identificat..."

...In this section, we have reported the results of [16] for six upper mentioned algorithms with their best DB index values (either for Euclidean distance or NRNCBD) for two miRNA microarray data sets, GSE16473 and GSE29495 and compared their results with our proposed clustering algorithm’s outcome....
[...]
...In [16] authors have incorporated range-normalized city block distance(NRNCBD) instead of Euclidean distance in robust rough Fuzzy c-means(rRFCM) [30] clustering algorithm....
[...]
...In [16] authors have shown the superioty of NRNCBD over Euclidean and Pearson distance version of different clustering algorithms like fuzzy c-means (FCM)[26], hard c-means (HCM)[7], rough-fuzzy c-means (RFCM)[27] and Robust rough-fuzzy c-means(rRFCM) [17]....
[...]
...Recently in [16] a clustering algorithm, combining the concepts of robust rough-fuzzy cmeans algorithm [17] and Normalized range-normalized cityblock distance(NRNCBD) is proposed to discover co-regulated miRNAs from datasets of miRNA expression data....
[...]
...As we can see in the table that for different clustering algorithms its best performance with respect DB index from [16] are reported....
[...]

Journal Article•DOI•

Cancer tissue sample classification using point symmetry-based clustering algorithm

[...]

Sudipta Acharya¹, Sriparna Saha¹•Institutions (1)

Indian Institute of Technology Patna¹

12 Mar 2018

TL;DR: This paper has proposed a point symmetry-based clustering algorithm which has been used to identify clusters of tissue samples from some real life cancer datasets and is also multi-objective-optimisation (MOO) based, i.e., optimises more than one objectives simultaneously.

...read moreread less

Abstract: Clustering or unsupervised classification techniques can be used to solve different types of classification problems of different domains. Symmetry is an important property for any real life object. Therefore, symmetry-based distance measurements play some important roles in identifying some patterns or clusters of real life datasets. In this paper, inspired by the symmetric property, we have proposed a point symmetry-based clustering algorithm which has been used to identify clusters of tissue samples from some real life cancer datasets. Our proposed algorithm is also multi-objective-optimisation (MOO) based, i.e., optimises more than one objectives simultaneously. We have also shown the superiority of our proposed algorithm with respect to some state-of-the-art clustering algorithms.

...read moreread less

3 citations

Cites background or methods from "City Block Distance for Identificat..."

..., 2010; Paul and Maji, 2013) for measuring similarity between two given data points, Euclidean distance or city block distance (Paul and Maji, 2013) is in general used....
[...]
...In most of the existing literatures (Acharya et al., 2014; Mukhopadhyay et al., 2010; Paul and Maji, 2013) for measuring similarity between two given data points, Euclidean distance or city block distance (Paul and Maji, 2013) is in general used....
[...]
...For example, microRNA datasets used in Paul and Maji (2013) or real life gene expression datasets used in Saha et al. (2013) are some unlabeled datasets....
[...]

Journal Article•DOI•

Detection and Classification of Brain Tumor Using Machine Learning Algorithms

[...]

Fatma M. Refaat, Mona M. Gouda, Mohamed Omar Ahmed Omar

20 Dec 2022-Biomedical and Pharmacology Journal

TL;DR: In this article , three kinds of brain tumors (a meningioma, a glioma and a pituitary tumor) were detected using machine learning algorithms, such as Support Vector Machines (SVM), K-Nearest Neighbor (KNN), and GRNN.

...read moreread less

Abstract: The brain is the organ that controls the activities of all parts of the body. The tumor is familiar as an irregular outgrowth of tissue. Brain tumors are an abnormal lump of tissue in which cells grow up and redouble uncontrollably. It is categorized into different types based on their nature, origin, growth rate, and stage of progress. Detection of the tumor by traditional methods is time-consuming and does not widen to diagnose a large amount of data and is less accurate. So, the automatic diagnosis of the tumors in the brain by magnetic resonance imaging (MRI) plays a very important role in computer-aided diagnosis. This paper concentrates on the diagnosis of three kinds of brain tumors (a meningioma, a glioma, and a pituitary tumor). Machine learning algorithms: KNN, SVM, and GRNN are suggested to increase accuracy and reduce diagnostic time by using a publicly available dataset, features that are extracted of images, data pre-processing methods, and the principal component analysis (PCA). This paper aims to minimize the training time of the suggested algorithms. The dimensionality reducing technique is applied to the dataset and diagnosis using machine learning algorithms, such as Support Vector Machines (SVM), K-Nearest Neighbor (KNN), and Generalized Regression Neural Networks (GRNN). The accuracies of the algorithms used in diagnosing tumors are 97%, 96.24%, and 94.7% for KNN, SVM, and GRNN, respectively. The KNN is therefore regarded as the algorithm of choice.

...read moreread less

References

PDF

Open Access

More filters

Book•

Fuzzy sets

[...]

Lotfi A. Zadeh

01 Aug 1996

TL;DR: A separation theorem for convex fuzzy sets is proved without requiring that the fuzzy sets be disjoint.

...read moreread less

Abstract: A fuzzy set is a class of objects with a continuum of grades of membership. Such a set is characterized by a membership (characteristic) function which assigns to each object a grade of membership ranging between zero and one. The notions of inclusion, union, intersection, complement, relation, convexity, etc., are extended to such sets, and various properties of these notions in the context of fuzzy sets are established. In particular, a separation theorem for convex fuzzy sets is proved without requiring that the fuzzy sets be disjoint.

...read moreread less

52,705 citations

Book•

Pattern Recognition with Fuzzy Objective Function Algorithms

[...]

James C. Bezdek

31 Jul 1981

TL;DR: Books, as a source that may involve the facts, opinion, literature, religion, and many others are the great friends to join with, becomes what you need to get.

...read moreread less

Abstract: New updated! The latest book from a very famous author finally comes out. Book of pattern recognition with fuzzy objective function algorithms, as an amazing reference becomes what you need to get. What's for is this book? Are you still thinking for what the book is? Well, this is what you probably will get. You should have made proper choices for your better life. Book, as a source that may involve the facts, opinion, literature, religion, and many others are the great friends to join with.

...read moreread less

15,662 citations

Journal Article•DOI•

Silhouettes: a graphical aid to the interpretation and validation of cluster analysis

[...]

Peter J. Rousseeuw¹•Institutions (1)

University of Fribourg¹

01 Nov 1987-Journal of Computational and Applied Mathematics

TL;DR: A new graphical display is proposed for partitioning techniques, where each cluster is represented by a so-called silhouette, which is based on the comparison of its tightness and separation, and provides an evaluation of clustering validity.

...read moreread less

14,144 citations

Journal Article•DOI•

MicroRNA expression profiles classify human cancers

[...]

Jun Lu¹, Gad Getz¹, Eric A. Miska², Eric A. Miska³, Ezequiel Alvarez-Saavedra³, Justin Lamb¹, David Peck¹, Alejandro Sweet-Cordero⁴, Alejandro Sweet-Cordero³, Benjamin L. Ebert⁴, Benjamin L. Ebert¹, Raymond H. Mak¹, Raymond H. Mak⁴, Adolfo A. Ferrando⁴, James R. Downing⁵, Tyler Jacks³, H. Robert Horvitz⁶, H. Robert Horvitz³, Todd R. Golub⁶, Todd R. Golub¹, Todd R. Golub⁴ - Show less +17 more•Institutions (6)

Broad Institute¹, Wellcome Trust/Cancer Research UK Gurdon Institute², Massachusetts Institute of Technology³, Harvard University⁴, St. Jude Children's Research Hospital⁵, Howard Hughes Medical Institute⁶

09 Jun 2005-Nature

TL;DR: A new, bead-based flow cytometric miRNA expression profiling method is used to present a systematic expression analysis of 217 mammalian miRNAs from 334 samples, including multiple human cancers, and finds the miRNA profiles are surprisingly informative, reflecting the developmental lineage and differentiation state of the tumours.

...read moreread less

Abstract: Recent work has revealed the existence of a class of small non-coding RNA species, known as microRNAs (miRNAs), which have critical functions across various biological processes. Here we use a new, bead-based flow cytometric miRNA expression profiling method to present a systematic expression analysis of 217 mammalian miRNAs from 334 samples, including multiple human cancers. The miRNA profiles are surprisingly informative, reflecting the developmental lineage and differentiation state of the tumours. We observe a general downregulation of miRNAs in tumours compared with normal tissues. Furthermore, we were able to successfully classify poorly differentiated tumours using miRNA expression profiles, whereas messenger RNA profiles were highly inaccurate when applied to the same samples. These findings highlight the potential of miRNA profiling in cancer diagnosis.

...read moreread less

9,470 citations

Book•

Rough Sets: Theoretical Aspects of Reasoning about Data

[...]

Zdzisław Pawlak

31 Oct 1991

TL;DR: Theoretical Foundations.

...read moreread less

Abstract: I. Theoretical Foundations.- 1. Knowledge.- 1.1. Introduction.- 1.2. Knowledge and Classification.- 1.3. Knowledge Base.- 1.4. Equivalence, Generalization and Specialization of Knowledge.- Summary.- Exercises.- References.- 2. Imprecise Categories, Approximations and Rough Sets.- 2.1. Introduction.- 2.2. Rough Sets.- 2.3. Approximations of Set.- 2.4. Properties of Approximations.- 2.5. Approximations and Membership Relation.- 2.6. Numerical Characterization of Imprecision.- 2.7. Topological Characterization of Imprecision.- 2.8. Approximation of Classifications.- 2.9. Rough Equality of Sets.- 2.10. Rough Inclusion of Sets.- Summary.- Exercises.- References.- 3. Reduction of Knowledge.- 3.1. Introduction.- 3.2. Reduct and Core of Knowledge.- 3.3. Relative Reduct and Relative Core of Knowledge.- 3.4. Reduction of Categories.- 3.5. Relative Reduct and Core of Categories.- Summary.- Exercises.- References.- 4. Dependencies in Knowledge Base.- 4.1. Introduction.- 4.2. Dependency of Knowledge.- 4.3. Partial Dependency of Knowledge.- Summary.- Exercises.- References.- 5. Knowledge Representation.- 5.1. Introduction.- 5.2. Examples.- 5.3. Formal Definition.- 5.4. Significance of Attributes.- 5.5. Discernibility Matrix.- Summary.- Exercises.- References.- 6. Decision Tables.- 6.1. Introduction.- 6.2. Formal Definition and Some Properties.- 6.3. Simplification of Decision Tables.- Summary.- Exercises.- References.- 7. Reasoning about Knowledge.- 7.1. Introduction.- 7.2. Language of Decision Logic.- 7.3. Semantics of Decision Logic Language.- 7.4. Deduction in Decision Logic.- 7.5. Normal Forms.- 7.6. Decision Rules and Decision Algorithms.- 7.7. Truth and Indiscernibility.- 7.8. Dependency of Attributes.- 7.9. Reduction of Consistent Algorithms.- 7.10. Reduction of Inconsistent Algorithms.- 7.11. Reduction of Decision Rules.- 7.12. Minimization of Decision Algorithms.- Summary.- Exercises.- References.- II. Applications.- 8. Decision Making.- 8.1. Introduction.- 8.2. Optician's Decisions Table.- 8.3. Simplification of Decision Table.- 8.4. Decision Algorithm.- 8.5. The Case of Incomplete Information.- Summary.- Exercises.- References.- 9. Data Analysis.- 9.1. Introduction.- 9.2. Decision Table as Protocol of Observations.- 9.3. Derivation of Control Algorithms from Observation.- 9.4. Another Approach.- 9.5. The Case of Inconsistent Data.- Summary.- Exercises.- References.- 10. Dissimilarity Analysis.- 10.1. Introduction.- 10.2. The Middle East Situation.- 10.3. Beauty Contest.- 10.4. Pattern Recognition.- 10.5. Buying a Car.- Summary.- Exercises.- References.- 11. Switching Circuits.- 11.1. Introduction.- 11.2. Minimization of Partially Defined Switching Functions.- 11.3. Multiple-Output Switching Functions.- Summary.- Exercises.- References.- 12. Machine Learning.- 12.1. Introduction.- 12.2. Learning From Examples.- 12.3. The Case of an Imperfect Teacher.- 12.4. Inductive Learning.- Summary.- Exercises.- References.

...read moreread less

7,826 citations