Home
/
Authors
/
R. Jothi

Author

R. Jothi

Other affiliations: Indian Institute of Information Technology, Design and Manufacturing, Jabalpur, VIT University, Indian Institutes of Information Technology

Bio: R. Jothi is an academic researcher from Pandit Deendayal Petroleum University. The author has contributed to research in topics: Cluster analysis & Minimum spanning tree. The author has an hindex of 4, co-authored 11 publications receiving 98 citations. Previous affiliations of R. Jothi include Indian Institute of Information Technology, Design and Manufacturing, Jabalpur & VIT University.

Papers

PDF

Open Access

More filters

Journal Article•DOI•

DK-means: a deterministic K-means clustering algorithm for gene expression analysis

[...]

R. Jothi¹, Sraban Kumar Mohanty², Aparajita Ojha²•Institutions (2)

Pandit Deendayal Petroleum University¹, Indian Institute of Information Technology, Design and Manufacturing, Jabalpur²

01 May 2019-Pattern Analysis and Applications

TL;DR: A deterministic initialization algorithm for K-means (DK-me means) is proposed by exploring a set of probable centers through a constrained bi-partitioning approach and achieves improved results in terms of faster and stable convergence, and better cluster quality as compared to other algorithms.

...read moreread less

Abstract: Clustering has been widely applied in interpreting the underlying patterns in microarray gene expression profiles, and many clustering algorithms have been devised for the same. K-means is one of the popular algorithms for gene data clustering due to its simplicity and computational efficiency. But, K-means algorithm is highly sensitive to the choice of initial cluster centers. Thus, the algorithm easily gets trapped with local optimum if the initial centers are chosen randomly. This paper proposes a deterministic initialization algorithm for K-means (DK-means) by exploring a set of probable centers through a constrained bi-partitioning approach. The proposed algorithm is compared with classical K-means with random initialization and improved K-means variants such as K-means++ and MinMax algorithms. It is also compared with three deterministic initialization methods. Experimental analysis on gene expression datasets demonstrates that DK-means achieves improved results in terms of faster and stable convergence, and better cluster quality as compared to other algorithms.

...read moreread less

45 citations

Journal Article•DOI•

Fast approximate minimum spanning tree based clustering algorithm

[...]

R. Jothi¹, Sraban Kumar Mohanty¹, Aparajita Ojha¹•Institutions (1)

Indian Institute of Information Technology, Design and Manufacturing, Jabalpur¹

10 Jan 2018-Neurocomputing

TL;DR: This paper proposes an algorithm namely MST-based clustering on partition-based nearest neighbor graph for reducing the computational overhead by using a centroid based nearest neighbor rule and proves that both size and computational time to construct the graph (LNG) is O(n3/2), which is a O ( n ) factor improvement over the traditional algorithms.

...read moreread less

35 citations

Journal Article•DOI•

Functional grouping of similar genes using eigenanalysis on minimum spanning tree based neighborhood graph

[...]

R. Jothi¹, Sraban Kumar Mohanty¹, Aparajita Ojha¹•Institutions (1)

Indian Institute of Information Technology, Design and Manufacturing, Jabalpur¹

01 Apr 2016-Computers in Biology and Medicine

TL;DR: A novel clustering algorithm using Eigenanalysis on Minimum Spanning Tree based neighborhood graph (E-MST) using a similarity graph obtained from k(') rounds of MST (k(')-MST neighborhood graph) achieves improved clustering results.

...read moreread less

20 citations

Book Chapter•DOI•

Fast Minimum Spanning Tree Based Clustering Algorithms on Local Neighborhood Graph

[...]

R. Jothi¹, Sraban Kumar Mohanty¹, Aparajita Ojha¹•Institutions (1)

Indian Institutes of Information Technology¹

13 May 2015

TL;DR: The proposed algorithms make use of a centroid based nearest neighbor rule to generate a partition-based Local Neighborhood Graph (LNG) and it is proved that both the size and the computational time to construct the graph (L NG) is O(n 3/2), which is a factor improvement over the traditional algorithms.

...read moreread less

Abstract: Minimum spanning tree (MST) based clustering algorithms have been employed successfully to detect clusters of heterogeneous nature. Given a dataset of n random points, most of the MST-based clustering algorithms first generate a complete graph G of the dataset and then construct MST from G. The first step of the algorithm is the major bottleneck which takes O(n 2) time. This paper proposes two algorithms namely MST-based clustering on K-means Graph and MST-based clustering on Bi-means Graph for reducing the computational overhead. The proposed algorithms make use of a centroid based nearest neighbor rule to generate a partition-based Local Neighborhood Graph (LNG). We prove that both the size and the computational time to construct the graph (LNG) is O(n 3/2), which is a \(O(\sqrt n)\) factor improvement over the traditional algorithms. The approximate MST is constructed from LNG in \(O(n^{3/2} \lg n)\) time, which is asymptotically faster than O(n 2). The advantage of the proposed algorithms is that they do not require any parameter setting which is a major issue in many of the nearest neighbor finding algorithms. Experimental results demonstrate that the computational time has been reduced significantly by maintaining the quality of the clusters obtained from the MST.

...read moreread less

12 citations

Proceedings Article•DOI•

A Comparative Study of Unsupervised Learning Algorithms for Software Fault Prediction

[...]

R. Jothi¹•Institutions (1)

Pandit Deendayal Petroleum University¹

14 Jun 2018

TL;DR: A comparative study on software fault prediction using K-means clustering algorithm and its variants and results indicate that proper initial seed selection enables K- means algorithm to effectively group the faulty modules.

...read moreread less

Abstract: Software fault prediction is an important task in software development process which enables software practitioners to easily detect and rectify the errors in modules or classes. Various fault prediction techniques have been studied in the past and unsupervised learning methods such as clustering techniques are drawing much attention in the recent years. K-means is a well known clustering algorithm which is applied on various exploratory analysis including software fault prediction. This paper provides a comparative study on software fault prediction using K-means clustering algorithm and its variants. We use five software fault prediction datasets taken from PROMISE repository to evaluate the prediction accuracy of the clustering algorithms. Experimental results indicate that proper initial seed selection enables K-means algorithm to effectively group the faulty modules.

...read moreread less

5 citations

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

DK-means: a deterministic K-means clustering algorithm for gene expression analysis

[...]

R. Jothi¹, Sraban Kumar Mohanty², Aparajita Ojha²•Institutions (2)

Pandit Deendayal Petroleum University¹, Indian Institute of Information Technology, Design and Manufacturing, Jabalpur²

01 May 2019-Pattern Analysis and Applications

...read moreread less

45 citations

Journal Article•DOI•

Development of new seed with modified validity measures for k-means clustering

[...]

S. Manochandar¹, Murugesan Punniyamoorthy¹, R. K. Jeyachitra¹•Institutions (1)

National Institute of Technology, Tiruchirappalli¹

01 Mar 2020-Computers & Industrial Engineering

TL;DR: The comparative analysis, based on the modified Dunn Index, and silhouette validity ratio have proved that the proposed initialization algorithm has performed better than the other initialization algorithms.

...read moreread less

37 citations

Journal Article•DOI•

New internal index for clustering validation based on graphs

[...]

J.C. Rojas-Thomas¹, Matilde Santos¹, Marco Mora²•Institutions (2)

Complutense University of Madrid¹, Catholic University of the Maule²

15 Nov 2017-Expert Systems With Applications

TL;DR: Two different versions of a new internal index for clustering validation using graphs capture the structural characteristics of each cluster and shows a superior capacity to deal with datasets that present different configurations of variances, densities, geometries and levels of noise.

...read moreread less

Abstract: This paper presents two different versions of a new internal index for clustering validation using graphs These graphs capture the structural characteristics of each cluster In this way, the new index overcomes the limitations of traditional indices based on statistics measurements and it is effective on clusters of different shapes and sizes These graphs are generated through an iterative process based on the principal component analysis, which partitions the clusters in a configurable number of “sub-clusters” Then, a minimum spanning tree based on the centroids of each of these sub-clusters is built and used to estimate both the quality of the clusters and the distances between them In particular, the quality of a cluster is defined in this paper as the level of “cohesion” among its sub-clusters The difference between the two versions of the proposed index is how this level of "cohesion" is measured Finally, a comparison of the performance of these two versions of the proposed index with a selected group of well-known internal indices is carried out In these tests, the two versions of the index show a superior capacity to deal with datasets that present different configurations of variances, densities, geometries and levels of noise

...read moreread less

36 citations

Proceedings Article•DOI•

Comparative study between deep learning and bag of visual words for wild-animal recognition

[...]

Emmanuel Okafor¹, Pornntiwa Pawara¹, Faik Karaaba¹, Olarik Surinta², Valeriu Codreanu³, Lambert Schomaker¹, Marco A. Wiering¹ - Show less +3 more•Institutions (3)

University of Groningen¹, Mahasarakham University², San Antonio River Authority³

01 Dec 2016

TL;DR: This paper developed two variants of the bag of visual words (BOW and HOG-BOW) and examined the use of gray and color information as well as different spatial pooling approaches and modified existing deep CNN architectures: AlexNet and GoogleNet.

...read moreread less

Abstract: Most research in image classification has focused on applications such as face, object, scene and character recognition. This paper examines a comparative study between deep convolutional neural networks (CNNs) and bag of visual words (BOW) variants for recognizing animals. We developed two variants of the bag of visual words (BOW and HOG-BOW) and examine the use of gray and color information as well as different spatial pooling approaches. We combined the final feature vectors extracted from these BOW variants with a regularized L2 support vector machine (L2-SVM) to distinguish between classes within our datasets. We modified existing deep CNN architectures: AlexNet and GoogleNet, by reducing the number of neurons in each layer of the fully connected layers and last inception layer for both scratch and pre-trained versions. Finally, we compared the existing CNN methods, our modified CNN architectures and the proposed BOW variants on our novel wild-animal dataset (Wild-Anim). The results show that the CNN methods significantly outperform the BOW techniques.

...read moreread less

36 citations

Journal Article•DOI•

Fast approximate minimum spanning tree based clustering algorithm

[...]

R. Jothi¹, Sraban Kumar Mohanty¹, Aparajita Ojha¹•Institutions (1)

Indian Institute of Information Technology, Design and Manufacturing, Jabalpur¹

10 Jan 2018-Neurocomputing

...read moreread less

35 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25

Collapse