Home
/
Authors
/
Vipin Kumar

Author

Vipin Kumar

Other affiliations: University of Tokyo, Krishna Institute of Engineering and Technology, Nanyang Technological University ...read more

Bio: Vipin Kumar is an academic researcher from Oak Ridge National Laboratory. The author has contributed to research in topics: Band gap & Thin film. The author has an hindex of 32, co-authored 295 publications receiving 12801 citations. Previous affiliations of Vipin Kumar include University of Tokyo & Krishna Institute of Engineering and Technology.

Topics: Band gap, Thin film, Lightning strike, Polyaniline, Crystallite ...read more

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2001
1999
1998
1997
1994
1990

Papers

PDF

Open Access

More filters

Book•

Introduction to Data Mining

[...]

Pang-Ning Tan, Michael Steinbach, Vipin Kumar

01 Jan 2013

TL;DR: This book discusses data mining through the lens of cluster analysis, which examines the relationships between data, clusters, and algorithms, and some of the techniques used to solve these problems.

...read moreread less

Abstract: 1 Introduction 1.1 What is Data Mining? 1.2 Motivating Challenges 1.3 The Origins of Data Mining 1.4 Data Mining Tasks 1.5 Scope and Organization of the Book 1.6 Bibliographic Notes 1.7 Exercises 2 Data 2.1 Types of Data 2.2 Data Quality 2.3 Data Preprocessing 2.4 Measures of Similarity and Dissimilarity 2.5 Bibliographic Notes 2.6 Exercises 3 Exploring Data 3.1 The Iris Data Set 3.2 Summary Statistics 3.3 Visualization 3.4 OLAP and Multidimensional Data Analysis 3.5 Bibliographic Notes 3.6 Exercises 4 Classification: Basic Concepts, Decision Trees, and Model Evaluation 4.1 Preliminaries 4.2 General Approach to Solving a Classification Problem 4.3 Decision Tree Induction 4.4 Model Overfitting 4.5 Evaluating the Performance of a Classifier 4.6 Methods for Comparing Classifiers 4.7 Bibliographic Notes 4.8 Exercises 5 Classification: Alternative Techniques 5.1 Rule-Based Classifier 5.2 Nearest-Neighbor Classifiers 5.3 Bayesian Classifiers 5.4 Artificial Neural Network (ANN) 5.5 Support Vector Machine (SVM) 5.6 Ensemble Methods 5.7 Class Imbalance Problem 5.8 Multiclass Problem 5.9 Bibliographic Notes 5.10 Exercises 6 Association Analysis: Basic Concepts and Algorithms 6.1 Problem Definition 6.2 Frequent Itemset Generation 6.3 Rule Generation 6.4 Compact Representation of Frequent Itemsets 6.5 Alternative Methods for Generating Frequent Itemsets 6.6 FP-Growth Algorithm 6.7 Evaluation of Association Patterns 6.8 Effect of Skewed Support Distribution 6.9 Bibliographic Notes 6.10 Exercises 7 Association Analysis: Advanced Concepts 7.1 Handling Categorical Attributes 7.2 Handling Continuous Attributes 7.3 Handling a Concept Hierarchy 7.4 Sequential Patterns 7.5 Subgraph Patterns 7.6 Infrequent Patterns 7.7 Bibliographic Notes 7.8 Exercises 8 Cluster Analysis: Basic Concepts and Algorithms 8.1 Overview 8.2 K-means 8.3 Agglomerative Hierarchical Clustering 8.4 DBSCAN 8.5 Cluster Evaluation 8.6 Bibliographic Notes 8.7 Exercises 9 Cluster Analysis: Additional Issues and Algorithms 9.1 Characteristics of Data, Clusters, and Clustering Algorithms 9.2 Prototype-Based Clustering 9.3 Density-Based Clustering 9.4 Graph-Based Clustering 9.5 Scalable Clustering Algorithms 9.6 Which Clustering Algorithm? 9.7 Bibliographic Notes 9.8 Exercises 10 Anomaly Detection 10.1 Preliminaries 10.2 Statistical Approaches 10.3 Proximity-Based Outlier Detection 10.4 Density-Based Outlier Detection 10.5 Clustering-Based Techniques 10.6 Bibliographic Notes 10.7 Exercises Appendix A Linear Algebra Appendix B Dimensionality Reduction Appendix C Probability and Statistics Appendix D Regression Appendix E Optimization Author Index Subject Index

...read moreread less

7,356 citations

Book•

Introduction to Data Mining, (First Edition)

[...]

Pang-Ning Tan, Michael Steinbach, Vipin Kumar

01 May 2005

1,598 citations

Book Chapter•DOI•

The Challenges of Clustering High Dimensional Data

[...]

Michael Steinbach, Levent Ertoz, Vipin Kumar

01 Jan 2004

TL;DR: This chapter provides a short introduction to cluster analysis, and presents a brief overview of several recent techniques, including a more detailed description of recent work of recent which uses a concept-based clustering approach.

...read moreread less

Abstract: Cluster analysis divides data into groups (clusters) for the purposes of summarization or improved understanding. For example, cluster analysis has been used to group related documents for browsing, to find genes and proteins that have similar functionality, or as a means of data compression. While clustering has a long history and a large number of clustering techniques have been developed in statistics, pattern recognition, data mining, and other fields, significant challenges still remain. In this chapter we provide a short introduction to cluster analysis, and then focus on the challenge of clustering high dimensional data. We present a brief overview of several recent techniques, including a more detailed description of recent work of our own which uses a concept-based clustering approach.

...read moreread less

449 citations

Journal Article•DOI•

Highly Transparent, Stretchable, and Self-Healing Ionic-Skin Triboelectric Nanogenerators for Energy Harvesting and Touch Applications.

[...]

Kaushik Parida¹, Vipin Kumar¹, Wang Jiangxin¹, Venkateswarlu Bhavanasi¹, Ramaraju Bendi¹, Pooi See Lee¹ - Show less +2 more•Institutions (1)

Nanyang Technological University¹

01 Oct 2017-Advanced Materials

TL;DR: The IS-TENG represents the first prototype of a highly deformable and transparent power source that is able to autonomously self-heal quickly and repeatedly at room temperature, and thus can be used as a power supply for digital watches, touch sensors, artificial intelligence, and biointegrated electronics.

...read moreread less

Abstract: Recently developed triboelectric nanogenerators (TENGs) act as a promising power source for self-powered electronic devices. However, the majority of TENGs are fabricated using metallic electrodes and cannot achieve high stretchability and transparency, simultaneously. Here, slime-based ionic conductors are used as transparent current-collecting layers of TENG, thus significantly enhancing their energy generation, stretchability, transparency, and instilling self-healing characteristics. This is the first demonstration of using an ionic conductor as the current collector in a mechanical energy harvester. The resulting ionic-skin TENG (IS-TENG) has a transparency of 92% transmittance, and its energy-harvesting performance is 12 times higher than that of the silver-based electronic current collectors. In addition, they are capable of enduring a uniaxial strain up to 700%, giving the highest performance compared to all other transparent and stretchable mechanical-energy harvesters. Additionally, this is the first demonstration of an autonomously self-healing TENG that can recover its performance even after 300 times of complete bifurcation. The IS-TENG represents the first prototype of a highly deformable and transparent power source that is able to autonomously self-heal quickly and repeatedly at room temperature, and thus can be used as a power supply for digital watches, touch sensors, artificial intelligence, and biointegrated electronics.

...read moreread less

298 citations

Journal Article•DOI•

Enhanced Piezoelectric Energy Harvesting Performance of Flexible PVDF-TrFE Bilayer Films with Graphene Oxide.

[...]

Venkateswarlu Bhavanasi¹, Vipin Kumar¹, Kaushik Parida¹, Jiangxin Wang¹, Pooi See Lee¹ - Show less +1 more•Institutions (1)

Nanyang Technological University¹

13 Jan 2016-ACS Applied Materials & Interfaces

TL;DR: This report reports, for the first time, the enhanced piezoelectric energy harvesting performance of the bilayer films of poled poly(vinylidene fluoride-trifluoroethylene) [PVDF-TrFE] and graphene oxide (GO).

...read moreread less

Abstract: Ferroelectric materials have attracted interest in recent years due to their application in energy harvesting owing to its piezoelectric nature. Ferroelectric polymers are flexible and can sustain larger strains compared to inorganic counterparts, making them attractive for harvesting energy from mechanical vibrations. Herein, we report, for the first time, the enhanced piezoelectric energy harvesting performance of the bilayer films of poled poly(vinylidene fluoride-trifluoroethylene) [PVDF-TrFE] and graphene oxide (GO). The bilayer film exhibits superior energy harvesting performance with a voltage output of 4 V and power output of 4.41 μWcm–2 compared to poled PVDF-TrFE films alone (voltage output of 1.9 V and power output of 1.77 μWcm–2). The enhanced voltage and power output in the presence of GO film is due to the combined effect of electrostatic contribution from graphene oxide, residual tensile stress, enhanced Young’s modulus of the bilayer films, and the presence of space charge at the interface...

...read moreread less

265 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Anomaly detection: A survey

[...]

Varun Chandola¹, Arindam Banerjee¹, Vipin Kumar¹•Institutions (1)

University of Minnesota¹

30 Jul 2009-ACM Computing Surveys

TL;DR: This survey tries to provide a structured and comprehensive overview of the research on anomaly detection by grouping existing techniques into different categories based on the underlying approach adopted by each technique.

...read moreread less

Abstract: Anomaly detection is an important problem that has been researched within diverse research areas and application domains. Many anomaly detection techniques have been specifically developed for certain application domains, while others are more generic. This survey tries to provide a structured and comprehensive overview of the research on anomaly detection. We have grouped existing techniques into different categories based on the underlying approach adopted by each technique. For each category we have identified key assumptions, which are used by the techniques to differentiate between normal and anomalous behavior. When applying a given technique to a particular domain, these assumptions can be used as guidelines to assess the effectiveness of the technique in that domain. For each category, we provide a basic anomaly detection technique, and then show how the different existing techniques in that category are variants of the basic technique. This template provides an easier and more succinct understanding of the techniques belonging to each category. Further, for each category, we identify the advantages and disadvantages of the techniques in that category. We also provide a discussion on the computational complexity of the techniques since it is an important issue in real application domains. We hope that this survey will provide a better understanding of the different directions in which research has been done on this topic, and how techniques developed in one area can be applied in domains for which they were not intended to begin with.

...read moreread less

9,627 citations

Journal Article•DOI•

Soil Chemical Analysis

[...]

C. I. Rich

01 May 1958-Agronomy Journal

7,335 citations

Journal Article•DOI•

Data clustering: 50 years beyond K-means

[...]

Anil K. Jain¹•Institutions (1)

Michigan State University¹

01 Jun 2010

TL;DR: A brief overview of clustering is provided, well known clustering methods are summarized, the major challenges and key issues in designing clustering algorithms are discussed, and some of the emerging and useful research directions are pointed out.

...read moreread less

Abstract: Organizing data into sensible groupings is one of the most fundamental modes of understanding and learning. As an example, a common scheme of scientific classification puts organisms into a system of ranked taxa: domain, kingdom, phylum, class, etc. Cluster analysis is the formal study of methods and algorithms for grouping, or clustering, objects according to measured or perceived intrinsic characteristics or similarity. Cluster analysis does not use category labels that tag objects with prior identifiers, i.e., class labels. The absence of category information distinguishes data clustering (unsupervised learning) from classification or discriminant analysis (supervised learning). The aim of clustering is to find structure in data and is therefore exploratory in nature. Clustering has a long and rich history in a variety of scientific fields. One of the most popular and simple clustering algorithms, K-means, was first published in 1955. In spite of the fact that K-means was proposed over 50 years ago and thousands of clustering algorithms have been published since then, K-means is still widely used. This speaks to the difficulty in designing a general purpose clustering algorithm and the ill-posed problem of clustering. We provide a brief overview of clustering, summarize well known clustering methods, discuss the major challenges and key issues in designing clustering algorithms, and point out some of the emerging and useful research directions, including semi-supervised clustering, ensemble clustering, simultaneous feature selection during data clustering, and large scale data clustering.

...read moreread less

6,601 citations

Journal Article•DOI•

Top 10 algorithms in data mining

[...]

Xindong Wu¹, Vipin Kumar², J. Ross Quinlan, Joydeep Ghosh³, Qiang Yang⁴, Hiroshi Motoda⁵, Geoffrey J. McLachlan⁶, Angus S. K. Ng⁷, Bing Liu⁸, Philip S. Yu⁹, Zhi-Hua Zhou¹⁰, Michael Steinbach², David J. Hand¹¹, Dan Steinberg¹² - Show less +10 more•Institutions (12)

University of Vermont¹, University of Minnesota², University of Texas at Austin³, Hong Kong University of Science and Technology⁴, Osaka University⁵, University of Queensland⁶, Griffith University⁷, University of Illinois at Chicago⁸, IBM⁹, Nanjing University¹⁰, Imperial College London¹¹, University of Salford¹²

19 Dec 2007-Knowledge and Information Systems

TL;DR: This paper presents the top 10 data mining algorithms identified by the IEEE International Conference on Data Mining (ICDM) in December 2006: C4.5, k-Means, SVM, Apriori, EM, PageRank, AdaBoost, kNN, Naive Bayes, and CART.

...read moreread less

Abstract: This paper presents the top 10 data mining algorithms identified by the IEEE International Conference on Data Mining (ICDM) in December 2006: C4.5, k-Means, SVM, Apriori, EM, PageRank, AdaBoost, kNN, Naive Bayes, and CART. These top 10 algorithms are among the most influential data mining algorithms in the research community. With each algorithm, we provide a description of the algorithm, discuss the impact of the algorithm, and review current and further research on the algorithm. These 10 algorithms cover classification, clustering, statistical learning, association analysis, and link mining, which are all among the most important topics in data mining research and development.

...read moreread less

4,944 citations

Book Chapter•DOI•

Data Clustering: 50 Years Beyond K-means

[...]

Anil K. Jain¹•Institutions (1)

Michigan State University¹

15 Sep 2008

TL;DR: Cluster analysis as mentioned in this paper is the formal study of algorithms and methods for grouping objects according to measured or perceived intrinsic characteristics, which is one of the most fundamental modes of understanding and learning.

...read moreread less

Abstract: The practice of classifying objects according to perceived similarities is the basis for much of science. Organizing data into sensible groupings is one of the most fundamental modes of understanding and learning. As an example, a common scheme of scientific classification puts organisms in to taxonomic ranks: domain, kingdom, phylum, class, etc.). Cluster analysis is the formal study of algorithms and methods for grouping objects according to measured or perceived intrinsic characteristics. Cluster analysis does not use category labels that tag objects with prior identifiers, i.e., class labels. The absence of category information distinguishes cluster analysis (unsupervised learning) from discriminant analysis (supervised learning). The objective of cluster analysis is to simply find a convenient and valid organization of the data, not to establish rules for separating future data into categories.

...read moreread less

4,255 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse