Home
/
Authors
/
Kalyan Veeramachaneni

Author

Kalyan Veeramachaneni

Other affiliations: ALFA, Syracuse University

Bio: Kalyan Veeramachaneni is an academic researcher from Massachusetts Institute of Technology. The author has contributed to research in topics: Genetic programming & Particle swarm optimization. The author has an hindex of 29, co-authored 132 publications receiving 3724 citations. Previous affiliations of Kalyan Veeramachaneni include ALFA & Syracuse University.

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

OpenTuner: an extensible framework for program autotuning

[...]

Jason Ansel¹, Shoaib Kamil¹, Kalyan Veeramachaneni¹, Jonathan Ragan-Kelley¹, Jeffrey Bosboom¹, Una-May O'Reilly¹, Saman Amarasinghe¹ - Show less +3 more•Institutions (1)

Massachusetts Institute of Technology¹

24 Aug 2014

TL;DR: The efficacy and generality of OpenTuner are demonstrated by building autotuners for 7 distinct projects and 16 total benchmarks, showing speedups over prior techniques of these projects of up to 2.8χ with little programmer effort.

...read moreread less

Abstract: Program autotuning has been shown to achieve better or more portable performance in a number of domains. However, autotuners themselves are rarely portable between projects, for a number of reasons: using a domain-informed search space representation is critical to achieving good results; search spaces can be intractably large and require advanced machine learning techniques; and the landscape of search spaces can vary greatly between different problems, sometimes requiring domain specific search techniques to explore efficiently. This paper introduces OpenTuner, a new open source framework for building domain-specific multi-objective program autotuners. OpenTuner supports fully-customizable configuration representations, an extensible technique representation to allow for domain-specific techniques, and an easy to use interface for communicating with the program to be autotuned. A key capability inside OpenTuner is the use of ensembles of disparate search techniques simultaneously; techniques that perform well will dynamically be allocated a larger proportion of tests. We demonstrate the efficacy and generality of OpenTuner by building autotuners for 7 distinct projects and 16 total benchmarks, showing speedups over prior techniques of these projects of up to 2.8x with little programmer effort.

...read moreread less

489 citations

Proceedings Article•DOI•

Fitness-distance-ratio based particle swarm optimization

[...]

Thanmaya Peram¹, Kalyan Veeramachaneni¹, Chilukuri K. Mohan¹•Institutions (1)

Syracuse University¹

24 Apr 2003

TL;DR: A modification of the particle swarm optimization algorithm (PSO) intended to combat the problem of premature convergence observed in many applications of PSO, which is shown to perform significantly better than the original PSO algorithm and some of its variants, on many different benchmark optimization problems.

...read moreread less

Abstract: This paper presents a modification of the particle swarm optimization algorithm (PSO) intended to combat the problem of premature convergence observed in many applications of PSO. The proposed new algorithm moves particles towards nearby particles of higher fitness, instead of attracting each particle towards just the best position discovered so far by any particle. This is accomplished by using the ratio of the relative fitness and the distance of other particles to determine the direction in which each component of the particle position needs to be changed. The resulting algorithm (FDR-PSO) is shown to perform significantly better than the original PSO algorithm and some of its variants, on many different benchmark optimization problems. Empirical examination of the evolution of the particles demonstrates that the convergence of the algorithm does not occur at an early phase of particle evolution, unlike PSO. Avoiding premature convergence allows FDR-PSO to continue search for global optima in difficult multimodal optimization problems.

...read moreread less

461 citations

Posted Content•

Modeling Tabular data using Conditional GAN

[...]

Lei Xu¹, Maria Skoularidou², Alfredo Cuesta-Infante³, Kalyan Veeramachaneni¹•Institutions (3)

Massachusetts Institute of Technology¹, University of Cambridge², King Juan Carlos University³

01 Jul 2019-arXiv: Learning

TL;DR: TGAN, which uses a conditional generative adversarial network to address challenges of modeling the probability distribution of rows in tabular data, outperforms Bayesian methods on most of the real datasets whereas other deep learning methods could not.

...read moreread less

Abstract: Modeling the probability distribution of rows in tabular data and generating realistic synthetic data is a non-trivial task. Tabular data usually contains a mix of discrete and continuous columns. Continuous columns may have multiple modes whereas discrete columns are sometimes imbalanced making the modeling difficult. Existing statistical and deep neural network models fail to properly model this type of data. We design TGAN, which uses a conditional generative adversarial network to address these challenges. To aid in a fair and thorough comparison, we design a benchmark with 7 simulated and 8 real datasets and several Bayesian network baselines. TGAN outperforms Bayesian methods on most of the real datasets whereas other deep learning methods could not.

...read moreread less

354 citations

Proceedings Article•DOI•

Deep feature synthesis: Towards automating data science endeavors

[...]

James Max Kanter¹, Kalyan Veeramachaneni¹•Institutions (1)

Massachusetts Institute of Technology¹

07 Dec 2015

TL;DR: This paper proposes and develops the Deep Feature Synthesis algorithm for automatically generating features for relational datasets, and implements a generalizable machine learning pipeline and tune it using a novel Gaussian Copula process based approach.

...read moreread less

Abstract: In this paper, we develop the Data Science Machine, which is able to derive predictive models from raw data automatically. To achieve this automation, we first propose and develop the Deep Feature Synthesis algorithm for automatically generating features for relational datasets. The algorithm follows relationships in the data to a base field, and then sequentially applies mathematical functions along that path to create the final feature. Second, we implement a generalizable machine learning pipeline and tune it using a novel Gaussian Copula process based approach. We entered the Data Science Machine in 3 data science competitions that featured 906 other data science teams. Our approach beats 615 teams in these data science competitions. In 2 of the 3 competitions we beat a majority of competitors, and in the third, we achieved 94% of the best competitor's score. In the best case, with an ongoing competition, we beat 85.6% of the teams and achieved 95.7% of the top submissions score.

...read moreread less

297 citations

Proceedings Article•DOI•

The Synthetic Data Vault

[...]

Neha Patki, Roy Wedge¹, Kalyan Veeramachaneni¹•Institutions (1)

Massachusetts Institute of Technology¹

01 Oct 2016

TL;DR: The Synthetic Data Vault is presented, a system that builds generative models of relational databases and is able to sample from the model and create synthetic data, hence the name SDV.

...read moreread less

Abstract: The goal of this paper is to build a system that automatically creates synthetic data to enable data science endeavors. To achieve this, we present the Synthetic Data Vault (SDV), a system that builds generative models of relational databases. We are able to sample from the model and create synthetic data, hence the name SDV. When implementing the SDV, we also developed an algorithm that computes statistics at the intersection of related database tables. We then used a state-of-the-art multivariate modeling approach to model this data. The SDV iterates through all possible relations, ultimately creating a model for the entire database. Once this model is computed, the same relational information allows the SDV to synthesize data by sampling from any part of the database. After building the SDV, we used it to generate synthetic data for five different publicly available datasets. We then published these datasets, and asked data scientists to develop predictive models for them as part of a crowdsourced experiment. By analyzing the outcomes, we show that synthetic data can successfully replace original data for data science. Our analysis indicates that there is no significant difference in the work produced by data scientists who used synthetic data as opposed to real data. We conclude that the SDV is a viable solution for synthetic data generation.

...read moreread less

273 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•

“Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks”の学習報告

[...]

杉山拓海

12 Sep 2017-Computers & Graphics

3,940 citations

Journal Article•DOI•

Comprehensive learning particle swarm optimizer for global optimization of multimodal functions

[...]

Jing Liang¹, A. K. Qin¹, Ponnuthurai Nagaratnam Suganthan¹, S. Baskar¹•Institutions (1)

Nanyang Technological University¹

01 Jun 2006-IEEE Transactions on Evolutionary Computation

TL;DR: The comprehensive learning particle swarm optimizer (CLPSO) is presented, which uses a novel learning strategy whereby all other particles' historical best information is used to update a particle's velocity.

...read moreread less

Abstract: This paper presents a variant of particle swarm optimizers (PSOs) that we call the comprehensive learning particle swarm optimizer (CLPSO), which uses a novel learning strategy whereby all other particles' historical best information is used to update a particle's velocity. This strategy enables the diversity of the swarm to be preserved to discourage premature convergence. Experiments were conducted (using codes available from http://www.ntu.edu.sg/home/epnsugan) on multimodal test functions such as Rosenbrock, Griewank, Rastrigin, Ackley, and Schwefel and composition functions both with and without coordinate rotation. The results demonstrate good performance of the CLPSO in solving multimodal problems when compared with eight other recent variants of the PSO.

...read moreread less

3,217 citations

Journal Article•DOI•

Discovering governing equations from data by sparse identification of nonlinear dynamical systems

[...]

Steven L. Brunton¹, Joshua L. Proctor, J. Nathan Kutz¹•Institutions (1)

University of Washington¹

12 Apr 2016-Proceedings of the National Academy of Sciences of the United States of America

TL;DR: This work develops a novel framework to discover governing equations underlying a dynamical system simply from data measurements, leveraging advances in sparsity techniques and machine learning and using sparse regression to determine the fewest terms in the dynamic governing equations required to accurately represent the data.

...read moreread less

Abstract: Extracting governing equations from data is a central challenge in many diverse areas of science and engineering. Data are abundant whereas models often remain elusive, as in climate science, neuroscience, ecology, finance, and epidemiology, to name only a few examples. In this work, we combine sparsity-promoting techniques and machine learning with nonlinear dynamical systems to discover governing equations from noisy measurement data. The only assumption about the structure of the model is that there are only a few important terms that govern the dynamics, so that the equations are sparse in the space of possible functions; this assumption holds for many physical systems in an appropriate basis. In particular, we use sparse regression to determine the fewest terms in the dynamic governing equations required to accurately represent the data. This results in parsimonious models that balance accuracy with model complexity to avoid overfitting. We demonstrate the algorithm on a wide range of problems, from simple canonical systems, including linear and nonlinear oscillators and the chaotic Lorenz system, to the fluid vortex shedding behind an obstacle. The fluid example illustrates the ability of this method to discover the underlying dynamics of a system that took experts in the community nearly 30 years to resolve. We also show that this method generalizes to parameterized systems and systems that are time-varying or have external forcing.

...read moreread less

2,784 citations

Book•

Particle Swarm Optimization

[...]

Maurice Clerc

24 Feb 2006

TL;DR: This work focuses on the optimization of particle Swarm Optimization for TRIBES or co-operation of tribes with a focus on the dynamics of a swarm.

...read moreread less

Abstract: Foreword. Introduction. Part 1: Particle Swarm Optimization. Chapter 1. What is a difficult problem? Chapter 2. On a table corner. Chapter 3. First formulations. Chapter 4. Benchmark set. Chapter 5. Mistrusting chance. Chapter 6. First results. Chapter 7. Swarm: memory and influence graphs. Chapter 8. Distributions of proximity. Chapter 9. Optimal parameter settings. Chapter 10. Adaptations. Chapter 11. TRIBES or co-operation of tribes. Chapter 12. On the constraints. Chapter 13. Problems and applications. Chapter 14. Conclusion. Part 2: Outlines. Chapter 15. On parallelism. Chapter 16. Combinatorial problems. Chapter 17. Dynamics of a swarm. Chapter 18. Techniques and alternatives. Further Information. Bibliography. Index.

...read moreread less

1,293 citations

Journal Article•DOI•

Generalizing from a Few Examples: A Survey on Few-shot Learning

[...]

Yaqing Wang¹, Quanming Yao², James T. Kwok¹, Lionel M. Ni¹•Institutions (2)

Hong Kong University of Science and Technology¹, Paradigm²

12 Jun 2020-ACM Computing Surveys

TL;DR: A thorough survey to fully understand Few-shot Learning (FSL), and categorizes FSL methods from three perspectives: data, which uses prior knowledge to augment the supervised experience; model, which used to reduce the size of the hypothesis space; and algorithm, which using prior knowledgeto alter the search for the best hypothesis in the given hypothesis space.

...read moreread less

Abstract: Machine learning has been highly successful in data-intensive applications but is often hampered when the data set is small. Recently, Few-shot Learning (FSL) is proposed to tackle this problem. Using prior knowledge, FSL can rapidly generalize to new tasks containing only a few samples with supervised information. In this article, we conduct a thorough survey to fully understand FSL. Starting from a formal definition of FSL, we distinguish FSL from several relevant machine learning problems. We then point out that the core issue in FSL is that the empirical risk minimizer is unreliable. Based on how prior knowledge can be used to handle this core issue, we categorize FSL methods from three perspectives: (i) data, which uses prior knowledge to augment the supervised experience; (ii) model, which uses prior knowledge to reduce the size of the hypothesis space; and (iii) algorithm, which uses prior knowledge to alter the search for the best hypothesis in the given hypothesis space. With this taxonomy, we review and discuss the pros and cons of each category. Promising directions, in the aspects of the FSL problem setups, techniques, applications, and theories, are also proposed to provide insights for future research.1

...read moreread less

1,129 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse