Home
/
Authors
/
Mrinal Kalakrishnan

Author

Mrinal Kalakrishnan

Other affiliations: Google, University of Southern California, Max Planck Society

Bio: Mrinal Kalakrishnan is an academic researcher from Stanford University. The author has contributed to research in topics: Reinforcement learning & Robot. The author has an hindex of 32, co-authored 63 publications receiving 4994 citations. Previous affiliations of Mrinal Kalakrishnan include Google & University of Southern California.

Papers published on a yearly basis

2023
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008

Papers

PDF

Open Access

More filters

Proceedings Article•

QT-Opt: Scalable Deep Reinforcement Learning for Vision-Based Robotic Manipulation

[...]

Dmitry Kalashnikov¹, Alex Irpan¹, Peter Pastor², Julian Ibarz¹, Alexander Herzog³, Eric Jang¹, Deirdre Quillen¹, Ethan Holly¹, Mrinal Kalakrishnan², Vincent Vanhoucke¹, Sergey Levine⁴ - Show less +7 more•Institutions (4)

Google¹, Stanford University², Max Planck Society³, University of California, Berkeley⁴

27 Jun 2018

TL;DR: QT-Opt as mentioned in this paper is a scalable self-supervised vision-based reinforcement learning framework that can leverage over 580k real-world grasp attempts to train a deep neural network Q-function with over 1.2M parameters.

...read moreread less

Abstract: In this paper, we study the problem of learning vision-based dynamic manipulation skills using a scalable reinforcement learning approach. We study this problem in the context of grasping, a longstanding challenge in robotic manipulation. In contrast to static learning behaviors that choose a grasp point and then execute the desired grasp, our method enables closed-loop vision-based control, whereby the robot continuously updates its grasp strategy based on the most recent observations to optimize long-horizon grasp success. To that end, we introduce QT-Opt, a scalable self-supervised vision-based reinforcement learning framework that can leverage over 580k real-world grasp attempts to train a deep neural network Q-function with over 1.2M parameters to perform closed-loop, real-world grasping that generalizes to 96% grasp success on unseen objects. Aside from attaining a very high success rate, our method exhibits behaviors that are quite distinct from more standard grasping systems: using only RGB vision-based perception from an over-the-shoulder camera, our method automatically learns regrasping strategies, probes objects to find the most effective grasps, learns to reposition objects and perform other non-prehensile pre-grasp manipulations, and responds dynamically to disturbances and perturbations.

...read moreread less

884 citations

Proceedings Article•DOI•

STOMP: Stochastic trajectory optimization for motion planning

[...]

Mrinal Kalakrishnan¹, Sachin Chitta², Evangelos A. Theodorou¹, Peter Pastor¹, Stefan Schaal¹ - Show less +1 more•Institutions (2)

University of Southern California¹, Willow Garage²

09 May 2011

TL;DR: It is experimentally show that the stochastic nature of STOMP allows it to overcome local minima that gradient-based methods like CHOMP can get stuck in.

...read moreread less

Abstract: We present a new approach to motion planning using a stochastic trajectory optimization framework. The approach relies on generating noisy trajectories to explore the space around an initial (possibly infeasible) trajectory, which are then combined to produced an updated trajectory with lower cost. A cost function based on a combination of obstacle and smoothness cost is optimized in each iteration. No gradient information is required for the particular optimization algorithm that we use and so general costs for which derivatives may not be available (e.g. costs corresponding to constraints and motor torques) can be included in the cost function. We demonstrate the approach both in simulation and on a mobile manipulation system for unconstrained and constrained tasks. We experimentally show that the stochastic nature of STOMP allows it to overcome local minima that gradient-based methods like CHOMP can get stuck in.

...read moreread less

817 citations

Proceedings Article•DOI•

Using Simulation and Domain Adaptation to Improve Efficiency of Deep Robotic Grasping

[...]

Konstantinos Bousmalis¹, Alex Irpan¹, Paul Wohlhart¹, Yunfei Bai¹, Matthew Kelcey¹, Mrinal Kalakrishnan², Laura Downs¹, Julian Ibarz¹, Peter Pastor², Kurt Konolige¹, Sergey Levine¹, Vincent Vanhoucke¹ - Show less +8 more•Institutions (2)

Google¹, Stanford University²

21 May 2018

TL;DR: In this paper, the authors study how randomized simulated environments and domain adaptation methods can be extended to train a grasping system to grasp novel objects from raw monocular RGB images, and they extensively evaluate their approaches with a total of more than 25,000 physical test grasps, including a novel extension of pixel-level domain adaptation that they termed the GraspGAN.

...read moreread less

Abstract: Instrumenting and collecting annotated visual grasping datasets to train modern machine learning algorithms can be extremely time-consuming and expensive. An appealing alternative is to use off-the-shelf simulators to render synthetic data for which ground-truth annotations are generated automatically. Unfortunately, models trained purely on simulated data often fail to generalize to the real world. We study how randomized simulated environments and domain adaptation methods can be extended to train a grasping system to grasp novel objects from raw monocular RGB images. We extensively evaluate our approaches with a total of more than 25,000 physical test grasps, studying a range of simulation conditions and domain adaptation methods, including a novel extension of pixel-level domain adaptation that we term the GraspGAN. We show that, by using synthetic data and domain adaptation, we are able to reduce the number of real-world samples needed to achieve a given level of performance by up to 50 times, using only randomly generated simulated objects. We also show that by using only unlabeled real-world data and our GraspGAN methodology, we obtain real-world grasping performance without any real-world labels that is similar to that achieved with 939,777 labeled real-world samples.

...read moreread less

459 citations

Proceedings Article•DOI•

Sim-To-Real via Sim-To-Sim: Data-Efficient Robotic Grasping via Randomized-To-Canonical Adaptation Networks

[...]

Stephen James¹, Paul Wohlhart², Mrinal Kalakrishnan, Dmitry Kalashnikov², Alex Irpan², Julian Ibarz², Sergey Levine², Raia Hadsell, Konstantinos Bousmalis - Show less +5 more•Institutions (2)

Imperial College London¹, Google²

01 Jun 2019

TL;DR: This paper presents Randomized-to-Canonical Adaptation Networks (RCANs), a novel approach to crossing the visual reality gap that uses no real-world data and learns to translate randomized rendered images into their equivalent non-randomized, canonical versions.

...read moreread less

Abstract: Real world data, especially in the domain of robotics, is notoriously costly to collect. One way to circumvent this can be to leverage the power of simulation to produce large amounts of labelled data. However, training models on simulated images does not readily transfer to real-world ones. Using domain adaptation methods to cross this "reality gap" requires a large amount of unlabelled real-world data, whilst domain randomization alone can waste modeling power. In this paper, we present Randomized-to-Canonical Adaptation Networks (RCANs), a novel approach to crossing the visual reality gap that uses no real-world data. Our method learns to translate randomized rendered images into their equivalent non-randomized, canonical versions. This in turn allows for real images to also be translated into canonical sim images. We demonstrate the effectiveness of this sim-to-real approach by training a vision-based closed-loop grasping reinforcement learning agent in simulation, and then transferring it to the real world to attain 70% zero-shot grasp success on unseen objects, a result that almost doubles the success of learning the same task directly on domain randomization alone. Additionally, by joint finetuning in the real-world with only 5,000 real-world grasps, our method achieves 91%, attaining comparable performance to a state-of-the-art system trained with 580,000 real-world grasps, resulting in a reduction of real-world data by more than 99%.

...read moreread less

299 citations

Journal Article•DOI•

Learning, planning, and control for quadruped locomotion over challenging terrain

[...]

Mrinal Kalakrishnan¹, Jonas Buchli¹, Peter Pastor¹, Michael Mistry², Stefan Schaal¹ - Show less +1 more•Institutions (2)

University of Southern California¹, Disney Research²

01 Feb 2011-The International Journal of Robotics Research

TL;DR: A floating-base inverse dynamics controller that allows for robust, compliant locomotion over unperceived obstacles and the generalization ability of this controller is demonstrated by presenting results from testing performed by an independent external test team on terrain that has never been shown to us.

...read moreread less

Abstract: We present a control architecture for fast quadruped locomotion over rough terrain. We approach the problem by decomposing it into many sub-systems, in which we apply state-of-the-art learning, planning, optimization, and control techniques to achieve robust, fast locomotion. Unique features of our control strategy include: (1) a system that learns optimal foothold choices from expert demonstration using terrain templates, (2) a body trajectory optimizer based on the Zero-Moment Point (ZMP) stability criterion, and (3) a floating-base inverse dynamics controller that, in conjunction with force control, allows for robust, compliant locomotion over unperceived obstacles. We evaluate the performance of our controller by testing it on the LittleDog quadruped robot, over a wide variety of rough terrains of varying difficulty levels. The terrain that the robot was tested on includes rocks, logs, steps, barriers, and gaps, with obstacle sizes up to the leg length of the robot. We demonstrate the generalization ability of this controller by presenting results from testing performed by an independent external test team on terrain that has never been shown to us.

...read moreread less

290 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•

“Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks”の学習報告

[...]

杉山拓海

12 Sep 2017-Computers & Graphics

3,940 citations

On robust estimation of the location parameter

[...]

Frederick R. Forst

01 Jan 1980

3,652 citations

Journal Article•DOI•

Reinforcement learning in robotics: A survey

[...]

Jens Kober¹, J. Andrew Bagnell², Jan Peters³•Institutions (3)

Bielefeld University¹, Carnegie Mellon University², Max Planck Society³

01 Sep 2013-The International Journal of Robotics Research

TL;DR: This article attempts to strengthen the links between the two research communities by providing a survey of work in reinforcement learning for behavior generation in robots by highlighting both key challenges in robot reinforcement learning as well as notable successes.

...read moreread less

Abstract: Reinforcement learning offers to robotics a framework and set of tools for the design of sophisticated and hard-to-engineer behaviors. Conversely, the challenges of robotic problems provide both inspiration, impact, and validation for developments in reinforcement learning. The relationship between disciplines has sufficient promise to be likened to that between physics and mathematics. In this article, we attempt to strengthen the links between the two research communities by providing a survey of work in reinforcement learning for behavior generation in robots. We highlight both key challenges in robot reinforcement learning as well as notable successes. We discuss how contributions tamed the complexity of the domain and study the role of algorithms, representations, and prior knowledge in achieving these successes. As a result, a particular focus of our paper lies on the choice between model-based and model-free as well as between value-function-based and policy-search methods. By analyzing a simple problem in some detail we demonstrate how reinforcement learning approaches may be profitably applied, and we note throughout open questions and the tremendous potential for future research.

...read moreread less

2,391 citations

Journal Article•

End-to-end training of deep visuomotor policies

[...]

Sergey Levine¹, Chelsea Finn¹, Trevor Darrell¹, Pieter Abbeel¹•Institutions (1)

University of California, Berkeley¹

01 Jan 2016-Journal of Machine Learning Research

TL;DR: In this article, a guided policy search method is used to map raw image observations directly to torques at the robot's motors, with supervision provided by a simple trajectory-centric reinforcement learning method.

...read moreread less

Abstract: Policy search methods can allow robots to learn control policies for a wide range of tasks, but practical applications of policy search often require hand-engineered components for perception, state estimation, and low-level control. In this paper, we aim to answer the following question: does training the perception and control systems jointly end-to-end provide better performance than training each component separately? To this end, we develop a method that can be used to learn policies that map raw image observations directly to torques at the robot's motors. The policies are represented by deep convolutional neural networks (CNNs) with 92,000 parameters, and are trained using a guided policy search method, which transforms policy search into supervised learning, with supervision provided by a simple trajectory-centric reinforcement learning method. We evaluate our method on a range of real-world manipulation tasks that require close coordination between vision and control, such as screwing a cap onto a bottle, and present simulated comparisons to a range of prior policy search methods.

...read moreread less

1,934 citations

Genetic Dissection of Complex Traits

[...]

Eric S. Lander¹, Nicholas J. Schork²•Institutions (2)

Massachusetts Institute of Technology¹, Case Western Reserve University²

29 Jan 2015

TL;DR: The current state of the genetic dissection of complex traits is summarized in this paper, which describes the methods, limitations, and recent applications to biological problems, including linkage analysis, allele-sharing methods, association studies, and polygenic analysis of experimental crosses.

...read moreread less

Abstract: Medical genetics was revolutionized during the 1980s by the application of genetic mapping to locate the genes responsible for simple Mendelian diseases. Most diseases and traits, however, do not follow simple inheritance patterns. Geneticists have thus begun taking up the even greater challenge of the genetic dissection of complex traits. Four major approaches have been developed: linkage analysis, allele-sharing methods, association studies, and polygenic analysis of experimental crosses. This article synthesizes the current state of the genetic dissection of complex traits—describing the methods, limitations, and recent applications to biological problems.

...read moreread less

1,805 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse