Home
/
Authors
/
Mykel J. Kochenderfer

Author

Mykel J. Kochenderfer

Other affiliations: Massachusetts Institute of Technology, University of Edinburgh, Bosch ...read more

Bio: Mykel J. Kochenderfer is an academic researcher from Stanford University. The author has contributed to research in topics: Markov decision process & Computer science. The author has an hindex of 41, co-authored 388 publications receiving 8215 citations. Previous affiliations of Mykel J. Kochenderfer include Massachusetts Institute of Technology & University of Edinburgh.

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2006
2005
2004
2003
2002

Papers

PDF

Open Access

More filters

Book Chapter•DOI•

Reluplex: An Efficient SMT Solver for Verifying Deep Neural Networks

[...]

Guy Katz¹, Clark Barrett¹, David L. Dill¹, Kyle D. Julian¹, Mykel J. Kochenderfer¹ - Show less +1 more•Institutions (1)

Stanford University¹

24 Jul 2017

TL;DR: In this paper, the authors presented a scalable and efficient technique for verifying properties of deep neural networks (or providing counter-examples) based on the simplex method, extended to handle the non-convex Rectified Linear Unit (ReLU) activation function.

...read moreread less

Abstract: Deep neural networks have emerged as a widely used and effective means for tackling complex, real-world problems. However, a major obstacle in applying them to safety-critical systems is the great difficulty in providing formal guarantees about their behavior. We present a novel, scalable, and efficient technique for verifying properties of deep neural networks (or providing counter-examples). The technique is based on the simplex method, extended to handle the non-convex Rectified Linear Unit (ReLU) activation function, which is a crucial ingredient in many modern neural networks. The verification procedure tackles neural networks as a whole, without making any simplifying assumptions. We evaluated our technique on a prototype deep neural network implementation of the next-generation airborne collision avoidance system for unmanned aircraft (ACAS Xu). Results show that our technique can successfully prove properties of networks that are an order of magnitude larger than the largest networks verified using existing methods.

...read moreread less

1,332 citations

Book Chapter•DOI•

Cooperative Multi-agent Control Using Deep Reinforcement Learning

[...]

Jayesh K. Gupta¹, M. Egorov¹, Mykel J. Kochenderfer¹•Institutions (1)

Stanford University¹

08 May 2017

TL;DR: It is shown that policy gradient methods tend to outperform both temporal-difference and actor-critic methods and that curriculum learning is vital to scaling reinforcement learning algorithms in complex multi-agent domains.

...read moreread less

Abstract: This work considers the problem of learning cooperative policies in complex, partially observable domains without explicit communication. We extend three classes of single-agent deep reinforcement learning algorithms based on policy gradient, temporal-difference error, and actor-critic methods to cooperative multi-agent systems. To effectively scale these algorithms beyond a trivial number of agents, we combine them with a multi-agent variant of curriculum learning. The algorithms are benchmarked on a suite of cooperative control tasks, including tasks with discrete and continuous actions, as well as tasks with dozens of cooperating agents. We report the performance of the algorithms using different neural architectures, training procedures, and reward structures. We show that policy gradient methods tend to outperform both temporal-difference and actor-critic methods and that curriculum learning is vital to scaling reinforcement learning algorithms in complex multi-agent domains.

...read moreread less

697 citations

Posted Content•

Reluplex: An Efficient SMT Solver for Verifying Deep Neural Networks

[...]

Guy Katz¹, Clark Barrett¹, David L. Dill¹, Kyle D. Julian¹, Mykel J. Kochenderfer¹ - Show less +1 more•Institutions (1)

Stanford University¹

03 Feb 2017-arXiv: Artificial Intelligence

TL;DR: Results show that the novel, scalable, and efficient technique presented can successfully prove properties of networks that are an order of magnitude larger than the largest networks verified using existing methods.

...read moreread less

421 citations

Book•

Decision Making Under Uncertainty: Theory and Application

[...]

Mykel J. Kochenderfer, Christopher Amato, Girish Chowdhary, Jonathan P. How, Hayley J. Davison Reynolds, Jason R. Thornton, Pedro A. Torres-Carrasquillo, N. Kemal Ure, John Vian - Show less +5 more

17 Jul 2015

TL;DR: This book provides an introduction to the challenges of decision making under uncertainty from a computational perspective and presents both the theory behind decision making models and algorithms and a collection of example applications that range from speech recognition to aircraft collision avoidance.

...read moreread less

Abstract: Many important problems involve decision making under uncertainty -- that is, choosing actions based on often imperfect observations, with unknown outcomes Designers of automated decision support systems must take into account the various sources of uncertainty while balancing the multiple objectives of the system This book provides an introduction to the challenges of decision making under uncertainty from a computational perspective It presents both the theory behind decision making models and algorithms and a collection of example applications that range from speech recognition to aircraft collision avoidance Focusing on two methods for designing decision agents, planning and reinforcement learning, the book covers probabilistic models, introducing Bayesian networks as a graphical model that captures probabilistic relationships between variables; utility theory as a framework for understanding optimal decision making under uncertainty; Markov decision processes as a method for modeling sequential problems; model uncertainty; state uncertainty; and cooperative decision making involving multiple interacting agents A series of applications shows how the theoretical concepts can be applied to systems for attribute-based person search, speech applications, collision avoidance, and unmanned aircraft persistent surveillance Decision Making Under Uncertainty unifies research from different communities using consistent notation, and is accessible to students and researchers across engineering disciplines who have some prior exposure to probability theory and calculus It can be used as a text for advanced undergraduate and graduate students in fields including computer science, aerospace and electrical engineering, and management science It will also be a valuable professional reference for researchers in a variety of disciplines

...read moreread less

412 citations

Book Chapter•DOI•

The Marabou Framework for Verification and Analysis of Deep Neural Networks

[...]

Guy Katz¹, Derek A. Huang², Duligur Ibeling², Kyle D. Julian², Christopher Lazarus², Rachel Lim², Parth Shah², Shantanu Thakoor², Haoze Wu², Aleksandar Zeljić², David L. Dill², Mykel J. Kochenderfer², Clark Barrett² - Show less +9 more•Institutions (2)

Hebrew University of Jerusalem¹, Stanford University²

15 Jul 2019

TL;DR: Marabou is an SMT-based tool that can answer queries about a network’s properties by transforming these queries into constraint satisfaction problems, and it performs high-level reasoning on the network that can curtail the search space and improve performance.

...read moreread less

Abstract: Deep neural networks are revolutionizing the way complex systems are designed. Consequently, there is a pressing need for tools and techniques for network analysis and certification. To help in addressing that need, we present Marabou, a framework for verifying deep neural networks. Marabou is an SMT-based tool that can answer queries about a network’s properties by transforming these queries into constraint satisfaction problems. It can accommodate networks with different activation functions and topologies, and it performs high-level reasoning on the network that can curtail the search space and improve performance. It also supports parallel execution to further enhance scalability. Marabou accepts multiple input formats, including protocol buffer files generated by the popular TensorFlow framework for neural networks. We describe the system architecture and main components, evaluate the technique and discuss ongoing work.

...read moreread less

375 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•

다중혈관 관상동맥 환자에서 y-문합을 이용하여 양쪽 내흉동맥만을 사용한 우회술의 조기 성적

[...]

성기익, 이영탁, 박계현, 전태국, 박표원, 한일용, 장윤희 - Show less +3 more

01 Mar 2003-The Korean Journal of Thoracic and Cardiovascular Surgery

28,685 citations

Pattern Recognition and Machine Learning

[...]

Christopher M. Bishop¹•Institutions (1)

Microsoft¹

01 Jan 2006

TL;DR: Probability distributions of linear models for regression and classification are given in this article, along with a discussion of combining models and combining models in the context of machine learning and classification.

...read moreread less

Abstract: Probability Distributions.- Linear Models for Regression.- Linear Models for Classification.- Neural Networks.- Kernel Methods.- Sparse Kernel Machines.- Graphical Models.- Mixture Models and EM.- Approximate Inference.- Sampling Methods.- Continuous Latent Variables.- Sequential Data.- Combining Models.

...read moreread less

10,141 citations

Journal Article•DOI•

Phd by thesis

[...]

Richard Lathe¹•Institutions (1)

French Institute of Health and Medical Research¹

01 Apr 1988-Nature

TL;DR: In this paper, a sedimentological core and petrographic characterisation of samples from eleven boreholes from the Lower Carboniferous of Bowland Basin (Northwest England) is presented.

...read moreread less

Abstract: Deposits of clastic carbonate-dominated (calciclastic) sedimentary slope systems in the rock record have been identified mostly as linearly-consistent carbonate apron deposits, even though most ancient clastic carbonate slope deposits fit the submarine fan systems better. Calciclastic submarine fans are consequently rarely described and are poorly understood. Subsequently, very little is known especially in mud-dominated calciclastic submarine fan systems. Presented in this study are a sedimentological core and petrographic characterisation of samples from eleven boreholes from the Lower Carboniferous of Bowland Basin (Northwest England) that reveals a >250 m thick calciturbidite complex deposited in a calciclastic submarine fan setting. Seven facies are recognised from core and thin section characterisation and are grouped into three carbonate turbidite sequences. They include: 1) Calciturbidites, comprising mostly of highto low-density, wavy-laminated bioclast-rich facies; 2) low-density densite mudstones which are characterised by planar laminated and unlaminated muddominated facies; and 3) Calcidebrites which are muddy or hyper-concentrated debrisflow deposits occurring as poorly-sorted, chaotic, mud-supported floatstones. These

...read moreread less

9,929 citations

Convex Analysisの二,三の進展について

[...]

徹丸山

01 Feb 1977

5,933 citations

On robust estimation of the location parameter

[...]

Frederick R. Forst

01 Jan 1980

3,652 citations