Home
/
Authors
/
Richard E. Korf

Author

Richard E. Korf

Other affiliations: Columbia University, Carnegie Mellon University

Bio: Richard E. Korf is an academic researcher from University of California, Los Angeles. The author has contributed to research in topics: Search algorithm & Beam search. The author has an hindex of 49, co-authored 133 publications receiving 9123 citations. Previous affiliations of Richard E. Korf include Columbia University & Carnegie Mellon University.

Papers published on a yearly basis

2022
2021
2019
2018
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1993
1992
1991
1990
1989
1988
1987
1986
1985
1983
1982
1981
1980
1977

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Depth-first iterative-deepening: an optimal admissible tree search

[...]

Richard E. Korf¹•Institutions (1)

Columbia University¹

01 Sep 1985-Artificial Intelligence

TL;DR: This heuristic depth-first iterative-deepening algorithm is the only known algorithm that is capable of finding optimal solutions to randomly generated instances of the Fifteen Puzzle within practical resource limits.

...read moreread less

1,698 citations

Journal Article•DOI•

Real-time heuristic search

[...]

Richard E. Korf¹•Institutions (1)

University of California, Los Angeles¹

03 Mar 1990-Artificial Intelligence

TL;DR: A variation of minimax lookahead search, and an analog to alpha-beta pruning that significantly improves the efficiency of the algorithm, and a new algorithm, called Real-Time-A∗, for interleaving planning and execution, which proves that the algorithm makes locally optimal decisions and is guaranteed to find a solution.

...read moreread less

989 citations

Journal Article•DOI•

Planning as search: a quantitative approach

[...]

Richard E. Korf¹•Institutions (1)

University of California, Los Angeles¹

01 Sep 1987-Artificial Intelligence

TL;DR: It is presented that planning can be viewed as problem-solving search using subgoals, macro-operators, and abstraction as knowledge sources and an analysis of abstraction concludes that abstraction hierarchies can reduce exponential problems to linear complexity.

...read moreread less

424 citations

Journal Article•DOI•

Macro-operators: a weak method for learning

[...]

Richard E. Korf¹•Institutions (1)

Columbia University¹

01 Apr 1985-Artificial Intelligence

TL;DR: The macro technique is a new kind of weak method, a method for learning as opposed to problem solving, and introduces a new type of problem structure called operator decomposability.

...read moreread less

327 citations

Journal Article•DOI•

Linear-space best-first search

[...]

Richard E. Korf¹•Institutions (1)

University of California, Los Angeles¹

01 Jul 1993-Artificial Intelligence

TL;DR: This work presents a linear-space best-first search algorithm (RBFS) that always explores new nodes in best- first order, regardless of the cost function, and expands fewer nodes than iterative deepening with a nondecreasing cost function.

...read moreread less

326 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27

Collapse

Cited by

PDF

Open Access

More filters

Book•

Reinforcement Learning: An Introduction

[...]

Richard S. Sutton¹, Andrew G. Barto•Institutions (1)

Massachusetts Institute of Technology¹

01 Jan 1988

TL;DR: This book provides a clear and simple account of the key ideas and algorithms of reinforcement learning, which ranges from the history of the field's intellectual foundations to the most recent developments and applications.

...read moreread less

Abstract: Reinforcement learning, one of the most active research areas in artificial intelligence, is a computational approach to learning whereby an agent tries to maximize the total amount of reward it receives when interacting with a complex, uncertain environment. In Reinforcement Learning, Richard Sutton and Andrew Barto provide a clear and simple account of the key ideas and algorithms of reinforcement learning. Their discussion ranges from the history of the field's intellectual foundations to the most recent developments and applications. The only necessary mathematical background is familiarity with elementary concepts of probability. The book is divided into three parts. Part I defines the reinforcement learning problem in terms of Markov decision processes. Part II provides basic solution methods: dynamic programming, Monte Carlo methods, and temporal-difference learning. Part III presents a unified view of the solution methods and incorporates artificial neural networks, eligibility traces, and planning; the two final chapters present case studies and consider the future of reinforcement learning.

...read moreread less

37,989 citations

Journal Article•DOI•

Phd by thesis

[...]

Richard Lathe¹•Institutions (1)

French Institute of Health and Medical Research¹

01 Apr 1988-Nature

TL;DR: In this paper, a sedimentological core and petrographic characterisation of samples from eleven boreholes from the Lower Carboniferous of Bowland Basin (Northwest England) is presented.

...read moreread less

Abstract: Deposits of clastic carbonate-dominated (calciclastic) sedimentary slope systems in the rock record have been identified mostly as linearly-consistent carbonate apron deposits, even though most ancient clastic carbonate slope deposits fit the submarine fan systems better. Calciclastic submarine fans are consequently rarely described and are poorly understood. Subsequently, very little is known especially in mud-dominated calciclastic submarine fan systems. Presented in this study are a sedimentological core and petrographic characterisation of samples from eleven boreholes from the Lower Carboniferous of Bowland Basin (Northwest England) that reveals a >250 m thick calciturbidite complex deposited in a calciclastic submarine fan setting. Seven facies are recognised from core and thin section characterisation and are grouped into three carbonate turbidite sequences. They include: 1) Calciturbidites, comprising mostly of highto low-density, wavy-laminated bioclast-rich facies; 2) low-density densite mudstones which are characterised by planar laminated and unlaminated muddominated facies; and 3) Calcidebrites which are muddy or hyper-concentrated debrisflow deposits occurring as poorly-sorted, chaotic, mud-supported floatstones. These

...read moreread less

9,929 citations

Journal Article•DOI•

Learning to Predict by the Methods of Temporal Differences

[...]

Richard S. Sutton

01 Aug 1988-Machine Learning

TL;DR: This article introduces a class of incremental learning procedures specialized for prediction – that is, for using past experience with an incompletely known system to predict its future behavior – and proves their convergence and optimality for special cases and relation to supervised-learning methods.

...read moreread less

Abstract: This article introduces a class of incremental learning procedures specialized for prediction – that is, for using past experience with an incompletely known system to predict its future behavior. Whereas conventional prediction-learning methods assign credit by means of the difference between predicted and actual outcomes, the new methods assign credit by means of the difference between temporally successive predictions. Although such temporal-difference methods have been used in Samuel's checker player, Holland's bucket brigade, and the author's Adaptive Heuristic Critic, they have remained poorly understood. Here we prove their convergence and optimality for special cases and relate them to supervised-learning methods. For most real-world prediction problems, temporal-difference methods require less memory and less peak computation than conventional methods and they produce more accurate predictions. We argue that most problems to which supervised learning is currently applied are really prediction problems of the sort to which temporal-difference methods can be applied to advantage.

...read moreread less

4,803 citations

Journal Article•DOI•

Learning Bayesian Networks: The Combination of Knowledge and Statistical Data

[...]

David Heckerman¹, Dan Geiger¹, David Maxwell Chickering¹•Institutions (1)

Microsoft¹

15 Sep 1995-Machine Learning

TL;DR: In this article, a Bayesian approach for learning Bayesian networks from a combination of prior knowledge and statistical data is presented, which is derived from a set of assumptions made previously as well as the assumption of likelihood equivalence, which says that data should not help to discriminate network structures that represent the same assertions of conditional independence.

...read moreread less

Abstract: We describe a Bayesian approach for learning Bayesian networks from a combination of prior knowledge and statistical data. First and foremost, we develop a methodology for assessing informative priors needed for learning. Our approach is derived from a set of assumptions made previously as well as the assumption of likelihood equivalence, which says that data should not help to discriminate network structures that represent the same assertions of conditional independence. We show that likelihood equivalence when combined with previously made assumptions implies that the user's priors for network parameters can be encoded in a single Bayesian network for the next case to be seen—a prior network—and a single measure of confidence for that network. Second, using these priors, we show how to compute the relative posterior probabilities of network structures given data. Third, we describe search methods for identifying network structures with high posterior probabilities. We describe polynomial algorithms for finding the highest-scoring network structures in the special case where every node has at most k e 1 parent. For the general case (k > 1), which is NP-hard, we review heuristic search algorithms including local search, iterative local search, and simulated annealing. Finally, we describe a methodology for evaluating Bayesian-network learning algorithms, and apply this approach to a comparison of various approaches.

...read moreread less

4,124 citations

Journal Article•DOI•

Between MDPs and semi-MDPs: a framework for temporal abstraction in reinforcement learning

[...]

Richard S. Sutton¹, Doina Precup², Satinder Singh¹•Institutions (2)

AT&T Labs¹, University of Massachusetts Amherst²

01 Aug 1999-Artificial Intelligence

TL;DR: It is shown that options enable temporally abstract knowledge and action to be included in the reinforcement learning frame- work in a natural and general way and may be used interchangeably with primitive actions in planning methods such as dynamic pro- gramming and in learning methodssuch as Q-learning.

...read moreread less

3,233 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse