Improving Heuristics On-the-fly for Effective Search in Plan Space

doi:10.1007/978-3-319-24489-1_27

Citations

PDF

Open Access

More filters

Posted Content•

Learning and Tuning Meta-heuristics in Plan Space Planning

[...]

Shashank Shekhar, Deepak Khemani

27 Jan 2016-arXiv: Artificial Intelligence

TL;DR: An online error minimization approach in POCL framework is discussed to minimize the step-error associated with the offline learned models thus enhancing their informativeness and scale up the performance of the planner over standard benchmarks, specially for larger problems.

...read moreread less

Abstract: In recent years, the planning community has observed that techniques for learning heuristic functions have yielded improvements in performance. One approach is to use offline learning to learn predictive models from existing heuristics in a domain dependent manner. These learned models are deployed as new heuristic functions. The learned models can in turn be tuned online using a domain independent error correction approach to further enhance their informativeness. The online tuning approach is domain independent but instance specific, and contributes to improved performance for individual instances as planning proceeds. Consequently it is more effective in larger problems. In this paper, we mention two approaches applicable in Partial Order Causal Link (POCL) Planning that is also known as Plan Space Planning. First, we endeavor to enhance the performance of a POCL planner by giving an algorithm for supervised learning. Second, we then discuss an online error minimization approach in POCL framework to minimize the step-error associated with the offline learned models thus enhancing their informativeness. Our evaluation shows that the learning approaches scale up the performance of the planner over standard benchmarks, specially for larger problems.

...read moreread less

Cites methods from "Improving Heuristics On-the-fly for..."

...The online heuristic adjustment approach is inspired from a recent work presented as technical communication (Shekhar and Khemani 2015) tested in a few planning domains....
[...]

References

PDF

Open Access

More filters

Proceedings Article•

Learning from multiple heuristics

[...]

Mehdi Samadi¹, Ariel Felner², Jonathan Schaeffer¹•Institutions (2)

University of Alberta¹, Ben-Gurion University of the Negev²

13 Jul 2008

TL;DR: A new technique is introduced for combining multiple heuristics in a single heuristic application, inspired by the evaluation functions used in two-player games, and can lead to a large reduction in the search effort at a small cost in the quality of the solution obtained.

...read moreread less

Abstract: Heuristic functions for single-agent search applications estimate the cost of the optimal solution. When multiple heuristics exist, taking their maximum is an effective way to combine them. A new technique is introduced for combining multiple heuristic values. Inspired by the evaluation functions used in two-player games, the different heuristics in a single-agent application are treated as features of the problem domain. An ANN is used to combine these features into a single heuristic value. This idea has been implemented for the sliding-tile puzzle and the 4-peg Towers of Hanoi, two classic single-agent search domains. Experimental results show that this technique can lead to a large reduction in the search effort at a small cost in the quality of the solution obtained.

...read moreread less

55 citations

Proceedings Article•

Extracting Effective and Admissible State Space Heuristics from the Planning Graph

[...]

XuanLong Nguyen¹, Subbarao Kambhampati•Institutions (1)

Arizona State University¹

30 Jul 2000

TL;DR: The planning graph structure that Graphplan builds in polynomial time, provides a rich substrate for deriving more effective heuristics for state space planners, and the mutex information in the planning graph captures exactly this interaction information.

...read moreread less

Abstract: Graphplan and heuristic state space planners such as HSP-R and UNPOP are currently two of the most effective approaches for solving classical planning problems. These approaches have hither-to been seen as largely orthogonal. In this paper, we show that the planning graph structure that Graphplan builds in polynomial time, provides a rich substrate for deriving more effective heuristics for state space planners. Specifically, we show that the heuristics used by planners such as HSP-R and UNPOP do badly in several domains due to their failure to consider the interactions between subgoals, and that the mutex information in the planning graph captures exactly this interaction information. We develop several families of heuristics, some aimed at search speed and others at optimality of solutions. Our empirical studies show that our heuristics significantly out-perform the existing state space heuristics.

...read moreread less

45 citations

Proceedings Article•

On the role of ground actions in refinement planning

[...]

Håkan L. S. Younes¹, Reid Simmons¹•Institutions (1)

Carnegie Mellon University¹

23 Apr 2002

TL;DR: This paper addresses the role of ground actions in refinement planning, and presents empirical results indicating that their role is twofold: first, planning with ground actions represents a bias towards early commitment of parameter bindings, and second, ground actions help enforce joint parameter domain constraints.

...read moreread less

Abstract: Less than a decade ago, the focus in refinement planning was on partial order planners using lifted actions. Today, the currently most successful refinement planners are all state space planners using ground actions—i.e. actions where all parameters have been substituted by objects. In this paper, we address the role of ground actions in refinement planning, and present empirical results indicating that their role is twofold. First, planning with ground actions represents a bias towards early commitment of parameter bindings. Second, ground actions help enforce joint parameter domain constraints. By implementing these two techniques in a least commitment planner such as UCPOP, together with using an informed heuristic function to guide the search for solutions, we show that we often need to generate far fewer plans than when planning with ground action, while the number of explored plans remains about the same. In some cases a vast reduction can also be achieved in the number of explored plans.

...read moreread less

38 citations

Proceedings Article•DOI•

Accelerating partial order planners by improving plan and goal choices

[...]

Lenhart K. Schubert¹, A. Gerevini¹•Institutions (1)

University of Rochester¹

01 Apr 1995

TL;DR: Improvements to plan refinement strategies for well-founded partial order planning are described that promise to bring this style of planning closer to practicality and propose an A* heuristic that counts only steps and open conditions, while ignoring "unsafe conditions".

...read moreread less

Abstract: Describes some simple domain-independent improvements to plan refinement strategies for well-founded partial order planning that promise to bring this style of planning closer to practicality. One suggestion concerns the strategy for selecting plans for refinement among the current (incomplete) candidate plans. We propose an A* heuristic that counts only steps and open conditions, while ignoring "unsafe conditions" (threats). A second suggestion concerns the strategy for selecting open conditions (goals) to be established next in a selected incomplete plan. We propose a variant of a strategy suggested by Peot and Smith (1993) and studied by Joslin and Pollack (1994); the variant gives top priority to unmatchable open conditions (enabling the elimination of the plan), second-highest priority to goals that can only be achieved uniquely and otherwise uses LIFO (last-in, first-out) prioritization. The preference for uniquely achievable goals is a "zero-commitment" strategy in the sense that the corresponding plan refinements are a matter of deductive certainty, involving no guesswork. In experiments based on modifications of UCPOP (Unsafe Conditions Partial Order Planner), we have obtained improvements by factors ranging from 5 to more than 600 for a variety of problems that are nontrivial for the unmodified version. Crucially, the hardest problems give the greatest improvements.

...read moreread less

27 citations

Journal Article•DOI•

Hybrid Planning and Scheduling

[...]

Bernd Schattenberg

01 Feb 2016-Künstliche Intelligenz

TL;DR: ‘‘Hybrid planning’’ refers to a distinct paradigm, which integrates building plans based on the individual action’s causal structure with obtaining plans from iteratively implementing abstract actions by pre-defined partial solutions.

...read moreread less

Abstract: Planning and scheduling (P&S) constitute fundamental cognitive capabilities for systems to reason about plans and their causal structure. They are essential for producing a goal-oriented system behavior and supporting a user’s decision making. While planning is the method of creating courses of action that achieve goals or perform tasks, scheduling assigns consistent allocations of time and resources to activities. ‘‘Hybrid planning’’ refers to a distinct paradigm, which integrates building plans based on the individual action’s causal structure with obtaining plans from iteratively implementing abstract actions by pre-defined partial solutions. Combined, hierarchical model aspects represent regular solutions provided by domain experts, while causality-based techniques complete underspecified procedures and address exceptional cases. Applications of the technology range from planning out disaster relief missions [4] to assessing cyber-security measures [7].

...read moreread less

22 citations

Improving Heuristics On-the-fly for Effective Search in Plan Space

Citations

Cites methods from "Improving Heuristics On-the-fly for..."

References

Related Papers (5)