Design and Analysis of State-Feedback Optimal Strategies for the Differential Game of Active Defense

doi:10.1109/TAC.2018.2828088

Citations

PDF

Open Access

More filters

Journal Article•DOI•

Protect Your Sky: A Survey of Counter Unmanned Aerial Vehicle Systems

[...]

Honggu Kang¹, Jingon Joung², Jin Young Kim³, Joonhyuk Kang¹, Yong Soo Cho² - Show less +1 more•Institutions (3)

KAIST¹, Chung-Ang University², Korea University³

11 Sep 2020-IEEE Access

TL;DR: A broad understanding gained from the survey overall will assist with the design of a holistic CUS and inspire cross-domain research across physical layer designs in wireless communications, CUS network designs, control theory, mechanics, and computer science, to enhance counter UAV techniques further.

...read moreread less

Abstract: Recognizing the various and broad range of applications of unmanned aerial vehicles (UAVs) and unmanned aircraft systems (UAS) for personal, public and military applications, recent un-intentional malfunctions of uncontrollable UAVs or intentional attacks on them divert our attention and motivate us to devise a protection system, referred to as a counter UAV system (CUS). The CUS, also known as a counter-drone system, protects personal, commercial, public, and military facilities and areas from uncontrollable and belligerent UAVs by neutralizing or destroying them. This paper provides a comprehensive survey of the CUS to describe the key technologies of the CUS and provide sufficient information with wich to comprehend this system. The first part starts with an introduction of general UAVs and the concept of the CUS. In the second part, we provide an extensive survey of the CUS through a top-down approach: i) the platform of CUS including ground and sky platforms and related networks; ii) the architecture of the CUS consisting of sensing systems, command-and-control (C2) systems, and mitigation systems; and iii) the devices and functions with the sensors for detection-and-identification and localization-and-tracking actions and mitigators for neutralization. The last part is devoted to a survey of the CUS market with relevant challenges and future visions. From the CUS market survey, potential readers can identify the major players in a CUS industry and obtain information with which to develop the CUS industry. A broad understanding gained from the survey overall will assist with the design of a holistic CUS and inspire cross-domain research across physical layer designs in wireless communications, CUS network designs, control theory, mechanics, and computer science, to enhance counter UAV techniques further.

...read moreread less

63 citations

Proceedings Article•DOI•

Bridging Hamilton-Jacobi Safety Analysis and Reinforcement Learning

[...]

Jaime F. Fisac¹, Neil F. Lugovoy¹, Vicenc Rubies-Royo¹, Shromona Ghosh¹, Claire J. Tomlin¹ - Show less +1 more•Institutions (1)

University of California, Berkeley¹

20 May 2019

TL;DR: This work shows how a time-discounted modification of the problem of maximizing the minimum payoff over time, central to safety analysis, through a modified dynamic programming equation that induces a contraction mapping can render reinforcement learning techniques amenable to quantitative safety analysis as tools to approximate the safe set and optimal safety policy.

...read moreread less

Abstract: Safety analysis is a necessary component in the design and deployment of autonomous robotic systems. Techniques from robust optimal control theory, such as Hamilton-Jacobi reachability analysis, allow a rigorous formalization of safety as guaranteed constraint satisfaction. Unfortunately, the computational complexity of these tools for general dynamical systems scales poorly with state dimension, making existing tools impractical beyond small problems. Modern reinforcement learning methods have shown promising ability to find approximate yet proficient solutions to optimal control problems in complex and high-dimensional systems, however their application has in practice been restricted to problems with an additive payoff over time, unsuitable for reasoning about safety. In recent work, we introduced a time-discounted modification of the problem of maximizing the minimum payoff over time, central to safety analysis, through a modified dynamic programming equation that induces a contraction mapping. Here, we show how a similar contraction mapping can render reinforcement learning techniques amenable to quantitative safety analysis as tools to approximate the safe set and optimal safety policy. This opens a new avenue of research connecting control-theoretic safety analysis and the reinforcement learning domain. We validate the correctness of our formulation by comparing safety results computed through Q-learning to analytic and numerical solutions, and demonstrate its scalability by learning safe sets and control policies for simulated systems of up to 18 state dimensions using value learning and policy gradient techniques.

...read moreread less

62 citations

Cites background from "Design and Analysis of State-Feedba..."

...While analytic solutions exist in rare instances [6, 7], and efficient decompositions are occasionally possible [8], computing safety-ensuring controllers is intractable for many systems of interest....
[...]

Journal Article•DOI•

Multiple Pursuer Multiple Evader Differential Games

[...]

Eloy Garcia¹, David W. Casbeer¹, Alexander Von Moll¹, Meir Pachter²•Institutions (2)

Wright-Patterson Air Force Base¹, Air Force Institute of Technology²

01 May 2021-IEEE Transactions on Automatic Control

TL;DR: In this paper, the authors considered the case of a team of pursuers and evaders, and provided a foundation to formally analyze complex and high-dimensional conflicts between teams by means of differential game theory, where the players' optimal strategies require codesign of cooperative optimal assignments and optimal guidance laws.

...read moreread less

Abstract: In this article an $N$ -pursuer versus $M$ -evader team conflict is studied. This article extends classical differential game theory to simultaneously address weapon assignments and multiplayer pursuit-evasion scenarios. Saddle-point strategies that provide guaranteed performance for each team regardless of the actual strategies implemented by the opponent are devised. The players’ optimal strategies require the codesign of cooperative optimal assignments and optimal guidance laws. A representative measure of performance is employed and the Value function of the attendant game is obtained. It is shown that the Value function is continuously differentiable and that it satisfies the Hamilton–Jacobi–Isaacs equation—the curse of dimensionality is overcome and the optimal strategies are obtained. The cases of $N=M$ and $N>M$ are considered. In the latter case, cooperative guidance strategies are also developed in order for the pursuers to exploit their numerical advantage. This article provides a foundation to formally analyze complex and high-dimensional conflicts between teams of $N$ pursuers and $M$ evaders by means of differential game theory.

...read moreread less

48 citations

Journal Article•DOI•

Feedback Strategies for a Reach-Avoid Game With a Single Evader and Multiple Pursuers

[...]

Jhanani Selvakumar¹, Efstathios Bakolas¹•Institutions (1)

University of Texas at Austin¹

15 Jan 2021-IEEE Transactions on Systems, Man, and Cybernetics

TL;DR: This work considers the problem of steering a single evader to a target location, while avoiding capture by multiple pursuers, and proposes a feasible control strategy for the evader, against a group of pursuers that adopts a semi-cooperative strategy.

...read moreread less

Abstract: We address a planar multiagent pursuit–evasion game with a terminal constraint (reach-avoid game). Specifically, we consider the problem of steering a single evader to a target location, while avoiding capture by multiple pursuers. We propose a feasible control strategy for the evader, against a group of pursuers that adopts a semi-cooperative strategy. First, we characterize a partition of the game’s state-space, that allows us to determine the existence of a solution to the game based on the initial conditions of the players. Next, based on the time-derivative of an appropriately defined risk metric, we develop a nonlinear state feedback strategy for the evader which provides a feasible solution to the game. This control strategy involves switching between different control laws in different parts of the state-space. We demonstrate the efficacy of our proposed feedback control in terms of the evader’s performance, through numerical simulations. We also show that for the special case of the reach-avoid game with only one pursuer, the proposed control law is successful in guiding the evader to the target location from almost all initial conditions, and ensures that the evader will remain uncaptured.

...read moreread less

34 citations

Cites background from "Design and Analysis of State-Feedba..."

...This class of games is a preferred tool to model situations where a team of agents must reach a target location while avoiding obstacles or defending a target from an offensive team of agents [16]–[20]....
[...]

Proceedings Article•DOI•

An Introduction to Pursuit-evasion Differential Games.

[...]

Isaac E. Weintraub, Meir Pachter, Eloy Garcia

01 Jul 2020

TL;DR: In this article, the authors present an organized introduction of pursuit-evasion differential games with an overview of recent advances in the area and present two representative pursuit evasion differential games: the two-cutters and fugitive ship differential game and the active target defense differential game.

...read moreread less

Abstract: Pursuit and evasion conflicts represent challenging problems with important applications in aerospace and robotics. In pursuit-evasion problems, synthesis of intelligent actions must consider the adversary's potential strategies. Differential game theory provides an adequate framework to analyze possible outcomes of the conflict without assuming particular behaviors by the opponent. This article presents an organized introduction of pursuit-evasion differential games with an overview of recent advances in the area. First, a summary of the seminal work is outlined, highlighting important contributions. Next, more recent results are described by employing a classification based on the number of players: one-pursuer-one-evader, N-pursuers-one-evader, one-pursuer-M-evaders, and N-pursuer-M-evader games. In each scenario, a brief summary of the literature is presented. Finally, two representative pursuit-evasion differential games are studied in detail: the two-cutters and fugitive ship differential game and the active target defense differential game. These problems provide two important applications and, more importantly, they give great insight into the realization of cooperation between friendly agents in order to form a team and defeat the adversary.

...read moreread less

34 citations

Collapse

Design and Analysis of State-Feedback Optimal Strategies for the Differential Game of Active Defense

Citations

Cites background from "Design and Analysis of State-Feedba..."

Cites background from "Design and Analysis of State-Feedba..."

References

"Design and Analysis of State-Feedba..." refers background in this paper

"Design and Analysis of State-Feedba..." refers background in this paper

"Design and Analysis of State-Feedba..." refers background in this paper

"Design and Analysis of State-Feedba..." refers background in this paper

"Design and Analysis of State-Feedba..." refers methods in this paper

Related Papers (5)