scispace - formally typeset
Open AccessPosted Content

On a Formal Model of Safe and Scalable Self-driving Cars

Reads0
Chats0
TLDR
A white-box, interpretable, mathematical model for safety assurance, which the authors call-Sensitive Safety (RSS), and a design of a system that adheres to the safety assurance requirements and is scalable to millions of cars.
Abstract
In recent years, car makers and tech companies have been racing towards self driving cars. It seems that the main parameter in this race is who will have the first car on the road. The goal of this paper is to add to the equation two additional crucial parameters. The first is standardization of safety assurance --- what are the minimal requirements that every self-driving car must satisfy, and how can we verify these requirements. The second parameter is scalability --- engineering solutions that lead to unleashed costs will not scale to millions of cars, which will push interest in this field into a niche academic corner, and drive the entire field into a "winter of autonomous driving". In the first part of the paper we propose a white-box, interpretable, mathematical model for safety assurance, which we call Responsibility-Sensitive Safety (RSS). In the second part we describe a design of a system that adheres to our safety assurance requirements and is scalable to millions of cars.

read more

Citations
More filters
Journal ArticleDOI

Survey on Scenario-Based Safety Assessment of Automated Vehicles

TL;DR: A novel taxonomy for the scenario-based approach to safety assessment is developed, and all literature sources are classified, so that the existing methods will be compared with each other and, as one conclusion, the alternative concept of formal verification will be combined with the scenario -based approach.
Proceedings ArticleDOI

Simulation-based Adversarial Test Generation for Autonomous Vehicles with Machine Learning Components

TL;DR: This work presents a testing framework that is compatible with test case generation and automatic falsification methods, which are used to evaluate cyber-physical systems and can be used to increase the reliability of autonomous driving systems.
Journal ArticleDOI

Modeling Vehicle Interactions via Modified LSTM Models for Trajectory Prediction

TL;DR: A spatio-temporal LSTM-based trajectory prediction model (ST-LSTM) which includes two modifications that embed spatial interactions into L STM models to implicitly measure the interactions between neighboring vehicles and introduces shortcut connections between the inputs and the outputs of two consecutive LSTm layers to handle gradient vanishment.
Proceedings ArticleDOI

Probabilistic Prediction of Vehicle Semantic Intention and Motion

TL;DR: A Semantic based Intention and Motion Prediction (SIMP) method, which can be adapted to any driving scenarios by using semantic defined vehicle behaviors, and utilizes a probabilistic framework based on deep neural network to estimate the intentions, final locations, and the corresponding time information for surrounding vehicles.
Posted Content

Scalable agent alignment via reward modeling: a research direction.

TL;DR: This work outlines a high-level research direction to solve the agent alignment problem centered around reward modeling: learning a reward function from interaction with the user and optimizing the learned reward function with reinforcement learning.
References
More filters
Journal ArticleDOI

Human-level control through deep reinforcement learning

TL;DR: This work bridges the divide between high-dimensional sensory inputs and actions, resulting in the first artificial agent that is capable of learning to excel at a diverse array of challenging tasks.
Proceedings ArticleDOI

A theory of the learnable

TL;DR: This paper regards learning as the phenomenon of knowledge acquisition in the absence of explicit programming, and gives a precise methodology for studying this phenomenon from a computational viewpoint.
Journal ArticleDOI

Between MDPs and semi-MDPs: a framework for temporal abstraction in reinforcement learning

TL;DR: It is shown that options enable temporally abstract knowledge and action to be included in the reinforcement learning frame- work in a natural and general way and may be used interchangeably with primitive actions in planning methods such as dynamic pro- gramming and in learning methodssuch as Q-learning.
Posted Content

Safe, Multi-Agent, Reinforcement Learning for Autonomous Driving

TL;DR: This paper applies deep reinforcement learning to the problem of forming long term driving strategies and shows how policy gradient iterations can be used without Markovian assumptions, and decomposes the problem into a composition of a Policy for Desires and trajectory planning with hard constraints.
Related Papers (5)