Social GAN: Socially Acceptable Trajectories with Generative Adversarial Networks.

Open AccessPosted Content

Social GAN: Socially Acceptable Trajectories with Generative Adversarial Networks.

- 29 Mar 2018 -

arXiv: Computer Vision and Pattern Recog...

TLDR

In this paper, a recurrent sequence-to-sequence model observes motion histories and predicts future behavior, using a novel pooling mechanism to aggregate information across people, and predicts socially plausible future by training adversarially against a recurrent discriminator, and encourage diverse predictions with a novel variety loss.

Abstract:

Understanding human motion behavior is critical for autonomous moving platforms (like self-driving cars and social robots) if they are to navigate human-centric environments. This is challenging because human motion is inherently multimodal: given a history of human motion paths, there are many socially plausible ways that people could move in the future. We tackle this problem by combining tools from sequence prediction and generative adversarial networks: a recurrent sequence-to-sequence model observes motion histories and predicts future behavior, using a novel pooling mechanism to aggregate information across people. We predict socially plausible futures by training adversarially against a recurrent discriminator, and encourage diverse predictions with a novel variety loss. Through experiments on several datasets we demonstrate that our approach outperforms prior work in terms of accuracy, variety, collision avoidance, and computational complexity.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

Argoverse: 3D Tracking and Forecasting With Rich Maps

Ming-Fang Chang, +10 more

TL;DR: Argoverse includes sensor data collected by a fleet of autonomous vehicles in Pittsburgh and Miami as well as 3D tracking annotations, 300k extracted interesting vehicle trajectories, and rich semantic maps, which contain rich geometric and semantic metadata which are not currently available in any public dataset.

...read moreread less

Proceedings ArticleDOI

From Recognition to Cognition: Visual Commonsense Reasoning

Rowan Zellers, +3 more

TL;DR: To move towards cognition-level understanding, a new reasoning engine is presented, Recognition to Cognition Networks (R2C), that models the necessary layered inferences for grounding, contextualization, and reasoning.

...read moreread less

Journal ArticleDOI

Human motion trajectory prediction: a survey:

Andrey Rudenko, +6 more

- 07 Jun 2020 -

The International Journal of Robotics Re...

TL;DR: In this article, the ability of intelligent autonomous systems to perceive, understand, and anticipate human behavior becomes increasingly important in a growing number of intelligent systems in human environments, and the ability to do so is discussed.

...read moreread less

Proceedings ArticleDOI

STGAT: Modeling Spatial-Temporal Interactions for Human Trajectory Prediction

Huang Yingfan, +4 more

TL;DR: This work proposes a Spatial-Temporal Graph Attention network (STGAT), based on a sequence-to-sequence architecture to predict future trajectories of pedestrians, which achieves superior performance on two publicly available crowd datasets and produces more "socially" plausible trajectories for pedestrians.

...read moreread less

Posted Content

Human Action Recognition and Prediction: A Survey.

Yu Kong, +1 more

- 28 Jun 2018 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: The complete state-of-the-art techniques in the action recognition and prediction are surveyed, including existing models, popular algorithms, technical difficulties, popular action databases, evaluation protocols, and promising future directions are provided.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Posted Content

A Point Set Generation Network for 3D Object Reconstruction from a Single Image

Haoqiang Fan, +2 more

- 02 Dec 2016 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: In this article, the authors address the problem of 3D reconstruction from a single image, generating a straight-forward form of output -point cloud coordinates. But the groundtruth shape for an input image may be ambiguous, and they design architecture, loss function and learning paradigm that are novel and effective.

...read moreread less

Posted Content

PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation

Charles R. Qi, +3 more

- 02 Dec 2016 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: PointNet as discussed by the authors proposes a novel type of neural network that directly consumes point clouds and well respects the permutation invariance of points in the input, which provides a unified architecture for applications ranging from object classification, part segmentation, to scene semantic parsing.

...read moreread less

Journal ArticleDOI

Gaussian Process Dynamical Models for Human Motion

Jack M. Wang, +2 more

TL;DR: This work marginalize out the model parameters in closed form by using Gaussian process priors for both the dynamical and the observation mappings, which results in a nonparametric model for dynamical systems that accounts for uncertainty in the model.

...read moreread less

Journal ArticleDOI

Continuum crowds

Adrien Treuille, +2 more

TL;DR: In this model, a dynamic potential field simultaneously integrates global navigation with moving obstacles such as other people, efficiently solving for the motion of large crowds without the need for explicit collision avoidance.

...read moreread less

Posted Content

A Recurrent Latent Variable Model for Sequential Data

Junyoung Chung, +5 more

- 07 Jun 2015 -

arXiv: Learning

TL;DR: In this article, the authors explore the use of latent random variables into the dynamic hidden state of a recurrent neural network (RNN) by combining elements of the variational autoencoder.

...read moreread less