Social GAN: Socially Acceptable Trajectories with Generative Adversarial Networks.

Open AccessPosted Content

Social GAN: Socially Acceptable Trajectories with Generative Adversarial Networks.

- 29 Mar 2018 -

arXiv: Computer Vision and Pattern Recog...

TLDR

In this paper, a recurrent sequence-to-sequence model observes motion histories and predicts future behavior, using a novel pooling mechanism to aggregate information across people, and predicts socially plausible future by training adversarially against a recurrent discriminator, and encourage diverse predictions with a novel variety loss.

Abstract:

Understanding human motion behavior is critical for autonomous moving platforms (like self-driving cars and social robots) if they are to navigate human-centric environments. This is challenging because human motion is inherently multimodal: given a history of human motion paths, there are many socially plausible ways that people could move in the future. We tackle this problem by combining tools from sequence prediction and generative adversarial networks: a recurrent sequence-to-sequence model observes motion histories and predicts future behavior, using a novel pooling mechanism to aggregate information across people. We predict socially plausible futures by training adversarially against a recurrent discriminator, and encourage diverse predictions with a novel variety loss. Through experiments on several datasets we demonstrate that our approach outperforms prior work in terms of accuracy, variety, collision avoidance, and computational complexity.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

Argoverse: 3D Tracking and Forecasting With Rich Maps

Ming-Fang Chang, +10 more

TL;DR: Argoverse includes sensor data collected by a fleet of autonomous vehicles in Pittsburgh and Miami as well as 3D tracking annotations, 300k extracted interesting vehicle trajectories, and rich semantic maps, which contain rich geometric and semantic metadata which are not currently available in any public dataset.

...read moreread less

Proceedings ArticleDOI

From Recognition to Cognition: Visual Commonsense Reasoning

Rowan Zellers, +3 more

TL;DR: To move towards cognition-level understanding, a new reasoning engine is presented, Recognition to Cognition Networks (R2C), that models the necessary layered inferences for grounding, contextualization, and reasoning.

...read moreread less

Journal ArticleDOI

Human motion trajectory prediction: a survey:

Andrey Rudenko, +6 more

- 07 Jun 2020 -

The International Journal of Robotics Re...

TL;DR: In this article, the ability of intelligent autonomous systems to perceive, understand, and anticipate human behavior becomes increasingly important in a growing number of intelligent systems in human environments, and the ability to do so is discussed.

...read moreread less

Proceedings ArticleDOI

STGAT: Modeling Spatial-Temporal Interactions for Human Trajectory Prediction

Huang Yingfan, +4 more

TL;DR: This work proposes a Spatial-Temporal Graph Attention network (STGAT), based on a sequence-to-sequence architecture to predict future trajectories of pedestrians, which achieves superior performance on two publicly available crowd datasets and produces more "socially" plausible trajectories for pedestrians.

...read moreread less

Posted Content

Human Action Recognition and Prediction: A Survey.

Yu Kong, +1 more

- 28 Jun 2018 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: The complete state-of-the-art techniques in the action recognition and prediction are surveyed, including existing models, popular algorithms, technical difficulties, popular action databases, evaluation protocols, and promising future directions are provided.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings ArticleDOI

StackGAN: Text to Photo-Realistic Image Synthesis with Stacked Generative Adversarial Networks

Han Zhang, +2 more

TL;DR: This paper proposes Stacked Generative Adversarial Networks (StackGAN) to generate 256 photo-realistic images conditioned on text descriptions and introduces a novel Conditioning Augmentation technique that encourages smoothness in the latent conditioning manifold.

...read moreread less

Proceedings Article

Conditional image synthesis with auxiliary classifier GANs

Augustus Odena, +2 more

TL;DR: A variant of GANs employing label conditioning that results in 128 x 128 resolution image samples exhibiting global coherence is constructed and it is demonstrated that high resolution samples provide class information not present in low resolution samples.

...read moreread less

Proceedings Article

Unsupervised Learning of Video Representations using LSTMs

Nitish Srivastava, +2 more

TL;DR: In this paper, an encoder LSTM is used to map an input video sequence into a fixed length representation, which is then decoded using single or multiple decoder Long Short Term Memory (LSTM) networks to perform different tasks.

...read moreread less

Proceedings Article

Towards End-To-End Speech Recognition with Recurrent Neural Networks

Alex Graves, +1 more

TL;DR: A speech recognition system that directly transcribes audio data with text, without requiring an intermediate phonetic representation is presented, based on a combination of the deep bidirectional LSTM recurrent neural network architecture and the Connectionist Temporal Classification objective function.

...read moreread less

Proceedings ArticleDOI

Hierarchical recurrent neural network for skeleton based action recognition

Yong Du, +2 more

TL;DR: This paper proposes an end-to-end hierarchical RNN for skeleton based action recognition, and demonstrates that the model achieves the state-of-the-art performance with high computational efficiency.

...read moreread less