Are They Going to Cross? A Benchmark Dataset and Baseline for Pedestrian Crosswalk Behavior

doi:10.1109/ICCVW.2017.33

Proceedings ArticleDOI

Are They Going to Cross? A Benchmark Dataset and Baseline for Pedestrian Crosswalk Behavior

Amir Rasouli, +2 more

- pp 206-213

Chats0

TLDR

A novel dataset is introduced which in addition to providing the bounding box information for pedestrian detection, also includes the behavioral and contextual annotations for the scenes, which allows combining visual and semantic information for better understanding of pedestrians' intentions in various traffic scenarios.

Abstract:

Designing autonomous vehicles suitable for urban environments remains an unresolved problem. One of the major dilemmas faced by autonomous cars is how to understand the intention of other road users and communicate with them. The existing datasets do not provide the necessary means for such higher level analysis of traffic scenes. With this in mind, we introduce a novel dataset which in addition to providing the bounding box information for pedestrian detection, also includes the behavioral and contextual annotations for the scenes. This allows combining visual and semantic information for better understanding of pedestrians' intentions in various traffic scenarios. We establish baseline approaches for analyzing the data and show that combining visual and contextual information can improve prediction of pedestrian intention at the point of crossing by at least 20%.

Citations

PDF

Open Access

More filters

Posted Content

Network Generalization Prediction for Safety Critical Tasks in Novel Operating Domains.

Molly O'Brien, +3 more

- 17 Aug 2021 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: In this paper, the authors identify the context subspace for a pretrained Faster RCNN network performing pedestrian detection on the Berkeley Deep Drive (BDD) dataset and demonstrate that the Context Subspace from the BDD dataset is informative for completely unseen datasets, such as JAAD and Cityscapes.

...read moreread less

Journal ArticleDOI

Pedestrian Crossing Intention Forecasting at Unsignalized Intersections Using Naturalistic Trajectories

Esteban Moreno, +9 more

- 01 Mar 2023 -

Sensors

TL;DR: In this paper , a model that predicts pedestrian crossing behavior at different locations around an urban intersection is proposed, which not only provides a classification label (e.g., crossing, not-crossing), but also a quantitative confidence level (i.e., probability).

...read moreread less

Journal ArticleDOI

ST CrossingPose: A Spatial-Temporal Graph Convolutional Network for Skeleton-Based Pedestrian Crossing Intention Prediction

- 01 Nov 2022 -

IEEE Transactions on Intelligent Transpo...

TL;DR: Wang et al. as mentioned in this paper proposed a spatial-temporal graph convolutional network (ST CrossingPose) to predict pedestrian crossing intention based on skeleton data, which can learn both spatial and temporal patterns from skeleton data.

...read moreread less

Journal ArticleDOI

U19-Net: a deep learning approach for obstacle detection in self-driving cars

Albert Aarón Cervera-Uribe

- 07 Apr 2022 -

Soft Computing

TL;DR: U19-Net as mentioned in this paper proposes an encoder-decoder deep model that explores the deep layers of a VGG19 model as encoder following a symmetrical approach with an U-Net decoder designed for pixel-wise classifications.

...read moreread less

Book ChapterDOI

Road Scene Risk Perception for Intelligent Vehicles Using End-to-End Affordance Learning and Visual Reasoning

Jayani Withanawasam, +4 more

TL;DR: A new perspective on scene risk perception is proposed by modeling end-to-end road scene affordance using a weakly supervised classifier and results show that the proposed model is able to correctly classify three different levels of risk.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings Article

ImageNet Classification with Deep Convolutional Neural Networks

Alex Krizhevsky, +2 more

TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.

...read moreread less

Proceedings ArticleDOI

ImageNet: A large-scale hierarchical image database

Jia Deng, +5 more

TL;DR: A new database called “ImageNet” is introduced, a large-scale ontology of images built upon the backbone of the WordNet structure, much larger in scale and diversity and much more accurate than the current image datasets.

...read moreread less

Proceedings ArticleDOI

Histograms of oriented gradients for human detection

Navneet Dalal, +1 more

TL;DR: It is shown experimentally that grids of histograms of oriented gradient (HOG) descriptors significantly outperform existing feature sets for human detection, and the influence of each stage of the computation on performance is studied.

...read moreread less

Journal ArticleDOI

Vision meets robotics: The KITTI dataset

Andreas Geiger, +3 more

- 01 Sep 2013 -

The International Journal of Robotics Re...

TL;DR: A novel dataset captured from a VW station wagon for use in mobile robotics and autonomous driving research, using a variety of sensor modalities such as high-resolution color and grayscale stereo cameras and a high-precision GPS/IMU inertial navigation system.

...read moreread less

Book ChapterDOI

Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition

Kaiming He, +3 more

TL;DR: This work equips the networks with another pooling strategy, “spatial pyramid pooling”, to eliminate the above requirement, and develops a new network structure, called SPP-net, which can generate a fixed-length representation regardless of image size/scale.

...read moreread less

Collapse

IEEE Transactions on Intelligent Transpo...

Will the Pedestrian Cross? A Study on Pedestrian Path Prediction

Christoph G. Keller, +1 more

- 01 Apr 2014 -

IEEE Transactions on Intelligent Transpo...

Are They Going to Cross? A Benchmark Dataset and Baseline for Pedestrian Crosswalk Behavior

Citations

Network Generalization Prediction for Safety Critical Tasks in Novel Operating Domains.

Pedestrian Crossing Intention Forecasting at Unsignalized Intersections Using Naturalistic Trajectories

ST CrossingPose: A Spatial-Temporal Graph Convolutional Network for Skeleton-Based Pedestrian Crossing Intention Prediction

U19-Net: a deep learning approach for obstacle detection in self-driving cars

Road Scene Risk Perception for Intelligent Vehicles Using End-to-End Affordance Learning and Visual Reasoning

References

ImageNet Classification with Deep Convolutional Neural Networks

ImageNet: A large-scale hierarchical image database

Histograms of oriented gradients for human detection

Vision meets robotics: The KITTI dataset

Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition

Related Papers (5)

Social LSTM: Human Trajectory Prediction in Crowded Spaces

Microsoft COCO: Common Objects in Context

Realtime Multi-person 2D Pose Estimation Using Part Affinity Fields

Autonomous Vehicles That Interact With Pedestrians: A Survey of Theory and Practice

Will the Pedestrian Cross? A Study on Pedestrian Path Prediction