Are They Going to Cross? A Benchmark Dataset and Baseline for Pedestrian Crosswalk Behavior
Citations
391 citations
295 citations
Cites background or methods from "Are They Going to Cross? A Benchmar..."
...of 80% [142], 62% [14] for the probability of crossing, and...
[...]
...[14] use various contextual information such as characteristics of the road, the presence of traffic sig-...
[...]
236 citations
185 citations
Cites background from "Are They Going to Cross? A Benchmar..."
...A recently proposed dataset, JAAD [27], contains a large number of pedestrian samples with temporal correspondence, a subset of which are annotated with behavior information....
[...]
...The performance of all models is generally poorer on the JAAD dataset which can be partially attributed to the smaller number of samples, scales and shorter tracks all of which reduce the diversity of the dataset....
[...]
...Action (or behavior) prediction algorithms may take different forms such as generating future frames [20, 19, 24, 6], predicting the type of action [15, 21, 7], measuring confidence in the occurrence of an event [27, 37, 10], and forecasting the motion of objects [25, 40, 43, 1, 17, 5, 8]....
[...]
...Table 1 summarizes the properties of PIE and JAAD datasets....
[...]
...JAAD has bounding box annotations for all pedestrians, which makes it suitable for detection and tracking applications....
[...]
133 citations
Additional excerpts
...Another available dataset for crosswalk behavior classification [76] pro-...
[...]
References
73,978 citations
49,639 citations
"Are They Going to Cross? A Benchmar..." refers methods in this paper
...For this purpose we use pre-trained AlexNet on two large image datasets, ImageNet [5] and places, and both datasets combined [44]....
[...]
...In each case we train a randomly initalized AlexNet end-to-end on cropped images of pedestrians from our dataset (with minor occlusions up to 25% allowed) and then try transfer learning by fine-tuning an AlexNet pre-trained on ImageNet [27]....
[...]
31,952 citations
7,153 citations
"Are They Going to Cross? A Benchmar..." refers background or methods in this paper
...5k x Caltech[8] 347k 250k x x x KITTI [13] 12k 80k x x MPD [16] 86....
[...]
...Compared to existing large-scale datasets such as KITTI [13] and Caltech pedestrian dataset [8], in addition to ground truth for all pedestrians in the scene and occlusion information, our dataset contains behavioral tags describing actions of pedestrians intending to cross....
[...]
...There are a number of large-scale datasets publicly available that can be potentially used for pedestrian behavior understanding [13, 8, 10]....
[...]
...Few exceptions, such as KITTI [13], also provide optical flow and stereo information for mapping and localization....
[...]
3,945 citations
"Are They Going to Cross? A Benchmar..." refers methods in this paper
...Fine-tuning the FCN and SPP models is similar....
[...]
...The first is the Spatial Pyramid Pooling (SPP) [15] technique which allows the maxpooling of the features from the last convolutional layer (conv5) at different scales....
[...]
...Overall, the performance of the SPP models was even inferior comparing to those of single scale models (with exception of stop sign detection)....
[...]
...Such a multi-scale detection performance, however, was not achieved using the SPP models....
[...]
...It should also be noted that in the SPP models the fc6 layers were learned from scratch due to the change in the dimensionality of their inputs....
[...]