Benjamin Caine

Posted Content

Scalability in Perception for Autonomous Driving: Waymo Open Dataset

- 10 Dec 2019 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: This work introduces a new large scale, high quality, diverse dataset, consisting of well synchronized and calibrated high quality LiDAR and camera data captured across a range of urban and suburban geographies, and studies the effects of dataset size and generalization across geographies on 3D detection methods.

...read moreread less

Proceedings ArticleDOI

Scalability in Perception for Autonomous Driving: Waymo Open Dataset

Pei Sun, +22 more

TL;DR: In this paper, a large scale, high quality, and diverse dataset for self-driving data is presented, consisting of LiDAR and camera data captured across a range of urban and suburban geographies.

...read moreread less

Posted Content

StarNet: Targeted Computation for Object Detection in Point Clouds.

Jiquan Ngiam, +12 more

- 29 Aug 2019 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: This work presents an object detection system called StarNet designed specifically to take advantage of the sparse and 3D nature of point cloud data, and shows how this design leads to competitive or superior performance on the large Waymo Open Dataset and the KITTI detection dataset, as compared to convolutional baselines.

...read moreread less

Proceedings ArticleDOI

DeepFusion: Lidar-Camera Deep Fusion for Multi-Modal 3D Object Detection

Yingwei Li, +12 more

TL;DR: This paper proposes two novel techniques: InverseAug that inverses geometric-related augmentations, e.g., rotation, to enable accurate geometric alignment between lidar points and image pixels, and LearnableAlign that leverages cross-attention to dynamically capture the correlations between image and lidar features during fusion.

...read moreread less

Posted Content

Large Scale Interactive Motion Forecasting for Autonomous Driving : The Waymo Open Motion Dataset

Scott Ettinger, +17 more

- 20 Apr 2021 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: In this article, the authors introduce a large-scale interactive motion dataset with over 100,000 scenes, each 20 seconds long at 10 Hz, collected by mining for interesting interactions between vehicles, pedestrians, and cyclists across six cities within the United States.

...read moreread less

Papers

Scalability in Perception for Autonomous Driving: Waymo Open Dataset

Scalability in Perception for Autonomous Driving: Waymo Open Dataset

StarNet: Targeted Computation for Object Detection in Point Clouds.

DeepFusion: Lidar-Camera Deep Fusion for Multi-Modal 3D Object Detection

Large Scale Interactive Motion Forecasting for Autonomous Driving : The Waymo Open Motion Dataset