Vanets Meet Autonomous Vehicles: Multimodal Surrounding Recognition Using Manifold Alignment

doi:10.1109/ACCESS.2018.2839561

Open AccessJournal ArticleDOI

Vanets Meet Autonomous Vehicles: Multimodal Surrounding Recognition Using Manifold Alignment

Yassine Maalej, +3 more

- 21 May 2018 -

IEEE Access

- Vol. 6, pp 29026-29040

Chats0

TLDR

The vision to create a beneficial link between the two worlds of vehicular ad-hoc networks and autonomous vehicles is presented by designing a multimodal scheme for object detection, recognition, and mapping based on the fusion of stereo camera frames, point cloud Velodyne LIDAR scans, and vehicle-to-vehicle (V2V) basic safety messages (BSMs) exchanges using VANET protocols.

Abstract:

In the past two years, calls for developing synergistic links between the two worlds of vehicular ad-hoc networks (VANETs) and autonomous vehicles have significantly gone up to achieve further on-road safety and benefits for end-users. In this paper, we present our vision to create such a beneficial link by designing a multimodal scheme for object detection, recognition, and mapping based on the fusion of stereo camera frames, point cloud Velodyne LIDAR scans, and vehicle-to-vehicle (V2V) basic safety messages (BSMs) exchanges using VANET protocols. Exploiting the high similarities in the underlying manifold properties of the three data sets, and their high neighborhood correlation, the proposed scheme employs semi-supervised manifold alignment to merge the key features of rich texture descriptions of objects from 2-D images, depth and distance between objects provided by 3-D point cloud, and the awareness of self-declared vehicles from BSMs’ 3-D information including the ones not seen by camera and LIDAR. The proposed scheme is applied to create joint pixel-to-point-cloud and pixel-to-V2V correspondences of objects in frames from the KITTI Vision Benchmark Suite, using a semi-supervised manifold alignment, to achieve camera-LIDAR and camera-V2V mapping of their recognized objects. We present the alignment accuracy results over two different driving sequences and show the additional acquired knowledge of objects from the various input modalities. We also study the effect of the number of neighbors employed in the alignment process on the alignment accuracy. With proper choice of parameters, the testing of our proposed scheme over two entire driving sequences exhibits 100% accuracy in the majority of cases, 74%–92% and 50%–72% average alignment accuracy for vehicles and pedestrians and up to 150% additional object recognition of the testing vehicle’s surrounding.

Vanets Meet Autonomous Vehicles: Multimodal Surrounding Recognition Using Manifold Alignment

Citations

A Point Cloud-Based Robust Road Curb Detection and Tracking Method

Ground Surface Filtering of 3D Point Clouds Based on Hybrid Regression Technique

Parallel Vehicular Networks: A CPSS-Based Approach via Multimodal Big Data in IoV

A Progressive Review: Emerging Technologies for ADAS Driven Solutions

Benchmarking Particle Filter Algorithms for Efficient Velodyne-Based Vehicle Localization.

References

You Only Look Once: Unified, Real-Time Object Detection

Vision meets robotics: The KITTI dataset

Object scene flow for autonomous vehicles

A Unified Multi-scale Deep Convolutional Neural Network for Fast Object Detection

Decomposing a scene into geometric and semantically consistent regions

Related Papers (5)

Object Detection Based on Hierarchical Multi-view Proposal Network for Autonomous Driving

Deconvolutional Networks for Point-Cloud Vehicle Detection and Tracking in Driving Scenarios

SARPNET: Shape attention regional proposal network for liDAR-based 3D object detection

LIDAR-based 3D Object Perception

Dynamic Object Aware LiDAR SLAM based on Automatic Generation of Training Data