Multimodal Fusion Object Detection System for Autonomous Vehicles
Author(s) -
Michael J. Person,
Matthew Jensen,
Anthony O. Smith,
Héctor Gutiérrez
Publication year - 2019
Publication title -
journal of dynamic systems measurement and control
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.528
H-Index - 89
eISSN - 1528-9028
pISSN - 0022-0434
DOI - 10.1115/1.4043222
Subject(s) - computer science , lidar , object detection , artificial intelligence , computer vision , convolutional neural network , point cloud , sensor fusion , pattern recognition (psychology) , remote sensing , geography
In order for autonomous vehicles to safely navigate the road ways, accurate object detection must take place before safe path planning can occur. Currently, general purpose object detection convolutional neural network (CNN) models have the highest detection accuracies of any method. However, there is a gap in the proposed detection frameworks. Specifically, those that provide high detection accuracy necessary for deployment but do not perform inference in realtime, and those that perform inference in realtime but detection accuracy is low. We propose multimodel fusion detection system (MFDS), a sensor fusion system that combines the speed of a fast image detection CNN model along with the accuracy of light detection and range (LiDAR) point cloud data through a decision tree approach. The primary objective is to bridge the tradeoff between performance and accuracy. The motivation for MFDS is to reduce the computational complexity associated with using a CNN model to extract features from an image. To improve efficiency, MFDS extracts complimentary features from the LiDAR point cloud in order to obtain comparable detection accuracy. MFDS is novel by not only using the image detections to aid three-dimensional (3D) LiDAR detection but also using the LiDAR data to jointly bolster the image detections and provide 3D detections. MFDS achieves 3.7% higher accuracy than the base CNN detection model and is able to operate at 10 Hz. Additionally, the memory requirement for MFDS is small enough to fit on the Nvidia Tx1 when deployed on an embedded device.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom