Multi-modality fusion : registering photographs, videos, and LIDAR range scans | Zendy

Brittany Morago | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Multi-modality fusion : registering photographs, videos, and LIDAR range scans

Author(s) -

Brittany Morago

Publication year - 2016

Publication title -

mospace institutional repository (university of missouri)

Language(s) - English

Resource type - Dissertations/theses

DOI - 10.32469/10355/56548

Subject(s) - computer science , artificial intelligence , computer vision , fuse (electrical) , lidar , modality (human–computer interaction) , object (grammar) , ambiguity , variety (cybernetics) , pattern recognition (psychology) , geography , remote sensing , engineering , electrical engineering , programming language

2D images and 3D LIDAR range scans provide very different but complementing information about a single subject and, when registered, can be used for a variety of exciting applications. Video sets can be fused with a 3D model and played in a single multi-dimensional environment. Imagery with temporal changes can be visualized simultaneously, unveiling changes in architecture, foliage, and human activity. Depth information for 2D photos and videos can be computed. Real-world measurements can be provided to users through simple interactions with traditional photographs. However, fusing multi-modality data is a very challenging task given the repetition and ambiguity that often occur in man-made scenes as well as the variety of properties different renderings of the same subject can possess. Image sets collected over a period of time during which the lighting conditions and scene content may have changed, different artistic renderings, varying sensor types, focal lengths, and exposure values can all contribute to visual variations in data sets. This dissertation addresses these obstacles using the common theme of incorporating contextual information to visualize regional properties that intuitively exist in each imagery source. We combine hard features that quantify the strong, stable edges that are often present in imagery along object boundaries and depth changes with soft features that capture distinctive texture information that can be unique to specific areas. We show that our detector and descriptor techniques can provide more accurate keypoint match sets between highly varying imagery than many traditional and state-of-the-art techniques, allowing us to fuse and align photographs, videos, and range scans containing both man-made and natural content.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research