Chapter 3

Multimodal Semantic Segmentation: Fusion of RGB and Depth Data in Convolutional Neural Networks

Zoltan Koppanyi^⁎; Dorota Iwaszczuk^†; Bing Zha^⁎; Can Jozef Saul^‡; Charles K. Toth^⁎; Alper Yilmaz^⁎ ^⁎Department of Civil, Envir. & Geod Eng., The Ohio State University, Columbus, OH, United States^†Photogrammetry & Remote Sensing, Technical University of Munich, Munich, Germany^‡Robert College, Istanbul, Turkey

Abstract

Semantic segmentation has been an active field in computer vision and photogrammetry communities for over a decade. Pixel-level semantic labeling of images is generally achieved by assigning labels to pixels using machine learning techniques. Among others, the encoder–decoder convolutional neural networks have become the baseline ...

Get Multimodal Scene Understanding now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.

Start your free trial

Multimodal Scene Understanding by Michael Ying Yang, Bodo Rosenhahn, Vittorio Murino

Multimodal Semantic Segmentation: Fusion of RGB and Depth Data in Convolutional Neural Networks

Abstract

Don’t leave empty-handed

It’s yours, free.

Check it out now on O’Reilly