Chapter 3

Multimodal Semantic Segmentation: Fusion of RGB and Depth Data in Convolutional Neural Networks

Zoltan Koppanyi; Dorota Iwaszczuk; Bing Zha; Can Jozef Saul; Charles K. Toth; Alper Yilmaz    Department of Civil, Envir. & Geod Eng., The Ohio State University, Columbus, OH, United StatesPhotogrammetry & Remote Sensing, Technical University of Munich, Munich, GermanyRobert College, Istanbul, Turkey

Abstract

Semantic segmentation has been an active field in computer vision and photogrammetry communities for over a decade. Pixel-level semantic labeling of images is generally achieved by assigning labels to pixels using machine learning techniques. Among others, the encoder–decoder convolutional neural networks have become the baseline ...

Get Multimodal Scene Understanding now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.