Chapter 1

Introduction to Multimodal Scene Understanding

Michael Ying Yang; Bodo Rosenhahn; Vittorio Murino    University of Twente, Enschede, The NetherlandsLeibniz University Hannover, Hannover, GermanyIstituto Italiano di Tecnologia, Genova, Italy


A fundamental goal of computer vision is to discover the semantic information within a given scene, commonly referred to as scene understanding. The overall goal is to find a mapping to derive semantic information from sensor data, which is an extremely challenging task, partially due to the ambiguities in the appearance of the data. However, the majority of the scene understanding tasks tackled so far are mainly involving visual modalities only. In this book, we aim at providing an overview ...

Get Multimodal Scene Understanding now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.