CHAPTER 5 Deep Learning Algorithms and Architectures for Multimodal Data Analysis

Anwar Ali Sathio1,2, Muhammad Malook Rind2, and Abdullah Lakhan3

1Department of CS&IT, Benazir Bhutto Shaheed University, Karachi, Sindh, Pakistan

2Department of Computer Science, Sindh Madressatul Islam University, Karachi, Sindh, Pakistan

3Department of Cyber Security, Dawood University of Engineering and Technology, Karachi, Sindh, Pakistan

DOI: 10.1201/9781032646268-5

5.1 Introduction to Multimodal Data Analysis

Multimodal data refers to data represented in multiple modes, such as text, speech, images, and videos. Each method provides a different perspective or piece of information about the data and, when analyzed together, can provide a more comprehensive ...

Get Deep Learning for Multimedia Processing Applications now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.