Index

Accuracy,

classification, 361, 364, 397

detection, 80, 104, 125

localization, 4, 68, 87, 236, 253

positioning, 237, 251

Adversarial,

training, 17

Adversarial variational Bayes (AVB), 13, 17

Adversarially learned inference (ALI), 13, 19

Aerial image scenes, 62

Alarm detection, 344

Amazon Mechanical Turk (AMT), 151

Ambient light sensors, 249

Anchor point relation (APR), 225

Annotated datasets, 280

Application programming interface (API), 246

baseline, 388

CNNs, 15

FuseNet, 57

fusion, 103, 109, 125, 214, 215

model, 27

multimodal, 31

network, 43–45, 48, 49, 81, 386

optimal, 109

performance, 109

TRPN, 112

Artificial intelligence (AI),

Get Multimodal Scene Understanding now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.

Multimodal Scene Understanding by Michael Ying Yang, Bodo Rosenhahn, Vittorio Murino