Index

A

Accuracy, 
classification, 361, 364, 397
detection, 80, 104, 125
localization, 4, 68, 87, 236, 253
positioning, 237, 251
Adversarial, 
training, 17
Adversarial variational Bayes (AVB), 13, 17
Adversarially learned inference (ALI), 13, 19
Aerial image scenes, 62
Alarm detection, 344
Amazon Mechanical Turk (AMT), 151
Ambient light sensors, 249
Anchor point relation (APR), 225
Annotated datasets, 280
Application programming interface (API), 246
Architecture, 11, 20, 22, 25, 26, 29, 105, 106, 109, 110, 112, 117, 121, 209, 214–216, 220, 238, 241, 244, 249, 387, 388, 391, 392
baseline, 388
CNNs, 15
FuseNet, 57
fusion, 103, 109, 125, 214, 215
model, 27
multimodal, 31
network, 43–45, 48, 49, 81, 386
optimal, 109
performance, 109
TRPN, 112
Artificial intelligence (AI), 

Get Multimodal Scene Understanding now with O’Reilly online learning.

O’Reilly members experience live online training, plus books, videos, and digital content from 200+ publishers.