Index

A

Accuracy, 
classification, 361, 364, 397
detection, 80, 104, 125
localization, 4, 68, 87, 236, 253
positioning, 237, 251
Adversarial, 
training, 17
Adversarial variational Bayes (AVB), 13, 17
Adversarially learned inference (ALI), 13, 19
Aerial image scenes, 62
Alarm detection, 344
Amazon Mechanical Turk (AMT), 151
Ambient light sensors, 249
Anchor point relation (APR), 225
Annotated datasets, 280
Application programming interface (API), 246
Architecture, 11, 20, 22, 25, 26, 29, 105, 106, 109, 110, 112, 117, 121, 209, 214–216, 220, 238, 241, 244, 249, 387, 388, 391, 392
baseline, 388
CNNs, 15
FuseNet, 57
fusion, 103, 109, 125, 214, 215
model, 27
multimodal, 31
network, 43–45, 48, 49, 81, 386
optimal, 109
performance, 109
TRPN, 112
Artificial intelligence (AI), 

Get Multimodal Scene Understanding now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.