November 2017
Intermediate to advanced
274 pages
6h 16m
English
In addition to comparing CNN architectures with each other, the authors also report the accuracy of a feature-based approach. A standard bag-of-words pipeline was used to extract several types of features at all frames of the videos, followed by discretizing them using k-means vector quantization and accumulating words into histograms with spatial pyramid encoding and soft quantization.
Read now
Unlock full access