Book description
Computer Vision Metrics: Survey, Taxonomy, and Analysis provides a technical tour through computer vision, with a survey of nearly 100 types of local, regional, and global feature descriptors, blending history of the field with state-of-the-art analysis of contemporary methods, rather than just another how-to book with source code shortcuts and performance analysis. Observations are provided to develop intuition behind the methods and mathematics, interesting questions are raised for future research rather than providing all the answers, and a Vision Taxonomy is suggested to draw a conceptual map of the field. Extensive illustrations are included, with over 540 references to the literature in the comprehensive bibliography to dig deeper.
Computer Vision Metrics explores the key questions behind the design and mathematics of computer vision metrics and feature descriptors, providing a comprehensive survey and taxonomy of what methods are used, with analysis and observations about why the methods work. Several 3D depth sensing methods are surveyed including MVS, stereo, and structured light.
This work focuses on a slice through the field from the view of feature description metrics, or how to describe, compute, and design the macro-features and micro-features that make up larger objects in images. The focus is on the pixel-side of the vision pipeline, with a light introduction to the back-end training, classification, machine learning, and matching stages.
Computer Vision Metrics is written for engineers, scientists, and academic researchers in areas including video analytics, scene understanding, machine vision, face recognition, gesture recognition, pattern recognition, general object analysis, media processing, and computational photography.
What you'll learn
Current status, brief history, and future directions for computer vision metrics
Taxonomy of local binary, gradient & other spectra, shape features, and basis spaces
Overview of 2D image sensing, 3D depth sensing, and image preprocessing
Vision pipeline optimization methods for computer vision applications
Characterization of ten OpenCV detectors using synthetic feature alphabets
Who this book is for
Engineers, scientists, and academic researchers in areas including media processing, computational photography, video analytics, scene understanding, machine vision, face recognition, gesture recognition, pattern recognition and general object analysis.
Table of contents
- Title Page
- About ApressOpen
- Dedication
- Contents at a Glance
- Contents
- About the Author
- Acknowledgments
- Introduction
- CHAPTER 1: Image Capture and Representation
- CHAPTER 2: Image Pre-Processing
- CHAPTER 3: Global and Regional Features
- CHAPTER 4: Local Feature Design Concepts, Classification, and Learning
- CHAPTER 5: Taxonomy of Feature Description Attributes
- CHAPTER 6: Interest Point Detector and Feature Descriptor Survey
- CHAPTER 7: Ground Truth Data, Content, Metrics, and Analysis
- CHAPTER 8: Vision Pipelines and Optimizations
-
APPENDIX A: Synthetic Feature Analysis
- Background Goals and Expectations
- Test Methodology and Results
- Summary of Synthetic Alphabet Ground Truth Images
- Test 1: Synthetic Interest Point Alphabet Detection
- Test 2: Synthetic Corner Point Alphabet Detection
- Test 3: Synthetic Alphabets Overlaid on Real Images
- Test 4: Rotational Invariance for Each Alphabet
- Analysis of Results and Non-Repeatability Anomalies
- APPENDIX B: Survey of Ground Truth Datasets
- APPENDIX C: Imaging and Computer Vision Resources
- APPENDIX D: Extended SDM Metrics
- Bibliography
- Index
Product information
- Title: Computer Vision Metrics: Survey, Taxonomy, and Analysis
- Author(s):
- Release date: June 2014
- Publisher(s): Apress
- ISBN: 9781430259299
You might also like
book
Emerging Trends in Image Processing, Computer Vision and Pattern Recognition
Emerging Trends in Image Processing, Computer Vision, and Pattern Recognition discusses the latest in trends in …
book
Monitoring Taxonomy
Choosing a monitoring tool can be a tedious exercise. Perhaps you need to inspect sFlow traffic. …
book
Multimodal Scene Understanding
Multimodal Scene Understanding: Algorithms, Applications and Deep Learning presents recent advances in multi-modal computing, with a …
article
Three Ways to Sell Value in B2B Markets
As customers face pressure to reduce costs while maintaining profitability, value-based selling (VBS) has become critical …