book

Learning OpenCV

by Gary Bradski, Adrian Kaehler

September 2008

Beginner to intermediate

580 pages

20h 7m

English

O'Reilly Media, Inc.

Read now

Unlock full access

Learning OpenCV
SPECIAL OFFER: Upgrade this ebook with O’Reilly
A Note Regarding Supplemental Files
Preface
PurposeWho This Book Is ForWhat This Book Is Not
About the Programs in This Book
Prerequisites
How This Book Is Best Used
Conventions Used in This Book
Using Code Examples
Safari® Books Online

We'd Like to Hear from You
Acknowledgments
Thanks for Help on OpenCVThanks for Help on the BookGary Adds…Adrian Adds…
1. Overview
What Is OpenCV?
Who Uses OpenCV?
What Is Computer Vision?
The Origin of OpenCV
Speeding Up OpenCV with IPPWho Owns OpenCV?
Downloading and Installing OpenCV
InstallWindowsLinuxMacOS X
Getting the Latest OpenCV via CVS
More OpenCV Documentation
Documentation Available in HTMLDocumentation via the Wiki
OpenCV Structure and Content
Portability
Exercises
2. Introduction to OpenCV
Getting Started
First Program—Display a Picture
Second Program—AVI Video
Moving Around
A Simple Transformation
A Not-So-Simple Transformation
Input from a Camera
Writing to an AVI File
Onward
Exercises
3. Getting to Know OpenCV
OpenCV Primitive Data TypesMatrix and Image Types
CvMat Matrix Structure
Accessing Data in Your MatrixThe easy wayThe hard wayThe right wayArrays of Points
IplImage Data Structure
Accessing Image DataMore on ROI and widthStep
Matrix and Image Operators
Matrix and Image OperatorscvAbs, cvAbsDiff, and cvAbsDiffScvAdd, cvAddS, cvAddWeighted, and alpha blendingcvAnd and cvAndScvAvgcvAvgSdvcvCalcCovarMatrixcvCmp and cvCmpScvConvertScalecvConvertScaleAbscvCopycvCountNonZerocvCrossProductcvCvtColorcvDetcvDivcvDotProductcvEigenVVcvFlipcvGEMMcvGetCol and cvGetColscvGetDiagcvGetDims and cvGetDimSizecvGetRow and cvGetRowscvGetSizecvGetSubRectcvInRange and cvInRangeScvInvertcvMahalanobiscvMax and cvMaxScvMergecvMin and cvMinScvMinMaxLoccvMulcvNotcvNormcvNormalizecvOr and cvOrScvReducecvRepeatcvScalecvSet and cvSetZerocvSetIdentitycvSolvecvSplitcvSub, cvSubS, and cvSubRScvSumcvSVDcvSVBkSbcvTracecvTranspose and cvTcvXor and cvXorScvZero
Drawing Things
LinesCircles and EllipsesPolygonsFonts and Text
Data Persistence
Integrated Performance Primitives
Verifying Installation
Summary
Exercises
4. HighGUI
A Portable Graphics Toolkit
Creating a Window
Loading an Image
Displaying Images
WaitKeyMouse EventsSliders, Trackbars, and SwitchesNo Buttons
Working with Video
Reading VideoWriting Video
ConvertImage
Exercises
5. Image Processing
Overview
Smoothing
Image Morphology
Dilation and ErosionMaking Your Own KernelMore General MorphologyOpening and closingMorphological gradientTop Hat and Black Hat
Flood Fill
Resize
Image Pyramids
Threshold
Adaptive Threshold
Exercises
6. Image Transforms
Overview
Convolution
Convolution Boundaries
Gradients and Sobel Derivatives
Scharr Filter
Laplace
Canny
Hough Transforms
Hough Line TransformHough Circle Transform
Remap
Stretch, Shrink, Warp, and Rotate
Affine TransformDense affine transformationscvWarpAffine performanceComputing the affine map matrixSparse affine transformationsPerspective TransformDense perspective transformComputing the perspective map matrixSparse perspective transformations
CartToPolar and PolarToCart
LogPolar
Discrete Fourier Transform (DFT)
Spectrum MultiplicationConvolution and DFT
Discrete Cosine Transform (DCT)
Integral Images
Distance Transform
Histogram Equalization
Exercises
7. Histograms and Matching
Basic Histogram Data Structure
Accessing Histograms
Basic Manipulations with Histograms
Comparing Two HistogramsCorrelation (method = CV_COMP_CORREL)Chi-square (method = CV_COMP_CHISQR)Intersection (method = CV_COMP_INTERSECT)Bhattacharyya distance (method = CV_COMP_BHATTACHARYYA)Histogram Usage Examples
Some More Complicated Stuff
Earth Mover's DistanceBack ProjectionPatch-based back projectionTemplate MatchingSquare difference matching method (method = CV_TM_SQDIFF)Correlation matching methods (method = CV_TM_CCORR)Correlation coefficient matching methods (method = CV_TM_CCOEFF)Normalized methods
Exercises
8. Contours
Memory Storage
Sequences
Creating a SequenceDeleting a SequenceDirect Access to Sequence ElementsSlices, Copying, and Moving DataUsing a Sequence As a StackInserting and Removing ElementsSequence Block SizeSequence Readers and Sequence WritersSequences and Arrays
Contour Finding
Contours Are SequencesFreeman Chain CodesDrawing ContoursA Contour Example
Another Contour Example
More to Do with Contours
Polygon ApproximationsSummary CharacteristicsLengthBounding boxesEnclosing circles and ellipsesGeometry
Matching Contours
MomentsMore About MomentsMatching with Hu MomentsHierarchical MatchingContour Convexity and Convexity DefectsPairwise Geometrical Histograms
Exercises
9. Image Parts and Segmentation
Parts and Segments
Background Subtraction
Weaknesses of Background SubtractionScene ModelingA Slice of PixelsFrame DifferencingAveraging Background MethodAccumulating means, variances, and covariancesAdvanced Background MethodStructuresLearning the backgroundLearning with moving foreground objectsBackground differencing: Finding foreground objectsUsing the codebook background modelA few more thoughts on codebook modelsConnected Components for Foreground CleanupA quick testComparing Background Methods
Watershed Algorithm
Image Repair by Inpainting
Mean-Shift Segmentation
Delaunay Triangulation, Voronoi Tesselation
Creating a Delaunay or Voronoi SubdivisionNavigating Delaunay SubdivisionsWalking on edgesPoints from edgesMethod 1: Use an external point to locate an edge or vertexMethod 2: Step through a sequence of points or edgesIdentifying the bounding triangle or edges on the convex hull and walking the hullUsage Examples
Exercises
10. Tracking and Motion
The Basics of Tracking
Corner Finding
Subpixel Corners
Invariant Features
Optical Flow
Lucas-Kanade MethodHow Lucas-Kanade worksLucas-Kanade codePyramid Lucas-Kanade codeDense Tracking TechniquesHorn-Schunck methodBlock matching method
Mean-Shift and Camshift Tracking
Mean-Shift and Camshift TrackingMean-ShiftCamshift
Motion Templates
Estimators
The Kalman FilterSome Kalman mathSystems with dynamicsKalman equationsOpenCV and the Kalman filterKalman filter example codeA Brief Note on the Extended Kalman Filter
The Condensation Algorithm
Exercises
11. Camera Models and Calibration
Camera ModelBasic Projective GeometryLens Distortions
Calibration
Rotation Matrix and Translation VectorChessboardsSubpixel cornersDrawing chessboard cornersHomographyCamera CalibrationHow many chess corners for how many parameters?What's under the hood?Calibration functionComputing extrinsics only
Undistortion
Putting Calibration All Together
Rodrigues Transform
Exercises
12. Projection and 3D Vision
Projections
Affine and Perspective Transformations
Bird's-Eye View Transform Example
POSIT: 3D Pose Estimation
Stereo Imaging
TriangulationEpipolar GeometryThe Essential and Fundamental MatricesEssential matrix mathFundamental matrix mathHow OpenCV handles all of thisComputing Epipolar LinesStereo CalibrationStereo RectificationUncalibrated stereo rectification: Hartley's algorithmCalibrated stereo rectification: Bouguet's algorithmRectification mapStereo CorrespondenceStereo Calibration, Rectification, and Correspondence CodeDepth Maps from 3D Reprojection
Structure from Motion
Fitting Lines in Two and Three Dimensions
Exercises
13. Machine Learning
What Is Machine LearningTraining and Test SetSupervised and Unsupervised DataGenerative and Discriminative ModelsOpenCV ML AlgorithmsUsing Machine Learning in VisionVariable ImportanceDiagnosing Machine Learning ProblemsCross-validation, bootstrapping, ROC curves, and confusion matrices
Common Routines in the ML Library
TrainingPredictionControlling Training Iterations
Mahalanobis Distance
K-Means
Problems and SolutionsK-Means Code
Naïve/Normal Bayes Classifier
Naïve/Normal Bayes Code
Binary Decision Trees
Regression ImpurityClassification ImpurityEntropy impurityGini impurityMisclassification impurityDecision Tree UsageTraining the treeDecision Tree Results
Boosting
AdaBoostBoosting Code
Random Trees
Random Tree CodeUsing Random Trees
Face Detection or Haar Classifier
Supervised Learning and Boosting TheoryBoosting in the Haar cascadeViola-Jones Classifier TheoryWorks well on …Code for Detecting FacesLearning New Objects
Other Machine Learning Algorithms
Expectation MaximizationK-Nearest NeighborsMultilayer PerceptronSupport Vector Machine
Exercises
14. OpenCV's Future
Past and Future
Directions
Specific Items
OpenCV for Artists
Afterword
15. Bibliography
Index
About the Authors
Colophon
SPECIAL OFFER: Upgrade this ebook with O’Reilly

Content preview from Learning OpenCV

Chapter 12. Projection and 3D Vision

In this chapter we'll move into three-dimensional vision, first with projections and then with multicamera stereo depth perception. To do this, we'll have to carry along some of the concepts from Chapter 11. We'll need the camera instrinsics matrix M, the distortion coefficients, the rotation matrix R, the translation vector T, and especially the homography matrix H.

We'll start by discussing projection into the 3D world using a calibrated camera and reviewing affine and projective transforms (which we first encountered in Chapter 6); then we'll move on to an example of how to get a bird's-eye view of a ground plane. ^[189] We'll also discuss POSIT, an algorithm that allows us to find the 3D pose (position and rotation) of a known 3D object in an image.

We will then move into the three-dimensional geometry of multiple images. In general, there is no reliable way to do calibration or to extract 3D information without multiple images. The most obvious case in which we use multiple images to reconstruct a three-dimensional scene is stereo vision. In stereo vision, features in two (or more) images taken at the same time from separate cameras are matched with the corresponding features in the other images, and the differences are analyzed to yield depth information. Another case is structure from motion. In this case we may have only a single camera, but we have multiple images taken at different times and from different places. In the former case we ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

O’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.

Julian F.

Head of Cybersecurity

I wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.

Addison B.

Field Engineer

I’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.

Amir M.

Data Platform Tech Lead

I'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.

Mark W.

Embedded Software Engineer

Publisher Resources

ISBN: 9780596516130Errata Page

Cloud Computing

Data Engineering

Data Science

AI & ML

Programming Languages

Software Architecture

IT/Ops

Security

Design

Business

Soft Skills

Learning OpenCV

by Gary Bradski, Adrian Kaehler

Chapter 12. Projection and 3D Vision

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

More than 5,000 organizations count on O’Reilly

Julian F.

Addison B.

Amir M.

Mark W.

You might also like

Learning OpenCV 3

Learning OpenCV, 2nd Edition

Practical OpenCV

Machine Learning for OpenCV

Publisher Resources