book

Learning OpenCV 3

by Adrian Kaehler, Gary Bradski

December 2016

Beginner to intermediate

1024 pages

29h 50m

English

O'Reilly Media, Inc.

Read now

Unlock full access

Purpose of This BookWho This Book Is ForWhat This Book Is NotAbout the Programs in This BookPrerequisitesHow This Book Is Best UsedConventions Used in This BookUsing Code ExamplesO’Reilly Online LearningWe’d Like to Hear from YouAcknowledgmentsThanks for Help on OpenCVThanks for Help on This BookAdrian Adds...Gary Adds...
What Is OpenCV?Who Uses OpenCV?What Is Computer Vision?The Origin of OpenCVOpenCV Block DiagramSpeeding Up OpenCV with IPPWho Owns OpenCV?Downloading and Installing OpenCVInstallationGetting the Latest OpenCV via GitMore OpenCV DocumentationSupplied DocumentationOnline Documentation and the WikiOpenCV Contribution RepositoryDownloading and Building Contributed ModulesPortabilitySummaryExercises
Include FilesResourcesFirst Program—Display a PictureSecond Program—VideoMoving AroundA Simple TransformationA Not-So-Simple TransformationInput from a CameraWriting to an AVI FileSummaryExercises
The BasicsOpenCV Data TypesOverview of the Basic TypesBasic Types: Getting Down to DetailsHelper ObjectsUtility FunctionsThe Template StructuresSummaryExercises
Dynamic and Variable StorageThe cv::Mat Class: N-Dimensional Dense ArraysCreating an ArrayAccessing Array Elements IndividuallyThe N-ary Array Iterator: NAryMatIteratorAccessing Array Elements by BlockMatrix Expressions: Algebra and cv::MatSaturation CastingMore Things an Array Can DoThe cv::SparseMat Class: Sparse ArraysAccessing Sparse Array ElementsFunctions Unique to Sparse ArraysTemplate Structures for Large Array TypesSummaryExercises
More Things You Can Do with Arrayscv::abs()cv::absdiff()cv::add()cv::addWeighted()cv::bitwise_and()cv::bitwise_not()cv::bitwise_or()cv::bitwise_xor()cv::calcCovarMatrix()cv::cartToPolar()cv::checkRange()cv::compare()cv::completeSymm()cv::convertScaleAbs()cv::countNonZero()cv::cvarrToMat()cv::dct()cv::dft()cv::cvtColor()cv::determinant()cv::divide()cv::eigen()cv::exp()cv::extractImageCOI()cv::flip()cv::gemm()cv::getConvertElem() and cv::getConvertScaleElem()cv::idct()cv::idft()cv::inRange()cv::insertImageCOI()cv::invert()cv::log()cv::LUT()cv::magnitude()cv::Mahalanobis()cv::max()cv::mean()cv::meanStdDev()cv::merge()cv::min()cv::minMaxIdx()cv::minMaxLoc()cv::mixChannels()cv::mulSpectrums()cv::multiply()cv::mulTransposed()cv::norm()cv::normalize()cv::perspectiveTransform()cv::phase()cv::polarToCart()cv::pow()cv::randu()cv::randn()cv::randShuffle()cv::reduce()cv::repeat()cv::scaleAdd()cv::setIdentity()cv::solve()cv::solveCubic()cv::solvePoly()cv::sort()cv::sortIdx()cv::split()cv::sqrt()cv::subtract()cv::sum()cv::trace()cv::transform()cv::transpose()SummaryExercises
Drawing ThingsLine Art and Filled PolygonsFonts and TextSummaryExercises
Objects That “Do Stuff”Principal Component Analysis (cv::PCA)Singular Value Decomposition (cv::SVD)Random Number Generator (cv::RNG)SummaryExercises
HighGUI: Portable Graphics ToolkitWorking with Image FilesLoading and Saving ImagesA Note About CodecsCompression and DecompressionWorking with VideoReading Video with the cv::VideoCapture ObjectWriting Video with the cv::VideoWriter ObjectData PersistenceWriting to a cv::FileStorageReading from a cv::FileStoragecv::FileNodeSummaryExercises
Working with WindowsHighGUI Native Graphical User InterfaceWorking with the Qt BackendIntegrating OpenCV with Full GUI ToolkitsSummaryExercises

OverviewBefore We BeginFilters, Kernels, and ConvolutionBorder Extrapolation and Boundary ConditionsThreshold OperationsOtsu’s AlgorithmAdaptive ThresholdSmoothingSimple Blur and the Box FilterMedian FilterGaussian FilterBilateral FilterDerivatives and GradientsThe Sobel DerivativeScharr FilterThe LaplacianImage MorphologyDilation and ErosionThe General Morphology FunctionOpening and ClosingMorphological GradientTop Hat and Black HatMaking Your Own KernelConvolution with an Arbitrary Linear FilterApplying a General Filter with cv::filter2D()Applying a General Separable Filter with cv::sepFilter2DKernel BuildersSummaryExercises
OverviewStretch, Shrink, Warp, and RotateUniform ResizeImage PyramidsNonuniform MappingsAffine TransformationPerspective TransformationGeneral RemappingsPolar MappingsLogPolarArbitrary MappingsImage RepairInpaintingDenoisingHistogram Equalizationcv::equalizeHist(): Contrast equalizationSummaryExercises
OverviewDiscrete Fourier Transformcv::dft(): The Discrete Fourier Transformcv::idft(): The Inverse Discrete Fourier Transformcv::mulSpectrums(): Spectrum MultiplicationConvolution Using Discrete Fourier Transformscv::dct(): The Discrete Cosine Transformcv::idct(): The Inverse Discrete Cosine TransformIntegral Imagescv::integral() for Standard Summation Integralcv::integral() for Squared Summation Integralcv::integral() for Tilted Summation IntegralThe Canny Edge Detectorcv::Canny()Hough TransformsHough Line TransformHough Circle TransformDistance Transformationcv::distanceTransform() for Unlabeled Distance Transformcv::distanceTransform() for Labeled Distance TransformSegmentationFlood FillWatershed AlgorithmGrabcutsMean-Shift SegmentationSummaryExercises
Histogram Representation in OpenCVcv::calcHist(): Creating a Histogram from DataBasic Manipulations with HistogramsHistogram NormalizationHistogram ThresholdFinding the Most Populated BinComparing Two HistogramsHistogram Usage ExamplesSome More Sophisticated Histograms MethodsEarth Mover’s DistanceBack ProjectionTemplate MatchingSquare Difference Matching Method (cv::TM_SQDIFF)Normalized Square Difference Matching Method (cv::TM_SQDIFF_NORMED)Correlation Matching Methods (cv::TM_CCORR)Normalized Cross-Correlation Matching Method (cv::TM_CCORR_NORMED)Correlation Coefficient Matching Methods (cv::TM_CCOEFF)Normalized Correlation Coefficient Matching Method (cv::TM_CCOEFF_NORMED)SummaryExercises
Contour FindingContour HierarchiesDrawing ContoursA Contour ExampleAnother Contour ExampleFast Connected Component AnalysisMore to Do with ContoursPolygon ApproximationsGeometry and Summary CharacteristicsGeometrical TestsMatching Contours and ImagesMomentsMore About MomentsMatching and Hu MomentsUsing Shape Context to Compare ShapesSummaryExercises
Overview of Background SubtractionWeaknesses of Background SubtractionScene ModelingA Slice of PixelsFrame DifferencingAveraging Background MethodAccumulating Means, Variances, and CovariancesA More Advanced Background Subtraction MethodStructuresLearning the BackgroundLearning with Moving Foreground ObjectsBackground Differencing: Finding Foreground ObjectsUsing the Codebook Background ModelA Few More Thoughts on Codebook ModelsConnected Components for Foreground CleanupA Quick TestComparing Two Background MethodsOpenCV Background Subtraction EncapsulationThe cv::BackgroundSubtractor Base ClassKaewTraKuPong and Bowden MethodZivkovic MethodSummaryExercises
Keypoints and the Basics of TrackingCorner FindingIntroduction to Optical FlowLucas-Kanade Method for Sparse Optical FlowGeneralized Keypoints and DescriptorsOptical Flow, Tracking, and RecognitionHow OpenCV Handles Keypoints and Descriptors, the General CaseCore Keypoint Detection MethodsKeypoint FilteringMatching MethodsDisplaying ResultsSummaryExercises
Concepts in TrackingDense Optical FlowThe Farnebäck Polynomial Expansion AlgorithmThe Dual TV-L1 AlgorithmThe Simple Flow AlgorithmMean-Shift and Camshift TrackingMean-ShiftCamshiftMotion TemplatesEstimatorsThe Kalman FilterA Brief Note on the Extended Kalman FilterSummaryExercises
Camera ModelThe Basics of Projective GeometryRodrigues TransformLens DistortionsCalibrationRotation Matrix and Translation VectorCalibration BoardsHomographyCamera CalibrationUndistortionUndistortion MapsConverting Undistortion Maps Between Representations with cv::convertMaps()Computing Undistortion Maps with cv::initUndistortRectifyMap()Undistorting an Image with cv::remap()Undistortion with cv::undistort()Sparse Undistortion with cv::undistortPoints()Putting Calibration All TogetherSummaryExercises
ProjectionsAffine and Perspective TransformationsBird’s-Eye-View Transform ExampleThree-Dimensional Pose EstimationPose Estimation from a Single CameraStereo ImagingTriangulationEpipolar GeometryThe Essential and Fundamental MatricesComputing Epipolar LinesStereo CalibrationStereo RectificationStereo CorrespondenceStereo Calibration, Rectification, and Correspondence Code ExampleDepth Maps from Three-Dimensional ReprojectionStructure from MotionFitting Lines in Two and Three DimensionsSummaryExercises
What Is Machine Learning?Training and Test SetsSupervised and Unsupervised LearningGenerative and Discriminative ModelsOpenCV ML AlgorithmsUsing Machine Learning in VisionVariable ImportanceDiagnosing Machine Learning ProblemsLegacy Routines in the ML LibraryK-MeansMahalanobis DistanceSummaryExercises
Common Routines in the ML LibraryTraining and the cv::ml::TrainData StructurePredictionMachine Learning Algorithms Using cv::StatModelNaïve/Normal Bayes ClassifierBinary Decision TreesBoostingRandom TreesExpectation MaximizationK-Nearest NeighborsMultilayer PerceptronSupport Vector MachineSummaryExercises
Tree-Based Object Detection TechniquesCascade ClassifiersSupervised Learning and Boosting TheoryLearning New ObjectsObject Detection Using Support Vector MachinesLatent SVM for Object DetectionThe Bag of Words Algorithm and Semantic CategorizationSummaryExercises
Past and PresentOpenCV 3.xHow Well Did Our Predictions Go Last Time?Future FunctionsCurrent GSoC WorkCommunity ContributionsOpenCV.orgSome AI SpeculationAfterword
Delaunay Triangulation, Voronoi TesselationCreating a Delaunay or Voronoi SubdivisionNavigating Delaunay SubdivisionsUsage ExamplesExercises
An Overview of the opencv_contrib ModulesContents of opencv_contrib
Calibration Patterns Used by OpenCV

Content preview from Learning OpenCV 3

Chapter 20. The Basics of Machine Learning in OpenCV

In this chapter, we’ll begin a discussion of the machinery that is used to turn vision into perception—in other words, the machinery that turns the visual inputs into meaningful visual semantics.

In the previous chapters we have discussed how to turn 2D or 2D+3D sensor information into features, clusters, or geometric information. In the next three chapters, we’ll use the results of these techniques to turn features, segmentations, and their geometry into recognition of scenes or objects; it is this step that turns raw information into a percept: what the machine is seeing and where it is relative to the camera.

In this chapter we will cover the basics of machine learning, focusing mainly on what it is. We will look at some simple machine learning capabilities of the library that form a good starting point for understanding the basic ideas in machine learning as a whole. In the next chapter, we will get into more detail about how modern machine learning methods are implemented in the library.¹

What Is Machine Learning?

The goal of machine learning (ML)² is to turn data into information. After learning from a collection of data, we want a machine to be able to answer questions about the data: What other data is most similar to this data? Is there a car in the image? What ad will the user respond to? There is often a cost component, so this question could become: “Of our most profitable products, which one will the user most ...