book

Programming Computer Vision with Python

by Jan Erik Solem

June 2012

Beginner to intermediate

260 pages

6h 28m

English

O'Reilly Media, Inc.

Read now

Unlock full access

Prerequisites and OverviewWhat You Need to KnowWhat You Will LearnChapter OverviewIntroduction to Computer VisionPython and NumPyNotation and ConventionsUsing Code ExamplesHow to Contact UsSafari® Books OnlineAcknowledgments
1.1 PIL—The Python Imaging LibraryConvert Images to Another FormatCreate ThumbnailsCopy and Paste RegionsResize and Rotate1.2 MatplotlibPlotting Images, Points, and LinesImage Contours and HistogramsInteractive Annotation1.3 NumPyArray Image RepresentationGraylevel TransformsImage ResizingHistogram EqualizationAveraging ImagesPCA of ImagesUsing the Pickle Module1.4 SciPyBlurring ImagesImage DerivativesMorphology—Counting ObjectsUseful SciPy ModulesReading and writing .mat filesSaving arrays as images1.5 Advanced Example: Image De-NoisingExercisesConventions for the Code Examples
2.1 Harris Corner DetectorFinding Corresponding Points Between Images2.2 SIFT—Scale-Invariant Feature TransformInterest PointsDescriptorDetecting Interest PointsMatching Descriptors2.3 Matching Geotagged ImagesDownloading Geotagged Images from PanoramioMatching Using Local DescriptorsVisualizing Connected ImagesExercises
3.1 HomographiesThe Direct Linear Transformation AlgorithmAffine Transformations3.2 Warping ImagesImage in ImagePiecewise Affine WarpingRegistering Images3.3 Creating PanoramasRANSACRobust Homography EstimationStitching the Images TogetherExercises
4.1 The Pin-Hole Camera ModelThe Camera MatrixProjecting 3D PointsFactoring the Camera MatrixComputing the Camera Center4.2 Camera CalibrationA Simple Calibration Method4.3 Pose Estimation from Planes and Markers4.4 Augmented RealityPyGame and PyOpenGLFrom Camera Matrix to OpenGL FormatPlacing Virtual Objects in the ImageTying It All TogetherLoading ModelsExercises
5.1 Epipolar GeometryA Sample Data SetPlotting 3D Data with MatplotlibComputing F—The Eight Point AlgorithmThe Epipole and Epipolar Lines5.2 Computing with Cameras and 3D StructureTriangulationComputing the Camera Matrix from 3D PointsComputing the Camera Matrix from a Fundamental MatrixThe uncalibrated case—projective reconstructionThe calibrated case—metric reconstruction5.3 Multiple View ReconstructionRobust Fundamental Matrix Estimation3D Reconstruction ExampleExtensions and More Than Two ViewsMore viewsBundle adjustmentSelf-calibration5.4 Stereo ImagesComputing Disparity MapsExercises
6.1 K-Means ClusteringThe SciPy Clustering PackageClustering ImagesVisualizing the Images on Principal ComponentsClustering Pixels6.2 Hierarchical ClusteringClustering Images6.3 Spectral ClusteringExercises
7.1 Content-Based Image RetrievalInspiration from Text Mining—The Vector Space Model7.2 Visual WordsCreating a Vocabulary7.3 Indexing ImagesSetting Up the DatabaseAdding Images7.4 Searching the Database for ImagesUsing the Index to Get CandidatesQuerying with an ImageBenchmarking and Plotting the Results7.5 Ranking Results Using Geometry7.6 Building Demos and Web ApplicationsCreating Web Applications with CherryPyImage Search DemoExercises

8.1 K-Nearest NeighborsA Simple 2D ExampleDense SIFT as Image FeatureClassifying Images—Hand Gesture Recognition8.2 Bayes ClassifierUsing PCA to Reduce Dimensions8.3 Support Vector MachinesUsing LibSVMHand Gesture Recognition Again8.4 Optical Character RecognitionTraining a ClassifierSelecting FeaturesMulti-Class SVMExtracting Cells and Recognizing CharactersRectifying ImagesExercises
9.1 Graph CutsGraphs from ImagesSegmentation with User Input9.2 Segmentation Using Clustering9.3 Variational MethodsExercises
10.1 The OpenCV Python Interface10.2 OpenCV BasicsReading and Writing ImagesColor SpacesDisplaying Images and Results10.3 Processing VideoVideo InputReading Video to NumPy Arrays10.4 TrackingOptical FlowThe Lucas-Kanade AlgorithmUsing the trackerUsing generators10.5 More ExamplesInpaintingSegmentation with the Watershed TransformLine Detection with a Hough TransformExercises
A.1 NumPy and SciPyWindowsMac OS XLinuxA.2 MatplotlibA.3 PILA.4 LibSVMA.5 OpenCVWindows and UnixMac OS XLinuxA.6 VLFeatA.7 PyGameA.8 PyOpenGLA.9 PydotA.10 Python-graphA.11 SimplejsonA.12 PySQLiteA.13 CherryPy
B.1 FlickrB.2 PanoramioB.3 Oxford Visual Geometry GroupB.4 University of Kentucky Recognition Benchmark ImagesB.5 OtherPrague Texture Segmentation Datagenerator and BenchmarkMSR Cambridge Grab Cut DatasetCaltech 101Static Hand Posture DatabaseMiddlebury Stereo Datasets
C.1 Images from FlickrC.2 Other ImagesC.3 Illustrations

Content preview from Programming Computer Vision with Python

Chapter 1. Basic Image Handling and Processing

This chapter is an introduction to handling and processing images. With extensive examples, it explains the central Python packages you will need for working with images. This chapter introduces the basic tools for reading images, converting and scaling images, computing derivatives, plotting or saving results, and so on. We will use these throughout the remainder of the book.

1.1 PIL—The Python Imaging Library

The Python Imaging Library (PIL) provides general image handling and lots of useful basic image operations like resizing, cropping, rotating, color conversion and much more. PIL is free and available from http://www.pythonware.com/products/pil/.

With PIL, you can read images from most formats and write to the most common ones. The most important module is the Image module. To read an image, use:

from PIL import Image

pil_im = Image.open('empire.jpg')

The return value, pil_im, is a PIL image object.

Color conversions are done using the convert() method. To read an image and convert it to grayscale, just add convert('L') like this:

pil_im = Image.open('empire.jpg').convert('L')

Here are some examples taken from the PIL documentation, available at http://www.pythonware.com/library/pil/handbook/index.htm. Output from the examples is shown in Figure 1-1.

Convert Images to Another Format

Using the save() method, PIL can save images in most image file formats. Here’s an example that takes all image files in a list of filenames (filelist) and ...