book

Python Cookbook

by Alex Martelli, David Ascher

July 2002

Intermediate to advanced

608 pages

15h 46m

English

O'Reilly Media, Inc.

Read now

Unlock full access

Python Cookbook
Foreword
Preface
The Design of the Book
The Implementation of the Book
A Note About Licenses
Audience
Organization
Further Reading
Conventions Used in This Book
How to Contact Us

Acknowledgments
David AscherAlex Martelli
1. Python Shortcuts
Introduction
Swapping Values WithoutUsing a Temporary Variable
ProblemSolutionDiscussionSee Also
Constructing a Dictionary Without Excessive Quoting
ProblemSolutionDiscussionSee Also
Getting a Value from a Dictionary
ProblemSolutionDiscussionSee Also
Adding an Entry to a Dictionary
ProblemSolutionDiscussionSee Also
Associating Multiple Values with Each Key in a Dictionary
ProblemSolutionDiscussionSee Also
Dispatching Using a Dictionary
ProblemSolutionDiscussionSee Also
Collecting a Bunch of Named Items
ProblemSolutionDiscussionSee Also
Finding the Intersection of Two Dictionaries
ProblemSolutionDiscussionSee Also
Assigning and Testing with One Statement
ProblemSolutionDiscussionSee Also
Using List Comprehensions Instead of map and filter
ProblemSolutionDiscussionSee Also
Unzipping Simple List-Like Objects
ProblemSolutionDiscussionSee Also
Flattening a Nested Sequence
ProblemSolutionDiscussionSee Also
Looping in Parallel over Index and Sequence Items
ProblemSolutionDiscussionSee Also
Looping Through Multiple Lists
ProblemSolutionDiscussionSee Also
Spanning a Range Defined by Floats
ProblemSolutionDiscussionSee Also
Transposing Two-Dimensional Arrays
ProblemSolutionDiscussionSee Also
Creating Lists of Lists Without Sharing References
ProblemSolutionDiscussionSee Also
2. Searching and Sorting
IntroductionSearching and Sorting FAQ
Sorting a Dictionary
ProblemSolutionDiscussionSee Also
Processing Selected Pairs of Structured Data Efficiently
ProblemSolutionDiscussionSee Also
Sorting While Guaranteeing Sort Stability
ProblemSolutionDiscussionSee Also
Sorting by One Field, Then by Another
ProblemSolutionDiscussionSee Also
Looking for Items in a Sorted Sequence Using Binary Search
ProblemSolutionDiscussionSee Also
Sorting a List of Objects by an Attribute of the Objects
ProblemSolutionDiscussionSee Also
Sorting by Item or by Attribute
ProblemSolutionDiscussionSee Also
Selecting Random Elements from a List Without Repetition
ProblemSolutionDiscussionSee Also
Performing Frequent Membership Tests on a Sequence
ProblemSolutionDiscussionSee Also
Finding the Deep Index of an Item in an Embedded Sequence
ProblemSolutionDiscussionSee Also
Showing Off Quicksort in Three Lines
ProblemSolutionDiscussionSee Also
Sorting Objects Using SQL’s ORDER BY Syntax
ProblemSolutionDiscussionSee Also
3. Text
IntroductionWhat Is Text?Basic Textual OperationsSources of TextString Basics
Processing a String One Character at a Time
ProblemSolutionDiscussionSee Also
Testing if an Object Is String-Like
ProblemSolutionDiscussionSee Also
Aligning Strings
ProblemSolutionDiscussionSee Also
Trimming Space from the Ends of a String
ProblemSolutionDiscussionSee Also
Combining Strings
ProblemSolutionDiscussionSee Also
Checking Whether a String Contains a Set of Characters
ProblemSolutionDiscussionSee Also
Filtering a String for a Set of Characters
ProblemSolutionDiscussionSee Also
Controlling Case
ProblemSolutionDiscussionSee Also
Reversing a String by Words or Characters
ProblemSolutionDiscussionSee Also
Accessing Substrings
ProblemSolutionDiscussionSee Also
Changing the Indentation of a Multiline String
ProblemSolutionDiscussionSee Also
Testing Whether a String Represents an Integer
ProblemSolutionDiscussionSee Also
Expanding and Compressing Tabs
ProblemSolutionDiscussionSee Also
Replacing Multiple Patterns in a Single Pass
ProblemSolutionDiscussionSee Also
Converting Between Different Naming Conventions
ProblemSolutionDiscussionSee Also
Converting Between Characters and Values
ProblemSolutionDiscussionSee Also
Converting Between Unicode and Plain Strings
ProblemSolutionDiscussionSee Also
Printing Unicode Characters to Standard Output
ProblemSolutionDiscussionSee Also
Dispatching Based on Pattern Matches
ProblemSolutionDiscussionSee Also
Evaluating Code Inside Strings
ProblemSolutionDiscussionSee Also
Replacing Python Code with the Results of Executing That Code
ProblemSolutionDiscussionSee Also
Module: Yet Another Python Templating Utility (YAPTU)
See Also
Module: Roman Numerals
See Also
4. Files
IntroductionFile BasicsPortability and Flexibility
Reading from a File
ProblemSolutionDiscussionSee Also
Writing to a File
ProblemSolutionDiscussionSee Also
Searching and Replacing Text in a File
ProblemSolutionDiscussionSee Also
Reading a Particular Line from a File
ProblemSolutionDiscussionSee Also
Retrieving a Line at Random from a File of Unknown Size
ProblemSolutionDiscussionSee Also
Counting Lines in a File
ProblemSolutionDiscussionSee Also
Processing Every Word in a File
ProblemSolutionDiscussionSee Also
Reading a Text File by Paragraphs
ProblemSolutionDiscussionSee Also
Reading Lines with Continuation Characters
ProblemSolutionDiscussionSee Also
Reading Data from ZIP Files
ProblemSolutionDiscussionSee Also
Reading INI Configuration Files
ProblemSolutionDiscussionSee Also
Sending Binary Data to Standard Output Under Windows
ProblemSolutionDiscussionSee Also
Using Random-Access Input/Output
ProblemSolutionDiscussionSee Also
Updating a Random-Access File
ProblemSolutionDiscussionSee Also
Splitting a Path into All of Its Parts
ProblemSolutionDiscussionSee Also
Treating Pathnames as Objects
ProblemSolutionDiscussionSee Also
Creating Directories Including Necessary Parent Directories
ProblemSolutionDiscussionSee Also
Walking Directory Trees
ProblemSolutionDiscussionSee Also
Swapping One File Extension for Another Throughout a Directory Tree
ProblemSolutionDiscussionSee Also
Finding a File Given an Arbitrary Search Path
ProblemSolutionDiscussionSee Also
Finding a File on the Python Search Path
ProblemSolutionDiscussionSee Also
Dynamically Changing the Python Search Path
ProblemSolutionDiscussionSee Also
Computing Directory Sizes in a Cross-Platform Way
ProblemSolutionDiscussionSee Also
File Locking Using a Cross-Platform API
ProblemSolutionDiscussionSee Also
Versioning Filenames
ProblemSolutionDiscussionSee Also
Module: Versioned Backups
See Also
5. Object-Oriented Programming
Introduction
Overriding a Built-In Method
ProblemSolutionDiscussionSee Also
Getting All Members of a Class Hierarchy
ProblemSolutionDiscussionSee Also
Calling a Superclass _ _init_ _ Method if It Exists
ProblemSolutionDiscussionSee Also
Calling a Superclass Implementation of a Method
ProblemSolutionDiscussionSee Also
Implementing Properties
ProblemSolutionDiscussionSee Also
Implementing Static Methods
ProblemSolutionDiscussionSee Also
Implementing Class Methods
ProblemSolutionDiscussionSee Also
Delegating Automatically as an Alternative to Inheritance
ProblemSolutionDiscussionSee Also
Decorating an Object with Print-Like Methods
ProblemSolutionDiscussionSee Also
Checking if an Object Has Necessary Attributes
ProblemSolutionDiscussionSee Also
Making a Fast Copy of an Object
ProblemSolutionDiscussionSee Also
Adding Methods to a Class at Runtime
ProblemSolutionDiscussionSee Also
Modifying the Class Hierarchy of an Instance
ProblemSolutionDiscussionSee Also
Keeping References to Bound Methods Without Inhibiting Garbage Collection
ProblemSolutionDiscussionSee Also
Defining Constants
ProblemSolutionDiscussionSee Also
Managing Options
ProblemSolutionDiscussionSee Also
Implementing a Set Class
ProblemSolutionDiscussionSee Also
Implementing a Ring Buffer
ProblemSolutionDiscussionSee Also
Implementing a Collection
ProblemSolutionDiscussionSee Also
Delegating Messages to Multiple Objects
ProblemSolutionDiscussionSee Also
Implementing the Singleton Design Pattern
ProblemSolutionDiscussionSee Also
Avoiding the Singleton Design Pattern with the Borg Idiom
ProblemSolutionDiscussionSee Also
Implementing the Null Object Design Pattern
ProblemSolutionDiscussionSee Also
6. Threads, Processes, and Synchronization
Introduction
Storing Per-Thread Information
ProblemSolutionDiscussionSee Also
Terminating a Thread
ProblemSolutionDiscussionSee Also
Allowing Multithreaded Read Access While Maintaining a Write Lock
ProblemSolutionDiscussionSee Also
Running Functions in the Future
ProblemSolutionDiscussionSee Also
Synchronizing All Methods in an Object
ProblemSolutionDiscussionSee Also
Capturing the Output and Error Streams from a Unix Shell Command
ProblemSolutionDiscussionSee Also
Forking a Daemon Process on Unix
ProblemSolutionDiscussionSee Also
Determining if Another Instance of a Script Is Already Running in Windows
ProblemSolutionDiscussionSee Also
Processing Windows Messages Using MsgWaitForMultipleObjects
ProblemSolutionDiscussionSee Also
7. System Administration
Introduction
Running a Command Repeatedly
ProblemSolutionDiscussionSee Also
Generating Random Passwords
ProblemSolutionDiscussionSee Also
Generating Non-Totally Random Passwords
ProblemSolutionDiscussionSee Also
Checking the Status of a Unix Network Interface
ProblemSolutionDiscussionSee Also
Calculating Apache Hits per IP Address
ProblemSolutionDiscussionSee Also
Calculating the Rate of Client Cache Hits on Apache
ProblemSolutionDiscussionSee Also
Manipulating the Environment on Windows NT/2000/XP
ProblemSolutionDiscussionSee Also
Checking and Modifying the Set of Tasks Windows Automatically Runs at Logon
ProblemSolutionDiscussionSee Also
Examining the Microsoft Windows Registry for a List of Name Server Addresses
ProblemSolutionDiscussionSee Also
Getting Information About the Current User on Windows NT/2000
ProblemSolutionDiscussionSee Also
Getting the Windows Service Name from Its Long Name
ProblemSolutionDiscussionSee Also
Manipulating Windows Services
ProblemSolutionDiscussionSee Also
Impersonating Principals on Windows
ProblemSolutionDiscussionSee Also
Changing a Windows NT Password Using ADSI
ProblemSolutionDiscussionSee Also
Working with Windows Scripting Host (WSH) from Python
ProblemSolutionDiscussionSee Also
Displaying Decoded Hotkeys for Shortcuts in Windows
ProblemSolutionDiscussionSee Also
8. Databases and Persistence
Introduction
Serializing Data Using the marshal Module
ProblemSolutionDiscussionSee Also
Serializing Data Using the pickle and cPickle Modules
ProblemSolutionDiscussionSee Also
Using the cPickle Module on Classes and Instances
ProblemSolutionDiscussionSee Also
Mutating Objects with shelve
ProblemSolutionDiscussionSee Also
Accessing a MySQL Database
ProblemSolutionDiscussionSee Also
Storing a BLOB in a MySQL Database
ProblemSolutionDiscussionSee Also
Storing a BLOB in a PostgreSQL Database
ProblemSolutionDiscussionSee Also
Generating a Dictionary Mapping from Field Names to Column Numbers
ProblemSolutionDiscussionSee Also
Using dtuple for Flexible Access to Query Results
ProblemSolutionDiscussionSee Also
Pretty-Printing the Contents of Database Cursors
ProblemSolutionDiscussionSee Also
Establishing Database Connections Lazily
ProblemSolutionDiscussionSee Also
Accessing a JDBC Database from a Jython Servlet
ProblemSolutionDiscussionSee Also
Module: jet2sql—Creating a SQL DDL from an Access Database
9. User Interfaces
Introduction
Avoiding lambda in Writing Callback Functions
ProblemSolutionDiscussionSee Also
Creating Menus with Tkinter
ProblemSolutionDiscussionSee Also
Creating Dialog Boxes with Tkinter
ProblemSolutionDiscussionSee Also
Supporting Multiple Values per Row in a Tkinter Listbox
ProblemSolutionDiscussionSee Also
Embedding Inline GIFs Using Tkinter
ProblemSolutionDiscussionSee Also
Combining Tkinter and Asynchronous I/O with Threads
ProblemSolutionDiscussionSee Also
Using a wxPython Notebook with Panels
ProblemSolutionDiscussionSee Also
Giving the User Unobtrusive Feedback During Data Entry with Qt
ProblemSolutionDiscussionSee Also
Building GUI Solutions Independent of the Specific GUI Toolkit
ProblemSolutionDiscussionSee Also
Creating Color Scales
ProblemSolutionDiscussion
Using Publish/Subscribe Broadcasting to Loosen the Coupling Between GUI and Business Logic Systems
ProblemSolutionDiscussionSee Also
Module: Building GTK GUIs Interactively
See Also
10. Network Programming
Introduction
Writing a TCP Client
ProblemSolutionDiscussionSee Also
Writing a TCP Server
ProblemSolutionDiscussionSee Also
Passing Messages with Socket Datagrams
ProblemSolutionDiscussionSee Also
Finding Your Own Name and Address
ProblemSolutionDiscussionSee Also
Converting IP Addresses
ProblemSolutionDiscussionSee Also
Grabbing a Document from the Web
ProblemSolutionDiscussionSee Also
Being an FTP Client
ProblemSolutionDiscussionSee Also
Sending HTML Mail
ProblemSolutionDiscussionSee Also
Sending Multipart MIME Email
ProblemSolutionDiscussionSee Also
Bundling Files in a MIME Message
ProblemSolutionDiscussionSee Also
Unpacking a Multipart MIME Message
ProblemSolutionDiscussionSee Also
Module: PyHeartBeat—Detecting Inactive Computers
See Also
Module: Interactive POP3 Mailbox Inspector
See Also
Module: Watching for New IMAP Mail Using a GUI
See Also
11. Web Programming
Introduction
Testing Whether CGI Is Working
ProblemSolutionDiscussionSee Also
Writing a CGI Script
ProblemSolutionDiscussionSee Also
Using a Simple Dictionary for CGI Parameters
ProblemSolutionDiscussionSee Also
Handling URLs Within a CGI Script
ProblemSolutionDiscussionSee Also
Resuming the HTTP Download of a File
ProblemSolutionDiscussionSee Also
Stripping Dangerous Tags and Javascript from HTML
ProblemSolutionDiscussionSee Also
Running a Servlet with Jython
ProblemSolutionDiscussionSee Also
Accessing Netscape Cookie Information
ProblemSolutionDiscussionSee Also
Finding an Internet Explorer Cookie
ProblemSolutionDiscussionSee Also
Module: Fetching Latitude/Longitude Data from the Web
See Also
12. Processing XML
Introduction
Checking XML Well-Formedness
ProblemSolutionDiscussionSee Also
Counting Tags in a Document
ProblemSolutionDiscussionSee Also
Extracting Text from an XML Document
ProblemSolutionDiscussionSee Also
Transforming an XML Document Using XSLT
ProblemSolutionDiscussionSee Also
Transforming an XML Document Using Python
ProblemSolutionDiscussionSee Also
Parsing an XML File with xml.parsers.expat
ProblemSolutionDiscussionSee Also
Converting Ad-Hoc Text into XML Markup
ProblemSolutionDiscussionSee Also
Normalizing an XML Document
ProblemSolutionDiscussionSee Also
Controlling XSLT Stylesheet Loading
ProblemSolutionDiscussionSee Also
Autodetecting XML Encoding
ProblemSolutionDiscussionSee Also
Module: XML Lexing (Shallow Parsing)
See Also
Module: Converting a List of Equal-Length Lists into XML
See Also
13. Distributed Programming
Introduction
Making an XML-RPC Method Call
ProblemSolutionDiscussionSee Also
Serving XML-RPC Requests
ProblemSolutionDiscussionSee Also
Using XML-RPC with Medusa
ProblemSolutionDiscussionSee Also
Writing a Web Service That Supports Both XML-RPC and SOAP
ProblemSolutionDiscussionSee Also
Implementing a CORBA Client and Server
ProblemSolutionDiscussionSee Also
Performing Remote Logins Using telnetlib
ProblemSolutionDiscussionSee Also
Using Publish/Subscribe in a Distributed Middleware Architecture
ProblemSolutionDiscussionSee Also
Using Request/Reply in a Distributed Middleware Architecture
ProblemSolutionDiscussionSee Also
14. Debugging and Testing
Introduction
Reloading All Loaded Modules
ProblemSolutionDiscussionSee Also
Tracing Expressions and Comments in Debug Mode
ProblemSolutionDiscussionSee Also
Wrapping Tracebacks in HTML
ProblemSolutionDiscussionSee Also
Getting More Information from Tracebacks
ProblemSolutionDiscussionSee Also
Starting the Debugger Automatically After an Uncaught Exception
ProblemSolutionDiscussionSee Also
Logging and Tracing Across Platforms
ProblemSolutionDiscussionSee Also
Determining the Name of the Current Function
ProblemSolutionDiscussionSee Also
Introspecting the Call Stack with Older Versions of Python
ProblemSolutionDiscussionSee Also
Debugging the Garbage-Collection Process
ProblemSolutionDiscussionSee Also
Tracking Instances of Particular Classes
ProblemSolutionDiscussionSee Also
15. Programs About Programs
IntroductionLexingParsingPLY and SPARKUsing Python Itself as a Little LanguageIntrospection
Colorizing Python Source Using the Built-in Tokenizer
ProblemSolutionDiscussionSee Also
Importing a Dynamically Generated Module
ProblemSolutionDiscussionSee Also
Importing from a Module Whose Name Is Determined at Runtime
ProblemSolutionDiscussionSee Also
Importing Modules with Automatic End-of-Line Conversions
ProblemSolutionDiscussionSee Also
Simulating Enumerations in Python
ProblemSolutionDiscussionSee Also
Modifying Methods in Place
ProblemSolutionDiscussionSee Also
Associating Parameters with a Function (Currying)
ProblemSolutionDiscussionSee Also
Composing Functions
ProblemSolutionDiscussionSee Also
Adding Functionality to a Class
ProblemSolutionDiscussionSee Also
Adding a Method to a Class Instance at Runtime
ProblemSolutionDiscussionSee Also
Defining a Custom Metaclass to Control Class Behavior
ProblemSolutionDiscussionSee Also
Module: Allowing the Python Profiler to Profile C Modules
See Also
16. Extending and Embedding
Introduction
Implementing a Simple Extension Type
ProblemSolutionDiscussionSee Also
Translating a Python Sequence into a C Array with the PySequence_Fast Protocol
ProblemSolutionDiscussionSee Also
Accessing a Python Sequence Item-by-Item with the Iterator Protocol
ProblemSolutionDiscussionSee Also
Returning None from a Python-Callable C Function
ProblemSolutionDiscussionSee Also
Coding the Methods of a Python Class in C
ProblemSolutionDiscussionSee Also
Implementing C Function Callbacks to a Python Function
ProblemSolutionDiscussionSee Also
Debugging Dynamically Loaded C Extensions with gdb
ProblemSolutionDiscussionSee Also
Debugging Memory Problems
ProblemSolutionDiscussionSee Also
Using SWIG-Generated Modules in a Multithreaded Environment
ProblemSolutionDiscussionSee Also
17. Algorithms
Introduction
Testing if a Variable Is Defined
ProblemSolutionDiscussionSee Also
Evaluating Predicate Tests Across Sequences
ProblemSolutionDiscussion
Removing Duplicates from a Sequence
ProblemSolutionDiscussionSee Also
Removing Duplicates from a Sequence While Maintaining Sequence Order
ProblemSolutionDiscussionSee Also
Simulating the Ternary Operator in Python
ProblemSolutionDiscussionSee Also
Counting Items and Sorting by Incidence (Histograms)
ProblemSolutionDiscussion
Memoizing (Caching) the Return Values of Functions
ProblemSolutionDiscussion
Looking Up Words by Sound Similarity
ProblemSolutionDiscussionSee Also
Computing Factorials with lambda
ProblemSolutionDiscussionSee Also
Generating the Fibonacci Sequence
ProblemSolutionDiscussionSee Also
Wrapping an Unbounded Iterator to Restrict Its Output
ProblemSolutionDiscussionSee Also
Operating on Iterators
ProblemSolutionDiscussionSee Also
Rolling Dice
ProblemSolutionDiscussionSee Also
Implementing a First-In First-Out Container
ProblemSolutionDiscussion
Modeling a Priority Queue
ProblemSolutionDiscussionSee Also
Converting Numbers to Rationals via Farey Fractions
ProblemSolutionDiscussionSee Also
Evaluating a Polynomial
ProblemSolutionDiscussionSee Also
Module: Finding the Convex Hull of a Set of 2D Points
See Also
Module: Parsing a String into a Date/Time Object Portably
See Also
18. List of Contributors
A
B
C
D
F
G
H
J
K
L
M
N
P
Q
R
S
T
U
V
W
Y
Z
Index
Colophon

Content preview from Python Cookbook

Resuming the HTTP Download of a File

Credit: Chris Moffitt

Problem

You need to resume an HTTP download of a file that has been partially transferred.

Solution

Large downloads are sometimes interrupted. However, a good HTTP server that supports the Range header lets you resume the download from where it was interrupted. The standard Python module urllib lets you access this functionality almost seamlessly. You need to add only the needed header and intercept the error code the server sends to confirm that it will respond with a partial file:

import urllib, os

class myURLOpener(urllib.FancyURLopener):
    """ Subclass to override error 206 (partial file being sent); okay for us """
    def http_error_206(self, url, fp, errcode, errmsg, headers, data=None):
        pass    # Ignore the expected "non-error" code

def getrest(dlFile, fromUrl, verbose=0):
    loop = 1
    existSize = 0
    myUrlclass = myURLOpener(  )
    if os.path.exists(dlFile):
        outputFile = open(dlFile,"ab")
        existSize = os.path.getsize(dlFile)
        # If the file exists, then download only the remainder
        myUrlclass.addheader("Range","bytes=%s-" % (existSize)) else: outputFile = open(dlFile,"wb") webPage = myUrlclass.open(fromUrl) if verbose: for k, v in webPage.headers.items( ): print k, "=", v # If we already have the whole file, there is no need to download it again numBytes = 0 webSize = int(webPage.headers['Content-Length']) if webSize == existSize: if verbose: print "File (%s) was already downloaded from URL (%s)"%( dlFile, fromUrl) else: if verbose: print ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

O’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.

Julian F.

Head of Cybersecurity

I wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.

Addison B.

Field Engineer

I’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.

Amir M.

Data Platform Tech Lead

I'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.

Mark W.

Embedded Software Engineer

Publisher Resources

ISBN: 0596001673Catalog Page Errata

Cloud Computing

Data Engineering

Data Science

AI & ML

Programming Languages

Software Architecture

IT/Ops

Security

Design

Business

Soft Skills