book

Python Standard Library

Name: Python Standard Library
Author: Fredrik Lundh
ISBN: 9780596000967

by Fredrik Lundh

May 2001

Intermediate to advanced

304 pages

6h 12m

English

O'Reilly Media, Inc.

Read now

Unlock full access

Python Standard Library
Preface
About This BookWhat About Tkinter?Production Details
Conventions Used in This Book
About the Examples
How to Contact Us
1. Core Modules
IntroductionBuilt-in Functions and ExceptionsOperating System Interface ModulesType Support ModulesRegular ExpressionsLanguage Support Modules
The _ _builtin_ _ Module
Calling a Function with Arguments from a Tuple or DictionaryLoading and Reloading ModulesLooking in NamespacesChecking an Object’s TypeEvaluating Python ExpressionsCompiling and Executing CodeOverloading Functions from the _ _builtin_ _ Module
The exceptions Module
The os Module
Working with FilesWorking with DirectoriesWorking with File AttributesWorking with ProcessesWorking with Daemon Processes
The os.path Module
Working with FilenamesTraversing a Filesystem

The stat Module
The string Module
The re Module
The math Module
The cmath Module
The operator Module
The copy Module
The sys Module
Working with Command-line ArgumentsWorking with ModulesWorking with Reference CountsChecking the Host PlatformTracing the ProgramWorking with Standard Input and OutputExiting the Program
The atexit Module
The time Module
Getting the Current TimeConverting Time Values to StringsConverting Strings to Time ValuesConverting Time ValuesTiming Things
The types Module
The gc Module
2. More Standard Modules
OverviewFiles and StreamsType WrappersRandom NumbersDigests and Encryption Algorithms
The fileinput Module
The shutil Module
The tempfile Module
The StringIO Module
The cStringIO Module
The mmap Module
The UserDict Module
The UserList Module
The UserString Module
The traceback Module
The errno Module
The getopt Module
The getpass Module
The glob Module
The fnmatch Module
The random Module
The whrandom Module
The md5 Module
The sha Module
The crypt Module
The rotor Module
The zlib Module
The code Module
3. Threads and Processes
OverviewThreadsProcesses
The threading Module
The Queue Module
The thread Module
The commands Module
The pipes Module
The popen2 Module
The signal Module
4. Data Representation
OverviewBinary DataSelf-Describing FormatsOutput FormattingEncoded Binary Data
The array Module
The struct Module
The xdrlib Module
The marshal Module
The pickle Module
The cPickle Module
The copy_reg Module
The pprint Module
The repr Module
The base64 Module
The binhex Module
The quopri Module
The uu Module
The binascii Module
5. File Formats
OverviewMarkup LanguagesConfiguration FilesArchive Formats
The xmllib Module
The xml.parsers.expat Module
The sgmllib Module
The htmllib Module
The htmlentitydefs Module
The formatter Module
The ConfigParser Module
The netrc Module
The shlex Module
The zipfile Module
Listing the ContentsReading Data from a ZIP FileWriting Data to a ZIP File
The gzip Module
6. Mail and News Message Processing
Overview
The rfc822 Module
The mimetools Module
The MimeWriter Module
The mailbox Module
The mailcap Module
The mimetypes Module
The packmail Module
The mimify Module
The multifile Module
7. Network Protocols
OverviewInternet Time ProtocolHypertext Transfer Protocol
The socket Module
The select Module
The asyncore Module
The asynchat Module
The urllib Module
The urlparse Module
The cookie Module
The robotparser Module
The ftplib Module
The gopherlib Module
The httplib Module
Posting Data to an HTTP Server
The poplib Module
The imaplib Module
The smtplib Module
The telnetlib Module
The nntplib Module
Listing messagesDownloading Messages
The SocketServer Module
The BaseHTTPServer Module
The SimpleHTTPServer Module
The CGIHTTPServer Module
The cgi Module
The webbrowser Module
8. Internationalization
The locale Module
The unicodedata Module
The ucnhash Module
9. Multimedia Modules
Overview
The imghdr Module
The sndhdr module
The whatsound Module
The aifc Module
The sunau Module
The sunaudio Module
The wave Module
The audiodev Module
The winsound Module
The colorsys Module
10. Data Storage
Overview
The anydbm Module
The whichdb Module
The shelve Module
The dbhash Module
The dbm Module
The dumbdbm Module
The gdbm Module
11. Tools and Utilities
The dis Module
The pdb Module
The bdb Module
The profile Module
The pstats Module
The tabnanny Module
12. Platform-Specific Modules
Overview
The fcntl Module
The pwd Module
The grp Module
The nis Module
The curses Module
The termios Module
The tty Module
The resource Module
The syslog Module
The msvcrt Module
The nt Module
The _winreg Module
The posix Module
13. Implementation Support Modules
The dospath Module
The macpath Module
The ntpath Module
The posixpath Module
The strop Module
The imp Module
The new Module
The pre Module
The sre Module
The py_compile Module
The compileall Module
The ihooks Module
The linecache Module
The macurl2path Module
The nturl2path module
The tokenize Module
The keyword Module
The parser Module
The symbol Module
The token Module
14. Other Modules
Overview
The pyclbr Module
The filecmp Module
The cmd Module
The rexec Module
The Bastion Module
The readline Module
The rlcompleter Module
The statvfs Module
The calendar Module
The sched Module
The statcache Module
The grep Module
The dircache Module
The dircmp Module
The cmp Module
The cmpcache Module
The util Module
The soundex Module
The timing Module
The posixfile Module
The bisect Module
The knee Module
The tzparse Module
The regex Module
The regsub Module
The reconvert Module
The regex_syntax Module
The find Module
Index
Colophon

Content preview from Python Standard Library

The xmllib Module

The xmlib module provides a simple XML parser, using regular expressions to pull the XML data apart, as shown in Example 5-1. The parser does basic checks on the document, such as a check to see that there is only one top-level element and a check to see that all tags are balanced.

You feed XML data to this parser piece by piece (as data arrives over a network, for example). The parser calls methods in itself for start tags, data sections, end tags, and entities, among other things.

If you’re only interested in a few tags, you can define special start_tag and end_tag methods, where tag is the tag name. The start functions are called with the attributes given as a dictionary.

Example 5-1. Using the xmllib Module to Extract Information from an Element

File: xmllib-example-1.py

import xmllib

class Parser(xmllib.XMLParser):
    # get quotation number

    def _ _init_ _(self, file=None):
        xmllib.XMLParser._ _init_ _(self)
        if file:
            self.load(file)

    def load(self, file):
        while 1:
            s = file.read(512)
            if not s:
                break
            self.feed(s)
        self.close()

    def start_quotation(self, attrs):
        print "id =>", attrs.get("id")
        raise EOFError

try:
    c = Parser()
    c.load(open("samples/sample.xml"))
except EOFError:
    pass

id => 031

Example 5-2 contains a simple (and incomplete) rendering engine. The parser maintains an element stack (_ _tags), which it passes to the renderer, together with text fragments. The renderer looks up the current tag hierarchy in a style dictionary, and if it isn’t already there, ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

O’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.

Julian F.

Head of Cybersecurity

I wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.

Addison B.

Field Engineer

I’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.

Amir M.

Data Platform Tech Lead

I'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.

Mark W.

Embedded Software Engineer

The Python 3 Standard Library by Example, Second Edition

Publisher Resources

ISBN: 0596000960Catalog Page Errata

Cloud Computing

Data Engineering

Data Science

AI & ML

Programming Languages

Software Architecture

IT/Ops

Security

Design

Business

Soft Skills

Python Standard Library

by Fredrik Lundh

The xmllib Module

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

More than 5,000 organizations count on O’Reilly

Julian F.

Addison B.

Amir M.

Mark W.

You might also like

The Python 3 Standard Library by Example, Second Edition

Dive Into Python 3

Python in a Nutshell

Python One-Liners

Publisher Resources