book

Developing with PDF

Name: Developing with PDF
Author: Leonard Rosenthol
ISBN: 9781449327910

by Leonard Rosenthol

October 2013

Intermediate to advanced

215 pages

4h 29m

English

O'Reilly Media, Inc.

Read now

Unlock full access

Preface
Who Should Read This BookOrganization of ContentConventions Used in This BookSafari® Books OnlineHow to Contact UsAcknowledgments
1. PDF Syntax
PDF ObjectsNull ObjectsBoolean ObjectsNumeric ObjectsName ObjectsString ObjectsArray ObjectsDictionary ObjectsName treesNumber treesStream ObjectsDirect versus Indirect ObjectsFile StructureWhite-SpaceThe Four Sections of a PDFHeaderTrailerBodyCross-reference tableIncremental UpdateLinearizationDocument StructureThe Catalog DictionaryThe Page TreePagesPDF unitsRects and boxesInheritanceThe Name DictionaryWhat’s Next
2. PDF Imaging Model
Content StreamsGraphic StateThe Painter’s ModelOpen versus Closed PathsClippingDrawing PathsTransformationsBasic ColorMarked Content OperatorsProperty ListsResourcesExternal Graphic StateBasic TransparencyWhat’s Next
3. Images
Raster ImagesAdding the ImageImage dictionariesImages in content streamsJPEG ImagesTransparency and ImagesSoft MasksStencil MasksColor-Keyed MasksVector ImagesAdding the Form XObjectThe Form DictionaryCopying a Page to a Form XObjectWhat’s Next
4. Text
FontsGlyphsFont TypesThe Font DictionaryEncodingsText StateFont and SizeRendering ModeDrawing TextPositioning TextWhat’s Next
5. Navigation
DestinationsExplicit DestinationsNamed DestinationsActionsThe Action DictionaryGoTo ActionsURI ActionsGoToR and Launch ActionsMultimedia ActionsNested ActionsBookmarks or OutlinesWhat’s Next
6. Annotations
IntroductionAnnotation DictionariesAppearance StreamsMarkup AnnotationsText MarkupDrawing MarkupAttributesSquares and circlesLinesPolygons and polylinesInkStamps MarkupText Annotations and Pop-upsNon-Markup AnnotationsWhat’s Next
7. AcroForms
The Interactive Form DictionaryThe Field DictionaryField NamesField FlagsFields and AnnotationsField ClassesButton FieldsText FieldsPlain textRich textText field flagsChoice FieldsMultiSelect flagOptionsValuesScrolling listsCombo boxesEditable combo boxesSignature FieldsForm ActionsSubmitFormSubmission formatsResetFormImportDataWhat’s Next
8. Embedded Files
File SpecificationsEmbedded File StreamsURL File SpecificationsWays to Embed FilesFileAttachment AnnotationsThe EmbeddedFiles Name TreeCollectionsThe Collection DictionaryCollection SchemaGoToE ActionsWhat’s Next
9. Multimedia and 3D
Simple MediaSound AnnotationsSound actionsMovie AnnotationsThe movie dictionaryThe movie activation dictionaryMovie actionsMultimediaScreen AnnotationThe appearance characteristics dictionaryRendition ActionsRendition objects3D3D AnnotationsThe 3D annotation dictionary3D views3D streamsMarkups on 3DWhat’s Next

10. Optional Content
Optional Content GroupsContent StateUsageOptional Content MembershipVisibility PoliciesVisibility ExpressionsOptional Content ConfigurationOrder KeyRBGroupsAS (Automatic State)Optional Content PropertiesMarking Content as OptionalOptional Content in Content StreamsOptional Content for Form XObjectsOptional Content for AnnotationsWhat’s Next
11. Tagging and Structure
Structured PDFThe Structure TreeStructure ElementsStandard structure typesGrouping elementsBlock-level structural elementsInline-level structural elementsArtifactsRole MappingAssociating Structure to ContentTagged PDFsWhat’s Next
12. Metadata
The Document Information DictionaryMetadata StreamsXMPSchemasXMP in PDFXMP versus the Info DictionaryWhat’s Next
13. PDF Standards
PDF (ISO 32000)PDF/X (ISO 15930)PDF/A (ISO 19005)PDF/E (ISO 24517)PDF/VT (ISO 16612-2)PDF/UA (ISO 14289)Other PDF-Related StandardsPAdES (ETSI TS 102 778)PDF Healthcare
Index
About the Author
Colophon
Copyright

Content preview from Developing with PDF

Chapter 1. PDF Syntax

We’ll begin our exploration of PDF by diving right into the building blocks of the PDF file format. Using these blocks, you’ll see how a PDF is constructed to lead to the page-based format that you are familiar with.

PDF Objects

The core part of a PDF file is a collection of “things” that the PDF standard (ISO 32000) refers to as objects, or sometimes COS objects.

Note

COS stands for Carousel Object System and refers to the original/code name for Adobe’s Acrobat product.

These aren’t objects in the “object-oriented programming” sense of the word; instead, they are the building blocks on which PDF stands. There are nine types of objects: null, Boolean, integer, real, name, string, array, dictionary, and stream.

Let’s look at each of these object types and how they are serialized into a PDF file. From there, you’ll then see how to take these object types and use them to build higher-level constructs and the PDF format itself.

Null Objects

The null object, if actually written to a file, is simply the four characters null. It is synonymous with a missing value, which is why it’s extremely rare to see one in a PDF. If you have reason to work with the null value, be sure to consult ISO 32000 carefully about the subtleties involving its handling.

Boolean Objects

Boolean objects represent the logical values of true and false and are represented accordingly in the PDF, either as true or false.

Note

When writing a PDF, you will always use true or false. However, if you are reading/parsing ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

O’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.

Julian F.

Head of Cybersecurity

I wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.

Addison B.

Field Engineer

I’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.

Amir M.

Data Platform Tech Lead

I'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.

Mark W.

Embedded Software Engineer

Publisher Resources

ISBN: 9781449327903Errata Page

Cloud Computing

Data Engineering

Data Science

AI & ML

Programming Languages

Software Architecture

IT/Ops

Security

Design

Business

Soft Skills

Developing with PDF

by Leonard Rosenthol

Chapter 1. PDF Syntax

PDF Objects