book

RELAX NG

Name: RELAX NG
Author: Eric van der Vlist
ISBN: 9780596004217

by Eric van der Vlist

December 2003

Intermediate to advanced

506 pages

12h 26m

English

O'Reilly Media, Inc.

Read now

Unlock full access

A Note Regarding Supplemental Files
Foreword by James Clark
Foreword by Murata Makoto
Preface
Who Should Read This Book?
Who Shouldn’t Read This Book?
Organization of This Book
Conventions Used in This Book
Comments and Questions
Powered by WikiML

Acknowledgments
I. Tutorial
1. What RELAX NG Offers
Diversity
Keeping Documents Independent of Applications
Validation Has Many Aspects
The Best Way to Validate XML Document Structures
RELAX NG’s Diverse Applications
RELAX NG as a Pivot Format
Why Use Other Schema Languages?
2. Simple Foundations Are Beautiful
Documents and Infosets
Different Types of Schema Languages
A Simple Example
A Strong Mathematical Background
Patterns, and Only Patterns
3. First Schema
Getting Started
First Patterns
The text PatternThe attribute PatternThe element PatternThe optional PatternThe oneOrMore PatternThe zeroOrMore Pattern
Complete Schema
Constraining Number of OccurrencesCreating “Russian Doll” Schemas
4. Introducing the Compact Syntax
First Compact Patterns
The text PatternThe attribute PatternElementThe optional PatternThe oneOrMore PatternThe zeroOrMore Pattern
Full Schema
XML or Compact?
5. Flattening the First Schema
Defining Named Patterns
Referencing Named Patterns
The grammar and start Elements
Assembling the Parts
Problems That Never Arise
Recursive Models
Escaping Named Pattern Identifiers in the Compact Syntax
6. More Complex Patterns
The group Pattern
The interleave Pattern
The choice Pattern
Pattern Compositions
Order Variation as a Source of Information
Text and Empty Patterns, Whitespace, and Mixed Content
Why Is It Called interleave?
Mixed Content Models with Order
A Restriction Related to interleave
A Missing Pattern: Unordered Group
7. Constraining Text Values
Fixed Values
Co-Occurrence Constraints
Enumerations
Whitespace and RELAX NG Native Datatypes
Using String Datatypes in Attribute Values
When to Use String Datatypes
Using Different Types in Each Value
Exclusions
Lists
Data Versus Text
8. Datatype Libraries
W3C XML Schema Type Library
The DatatypesString datatypesURIsQualified namesBinary string-encoded datatypesNumeric datatypesDate and time formatsExamplesFacets
DTD Compatibility Datatypes
Which Library Should Be Used?
Native Types Versus W3C XML Schema DatatypesDTD Versus W3C XML Schema Datatypes
9. Using Regular Expressions to Specify Simple Datatypes
A Swiss Army Knife
The Simplest Possible Pattern Facets
Quantifying
More Atoms
Special CharactersWildcardCharacter ClassesClassical Perl character classesUnicode character classesUser-defined character classesOr-ing and Grouping
Common Patterns
String DatatypesUnicode blocksCounting wordsURIsNumeric and Float TypesLeading zerosFixed formatDatetimesTime zones
10. Creating Building Blocks
Using External References
With Russian Doll SchemasWith Flat SchemasEmbedding GrammarsReferencing Patterns in Parent Grammars
Merging Grammars
Merging Without RedefinitionMerging and Replacing DefinitionsCombining DefinitionsCombining by choiceCombining by interleaveWhy Can’t Definitions Be Defined by Group?
A Real-World Example: XHTML 2.0
Other Options
A Possible Use CaseXML ToolsText Tools
11. Namespaces
A Ten-Minute Guide to XML Namespaces
The Two Challenges of Namespaces
Declaring Namespaces in Schemas
Using the Default NamespaceUsing Prefixes
Accepting Foreign Namespaces
Constructing a WildcardUsing WildcardsWhere Should Foreign Nodes Be Allowed?Traps to AvoidAdding Foreign Nodes Through Combination
Namespaces, Building Blocks, and Chameleon Design
Reexamining XHTML 2.0Putting a Chameleon in the LibraryGood Chameleon or Evil Chameleon?
12. Writing Extensible Schemas
Extensible Schemas
Working from a Fixed ResultProviding a grammar and a start elementMaximize granularityDefining named patterns for content rather than for elementsFree FormatsBe cautious with attributesUse order sparinglyUse containersRestricting Existing Schemas
The Case for Open Schemas
More Name Classes
Extensible and Open?
13. Annotating Schemas
Common Principles for Annotating RELAX NG Schemas
Annotation Using the XML SyntaxAnnotations Using the Compact SyntaxGrammar annotationsInitial annotationsFollowing annotationsAssembling the annotation syntaxWhen initial annotations turn into following annotationsAnnotating Groups of DefinitionsAlternatives and WorkaroundsWhy reinvent XML 1.0 comments and PIs?Annotation of value and param patterns
Documentation
CommentsRELAX NG DTD Compatibility CommentsXHTML AnnotationsDocBook AnnotationsDublin Core AnnotationsSVG AnnotationsRDDL Annotations
Annotation for Applications
Annotations for PreprocessingAnnotations for ConversionAnnotations to generate DTDsAnnotations to generate W3C XML SchemaSchema Adjunct FrameworkAnnotations for ExtensionEmbedded Schematron rulesXVIF
14. Generating RELAX NG Schemas
Examplotron: Instance Documents as Schemas
Ten-Minute Guide to ExamplotronUse Cases
Literate Programming
Out of the BoxAdding Bells and Whistles for RDDL
UML
Spreadsheets
15. Simplification and Restrictions
Simplification
Annotation Removal, Whitespace and Attribute Normalization, and InheritanceRetrieval of External SchemasName Class NormalizationPattern NormalizationFirst Set of ConstraintsGrammar MergeSchema FlatteningFinal Cleanup
Restrictions
Constraints on AttributesBad example: attribute content modelBad example: attribute duplicationBad example: name class overlapConstraints on ListsBad example: list and interleaveConstraints on Except PatternsConstraints on Start PatternsConstraints on Content ModelsLimitations on interleaveBad example: more than one text pattern in interleave
16. Determinism and Datatype Assignment
What Is Ambiguity?
Ambiguity Versus DeterminismDifferent Kinds of AmbiguityRegular expression ambiguitiesAmbiguous regular hedge grammarsName class ambiguityAmbiguous datatypes
The Downsides of Ambiguous and Nondeterministic Content Models
Instance AnnotationsCompatibility with W3C XML Schema
Some Ideas to Make Disambiguation Easier
Generalizing the Except PatternMaking Disambiguation Rules ExplicitAccepting Ambiguity
II. Reference
17. Element Reference
Elements
18. Compact Syntax Reference
EBNF Production Reference
19. Datatype Reference
xsd:anyURI — URI (Uniform Resource Identifier)
xsd:base64Binary — Binary content coded as “base64”
xsd:boolean — Boolean (true or false)
xsd:byte — Signed value of 8 bits
xsd:date — Gregorian calendar date
xsd:dateTime — Instant of time (Gregorian calendar)
xsd:decimal — Decimal numbers
xsd:double — IEEE 64-bit floating-point
xsd:duration — Time durations
xsd:ENTITIES — Whitespace-separated list of unparsed entity references
xsd:ENTITY — Reference to an unparsed entity
xsd:float — IEEE 32-bit floating-point
xsd:gDay — Recurring period of time: monthly day
xsd:gMonth — Recurring period of time: yearly month
xsd:gMonthDay — Recurring period of time: yearly day
xsd:gYear — Period of one year
xsd:gYearMonth — Period of one month
xsd:hexBinary — Binary contents coded in hexadecimal
xsd:ID — Definition of unique identifiers
xsd:IDREF — Definition of references to unique identifiers
xsd:IDREFS — Definition of lists of references to unique identifiers
xsd:int — 32-bit signed integers
xsd:integer — Signed integers of arbitrary length
xsd:language — RFC 1766 language codes
xsd:long — 64-bit signed integers
xsd:Name — XML 1.O name
xsd:NCName — Unqualified names
xsd:negativeInteger — Strictly negative integers of arbitrary length
xsd:NMTOKEN — XML 1.0 name token (NMTOKEN)
xsd:NMTOKENS — List of XML 1.0 name tokens (NMTOKEN)
xsd:nonNegativeInteger — Integers of arbitrary length positive or equal to zero
xsd:nonPositiveInteger — Integers of arbitrary length negative or equal to zero
xsd:normalizedString — Whitespace-replaced strings
xsd:NOTATION — Emulation of the XML 1.0 feature
xsd:positiveInteger — Strictly positive integers of arbitrary length
xsd:QName — Namespaces in XML-qualified names
xsd:short — 32-bit signed integers
xsd:string — Any string
xsd:time — Point in time recurring each day
xsd:token — Whitespace-replaced and collapsed strings
xsd:unsignedByte — Unsigned value of 8 bits
xsd:unsignedInt — Unsigned integer of 32 bits
xsd:unsignedLong — Unsigned integer of 64 bits
xsd:unsignedShort — Unsigned integer of 16 bits
III. Appendixes
A. DSDL
A Multipart Standard
Part 1: OverviewPart 2: Regular Grammar-Based ValidationPart 3: Rule-Based ValidationPart 4: Selection of Validation CandidatesPart 5: DatatypesPart 6: Path-Based Integrity ConstraintsPart 7: Character Repertoire ValidationPart 8: Declarative Document ArchitecturesPart 9: Namespace- and Datatype-Aware DTDsPart 10: Validation Management
What DSDL Should Bring You
B. The GNU Free Documentation License
GNU Free Documentation License
0. Preamble
1. APPLICABILITY AND DEFINITIONS
2. VERBATIM COPYING
3. COPYING IN QUANTITY
4. MODIFICATIONS
5. COMBINING DOCUMENTS
6. COLLECTIONS OF DOCUMENTS
7. AGGREGATION WITH INDEPENDENT WORKS
8. TRANSLATION
9. TERMINATION
10. FUTURE REVISIONS OF THIS LICENSE
Addendum: How to use this License for your documents
Glossary
Index
About the Author
Colophon
Copyright

Overview

As developers know, the beauty of XML is that it is extensible, even to the point that you can invent new elements and attributes as you write XML documents. Then, however, you need to define your changes so that applications will be able to make sense of them and this is where XML schema languages come into play. RELAX NG (pronounced relaxing), the Regular Language Description for XML Core--New Generation is quickly gaining momentum as an alternative to other schema languages. Designed to solve a variety of common problems raised in the creation and sharing of XML vocabularies, RELAX NG is less complex than The W3C's XML Schema Recommendation and much more powerful and flexible than DTDs.RELAX NG is a grammar-based schema language that's both easy to learn for schema creators and easy to implement for software developers In RELAX NG, developers are introduced to this unique language and will learn a no-nonsense method for creating XML schemas. This book offers a clear-cut explanation of RELAX NG that enables intermediate and advanced XML developers to focus on XML document structures and content rather than battle the intricacies of yet another convoluted standard.RELAX NG covers the following topics in depth:

Introduction to RELAX NG
Building RELAX NG schemas using XML syntax
Building RELAX NG schemas using compact syntax, an alternative non-XML syntax
Flattening schemas to limit depth and provide reusability
Using external datatype libraries with RELAX NG
W3C XML Schema regular expressions
Writing extensible schemas
Annotating schemas
Generating schemas form different sources
Determinism and datatype assignment

and much more.If you're looking for a schema language that's easy to use and won't leave you in a labyrinth of obscure limitations, RELAX NG is the language you should be using. And only O'Reilly's RELAX NG gives you the straightforward information and everything else you'll need to take advantage of this powerful and intelligible language.

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

O’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.

Julian F.

Head of Cybersecurity

I wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.

Addison B.

Field Engineer

I’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.

Amir M.

Data Platform Tech Lead

I'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.

Mark W.

Embedded Software Engineer

Publisher Resources

ISBN: 0596004214Errata Page

Cloud Computing

Data Engineering

Data Science

AI & ML

Programming Languages

Software Architecture

IT/Ops

Security

Design

Business

Soft Skills

RELAX NG

by Eric van der Vlist

Overview

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

More than 5,000 organizations count on O’Reilly

Julian F.

Addison B.

Amir M.

Mark W.

You might also like

Be a More Active Listener

Broadband Optical Access Networks

What Successful Project Managers Do

Perfecting Your Thinking Skills

Publisher Resources