book

eXist

by Erik Siegel, Adam Retter

December 2014

Beginner

584 pages

15h 13m

English

O'Reilly Media, Inc.

Read now

Unlock full access

Preface
WelcomeWho Is This Book For?Conventions Used in This BookXQuery Filename ConventionsUsing Code ExamplesAccompanying Source CodeGetting the Source CodeBuilding and DeployingBuilding everythingBuilding the EXPath packageCompiling the Java examplesSafari® Books OnlineHow to Contact UsAcknowledgments
1. Introduction
What Is eXist?eXist Compared to Other Database SystemsHistoryCompetitorsOpen Source CompetitorsBaseXSednaClosed Source, Commercial Competitors28.ioMarkLogic ServerWho Is Using eXist, and for What?Contributing to the CommunityIndividuals Using eXistOrganizations Using eXistAuthors Using eXistDevelopers Using eXistAdditional Resources
2. Getting Started
Downloading and Installing eXistPreconditionsDownloading eXistThings to Decide Before InstallingInstalling eXistPost-Installation ChecksStarting and Stopping eXist with a GUIStarting and Stopping eXist from the Command LineA First Tour Around TownThe DashboardPlaying AroundWhat’s in Your DatabaseWhat’s on Your DiskThe Java Admin ClientGetting Files into and out of the DatabaseHello eXist!Hello DataHello XQueryHello XSLTHello XIncludeHello XForms
3. Using eXist 101
Preparations and Basic Application SetupeXist TerminologyExporting Documents from eXistDesigning an Application’s Collection Structure and Importing DataViewing the DataListing the Plays (XML)Listing with the collection FunctionListing with the xmldb Extension ModuleListing the Plays (HTML)Analyzing the PlaysLinking the Analysis to the Play OverviewSearching the PlaysSearching Using Straight XQuerySearching Using an IndexCreating a LogWhat’s Next?
4. Architecture
Deployment ArchitecturesEmbedded ArchitectureClient/Server Database ArchitectureWeb Application Platform ArchitectureStorage ArchitectureXML Document Storage and IndexingBinary Document StorageEfficient XML Processing ArchitectureCollectionsDocumentsDynamic Level Numbering of NodesDynamic Level Numbering and UpdatesPaging and Caching
5. Working with the Database
The Database’s ContentHelp: Where Is My XML?TerminologyProperties of Collections and ResourcesSystem CollectionsAddressing Collections, Resources, and FilesUse URIsRelative versus absolute pathsXMLDB URIsAccessing filesThe XPath Collection and Doc Functions in eXistThe collection FunctionThe doc FunctionQuerying the Database Using RESTSecurityGET RequestsPUT RequestsDELETE RequestsPOST RequestsExtended query requests XML formatAd Hoc QueryingQuerying using eXideQuerying using the eXist client toolUpdating DocumentseXist’s XQuery Update Extensionsupdate deleteupdate insertupdate renameupdate replaceupdate valueXUpdateXUpdate XML formatExecuting XUpdateControlling the Database from CodeSpecifying Collections and Resources for the xmldb Extension ModuleAccessing external databases using extended XMLDB URIsGetting InformationCreating Resources and CollectionsSetting PermissionsMoving, Removing, and Renaming
6. XQuery for eXist
eXist’s XQuery ImplementationXQuery 1.0 SupportXQuery 3.0 SupportXPath 3.0 functionstry/catchswitch expressionHigher-order functionsThe simple map operatorThe string concatenation operatorAnnotationsControlling serializationThe group by clauseOther XQuery ExtrasThe map data type proposed for XQuery 3.1Java bindingXQuery ExecutionSerializationControlling SerializationSerialization OptionsGeneral serialization optionsPost-processing serialization optionseXist-specific serialization optionsJSON serializationControlling the XQuery ExecutioneXist XQuery PragmasLimiting Execution Time and Output SizeOther OptionsXQuery Documentation with xqDoc
7. Extension Modules
Types of Extension ModulesExtension Modules Written in JavaExtension Modules Written in XQueryEnabling Extension ModulesEnabling Java Extension ModulesRebuilding eXistEnabling XQuery Extension Modules
8. Security
Security BasicsUsersGroupsPermissionsDefault PermissionsUser masksManaging Users and GroupsGroup ManagersTools for User and Group ManagementUsing the Java Admin ClientUsing the User Manager web appExecuting XQuery functionsModifying the security collectionUsing the APIs: XML-RPC, XML:DB, and SOAPUser and Group Management with the Java Admin ClientScenarioUser Management in the Java Admin ClientCreating a GroupCreating UsersSetting Group ManagersManaging PermissionsTools for Permission ManagementUsing the Java Admin ClientUsing the Collections Browser web appExecuting XQuery functionsUsing the XML-RPC or XML:DB APIUsing the eXist Ant tasksPermission Management with the Java Admin ClientAccess Control ListsAccess Control EntriesACLs by ExampleAllowing additional accessRestricting accessAllowing and restricting accessManaging ACLsACL management with the Java Admin ClientRealmsLDAP Realm ModuleLDAP configuration optionsLDAP configuration for Microsoft Active DirectoryOther Realm ModulesHardeningReducing Collateral DamageLinux platformsSolaris platformsWindows platformsReducing the Attack SurfaceDisabling extension modulesDisabling Java Binding from XQueryDisabling direct access to the REST ServerDisabling network services and APIsDisabling autodeployment of EXPath packagesRemoving preinstalled EXPath packagesSecuring eXist’s network servicesReverse proxyingUser Authentication in XQueryxmldb:authenticatexmldb:loginBackups
9. Building Applications
OverviewWhat Technology to Use?Application Building AspectsQuickly Getting Started?Where to Store Your Application?URL Mapping Using URL RewritingAnatomy of a URL Rewriting−Based ApplicationHow eXist Finds the ControllerThe URL Rewrite Controller’s EnvironmentThe Controller’s Output XML FormatIgnoring the requestRedirecting the requestForwarding the requestURL rewrite cachingAdvanced URL ControlChanging the URL for URL RewritingChanging Jetty Settings: Port Number and URL PrefixThe controller-config.xml Configuration FileProxying eXist Behind a Web ServerRequests, Sessions, and ResponsesThe Request Extension ModuleRequest parameters and attributesUploading filesThe Session Extension ModuleThe Response Extension ModuleCreating “download XML file” functionalityApplication SecurityRunning with Extra PermissionsGlobal Error PagesBuilding Applications with RESTXQConfiguring RESTXQRESTXQ AnnotationsHTTP method constraint annotationsURI path constraint annotationConsumes constraint annotationProduces constraint annotationParameter annotationsRESTXQ XQuery Extension FunctionsPackagingExamplesThe Packaging FormatThe expath-pkg.xml fileThe repo.xml fileThe Prepare and Finish ScriptsCreating PackagesAdditional Remarks About Packages

10. Other XML Technologies
XSLTEmbedding Stylesheets or NotInvoking XSLT with the Transform Extension ModulePassing XSLT ParametersInvoking XSLT by Processing InstructionStylesheet DetailsXIncludeIncluding DocumentsIncluding Query ResultsError Handling and FallbackValidationImplicit ValidationControlling implicit validationSpecifying catalogs for implicit validationExplicit ValidationPerforming explicit validationGrammar management in the JAXP (Xerces) parserCollationsSupported CollationsSpecifying CollationsXSL-FOXFormsXForms InstancesInstances and the REST ServerInstances and XQueryXForms SubmissionsSubmission to the REST ServerSubmission via XQuerySubmission authenticationbetterFormXSLTForms
11. Basic Indexing
Indexing ExampleIndex TypesStructural IndexRange IndexesNGram IndexesFull-Text IndexesConfiguring IndexesConfiguring Range IndexesConfiguring NGram IndexesMaintaining IndexesUsing IndexesUsing the Structural IndexUsing the Range IndexesUsing the NGram IndexesGeneral Optimization TipsDebugging IndexesChecking Index DefinitionsChecking Index UsageTracing the Optimizer
12. Text Indexing and Lookup
Full-Text Index and KWIC ExampleConfiguring Full-Text IndexesConfiguring the Search ContextChoosing the correct contextSearch context and performanceHandling Mixed ContentInline content and whitespaceIgnoring inline contentMaintaining the Full-Text IndexSearching with the Full-Text IndexBasic Search OperationsLucene native query syntaxThe full-text query XML specificationAdditional search parametersScoring SearchesLocating MatchesUsing Keywords in ContextDefining and Configuring the Lucene AnalyzerManual Full-Text Indexing
13. Integration
Choosing an APIRemote APIsWebDAVUsing WebDAV from Microsoft WindowsMapping a drive to eXist WebDAV from Windows ExplorerUsing WebDAV from Mac OS XMounting eXist WebDAV from FinderUsing WebDAV from LinuxUsing WebDAV from GNOME NautilusMounting eXist WebDAV from NautilusUsing WebDAV with FUSEInstalling davfs2 in Debian-based distributionsInstalling davfs2 in distributions with RPM packagesUsing WebDAV from JavaExamplesStore exampleRetrieve exampleREST Server APIRetrieving collections and documentsXSL transformationStoring a documentDeleting collections and documentsQuerying the databaseHTTP GET queriesHTTP POST queriesREST Server parameters and paging resultsUpdating the databaseExecuting stored queriesStore a JPEG image received over HTTP into the databaseRetrieve a stored image from the databaseRetrieve a thumbnail representation of an image from the database.Using the REST Server API from JavaExamplesStore exampleRetrieve exampleQuery exampleRemove exampleXML-RPC APIUsing the XML-RPC Client API from JavaExamplesClassic store exampleProxy store exampleUsing the XML-RPC Client API from PythonPython XML-RPC proxy store exampleXML:DB Remote APIUsing the XML:DB Remote API from JavaExamplesStore exampleRetrieve exampleQuery exampleRemove exampleRESTXQStore a JPEG image received over HTTP into the databaseRetrieve a stored image from the databaseRetrieve a thumbnail representation of an image from the databaseXQJExamplesQuery exampleDeprecated Remote APIsAtom ServletSOAP APISOAP ServerRemote API Libraries for Other LanguagesCommunity APIs for eXist by programming languageLocal APIsXML:DB Local APIExampleXML:DB local exampleFluent APIExampleFluent API example
14. Tools
Java Admin ClienteXideoXygenConnecting with oXygen Using WebDAVNatively Connecting with oXygenAnt and eXistTrying the Ant ExamplesPreparing an eXist Ant Build ScriptUsing Ant with eXistBasic example: Listing the main collectionsBackup and shutdownCreate separate backups for all subcollectionsRun an XQuery from Ant
15. System Administration
LoggingJMXMemory and Cache TuningUnderstanding Memory UseWeb Admin StatusXQueryVisualVMJava Mission ControlCache TuningBackup and RestoreClient-Side Data Export BackupJava Admin Client backupCommand-line backupAnt backup taskServer-Side Data Export BackupScheduled backupsBackups from XQueryDashboard backups appRestoring a Clean DatabaseEmergency Export ToolInstalling eXist as a ServiceSolarisWindows Linux and Other UnixHosting and the CloudEnticAmazon EC2eXist AMIInstallationServiceAdministeringOther Cloud ProvidersGreenQloudDigital OceanGetting SupportCommunity SupportCommercial Support
16. Advanced Topics
XQuery TestingVersioningHistorical ArchivalDocument RevisionsWrite conflict avoidanceScheduled JobsScheduling JobsXQuery JobsScheduled weather retrieval (XQuery)Java JobsJava user jobScheduled weather retrieval (Java)Java system task jobDatabase stats scheduled system taskStartup TriggersConfigured Modules Example Startup TriggerDatabase TriggersXQuery TriggersJava TriggersJava collection triggersNo delete example collection triggerJava document triggersExample Filtering TriggerInternal XQuery Library ModulesUsing the Hello Word ModuleTypes and CardinalityFunction Parameters and Return TypesVariable DeclarationsModule ConfigurationDeveloping eXistBuilding eXist from SourceDebugging eXistRemote debugging with NetBeans IDE
A. XQuery Extension Modules
Extension Modules by CategoryAdditional dataCoreDatatype ExtensionsDatabase FunctionalityIndexingProtocols/InterfacesXML TechnologiesXQueryExtension Module Descriptionscachecompressioncontentextractioncounterdatetimeexiexiftoolfileftftpclienthttphttpclientimageinspectjfreechartjndijsonjsonpkwicmailmapmathmetadatangramreporequestresponserestxqrestxqexschedulersequencessessionsmsortsqlsystemtexttransformutilvalidationversioningxmlcalabashxmldbxmldiffxmppxqjsonxqdmxslfozip
B. REST Server Processes
GET Process FlowHEAD Process FlowPUT Process FlowDELETE Process FlowPOST Process FlowREST Server ParametersHTTP GET ParametersHTTP POST ParametersCommon XML Grammars for ParametersWrap XML grammarProperties XML grammarText XML grammarVariables XML grammar
Index
Colophon
Copyright

Content preview from eXist

Chapter 4. Architecture

In this chapter we look in detail at how eXist is constructed and how it processes your XML documents and executes your XQueries. This information should be considered advanced, so if you are a beginner you may want to skip to another chapter. However, for those wishing to master eXist, this information can be invaluable in helping you understand how to use it efficiently.

eXist is a large software project that has evolved over the last 13 years, and is written predominantly in the Java programming language. Although extensions and add-ons to eXist are often written in pure XQuery and/or XSLT, the main body of eXist is written in Java.

Regardless of how you decide to deploy and use eXist, its architecture (see Figure 4-1) predominately remains the same, with various optional components depending on your use.

Figure 4-1. Complete high-level diagram of eXist’s architecture

Each connection from an API to eXist is a single thread that interacts with the broker pool, which is configured with a number of brokers (20 by default). Each broker is a thread that interacts with the database, and represents a database request; this might be a database update operation (add/delete/store/update) or a query operation (XPath, XQuery, or XSLT). Should you connect to eXist and all the brokers are busy, your request will pause until a broker becomes available to service your request. ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

O’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.

Julian F.

Head of Cybersecurity

I wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.

Addison B.

Field Engineer

I’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.

Amir M.

Data Platform Tech Lead

I'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.

Mark W.

Embedded Software Engineer

Publisher Resources

ISBN: 9781449337094Errata Page

Cloud Computing

Data Engineering

Data Science

AI & ML

Programming Languages

Software Architecture

IT/Ops

Security

Design

Business

Soft Skills

eXist

by Erik Siegel, Adam Retter

Chapter 4. Architecture

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.