book

Graph Databases

by Jim Webber, Emil Eifrem, Ian Robinson

June 2013

Intermediate to advanced

221 pages

6h 16m

English

O'Reilly Media, Inc.

Read now

Unlock full access

Foreword
Graphs Are Everywhere, or the Birth of Graph Databases as We Know Them
Preface
About This BookConventions Used in This BookUsing Code ExamplesSafari® Books OnlineHow to Contact UsAcknowledgments
1. Introduction
What Is a Graph?A High-Level View of the Graph SpaceGraph DatabasesGraph Compute EnginesThe Power of Graph DatabasesPerformanceFlexibilityAgilitySummary
2. Options for Storing Connected Data
Relational Databases Lack RelationshipsNOSQL Databases Also Lack RelationshipsGraph Databases Embrace RelationshipsSummary
3. Data Modeling with Graphs
Models and GoalsThe Property Graph ModelQuerying Graphs: An Introduction to CypherCypher PhilosophySTARTMATCHRETURNOther Cypher ClausesA Comparison of Relational and Graph ModelingRelational Modeling in a Systems Management DomainGraph Modeling in a Systems Management DomainTesting the ModelCross-Domain ModelsCreating the Shakespeare GraphBeginning a QueryDeclaring Information Patterns to FindConstraining MatchesProcessing ResultsQuery ChainingCommon Modeling PitfallsEmail Provenance Problem DomainA Sensible First Iteration?Second Time’s the CharmEvolving the DomainAvoiding Anti-PatternsSummary
4. Building a Graph Database Application
Data ModelingDescribe the Model in Terms of the Application’s NeedsNodes for Things, Relationships for StructureFine-Grained versus Generic RelationshipsModel Facts as NodesEmploymentPerformanceEmailingReviewingRepresent Complex Value Types as NodesTimeTimeline treesLinked listsVersioningIterative and Incremental DevelopmentApplication ArchitectureEmbedded Versus ServerEmbedded Neo4jServer modeServer extensionsClusteringReplicationBuffer writes using queuesGlobal clustersLoad BalancingSeparate read traffic from write trafficCache shardingRead your own writesTestingTest-Driven Data Model DevelopmentExample: A test-driven social network data modelTesting server extensionsPerformance TestingQuery performance testsApplication performance testsTesting with representative dataCapacity PlanningOptimization CriteriaPerformanceCalculating the cost of graph database performancePerformance optimization optionsRedundancyLoadSummary
5. Graphs in the Real World
Why Organizations Choose Graph DatabasesCommon Use CasesSocialRecommendationsGeoMaster Data ManagementNetwork and Data Center ManagementAuthorization and Access Control (Communications)Real-World ExamplesSocial Recommendations (Professional Social Network)Talent.net data modelInferring social relationsFinding colleagues with particular interestsAdding WORKED_WITH relationshipsAuthorization and Access ControlTeleGraph data modelFinding all accessible resources for an administratorDetermining whether an administrator has access to a resourceFinding administrators for an accountGeo (Logistics)Global Post data modelRoute calculationFinding the shortest delivery route using CypherImplementing route calculation with the traversal frameworkSummary
6. Graph Database Internals
Native Graph ProcessingNative Graph StorageProgrammatic APIsKernel APICore (or “Beans”) APITraversal APINonfunctional CharacteristicsTransactionsRecoverabilityAvailabilityScaleCapacityLatencyThroughputSummary
7. Predictive Analysis with Graph Theory
Depth- and Breadth-First SearchPath-Finding with Dijkstra’s AlgorithmThe A* AlgorithmGraph Theory and Predictive ModelingTriadic ClosuresStructural BalanceLocal BridgesSummary
A. NOSQL Overview
The Rise of NOSQLACID versus BASEThe NOSQL QuadrantsDocument StoresKey-Value StoresColumn FamilyQuery versus Processing in Aggregate StoresGraph DatabasesProperty GraphsHypergraphsTriples

Index
About the Authors
Colophon
Copyright

Content preview from Graph Databases

Chapter 1. Introduction

Although much of this book talks about graph data models, it is not a book about graph theory.^[2] We don’t need much theory to take advantage of graph databases: provided we understand what a graph is, we’re practically there. With that in mind, let’s refresh our memories about graphs in general.

What Is a Graph?

Formally, a graph is just a collection of vertices and edges—or, in less intimidating language, a set of nodes and the relationships that connect them. Graphs represent entities as nodes and the ways in which those entities relate to the world as relationships. This general-purpose, expressive structure allows us to model all kinds of scenarios, from the construction of a space rocket, to a system of roads, and from the supply-chain or provenance of foodstuff, to medical history for populations, and beyond.

Graphs are extremely useful in understanding a wide diversity of datasets in fields such as science, government, and business. The real world—unlike the forms-based model behind the relational database—is rich and interrelated: uniform and rule-bound in parts, exceptional and irregular in others. Once we understand graphs, we begin to see them in all sorts of places. Gartner, for example, identifies five graphs in the world of business—social, intent, consumption, interest, and mobile—and says that the ability to leverage these graphs provides a “sustainable competitive advantage.”

For example, Twitter’s data is easily represented ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

O’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.

Julian F.

Head of Cybersecurity

I wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.

Addison B.

Field Engineer

I’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.

Amir M.

Data Platform Tech Lead

I'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.

Mark W.

Embedded Software Engineer

Publisher Resources

ISBN: 9781449356255Errata Page

Cloud Computing

Data Engineering

Data Science

AI & ML

Programming Languages

Software Architecture

IT/Ops

Security

Design

Business

Soft Skills

Graph Databases

by Jim Webber, Emil Eifrem, Ian Robinson

Chapter 1. Introduction

What Is a Graph?

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.