book

NoSQL for Mere Mortals®

Name: NoSQL for Mere Mortals®
Author: Dan Sullivan
ISBN: 9780134029894

by Dan Sullivan

April 2015

Intermediate to advanced

650 pages

10h 47m

English

Addison-Wesley Professional

Read now

Unlock full access

About This eBook
Title Page
Copyright Page
Dedication Page
About the Author
Contents
Preface
Acknowledgments
Introduction
Who Should Read This Book?The Purpose of This BookHow to Read This BookHow This Book Is OrganizedPart I: IntroductionPart II: Key-Value DatabasesPart III: Document DatabasesPart IV: Column Family DatabasesPart V: Graph DatabasesPart VI: Choosing a Database for Your ApplicationPart VII: Appendices
Part I: Introduction

1. Different Databases for Different Requirements
Relational Database DesignE-commerce ApplicationEarly Database Management SystemsFlat File Data Management SystemsHierarchical Data Model SystemsNetwork Data Management SystemsSummary of Early Database Management SystemsThe Relational Database RevolutionRelational Database Management SystemsMotivations for Not Just/No SQL (NoSQL) DatabasesScalabilityCostFlexibilityAvailabilitySummaryCase StudyReview QuestionsReferencesBibliography
2. Variety of NoSQL Databases
Data Management with Distributed DatabasesStore Data PersistentlyMaintain Data ConsistencyEnsure Data AvailabilityBalancing Response Times, Consistency, and DurabilityConsistency, Availability, and Partitioning: The CAP TheoremACID and BASEACID: Atomicity, Consistency, Isolation, and DurabilityBASE: Basically Available, Soft State, Eventually ConsistentTypes of Eventual ConsistencyFour Types of NoSQL DatabasesKey-Value Pair DatabasesDocument DatabasesColumn Family DatabasesGraph DatabasesSummaryReview QuestionsReferencesBibliography
Part II: Key-Value Databases
3. Introduction to Key-Value Databases
From Arrays to Key-Value DatabasesArrays: Key Value Stores with Training WheelsAssociative Arrays: Taking Off the Training WheelsCaches: Adding Gears to the BikeIn-Memory and On-Disk Key-Value Database: From Bikes to Motorized VehiclesEssential Features of Key-Value DatabasesSimplicity: Who Needs Complicated Data Models Anyway?Speed: There Is No Such Thing as Too FastScalability: Keeping Up with the RushKeys: More Than Meaningless IdentifiersHow to Construct a KeyUsing Keys to Locate ValuesValues: Storing Just About Any Data You WantValues Do Not Require Strong TypingLimitations on Searching for ValuesSummaryReview QuestionsReferencesBibliography
4. Key-Value Database Terminology
Key-Value Database Data Modeling TermsKeyValueNamespacePartitionPartition KeySchemalessKey-Value Architecture TermsClusterRingReplicationKey-Value Implementation TermsHash FunctionCollisionCompressionSummaryReview QuestionsReferences
5. Designing for Key-Value Databases
Key Design and PartitioningKeys Should Follow a Naming ConventionWell-Designed Keys Save CodeDealing with Ranges of ValuesKeys Must Take into Account Implementation LimitationsHow Keys Are Used in PartitioningDesigning Structured ValuesStructured Data Types Help Reduce LatencyLarge Values Can Lead to Inefficient Read and Write OperationsLimitations of Key-Value DatabasesLook Up Values by Key OnlyKey-Value Databases Do Not Support Range QueriesNo Standard Query Language Comparable to SQL for Relational DatabasesDesign Patterns for Key-Value DatabasesTime to Live (TTL) KeysEmulating TablesAggregatesAtomic AggregatesEnumerable KeysIndexesSummaryCase Study: Key-Value Databases for Mobile Application ConfigurationReview QuestionsReferences
Part III: Document Databases
6. Introduction to Document Databases
What Is a Document?Documents Are Not So Simple After AllDocuments and Key-Value PairsManaging Multiple Documents in CollectionsAvoid Explicit Schema DefinitionsBasic Operations on Document DatabasesInserting Documents into a CollectionDeleting Documents from a CollectionUpdating Documents in a CollectionRetrieving Documents from a CollectionSummaryReview QuestionsReferences
7. Document Database Terminology
Document and Collection TermsDocumentCollectionEmbedded DocumentSchemalessPolymorphic SchemaTypes of PartitionsVertical PartitioningHorizontal Partitioning or ShardingData Modeling and Query ProcessingNormalizationDenormalizationQuery ProcessorSummaryReview QuestionsReferences
8. Designing for Document Databases
Normalization, Denormalization, and the Search for Proper BalanceOne-to-Many RelationsMany-to-Many RelationsThe Need for JoinsExecuting Joins: The Heavy Lifting of Relational DatabasesWhat Would a Document Database Modeler Do?Planning for Mutable DocumentsAvoid Moving Oversized DocumentsThe Goldilocks Zone of IndexesRead-Heavy ApplicationsWrite-Heavy ApplicationsModeling Common RelationsOne-to-Many Relations in Document DatabasesMany-to-Many Relations in Document DatabasesModeling Hierarchies in Document DatabasesSummaryCase Study: Customer ManifestsEmbed or Not Embed?Choosing IndexesSeparate Collections by Type?Review QuestionsReferences
Part IV: Column Family Databases
9. Introduction to Column Family Databases
In the Beginning, There Was Google BigTableUtilizing Dynamic Control over ColumnsIndexing by Row, Column Name, and Time StampControlling Location of DataReading and Writing Atomic RowsMaintaining Rows in Sorted OrderDifferences and Similarities to Key-Value and Document DatabasesColumn Family Database FeaturesColumn Family Database Similarities to and Differences from Document DatabasesColumn Family Database Versus Relational DatabasesArchitectures Used in Column Family DatabasesHBase Architecture: Variety of NodesCassandra Architecture: Peer-to-PeerGetting the Word Around: Gossip ProtocolThermodynamics and Distributed Database: Why We Need Anti-EntropyHold This for Me: Hinted HandoffWhen to Use Column Family DatabasesSummaryReview QuestionsReferences
10. Column Family Database Terminology
Basic Components of Column Family DatabasesKeyspaceRow KeyColumnColumn FamiliesStructures and Processes: Implementing Column Family DatabasesInternal Structures and Configuration Parameters of Column Family DatabasesOld Friends: Clusters and PartitionsTaking a Look Under the Hood: More Column Family Database ComponentsProcesses and ProtocolsReplicationAnti-EntropyGossip ProtocolHinted HandoffSummaryReview QuestionsReferences
11. Designing for Column Family Databases
Guidelines for Designing TablesDenormalize Instead of JoinMake Use of Valueless ColumnsUse Both Column Names and Column Values to Store DataModel an Entity with a Single RowAvoid Hotspotting in Row KeysKeep an Appropriate Number of Column Value VersionsAvoid Complex Data Structures in Column ValuesGuidelines for IndexingWhen to Use Secondary Indexes Managed by the Column Family Database SystemWhen to Create and Manage Secondary Indexes Using TablesTools for Working with Big DataExtracting, Transforming, and Loading Big DataAnalyzing Big DataTools for Monitoring Big DataSummaryCase Study: Customer Data AnalysisUnderstanding User NeedsReview QuestionsReferences
Part V: Graph Databases
12. Introduction to Graph Databases
What Is a Graph?Graphs and Network ModelingModeling Geographic LocationsModeling Infectious DiseasesModeling Abstract and Concrete EntitiesModeling Social MediaAdvantages of Graph DatabasesQuery Faster by Avoiding JoinsSimplified ModelingMultiple Relations Between EntitiesSummaryReview QuestionsReferences
13. Graph Database Terminology
Elements of GraphsVertexEdgePathLoopOperations on GraphsUnion of GraphsIntersection of GraphsGraph TraversalProperties of Graphs and NodesIsomorphismOrder and SizeDegreeClosenessBetweennessTypes of GraphsUndirected and Directed GraphsFlow NetworkBipartite GraphMultigraphWeighted GraphSummaryReview QuestionsReferences
14. Designing for Graph Databases
Getting Started with Graph DesignDesigning a Social Network Graph DatabaseQueries Drive Design (Again)Querying a GraphCypher: Declarative QueryingGremlin: Query by Graph TraversalTips and Traps of Graph Database DesignUse Indexes to Improve Retrieval TimeUse Appropriate Types of EdgesWatch for Cycles When Traversing GraphsConsider the Scalability of Your Graph DatabaseSummaryCase Study: Optimizing Transportation RoutesUnderstanding User NeedsDesigning a Graph Analysis SolutionReview QuestionsReferences
Part VI: Choosing a Database for Your Application
15. Guidelines for Selecting a Database
Choosing a NoSQL DatabaseCriteria for Selecting Key-Value DatabasesUse Cases and Criteria for Selecting Document DatabasesUse Cases and Criteria for Selecting Column Family DatabasesUse Cases and Criteria for Selecting Graph DatabasesUsing NoSQL and Relational Databases TogetherSummaryReview QuestionsReferences
Part VII: Appendices
A. Answers to Chapter Review Questions
Chapter 1Chapter 2Chapter 3Chapter 4Chapter 5Chapter 6Chapter 7Chapter 8Chapter 9Chapter 10Chapter 11Chapter 12Chapter 13Chapter 14Chapter 15
B. List of NoSQL Databases
Glossary
Index
Code Snippets

Overview

NoSQL was developed to overcome the limitations of relational databases in the largest Web applications at companies such as Google, Yahoo and Facebook. As it is applied more widely, developers are finding that it can simplify scalability while requiring far less coding and management overhead. However, NoSQL requires fundamentally different approaches to database design and modeling, and many conventional relational techniques lead to suboptimal results.

NoSQL for Mere Mortals is an easy, practical guide to succeeding with NoSQL in your environment. Following the classic, best-selling format pioneered in SQL Queries for Mere Mortals, enterprise database expert Dan Sullivan guides you step-by-step through choosing technologies, designing high-performance databases, and planning for long-term maintenance.

Sullivan introduces each type of NoSQL database, shows how to install and manage them, and demonstrates how to leverage their features while avoiding common mistakes that lead to poor performance and unmet requirements. He uses four popular NoSQL databases as reference models: MongoDB, a document database; Cassandra, a column family data store; Redis, a key-value database; and Neo4j, a graph database. You'll find explanations of each database's structure and capabilities, practical guidelines for choosing amongst them, and expert guidance on designing databases with them.

Packed with examples, NoSQL for Mere Mortals is today's best way to master NoSQL—whether you're a DBA, developer, user, or student.

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

O’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.

Julian F.

Head of Cybersecurity

I wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.

Addison B.

Field Engineer

I’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.

Amir M.

Data Platform Tech Lead

I'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.

Mark W.

Embedded Software Engineer

Publisher Resources

ISBN: 9780134029894Purchase book

Cloud Computing

Data Engineering

Data Science

AI & ML

Programming Languages

Software Architecture

IT/Ops

Security

Design

Business

Soft Skills