book

CouchDB: The Definitive Guide

by J. Chris Anderson, Jan Lehnardt, Noah Slater

January 2010

Intermediate to advanced

272 pages

6h 41m

English

O'Reilly Media, Inc.

Read now

Unlock full access

CouchDB: The Definitive Guide
Dedication
SPECIAL OFFER: Upgrade this ebook with O’Reilly
Foreword
Preface
Using Code ExamplesConventions Used in This BookSafari® Books OnlineHow to Contact UsAcknowledgmentsJ. ChrisJanNoah
I. Introduction
1. Why CouchDB?
RelaxA Different Way to Model Your DataA Better Fit for Common ApplicationsSelf-Contained DataSyntax and SemanticsBuilding Blocks for Larger SystemsCouchDB ReplicationLocal Data Is KingWrapping Up
2. Eventual Consistency
Working with the GrainThe CAP TheoremLocal ConsistencyThe Key to Your DataNo LockingValidationDistributed ConsistencyIncremental ReplicationCase StudyWrapping Up
3. Getting Started
All Systems Are Go!Welcome to FutonYour First Database and DocumentRunning a Query Using MapReduceTriggering ReplicationWrapping Up
4. The Core API
ServerDatabasesDocumentsRevisionsDocuments in DetailAttachmentsReplicationWrapping Up

II. Developing with CouchDB
5. Design Documents
Document ModelingThe Query ServerApplications Are DocumentsA Basic Design DocumentLooking to the Future
6. Finding Your Data with Views
What Is a View?Efficient LookupsFind OneFind ManyReversed ResultsThe View to Get Comments for PostsReduce/RereduceLessons LearnedWrapping Up
7. Validation Functions
Document Validation FunctionsValidation’s ContextWriting OneTypeRequired FieldsTimestampsAuthorshipWrapping Up
8. Show Functions
The Show Function APISide Effect–FreeDesign DocumentsQuerying Show FunctionsDesign Document ResourcesQuery ParametersAccept HeadersEtagsFunctions and TemplatesThe !json MacroThe !code MacroLearning ShowsUsing TemplatesWriting Templates
9. Transforming Views with List Functions
Arguments to the List FunctionAn Example List FunctionList TheoryQuerying ListsLists, Etags, and Caching
III. Example Application
10. Standalone Applications
Use the Correct VersionPortable JavaScriptApplications Are DocumentsStandaloneIn the WildWrapping Up
11. Managing Design Documents
Working with the Example ApplicationInstalling CouchAppUsing CouchAppDownload the Sofa Source CodeCouchApp CloneZIP and TAR FilesJoin the Sofa Development Community on GitHubThe Sofa Source TreeDeploying SofaPushing Sofa to Your CouchDBVisit the ApplicationSet Up Your Admin AccountDeploying to a Secure CouchDBConfiguring CouchApp with .couchapprc
12. Storing Documents
JSON Document FormatBeyond _id and _rev: Your Document DataThe Edit PageThe HTML ScaffoldSaving a DocumentValidationSave Your First PostWrapping Up
13. Showing Documents in Custom Formats
Rendering Documents with Show FunctionsThe Post Page TemplateDynamic Dates
14. Viewing Lists of Blog Posts
Map of Recent Blog PostsRendering the View as HTML Using a List FunctionSofa’s List FunctionThe Final Result
IV. Deploying CouchDB
15. Scaling Basics
Scaling Read RequestsScaling Write RequestsScaling DataBasics First
16. Replication
The MagicSimple Replication with the Admin InterfaceReplication in DetailContinuous ReplicationThat’s It?
17. Conflict Management
The Split BrainConflict Resolution by ExampleWorking with ConflictsDeterministic Revision IDsWrapping Up
18. Load Balancing
Having a Backup
19. Clustering
Introducing CouchDB LoungeConsistent HashingRedundant StorageRedundant ProxiesView MergingGrowing the ClusterMoving PartitionsSplitting Partitions
V. Reference
20. Change Notifications
Polling for ChangesLong PollingContinuous ChangesFiltersWrapping Up
21. View Cookbook for SQL Jockeys
Using ViewsDefining a ViewQuerying a ViewMapReduce FunctionsMap functionsReduce functionsLook Up by KeyLook Up by PrefixAggregate FunctionsGet Unique ValuesEnforcing Uniqueness
22. Security
The Admin PartyCreating New Admin UsersHashing PasswordsBasic AuthenticationUpdate Validations AgainCookie AuthenticationNetwork Server Security
23. High Performance
Good Benchmarks Are Non-TrivialHigh Performance CouchDBHardwareAn Implementation NoteBulk Inserts and Mostly Monotonic DocIDsOptimized Examples: Views and ReplicationBulk Document InsertsBatch ModeSingle Document InsertsHovercraftTrade-OffsBut…My Boss Wants Numbers!A Call to Arms
24. Recipes
BankingAccountants Don’t Use ErasersWrapping UpOrdering ListsA List of IntegersA List of FloatsPaginationExample DataA ViewSetupSlow Paging (Do Not Use)The dealbreakerFast Paging (Do Use)Jump to Page
VI. Appendixes
A. Installing on Unix-like Systems
Debian GNU/LinuxUbuntuGentoo LinuxProblems
B. Installing on Mac OS X
CouchDBXHomebrewMacPorts
C. Installing on Windows
D. Installing from Source
DependenciesDebian-Based (Including Ubuntu) SystemsMac OS XInstallingSecurity ConsiderationsRunning ManuallyRunning As a DaemonSysV/BSD-Style SystemsMac OS XTroubleshooting
E. JSON Primer
Data TypesNumbersStringsBooleansArraysObjectsNulls
F. The Power of B-trees
Index
About the Authors
Colophon
SPECIAL OFFER: Upgrade this ebook with O’Reilly
Copyright

Content preview from CouchDB: The Definitive Guide

Chapter 19. Clustering

OK, you’ve made it this far. I’m assuming you more or less understand what CouchDB is and how the application API works. Maybe you’ve deployed an application or two, and now you’re dealing with enough traffic that you need to think about scaling. “Scaling” is an imprecise word, but in this chapter we’ll be dealing with the aspect of putting together a partitioned or sharded cluster that will have to grow at an increasing rate over time from day one.

We’ll look at request and response dispatch in a CouchDB cluster with stable nodes. Then we’ll cover how to add redundant hot-failover twin nodes, so you don’t have to worry about losing machines. In a large cluster, you should plan for 5–10% of your machines to experience some sort of failure or reduced performance, so cluster design must prevent node failures from affecting reliability. Finally, we’ll look at adjusting cluster layout dynamically by splitting or merging nodes using replication.

Introducing CouchDB Lounge

CouchDB Lounge is a proxy-based partitioning and clustering application, originally developed for Meebo, a web-based instant messaging service. Lounge comes with two major components: one that handles simple GET and PUT requests for documents, and another that distributes view requests.

The dumbproxy handles simple requests for anything that isn’t a CouchDB view. This comes as a module for nginx, a high-performance reverse HTTP proxy. Because of the way reverse HTTP proxies work, this automatically ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

O’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.

Julian F.

Head of Cybersecurity

I wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.

Addison B.

Field Engineer

I’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.

Amir M.

Data Platform Tech Lead

I'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.

Mark W.

Embedded Software Engineer

Publisher Resources

ISBN: 9780596158156Errata Page

Cloud Computing

Data Engineering

Data Science

AI & ML

Programming Languages

Software Architecture

IT/Ops

Security

Design

Business

Soft Skills

CouchDB: The Definitive Guide

by J. Chris Anderson, Jan Lehnardt, Noah Slater

Chapter 19. Clustering

Introducing CouchDB Lounge

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.