book

High Performance Drupal

by Jeff Sheltren, Narayan Newton, Nathaniel Catchpole

October 2013

Intermediate to advanced

261 pages

7h 4m

English

O'Reilly Media, Inc.

Read now

Unlock full access

High Performance Drupal
Dedication
Preface
Does Drupal Scale?Goals of This BookSupported Drupal VersionsHow This Book Is OrganizedPerformance AnalysisApplication PerformanceInfrastructureDatabasesWeb Servers and Reverse ProxiesOngoing TestingWhere to Next?Conventions Used in This BookUsing Code ExamplesSafari® Books OnlineHow to Contact UsAcknowledgmentsFrom JeffFrom NarayanFrom Nat
1. Beginning a Performance Project
Getting Started with Performance ImprovementsEstablishing a Performance BaselineSetting Goals for Website PerformanceThe Many Aspects of Drupal PerformanceCreating a Prioritized List of Improvements
2. Frontend Performance
Limiting HTTP RequestsAuditsImage RequestsMinificationMinification On the FlyPreminification for Modules/ThemesMinifying During the Build ProcessCompressionCacheable HeadersCDNsKeep Third-Party Libraries Up to DatejQuery UpdateExternal ScriptsSingle Points of Failure (SPOFs)
3. Drupal Performance Out of the Box
Page CachingWhen Should You Use Page Caching?Internal Page CachingReverse Proxy CachingCSS and JavaScript AggregationLoggingThe Cache and Other Swappable StorageCronViews
4. Drupal Coding for Optimal Performance
Context MattersFalse OptimizationsListing EntitiesentityQuery()Multiple Entity LoadingCachingStatic CachingPersistent CachingCache chainsCache binsgetMultiple()/setMultiple()/deleteMultiple()Cache tagsCacheArrayRender cachingQueues and WorkersCache Stampedes and Race Conditions
5. Drupal Coding for Abysmal Performance
variable_set() AbuseExternal RequestsSessionsExcessive Cache GranularityPHP ErrorsDebug Code in the Code BaseDevelopment Settings
6. Verifying Changes
Analyzing Frontend PerformanceYSlow and Google PageSpeedWaterfall ChartsReal User MonitoringAnalyzing Application PerformanceThe Devel ModulePage timingMemory usageQuery logXdebugXHProfstrace
7. Infrastructure Design and Planning
Horizontal and Vertical ScalingService CategorizationWorking Well TogetherExample Two-Layer ConfigurationExample Larger-Scale InfrastructureDevelopment and Staging EnvironmentsInternal Network LayoutUtility ServersHigh Availability and FailoverHosting ConsiderationsSummary

8. Service Monitoring
The Importance of Monitoring ServicesMonitoring Alerts with IcingaWhat to MonitorHow to Tune MonitoringGraphing Monitoring DataInternal Versus Remote Monitoring
9. “DevOps”: Breaking Down Barriers Between Development and Operations
Revision Control SystemsLocally Hosted or External ServiceNot Just for CodeConfiguration Management SystemsWhich System to UsePulling It Together: In-Depth Example with Puppet and GitDevelopment Virtual MachinesHow to Distribute Development VMs with VagrantDeployment WorkflowExample Workflow with GitDeployment with Jenkins CI
10. File Storage for Multiple Web Servers
rsyncGlusterFSExample ConfigurationSingle NFS ServerHA NFS ClusterExample ConfigurationSetting Up DRBDSetting Up HeartbeatSetting Up NFSTestingStorage Area Networks (SANs)
11. Drupal and Cloud Deployments
What Is the Cloud?Why Use the Cloud?Infrastructure OverheadPrepackaged CloudsCommon Issues with Cloud Deployments and Their Mitigations
12. Failover Configuration
IP Failover Versus DNS FailoverService-Level IssuesHeartbeatInstallationConfigurationUsage
13. MySQL
Drupal and MySQL EnginesVersions of MySQLOracle MySQLMariaDBPercona ServerGeneral ConfigurationGlobal ConfigurationPer-Thread ConfigurationStorage Engine ConfigurationReplicationVirtualized Deployments
14. Tools for Managing and Monitoring MySQL
Percona ToolkitOpenark KitmysqlreportPercona Monitoring Plug-Ins
15. MySQL Query Optimization
Index BasicsBase Tables and Join OrderCommon IssuesThe ORDER BY on an Unrelated TableThe Useless DISTINCT (“In Case of Accidents!”)Starfish Syndrome (All LEFT JOINS)Node Access
16. Alternative Storage and Cache Backends
Cache, Lock, and Session StorageMemcache In DepthPHP Extensions for MemcacheAssigning Memcached Servers and BinsMemcache Locking and Stampede ProtectionWhat to Store in MemcacheConfiguring the Memcache DaemonHow to Break Your Site with MemcacheInconsistent CachingConstant EvictionsVanishing SessionsEntity/Field StorageEntityFieldQuery/EntityQueryCRUDMongoDB
17. Solr Search
Performance and Scalability ConsiderationsIntegrating Solr with DrupalSolr ConfigurationIndexing ContentInfrastructure ConsiderationsSolr ReplicationDrupal Module Installation
18. PHP and httpd Configuration
APC: PHP Opcode Cachephp.ini SettingsPHP Apache Module Versus CGIApache MPM SettingsPrefork Thread SettingsKeepAliveCache HeadersLoggingServer SignatureAdministrative Directory or VirtualHostNginxWhy Not Use Nginx Everywhere?
19. Reverse Proxies and Content Delivery Networks
Using a Reverse Proxy with DrupalUnderstanding Varnish Configuration LanguageDefining a BackendDirectors: Dealing with Multiple Backend ServersBuilt-in VCL SubroutinesCustomizing SubroutinesCookies and VarnishCaching for Authenticated UsersEdge-Side IncludesServing Expired ContentError PagesMemory AllocationLogging and Monitoring VarnishSample VCL for DrupalContent Delivery NetworksServing Static Content Through a CDNWhen to Use a CDNChoosing Between a CDN and a Reverse Proxy
20. Load Testing
Different Types of Load TestsCreating a Valid TestWhen to TestContinuous Integration (CI)Periodic TestingManual Targeted TestingInterpreting Test ResultsServer Monitoring During Load TestsWhere to TestExample Load Test Using JMeterGlobal Test SettingsThread GroupsHandling CookiesLogin ControllerBrowse ControllerOutput ConfigurationRunning a TestReading Test Results
21. Where to Next?
Official Book WebsiteHigh Performance Drupal GroupDrupal WatchdogRevision Control with GitVarnishConfiguration ManagementVagrantJenkinsMySQL PerformanceInnoDB Index Structures
Index
About the Authors
Colophon
Copyright

Content preview from High Performance Drupal

Chapter 1. Beginning a Performance Project

So you’re ready to jump in and start improving your website’s performance. This can be a daunting task. There are so many services, underlying technologies, and possible problems that it can be difficult to pick a starting point. It is easy to run around in circles, checking and fixing many small issues but never addressing your major problems (or even discovering what they are). Knowing where to start and which issues are of high priority can be one of the most difficult parts of optimizing a site.

Due to these common issues of discovery and prioritization, good performance engineers and system administrators tend to do a lot more gathering of metrics and statistics than most people think. A complete understanding of the problem points of a website (problem pages, blocks, or views) and server metrics during average- and high-load situations is a requirement for making good decisions. Whenever we approach a new infrastructure or website project, the investigation and metrics collection period is often the most important time and will determine how effective the entire optimization project is.

Getting Started with Performance Improvements

We will discuss tools and methodologies for collecting performance information in later chapters. For now, let us assume we have a spreadsheet of problem pages or requests and some server information (CPU, load, I/O usage, etc.) during some peak load periods. The next important step in optimizing a site is ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

O’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.

Julian F.

Head of Cybersecurity

I wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.

Addison B.

Field Engineer

I’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.

Amir M.

Data Platform Tech Lead

I'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.

Mark W.

Embedded Software Engineer

Publisher Resources

ISBN: 9781449358013Errata Page

Cloud Computing

Data Engineering

Data Science

AI & ML

Programming Languages

Software Architecture

IT/Ops

Security

Design

Business

Soft Skills

High Performance Drupal

by Jeff Sheltren, Narayan Newton, Nathaniel Catchpole

Chapter 1. Beginning a Performance Project

Getting Started with Performance Improvements

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.