book

Clojure High Performance Programming - Second Edition

Name: Clojure High Performance Programming - Second Edition
Author: Shantanu Kumar
ISBN: 9781785283642

by Shantanu Kumar

September 2015

Intermediate to advanced

198 pages

4h 52m

English

Packt Publishing

Read now

Unlock full access

Clojure High Performance Programming Second Edition
Table of Contents
Clojure High Performance Programming Second Edition
Credits
About the Author
About the Reviewers
www.PacktPub.com
Support files, eBooks, discount offers, and moreWhy subscribe?Free access for Packt account holders
Preface
What this book covers
What you need for this book
Who this book is for

Conventions
Reader feedback
Customer support
ErrataPiracyeBooks, discount offers, and moreQuestions
1. Performance by Design
Use case classificationThe user-facing softwareComputational and data-processing tasksA CPU bound computationA memory bound taskA cache bound taskAn input/output bound taskOnline transaction processingOnline analytical processingBatch processing
A structured approach to the performance
The performance vocabulary
LatencyThroughputBandwidthBaseline and benchmarkProfilingPerformance optimizationConcurrency and parallelismResource utilizationWorkload
The latency numbers that every programmer should know
Summary
2. Clojure Abstractions
Non-numeric scalars and interning
Identity, value, and epochal time model
Variables and mutationCollection types
Persistent data structures
Constructing lesser-used data structuresComplexity guaranteeO(<7) implies near constant timeThe concatenation of persistent data structures
Sequences and laziness
LazinessLaziness in data structure operationsConstructing lazy sequencesCustom chunkingMacros and closures
Transducers
Performance characteristics
Transients
Fast repetition
Performance miscellanea
Disabling assertions in productionDestructuringRecursion and tail-call optimization (TCO)Premature end of iterationMultimethods versus protocolsInlining
Summary
3. Leaning on Java
Inspecting the equivalent Java source for Clojure codeCreating a new projectCompiling the Clojure sources into Java bytecodeDecompiling the .class files into Java sourceCompiling the Clojure source without locals clearing
Numerics, boxing, and primitives
Arrays
Reflection and type hints
An array of primitivesPrimitivesMacros and metadataString concatenationMiscellaneous
Using array/numeric libraries for efficiency
HipHipprimitive-mathDetecting boxed math
Resorting to Java and native code
Proteus – mutable locals in Clojure
Summary
4. Host Performance
The hardwareProcessorsBranch predictionInstruction schedulingThreads and coresMemory systemsCacheInterconnectStorage and networking
The Java Virtual Machine
The just-in-time compilerMemory organizationHotSpot heap and garbage collectionMeasuring memory (heap/stack) usageDetermining program workload typeTackling memory inefficiency
Measuring latency with Criterium
Criterium and Leiningen
Summary
5. Concurrency
Low-level concurrencyHardware memory barrier (fence) instructionsJava support and the Clojure equivalent
Atomic updates and state
Atomic updates in JavaClojure's support for atomic updatesFaster writes with atom striping
Asynchronous agents and state
Asynchrony, queueing, and error handlingWhy you should use agentsNesting
Coordinated transactional ref and state
Ref characteristicsRef history and in-transaction deref operationsTransaction retries and bargingUpping transaction consistency with ensureLesser transaction retries with commutative operationsAgents can participate in transactionsNested transactionsPerformance considerations
Dynamic var binding and state
Validating and watching the reference types
Java concurrent data structures
Concurrent mapsConcurrent queuesClojure support for concurrent queues
Concurrency with threads
JVM support for threadsThread pools in the JVMClojure concurrency supportFuturePromise
Clojure parallelization and the JVM
Moore's lawAmdahl's lawUniversal Scalability LawClojure support for parallelizationpmappcallspvaluesJava 7's fork/join framework
Parallelism with reducers
Reducible, reducer function, reduction transformationRealizing reducible collectionsFoldable collections and parallelism
Summary
6. Measuring Performance
Performance measurement and statisticsA tiny statistics terminology primerMedian, first quartile, third quartilePercentileVariance and standard deviationUnderstanding Criterium outputGuided performance objectives
Performance testing
The test environmentWhat to testMeasuring latencyComparative latency measurementLatency measurement under concurrencyMeasuring throughputAverage throughput testThe load, stress, and endurance tests
Performance monitoring
Monitoring through logsRing (web) monitoringIntrospectionJVM instrumentation via JMX
Profiling
OS and CPU/cache-level profilingI/O profiling
Summary
7. Performance Optimization
Project setupSoftware versionsLeiningen project.clj configurationEnable reflection warningEnable optimized JVM options when benchmarkingDistinguish between initialization and runtime
Identifying performance bottlenecks
Latency bottlenecks in Clojure codeMeasure only when it is hotGarbage collection bottlenecksThreads waiting at GC safepointUsing jstat to probe GC detailsInspecting generated bytecode for Clojure sourceThroughput bottlenecks
Profiling code with VisualVM
The Monitor tab
The Threads tabThe Sampler tabSetting the thread nameThe Profiler tabThe Visual GC tabThe Alternate profilers
Performance tuning
Tuning Clojure codeCPU/cache boundMemory boundMulti-threadedI/O boundJVM tuningBack pressure
Summary
8. Application Performance
Choosing librariesMaking a choice via benchmarksWeb serversWeb routing librariesData serializationJSON serializationJDBC
Logging
Why SLF4J/LogBack?The setupDependenciesThe logback configuration fileOptimization
Data sizing
Reduced serializationChunking to reduce memory pressureSizing for file/network operationsSizing for JDBC query results
Resource pooling
JDBC resource pooling
I/O batching and throttling
JDBC batch operationsBatch support at API levelThrottling requests to services
Precomputing and caching
Concurrent pipelines
Distributed pipelines
Applying back pressure
Thread pool queuesServlet containers such as Tomcat and JettyHTTP KitAleph
Performance and queueing theory
Little's lawPerformance tuning with respect to Little's law
Summary
Index

Content preview from Clojure High Performance Programming - Second Edition

Concurrent pipelines

Imagine a situation where we have to carry out jobs at a certain throughput, such that each job includes the same sequence of differently sized I/O task (task A), a memory-bound task (task B) and, again, an I/O task (task C). A naïve approach would be to create a thread pool and run each job off it, but soon we realize that this is not optimum because we cannot ascertain the utilization of each I/O resource due to unpredictability of the threads being scheduled by the OS. We also observe that even though several concurrent jobs have similar I/O tasks, we are unable to batch them in our first approach.

As the next iteration, we split each job in stages (A, B, C), such that each stage corresponds to one task. Since the tasks ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

O’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.

Julian F.

Head of Cybersecurity

I wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.

Addison B.

Field Engineer

I’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.

Amir M.

Data Platform Tech Lead

I'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.

Mark W.

Embedded Software Engineer

Clojure: High Performance JVM Programming

Publisher Resources

ISBN: 9781785283642

Cloud Computing

Data Engineering

Data Science

AI & ML

Programming Languages

Software Architecture

IT/Ops

Security

Design

Business

Soft Skills

Clojure High Performance Programming - Second Edition

by Shantanu Kumar

Concurrent pipelines

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.