book

Haskell High Performance Programming

Name: Haskell High Performance Programming
Author: Samuli Thomasson
ISBN: 9781786464217

by Samuli Thomasson

September 2016

Intermediate to advanced

408 pages

9h 18m

English

Packt Publishing

Read now

Unlock full access

Haskell High Performance Programming
Table of Contents
Haskell High Performance Programming
Credits
About the Author
About the Reviewer
www.PacktPub.com
eBooks, discount offers, and moreWhy subscribe?
Preface
What this book covers
What you need for this book
Who this book is for

Conventions
Reader feedback
Customer support
Downloading the example codeDownloading the color images of this bookErrataPiracyQuestions
1. Identifying Bottlenecks
Meeting lazy evaluationWriting sum correctlyWeak head normal formFolding correctly
Memoization and CAFs
Constant applicative form
Recursion and accumulators
The worker/wrapper idiomGuarded recursionAccumulator parameters
Inspecting time and space usage
Increasing sharing and minimizing allocation
Compiler code optimizations
Inlining and stream fusionPolymorphism performancePartial functions
Summary
2. Choosing the Correct Data Structures
Annotating strictness and unpacking datatype fieldsUnbox with UNPACKUsing anonymous tuplesPerformance of GADTs and branching
Handling numerical data
Handling binary and textual data
Representing bit arraysHandling bytes and blobs of bytesWorking with characters and stringsUsing the text libraryBuilders for iterative constructionBuilders for strings
Handling sequential data
Using difference listsDifference list performanceDifference list with the Writer monadUsing zippersAccessing both ends fast with Seq
Handling tabular data
Using the vector package
Handling sparse data
Using the containers packageUsing the unordered-containers package
Ephemeral data structures
Mutable references are slowUsing mutable arraysUsing mutable vectorsBubble sort with vectors
Working with monads and monad stacks
The list monad and its transformerFree monadsWorking with monad transformersSpeedup via continuation-passing style
Summary
3. Profile and Benchmark to Your Heart's Content
Profiling time and allocationsSetting cost centres manuallySetting cost centres automaticallyInstalling libraries with profilingDebugging unexpected crashes with profiler
Heap profiling
Cost centre-based heap profilingObjects outside the heapRetainer profilingBiographical profiling
Benchmarking using the criterion library
Profile and monitor in real time
Monitoring over HTTP with ekg
Summary
4. The Devil's in the Detail
The anatomy of a Haskell projectUseful fields and flags in cabal filesTest suites and benchmarksUsing the stack toolMulti-package projects
Erroring and handling exceptions
Handling synchronous errorsThe exception hierarchyHandling asynchronous errorsThrow and catch in other monads besides IO
Writing tests for Haskell
Property checksUnit testing with HUnitTest frameworks
Trivia at term-level
Coding in GHC PrimOpsControl inliningUsing rewrite rulesSpecializing definitionsPhase control
Trivia at type-level
Phantom typesFunctional dependenciesType families and associated types
Useful GHC extensions
Monomorphism RestrictionExtensions for patterns and guardsStrict-by-default Haskell
Summary
5. Parallelize for Performance
Primitive parallelism and the Runtime SystemSpark awaySubtle evaluation – pseqWhen in doubt, use the force
The Eval monad and strategies
Composing strategiesFine-tune granularity with chunking and buffering
The Par monad and schedules
spawn for futures and promisesNon-deterministic parallelism with ParIO
Diagnosing parallelism – ThreadScope
Data parallel programming – Repa
Playing with Repa in GHCiMapping and delayed arraysReduction via foldingManifest representationsDelayed representation and fusionIndices, slicing, and extending arraysConvolution with stencilsCursored and partitioned arraysWriting fast Repa codeAdditional librariesExample from image processingLoading the image from fileIdentifying letters with convolutionExtracting strings from an imageTesting and evaluating performance
Summary
6. I/O and Streaming
Reading, writing, and handling resourcesTraps of lazy I/OFile handles, buffering, and encodingBinary I/OTextual I/OI/O performance with filesystem objectsSockets and networkingActing as a TCP/IP clientActing as a TCP server (Unix domain sockets)Raw UDP trafficNetworking above the transport layerManaging resources with ResourceT
Streaming with side-effects
Choosing a streaming librarySimple streaming using io-streamsCreating input streamsUsing combinators and output streamsHandling exceptions and resources in streamsAn example of parsing using io-streams and attoparsecStreaming using pipesComposing and executing pipesFor loops and category theory in pipesHandling exceptions in pipesStrengths and weaknesses of pipesStreaming using conduitsHandling resources and exceptions in conduitsResuming conduits
Logging in Haskell
Logging with FastLoggerMore abstract loggersTimed log messagesMonadic loggingCustomizing monadic loggers
Summary
7. Concurrency and Performance
Threads and concurrency primitivesThreads and mutable referencesAvoid accumulating thunksAtomic operations with IORefsMVarMVars are fairMVar as a building blockBroadcasting with Chan
Software Transactional Memory
STM example – Bank accountsAlternative transactionsExceptions in STM
Runtime System and threads
Masking asynchronous exceptions
Asynchronous processing
Using the Async APIAsync example – TimeoutsComposing with Concurrently
Lifting up from I/O
Top-level mutable referencesLifting from a base monadLifting base with exception handling
Summary
8. Tweaking the Compiler and Runtime System (GHC)
Using GHC like a proOperating GHCCircular dependenciesAdjusting optimizations and transformationsThe state hackFloating lets in and outEliminating common subexpressionsLiberate-case duplicates codeCompiling via the LLVM routeLinking and building shared librariesPreprocessing Haskell source codeEnforcing type-safety using Safe Haskell
Tuning GHC's Runtime System
Scheduler and green threadsSparks and spark poolBounded threads and affinityIndefinite blocking and weak referencesHeap, stack, and memory managementEvaluation stack in HaskellTuning the garbage collectorParallel GCProfiling and tracing optionsTracing using eventlogOptions for profiling and debugging
Summary of useful GHC options
Basic usageThe LLVM backendTurn optimizations on and offConfiguring the Runtime System (compile-time)Safe Haskell
Summary of useful RTS options
Scheduler flagsMemory managementGarbage collectionRuntime System statisticsProfiling and debugging
Summary
9. GHC Internals and Code Generation
Interpreting GHC's internal representationsReading GHC CoreSpineless tagless G-machine
Primitive GHC-specific features
Kinds encode type representation
Datatype generic programming
Working example – A generic sum
Generating Haskell with Haskell
Splicing with $(…)Names in templatesSmart template constructorsThe constN functionLifting Haskell code to Q with quotation bracketsLaunching missiles during compilationReifying Haskell data into template objectsDeriving setters with Template HaskellQuasi-quoting for DSLs
Summary
10. Foreign Function Interface
From Haskell to C and C to HaskellCommon types in Haskell and CImporting static functions and addressesExporting Haskell functionsCompiling a shared libraryFunction pointers and wrappersHaskell callbacks from C
Data marshal and stable pointers
Allocating memory outside the heapPointing to objects in the heapMarshalling abstract datatypesMarshalling in standard libraries
Summary
11. Programming for the GPU with Accelerate
Writing Accelerate programsKernels – The motivation behind explicit use and runWorking with elements and scalarsRudimentary array computationsExample – Matrix multiplicationFlow control and conditional executionInspecting generated code
Running with the CUDA backend
Debugging CUDA programs
More Accelerate concepts
Working with tuplesFolding, reducing, and segmentingAccelerated stencilsPermutations in AccelerateUsing the backend foreign function interface
Summary
12. Scaling to the Cloud with Cloud Haskell
Processes and message-passingCreating a message typeCreating a ProcessSpawning and closuresRunning with the SimpleLocalNet backendUsing channelsEstablishing bidirectional channelsCalling a remote process
Handling failure
Firing up monitorsMatching on the message queueLinking processes togetherMessage-passing performance
Nodes and networking
Summary
13. Functional Reactive Programming
The tiny discrete-time ElereaMutually recursive signalsSignalling side-effectsDynamically changing signal networksPerformance and limitations in Elerea
Events and signal functions with Yampa
Adding state to signal functionsWorking with timeSwitching and discrete-time eventsIntegrating to the real world
Reactive-banana – Safe and simple semantics
Example – First GUI applicationGraphical display with wxWidgets
Combining events and behaviors
Switching events and behaviorsObserving moments on demandRecursion and semanticsAdding input and outputInput via polling or handlersReactimate outputInput and output dynamically
Summary
14. Library Recommendations
Representing data
Functional graphs
Numeric data for special use
Encoding and serialization
Binary serialization of Haskell valuesEncoding to and from other formatsCSV input and output
Persistent storage, SQL, and NoSQL
acid-state and safecopypersistent and esqueletoHDBC and add-ons
Networking and HTTP
HTTP clients and serversSupplementary HTTP librariesJSON remote procedure callsUsing WebSocketsProgramming a REST API
Cryptography
Web technologies
Parsing and pretty-printing
Regular expressions in HaskellParsing XML
Pretty-printing and text formatting
Control and utility libraries
Using lensesEasily converting between types (convertible)Using a custom Prelude
Working with monads and transformers
Monad morphisms – monad-unlift
Handling exceptions
Random number generators
Parallel and concurrent programming
Functional Reactive Programming
Mathematics, statistics, and science
Tools for research and sketching
The HaskellR project
Creating charts and diagrams
Scripting and CLI applications
Testing and benchmarking
Summary
Index

Content preview from Haskell High Performance Programming

Summary

Now we have learned to write programs with Accelerate that run using the interpreter, and to compile and run them on CUDA-enabled GPUs. We know that Accelerate uses a code generator of its own internally. We understand it's crucial to write code that can efficiently reuse cached CUDA kernels, because their compilation is very expensive. We also learned that tuples are a free abstraction in Accelerate, although GPUs themselves don't directly support tupled elements.

In the next chapter, we will dive into Cloud Haskell and distributed programming using Haskell. It turns out Haskell is a pretty well-suited language for programming distributed systems. Cloud Haskell is an effort that streamlines building distributed applications, providing ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

O’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.

Julian F.

Head of Cybersecurity

I wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.

Addison B.

Field Engineer

I’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.

Amir M.

Data Platform Tech Lead

I'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.

Mark W.

Embedded Software Engineer

Publisher Resources

ISBN: 9781786464217

Cloud Computing

Data Engineering

Data Science

AI & ML

Programming Languages

Software Architecture

IT/Ops

Security

Design

Business

Soft Skills

Haskell High Performance Programming

by Samuli Thomasson

Summary

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.