book

CUDA for Engineers: An Introduction to High-Performance Parallel Computing

Name: CUDA for Engineers: An Introduction to High-Performance Parallel Computing
ISBN: 9780134177540

by Mete Yurtoglu, Duane Storti

November 2015

Intermediate to advanced

352 pages

9h 45m

English

Addison-Wesley Professional

Read now

Unlock full access

About This E-Book
Title Page
Copyright Page
Praise for CUDA for Engineers
Dedication Page
Contents
Acknowledgments
About the Authors
Introduction
What Is CUDA?What Does “Need-to-Know” Mean for Learning CUDA?What Is Meant by “for Engineers”?What Do You Need to Get Started with CUDA?How Is This Book Structured?Conventions Used in This BookCode Used in This BookUser’s GuideHistorical ContextReferences
Chapter 1. First Steps
Running CUDA SamplesCUDA Samples Under WindowsCUDA Samples Under LinuxEstimating “Acceleration”Running Our Own Serial Appsdist_v1dist_v2SummarySuggested Projects

Chapter 2. CUDA Essentials
CUDA’s Model for ParallelismNeed-to-Know CUDA API and C Language ExtensionsSummarySuggested ProjectsReferences
Chapter 3. From Loops to Grids
Parallelizing dist_v1Executing dist_v1_cudaParallelizing dist_v2Standard WorkflowSimplified WorkflowUnified Memory and Managed ArraysDistance App with cudaMallocManaged()SummarySuggested ProjectsReferences
Chapter 4. 2D Grids and Interactive Graphics
Launching 2D Computational GridsSyntax for 2D Kernel LaunchDefining 2D Kernelsdist_2dLive Display via Graphics InteropApplication: StabilityRunning the Stability VisualizerSummarySuggested ProjectsReferences
Chapter 5. Stencils and Shared Memory
Thread InterdependenceComputing Derivatives on a 1D GridImplementing dd_1d_globalImplementing dd_1d_sharedSolving Laplace’s Equation in 2D: heat_2dSharpening Edges in an Image: sharpenSummarySuggested ProjectsReferences
Chapter 6. Reduction and Atomic Functions
Threads Interacting GloballyImplementing parallel_dotComputing Integral Properties: centroid_2dSummarySuggested ProjectsReferences
Chapter 7. Interacting with 3D Data
Launching 3D Computational Grids: dist_3dViewing and Interacting with 3D Data: vis_3dSlicingVolume RenderingRaycastingCreating the vis_3d AppSummarySuggested ProjectsReferences
Chapter 8. Using CUDA Libraries
Custom versus Off-the-ShelfThrustComputing Norms with inner_product()Computing Distances with transform()Estimating Pi with generate(), transform(), and reduce()cuRANDNPPsharpen_nppMore Image Processing with NPPLinear Algebra Using cuSOLVER and cuBLAScuDNNArrayFireSummarySuggested ProjectsReferences
Chapter 9. Exploring the CUDA Ecosystem
The Go-To List of Primary SourcesCUDA ZoneOther Primary Web SourcesOnline CoursesCUDA BooksFurther SourcesCUDA SamplesCUDA Languages and LibrariesMore CUDA BooksSummarySuggested Projects
Appendix A. Hardware Setup
Checking for an NVIDIA GPU: WindowsChecking for an NVIDIA GPU: OS XChecking for an NVIDIA GPU: LinuxDetermining Compute CapabilityUpgrading Compute CapabilityMac or Notebook Computer with a CUDA-Enabled GPUDesktop Computer
Appendix B. Software Setup
Windows SetupCreating a Restore PointInstalling the IDEInstalling the CUDA ToolkitInitial Test RunOS X SetupDownloading and Installing the CUDA ToolkitLinux SetupPreparing the System Software for CUDA InstallationDownloading and Installing the CUDA ToolkitInstalling Samples to the User DirectoryInitial Test Run
Appendix C. Need-to-Know C Programming
Characterization of CC Language BasicsData Types, Declarations, and AssignmentsDefining FunctionsBuilding Apps: Create, Compile, Run, DebugBuilding Apps in WindowsBuilding Apps in LinuxArrays, Memory Allocation, and PointersControl Statements: for, ifThe for LoopThe if StatementOther Control StatementsSample C Programsdist_v1dist_v2dist_v2 with Dynamic MemoryReferences
Appendix D. CUDA Practicalities: Timing, Profiling, Error Handling, and Debugging
Execution Timing and ProfilingStandard C Timing MethodsCUDA EventsProfiling with NVIDIA Visual ProfilerProfiling in Nsight Visual StudioError HandlingDebugging in WindowsDebugging in LinuxCUDA-MEMCHECKUsing Visual Studio Property PagesReferences
Index
Code Snippets

Overview

CUDA for Engineers gives you direct, hands-on engagement with personal, high-performance parallel computing, enabling you to do computations on a gaming-level PC that would have required a supercomputer just a few years ago.

The authors introduce the essentials of CUDA C programming clearly and concisely, quickly guiding you from running sample programs to building your own code. Throughout, you’ll learn from complete examples you can build, run, and modify, complemented by additional projects that deepen your understanding. All projects are fully developed, with detailed building instructions for all major platforms.

Ideal for any scientist, engineer, or student with at least introductory programming experience, this guide assumes no specialized background in GPU-based or parallel computing. In an appendix, the authors also present a refresher on C programming for those who need it.

Coverage includes

Preparing your computer to run CUDA programs
Understanding CUDA’s parallelism model and C extensions
Transferring data between CPU and GPU
Managing timing, profiling, error handling, and debugging
Creating 2D grids
Interoperating with OpenGL to provide real-time user interactivity
Performing basic simulations with differential equations
Using stencils to manage related computations across threads
Exploiting CUDA’s shared memory capability to enhance performance
Interacting with 3D data: slicing, volume rendering, and ray casting
Using CUDA libraries
Finding more CUDA resources and code

Realistic example applications include

Visualizing functions in 2D and 3D
Solving differential equations while changing initial or boundary conditions
Viewing/processing images or image stacks
Computing inner products and centroids
Solving systems of linear algebraic equations
Monte-Carlo computations

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

O’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.

Julian F.

Head of Cybersecurity

I wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.

Addison B.

Field Engineer

I’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.

Amir M.

Data Platform Tech Lead

I'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.

Mark W.

Embedded Software Engineer

Hands-On GPU Programming with Python and CUDA

Publisher Resources

ISBN: 9780134177540Purchase book

Cloud Computing

Data Engineering

Data Science

AI & ML

Programming Languages

Software Architecture

IT/Ops

Security

Design

Business

Soft Skills