April 2016
Intermediate to advanced
256 pages
6h 48m
English
1.1 Framework of Swarm Intelligence Algorithms 2
2.1 Trend of CPU's Clock Frequency 10
2.2 The GPU Devotes More Transistors to Data Processing 11
2.3 Arithmetic Throughput and Memory Bandwidth for the CPU and GPU 12
2.4 Stream Programming Mode vs. SIMD 14
2.5 Programming Model for GPU 15
2.6 NVIDIA GPU 18
2.7 Thread Hierarchy in CUDA 19
2.8 Memory Hierarchy 20
2.9 Automatic Scalability 21
2.10 OpenCL Device Model 24
2.11 OpenCL Abstract Parallel Model 25
2.12 OpenCL Manage Kernel Execution Through Command Queue and Context 26
2.13 Gather and Scatter 28
2.14 Array of Structures vs. Structure of Arrays 29
3.1 Cycle Process of APOD 35
3.2 Naïve Parallel Model 37
3.3 Breakup Timing 38
3.4 The Overall Speedup ...
Read now
Unlock full access