Appendix A

Figures and Tables

List of Figures

1.1 Framework of Swarm Intelligence Algorithms 2

2.1 Trend of CPU's Clock Frequency 10

2.2 The GPU Devotes More Transistors to Data Processing 11

2.3 Arithmetic Throughput and Memory Bandwidth for the CPU and GPU 12

2.4 Stream Programming Mode vs. SIMD 14

2.5 Programming Model for GPU 15

2.6 NVIDIA GPU 18

2.7 Thread Hierarchy in CUDA 19

2.8 Memory Hierarchy 20

2.9 Automatic Scalability 21

2.10 OpenCL Device Model 24

2.11 OpenCL Abstract Parallel Model 25

2.12 OpenCL Manage Kernel Execution Through Command Queue and Context 26

2.13 Gather and Scatter 28

2.14 Array of Structures vs. Structure of Arrays 29

3.1 Cycle Process of APOD 35

3.2 Naïve Parallel Model 37

3.3 Breakup Timing 38

3.4 The Overall Speedup ...

Get GPU-based Parallel Implementation of Swarm Intelligence Algorithms now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.