Skip to Content
Parallel and High Performance Computing
book

Parallel and High Performance Computing

by Yuliana Zamora, Robert Robey
July 2021
Intermediate to advanced content levelIntermediate to advanced
704 pages
21h 6m
English
Manning Publications
Content preview from Parallel and High Performance Computing

6 Vectorization: FLOPs for free

This chapter covers

  • The importance of vectorization
  • The kind of parallelization provided by a vector unit
  • Different ways you can access vector parallelization
  • Performance benefits you can expect

Processors have special vector units that can load and operate on more than one data element at a time. If we’re limited by floating-point operations, it is absolutely necessary to use vectorization to reach peak hardware capabilities. Vectorization is the process of grouping operations together so more than one can be done at a time. But, adding more flops to hardware capability when an application is memory bound has limited benefit. Take note, most applications are memory bound. Compilers can be powerful, but as ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Programming Massively Parallel Processors, 3rd Edition

Programming Massively Parallel Processors, 3rd Edition

David B. Kirk, Wen-mei W. Hwu
Algorithms: 24-part Lecture Series

Algorithms: 24-part Lecture Series

Robert Sedgewick, Kevin Wayne
Grokking Algorithms

Grokking Algorithms

Aditya Bhargava

Publisher Resources

ISBN: 9781617296468Supplemental ContentPublisher SupportOtherPublisher WebsiteSupplemental ContentPurchase Link