Portable Explicit Vectorization Intrinsics
Paulo Souza*; Leonardo Borges‡; Cedric Andreolli†; Philippe Thierry† * Petrobras, Brazil† Intel, France‡ Intel, USA
Abstract
This chapter presents OpenVec. OpenVec provides a powerful and simple way to express vectorization closer to assembly level but in a portable way, supporting SIMD architectures from over a decade ago up to the most recent architectures including AVX-512. Vectorization is a fundamental aspect of performance optimization. Good compiler vectorizers, abstract language extensions like Cilk Plus, OpenMP SIMD, and other compiler options can effectively take care of the majority of the source code base. Yet, applications may contain very specific and delimited region(s) ...
Get High Performance Parallelism Pearls Volume Two now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.