In Chapter 9, you learned how to carry out elementary arithmetic using packed floating-point operands and AVX instructions. You also learned how to code simple SIMD functions that performed computations using the elements of a floating-point array or matrix. In this chapter, you’ll study source code examples that perform more sophisticated floating-point calculations using AVX2 and fused-multiply-add (FMA) instructions. The first section highlights an array-based algorithm ...
11. AVX2 Programming – Packed Floating-Point – Part 1
Get Modern X86 Assembly Language Programming: Covers X86 64-bit, AVX, AVX2, and AVX-512 now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.