In the preceding chapters, you learned about the fundamentals of Armv8 assembly language programming. You also learned how to exploit the SIMD capabilities of the Armv8 platform to accelerate the performance of computationally intensive algorithms. To maximize the performance of your Armv8 assembly language code, it is often necessary to understand specific aspects regarding the inner workings of an Armv8 processor. In this chapter, you will explore the internal hardware components of a modern Armv8 multicore processor and its underlying microarchitecture. ...
17. Optimization Strategies and Techniques
Get Modern Arm Assembly Language Programming: Covers Armv8-A 32-bit, 64-bit, and SIMD now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.