Skip to Content
Intel® Xeon Phi™ Coprocessor Architecture and Tools: The Guide for Application Developers
book

Intel® Xeon Phi™ Coprocessor Architecture and Tools: The Guide for Application Developers

by Rezaur Rahman
September 2013
Intermediate to advanced
232 pages
7h 30m
English
Apress

Overview

Intel® Xeon Phi™ Coprocessor Architecture and Tools: The Guide for Application Developers provides developers a comprehensive introduction and in-depth look at the Intel Xeon Phi coprocessor architecture and the corresponding parallel data structure tools and algorithms used in the various technical computing applications for which it is suitable. It also examines the source code-level optimizations that can be performed to exploit the powerful features of the processor.

Xeon Phi is at the heart of world's fastest commercial supercomputer, which thanks to the massively parallel computing capabilities of Intel Xeon Phi processors coupled with Xeon Phi coprocessors attained 33.86 teraflops of benchmark performance in 2013. Extracting such stellar performance in real-world applications requires a sophisticated understanding of the complex interaction among hardware components, Xeon Phi cores, and the applications running on them.

In this book, Rezaur Rahman, an Intel leader in the development of the Xeon Phi coprocessor and the optimization of its applications, presents and details all the features of Xeon Phi core design that are relevant to the practice of application developers, such as its vector units, hardware multithreading, cache hierarchy, and host-to-coprocessor communication channels. Building on this foundation, he shows developers how to solve real-world technical computing problems by selecting, deploying, and optimizing the available algorithms and data structure alternatives matching Xeon Phi's hardware characteristics. From Rahman's practical descriptions and extensive code examples, the reader will gain a working knowledge of the Xeon Phi vector instruction set and the Xeon Phi microarchitecture whereby cores execute 512-bit instruction streams in parallel.

What you'll learn

  • How to calculate theoretical Gigaflops and bandwidth numbers on the hardware and measure them through code segment

  • How to estimate latencies in fetching data from different cache hierarchies, including memory subsystems

  • How to measure PCIe bus bandwidth between the host and coprocessor

  • How to exploit power management and reliability features built into the hardware

  • How to select and manipulate the best tools to tune particular Xeon Phi applications

  • Algorithms and data structures for optimizing Xeon Phi performance

  • Case studies of real-world Xeon Phi technical computing applications in molecular dynamics and financial simulations

  • Who this book is for

    This book is for developers wishing to design and develop technical computing applications to achieve the highest performance available in the Intel Xeon Phi coprocessor hardware. It provides a solid base on the coprocessor architecture, as well as algorithm and data structure case studies for Xeon Phi coprocessor. The book may also be of interest to students and practitioners in computer engineering as a case study for massively parallel core microarchitecture of modern day processors.

    Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
    and much more.

    Read now

    Unlock full access

    More than 5,000 organizations count on O’Reilly

    AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

    QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
    Julian F.
    Head of Cybersecurity
    QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
    Addison B.
    Field Engineer
    QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
    Amir M.
    Data Platform Tech Lead
    QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
    Mark W.
    Embedded Software Engineer

    You might also like

    Optimizing HPC Applications with Intel® Cluster Tools

    Optimizing HPC Applications with Intel® Cluster Tools

    Alexander Supalov, Andrey Semin, Michael Klemm, Christopher Dahnken
    Linux® Kernel Primer, The: A Top-Down Approach for x86 and PowerPC Architectures

    Linux® Kernel Primer, The: A Top-Down Approach for x86 and PowerPC Architectures

    Claudia Salzberg Rodriguez, Gordon Fischer, Steven Smolski

    Publisher Resources

    ISBN: 9781430259268Purchase book