language-icon Old Web
English
Sign In

Profiling-Guided Optimization

2015 
The chapter focuses on a matrix transposition, a small and self-contained workload of great practical value. The optimization process applied to the code relies exclusively on programming in a high-level language plus utilization of the OpenMP framework. The result is a portable code that can run on both CPU (processor) and MIC (coprocessor) architectures, and can be recompiled for future generations of Intel architectures. The focus of the chapter is on the use of Intel® VTune™ Amplifier XE reports to understand where to apply optimization. Through VTune, the performance monitoring functionality of Intel Xeon Phi coprocessors is showcased not only to detect bottlenecks but also to point out overall performance issues and possible resolution methods. In short, this chapter demonstrates the familial relationship between Intel processors and coprocessors plus the Intel software development tools.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    0
    References
    0
    Citations
    NaN
    KQI
    []