An early performance evaluation of many integrated core architecture based SGI rackable computing system

Subhash Saini,Haoqiang Jin,Dennis C. Jespersen,Huiyu Feng,M. Jahed Djomehri,William Arasin,Robert Hood,Piyush Mehrotra,Rupak Biswas

An early performance evaluation of many integrated core architecture based SGI rackable computing system

2013

Intel recently introduced the Xeon Phi coprocessor based on the Many Integrated Core architecture featuring 60 cores with a peak performance of 1.0 Tflop/s. NASA has deployed a 128-node SGI Rackable system where each node has two Intel Xeon E2670 8-core Sandy Bridge processors along with two Xeon Phi 5110P coprocessors. We have conducted an early performance evaluation of the Xeon Phi. We used microbenchmarks to measure the latency and bandwidth of memory and interconnect, I/O rates, and the performance of OpenMP directives and MPI functions. We also used OpenMP and MPI versions of the NAS Parallel Benchmarks along with two production CFD applications to test four programming modes: offload, processor native, coprocessor native and symmetric (processor plus coprocessor). In this paper we present preliminary results based on our performance evaluation of various aspects of a Phi-based system.

Keywords:

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations