|
[ Publications ]
[ Research Opportunities ]
[ Partners & Supporters ]
[ Earlier Work ]
|
|
Asymmetries in Multi-Core Systems -- Or Why We Need Better Performance Measurement Units
|
| Irina Tuduce,
Zoltan Majo,
Adrian Gauch,
Brad Chen,
Thomas R. Gross,
Asymmetries in Multi-Core Systems -- Or Why We Need Better Performance Measurement Units, The Exascale Evaluation and Research Techniques Workshop (EXERT) at ASPLOS 2010, March 2010.
[EXERT_2010.pdf]
|
| Future exascale systems will be based on multi-core processors, but even
today's multi-core processors can be asymmetric and exhibit limitations
and bottlenecks that are different from those found on a symmetric
multiprocessor. In this paper we investigate the performance of a
cluster node based on the Intel Xeon E5345 quad-core processor and
note that despite the symmetry implied by the programming model, the
available memory bandwidth is not shared equally among the
cores. Consequently, applications experience substantial performance
variance and slow-downs when the tasks (threads) are mapped to cores in
a naive manner. An operating system scheduler could mitigate these
effects by taking into account the memory bus structure but needs
accurate information from the performance monitoring unit as the
asymmetry is not directly exposed in the processor's instruction set
manual. Current performance monitoring units are quite inflexible and
change from one processor to the next, so higher levels of the software
tool chain are discouraged to use them. The next generation of
Nehalem-based multi-core systems poses similar challenges, and
the development of portable performance monitoring units will be crucial if
applications want to use the performance potential of exascale systems.
We expect this situation to remain unchanged as long as memory is slow
relative to the processor. |
|
[ Publications ]
[ Research Opportunities ]
[ Partners & Supporters ]
[ Earlier Work ]
|
|