Next: The HP/Compaq
GS series. Up: Recount of
(almost) available ... Previous: The HP 9000
Superdome.
Machine type |
RISC-based ccNUMA system. |
Models |
HP 9000 SuperDome. |
Operating system |
HP-UX (HP's usual Unix flavour) |
Connection structure |
Crossbar |
Compilers |
Fortran 77, Fortran 90, Parallel Fortran, HPF, C, C++ |
Vendors information Web page |
http://www.hp.com/products1/servers/integrity/superdome_high_end/ |
Year of introduction |
2000. |
System parameters:
Model |
Integrity SuperDome |
Clock cycle |
1.5 GHz |
Theor. peak performance |
Per proc. (64-bits) |
6 Gflop/s |
Maximal (64-bits) |
768 Gflop/s |
Main memory |
Memory/node |
≤ 128 GB |
Memory/maximal |
1 TB |
No. of processors |
≤ 128 |
Communication bandwidth |
aggregate (global) |
64 GB/s |
(cell—backplane) |
8 GB/s |
(within cell, see below) |
16 GB/s |
Remarks:
The Integrity Superdome is HP's investment in the future for
high-end servers. Within a timespan of a few years it should replace
the PA-RISC-based HP 9000 Superdome. HP has anticipated on this by
giving it exactly the same macro structure: cells are connected to a
backplane crossbar that enables the communication between cells. For
the backplane it is immaterial whether a cell contains PA-RISC or
Itanium processors. For a discussion of the macro architecture see
the Remarks part of the HP 9000
Superdome. Also within a cell the structure of the system is
very similar: an identical Cell Controller ASIC as in the HP 9000
Superdome is the central hub that controls local memory access, I/O
requests, and communication to the other cells via the backplane.
The difference lies in the processor, the 1.5 GHz Itanium 2. HP
managed to make a special processor socket that allows 2 Itaniums to
be placed in one socket. Therefore 8 Itaniums in total can be housed
in the four sockets on the cell board. As each Itanium contains 2
floating-point units that are able to execute a combined floating
multiply-add instruction, in favourable circumstances 4 flops/cycle
can be achieved and a Theoretical Peak Performance of 6 Gflop/s per
processor can be attained. This amounts to a peak speed of 768
Gflop/s for a full configuration. Because the similarity between
the both Superdome systems is so large, the Integrity Superdome has
the same ccNUMA characteristics as its relative. It therefore
supports OpenMP over its maximum of 128 processors. As the Integrity
Superdome is based on the Itanium 2 for which much Linux development
is done in the past few years, the system can also be run with the
Linux OS. In fact, because the machine can be partitioned, it is
possible to run both Linux and HP-UX in the different complexes of
the same machine. The remark as for the HP 9000 Superdome applies:
in different complexes one can employ cells with different types of
processors, making the system a hybrid Integrity and HP 9000
Superdome.
Measured Performances: In [42]
a speed of 1049 Gflop/s is reported for solving a full linear system
of unspecified size. This result is achieved on a complex of systems
with a total of 220 processors. As the Theoretical Peak Performance
of such a cluster is 1320 Gflop/s the efficiency is 79%.
Next: The HP/Compaq
GS series Up: Recount of
(almost) available ... Previous: The HP 9000
Superdome.
Aad van der Steen Wed Oct 13 11:46:16 CEST 2004
|