Idgraf
From Digitalis
Contents |
Overview
Idgraf is a Bi-Socket Intel Xeon Westmere machine featuring 8 Nvidia GPU
- 2x Intel Xeon X5650 (Westmere, 6 cores each, total 12 cores)
- 72 GB RAM
- 8x coprocesseur Nvidia Tesla C2050
See:
- http://www.tyan.com/Barebones_FT72B7015_B7015F72V2R-N825%20%5BBTO%5D
- http://www.tyan.com/datasheets/FT72-B7015_Datasheet.pdf
- http://www.tyan.com/manuals/FT72-B7015_V2.1.pdf
- http://www.nvidia.com/docs/IO/43395/NV_DS_Tesla_C2050_C2070_jul10_lores.pdf
Beware that to support 8 GPU on a bi-socket Nehalem based machine, in addition to the need of 2 IOH connected via QPI buses, 2 PCI-e switches are required (PLXTech PEX8647 here), possibly limiting the bandwidth to the PCI-e cards.
See:
- http://www.qdpma.com/systemarchitecture/systemarchitecture_qpi.html
- http://www.plxtech.com/products/expresslane/pex8647
- http://www.plxtech.com/download/file/586
How to experiment
Privileged commands
Currently, the following commands can be run via sudo in exclusive jobs:
- sudo /usr/bin/whoami (provided for testing the mechanism, should return "root")
- sudo /sbin/reboot
- sudo /usr/bin/schedtool
- sudo /usr/bin/opcontrol
- sudo /usr/bin/perf
- sudo /opt/likwid/bin/likwid-perfctr
- sudo /opt/likwid/bin/likwid-topology
- sudo /usr/bin/nvidia-smi (please notify other users via the digitalis mailing list if you change parameters on GPUs that will not be reset to default after a reboot, e.g. the memory ECC configuration)
- sudo /usr/local/bin/ipmi-reset
- sudo /usr/bin/lstopo
System changelog
Currently, the default system is outdated:
- Debian squeeze
- Cuda 4
- ...
System is to be updated. Help welcome.
Acknolegment
The idgraf machine was funded by the Mescal and Moais teams of LIG/Inria.